Sunday, 14 Dec 2025
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI > HyperWrite debuts Reflection 70B, most powerful open source LLM
AI

HyperWrite debuts Reflection 70B, most powerful open source LLM

Last updated: September 6, 2024 8:48 am
Published September 6, 2024
Share
HyperWrite debuts Reflection 70B, most powerful open source LLM
SHARE

Be part of our every day and weekly newsletters for the most recent updates and unique content material on industry-leading AI protection. Be taught Extra


There’s a brand new king on the town: Matt Shumer, co-founder and CEO of AI writing startup HyperWrite, immediately unveiled Reflection 70B, a brand new massive language mannequin (LLM) based mostly on Meta’s open supply Llama 3.1-70B Instruct that leverages a brand new error self-correction approach and boasts superior efficiency on third-party benchmarks.

As Shumer introduced in a post on the social network X, Reflection-70B now seems to be “the world’s high open-source AI mannequin.”

I am excited to announce Reflection 70B, the world’s high open-source mannequin.

Skilled utilizing Reflection-Tuning, a method developed to allow LLMs to repair their very own errors.

405B coming subsequent week – we anticipate it to be the perfect mannequin on this planet.

Constructed w/ @GlaiveAI.

Learn on ⬇️: pic.twitter.com/kZPW1plJuo

— Matt Shumer (@mattshumer_) September 5, 2024

He posted the next chart displaying its benchmark efficiency right here:

Reflection 70B has been rigorously examined throughout a number of benchmarks, together with MMLU and HumanEval, utilizing LMSys’s LLM Decontaminator to make sure the outcomes are free from contamination. These benchmarks present Reflection constantly outperforming fashions from Meta’s Llama sequence and competing head-to-head with high industrial fashions.

You possibly can try it yourself here as a demo on a “playground” web site, however as Shumer noted on X, the announcement of the brand new king of open-source AI fashions has flooded the demo web site with visitors and his workforce is scrambling to search out sufficient GPUs (graphics processing items, the precious chips from Nvidia and others used to coach and run most generative AI fashions) to spin as much as meet the demand.

How Reflection 70B stands aside

Shumer emphasised that Reflection 70B isn’t simply aggressive with top-tier fashions however brings distinctive capabilities to the desk, particularly, error identification and correction.

As Shumer advised VentureBeat over DM: “I’ve been fascinated with this concept for months now. LLMs hallucinate, however they’ll’t course-correct. What would occur when you taught an LLM the way to acknowledge and repair its personal errors?”

See also  Synthesia launches LLM-powered assistant to turn any text file or link into AI video

Therefore the identify, “Reflection” — a mannequin that may replicate on its generated textual content and assess its accuracy earlier than delivering it as outputs to the consumer.

The mannequin’s benefit lies in a method referred to as reflection tuning, which permits it to detect errors in its personal reasoning and proper them earlier than finalizing a response.

The approach that drives Reflection 70B is easy, however very highly effective.

Present LLMs tend to hallucinate, and might’t acknowledge once they accomplish that.

Reflection-Tuning allows LLMs to acknowledge their errors, after which right them earlier than committing to a solution. pic.twitter.com/pW78iXSwwb

— Matt Shumer (@mattshumer_) September 5, 2024

Reflection 70B introduces a number of new particular tokens for reasoning and error correction, making it simpler for customers to work together with the mannequin in a extra structured manner. Throughout inference, the mannequin outputs its reasoning inside particular tags, permitting for real-time corrections if it detects a mistake.

The playground demo web site contains urged prompts for the consumer to make use of, asking Reflection 70B what number of letter “r” cases there are within the phrase “Strawberry” and which quantity is bigger, 9.11 or 9.9, two easy issues many AI fashions — together with main proprietary ones — fail to get proper constantly. Our assessments of it had been sluggish, however Reflection 70B in the end offered the right response after 60+ seconds.

This makes the mannequin notably helpful for duties requiring excessive accuracy, because it separates reasoning into distinct steps to enhance precision. The mannequin is out there for obtain by way of the AI code repository Hugging Face, and API entry is about to be accessible later immediately via GPU service supplier Hyperbolic Labs.

An much more highly effective, bigger mannequin on the way in which

The discharge of Reflection 70B is just the start of the Reflection sequence. Shumer has introduced that an excellent bigger mannequin, Reflection 405B, will probably be made accessible subsequent week.

He additionally advised VentureBeat that HyperWrite is engaged on integrating the Reflection 70B mannequin into its main AI writing assistant product.

See also  Lenovo AI Innovator Graymatics debuts in the Sunlight app library

“We’re exploring a variety of methods to combine the mannequin into HyperWrite — I’ll share extra on this quickly,” he pledged.

Reflection 405B is anticipated to outperform even the highest closed-source fashions in the marketplace immediately. Shumer additionally stated HyperWrite would launch a report detailing the coaching course of and benchmarks, offering insights into the improvements that energy Reflection fashions.

The underlying mannequin for Reflection 70B is constructed on Meta’s Llama 3.1 70B Instruct and makes use of the inventory Llama chat format, guaranteeing compatibility with current instruments and pipelines.

Shumer credit Glaive for enabling speedy AI mannequin coaching

A key contributor to Reflection 70B’s success is the artificial information generated by Glaive, a startup specializing within the creation of use-case-specific datasets.

Glaive’s platform allows the speedy coaching of small, extremely centered language fashions, serving to to democratize entry to AI instruments. Based by Dutch engineer Sahil Chaudhary, Glaive focuses on fixing one of many greatest bottlenecks in AI growth: the provision of high-quality, task-specific information.

I need to be very clear — @GlaiveAI is the rationale this labored so effectively.

The management they provide you to generate artificial information is insane.

I will probably be utilizing them for practically each mannequin I construct transferring ahead, and it’s best to too. https://t.co/I789UIa5Yg

— Matt Shumer (@mattshumer_) September 5, 2024

Glaive’s method is to create artificial datasets tailor-made to particular wants, permitting firms to fine-tune fashions rapidly and affordably. The corporate has already demonstrated success with smaller fashions, similar to a 3B parameter mannequin that outperformed many bigger open-source alternate options on duties like HumanEval. Spark Capital led a $3.5 million seed round for Glaive greater than a 12 months in the past, supporting Sahil’s imaginative and prescient of making a commoditized AI ecosystem the place specialist fashions may be skilled simply for any process.

By leveraging Glaive’s know-how, the Reflection workforce was capable of quickly generate high-quality artificial information to coach Reflection 70B. Shumer credit Sahil and the Glaive AI platform for accelerating the event course of, with information generated in hours quite than weeks.

See also  Flood of interest in Europe’s AI Gigafactories plan

In complete, the coaching course of took three weeks, in keeping with Shumer in a direct message to VentureBeat. “We skilled 5 iterations of the mannequin over three weeks,” he wrote. “The dataset is solely customized, constructed utilizing Glaive’s artificial information era programs.”

HyperWrite is a uncommon Lengthy Island AI startup

At first look, it looks like Reflection 70B got here from nowhere. However Shumer has been on the AI sport for years.

He based his firm, initially referred to as Otherside AI, in 2020 alongside Jason Kuperberg. It was initially based mostly in Melville, New York, a hamlet about an hour’s drive east of New York Metropolis on Lengthy Island.

It gained traction round its signature product, HyperWrite, which began as a Chrome extension for shoppers to craft emails and responses based mostly on bullet factors, however has developed to deal with duties similar to drafting essays, summarizing textual content, and even organizing emails. HyperWrite counted two million customers as of November 2023 and earned the co-founding duo a spot on Forbes‘ annual “30 Under 30” List, in the end spurring Shumer and Kuperberg and their rising workforce to vary the identify of the corporate to it.

HyperWrite’s newest spherical, disclosed in March 2023, noticed a $2.8 million injection from buyers together with Madrona Enterprise Group. With this funding, HyperWrite has launched new AI-driven options, similar to turning net browsers into digital butlers that may deal with duties starting from reserving flights to discovering job candidates on LinkedIn.

Shumer notes that accuracy and security stay high priorities for HyperWrite, particularly as they discover complicated automation duties. The platform remains to be refining its private assistant device by monitoring and making enhancements based mostly on consumer suggestions. This cautious method, much like the structured reasoning and reflection embedded in Reflection 70B, exhibits Shumer’s dedication to precision and accountability in AI growth.

What’s subsequent for HyperWrite and the Reflection AI mannequin household?

Trying forward, Shumer has even greater plans for the Reflection sequence. With Reflection 405B set to launch quickly, he believes it’ll surpass the efficiency of even proprietary or closed-source LLMs similar to OpenAI’s GPT-4o, presently the worldwide chief, by a big margin.

That’s unhealthy information not just for OpenAI — which is reportedly looking for to boost a big new spherical of personal funding from the likes of Nvidia and Apple — however different closed-source mannequin suppliers similar to Anthropic and even Microsoft.

It seems that as soon as once more within the fast-moving gen AI area, the stability of energy has shifted.

For now, the discharge of Reflection 70B marks a big milestone for open-source AI, giving builders and researchers entry to a robust device that rivals the capabilities of proprietary fashions. As AI continues to evolve, Reflection’s distinctive method to reasoning and error correction could set a brand new commonplace for what open-source fashions can obtain.


Source link
TAGGED: 70B, Debuts, HyperWrite, LLM, Open, powerful, reflection, source
Share This Article
Twitter Email Copy Link Print
Previous Article Cloud Computing News Navigating the Shift to Cloud and Agile Practices
Next Article datacenter Edgecore unveils high-performance 400G spine switch for data centers
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Security, sustainability, and overcoming silos

NetApp has make clear the urgent points confronted by organisations globally as they attempt to…

December 11, 2024

AMD Strikes Blow in AI Chip War With OpenAI Deal

Superior Micro Gadgets (AMD) on Monday struck a large take care of OpenAI for six…

October 6, 2025

Legacy data centres are the missing lever in Europe’s AI plan

Simon Harris, Director of Crucial Infrastructure at BCS, makes the case that refurbishments, focused electrical…

November 17, 2025

OpenAI updates Operator to o3, making its $200 monthly ChatGPT Pro subscription more enticing

Be part of our each day and weekly newsletters for the newest updates and unique…

May 24, 2025

NiaHealth Raises $5.75M in Seed Funding

NiaHealth, a Toronto, Canada-based well being tech startup offering proactive care, raised $5.75M in Seed…

June 28, 2025

You Might Also Like

Why most enterprise AI coding pilots underperform (Hint: It's not the model)
AI

Why most enterprise AI coding pilots underperform (Hint: It's not the model)

By saad
Newsweek: Building AI-resilience for the next era of information
AI

Newsweek: Building AI-resilience for the next era of information

By saad
Google’s new framework helps AI agents spend their compute and tool budget more wisely
AI

Google’s new framework helps AI agents spend their compute and tool budget more wisely

By saad
BBVA embeds AI into banking workflows using ChatGPT Enterprise
AI

BBVA embeds AI into banking workflows using ChatGPT Enterprise

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.