Saturday, 21 Mar 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI > Stability AI unveils smaller, more efficient 1.6B language model as part of ongoing innovation
AI

Stability AI unveils smaller, more efficient 1.6B language model as part of ongoing innovation

Last updated: January 22, 2024 5:23 am
Published January 22, 2024
Share
Stability AI unveils smaller, more efficient 1.6B language model as part of ongoing innovation
SHARE

Size certainly matters when it comes to large language models (LLMs) as it impacts where a model can run.

Stability AI, the vendor that is perhaps best known for its stable diffusion text to image generative AI technology, today released one of its smallest models yet, with the debut of  Stable LM 2 1.6B.  Stable LM is a text content generation LLM that Stability AI first launched in April 2023 with both 3 billion and 7 billion parameter models. The new StableLM model is actually the second model released in 2024 by Stability AI, following the company’s Stable Code 3B launched earlier this week.

The new compact yet powerful Stable LM model aims to lower barriers and enable more developers to participate in the generative AI ecosystem incorporating multilingual data in seven languages – English, Spanish, German, Italian, French, Portuguese, and Dutch. The model utilizes recent algorithmic advancements in language modeling to strike what Stability AI hopes is an optimal balance between speed and performance. 

“In general, larger models trained on similar data with a similar training recipe tend to do better than smaller ones,” Carlos Riquelme, Head of the Language Team at Stability AI told VentureBeat. ” However, over time, as new models get to implement better algorithms and are trained on more and higher quality data, we sometimes witness recent smaller models outperforming older larger ones.”

Why smaller is better (this time) with Stable LM

According to Stability AI, the model outperforms other small language models with under 2 billion parameters on most benchmarks, including Microsoft’s Phi-2 (2.7B), TinyLlama 1.1B,and  Falcon 1B. 

See also  Virtus unveils Saunderton data centre development

The new smaller Stable LM is even able to surpass some larger models, including Stability AI’s own earlier Stable LM 3B model.

“Stable LM 2 1.6B performs better than some larger models that were trained a few months ago,” Riquelme said. “If you think about computers, televisions or microchips, we could roughly see a similar trend, they got smaller, thinner and better over time.”

To be clear, the smaller Stable LM 2 1.6B does have some drawbacks due to its size. Stability AI in its release for the new model cautions that,”… due to the nature of small, low-capacity language models, Stable LM 2 1.6B may similarly exhibit common issues such as high hallucination rates or potential toxic language.”

Transparency and more data are core to the new model release

The more toward smaller more powerful LLM options is one that Stability AI has been on for the last few months.

In December 2023, the StableLM Zephyr 3B model was released, providing more performance to StableLM with a smaller size than the initial iteration back in April.

Riquelme explained that the new Stable LM 2 models are trained on more data, including multilingual documents in 6 languages in addition to English (Spanish, German, Italian, French, Portuguese and Dutch). Another interesting aspect highlighted by Riquelme is the order in which data is shown to the model during training. He noted that it may pay off to focus on different types of data during different training stages.

Going a step further, Stability AI is making the new models available in with pre-trained and fine-tuned options as well as a format that the researchers describe as , “…the last model checkpoint  before the pre-training cooldown.”

See also  Local AI models: How to keep control of the bidstream without losing your data

“Our goal here is to provide more tools and artifacts for individual developers to innovate, transform and build on top of our current model,” Riquelme said. “Here we are providing a specific half-cooked model for people to play with.”

Riquelme explained that during training, the model gets sequentially updated and its performance increases. In that scenario, the very first model knows nothing, while the last one has consumed and hopefully learned most aspects of the data. At the same time, Riquelme  said that models may become less malleable towards the end of their training as they are forced to wrap up learning. 

“We decided to provide the model in its current form right before we started the last stage of training, so that –hopefully– it’s easier to specialize it to other tasks or datasets people may want to use,” he said. “We are not sure if this will work well, but we really believe in people’s ability to leverage new tools and models in awesome and surprising ways.”

Source link

Contents
Why smaller is better (this time) with Stable LMTransparency and more data are core to the new model release
TAGGED: 1.6B, efficient, innovation, language, Model, ongoing, part, smaller, Stability, unveils
Share This Article
Twitter Email Copy Link Print
Previous Article DeepMind AlphaGeometry solves complex geometry problems DeepMind AlphaGeometry solves complex geometry problems
Next Article Stability AI releases Stable Code 3B for enhanced coding assistance Stability AI releases Stable Code 3B for enhanced coding assistance
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

AI Workloads and the Future of IT Infrastructure

The white paper ‘AI Workloads and the Way forward for IT Infrastructure: Discovering the Proper…

March 27, 2025

How AstraZeneca Dominates AI Clinical Trials in 2025

Massive Pharma’s AI race extends throughout drug discovery, improvement, and scientific trials—however AstraZeneca has distinguished…

December 20, 2025

Closinglock Raises $34M in Series B Funding

Closinglock, an Austin, TX-based fintech and fraud prevention know-how supplier, raised $34M in Collection B…

February 2, 2025

TailorMed Raises $40M in Funding

TailorMed, a NYC-based healthcare expertise firm, raised $40M in funding combining fairness and debt. The…

November 24, 2024

Global Server Market Jumps 91% in Q4, Fueled by AI and GPU Demand

The worldwide server market skilled a record-breaking fourth quarter in 2024, producing $77.3 billion in…

March 23, 2025

You Might Also Like

NVIDIA Agent Toolkit Gives Enterprises a Framework to Deploy AI Agents at Scale
AI

NVIDIA Agent Toolkit Gives Enterprises a Framework to Deploy AI Agents at Scale

By saad
Visa prepares payment systems for AI agent-initiated transactions
AI

Visa prepares payment systems for AI agent-initiated transactions

By saad
For effective AI, insurance needs to get its data house in order
AI

For effective AI, insurance needs to get its data house in order

By saad
Mastercard keeps tabs on fraud with new foundation model
AI

Mastercard keeps tabs on fraud with new foundation model

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.