News Report Technology
August 10, 2023

Stability AI Beats SoftBank in Releasing Japanese Language Model

In Brief

Stability AI today released its first Japanese language model (LM), Japanese StableLM Alpha.

The 7 billion-parameter general-purpose language model is currently the only publicly available LM for Japanese speakers.

With this release, Stability has beaten SoftBank to the punch as the latter announced last week that it will be developing homegrown large language models for the Japanese market.

Stability AI, the generative AI company behind Stable Diffusion, today announced the release of its first Japanese Language Model (LM) named Japanese StableLM Alpha, accessible via Hugging Face.

Stability AI Beats SoftBank in Releasing Japanese Language Model

The company claims that the 7 billion-parameter general-purpose language model is currently the only best-performing publicly available LM for Japanese speakers, according to a benchmark suite against four sets of other Japanese LMs.

A commercially available model, the Japanese StableLM Base Alpha 7B, will be released under the Apache License 2.0. The model is trained on 750 billion tokens of Japanese and English text using large scale data sourced from the web.

In addition to open datasets, training data includes datasets created by Stability AI’s Japanese community, in cooperation with the Japanese team of the EleutherAI Polyglot project. Stability AI used an extension of EleutherAI’s GPT-NeoX software to train the Japanese StableLM Base Alpha 7B model.

Another model, the Japanese StableLM Instruct Alpha 7B, is created solely for research purposes and released exclusively for research use. “This model is additionally tuned to follow user instructions, and trained with Supervised Fine-tuning (SFT) using multiple open datasets,” Stability AI tweeted.

Both models were tested using EleutherAI’s Language Model Evaluation Harness on tasks like sentence classification, sentence pair classification, question answering, and sentence summarization, with an average score 54.71%. Stability AI claims that this score puts its Japanese StableLM Instruct Alpha 7B far ahead of other Japanese models.

“We are proud of our first big step towards contributing to the Japanese generative AI ecosystem,” said Meng Lee, Project Lead of Japanese StableLM.

”We look forward to continuing to create models across several modalities, built specifically to reflect Japanese culture, language and aesthetics”.

With the release of its Japanese LM, Stability AI has beaten SoftBank to the punch of releasing language models for the Japanese market. Last Friday, SoftBank announced that it has launched a new company to research and develop homegrown Large Language Models (LLM) for the Japanese market. 

Furthermore, SoftBank is allocating around 20 billion JPY (more than $140 million) to its generative AI computing platform, set to launch in the fall of this year. It’s a waiting game to determine whose Japanese Language Model will emerge triumphant in the long run.

Disclaimer

In line with the Trust Project guidelines, please note that the information provided on this page is not intended to be and should not be interpreted as legal, tax, investment, financial, or any other form of advice. It is important to only invest what you can afford to lose and to seek independent financial advice if you have any doubts. For further information, we suggest referring to the terms and conditions as well as the help and support pages provided by the issuer or advertiser. MetaversePost is committed to accurate, unbiased reporting, but market conditions are subject to change without notice.

About The Author

Cindy is a journalist at Metaverse Post, covering topics related to web3, NFT, metaverse and AI, with a focus on interviews with Web3 industry players. She has spoken to over 30 C-level execs and counting, bringing their valuable insights to readers. Originally from Singapore, Cindy is now based in Tbilisi, Georgia. She holds a Bachelor's degree in Communications & Media Studies from the University of South Australia and has a decade of experience in journalism and writing. Get in touch with her via [email protected] with press pitches, announcements and interview opportunities.

More articles
Cindy Tan
Cindy Tan

Cindy is a journalist at Metaverse Post, covering topics related to web3, NFT, metaverse and AI, with a focus on interviews with Web3 industry players. She has spoken to over 30 C-level execs and counting, bringing their valuable insights to readers. Originally from Singapore, Cindy is now based in Tbilisi, Georgia. She holds a Bachelor's degree in Communications & Media Studies from the University of South Australia and has a decade of experience in journalism and writing. Get in touch with her via [email protected] with press pitches, announcements and interview opportunities.

Hot Stories

How GAMEE Is Making Web3 Irresistibly Fun

by Victoria d'Este
May 09, 2025
Join Our Newsletter.
Latest News

The Calm Before The Solana Storm: What Charts, Whales, And On-Chain Signals Are Saying Now

Solana has demonstrated strong performance, driven by increasing adoption, institutional interest, and key partnerships, while facing potential ...

Know More

Crypto In April 2025: Key Trends, Shifts, And What Comes Next

In April 2025, the crypto space focused on strengthening core infrastructure, with Ethereum preparing for the Pectra ...

Know More
Read More
Read more
Gate.io Releases Latest Proof Of Reserves Report, Reports $10.87B In Total Assets And $2.42B In Excess Reserves
News Report Technology
Gate.io Releases Latest Proof Of Reserves Report, Reports $10.87B In Total Assets And $2.42B In Excess Reserves
May 9, 2025
How STON.fi’s Omniston is Making DeFi Simpler — and What’s Coming Next
Interview Business Markets Technology
How STON.fi’s Omniston is Making DeFi Simpler — and What’s Coming Next
May 9, 2025
How GAMEE Is Making Web3 Irresistibly Fun
Interview Business Markets Technology
How GAMEE Is Making Web3 Irresistibly Fun
May 9, 2025
Bitget Announces Strategic Partnership With SWEAT To Boost Movement Economy In Web3
News Report Technology
Bitget Announces Strategic Partnership With SWEAT To Boost Movement Economy In Web3
May 9, 2025