News Report Technology
August 10, 2023

Stability AI Beats SoftBank in Releasing Japanese Language Model

In Brief

Stability AI today released its first Japanese language model (LM), Japanese StableLM Alpha.

The 7 billion-parameter general-purpose language model is currently the only publicly available LM for Japanese speakers.

With this release, Stability has beaten SoftBank to the punch as the latter announced last week that it will be developing homegrown large language models for the Japanese market.

Stability AI, the generative AI company behind Stable Diffusion, today announced the release of its first Japanese Language Model (LM) named Japanese StableLM Alpha, accessible via Hugging Face.

The company claims that the 7 billion-parameter general-purpose language model is currently the only best-performing publicly available LM for Japanese speakers, according to a benchmark suite against four sets of other Japanese LMs.

A commercially available model, the Japanese StableLM Base Alpha 7B, will be released under the Apache License 2.0. The model is trained on 750 billion tokens of Japanese and English text using large scale data sourced from the web.

In addition to open datasets, training data includes datasets created by Stability AI’s Japanese community, in cooperation with the Japanese team of the EleutherAI Polyglot project. Stability AI used an extension of EleutherAI’s GPT-NeoX software to train the Japanese StableLM Base Alpha 7B model.

Another model, the Japanese StableLM Instruct Alpha 7B, is created solely for research purposes and released exclusively for research use. “This model is additionally tuned to follow user instructions, and trained with Supervised Fine-tuning (SFT) using multiple open datasets,” Stability AI tweeted.

Both models were tested using EleutherAI’s Language Model Evaluation Harness on tasks like sentence classification, sentence pair classification, question answering, and sentence summarization, with an average score 54.71%. Stability AI claims that this score puts its Japanese StableLM Instruct Alpha 7B far ahead of other Japanese models.

“We are proud of our first big step towards contributing to the Japanese generative AI ecosystem,” said Meng Lee, Project Lead of Japanese StableLM.

”We look forward to continuing to create models across several modalities, built specifically to reflect Japanese culture, language and aesthetics”.

With the release of its Japanese LM, Stability AI has beaten SoftBank to the punch of releasing language models for the Japanese market. Last Friday, SoftBank announced that it has launched a new company to research and develop homegrown Large Language Models (LLM) for the Japanese market. 

Furthermore, SoftBank is allocating around 20 billion JPY (more than $140 million) to its generative AI computing platform, set to launch in the fall of this year. It’s a waiting game to determine whose Japanese Language Model will emerge triumphant in the long run.

Disclaimer

In line with the Trust Project guidelines, please note that the information provided on this page is not intended to be and should not be interpreted as legal, tax, investment, financial, or any other form of advice. It is important to only invest what you can afford to lose and to seek independent financial advice if you have any doubts. For further information, we suggest referring to the terms and conditions as well as the help and support pages provided by the issuer or advertiser. MetaversePost is committed to accurate, unbiased reporting, but market conditions are subject to change without notice.

About The Author

Cindy is a journalist at Metaverse Post, covering topics related to web3, NFT, metaverse and AI, with a focus on interviews with Web3 industry players. She has spoken to over 30 C-level execs and counting, bringing their valuable insights to readers. Originally from Singapore, Cindy is now based in Tbilisi, Georgia. She holds a Bachelor's degree in Communications & Media Studies from the University of South Australia and has a decade of experience in journalism and writing. Get in touch with her via [email protected] with press pitches, announcements and interview opportunities.

More articles
Cindy Tan
Cindy Tan

Cindy is a journalist at Metaverse Post, covering topics related to web3, NFT, metaverse and AI, with a focus on interviews with Web3 industry players. She has spoken to over 30 C-level execs and counting, bringing their valuable insights to readers. Originally from Singapore, Cindy is now based in Tbilisi, Georgia. She holds a Bachelor's degree in Communications & Media Studies from the University of South Australia and has a decade of experience in journalism and writing. Get in touch with her via [email protected] with press pitches, announcements and interview opportunities.

Hot Stories
Join Our Newsletter.
Latest News

Institutional Appetite Grows Toward Bitcoin ETFs Amid Volatility

Disclosures through 13F filings reveal notable institutional investors dabbling in Bitcoin ETFs, underscoring a growing acceptance of ...

Know More

Sentencing Day Arrives: CZ’s Fate Hangs in Balance as US Court Considers DOJ’s Plea

Changpeng Zhao is poised to face sentencing in a U.S. court in Seattle today.

Know More
Join Our Innovative Tech Community
Read More
Read more
Injective Joins Forces With AltLayer To Bring Restaking Security To inEVM
Business News Report Technology
Injective Joins Forces With AltLayer To Bring Restaking Security To inEVM
May 3, 2024
Masa Teams Up With Teller To Introduce MASA Lending Pool, Enables USDC Borrowing On Base
Markets News Report Technology
Masa Teams Up With Teller To Introduce MASA Lending Pool, Enables USDC Borrowing On Base
May 3, 2024
Velodrome Launches Superchain Beta Version In Coming Weeks And Expands Across OP Stack Layer 2 Blockchains
Markets News Report Technology
Velodrome Launches Superchain Beta Version In Coming Weeks And Expands Across OP Stack Layer 2 Blockchains
May 3, 2024
CARV Announces Partnership With Aethir To Decentralize Its Data Layer And Distribute Rewards
Business News Report Technology
CARV Announces Partnership With Aethir To Decentralize Its Data Layer And Distribute Rewards
May 3, 2024