News Report Technology
December 13, 2023

Sarvam AI Releases India’s First Hindi LLM ‘OpenHathi’, a Week After Raising $41 million

In Brief

Sarvam AI’s OpenHathi is the first Hindi LLM aimed at development of open models and datasets, to promote AI innovation for Indian languages.

Sarvam AI Releases India’s First Hindi LLM 'OpenHathi', A Week After Raising $41 million

Indian Generative AI startup Sarvam AI released OpenHathi-Hi-v0.1, the first Hindi large language model (LLM) in the OpenHathi series, just a week after raising $41 million in its Series A funding round led by Lightspeed Ventures.

Hindi is India’s most widely spoken language, with over 43% of the population declaring it as their mother tongue. The model is built on Meta AI’s Llama2-7B architecture and delivers performance on par with GPT-3.5 for Indic languages.

“We show that our model works as well as, if not better than GPT-3.5 on various Hindi tasks while maintaining its English performance,” the company said on social media platform X (formerly Twitter).

According to Sarvam AI, the goal of the OpenHathi series is to contribute to the development of open models and datasets to promote innovation in Indian language AI and is collaborating with academic partners at AI4Bharat, who have provided language resources and benchmarks for this initiative.

AI4Bharat is a research lab at the Indian Institute of Technology (IIT) Madras – a public technical university in India, which works on developing open-source datasets, tools, models and applications for Indian languages.

OpenHathi uses a 48,000-token extension of Llama2-7B’s tokenizer and undergoes a two-step training process. In the first phase, the model concentrates on embedding alignment, aligning randomly initialized Hindi embeddings. The second phase involves bilingual language modeling, where the model learns cross-lingual attention across tokens.

Full-Stack Generative AI Platform to be Launched Soo

Further, the company is encouraging people to be creative and come up with new ideas by using and improving upon this release. The company invites developers to build specialized models for various purposes using the OpenHathi-Hi-v0.1 model as a starting point.

Sarvam AI also plans to release advanced models for businesses on its comprehensive generative AI platform, which according to the startup will be launched soon.

While raising its Series A funding, the company announced its plan to develop the “full-stack” for Generative AI ranging from research-led innovations in training custom AI models to an enterprise-grade platform for authoring and deployment.

The company added that this full-stack approach will accelerate the adoption of generative AI in India, especially given that enterprises see the potential but are grappling with how to leverage it for their business.

Established in July 2023 by Vivek Raghavan and Pratyush Kumar, have previously worked at AI4Bharat – backed by Infosys co-founder Nandan Nilekani.

Recently, India launched BharatGPT, an LLM-based solution developed in collaboration with Indian conversational AI startup CoRover.ai, which is positioned as the answer to existing generative AI or LLMs. It intends to fill a critical gap in the market, providing an indigenous solution available in over 12 Indian languages, covering video, voice and text interactions.

Disclaimer

In line with the Trust Project guidelines, please note that the information provided on this page is not intended to be and should not be interpreted as legal, tax, investment, financial, or any other form of advice. It is important to only invest what you can afford to lose and to seek independent financial advice if you have any doubts. For further information, we suggest referring to the terms and conditions as well as the help and support pages provided by the issuer or advertiser. MetaversePost is committed to accurate, unbiased reporting, but market conditions are subject to change without notice.

About The Author

Kumar is an experienced Tech Journalist with a specialization in the dynamic intersections of AI/ML, marketing technology, and emerging fields such as crypto, blockchain, and NFTs. With over 3 years of experience in the industry, Kumar has established a proven track record in crafting compelling narratives, conducting insightful interviews, and delivering comprehensive insights. Kumar's expertise lies in producing high-impact content, including articles, reports, and research publications for prominent industry platforms. With a unique skill set that combines technical knowledge and storytelling, Kumar excels at communicating complex technological concepts to diverse audiences in a clear and engaging manner.

More articles
Kumar Gandharv
Kumar Gandharv

Kumar is an experienced Tech Journalist with a specialization in the dynamic intersections of AI/ML, marketing technology, and emerging fields such as crypto, blockchain, and NFTs. With over 3 years of experience in the industry, Kumar has established a proven track record in crafting compelling narratives, conducting insightful interviews, and delivering comprehensive insights. Kumar's expertise lies in producing high-impact content, including articles, reports, and research publications for prominent industry platforms. With a unique skill set that combines technical knowledge and storytelling, Kumar excels at communicating complex technological concepts to diverse audiences in a clear and engaging manner.

Hot Stories

Big Bets: The Biggest Gambling Cryptocurrencies of 2024

by Viktoriia Palchik
April 22, 2024
Join Our Newsletter.
Latest News

Big Bets: The Biggest Gambling Cryptocurrencies of 2024

by Viktoriia Palchik
April 22, 2024

The DOGE Frenzy: Analysing Dogecoin’s (DOGE) Recent Surge in Value

The cryptocurrency industry is rapidly expanding, and meme coins are preparing for a significant upswing. Dogecoin (DOGE), ...

Know More

The Evolution of AI-Generated Content in the Metaverse

The emergence of generative AI content is one of the most fascinating developments inside the virtual environment ...

Know More
Join Our Innovative Tech Community
Read More
Read more
Exponential Bitcoin Bull Market Is Yet To Commence, Notes Crypto Analyst Willy Woo
Markets News Report Technology
Exponential Bitcoin Bull Market Is Yet To Commence, Notes Crypto Analyst Willy Woo
April 22, 2024
Unibot Unveils Three Growth Initiatives, Outlines Plans For Referral System, Token Buyback, And Rewards
Markets News Report Technology
Unibot Unveils Three Growth Initiatives, Outlines Plans For Referral System, Token Buyback, And Rewards
April 22, 2024
Big Bets: The Biggest Gambling Cryptocurrencies of 2024
Digest Gambling Wiki Technology
Big Bets: The Biggest Gambling Cryptocurrencies of 2024
April 22, 2024
Galxe Community Initiates Vote On Proposal For Token Symbol Change And Contract Migration
Markets News Report Technology
Galxe Community Initiates Vote On Proposal For Token Symbol Change And Contract Migration
April 22, 2024