ElevenLabs Emerges from Beta with Foundational AI Speech Model for 28 Languages

by Cindy Tan

Published: August 23, 2023 at 9:38 am Updated: August 23, 2023 at 9:39 am

by Victor Dey

Edited and fact-checked: August 23, 2023 at 9:38 am

In Brief

Voice AI platform ElevenLabs has launched out of beta.

Simultaneously, the platform has released the Eleven Multilingual v2, a new foundational deep-learning model that supports 28 languages.

ElevenLabs Emerges from Beta with Foundational AI Speech Model for 28 Languages

Voice AI platform, ElevenLabs, today launched a new foundational AI speech model as the company emerges from beta. The company said that the new AI model, named Eleven Multilingual v2, has the ability to accurately produce ‘emotionally rich’ AI audio in 28 languages.

Built through in-house research, ElevenLabs said that its latest AI speech model underwent an 18-month developmental phase. During this time, the company studied the intricacies of human speech, built new mechanisms for the model to comprehend context and express emotions in speech generation, as well as synthesize new, unique voices.

Previously only available in English, Polish, German, Spanish, French, Italian, Hindi and Portuguese, the model now supports Chinese, Korean, Dutch, Turkish, Swedish, Indonesian, Filipino, Japanese, Ukrainian, Greek, Czech, Finnish, Romanian, Danish, Bulgarian, Malay, Slovak, Croatian, Classic Arabic and Tamil.

ElevenLabs highlighted that the expanded language support will enable content creators to craft localized audio material aimed at global markets spanning Europe, Asia, and the Middle East.

To generate speech with the Eleven Multilingual v2, users can input text in any of the supported languages into the text-to-speech platform.

Simultaneously, whether employing a synthetic or cloned voice, the company explained that distinct vocal attributes of the speaker will remain consistent across all languages, including their original accent. Moreover, a single voice can be used to generate speech across the 28 supported languages.

“Our text-to-speech generation tools help level the playing field and bring top quality spoken audio capabilities to all the creators out there,” Mati Staniszewski, CEO and co-founder of ElevenLabs, said in a statement. “Those benefits now extend to multilingual applications across almost 30 languages. Eventually we hope to cover even more languages and voices with the help of AI, and eliminate the linguistic barriers to content.”

The roll-out of Eleven Multilingual v2 follows the public release of Professional Voice Cloning earlier this month. The offering allows users to generate an accurate digital replica of their voices. With the latest update, the tool will now enable users to directly translate their voice audio to any of the newly added languages.

Since its beta launch in January, ElevenLabs asserts it has amassed over 1 million registered users across creative, entertainment and publishing spaces. The company announced a successful $19 million Series A raise in June led by former GitHub CEO Nat Friedman, ex-Y Combinator partner Daniel Gross, and Andreessen Horowitz.

ElevenLabs also recently partnered with D-ID, the generative AI video content platform, to combine their generative AI tools.

Tags:

Disclaimer

In line with the Trust Project guidelines, please note that the information provided on this page is not intended to be and should not be interpreted as legal, tax, investment, financial, or any other form of advice. It is important to only invest what you can afford to lose and to seek independent financial advice if you have any doubts. For further information, we suggest referring to the terms and conditions as well as the help and support pages provided by the issuer or advertiser. MetaversePost is committed to accurate, unbiased reporting, but market conditions are subject to change without notice.

About The Author

Cindy is a journalist at Metaverse Post, covering topics related to web3, NFT, metaverse and AI, with a focus on interviews with Web3 industry players. She has spoken to over 30 C-level execs and counting, bringing their valuable insights to readers. Originally from Singapore, Cindy is now based in Tbilisi, Georgia. She holds a Bachelor's degree in Communications & Media Studies from the University of South Australia and has a decade of experience in journalism and writing. Get in touch with her via [email protected] with press pitches, announcements and interview opportunities.

Cindy Tan