Cohesive AI Voice: Turn Your Text into Top-quality Spoken Audio in Minutes

by Damir Yalalov

Published: June 22, 2023 at 2:16 am Updated: June 22, 2023 at 2:16 am

by Karolina Gaszcz

Edited and fact-checked: June 22, 2023 at 2:16 am

Cohesive AI Voice is a new tool offers a comprehensive solution for users looking to add professional voiceovers to their content. With Cohesive, you can effortlessly generate high-quality scripts for your videos or podcasts. The user-friendly interface allows you to easily distribute roles among the application’s diverse set of 2 dozen voices. Whether you need a voiceover in English, Spanish, French, or other supported languages.

Cohesive AI: Turn Your Text into Top-quality Spoken Audio in Minutes

What sets Cohesive apart from its competitors, such as Google’s SoundStorm, is its full-fledged editor and availability to users. You can try out Cohesive for free and experience its range of features firsthand.

Not only does Cohesive excel in voice acting, but it also offers assistance in various other forms of content creation. From writing tweets and blog posts to drafting non-disclosure agreements and even crafting song lyrics, Cohesive is a versatile tool for creative expression.

Transforming your storytelling has never been easier with Cohesive AI’s human-like voices. Each sentence is meticulously crafted to ensure a convincing and lifelike delivery, adding depth and authenticity to your content. Moreover, you have the ability to generate a wide range of emotions and styles, from joy to anger, and even whispering.

This week, Meta has unveiled Voicebox, a generative text-to-speech model that aims to mimic ChatGPT and Dall-E for text and image generation. The system is a non-autoregressive flow-matching model trained to infill speech, given audio context and text. It has been trained on over 50,000 hours of unfiltered audio, using recorded speech and transcripts from public domain audiobooks in various languages. Meta’s AI outperforms current state-of-the-art systems in intelligibility and audio similarity, operating up to 20 times faster than current TTS systems. The Voicebox app and source code are not being released to the public, but the company has released a series of audio examples and a research paper. The research team hopes the technology will find its way into prosthetics, in-game NPCs, and digital assistants in the future.
Also, London-based voice AI startup ElevenLabs has raised $19 million in a Series A funding round, aiming to advance voice AI research projects and product deployments. The company’s valuation is estimated to be around $100 million. The $19 million round was led by former GitHub CEO Nat Friedman, former Head of AI at Y Combinator Daniel Gross, and Andreessen Horowitz. ElevenLabs’ tech, which turns text into speech using synthetic voices, cloned voices, or new voices tailored according to gender, age, and accent preferences, has gained interest from various creative sectors, including independent authors, video game developers, visually impaired users, and the world’s first AI radio channel, Super Hi-Fi.

Read more about related news:

Tags:

Disclaimer

In line with the Trust Project guidelines, please note that the information provided on this page is not intended to be and should not be interpreted as legal, tax, investment, financial, or any other form of advice. It is important to only invest what you can afford to lose and to seek independent financial advice if you have any doubts. For further information, we suggest referring to the terms and conditions as well as the help and support pages provided by the issuer or advertiser. MetaversePost is committed to accurate, unbiased reporting, but market conditions are subject to change without notice.

About The Author

Damir is the team leader, product manager, and editor at Metaverse Post, covering topics such as AI/ML, AGI, LLMs, Metaverse, and Web3-related fields. His articles attract a massive audience of over a million users every month. He appears to be an expert with 10 years of experience in SEO and digital marketing. Damir has been mentioned in Mashable, Wired, Cointelegraph, The New Yorker, Inside.com, Entrepreneur, BeInCrypto, and other publications. He travels between the UAE, Turkey, Russia, and the CIS as a digital nomad. Damir earned a bachelor's degree in physics, which he believes has given him the critical thinking skills needed to be successful in the ever-changing landscape of the internet.

Damir Yalalov