News Report Technology
January 08, 2023

VALL-E: Microsoft’s new zero-shot text-to-speech model can duplicate everyone’s voice in three seconds

Since the release of the first text-to-speech (TTS) model, researchers have been looking for ways to improve the way these systems generate speech. The latest model from Microsoft, VALL-E, is a significant step forward in this regard.

VALL-E is a transformer-based TTS model that can generate speech in any voice after only hearing a three-second sample of that voice. This is a significant improvement over previous models, which required a much longer training period in order to generate a new voice.

VALL-E is an amazing technological feat that has the potential to change the way we interact with digital media.
Related article: Microsoft has released a diffusion model that can build a 3D avatar from a single photo of a person

Additionally, the intonation, charisma, and style of the voice are all kept intact in the generated speech. This is an important step forward in making TTS systems sound more natural.

This model is transformer-based and has a Dale-1 appearance. Not to be confused with the diffusion-based Dalle-2. The code is still lacking. And users have some skepticism that they will post it.

Related article: Microsoft’s VALL-E appears to be the most dangerous scam software ever

However, Microsoft has released a few examples of the model in action, and it is clear that this is a major advance in TTS technology.

Example #1:

Example #2:

Example #3:

Read more about AI:

Disclaimer

Any data, text, or other content on this page is provided as general market information and not as investment advice. Past performance is not necessarily an indicator of future results.


The Trust Project is a worldwide group of news organizations working to establish transparency standards.

Damir is the team leader, product manager, and editor at Metaverse Post, covering topics such as AI/ML, AGI, LLMs, Metaverse, and Web3-related fields. His articles attract a massive audience of over a million users every month. He appears to be an expert with 10 years of experience in SEO and digital marketing. Damir has been mentioned in Mashable, Wired, Cointelegraph, The New Yorker, Inside.com, Entrepreneur, BeInCrypto, and other publications. He travels between the UAE, Turkey, Russia, and the CIS as a digital nomad. Damir earned a bachelor's degree in physics, which he believes has given him the critical thinking skills needed to be successful in the ever-changing landscape of the internet. 

More articles
Damir Yalalov
Damir Yalalov

Damir is the team leader, product manager, and editor at Metaverse Post, covering topics such as AI/ML, AGI, LLMs, Metaverse, and Web3-related fields. His articles attract a massive audience of over a million users every month. He appears to be an expert with 10 years of experience in SEO and digital marketing. Damir has been mentioned in Mashable, Wired, Cointelegraph, The New Yorker, Inside.com, Entrepreneur, BeInCrypto, and other publications. He travels between the UAE, Turkey, Russia, and the CIS as a digital nomad. Damir earned a bachelor's degree in physics, which he believes has given him the critical thinking skills needed to be successful in the ever-changing landscape of the internet. 

Hot Stories
Join Our Newsletter.
Latest News

OpenAI Expands ChatGPT’s Capabilities with Web Browsing

by Agne Cimermanaite
September 27, 2023

CGV Research: Telegram Open Network’s (TON) Technological Advancements and Future Prospects

TL;DR TON’s Past In 2018, founders of Telegram — the Durov brothers, began exploring blockchain solutions suitable ...

Know More

20 Most Underrated AI Startups in 2023: Ranked by Funding

AI remains a constant focal point for investors and entrepreneurs alike. While the spotlight often falls on ...

Know More
Join Our Innovative Tech Community

Read More

Read more
Meta Introduces 28 AI Characters and AI Studio for Expanded Creativity
News Report Technology
Meta Introduces 28 AI Characters and AI Studio for Expanded Creativity
September 27, 2023
Meta Unveils Impressive AI Integration Across Services, from Generative Emu Model to Smart Glasses
Business News Report Technology
Meta Unveils Impressive AI Integration Across Services, from Generative Emu Model to Smart Glasses
September 27, 2023
OpenAI Expands ChatGPT’s Capabilities with Web Browsing
Business News Report
OpenAI Expands ChatGPT’s Capabilities with Web Browsing
September 27, 2023
CGV Research: Telegram Open Network’s (TON) Technological Advancements and Future Prospects
Analysis Opinion Technology
CGV Research: Telegram Open Network’s (TON) Technological Advancements and Future Prospects
September 27, 2023
What You
Need to Know

Subscribe To Our Newsletter.
Daily search marketing tidbits for savvy pros.