News Report SMW Technology
June 19, 2023

Meta Unveils Voicebox, Text-to-Speech Generative AI Tool

Voicebox is Meta’s breakthrough in generative speech AI, which transforms text into realistic and expressive speech. The AI tool, which works similarly to ChatGPT or Dall-E, is an advanced AI model capable of performing speech generation tasks like content editing, sampling, and style conversion, even without specific training, thanks to in-context learning. 

Meta Unveils Voicebox, Text-to-Speech Generative AI Tool

It sets itself apart from other text-to-speech models by excelling in various tasks such as noise removal, text-to-speech synthesis and cross-lingual style transfer, pushing the boundaries of synthetic speech generation. Voicebox also surpasses current models in speed, operating at a 20 times faster rate.

Voicebox underwent extensive training using a dataset comprising over 50,000 hours of unfiltered audio. The AI model was trained using Meta’s innovative “Flow Matching” technique, a versatile alternative to diffusion-based learning methods employed by other generative models. 

Meta’s training dataset includes recorded speech and transcripts from public-domain audiobooks in multiple languages, such as English, French, Spanish, German, Polish, and Portuguese.

According to Mark Zuckerberg, Voicebox is “the first ever generative AI speech model that can do tasks it wasn’t specifically trained on.”

Source: Mark Zuckerberg

In the future, Voicebox and similar AI models can provide natural-sounding voices for virtual assistants and non-player characters in the metaverse. They can also enable visually impaired individuals to hear written messages in familiar voices through AI and offer creators easy tools for editing audio tracks in videos.

Voicebox and the Dangers of Deepfakes

However, Voicebox might pose some ethical and social challenges, especially in the context of deepfakes. Deepfakes, created by AI models, are synthetic media that manipulate a person’s voice, often maliciously. Voicebox could create convincing deepfakes that impersonate someone’s voice or make them say things they never said. This could have serious implications for privacy, security, and trust.

Microsoft’s president Brad Smith raised concerns last month about the harm caused by deepfakes. He emphasized the need for mechanisms to differentiate between genuine and AI-generated material, particularly in cases of malicious intent. He called for accountability and safety measures to maintain human control over critical infrastructure governed by AI systems. Furthermore, he proposed a system where developers monitor usage and provide transparency to identify manipulated videos, similar to a KYC approach.

Meta claims that it’s aware of the potential harm that Voicebox could cause and that the company is working on an effective way to distinguish between authentic speech and audio generated by Voicebox. While Voicebox is still undergoing development and not currently accessible to the public, Meta acknowledges the potential risks associated with advanced AI technology.

Read more:

Disclaimer

In line with the Trust Project guidelines, please note that the information provided on this page is not intended to be and should not be interpreted as legal, tax, investment, financial, or any other form of advice. It is important to only invest what you can afford to lose and to seek independent financial advice if you have any doubts. For further information, we suggest referring to the terms and conditions as well as the help and support pages provided by the issuer or advertiser. MetaversePost is committed to accurate, unbiased reporting, but market conditions are subject to change without notice.

About The Author

Agne is a journalist who covers the latest trends and developments in the metaverse, AI, and Web3 industries for the Metaverse Post. Her passion for storytelling has led her to conduct numerous interviews with experts in these fields, always seeking to uncover exciting and engaging stories. Agne holds a Bachelor’s degree in literature and has an extensive background in writing about a wide range of topics including travel, art, and culture. She has also volunteered as an editor for the animal rights organization, where she helped raise awareness about animal welfare issues. Contact her on [email protected].

More articles
Agne Cimerman
Agne Cimerman

Agne is a journalist who covers the latest trends and developments in the metaverse, AI, and Web3 industries for the Metaverse Post. Her passion for storytelling has led her to conduct numerous interviews with experts in these fields, always seeking to uncover exciting and engaging stories. Agne holds a Bachelor’s degree in literature and has an extensive background in writing about a wide range of topics including travel, art, and culture. She has also volunteered as an editor for the animal rights organization, where she helped raise awareness about animal welfare issues. Contact her on [email protected].

Hot Stories
Join Our Newsletter.
Latest News

Web3 Fundraising Report Q3 2023: A Overview of Evolving Trends and Strategies

In this report, we explore all the trends in the AI, crypto and web3 sectors and identify ...

Know More

Virtual Reality Superior Than Video Conferencing for Remote Work Collaboration: PWC Report

In today’s modern era of remote work, the traditional landscape of digital meetings and workshops is undergoing ...

Know More
Join Our Innovative Tech Community
Read More
Read more
Chainlink (LINK) Announces Stacking Mechanism v0.2 Upgrade to Improve Network Functionality
Markets News Report
Chainlink (LINK) Announces Stacking Mechanism v0.2 Upgrade to Improve Network Functionality
November 30, 2023
Seraph Studio Launches ARPG Web3 Game ‘SERAPH: In The Darkness’ on Arbitrum One
Business News Report
Seraph Studio Launches ARPG Web3 Game ‘SERAPH: In The Darkness’ on Arbitrum One
November 30, 2023
NASA and IBM Partner to Develop Geospatial AI for Weather Prediction
News Report Technology
NASA and IBM Partner to Develop Geospatial AI for Weather Prediction
November 30, 2023
AntPool to Refund Record $3 Million Bitcoin Transaction Fee After User Error
News Report
AntPool to Refund Record $3 Million Bitcoin Transaction Fee After User Error
November 30, 2023
What You
Need to Know

Subscribe To Our Newsletter.
Daily search marketing tidbits for savvy pros.