Are you ready to hear how top executives are integrating and optimizing AI investments for success? Join us in San Francisco on July 11-12 to learn from the leaders themselves. Learn More


Last week, Meta Platforms’ artificial intelligence research arm introduced Voicebox, a machine learning model that can generate speech from text. What sets Voicebox apart from other text-to-speech models is its ability to perform many tasks that it has not been trained for, including editing, noise removal and style transfer.

The model was trained using a special method developed by Meta researchers. While Meta has not released Voicebox due to ethical concerns about misuse, the initial results are promising and could power many applications in the future.

‘Flow Matching’

Voicebox is a generative model that can synthesize speech across six languages: English, French, Spanish, German, Polish and Portuguese. But what makes it truly unique is its ability to perform many text-guided speech generation tasks through in-context learning. This means that it can replicate voices across languages, edit out mistakes in speech, and more.

>>Don’t miss our special issue: Building the foundation for customer data quality.<<

Event

Transform 2023

Join us in San Francisco on July 11-12, where top executives will share how they have integrated and optimized AI investments for success and avoided common pitfalls.