Amazon develops world’s largest text-to-speech model with ‘emergent’ qualities

The new model is called Big Adaptive Streamable TTS with Emergent abilities — BASE TTS.

In what is being called the largest text-to-speech model ever developed, researchers at Amazon AGI have made waves after creating the Big Adaptive Streamable TTS with Emergent abilities (BASE TTS).

Text-to-Speech (TTS) models are used in the development of voice assistants for smart devices and are employed to convert written text into spoken words, allowing voice assistants to communicate with users in a natural and human-like manner.

Furthermore, TTS models produce outputs that closely resemble natural speech, incorporating elements such as intonation, emphasis, and inflection.

Blog