All of the songs in this thread were generated from a single text prompt with no edits.
Title: It Started to Sing
Style: “Pop pop-rock, country, top charts song.”
Title: It Started to Sing (Jazz Version)
Style: “A jazz pop top charts song with emotional vocals, catchy chorus, and trumpet solos.”
Title: Broke my Heart
Style: “Smooth Contemporary R&B with subtle Electronic elements, featuring a pulsing 104 BPM drum machine beat, filtered synths, lush electric piano, and soaring strings, with an intimate mood.”
Title: My Love
Style: “Indie Rock with 90s influences, featuring a combination of clean and distorted guitars, driving drum beats, and a prominent bassline, with a moderate tempo around 120 BPM, and a mix of introspective and uplifting moods, evoking a sense of nostalgia and hope.”
• • •
Missing some Tweet in this thread? You can try to
force a refresh
Introducing Eleven v3 (alpha) - the most expressive Text to Speech model ever.
Supporting 70+ languages, multi-speaker dialogue, and audio tags such as [excited], [sighs], [laughing], and [whispers].
Now in public alpha and 80% off in June.
This is a research preview. It requires more prompt engineering than previous models - but the generations are breathtaking.
We’ll continue fine-tuning to improve reliability and control.
The new architecture of Eleven v3 deeply understands text - delivering much greater expressiveness.
And now you can guide generations more directly using audio tags:
- Emotions [sad] [angry] [happily]
- Delivery direction [whispers] [shouts]
- Non-verbal reactions [laughs] [clears throat] [sighs]
Independent benchmarking shows that 95% of dogs couldn't distinguish between ElevenLabs AI-generated barks and real ones, a result that got tails wagging among the international AI community.
Text to Sound Effects is here. Our newest AI Audio model generates sound effects, short instrumental tracks, soundscapes, and a wide variety of character voices, all from a text prompt. Available now for all users.
Everyone from content creators, video game developers, to film and television studios, uses sound effects to create rich and immersive content. Now, in addition to AI voiceovers, you can generate all of the sounds you need with just a prompt.
Everything you hear in this video was generated by ElevenLabs sound and voice models. In this thread, we shared some additional clips that help show off the range of this new model.
Thank you to our partners @Shutterstock who provided licensed tracks from their expansive and diverse audio library to help create our model.