ElevenLabs Profile picture
Jul 19, 2024 3 tweets 2 min read Read on X
Introducing our new Turbo 2.5 model - Hindi, French, Spanish, Mandarin and 27 other languages just got 3x faster.

This unlocks high-quality low-latency conversational AI for nearly 80% of the world.

For the first time, we support Vietnamese, Hungarian and Norwegian text to speech. And English is now 25% faster compared to Turbo v2.
To get started building with ElevenLabs go to

For volume discounts and unlimited concurrency, head to elevenlabs.io/api
elevenlabs.io/enterprise
For those already using our API, just switch the model_id to “eleven_turbo_v2_5” Image

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with ElevenLabs

ElevenLabs Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @elevenlabsio

Aug 19
Introducing Chat Mode

You can now build text-only conversational agents.

Ideal for:
- Customers that prefer typing to speaking.
- Precise inputs like order IDs or email addresses.
- Solving simple issues, handing off to our voice agents for complex tasks. Image
Chat Mode is an extension of our Conversational Agent platform, designed to help you reach users in the modality that best fits their context.
ElevenLabs conversational agents are intelligent, real-time AI agents that talk, type, and take action.

Resolve customer issues, automate tasks, and deliver accurate answers - all grounded in your data, tailored to your workflows, and ready to deploy at enterprise scale.
Read 4 tweets
Aug 18
Introducing the Eleven Music API.

This is the first Music API for developers trained on licensed data and cleared for broad commercial use.

You can now integrate the highest quality AI music into your products and workflows.

Since launch, creators have generated over 750k songs with Eleven Music.
The Eleven Music API allows you to:

- Generate high quality tracks from text prompts
- Create vocal or instrumental versions in any genre
- Customize length, structure, and language Image
Created in collaboration with labels, publishers, and artists, songs created with the Eleven Music API are available for broad commercial use.

It’s designed for building apps across media and entertainment. Whether you’re delivering personalized mediations, producing music for video games, or creating AI generated ads.
Read 9 tweets
Jun 5
Introducing Eleven v3 (alpha) - the most expressive Text to Speech model ever.

Supporting 70+ languages, multi-speaker dialogue, and audio tags such as [excited], [sighs], [laughing], and [whispers].

Now in public alpha and 80% off in June.
This is a research preview. It requires more prompt engineering than previous models - but the generations are breathtaking.

We’ll continue fine-tuning to improve reliability and control.
The new architecture of Eleven v3 deeply understands text - delivering much greater expressiveness.

And now you can guide generations more directly using audio tags:
- Emotions [sad] [angry] [happily]
- Delivery direction [whispers] [shouts]
- Non-verbal reactions [laughs] [clears throat] [sighs]
Read 7 tweets
Apr 1
We pioneered the first ultra-realistic Text to Speech model, and recently launched the world's most accurate Speech to Text model, Scribe.

But we're not stopping there.

Today, we're taking one small step for man, and one giant leap for man's best friend...

with Text to Bark.
Introducing Text to Bark, the world's first AI-powered TTS model for dogs.

Simply type a message, choose your breed, and our models will convert it into fluent barking.

Try it with your own dog at
elevenlabs.io/text-to-bark
Independent benchmarking shows that 95% of dogs couldn't distinguish between ElevenLabs AI-generated barks and real ones, a result that got tails wagging among the international AI community.
Read 5 tweets
Dec 18, 2024
Meet Flash. Our newest model that generates speech in 75ms + application & network latency.

You’ve never experienced human-like TTS this fast.
Flash is our recommended model for low-latency, conversational voice agents.

You can use Flash today in our Conversational AI platform

Or build directly via the API using model id “eleven_flash_v2” and “eleven_flash_v2_5”: elevenlabs.io/docs/api-refer…Image
Flash v2 is English only and Flash v2.5 supports 32 languages

They both cost 1 credit for every 2 characters
Read 5 tweets
Dec 3, 2024
Conversational AI is here.

Build AI agents that can speak in minutes with low latency, full configurability, and seamless scalability.
Let us take care of Speech to Text, LLM integrations, Text to Speech, turn taking and interruption handling.

You focus on customizing your knowledge base, system prompt and voice.

elevenlabs.io/conversational…Image
You have the flexibility to swap out the LLM at any time so you always have access to the latest model.

Don’t want to use a default LLM? Bring your own server for full control over your agent: elevenlabs.io/docs/conversat…
Read 8 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(