Build with one flexible voice AI API. Get started with $200 in credit for transcription or text-to-speech: https://t.co/ssASTgz2u1
Oct 4, 2024 • 8 tweets • 2 min read
After spending time with #OpenAI’s Voice Mode in #ChatGPT, we were eager to explore the API behind it.
A few weeks ago, we launched our Voice Agent API, and we’ve been curious to see how the two compare. Here’s what we found—just some early thoughts. 👇
⚡Latency: Both solutions performed similarly when it came to response time. Whether handling simple or complex tasks, the latency felt roughly equal at ~<1sec.
Nov 14, 2023 • 19 tweets • 5 min read
The wait is over! With 60M+ minutes transcribed, our next-gen speech-to-text model Nova-2 is now available.
What's new?
✅ Expanded languages: Spanish, Hindi, German, French, Portuguese
✅ Custom model training
✅ On-prem deployment
Let's dive in...🧵 dpgr.am/9f52615
In our early access release, Nova-2 impressed developers with its unmatched performance and value compared to competitors.
✅ An average 30% reduction in word error rate (WER)
✅ 5-40x faster speed
✅ 3-5x lower costs
✅ Full feature set: diarization, smart formatting, and more