1 link tagged with all of: transcription + speech-to-text + real-time + voice-ai + language-detection
Links
ElevenLabs introduced Scribe v2 Realtime, a Speech to Text model that transcribes live speech with a latency under 150 ms. It supports multiple languages and features like automatic language detection and voice activity detection, making it suitable for voice agents and real-time captioning. The model achieves 93.5% accuracy across various languages and is available through their API.
speech-to-text ✓
voice-ai ✓
transcription ✓
real-time ✓
language-detection ✓