Built for voice agents that can't wait 600ms
Models
Realtime voiceCompare first-audio latency and barge-in behavior
Medical dictationTranscribe with diarization and drug-term accuracy
Multi-voice TTSGenerate two speakers with cloned voices
Telephony streamReturn mu-law with region pinned to us-east
provider.sort=latency • provider.region=auto • require_parameters=true