Built for voice agents that can't wait 600ms

OpenRouter for Audio

One endpoint for every TTS, STT, and realtime voice model. Route by latency, region, provider, cost, or capability.

LIVE
p50 87ms
p95 214ms
best sonic-3.5
310 model routes • 14 providers • 7 regions
client
language
Building voice products? You're choosing between dozens of APIs, model IDs, voice cloning flows, stream protocols, codecs, regions, and billing units. We normalize it into one audio API.
Voice portability

Keep the voice. Switch the model.

Your users know the voice they hear every day. Bring that voice to another provider when you need lower latency, better quality, a different region, or a different model family.

Send us a source provider and voice, choose a destination provider, and Allmodels returns a portable voice slug like elevenlabs/edward that can be used on compatible routes such as cartesia/sonic.

POST /v1/voices/port
{
  "source": {
    "provider": "elevenlabs",
    "voice": "edward"
  },
  "destination": {
    "provider": "cartesia"
  }
}

{
  "voice": "elevenlabs/edward",
  "provider": "cartesia",
  "status": "ready"
}

One API call. Advanced audio features.

Ask for the capability you need. Allmodels maps the request to the provider-specific schema, stream, and model route that can satisfy it.

cloning

Voice cloning

Use one clone ID across providers. We translate provider-specific voice creation, voice IDs, and generation parameters.

dialogue

Multi-voice

Send speaker arrays for dialogue, dubbing, podcasts, and agent handoffs without reimplementing each provider's dialogue API.

language

Multilingual STT/TTS

Route by language, accent, diarization, translation, timestamps, and model-level capability constraints.

streams

Output stream you want

Request WebSocket, SSE, chunked HTTP, file response, PCM16, Opus, MP3, WAV, mu-law, verbose JSON, or word timestamps.

routing

Region-aware routing

Default to the closest Allmodels edge and closest provider server, or pin provider.region directly.

billing

Provider pass-through

One account, one usage surface, and routing controls inspired by OpenRouter's provider selection model.

Live stats

Mocked for the prototype, designed to become the public routing telemetry surface.

0 routes • 0 providers • 0 regions

Last 60s - TTFB per provider

Cartesia Deepgram ElevenLabs OpenAI Groq

What we measure

TTFB, first audio, P50/P95 by provider, model, and region; error rates; fallback time; and stream format compatibility.

7routing regions
<50msfallback target
5stream formats
24/7probe cadence