Azure Voice Live
converts spoken audio directly into spoken audio, skipping the intermediate text step. OpenAI Realtime, Ultravox.fact-checked, all required fields present, publicly visiblereal-time
speech.dev has not published an operational assessment for this entry yet. Facts below are source-linked where available.
Overview
- Publisher
- Microsoft
- Lab
- azure-speech
- Last updated
- 2026-05-23
- Modality
STS converts spoken audio directly into spoken audio, skipping the intermediate text step. OpenAI Realtime, Ultravox.
Languages a voice model trained on multiple languages. Quality varies enormously by language — "supports 40 languages" may be great in 3 and mediocre in the other 37.
multilingual
Pricing
- Billing model
- Per token
Last verified May 23, 2026
Technical
- Streaming
- Supported · bidirectional
- Hosting
- saas-vendor-cloud
- Self-hostable
- No
API access
WSS
API endpoints
wss
SDK languages
python, typescript, csharp, java
Regions (vendor buckets)
Macro areas as published by the vendor (often broad; not a country list).
Compliance (vendor-published)
FedRAMP ModerateGDPR DPAHIPAA BAAISO 27001PCI DSSSOC 2 Type II
Suggest an update
Facts-layer corrections only — source URLs required. Opens a GitHub issue; a maintainer runs the content agent after triage. Not for operational notes.
Lab metadata: Update Azure Speech · All request types · Full guide