Deepgram Nova-3
converts spoken audio into text. Sometimes called "ASR" (automatic speech recognition). Deepgram, AssemblyAI.fact-checked, all required fields present, publicly visiblereal-time
speech.dev has not published an operational assessment for this entry yet. Facts below are source-linked where available.
Overview
- Publisher
- Deepgram
- Lab
- deepgram
- Last updated
- 2026-05-23
- Modality
STT converts spoken audio into text. Sometimes called "ASR" (automatic speech recognition). Deepgram, AssemblyAI.
Languages a voice model trained on multiple languages. Quality varies enormously by language — "supports 40 languages" may be great in 3 and mediocre in the other 37.
en-us, en-gb, es, fr, de, pt, nl, hi, ja, ko, zh
Pricing
Rate tiers
Streaming monolingual
Per minute · $0.0048 per minute
Streaming multilingual
Per minute · $0.0058 per minute
Pre-recorded monolingual
Per minute · $0.0077 per minute
Pre-recorded multilingual
Per minute · $0.0092 per minute
- Default billing model
- Per minute
- Headline rate
- $0.0048 per minute
Last verified May 23, 2026
Technical
- Streaming
- Supported · bidirectional
- Hosting
- saas-vendor-cloud
- Self-hostable
- No
API access
WSS
API endpoints
wsshttps
SDK languages
python, typescript, go, csharp
Regions (vendor buckets)
Macro areas as published by the vendor (often broad; not a country list).
Compliance (vendor-published)
CCPAGDPR DPAHIPAA BAAPCI DSSSOC 2 Type ISOC 2 Type II
LiveKit Inference
Listed on LiveKit Agents inference catalog · verified 2026-05-23
| Listed model | Build & Ship | Scale |
|---|---|---|
| Nova-3 (Monolingual) | $0.0048 | $0.0042 |
| Nova-3 (Multilingual) | $0.0058 | $0.0050 |
| Nova-3 Medical | $0.0077 | $0.0065 |
LiveKit inference pricing (per minute)
Suggest an update
Facts-layer corrections only — source URLs required. Opens a GitHub issue; a maintainer runs the content agent after triage. Not for operational notes.
Lab metadata: Update Deepgram · All request types · Full guide