Company
OpenAI
13 listed entries across models, skills, and agents.
Order by
| Model | Type |
|---|---|
| Whisper large v3 | converts spoken audio into text. Sometimes called "ASR" (automatic speech recognition). Deepgram, AssemblyAI. |
| OpenAI GPT-4o Transcribe | converts spoken audio into text. Sometimes called "ASR" (automatic speech recognition). Deepgram, AssemblyAI. |
| OpenAI gpt-4o-mini-transcribe | converts spoken audio into text. Sometimes called "ASR" (automatic speech recognition). Deepgram, AssemblyAI. |
| OpenAI gpt-4o-transcribe-diarize | converts spoken audio into text. Sometimes called "ASR" (automatic speech recognition). Deepgram, AssemblyAI. |
| OpenAI GPT-Realtime-Whisper | converts spoken audio into text. Sometimes called "ASR" (automatic speech recognition). Deepgram, AssemblyAI. |
| OpenAI tts-1 | converts written text into spoken audio. ElevenLabs, Cartesia. |
| OpenAI tts-1-hd | converts written text into spoken audio. ElevenLabs, Cartesia. |
| OpenAI GPT-4o mini Realtime | converts spoken audio directly into spoken audio, skipping the intermediate text step. OpenAI Realtime, Ultravox. |
| OpenAI GPT-Realtime-2 | converts spoken audio directly into spoken audio, skipping the intermediate text step. OpenAI Realtime, Ultravox. |
| OpenAI Realtime | converts spoken audio directly into spoken audio, skipping the intermediate text step. OpenAI Realtime, Ultravox. |
| OpenAI GPT-Realtime-Translate | converts speech in one language to text or speech in another, often in real-time. |