Company
Microsoft
11 listed entries across models, skills, and agents.
Order by
| Model | Type |
|---|---|
| Azure Batch Transcription | converts spoken audio into text. Sometimes called "ASR" (automatic speech recognition). Deepgram, AssemblyAI. |
| Azure OpenAI Whisper | converts spoken audio into text. Sometimes called "ASR" (automatic speech recognition). Deepgram, AssemblyAI. |
| Azure Speech Service | converts spoken audio into text. Sometimes called "ASR" (automatic speech recognition). Deepgram, AssemblyAI. |
| Azure Custom Neural Voice | converts written text into spoken audio. ElevenLabs, Cartesia. |
| Azure Speech HD (DragonHD) | converts written text into spoken audio. ElevenLabs, Cartesia. |
| Azure Speech Service | converts written text into spoken audio. ElevenLabs, Cartesia. |
| MAI-Voice-1 | converts written text into spoken audio. ElevenLabs, Cartesia. |
| VibeVoice 1.5B | converts written text into spoken audio. ElevenLabs, Cartesia. |
| VibeVoice 7B | converts written text into spoken audio. ElevenLabs, Cartesia. |
| Azure Voice Live | converts spoken audio directly into spoken audio, skipping the intermediate text step. OpenAI Realtime, Ultravox. |