Automatic Speech Recognition (ASR)
We support the following third-party service providers for ASR services:| Type | ASR Provider | Deployment | Languages | Regions | Notes |
|---|---|---|---|---|---|
| Native | Microsoft Azure Speech Services ASR | Cloud & On-Prem | 100+ languages/locales | Global Azure regions | Supports custom speech models and phrase lists for improving recognition accuracy. |
| Native | Google Cloud Speech-to-Text | Cloud | 125+ languages | Global | Supports automatic punctuation and speaker diarization features. |
| Native | Amazon Transcribe | Cloud | 100+ languages/dialects | Global AWS regions | Supports channel identification and domain-specific vocabularies. |
| Native | Deepgram | Cloud & On-Prem | Multiple languages | Global | Low latency real-time transcription; custom model training supported. |
| Native | OpenAI ASR | Cloud | 100+ languages | Global | Uses Whisper models for multilingual transcription and translation. |
| Native | Amivoice ASR | Cloud | Japanese, English, Chinese, Korean | - | - |
| Custom | Emotech ASR | - | - | - | - |
| Custom | RTZR ASR | - | - | - | - |
Text to Speech (TTS)
We support the following third-party service providers for TTS services:| Type | TTS Provider | Deployment | Languages | Regions | Notes |
|---|---|---|---|---|---|
| Native | Microsoft Azure Speech Services TTS | Cloud & On-Prem | 100+ languages | Global Azure regions | Neural voices and custom neural voice training available. |
| Native | Google Cloud Text-to-Speech | Cloud | 50+ languages | Global | Includes WaveNet neural voices. |
| Native | Amazon Polly | Cloud | 30+ languages | Global AWS regions | Supports neural voices and SSML speech marks. |
| Native | Deepgram TTS | Cloud & On-Prem | Limited languages | Global | Real-time conversational TTS optimized for voice agents. |
| Native | ElevenLabs | Cloud | 30+ languages | Global | Human-like voices with stability, speed, and style controls. |
| Native | OpenAI TTS | Cloud | Multiple languages | Global | Natural sounding voices with limited voice options. |
| Native | NvIdia Riva | On-Prem | 11+ languages | - | - |
| Custom | Sarvam TTS | - | - | - | - |
| Custom | IST TTS | - | - | - | - |
| Custom | Emotech TTS | - | - | - | - |
Voice Biometrics
We support the following third-party service providers for voice biometrics:| Type | Vendor | Engine | Deployment | Notes |
|---|---|---|---|---|
| Custom | ID R&D | ID Voice | Cloud & On-Prem | Voice authentication and fraud detection engine. |