Riva provides deep learning-based automatic speech recognition (ASR), text-to-speech (TTS), and neural machine translation (NMT) models for AI practitioners and developers. ASR, TTS, and NMT are voice interfaces in speech-AI -based applications, such as call center agent assists, digital assistants, and video call transcriptions.
ASR converts speech to text and usually is the first step in a speech pipeline, so its transcription accuracy influences all downstream tasks. TTS generates human-like voices from text. NMT translates words from one language to another.
Riva is used across all industries—from telecommunications and finance to healthcare, retail, and automotive—since every company needs to interact with its customers.