Tag: speech recognition

Blog
>
Tag: speech-recognition

What is Automatic Speech Recognition?

How many times a day do you talk to a computer? We’re not referring to the exasperated exclamation you direct at your laptop when it overheats and crashes. We want you to think about the moments you speak to a device and it actually listens.

ASR speech recognition speech-to-text

Enhanced speech recognition model is now available

62% Word Error Rate (WER) improvement for US English

ASR ivr speech recognition

High quality Speech Recognition is now available

We are happy to announce the high quality speech recognition for both audio call records transcription and real-time recognition scenarios.

TTS ASR Integration voice ai

OpenAI Client update: gpt-realtime GA alignment

OpenAI has recently announced GA version of their Realtime API that Voximplant now fully supports

TTS text-to-speech voice ai realtime

Inworld Text-to-Speech now available in Voximplant

Voximplant has new realtime speech generation for voice AI from Inworld, our latest Voice AI text-to-speech (TTS) partner. Together, we combine state-of-the-art TTS with carrier-grade connectivity so you can build voice agents that sound like your brand, not a generic robot.

voice agent voice ai multimodal ultravox

Introducing Voximplant integration with Ultravox.ai

The new integration enables instant connection of any Voximplant call to an Ultravox agent, delivering seamless voice-to-voice conversations.

Extend Cartesia Line Agents to SIP, WhatsApp, and Global Phone Networks

Voximplant now includes a native Cartesia Line / Agents connector that connects any Voximplant call to a Cartesia Line voice agent for real-time, speech-to-speech conversations—over PSTN, SIP, WebRTC, or WhatsApp Business Calling—without building custom media gateways or WebSocket streaming infrastructure.

Deepgram Voice Agent now available in Voximplant

Voximplant now includes a native Deepgram module that connects any Voximplant call to Deepgram’s Voice Agent API for real-time, speech‑to‑speech conversations. You can stream audio from phone numbers, SIP trunks, WhatsApp, or WebRTC into Deepgram’s unified agent environment—combining STT, LLM reasoning, and TTS—and play responses via Voximplant’s serverless runtime with minimal latency.

New WebSocket privacy feature for compliance-oriented environments

Tag: speech recognition

What is Automatic Speech Recognition?

Enhanced speech recognition model is now available

High quality Speech Recognition is now available

Sign Up for a free Voximplant developer account or talk to our experts

OpenAI Client update: gpt-realtime GA alignment

Inworld Text-to-Speech now available in Voximplant

Introducing Voximplant integration with Ultravox.ai

Extend Cartesia Line Agents to SIP, WhatsApp, and Global Phone Networks

Deepgram Voice Agent now available in Voximplant

New WebSocket privacy feature for compliance-oriented environments

Voice AI agents can now act, not just talk: introducing the VoxEngine MCP Client

Voximplant adds enhanced pipeline options for Voice AI

Sign Up for a free Voximplant developer account or talk to our experts

Tag: speech recognition

Sign Up for a free Voximplant developer account or talk to our experts

Sign Up for a free Voximplant developer account or talk to our experts

Contact Us