Tag: speech-to-text

Blog
>
Tag: speech-to-text

Enhanced speech recognition model is now available

62% Word Error Rate (WER) improvement for US English

ASR speech-to-text

Hot Summer Speech-to-Text Updates

Following Google’s release of new Speech API, we are happy to announce improved quality of call records transcription.

TTS text-to-speech voice ai realtime

Inworld Text-to-Speech now available in Voximplant

Voximplant has new realtime speech generation for voice AI from Inworld, our latest Voice AI text-to-speech (TTS) partner. Together, we combine state-of-the-art TTS with carrier-grade connectivity so you can build voice agents that sound like your brand, not a generic robot.

Extend Cartesia Line Agents to SIP, WhatsApp, and Global Phone Networks

Voximplant now includes a native Cartesia Line / Agents connector that connects any Voximplant call to a Cartesia Line voice agent for real-time, speech-to-speech conversations—over PSTN, SIP, WebRTC, or WhatsApp Business Calling—without building custom media gateways or WebSocket streaming infrastructure.

Cartesia Realtime TTS now available in Voximplant

Voximplant now includes a native Cartesia module for streaming, low-latency text-to-speech (TTS). You can use a single VoxEngine API to synthesize speech in real time, connect it to any call (PSTN, SIP, WebRTC, WhatsApp) and control playback from a Large Language Model (LLM) or other source, all inside VoxEngine.

Ultravox adds SIP to its Voice AI Services using Voximplant

Today Ultravox announced they are directly integrating Voximplant into their platform to provide SIP capabilities. The integration builds on Voximplant’s deep telephony and Voice AI tooling

What Is a Voice AI Orchestration Platform?

Learn how a Voice AI Orchestration Platform connects LLMs, STT/TTS, turn‑taking, and telephony (PSTN, SIP, WebRTC) to build reliable real‑time voice agents. See benefits, architecture, and how Voximplant helps.

Deepgram Voice Agent now available in Voximplant

Voximplant now includes a native Deepgram module that connects any Voximplant call to Deepgram’s Voice Agent API for real-time, speech‑to‑speech conversations. You can stream audio from phone numbers, SIP trunks, WhatsApp, or WebRTC into Deepgram’s unified agent environment—combining STT, LLM reasoning, and TTS—and play responses via Voximplant’s serverless runtime with minimal latency.

Voximplant adds enhanced pipeline options for Voice AI

Voximplant now lets developers build full-cascade voice AI pipelines in VoxEngine without sacrificing turn-taking quality.

Grok Voice Agent API now available in Voximplant

Voximplant now includes a native Grok module that connects any Voximplant call to xAI’s Grok Voice Agent API for real-time, speech-to-speech conversations. With a single VoxEngine scenario, you can interact via audio with Grok over phone numbers, SIP trunks and infrastructure, WhatsApp Business, or WebRTC into Grok — all without building custom media gateways or WebSocket streaming infrastructure.

voximplant kit podcast voximplant-kit-cc-news product management voximplant-kit-automation-news web sdk webrtc video kit-updates call center ios sdk sip voximplant pstn api

Tag: speech-to-text

Enhanced speech recognition model is now available

Hot Summer Speech-to-Text Updates

Sign Up for a free Voximplant developer account or talk to our experts

Inworld Text-to-Speech now available in Voximplant

Extend Cartesia Line Agents to SIP, WhatsApp, and Global Phone Networks

Cartesia Realtime TTS now available in Voximplant

Ultravox adds SIP to its Voice AI Services using Voximplant

What Is a Voice AI Orchestration Platform?

Deepgram Voice Agent now available in Voximplant

Voximplant adds enhanced pipeline options for Voice AI

Grok Voice Agent API now available in Voximplant

Sign Up for a free Voximplant developer account or talk to our experts

Tag: speech-to-text

Sign Up for a free Voximplant developer account or talk to our experts

Sign Up for a free Voximplant developer account or talk to our experts

Contact Us