New!
Inworld Text-to-Speech is available now
Voximplant
New!
Inworld Text-to-Speech is available now
NewsEventsVoximplant KitGlossary

Tag: speech recognition

What is Automatic Speech Recognition?

What is Automatic Speech Recognition?

How many times a day do you talk to a computer? We’re not referring to the exasperated exclamation you direct at your laptop when it overheats and crashes. We want you to think about the moments you speak to a device and it actually listens.

Cartesia Realtime TTS now available in Voximplant

Cartesia Realtime TTS now available in Voximplant

Voximplant now includes a native Cartesia module for streaming, low-latency text-to-speech (TTS). You can use a single VoxEngine API to synthesize speech in real time, connect it to any call (PSTN, SIP, WebRTC, WhatsApp) and control playback from a Large Language Model (LLM) or other source, all inside VoxEngine.

What Is a Voice AI Orchestration Platform?

What Is a Voice AI Orchestration Platform?

Learn how a Voice AI Orchestration Platform connects LLMs, STT/TTS, turn‑taking, and telephony (PSTN, SIP, WebRTC) to build reliable real‑time voice agents. See benefits, architecture, and how Voximplant helps.

Deepgram Voice Agent now available in Voximplant

Deepgram Voice Agent now available in Voximplant

Voximplant now includes a native Deepgram module that connects any Voximplant call to Deepgram’s Voice Agent API for real-time, speech‑to‑speech conversations. You can stream audio from phone numbers, SIP trunks, WhatsApp, or WebRTC into Deepgram’s unified agent environment—combining STT, LLM reasoning, and TTS—and play responses via Voximplant’s serverless runtime with minimal latency.