The relevance of remote business has grown rapidly due to changing conditions in world markets. Several companies are facing challenges because they are not set up for their employees to transition to remote work but situations like these call for immediate measures.
UC (or Unified Communications) are a combination of real-time communication technologies, such as chats and document collaborations, integrated with offline communication methods that do not require the presence of a person (e-mail, voicemail, SMS, fax).
We've started with audio, then we've added video calls and now it's time to let our developers use instant messaging and presence - two very important features of UC stack.
Voximplant now includes a native Grok module that connects any Voximplant call to xAI’s Grok Voice Agent API for real-time, speech-to-speech conversations. With a single VoxEngine scenario, you can interact via audio with Grok over phone numbers, SIP trunks and infrastructure, WhatsApp Business, or WebRTC into Grok — all without building custom media gateways or WebSocket streaming infrastructure.
Voximplant now includes a native MCP Client for VoxEngine, giving developers direct connectivity to any MCP server and full control over every tool call
Voximplant now supports Inworld's Realtime API, so you can bring Inworld's expressive, conversation-aware agents into real phone calls, SIP, and WhatsApp without custom media infrastructure
Voximplant has added a WebSocket privacy option that redacts message payloads from logs across all WebSocket-based services – Voice AI connectors and external speech system – and speech control modules
Voximplant now includes a native Cartesia module for streaming, low-latency text-to-speech (TTS). You can use a single VoxEngine API to synthesize speech in real time, connect it to any call (PSTN, SIP, WebRTC, WhatsApp) and control playback from a Large Language Model (LLM) or other source, all inside VoxEngine.
Voximplant now includes a native Deepgram module that connects any Voximplant call to Deepgram’s Voice Agent API for real-time, speech‑to‑speech conversations. You can stream audio from phone numbers, SIP trunks, WhatsApp, or WebRTC into Deepgram’s unified agent environment—combining STT, LLM reasoning, and TTS—and play responses via Voximplant’s serverless runtime with minimal latency.