xAI Launches Voice Agent Builder: No-Code Platform Harnesses Grok Voice to Beat GPT and Gemini in Telephony Benchmarks
Technology📅 July 1, 2026👤 FreeReadText Team

xAI Launches Voice Agent Builder: No-Code Platform Harnesses Grok Voice to Beat GPT and Gemini in Telephony Benchmarks

Elon Musk's xAI enters the voice AI market with Voice Agent Builder, a no-code platform powered by Grok Voice Think Fast 1.0 that scores 67.3% on the τ-voice Bench — far outpacing Google Gemini 3.1 Flash Live (43.8%) and OpenAI GPT Realtime 1.5 (35.3%) — with pricing starting at $0.05 per minute.

On July 1, 2026, xAI — Elon Musk's AI company — launched Voice Agent Builder in public beta, marking its first major entry into the enterprise voice agent market. The no-code platform lets users create production-grade voice agents in under two minutes through a browser interface, powered by xAI's proprietary Grok Voice Think Fast 1.0 model. The launch was announced via xAI's official channels alongside detailed documentation on the company's website.

Voice Agent Builder uses a native end-to-end speech-to-speech architecture that processes raw audio directly, bypassing the traditional ASR-to-LLM-to-TTS pipeline that most competitors rely on. The performance gap is stark: on τ-voice Bench, a benchmark evaluating real-world telephony performance including background noise, accents, and interruptions, Grok Voice Think Fast 1.0 scored 67.3%, dramatically outperforming Google Gemini 3.1 Flash Live at 43.8% and OpenAI GPT Realtime 1.5 at 35.3%. The platform supports 25+ languages, includes 80+ built-in voice tones, and can clone a voice from two minutes of reference audio.

The platform bundles a full telephony infrastructure stack: each account receives a free phone number for inbound and outbound calls, along with built-in knowledge retrieval, tool calling supporting the Model Context Protocol for CRM and order management integrations, and compliance guardrails. SOC 2, HIPAA, and GDPR certifications are in place for regulated industry deployments. Pricing is set at $0.05 per minute for the model API with an optional $0.01 per minute for xAI-provided telephony — positioning it competitively against combinations of Vapi or Retell with third-party TTS providers.

The launch intensifies an already crowded voice agent market where Five9, Vapi, Bland.ai, and ElevenLabs have been competing for enterprise contact center workloads. xAI brings structural advantages: Grok Voice is already deployed in Tesla vehicles, the platform integrates with the broader Grok ecosystem including Gmail, Google Calendar, and Outlook tool connectors, and xAI's distribution reach — amplified by Musk's X platform — gives it a marketing channel that pure-play voice startups cannot match. The beta is available immediately, with general availability expected later in 2026.

xAIGrok VoiceVoice Agent BuilderNo-CodeSpeech-to-SpeechCall Center AI

출처

← Back to News