previous icon Back to blog
Jun 12, 2026
5 minutes read

The Bar for Voice AI Is Higher Than You Think. Here Is What We Rebuilt to Clear It

The customers who call are not a random sample of your customer base. They tried the FAQs. They sent a message. They waited, or they did not bother, because what they need to say is too complicated, too urgent, or too important to type. By the time they pick up the phone, they are already a little frustrated and already invested in getting an answer. The next three minutes will shape how they feel about your business more than any chat ever could. That is who voice AI is talking to. And most voice AI is not built with that person in mind.

Why voice is the strictest channel in CX

The emotional weight has a direct technical consequence. A two-second delay in a chat response is invisible. On a phone call, it is the moment a caller starts wondering if the line dropped. A generic waiting sound in a chat widget is mildly annoying. On a phone call, on loop, it signals immediately that no one really thought about this experience. A wrong answer in chat can be corrected in the next message. On a phone call, it is the last thing the caller hears before they ask for a human.

Every requirement that is forgiving on other channels is strict on voice. Response speed. Turn-taking. What the caller hears while the AI is thinking. How your brand sounds. Whether the AI can actually do something, or only answer questions. Any single failure is enough to make the conversation feel broken and send that high-value caller straight to a human, or worse, to a competitor.

Most voice AI products treat this as a feature list. We treated it as a rebuild.

Already using HALO? Read everything about HALO Voice in our Knowledge Center

What we rebuilt in HALO Voice

A new speech pipeline. We rebuilt speech-to-text and text-to-speech from the ground up. Transcription is faster and more accurate. Speech output is smoother and more consistent. Time-to-first-word was a deliberate priority, because the gap between a caller finishing their sentence and the AI starting its response is felt on voice in a way it simply is not anywhere else.

Intelligent turn detection. HALO Voice identifies when a caller has finished speaking and responds at natural intervals. No talking over each other. No silences that make the caller wonder if the system is still working. The conversation moves the way a real one does, because the AI knows when it is its turn.

Contextual filler audio and the Audioverse. When the AI is processing, the caller is waiting. We replaced generic processing sounds with contextual filler phrases that match what the agent is actually doing. During a knowledge search: "Let me see what I can find." During tool execution: "Hang on, almost got it." The Audioverse adds subtle ambient background sound that mimics a real customer service environment. It is the difference between a caller feeling like they reached someone and feeling like they reached something.

ElevenLabs and the Lexicon. We moved to ElevenLabs as the primary voice provider for HALO, deprecating Azure TTS. The difference in naturalness and expressiveness is significant. Voice personas are configurable per language and per environment, previewable before deployment. The Lexicon gives you exact control over pronunciation: prevent a brand name from being translated, set an alias for a URL or abbreviation, spell out an acronym letter by letter. Small details that add up to an experience that sounds designed, not generated.

Agentic tool integration. HALO Voice connects natively to the same agents, tools, and knowledge base that power the rest of HALO. The voice agent executes real workflows: booking, looking up account information, routing, escalating. Not just retrieving pre-written answers. DTMF input support extends this to callers who need to enter precise numeric information.

WhatsApp Calling. Customers want to call the way they want to call. For a growing share of them, that means WhatsApp, not the dial pad. HALO Voice handles both. A WhatsApp call reaches the same voice agent, with the same speech pipeline, the same turn detection, the same tools, the same context. No separate setup, no degraded experience, no second-class channel. Whether the caller dials your number or taps the call button in a WhatsApp thread they already had open, the conversation works the same way.

The part that changes everything

All of this makes HALO Voice a better voice AI. But there is one requirement no amount of speech quality or filler audio can solve on its own: the caller should not have to repeat themselves.

The caller who already tried other channels almost certainly has a history with your business. A WhatsApp message sent last week. An order placed yesterday. A complaint logged this morning. If the voice AI does not know any of that, every other improvement is undermined the moment it asks them to start over.

This is where HALO is built differently. HALO Voice is not a stand alone voice product. It is the same agent that handles your WhatsApp messages, your chat, your messaging, now picking up the phone. Same knowledge base. Same tools. Same customer data layer. Context set in any channel is automatically available in every other one, in both directions. The voice agent knows who it is talking to before the first word is spoken, because it has already been talking to them.

That is what separates a voice automation project from a genuine CX advantage. Not a voice AI that sounds natural in isolation. An agent that already knows the customer, on whichever channel they reach for next.

Get started with HALO: how to create AI agents in HALO

The channel worth getting right

Voice has been treated as a legacy channel for too long. Too expensive to scale, too risky to automate, too complex to modernize. The result is that most businesses have invested heavily in every other channel and left their most emotionally charged touchpoint running on infrastructure from a decade ago.

The customers who call are already your most engaged ones. They needed help badly enough to pick up the phone. How that call goes shapes how they feel about your business far more than any chat interaction will. The technology to handle those calls well exists now. The question is whether the platform behind it is built to match.

That is what HALO is.

Want to see how it works in your setup? Read the full documentation in our Knowledge Center or get in touch with our team.

Was this article interesting?
Share it!
Tags
CM.com
connects tens of thousands of companies with millions of consumers via their mobile phone each day. Behind the scenes, from our innovative platform, CM.com makes sure companies can use these millions of messages, phone calls and payments to become part of people’s lives.

Latest Articles

HALO Whatsapp voice
Dec 10, 2025 • HALO

HALO Introduces WhatsApp Voice

Customers are increasingly communicating through apps rather than traditional phone calls. That is why we are introducing an important expansion of HALO today: Voice Agents can now automatically handle incoming calls made through WhatsApp. We are also looking back at improvements we have added to HALO over the past months. Developments that bring voice and chat closer together and help organizations automate faster and more consistently.

blog-christmas-carol
Nov 25, 2025 • CM.com

An eCommerce Christmas Carol: The Customer Journey in One Package

' Tis the season for conversational commerce - and CM.com can deliver the whole customer journey in one package! From getting your promotional material seen to facilitating payments within the conversation, and some post-purchase customer care to turn holiday shoppers into loyal fans of your brand.

3 types of AI
Oct 27, 2025 • AI

Unleashing the Power of AI: How Generative, Agentic, and Predictive AI Are Transforming Customer Experience

Artificial Intelligence (AI) has been in development for decades, but the way we use it today has changed dramatically. With the advent of ChatGPT and other applications, AI has suddenly become tangible for the general public. While it was previously used primarily for specific, often invisible applications (think fraud detection in banking or predictive maintenance in industry), it now actively assists in content creation, enhancing customer experiences, and streamlining processes. Within customer experience, three forms of AI are particularly relevant: generative, agentic, and predictive AI. In this article, we’ll break them down and explain how to leverage them effectively.

halo-insurance
Oct 25, 2025 • AI

From Claims to Customer Questions: How AI Agents Help Insurers

The insurance industry is known for its complex processes and heavy administrative load. Fragmented communication, outdated systems, and complicated policy conditions mean that finding the right information or processing changes often takes far longer than it should. AI agents can change that. They answer questions, pull real-time data from internal systems, and seamlessly trigger processes.

Implementation checklist for AI agents
Oct 02, 2025 • AI

Your AI Agent Implementation Checklist

AI agents aren’t just shaping the future they’re transforming how companies serve and connect with their customers right now. From answering service requests instantly, to guiding shoppers through a purchase, to spotting upsell opportunities in real time, the question is no longer if you should implement AI, but how quickly you can put it to work.

blog-picking-ai-platform
Sep 29, 2025 • HALO

From Selection to Success: How to Choose the Right AI Platform

An AI platform isn’t just another tool you purchase. It’s the foundation on which your organization operates and innovates. The choices you make today will shape how you work in the future. While you may start with just a few agents supporting specific use cases, over time more processes will be taken over by agents. That’s why it’s critical to ensure the foundation you lay now is cohesive, scalable, and backed by solid governance and compliance.

blog-halo-ecommerce
Sep 27, 2025 • AI

AI Agents: The Accelerators of Conversational Commerce

The way consumers search for and process information online is rapidly changing thanks to AI. Where we used to type in search terms, scroll through dozens of results, and manually filter them, we are now getting used to having conversations. With ChatGPT, Google’s AI features, and other assistants, answers come faster and are more relevant. That same way of interacting is now taking over e-commerce at high speed. For retailers, this is the moment to step in: the webshop as we know it—where customers have to actively search themselves—is giving way to personal conversations that directly lead to action.

Is this region a better fit for you?
Go
close icon