previous icon Back to blog
May 19, 2026
5 minutes read

The Bar for Voice AI Is Higher Than You Think. Here Is What We Rebuilt to Clear It

The customers who call are not a random sample of your customer base. They tried the FAQs. They sent a message. They waited, or they did not bother, because what they need to say is too complicated, too urgent, or too important to type. By the time they pick up the phone, they are already a little frustrated and already invested in getting an answer. The next three minutes will shape how they feel about your business more than any chat ever could. That is who voice AI is talking to. And most voice AI is not built with that person in mind.

Rutger de Ruiter
Rutger de Ruiter,
Head of Product Engage

Why voice is the strictest channel in CX

The emotional weight has a direct technical consequence. A two-second delay in a chat response is invisible. On a phone call, it is the moment a caller starts wondering if the line dropped. A generic waiting sound in a chat widget is mildly annoying. On a phone call, on loop, it signals immediately that no one really thought about this experience. A wrong answer in chat can be corrected in the next message. On a phone call, it is the last thing the caller hears before they ask for a human.

Every requirement that is forgiving on other channels is strict on voice. Response speed. Turn-taking. What the caller hears while the AI is thinking. How your brand sounds. Whether the AI can actually do something, or only answer questions. Any single failure is enough to make the conversation feel broken and send that high-value caller straight to a human, or worse, to a competitor.

Most voice AI products treat this as a feature list. We treated it as a rebuild.

Already using HALO? Read everything about HALO Voice in our Knowledge Center

What we rebuilt in HALO Voice

A new speech pipeline. We rebuilt speech-to-text and text-to-speech from the ground up. Transcription is faster and more accurate. Speech output is smoother and more consistent. Time-to-first-word was a deliberate priority, because the gap between a caller finishing their sentence and the AI starting its response is felt on voice in a way it simply is not anywhere else.

Intelligent turn detection. HALO Voice identifies when a caller has finished speaking and responds at natural intervals. No talking over each other. No silences that make the caller wonder if the system is still working. The conversation moves the way a real one does, because the AI knows when it is its turn.

Contextual filler audio and the Audioverse. When the AI is processing, the caller is waiting. We replaced generic processing sounds with contextual filler phrases that match what the agent is actually doing. During a knowledge search: "Let me see what I can find." During tool execution: "Hang on, almost got it." The Audioverse adds subtle ambient background sound that mimics a real customer service environment. It is the difference between a caller feeling like they reached someone and feeling like they reached something.

ElevenLabs and the Lexicon. We moved to ElevenLabs as the primary voice provider for HALO, deprecating Azure TTS. The difference in naturalness and expressiveness is significant. Voice personas are configurable per language and per environment, previewable before deployment. The Lexicon gives you exact control over pronunciation: prevent a brand name from being translated, set an alias for a URL or abbreviation, spell out an acronym letter by letter. Small details that add up to an experience that sounds designed, not generated.

Agentic tool integration. HALO Voice connects natively to the same agents, tools, and knowledge base that power the rest of HALO. The voice agent executes real workflows: booking, looking up account information, routing, escalating. Not just retrieving pre-written answers. DTMF input support extends this to callers who need to enter precise numeric information.

WhatsApp Calling. Customers want to call the way they want to call. For a growing share of them, that means WhatsApp, not the dial pad. HALO Voice handles both. A WhatsApp call reaches the same voice agent, with the same speech pipeline, the same turn detection, the same tools, the same context. No separate setup, no degraded experience, no second-class channel. Whether the caller dials your number or taps the call button in a WhatsApp thread they already had open, the conversation works the same way.

The part that changes everything

All of this makes HALO Voice a better voice AI. But there is one requirement no amount of speech quality or filler audio can solve on its own: the caller should not have to repeat themselves.

The caller who already tried other channels almost certainly has a history with your business. A WhatsApp message sent last week. An order placed yesterday. A complaint logged this morning. If the voice AI does not know any of that, every other improvement is undermined the moment it asks them to start over.

This is where HALO is built differently. HALO Voice is not a stand alone voice product. It is the same agent that handles your WhatsApp messages, your chat, your messaging, now picking up the phone. Same knowledge base. Same tools. Same customer data layer. Context set in any channel is automatically available in every other one, in both directions. The voice agent knows who it is talking to before the first word is spoken, because it has already been talking to them.

That is what separates a voice automation project from a genuine CX advantage. Not a voice AI that sounds natural in isolation. An agent that already knows the customer, on whichever channel they reach for next.

Get started with HALO: how to create AI agents in HALO

The channel worth getting right

Voice has been treated as a legacy channel for too long. Too expensive to scale, too risky to automate, too complex to modernize. The result is that most businesses have invested heavily in every other channel and left their most emotionally charged touchpoint running on infrastructure from a decade ago.

The customers who call are already your most engaged ones. They needed help badly enough to pick up the phone. How that call goes shapes how they feel about your business far more than any chat interaction will. The technology to handle those calls well exists now. The question is whether the platform behind it is built to match.

That is what HALO is.

Want to see how it works in your setup? Read the full documentation in our Knowledge Center or get in touch with our team.

Was this article interesting?
Share it!
Tags
Rutger de Ruiter
Rutger de Ruiter,
Head of Product Engage
logo linkedin icon

Latest Articles

3 types of ai
Mar 20, 2026 • HALO

New in HALO: Analytics That Show You What Your AI Is Actually Doing

Customer service teams are used to precise analytics. Average handle time, first contact resolution, CSAT scores, numbers that are clean, comparable, and easy to report on. AI agents change that dynamic in a fundamental way.

HALO Whatsapp voice
Nov 28, 2025 • HALO

HALO Introduces WhatsApp Voice

Customers are increasingly communicating through apps rather than traditional phone calls. That is why we are introducing an important expansion of HALO today: Voice Agents can now automatically handle incoming calls made through WhatsApp. We are also looking back at improvements we have added to HALO over the past months. Developments that bring voice and chat closer together and help organizations automate faster and more consistently.

blog-christmas-carol
Nov 20, 2025 • CM.com

An eCommerce Christmas Carol: The Customer Journey in One Package

' Tis the season for conversational commerce - and CM.com can deliver the whole customer journey in one package! From getting your promotional material seen to facilitating payments within the conversation, and some post-purchase customer care to turn holiday shoppers into loyal fans of your brand.

Ai governance
Nov 14, 2025 • AI

AI Is Getting Smarter and More Independent. How Do You Stay in Control?

More and more companies are rolling out AI agents that go far beyond generating content or answering simple questions. They handle customer requests, update systems, trigger workflows, and even complete transactions. This brings speed, efficiency, and relief from manual, repetitive work. But once AI starts doing instead of merely assisting, something fundamental shifts. You need more than clever technology. You need a clear way to stay in control of what it does, why it does it, and the limits within which it can operate. This is where AI governance comes in. Not as a tick-box compliance layer, but as a practical foundation that keeps AI predictable, explainable, and safe as it becomes more autonomous.

blog-black-friday-customer-retention
Nov 13, 2025 • AI

From Casual Consumers to Loyal Fans: How to retain customers after Black Friday

Black Friday is a huge shopping event, drawing crowds of eager shoppers hunting for deals. But once the rush fades, the real challenge begins: turning one-time shoppers into returning customers. Customer retention is vital for long-term success, and the period after Black Friday is the ideal moment to turn short-term excitement into lasting relationships. In this blog, you’ll learn how to keep customers engaged and coming back long after the rush is over.

AI-now and future
Oct 28, 2025 • HALO

The Business Impact of Agentic AI: now and in the future

Imagine customer inquiries being answered 24/7 without wait times, automatic updates on delayed orders, or fully digital shopping assistants guiding customers from browsing to purchase. These are just a few examples of what Agentic AI can achieve. In this article, CM.com’s Marketing Lead AI & SaaS, Sander Harryvan, and Product Marketer, Tom Faas, share their insights on where businesses currently stand in adopting Agentic AI, what the next phase looks like, and why Agentic AI will have a profound impact on the way we do business in the coming years.

3 types of AI
Oct 15, 2025 • AI

Unleashing the Power of AI: How Generative, Agentic, and Predictive AI Are Transforming Customer Experience

Artificial Intelligence (AI) has been in development for decades, but the way we use it today has changed dramatically. With the advent of ChatGPT and other applications, AI has suddenly become tangible for the general public. While it was previously used primarily for specific, often invisible applications (think fraud detection in banking or predictive maintenance in industry), it now actively assists in content creation, enhancing customer experiences, and streamlining processes. Within customer experience, three forms of AI are particularly relevant: generative, agentic, and predictive AI. In this article, we’ll break them down and explain how to leverage them effectively.

halo-insurance
Oct 10, 2025 • AI

From Claims to Customer Questions: How AI Agents Help Insurers

The insurance industry is known for its complex processes and heavy administrative load. Fragmented communication, outdated systems, and complicated policy conditions mean that finding the right information or processing changes often takes far longer than it should. AI agents can change that. They answer questions, pull real-time data from internal systems, and seamlessly trigger processes.

Is this region a better fit for you?
Go
close icon