Menu

IBM Pushes Voice AI Into watsonx. Can It Drive Enterprise Adoption?

IBM Pushes Voice AI Into watsonx. Can It Drive Enterprise Adoption

Analyst(s): Nick Patience
Publication Date: March 31, 2026

IBM and ElevenLabs have partnered to integrate advanced voice AI into watsonx Orchestrate, enabling multilingual, natural-sounding AI agents. The move targets enterprise automation use cases with a focus on security, scalability, and compliance across industries.

What is Covered in This Article:

  • IBM and ElevenLabs integrate text-to-speech and speech-to-text into watsonx Orchestrate to enable voice-based AI agents.
  • The platform supports over 70 languages and 10,000+ voices, targeting enterprise use cases across regulated industries.
  • Security features such as PCI compliance, Zero Retention Mode, and data residency aim to meet enterprise deployment standards.
  • The collaboration expands agentic AI capabilities from text-based to voice-first interactions across workflows.
  • The move aligns with IBM’s broader focus on AI-driven automation and enterprise AI platforms.

The News: IBM and ElevenLabs announced a collaboration to integrate ElevenLabs’ text-to-speech (TTS) and speech-to-text (STT) technologies into IBM watsonx Orchestrate. The integration enables enterprises to build voice-enabled AI agents capable of interacting in more than 70 languages with multiple accents, supported by a library of over 10,000 voices. The platform is designed to support customer and employee workflows across industries such as government, banking, healthcare, insurance, and utilities.

The combined offering includes enterprise-grade security features such as PCI compliance for payment processing, Zero Retention Mode aligned with HIPAA requirements, and data residency controls. IBM watsonx Orchestrate connects to existing systems and automation tools, allowing organisations to build, deploy, manage, and govern AI agents at scale. The collaboration is intended to expand agentic AI capabilities from text-based systems to voice-first interactions while supporting high-volume and concurrent enterprise deployments.

IBM Pushes Voice AI Into watsonx. Can It Drive Enterprise Adoption?

Analyst Take: IBM ElevenLabs voice AI integration reflects a targeted expansion of watsonx Orchestrate into voice-first enterprise workflows, combining multilingual capabilities, security controls, and scalability to address enterprise automation needs. The integration enables organisations to deploy AI agents that move beyond text-based interfaces and support customer and employee interactions across regulated industries. With support for over 70 languages and access to more than 10,000 voices, the offering addresses multilingual communication requirements for global enterprises. Enterprise-grade protections, including PCI compliance and HIPAA-aligned Zero Retention Mode, position the offering for industries with strict regulatory requirements. The move aligns with IBM’s broader focus on agentic AI as a driver of enterprise automation and workflow orchestration.

Expanding Agentic AI into Voice-based Enterprise Workflows

The integration extends watsonx Orchestrate from text-based automation to voice-first interactions, allowing AI agents to handle customer support, sales, and internal operations through natural language conversations. Enterprises are increasingly deploying agentic AI to automate workflows and interact with customers and employees, creating demand for more intuitive communication interfaces. Legacy voice systems used in enterprise environments have been limited by rigid call flows, long wait times, and robotic-sounding outputs. By incorporating ElevenLabs’ TTS and STT capabilities, IBM enables more natural, responsive, and accessible voice interactions. IBM positions this as voice becoming a core interface for agentic AI rather than a secondary feature.

Multilingual Capabilities and Inclusivity in Global Deployments

The platform supports over 70 languages and multiple regional accents, addressing the need for global enterprises to serve diverse user bases. This is particularly relevant for sectors such as government, healthcare, and financial services, where communication must be accessible across different languages and regions. The availability of more than 10,000 voices allows organisations to tailor interactions to specific audiences and use cases. Multilingual support also enables enterprises to scale AI-driven interactions across geographies without relying on separate systems. This capability strengthens watsonx Orchestrate’s positioning for global deployments requiring consistent and inclusive communication.

Security and Compliance are Central to Enterprise Adoption

IBM combines ElevenLabs’ voice capabilities with enterprise-grade security features, including PCI compliance, HIPAA-aligned Zero Retention Mode, and data residency controls. These features are designed to address concerns around handling sensitive data in regulated industries such as banking, healthcare, and insurance. The integration emphasises secure deployment of voice-enabled AI agents at scale, particularly in high-volume and concurrent interaction environments. Compliance and data governance remain critical barriers for enterprise AI adoption, and this offering directly targets those requirements.

Strategic Positioning Within a Growing AI Agents Market

The integration comes as the AI agents market is projected to see high growth. IBM is positioning watsonx Orchestrate as a platform for building, managing, and governing AI agents across enterprise workflows. The partnership with ElevenLabs adds voice-first capabilities to this positioning, targeting demand for automated interactions in customer-facing and internal operations. IBM’s broader strategy includes expanding AI-enabled automation and integrating partner capabilities into its ecosystem. This move reinforces IBM’s focus on making watsonx Orchestrate central to enterprise AI deployments.

What to Watch:

  • Whether regulated-industry customers move beyond pilots, there’s an existing AI pattern of strong enterprise interest that stalls at production scale, and voice AI’s compliance requirements raise the bar further.
  • How ElevenLabs’ voice quality holds up under high-concurrency enterprise load compared with established CCaaS players such as Genesys and NICE, where the benchmark for naturalness is already reasonably high.
  • Whether IBM can demonstrate measurable workflow outcomes, such as cost per interaction, containment rates, and handle time, rather than capability breadth, which is what enterprise buyers will demand before committing at scale.
  • How IBM positions this against Salesforce Agentforce and Microsoft Copilot Studio, both of which are building voice capabilities into broader agent platforms – the competitive question for buyers is whether watsonx Orchestrate’s security posture justifies any integration overhead.
  • ElevenLabs’ ability to maintain enterprise commitments as it scales: the company is growing fast, and IBM enterprise customers will want assurance on SLAs, support tiers, and long-term roadmap stability.

See the complete press release on the ElevenLabs and IBM collaboration to integrate voice AI into watsonx Orchestrate on the company’s website.

Declaration of generative AI and AI-assisted technologies in the writing process: This content has been generated with the support of artificial intelligence technologies. Due to the fast pace of content creation and the continuous evolution of data and information, The Futurum Group and its analysts strive to ensure the accuracy and factual integrity of the information presented. However, the opinions and interpretations expressed in this content reflect those of the individual author/analyst. The Futurum Group makes no guarantees regarding the completeness, accuracy, or reliability of any information contained herein. Readers are encouraged to verify facts independently and consult relevant sources for further clarification.

Disclosure: Futurum is a research and advisory firm that engages or has engaged in research, analysis, and advisory services with many technology companies, including those mentioned in this article. The author does not hold any equity positions with any company mentioned in this article.

Analysis and opinions expressed herein are specific to the analyst individually and data and other information that might have been provided for validation, not those of Futurum as a whole.

Other Insights from Futurum:

IBM Q4 FY 2025: Software and Z Cycle Lift Growth and FCF

IBM’s New FlashSystem Might Be the Blueprint for AI-Driven Storage Resilience

IBM vs. Anthropic: A Tale of the COBOL Modernization Tape

Author Information

Nick Patience is VP and Practice Lead for AI Platforms at The Futurum Group. Nick is a thought leader on AI development, deployment, and adoption - an area he has researched for 25 years. Before Futurum, Nick was a Managing Analyst with S&P Global Market Intelligence, responsible for 451 Research’s coverage of Data, AI, Analytics, Information Security, and Risk. Nick became part of S&P Global through its 2019 acquisition of 451 Research, a pioneering analyst firm that Nick co-founded in 1999. He is a sought-after speaker and advisor, known for his expertise in the drivers of AI adoption, industry use cases, and the infrastructure behind its development and deployment. Nick also spent three years as a product marketing lead at Recommind (now part of OpenText), a machine learning-driven eDiscovery software company. Nick is based in London.

Related Insights
Glean Doubles ARR to $200M. Can Its Knowledge Graph Beat Copilot
April 3, 2026

Glean Doubles ARR to $200M. Can Its Knowledge Graph Beat Copilot?

Nick Patience, VP & Practice Lead at Futurum, examines Glean's platform evolution from enterprise search to agentic AI, as it doubles ARR to $200M and battles Microsoft 365 Copilot for...
HP IQ Finally Brings Useful On-Device AI To Workspaces
April 3, 2026

HP IQ Finally Brings Useful On-Device AI To Workspaces

Olivier Blanchard, Research Director at Futurum, shares insights on HP IQ, HP’s workplace intelligence layer combining on-device AI, proximity-based connectivity, and IT control across devices and workflows....
Can UK Public Sector Security Keep Up With Its Own Digital Growth?
April 2, 2026

Can UK Public Sector Security Keep Up With Its Own Digital Growth?

The UK public sector's complex digital infrastructure has outpaced manual audits. Palo Alto Networks offers visibility to uncover critical security gaps in government and NHS environments....
Are Browsers the New Enterprise Attack Surface No One Is Ready to Defend?
April 2, 2026

Are Browsers the New Enterprise Attack Surface No One Is Ready to Defend?

Browser security is now the primary enterprise attack surface, with 95% of organizations experiencing browser-originated incidents that legacy tools cannot defend....
Will NVIDIA Investment Accelerate Marvell’s XPU Growth?
April 2, 2026

Will NVIDIA Investment Accelerate Marvell’s XPU Growth?

Brendan Burke, Research Director at Futurum, reviews the NVIDIA-Marvell NVLink Fusion partnership, showing how heterogeneous AI infrastructure, custom silicon, and optical networking reshape ecosystem control and enterprise deployment flexibility....

Book a Demo

Newsletter Sign-up Form

Get important insights straight to your inbox, receive first looks at eBooks, exclusive event invitations, custom content, and more. We promise not to spam you or sell your name to anyone. You can always unsubscribe at any time.

All fields are required






Thank you, we received your request, a member of our team will be in contact with you.