Gen-AI-Today

GenAI TODAY NEWS

Free eNews Subscription

Twilio Announces Enhanced Partnership with OpenAI to Improve Speech-to-Speech

By Tracey E. Schelmetic

In the past, customers have found that speaking with AI-driven chatbots has felt very much less like “chat” and more like “bot.” Responses, even if they were correct, rarely sounded particularly natural.

Now. technology has grown to ideally fix this problem; namely, through speech-to-speech. This is an emerging solution that allows for voice conversations by AI virtual agents to feel much more like real human dialogue.

In this vein, many contact center companies are turning to OpenAI’s Realtime API to reduce latency and improve key components like conversation pacing, interruption handling, tone and balance between speaking and listening – all critical user experience elements that make conversation with a virtual agent more human-like.

And so, customer engagement solutions provider Twilio recently announced an integration with OpenAI to bring the latter company’s new Realtime API to the Twilio platform. The integration of streaming speech-to-speech capabilities, which are part of the Realtime API, will enable 300,000+ Twilio customers and more than 10 million developers to build conversational AI virtual agents leveraging OpenAI’s flagship multilingual and multimodal GPT-4o model. The new integration builds on existing OpenAI and Twilio product integrations announced last year to bring the power of large language models (LLMs) to the customer engagement platform.

In the announcement, the companies noted that the combined technology "is especially relevant for customer service and sales, delivering both operational efficiency and exceptional customer outcomes." Speech-to-speech is also set to support social impact at scale, empowering nonprofit and public sector organizations to deploy novel use cases like voice translation in real time between constituents and staff members who speak different languages.

“Integrating OpenAI’s Realtime API with Twilio’s platform enables businesses to offer more natural, real-time AI voice interactions at scale,” said Inbal Shani, Chief Product Officer, Twilio Communications. “Businesses can use this to create voice experiences that feel more human and can reduce operational costs and drive higher customer satisfaction.”




Edited by Alex Passett
Get stories like this delivered straight to your inbox. [Free eNews Subscription]

GenAIToday Contributor

SHARE THIS ARTICLE
Related Articles

Building Personalized AI Agents

By: Special Guest    4/4/2025

It's tempting to build an AI Agent that can do everything, but that's a recipe for a diluted and, ultimately, less effective generic workflow.

Read More

Salad Redefines AI Transcription with Unmatched Accuracy and Ultra-Low Pricing

By: Erik Linask    3/31/2025

Salad looks to upend the AI transcription market with its low-cost, highly accurate artificial intelligence-driven Salad Transcription API.

Read More

The Human-AI Partnership: Elevating Customer Service Without Losing the Personal Touch

By: Special Guest    3/26/2025

How businesses can leverage AI to improve customer experiences without losing the human touch of customer interactions.

Read More

Boomi AI Studio Launched to Centralize Control and Governance of Enterprise AI Agents

By: Erik Linask    3/10/2025

Boomi AI Studio allows businesses to harness the power of AI-driven automation by delivering the necessary oversight and guardrails to enable scaling …

Read More

IBM Strengthens GenAI Portfolio with DataStax Acquisition

By: Erik Linask    2/25/2025

Bolstering its Generative AI portfolio, IBM announced its plan to acquire AI and data solutions provider DataStax.

Read More

-->