OpenAI adds MCP and SIP support to gpt-realtime for smarter voice-based agents
Briefly

OpenAI adds MCP and SIP support to gpt-realtime for smarter voice-based agents
"Additionally, the model provider said that the updated gpt-realtime model has shown improvements in following complex instructions, calling tools with precision, and producing speech that "sounds more natural and expressive.""
"These improvements, according to Dai, would help enterprises use the API for enabling low-latency, natural voice interactions for a spectrum of use cases, such as real-time medical transcription, conversational booking assistants, customer service for banking, insurance, and telco, and employee enablement across major verticals."
"Enterprises accessing the model through the API can use two new voices, Cedar and Marin, the model provider said."
The updated gpt-realtime model shows improvements in following complex instructions, calling tools with precision, and producing speech that sounds more natural and expressive. These improvements enable enterprises to deploy the API for low-latency, natural voice interactions across a spectrum of use cases, including real-time medical transcription, conversational booking assistants, customer service for banking, insurance, and telco, and employee enablement across major verticals. Enterprises accessing the model through the API gain access to two new voices, Cedar and Marin. The enhancements support precise tool integration and more humanlike spoken responses in latency-sensitive applications.
Read at InfoWorld
Unable to calculate read time
[
|
]