
"OpenAI, the company that developed the models and products associated with ChatGPT, plans to announce a new audio language model in the first quarter of 2026, and that model will be an intentional step along the way to an audio-based physical hardware device, according to a report in The Information. Citing a variety of sources familiar with the plans, including both current and former employees, The Information claims that OpenAI has taken efforts to combine multiple teams across engineering, product, and research under one initiative focused on improving audio models, which researchers in the company believe lag behind the models used for written text in terms of both accuracy and speed."
"They have also seen that relatively few ChatGPT users opt to use the voice interface, with most people preferring the text one. The hope may be that substantially improving the audio models could shift user behavior toward voice interfaces, allowing the models and products to be deployed in a wider range of devices, such as in cars. OpenAI plans to release a family of physical devices in the coming years, starting with an audio-focused one. People inside the company have discussed a variety of forms for future devices, including smart speakers and glasses, but the emphasis across the line is on audio interfaces rather than screen-based ones."
OpenAI plans to announce a new audio language model in the first quarter of 2026 as a deliberate step toward an audio-based physical hardware device. The company has combined engineering, product, and research teams under an initiative to improve audio models, which researchers consider less accurate and slower than text models. Few ChatGPT users currently prefer the voice interface over text. The company hopes improved audio capabilities will shift behavior toward voice interfaces and enable deployment across more devices, including cars. OpenAI intends to launch a family of physical devices beginning with an audio-focused product, emphasizing audio interfaces over screens.
Read at Ars Technica
Unable to calculate read time
Collection
[
|
...
]