
"For a while now, Open AI has been working on an AI gadget with built-in cameras, microphones and speakers that will be able to record what is happening in the environment and answer users' questions. According to The Information, the device will be mostly controlled by voice, which makes sense since the device has no screen."
"Thus, the focus is currently on developing a new voice model that sounds more natural and is capable of speaking and understanding speech at the same time, which is not as easy as one might think."
OpenAI is developing a compact AI gadget with built-in cameras, microphones, and speakers to record environmental context and answer user questions. The device lacks a display and therefore relies primarily on voice-based control. Development focuses on a new voice model that sounds more natural and can speak while understanding incoming speech simultaneously. Concurrent speaking and listening creates technical challenges such as latency, echo cancellation, and model coordination. Hardware integration and real-time bidirectional audio processing are prioritized to enable seamless, screenless conversational interactions.
Read at Computerworld
Unable to calculate read time
Collection
[
|
...
]