Google DeepMind's latest model promises realistic audio

from Theregister 8 months ago

The model can generate audio from video and text inputs without manual alignment, utilizing datasets with AI-generated annotations and transcriptions. However, audio quality is tied to video source quality, with challenges such as lip sync.
Theregisterhttps://www.theregister.com/2024/06/18/google_deepmind_video/

Read at Theregister

#ai-models #audio-generation #video-to-audio #deepmind #diffusion-models

Collection

[

...

]

Google DeepMind's latest model promises realistic audioGoogle DeepMind's latest model promises realistic audio Briefly

Google DeepMind's latest model promises realistic audio
Google DeepMind's latest model promises realistic audio
Briefly