#speech-synthesis tag

An AI announcer mispronounced and skipped names during a graduation

AI name-announcement tools can mispronounce or skip names during graduations due to timing issues, requiring pauses and human do-overs.

Artificial intelligence

fromTheregister

1 month ago

Microsoft shivs OpenAI with new AI models for speech, images

Microsoft launched public preview versions of machine learning models for speech recognition, speech synthesis, and image generation, competing directly with OpenAI.

Artificial intelligence

fromBusiness Matters

4 months ago

Free AI Dubbing Tool with Audiobook Support - Convert Text to Speech Instantly

AI audiobook generators and dubbing engines let anyone convert text or video into realistic, human-like audio quickly, affordably, and across languages.

fromComputerWeekly.com

5 months ago

How digital twins are helping people with motor neurone disease speak | Computer Weekly

An initiative by a UK-based charity, supported by technology companies and universities, has developed an artificial intelligence (AI)-powered digital twin that allows people with communications disabilities to speak in a natural way. The technology, known as VoxAI, represents a step-change from the computer-assisted voice used by late physicist Stephen Hawking, one of the first well-known public figures with motor neurone disease (MND).

Artificial intelligence

fromPitchfork

7 months ago

Kraftwerk Co-Founder Florian Schneider's Synths, Volkswagen Van, and More Available at Auction

Among the recording equipment and memorabilia available to bid on are Schneider's 1964 Volkswagen van, the Panasonic bicycle he rode in the 1984 music video for a remix of "Tour De France," a number of woodwind and brass instruments-including the 1960s Orsi alto flute that appeared on the back cover of Kraftwerk's 1970 self-titled debut-and a rack case of Votrax speech synthesizer units, which the band used to create the robot voices that opened all of their concerts between 1981 and 2002.

Music

fromTechCrunch

7 months ago

Sesame, the conversational AI startup from Oculus founders, raises $250M and launches beta | TechCrunch

The startup, headed by former Oculus co-founder and CEO Brendan Iribe and Ankit Kumar, former CTO of AR startup Ubiquity6, is working to create a personal AI agent that interacts with users using a natural-sounding human voice. The company plans to embed the personal AI agent into lightweight eyewear that is designed to be worn throughout the day and which users can interact with via voice.

Wearables

fromTheregister

7 months ago

Humans flunk the Turing test for voices as bots get chattier

Think you can distinguish between a human voice and a robot? Think again, because the numbers are starting to say otherwise. Researchers at Queen Mary University of London and University College London found that people can no longer reliably distinguish between genuine speech and cloned AI voices. Their study, published in open-access journal PLOS One, found that when people were played recordings of real people together with AI-generated versions of the same voices, their judgments were little better than random chance.

Artificial intelligence

fromAcm

8 months ago

Unlocking the Potential of Arabic Voice-Generation Technologies

Voice-generation technology enables machines to synthesize human-like speech-text-to-speech (TTS)-revolutionizing digital communication by fostering more inclusive and accessible experiences. What began as simple robotic speech synthesis has evolved into highly sophisticated voice-cloning systems that can produce natural, coherent, expressive, and personalized voices using minimal data. These technologies empower individuals with cross-lingual communication through virtual agents, assist in overcoming visual or speech impairments or literacy challenges via assistive tools, and support educators and industries such as entertainment with creative content generation.

Artificial intelligence

fromThe Verge

8 months ago

Microsoft launches its first in-house AI models

Microsoft released MAI-Voice-1 for rapid speech generation and MAI-1-preview as a consumer-focused instruction-following model for Copilot text use cases.

fromArs Technica

10 months ago

A neural brain implant provides near instantaneous speech

"Our main goal is creating a flexible speech neuroprosthesis that enables a patient with paralysis to speak as fluently as possible, managing their own cadence, and be more expressive by letting them modulate their intonation," says Maitreyee Wairagkar, a neuroprosthetics researcher at UC Davis who led the study.

Science

#speech-synthesis#speech-synthesis

An AI announcer mispronounced and skipped names during a graduation

Microsoft shivs OpenAI with new AI models for speech, images

Free AI Dubbing Tool with Audiobook Support - Convert Text to Speech Instantly

How digital twins are helping people with motor neurone disease speak | Computer Weekly

Kraftwerk Co-Founder Florian Schneider's Synths, Volkswagen Van, and More Available at Auction

Sesame, the conversational AI startup from Oculus founders, raises $250M and launches beta | TechCrunch

Humans flunk the Turing test for voices as bots get chattier

Unlocking the Potential of Arabic Voice-Generation Technologies

Microsoft launches its first in-house AI models

A neural brain implant provides near instantaneous speech

#speech-synthesis
#speech-synthesis