AI chatbots fail at accurate news, major study reveals DW 10/22/2025
Briefly

AI chatbots fail at accurate news, major study reveals  DW  10/22/2025
"Journalists from a range of public service broadcasters, including the BBC (UK) and NPR (US), evaluated the responses of four AI assistants, or chatbots ChatGPT, Microsoft's Copilot, Google's Gemini and Perplexity AI. Measuring criteria such as accuracy, sourcing, providing context, the ability to editorialize appropriately and the ability to distinguish fact from opinion, the study found that almost half of all answers had at least one significant issue, while 31% contained serious sourcing problems and 20% contained major factual errors."
"DW found that 53% of the answers provided by the AI assistants to its questions had significant issues, with 29% experiencing specific issues with accuracy. Among the factual errors made in response to DW questions was Olaf Scholz being named as German Chancellor, even though Friedrich Merz had been made Chancellor one month earlier. Another saw Jens Stoltenberg named as NATO secretary general after Mark Rutte had already taken over the role."
Twenty-two public service media organizations evaluated responses from ChatGPT, Microsoft's Copilot, Google's Gemini and Perplexity AI across languages and territories. Evaluators measured accuracy, sourcing, context, appropriate editorializing and distinction between fact and opinion. Results showed 45% of responses contained at least one significant issue, 31% had serious sourcing problems and 20% contained major factual errors. DW-specific queries produced 53% problematic answers and 29% with accuracy issues. Examples included incorrect identifications of national leaders and NATO secretary general. AI chatbots are increasingly used for news, with usage higher among younger audiences, but performance showed systemic, multilingual and cross-border failings.
Read at www.dw.com
Unable to calculate read time
[
|
]