Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]
Groups > misc.consumers > #21055
| From | Fritz Wuehler <fritz@spamexpire-202511.rodent.frell.theremailer.net> |
|---|---|
| Subject | AI Chatbot Mangles 76% Of News Summary |
| Message-ID | <b55032b0870d8a28e70c9f2e65ff57ed@msgid.frell.theremailer.net> (permalink) |
| Date | 2025-11-17 01:36 +0100 |
| Newsgroups | misc.consumers, alt.privacy |
| Organization | dizum.com - The Internet Problem Provider |
Cross-posted to 2 groups.
BBC probe finds AI chatbots mangle nearly half of news summaries Google Gemini worst offender with 76% error rate A major study [PDF] led by the BBC on behalf of the European Broadcasting Union (EBU) found that OpenAI's ChatGPT, Microsoft Copilot, Google Gemini, and Perplexity misrepresented news content in almost half of the cases. An analysis of more than 3,000 responses from the AI assistants found that 45 percent of answers given contained at least one significant issue, 31 percent had serious sourcing problems, and a fifth had "major accuracy issues, including hallucinated details and outdated information." When accounting for smaller slip-ups, a whopping 81 percent of responses included a mistake of some sort. Gemini was identified as the worst performer, with researchers identifying "significant issues" in 76 percent of responses it provided – double the error rate of the other AI bots. The researchers blamed this on Gemini's poor performance in sourcing information, with researchers finding significant inaccuracies in 72 percent of responses. This was three times as many as ChatGPT (24 percent), followed by Perplexity and Copilot (both 15 percent). More here: https://www.theregister.com/2025/10/24/bbc_probe_ai_news/
Back to misc.consumers | Previous | Next | Find similar
AI Chatbot Mangles 76% Of News Summary Fritz Wuehler <fritz@spamexpire-202511.rodent.frell.theremailer.net> - 2025-11-17 01:36 +0100
csiph-web