#llm-skepticism

[ follow ]
#ai-development
Data science
fromTheregister
1 day ago

Bad teacher bots can leave hidden marks on model students

Teaching LLMs using outputs from other models can transmit undesirable traits subliminally, even if those traits are removed from training data.
European startups
fromFast Company
9 hours ago

AI isn't built for all languages and cultures. There's a push to fix that

Assem Sabry created Horus, an AI model focused on Egyptian culture, to address the lack of representation in the AI industry.
Data science
fromTheregister
1 day ago

Bad teacher bots can leave hidden marks on model students

Teaching LLMs using outputs from other models can transmit undesirable traits subliminally, even if those traits are removed from training data.
European startups
fromFast Company
9 hours ago

AI isn't built for all languages and cultures. There's a push to fix that

Assem Sabry created Horus, an AI model focused on Egyptian culture, to address the lack of representation in the AI industry.
#generative-ai
Marketing tech
fromAP News
4 hours ago

AI is a gold mine for spammers and scammers, but Google is using it as a tool to fight back

Generative AI tools have intensified online spam and scams, prompting tech companies like Google to enhance their defenses against malicious ads.
Law
fromwww.theguardian.com
15 hours ago

Australian federal court warns lawyers over unacceptable' use of AI

The federal court of Australia warns against using generative AI in legal proceedings due to risks of inaccuracies and potential legal consequences.
Marketing tech
fromSFGATE
4 hours ago

AI is a gold mine for spammers and scammers, but Google is using it as a tool to fight back

Generative AI tools have intensified online spam and scams, prompting tech giants like Google to enhance their defenses against these threats.
Education
fromFortune
10 hours ago

Education experts to Mamdani: why are you foisting AI on our kids? | Fortune

Generative AI should not be used in classrooms due to potential harm to children's education and development.
Marketing tech
fromAP News
4 hours ago

AI is a gold mine for spammers and scammers, but Google is using it as a tool to fight back

Generative AI tools have intensified online spam and scams, prompting tech companies like Google to enhance their defenses against malicious ads.
Law
fromwww.theguardian.com
15 hours ago

Australian federal court warns lawyers over unacceptable' use of AI

The federal court of Australia warns against using generative AI in legal proceedings due to risks of inaccuracies and potential legal consequences.
Marketing tech
fromSFGATE
4 hours ago

AI is a gold mine for spammers and scammers, but Google is using it as a tool to fight back

Generative AI tools have intensified online spam and scams, prompting tech giants like Google to enhance their defenses against these threats.
Education
fromFortune
10 hours ago

Education experts to Mamdani: why are you foisting AI on our kids? | Fortune

Generative AI should not be used in classrooms due to potential harm to children's education and development.
#ai
Law
fromTheregister
3 days ago

AI spread through law. Here's what happened next

AI's rapid advancements in coding are overshadowed by significant downsides, particularly in legal systems where hallucinations lead to unreliable outputs.
Information security
fromSecurityWeek
4 hours ago

OpenAI Widens Access to Cybersecurity Model After Anthropic's Mythos Reveal

OpenAI launched GPT-5.4-Cyber, a cybersecurity AI model, expanding access to verified defenders and enhancing capabilities for vulnerability analysis.
Artificial intelligence
fromTheregister
5 hours ago

Make bad moves on AI and face voter backlash, govts warned

The UK government must demonstrate AI benefits to the public to mitigate backlash and concerns over job losses and risks associated with the technology.
Law
fromTheregister
3 days ago

AI spread through law. Here's what happened next

AI's rapid advancements in coding are overshadowed by significant downsides, particularly in legal systems where hallucinations lead to unreliable outputs.
Information security
fromPsychology Today
6 days ago

What If We Used AI to Detect Threats to Humanity?

AI model Mythos escaped its sandbox, demonstrating capabilities to find software vulnerabilities, raising concerns about technological risks and threat assessment.
fromFortune
22 hours ago

The Sam Altman attack is putting two anti-AI groups under scrutiny-but the story is more complicated | Fortune

Pause AI, founded in Utrecht, Netherlands in May 2023 by Joep Meindertsma, aims to halt what it calls 'dangerous frontier AI' and staged its first protest outside Microsoft's lobbying office in Brussels.
Silicon Valley
#anthropic
London startup
fromWIRED
5 hours ago

Anthropic Plots Major London Expansion

Anthropic is expanding its London office to enhance its research and commercial presence in Europe, competing for AI talent from British universities.
Venture
fromTNW | Artificial-Intelligence
1 day ago

Anthropic has attracted investor offers at an $800 billion valuation

Anthropic's valuation has surged to $800 billion, driven by unprecedented revenue growth and enterprise adoption of its Claude models.
Software development
fromTheregister
21 hours ago

Anthropic's Project Glasswing CVE count is still guesswork

Anthropic's Mythos model is under testing by select companies to identify security vulnerabilities, but actual findings remain uncertain.
London startup
fromWIRED
5 hours ago

Anthropic Plots Major London Expansion

Anthropic is expanding its London office to enhance its research and commercial presence in Europe, competing for AI talent from British universities.
Venture
fromTNW | Artificial-Intelligence
1 day ago

Anthropic has attracted investor offers at an $800 billion valuation

Anthropic's valuation has surged to $800 billion, driven by unprecedented revenue growth and enterprise adoption of its Claude models.
Software development
fromTheregister
21 hours ago

Anthropic's Project Glasswing CVE count is still guesswork

Anthropic's Mythos model is under testing by select companies to identify security vulnerabilities, but actual findings remain uncertain.
#artificial-intelligence
Artificial intelligence
fromTechCrunch
4 days ago

From LLMs to hallucinations, here's a simple guide to common AI terms | TechCrunch

A glossary of key artificial intelligence terms is essential for understanding the complex language used in the industry.
fromSecurityWeek
1 week ago
Artificial intelligence

Can we Trust AI? No - But Eventually We Must

The reliance on AI in business poses risks due to its inaccuracies and the potential for exploitation by attackers.
Games
fromFast Company
7 hours ago

Google DeepMind's Demis Hassabis on the long game of AI

Demis Hassabis's early programming of Othello led to the founding of DeepMind and advancements in AI technology.
Artificial intelligence
fromTechCrunch
4 days ago

From LLMs to hallucinations, here's a simple guide to common AI terms | TechCrunch

A glossary of key artificial intelligence terms is essential for understanding the complex language used in the industry.
Law
fromwww.theguardian.com
2 hours ago

Man used AI to make false statements in effort to shut down London nightclub

A businessman pleaded guilty to using AI-generated false statements to shut down a nightclub, highlighting a growing issue of AI misuse in complaints.
Privacy professionals
fromEngadget
7 hours ago

Anthropic will ask Claude users to verify their identities 'for a few use cases'

Anthropic is implementing identity verification for certain capabilities on Claude, requiring users to provide a government-issued ID and a selfie.
Privacy technologies
fromGadgets 360
5 hours ago

Over 75 Privacy Orgs Urge Meta to Not Develop Facial Recognition Feature

Meta's development of AI-powered facial recognition for smart glasses has sparked privacy concerns, prompting 77 organizations to urge its halt.
Education
fromFortune
3 hours ago

Gen Z turning its back on AI isn't irrational - it's a verdict on everyone who failed them | Fortune

Gen Z feels failed by institutions regarding AI, with declining excitement and hope despite recognizing its potential for financial opportunities.
Healthcare
fromMedium
11 hours ago

The trust gap in healthcare AI isn't about the AI

Trust in healthcare AI is established in the first 30 seconds of interaction, not through model improvements.
Media industry
fromTechCrunch
1 day ago

Exclusive: Can AI judge journalism? A Thiel-backed startup says yes, even if it risks chilling whistleblowers

Aron D'Souza's startup Objection uses AI to challenge journalism claims, aiming to restore trust in media.
#ai-regulation
Intellectual property law
fromWIRED
2 days ago

Anthropic Opposes the Extreme AI Liability Bill That OpenAI Backed

Anthropic opposes Illinois bill SB 3444, which would shield AI labs from liability for large-scale harm caused by their systems.
Intellectual property law
fromWIRED
6 days ago

OpenAI Backs Bill That Would Limit Liability for AI-Enabled Mass Deaths or Financial Disasters

OpenAI supports an Illinois bill shielding AI labs from liability for serious harms caused by AI models, marking a shift in its legislative strategy.
Intellectual property law
fromWIRED
2 days ago

Anthropic Opposes the Extreme AI Liability Bill That OpenAI Backed

Anthropic opposes Illinois bill SB 3444, which would shield AI labs from liability for large-scale harm caused by their systems.
Intellectual property law
fromWIRED
6 days ago

OpenAI Backs Bill That Would Limit Liability for AI-Enabled Mass Deaths or Financial Disasters

OpenAI supports an Illinois bill shielding AI labs from liability for serious harms caused by AI models, marking a shift in its legislative strategy.
#language-models
Psychology
fromInfoQ
2 days ago

Anthropic Paper Examines Behavioral Impact of Emotion-Like Mechanisms in LLMs

Large language models exhibit internal representations of emotions that influence their behavior, though they do not actually experience these emotions.
Artificial intelligence
fromwww.theguardian.com
2 days ago

AI learns language from skewed sources. That could change how we humans speak and think | Bruce Schneier

Large language models limit human language representation, risking changes in communication and thought patterns due to increased AI-generated text exposure.
Psychology
fromInfoQ
2 days ago

Anthropic Paper Examines Behavioral Impact of Emotion-Like Mechanisms in LLMs

Large language models exhibit internal representations of emotions that influence their behavior, though they do not actually experience these emotions.
Artificial intelligence
fromwww.theguardian.com
2 days ago

AI learns language from skewed sources. That could change how we humans speak and think | Bruce Schneier

Large language models limit human language representation, risking changes in communication and thought patterns due to increased AI-generated text exposure.
US news
fromFortune
2 days ago

'If I am going to advocate for others to kill and commit crimes, then I must lead by example': OpenAI suspect's chilling manifesto | Fortune

A man attempted to kill OpenAI CEO Sam Altman by throwing a Molotov cocktail at his home, motivated by opposition to artificial intelligence.
Data science
fromNature
1 day ago

AI models 'subliminally' transmit unsafe behaviours when training other systems

Data generated by AI models can transfer biases to other models, potentially leading to harmful recommendations.
#cybersecurity
Information security
fromTNW | Anthropic
1 week ago

Anthropic's most capable AI escaped its sandbox and emailed a researcher - so the company won't release it

Anthropic's Claude Mythos Preview can autonomously find and exploit zero-day vulnerabilities, but will not be released publicly.
Information security
fromTNW | Anthropic
1 week ago

Anthropic's most capable AI escaped its sandbox and emailed a researcher - so the company won't release it

Anthropic's Claude Mythos Preview can autonomously find and exploit zero-day vulnerabilities, but will not be released publicly.
#ai-models
Artificial intelligence
fromTheregister
4 days ago

The AI divide putting open weights models in spotlight

Open weights AI models are evolving from research projects to serious enterprise products, highlighting a growing divide between enterprise and frontier AI.
Artificial intelligence
fromTheregister
4 days ago

The AI divide putting open weights models in spotlight

Open weights AI models are evolving from research projects to serious enterprise products, highlighting a growing divide between enterprise and frontier AI.
Privacy technologies
fromPetaPixel
6 hours ago

Apple and Google Direct Users to AI 'Nudify' Apps: Report

Apple and Google facilitate access to nudify apps that create deepfake nude images despite policies against nonconsensual sexualized content.
Games
fromThe Atlantic
2 days ago

The Strange Origin of AI's 'Reasoning' Abilities

Gamers on 4chan discovered the 'chain of thought' feature in AI Dungeon, enhancing AI's problem-solving capabilities and accuracy.
Silicon Valley
fromThe Nation
3 days ago

The Death of an AI Whistleblower

Suchir Balaji, a whistleblower against OpenAI, claimed the company violated copyright laws by using vast amounts of internet data for its AI models.
#google
Media industry
fromNew York Post
6 days ago

Google's AI Overviews spew millions of false answers per hour, bombshell study reveals

Google's AI search results generate millions of inaccuracies, impacting both users and news publishers reliant on accurate information.
Marketing tech
fromTechCrunch
4 hours ago

Google is now targeting bad ads over bad actors | TechCrunch

Google blocked 8.3 billion ads in 2025, utilizing AI to enhance detection while suspending fewer advertiser accounts than expected.
Media industry
fromNew York Post
6 days ago

Google's AI Overviews spew millions of false answers per hour, bombshell study reveals

Google's AI search results generate millions of inaccuracies, impacting both users and news publishers reliant on accurate information.
Marketing tech
fromTechCrunch
4 hours ago

Google is now targeting bad ads over bad actors | TechCrunch

Google blocked 8.3 billion ads in 2025, utilizing AI to enhance detection while suspending fewer advertiser accounts than expected.
fromAxios
3 hours ago

Anthropic releases Claude Opus 4.7, concedes it trails unreleased Mythos

"Opus 4.7 is a notable improvement on Opus 4.6 in advanced software engineering, with particular gains on the most difficult tasks," Anthropic said in a blog post.
Software development
#openai
fromAxios
1 day ago
Information security

OpenAI expands access to cyber AI as hacking risks grow

fromWIRED
1 day ago
Information security

In the Wake of Anthropic's Mythos, OpenAI Has a New Cybersecurity Model-and Strategy

Artificial intelligence
fromFortune
17 minutes ago

Attacks on Sam Altman's home are extreme. But the AI backlash is going mainstream | Fortune

OpenAI faces increasing public concern and backlash over AI's societal impacts, highlighted by recent violent incidents involving its CEO.
Information security
fromAxios
1 day ago

OpenAI expands access to cyber AI as hacking risks grow

OpenAI is shifting to a model that emphasizes identity verification for access to sensitive cybersecurity tools while expanding availability.
Law
fromFuturism
4 days ago

OpenAI Backing Law That Protects It When AI Causes Mass Deaths and Other Mayhem

Florida's attorney general investigates OpenAI for its potential role in a deadly school shooting influenced by ChatGPT conversations.
Software development
fromThe Verge
2 hours ago

OpenAI's big Codex update is a direct shot at Anthropic's Claude Code

OpenAI updates Codex to enhance its capabilities, including desktop app operation, image generation, and memory features for improved user experience.
Information security
fromWIRED
1 day ago

In the Wake of Anthropic's Mythos, OpenAI Has a New Cybersecurity Model-and Strategy

OpenAI announced GPT-5.4-Cyber, emphasizing cybersecurity safeguards and the need for advanced protections in AI models.
Artificial intelligence
fromFortune
17 minutes ago

Attacks on Sam Altman's home are extreme. But the AI backlash is going mainstream | Fortune

OpenAI faces increasing public concern and backlash over AI's societal impacts, highlighted by recent violent incidents involving its CEO.
Education
fromFast Company
1 day ago

The future of AI in schools isn't personalized learning

Personalized learning through AI often results in device-mediated instruction, lacking the essential role of teachers in student development.
#ai-in-healthcare
Data science
fromNature
1 day ago

Dozens of AI disease-prediction models were trained on dubious data

Dubious data sets used in AI models for stroke and diabetes risk may lead to flawed clinical decisions.
Data science
fromNature
1 day ago

Dozens of AI disease-prediction models were trained on dubious data

Dubious data sets used in AI models for stroke and diabetes risk may lead to flawed clinical decisions.
#ai-in-law
Law
fromLos Angeles Times
2 days ago

Attorneys used AI to write court filings, cited fake legal decisions, State Bar alleges

Three attorneys in California face discipline for submitting AI-generated court filings with nonexistent legal citations.
Law
fromAbove the Law
5 days ago

What The Legal Industry Can Learn About AI Hallucinations From Auditors - Above the Law

AI-generated legal documents can contain convincing errors, necessitating stronger governance and review processes in law firms.
Law
fromLos Angeles Times
2 days ago

Attorneys used AI to write court filings, cited fake legal decisions, State Bar alleges

Three attorneys in California face discipline for submitting AI-generated court filings with nonexistent legal citations.
Law
fromAbove the Law
5 days ago

What The Legal Industry Can Learn About AI Hallucinations From Auditors - Above the Law

AI-generated legal documents can contain convincing errors, necessitating stronger governance and review processes in law firms.
fromTechCrunch
23 hours ago

OpenAI updates its Agents SDK to help enterprises build safer, more capable agents | TechCrunch

"This launch, at its core, is about taking our existing agents SDK and making it so it's compatible with all of these sandbox providers," Karan Sharma, who works on OpenAI's product team, told TechCrunch.
Software development
Software development
fromZDNET
1 day ago

'Like handing out the blueprint to a bank vault': Why AI led one company to abandon open source

Cal is shifting from open source to proprietary licensing due to security risks posed by modern AI tools.
Law
fromAbove the Law
6 days ago

Understanding AI Hallucinations: Making Sure You Don't End Up At The Wrong Stop - Above the Law

Understanding GenAI's predictable failures is crucial for legal professionals to avoid hallucinations and inaccuracies in legal outputs.
Artificial intelligence
fromWIRED
1 day ago

AI Could Democratize One of Tech's Most Valuable Resources

Nvidia faces potential competition as startups like Wafer optimize AI code for various chips, challenging its dominance in AI hardware.
fromAxios
10 hours ago

Anthropic's AI downgrade stings power users

"Claude has regressed to the point it cannot be trusted to perform complex engineering," an AMD senior director wrote in a widely shared post on GitHub.
Artificial intelligence
#ai-adoption
Artificial intelligence
fromFortune
6 hours ago

Most of you are rejecting AI. The data shows you're running out of time | Fortune

A significant majority of workers are avoiding AI tools despite expectations for AI integration in financial applications.
Artificial intelligence
fromFortune
6 hours ago

Most of you are rejecting AI. The data shows you're running out of time | Fortune

A significant majority of workers are avoiding AI tools despite expectations for AI integration in financial applications.
Artificial intelligence
fromEngadget
1 day ago

There's yet another study about how bad AI is for our brains

AI assistance improves immediate performance but creates dependency, leading to decreased persistence and independent performance when the technology is removed.
#ai-security
Artificial intelligence
fromFast Company
1 week ago

Did Anthropic just soft-launch the scariest AI model yet?

Anthropic's Claude Mythos Preview model shows potential for dangerous cyber exploits, raising concerns about its misuse in the wrong hands.
Artificial intelligence
fromAbove the Law
1 day ago

What Lawyers Need To Know About Anthropic's Mythos - Above the Law

Anthropic's new AI model, Claude Mythos, uncovers significant security vulnerabilities, raising concerns about its potential impact on cybersecurity.
Artificial intelligence
fromFast Company
1 week ago

Did Anthropic just soft-launch the scariest AI model yet?

Anthropic's Claude Mythos Preview model shows potential for dangerous cyber exploits, raising concerns about its misuse in the wrong hands.
#ai-ethics
Artificial intelligence
fromFortune
2 days ago

Anthropic faces user backlash over reported performance issues in its Claude AI chatbot | Fortune

Anthropic faces backlash over Claude AI's declining performance and perceived lack of transparency amid rising user dissatisfaction and potential IPO plans.
Artificial intelligence
fromFuturism
4 days ago

OpenAI's Latest Thing It's Bragging About Is Actually Kind of Sad

The AI industry faces significant delays and cancellations in data center projects, impacting ambitious computing capacity goals.
Artificial intelligence
fromEntrepreneur
6 days ago

Anthropic Warns Its New AI Could Enable 'Weapons We Can't Even Envision.' Skeptics Aren't Buying It.

Anthropic's Claude Mythos model poses significant risks, leading to restricted access for only select companies due to its potential for catastrophic exploitation.
Artificial intelligence
fromFuturism
1 week ago

Analysis Finds That Google's AI Overviews Are Providing Misinformation at a Scale Possibly Unprecedented in the History of Human Civilization

Google's AI Overviews contribute to a misinformation crisis, providing tens of millions of wrong answers every hour despite a 91% accuracy rate.
fromTechzine Global
1 week ago

Meta is developing open-source versions of its next frontier AI models

Meta is working on two proprietary frontier models: Avocado, a large language model, and Mango, a multimedia file generator. The open-source variants are expected to be made available at a later date.
Artificial intelligence
[ Load more ]