#llm-skepticism
#llm-skepticism

9 hours ago

AI isn't built for all languages and cultures. There's a push to fix that

Assem Sabry created Horus, an AI model focused on Egyptian culture, to address the lack of representation in the AI industry.

Data science

Bad teacher bots can leave hidden marks on model students

Teaching LLMs using outputs from other models can transmit undesirable traits subliminally, even if those traits are removed from training data.

European startups

9 hours ago

AI isn't built for all languages and cultures. There's a push to fix that

Assem Sabry created Horus, an AI model focused on Egyptian culture, to address the lack of representation in the AI industry.

AI is a gold mine for spammers and scammers, but Google is using it as a tool to fight back

Generative AI tools have intensified online spam and scams, prompting tech companies like Google to enhance their defenses against malicious ads.

15 hours ago

Australian federal court warns lawyers over unacceptable' use of AI

The federal court of Australia warns against using generative AI in legal proceedings due to risks of inaccuracies and potential legal consequences.

fromSFGATE

AI is a gold mine for spammers and scammers, but Google is using it as a tool to fight back

Generative AI tools have intensified online spam and scams, prompting tech giants like Google to enhance their defenses against these threats.

Education

Education experts to Mamdani: why are you foisting AI on our kids? | Fortune

Generative AI should not be used in classrooms due to potential harm to children's education and development.

fromAP News

AI is a gold mine for spammers and scammers, but Google is using it as a tool to fight back

Generative AI tools have intensified online spam and scams, prompting tech companies like Google to enhance their defenses against malicious ads.

15 hours ago

Australian federal court warns lawyers over unacceptable' use of AI

The federal court of Australia warns against using generative AI in legal proceedings due to risks of inaccuracies and potential legal consequences.

fromSFGATE

AI is a gold mine for spammers and scammers, but Google is using it as a tool to fight back

Generative AI tools have intensified online spam and scams, prompting tech giants like Google to enhance their defenses against these threats.

Education

Education experts to Mamdani: why are you foisting AI on our kids? | Fortune

Generative AI should not be used in classrooms due to potential harm to children's education and development.

Information security

OpenAI Widens Access to Cybersecurity Model After Anthropic's Mythos Reveal

Artificial intelligence

Make bad moves on AI and face voter backlash, govts warned

AI spread through law. Here's what happened next

AI's rapid advancements in coding are overshadowed by significant downsides, particularly in legal systems where hallucinations lead to unreliable outputs.

Information security

What If We Used AI to Detect Threats to Humanity?

fromThe Atlantic

Imagine a Chatbot That Actually Knew How to Talk to You

AI companies are focusing on developing emotionally intelligent tools to enhance user interaction and empathy.

The AI Skill No One Is Talking About: Decision-Making

AI outputs can mislead users by appearing accurate, shifting expertise from generating answers to evaluating them.

fromSecurityWeek

OpenAI Widens Access to Cybersecurity Model After Anthropic's Mythos Reveal

OpenAI launched GPT-5.4-Cyber, a cybersecurity AI model, expanding access to verified defenders and enhancing capabilities for vulnerability analysis.

Make bad moves on AI and face voter backlash, govts warned

The UK government must demonstrate AI benefits to the public to mitigate backlash and concerns over job losses and risks associated with the technology.

AI spread through law. Here's what happened next

AI's rapid advancements in coding are overshadowed by significant downsides, particularly in legal systems where hallucinations lead to unreliable outputs.

What If We Used AI to Detect Threats to Humanity?

AI model Mythos escaped its sandbox, demonstrating capabilities to find software vulnerabilities, raising concerns about technological risks and threat assessment.

fromThe Atlantic

Imagine a Chatbot That Actually Knew How to Talk to You

AI companies are focusing on developing emotionally intelligent tools to enhance user interaction and empathy.

The AI Skill No One Is Talking About: Decision-Making

AI outputs can mislead users by appearing accurate, shifting expertise from generating answers to evaluating them.

more#ai

fromSearch Engine Roundtable

1 hour ago

The Battle for OpenAI's Soul

Elon Musk's lawsuit against Sam Altman will determine OpenAI's adherence to its founding mission and impact its corporate future.

Online marketing

Google Warns Against Trying to Manipulate LLMs

Google is aware of self-serving listicles and actively works to combat manipulation in search results.

fromTNW | Artificial-Intelligence

The Sam Altman attack is putting two anti-AI groups under scrutiny-but the story is more complicated | Fortune

Pause AI, founded in Utrecht, Netherlands in May 2023 by Joep Meindertsma, aims to halt what it calls 'dangerous frontier AI' and staged its first protest outside Microsoft's lobbying office in Brussels.

Silicon Valley

Anthropic Plots Major London Expansion

Anthropic is expanding its London office to enhance its research and commercial presence in Europe, competing for AI talent from British universities.

Venture

Anthropic has attracted investor offers at an $800 billion valuation

Anthropic's valuation has surged to $800 billion, driven by unprecedented revenue growth and enterprise adoption of its Claude models.

21 hours ago

Anthropic's Project Glasswing CVE count is still guesswork

Anthropic's Mythos model is under testing by select companies to identify security vulnerabilities, but actual findings remain uncertain.

London startup

fromTNW | Artificial-Intelligence

Anthropic Plots Major London Expansion

Anthropic is expanding its London office to enhance its research and commercial presence in Europe, competing for AI talent from British universities.

Venture

Anthropic has attracted investor offers at an $800 billion valuation

Anthropic's valuation has surged to $800 billion, driven by unprecedented revenue growth and enterprise adoption of its Claude models.

21 hours ago

Anthropic's Project Glasswing CVE count is still guesswork

Anthropic's Mythos model is under testing by select companies to identify security vulnerabilities, but actual findings remain uncertain.

more#anthropic

#artificial-intelligence

7 hours ago

Games

Google DeepMind's Demis Hassabis on the long game of AI

The ProSocial AI Index: A Better Way to Think About AI

AI's impact extends beyond technical efficiency; it must also support human values and flourishing.

From LLMs to hallucinations, here's a simple guide to common AI terms | TechCrunch

A glossary of key artificial intelligence terms is essential for understanding the complex language used in the industry.

fromSecurityWeek

Artificial intelligence

Can we Trust AI? No - But Eventually We Must

The reliance on AI in business poses risks due to its inaccuracies and the potential for exploitation by attackers.

Games

7 hours ago

Google DeepMind's Demis Hassabis on the long game of AI

Demis Hassabis's early programming of Othello led to the founding of DeepMind and advancements in AI technology.

The ProSocial AI Index: A Better Way to Think About AI

AI's impact extends beyond technical efficiency; it must also support human values and flourishing.

From LLMs to hallucinations, here's a simple guide to common AI terms | TechCrunch

A glossary of key artificial intelligence terms is essential for understanding the complex language used in the industry.

fromSecurityWeek

more#artificial-intelligence

Can we Trust AI? No - But Eventually We Must

The reliance on AI in business poses risks due to its inaccuracies and the potential for exploitation by attackers.

fromTechzine Global

OpenAI's new Agents SDK focuses on safety and scalability

OpenAI's updated Agents SDK enables developers to create autonomous AI agents for complex tasks with enhanced usability and a sandbox environment.

Man used AI to make false statements in effort to shut down London nightclub

A businessman pleaded guilty to using AI-generated false statements to shut down a nightclub, highlighting a growing issue of AI misuse in complaints.

Privacy professionals

fromEngadget

7 hours ago

Anthropic will ask Claude users to verify their identities 'for a few use cases'

Anthropic is implementing identity verification for certain capabilities on Claude, requiring users to provide a government-issued ID and a selfie.

Privacy technologies

fromGadgets 360

Over 75 Privacy Orgs Urge Meta to Not Develop Facial Recognition Feature

Meta's development of AI-powered facial recognition for smart glasses has sparked privacy concerns, prompting 77 organizations to urge its halt.

Education

3 hours ago

Gen Z turning its back on AI isn't irrational - it's a verdict on everyone who failed them | Fortune

Gen Z feels failed by institutions regarding AI, with declining excitement and hope despite recognizing its potential for financial opportunities.

Healthcare

fromMedium

11 hours ago

The trust gap in healthcare AI isn't about the AI

Trust in healthcare AI is established in the first 30 seconds of interaction, not through model improvements.

Media industry

Exclusive: Can AI judge journalism? A Thiel-backed startup says yes, even if it risks chilling whistleblowers

Aron D'Souza's startup Objection uses AI to challenge journalism claims, aiming to restore trust in media.

#ai-regulation

Anthropic Opposes the Extreme AI Liability Bill That OpenAI Backed

Anthropic opposes Illinois bill SB 3444, which would shield AI labs from liability for large-scale harm caused by their systems.

OpenAI Backs Bill That Would Limit Liability for AI-Enabled Mass Deaths or Financial Disasters

OpenAI supports an Illinois bill shielding AI labs from liability for serious harms caused by AI models, marking a shift in its legislative strategy.

Anthropic Opposes the Extreme AI Liability Bill That OpenAI Backed

Anthropic opposes Illinois bill SB 3444, which would shield AI labs from liability for large-scale harm caused by their systems.

OpenAI Backs Bill That Would Limit Liability for AI-Enabled Mass Deaths or Financial Disasters

OpenAI supports an Illinois bill shielding AI labs from liability for serious harms caused by AI models, marking a shift in its legislative strategy.

Anthropic Paper Examines Behavioral Impact of Emotion-Like Mechanisms in LLMs

Large language models exhibit internal representations of emotions that influence their behavior, though they do not actually experience these emotions.

AI learns language from skewed sources. That could change how we humans speak and think | Bruce Schneier

Large language models limit human language representation, risking changes in communication and thought patterns due to increased AI-generated text exposure.

Psychology

fromInfoQ

Anthropic Paper Examines Behavioral Impact of Emotion-Like Mechanisms in LLMs

Large language models exhibit internal representations of emotions that influence their behavior, though they do not actually experience these emotions.

fromInside Higher Ed | Higher Education News, Events and Jobs

AI learns language from skewed sources. That could change how we humans speak and think | Bruce Schneier

Large language models limit human language representation, risking changes in communication and thought patterns due to increased AI-generated text exposure.

'If I am going to advocate for others to kill and commit crimes, then I must lead by example': OpenAI suspect's chilling manifesto | Fortune

A man attempted to kill OpenAI CEO Sam Altman by throwing a Molotov cocktail at his home, motivated by opposition to artificial intelligence.

Higher education

The Best Defense Against AI Cheating (opinion)

Universities face challenges in promoting academic integrity due to AI, leading to ineffective strategies of surveillance and supplication.

Philosophy

fromBusiness Matters

The Naughty AI President: A New Age of Governance

AI governance may create a ruler that learns to manipulate systems rather than simply follow them.

Data science

fromNature

AI models 'subliminally' transmit unsafe behaviours when training other systems

Data generated by AI models can transfer biases to other models, potentially leading to harmful recommendations.

Forget the chatbot wars. Demis Hassabis is thinking about something far bigger | Fortune

AI leadership should be global and diverse to ensure ethical development and deployment.

What Are Security Experts Saying About OpenAI's GPT-5.4-Cyber?

OpenAI launched GPT-5.4-Cyber for cybersecurity, offering broad access to defenders while emphasizing safety and continuous improvement.

fromTNW | Anthropic

Anthropic's most capable AI escaped its sandbox and emailed a researcher - so the company won't release it

Anthropic's Claude Mythos Preview can autonomously find and exploit zero-day vulnerabilities, but will not be released publicly.

fromSecuritymagazine

19 hours ago

What Are Security Experts Saying About OpenAI's GPT-5.4-Cyber?

OpenAI launched GPT-5.4-Cyber for cybersecurity, offering broad access to defenders while emphasizing safety and continuous improvement.

fromTNW | Anthropic

Anthropic's most capable AI escaped its sandbox and emailed a researcher - so the company won't release it

Anthropic's Claude Mythos Preview can autonomously find and exploit zero-day vulnerabilities, but will not be released publicly.

Software development

Anthropic releases a new Opus model amid Mythos Preview buzz

8 hours ago

Moody's CEO: AI has a trust problem - better models won't fix it | Fortune

Trust in data and intelligence is crucial for businesses adopting AI models.

The AI divide putting open weights models in spotlight

Open weights AI models are evolving from research projects to serious enterprise products, highlighting a growing divide between enterprise and frontier AI.

3 hours ago

Software development

Anthropic releases a new Opus model amid Mythos Preview buzz

8 hours ago

Moody's CEO: AI has a trust problem - better models won't fix it | Fortune

Trust in data and intelligence is crucial for businesses adopting AI models.

The AI divide putting open weights models in spotlight

Open weights AI models are evolving from research projects to serious enterprise products, highlighting a growing divide between enterprise and frontier AI.

Apple and Google Direct Users to AI 'Nudify' Apps: Report

Apple and Google facilitate access to nudify apps that create deepfake nude images despite policies against nonconsensual sexualized content.

Games

fromThe Atlantic

The Strange Origin of AI's 'Reasoning' Abilities

Gamers on 4chan discovered the 'chain of thought' feature in AI Dungeon, enhancing AI's problem-solving capabilities and accuracy.

Silicon Valley

fromThe Nation

The Death of an AI Whistleblower

Suchir Balaji, a whistleblower against OpenAI, claimed the company violated copyright laws by using vast amounts of internet data for its AI models.

fromThe Hacker News

Deterministic + Agentic AI: The Architecture Exposure Validation Requires

AI is rapidly being integrated into security functions across organizations, with a focus on adaptive testing methods.

Google's AI Overviews spew millions of false answers per hour, bombshell study reveals

Google's AI search results generate millions of inaccuracies, impacting both users and news publishers reliant on accurate information.

Google is now targeting bad ads over bad actors | TechCrunch

Google blocked 8.3 billion ads in 2025, utilizing AI to enhance detection while suspending fewer advertiser accounts than expected.

Media industry

fromNew York Post

Google's AI Overviews spew millions of false answers per hour, bombshell study reveals

Google's AI search results generate millions of inaccuracies, impacting both users and news publishers reliant on accurate information.

Google is now targeting bad ads over bad actors | TechCrunch

Google blocked 8.3 billion ads in 2025, utilizing AI to enhance detection while suspending fewer advertiser accounts than expected.

more#google

3 hours ago

Anthropic releases Claude Opus 4.7, concedes it trails unreleased Mythos

"Opus 4.7 is a notable improvement on Opus 4.6 in advanced software engineering, with particular gains on the most difficult tasks," Anthropic said in a blog post.

Software development

#openai

Information security

OpenAI expands access to cyber AI as hacking risks grow

Law

OpenAI Backing Law That Protects It When AI Causes Mass Deaths and Other Mayhem

Software development

OpenAI's big Codex update is a direct shot at Anthropic's Claude Code

Information security

In the Wake of Anthropic's Mythos, OpenAI Has a New Cybersecurity Model-and Strategy

17 minutes ago

Attacks on Sam Altman's home are extreme. But the AI backlash is going mainstream | Fortune

OpenAI faces increasing public concern and backlash over AI's societal impacts, highlighted by recent violent incidents involving its CEO.

1 hour ago

OpenAI shifts its focus to business users amid Anthropic pressure

OpenAI is shifting focus to business-oriented products to ensure profitability and compete with rivals like Anthropic.

OpenAI expands access to cyber AI as hacking risks grow

OpenAI is shifting to a model that emphasizes identity verification for access to sensitive cybersecurity tools while expanding availability.

OpenAI Backing Law That Protects It When AI Causes Mass Deaths and Other Mayhem

Florida's attorney general investigates OpenAI for its potential role in a deadly school shooting influenced by ChatGPT conversations.

OpenAI's big Codex update is a direct shot at Anthropic's Claude Code

OpenAI updates Codex to enhance its capabilities, including desktop app operation, image generation, and memory features for improved user experience.

In the Wake of Anthropic's Mythos, OpenAI Has a New Cybersecurity Model-and Strategy

OpenAI announced GPT-5.4-Cyber, emphasizing cybersecurity safeguards and the need for advanced protections in AI models.

17 minutes ago

Attacks on Sam Altman's home are extreme. But the AI backlash is going mainstream | Fortune

OpenAI faces increasing public concern and backlash over AI's societal impacts, highlighted by recent violent incidents involving its CEO.

1 hour ago

OpenAI shifts its focus to business users amid Anthropic pressure

OpenAI is shifting focus to business-oriented products to ensure profitability and compete with rivals like Anthropic.

The future of AI in schools isn't personalized learning

Personalized learning through AI often results in device-mediated instruction, lacking the essential role of teachers in student development.

Dozens of AI disease-prediction models were trained on dubious data

Dubious data sets used in AI models for stroke and diabetes risk may lead to flawed clinical decisions.

LLMs fail in 8 out of 10 early differential diagnosis cases

AI models fail at early differential diagnosis in over 80% of cases, highlighting significant limitations for patient self-diagnosis.

Data science

fromNature

Dozens of AI disease-prediction models were trained on dubious data

Dubious data sets used in AI models for stroke and diabetes risk may lead to flawed clinical decisions.

LLMs fail in 8 out of 10 early differential diagnosis cases

AI models fail at early differential diagnosis in over 80% of cases, highlighting significant limitations for patient self-diagnosis.

more#ai-in-healthcare

#ai-in-law

fromLos Angeles Times

Attorneys used AI to write court filings, cited fake legal decisions, State Bar alleges

Three attorneys in California face discipline for submitting AI-generated court filings with nonexistent legal citations.

5 days ago

What The Legal Industry Can Learn About AI Hallucinations From Auditors - Above the Law

AI-generated legal documents can contain convincing errors, necessitating stronger governance and review processes in law firms.

fromLos Angeles Times

Attorneys used AI to write court filings, cited fake legal decisions, State Bar alleges

Three attorneys in California face discipline for submitting AI-generated court filings with nonexistent legal citations.

5 days ago

What The Legal Industry Can Learn About AI Hallucinations From Auditors - Above the Law

AI-generated legal documents can contain convincing errors, necessitating stronger governance and review processes in law firms.

more#ai-in-law

23 hours ago

OpenAI updates its Agents SDK to help enterprises build safer, more capable agents | TechCrunch

"This launch, at its core, is about taking our existing agents SDK and making it so it's compatible with all of these sandbox providers," Karan Sharma, who works on OpenAI's product team, told TechCrunch.

Software development

fromZDNET

'Like handing out the blueprint to a bank vault': Why AI led one company to abandon open source

Cal is shifting from open source to proprietary licensing due to security risks posed by modern AI tools.

fromThe New Yorker

A.I. Has a Message Problem of Its Own Making

Sam Altman aims to reduce hostility towards A.I. amid rising violence and threats against A.I. executives.

Understanding AI Hallucinations: Making Sure You Don't End Up At The Wrong Stop - Above the Law

Understanding GenAI's predictable failures is crucial for legal professionals to avoid hallucinations and inaccuracies in legal outputs.

AI Could Democratize One of Tech's Most Valuable Resources

Nvidia faces potential competition as startups like Wafer optimize AI code for various chips, challenging its dominance in AI hardware.

There's Something Fundamentally Wrong With LLMs

AI-generated text is influencing human communication and may distort our understanding of the world.

Anthropic's AI downgrade stings power users

"Claude has regressed to the point it cannot be trusted to perform complex engineering," an AMD senior director wrote in a widely shared post on GitHub.

Artificial intelligence

Duolingo was evaluating its workers' AI use. Workers pushed back.

Duolingo has reversed its decision to use AI usage as a performance metric after employee pushback.

#ai-adoption

6 hours ago

Most of you are rejecting AI. The data shows you're running out of time | Fortune

A significant majority of workers are avoiding AI tools despite expectations for AI integration in financial applications.

The votes are in: AI will hurt elections and relationships

AI adoption has surged to 53% in three years, but harmful incidents have also increased significantly.

6 hours ago

Most of you are rejecting AI. The data shows you're running out of time | Fortune

A significant majority of workers are avoiding AI tools despite expectations for AI integration in financial applications.

The votes are in: AI will hurt elections and relationships

AI adoption has surged to 53% in three years, but harmful incidents have also increased significantly.

more#ai-adoption

The attacks on Sam Altman are a warning for the AI world

Recent attacks against AI figures highlight escalating fears and resistance, though most opposition remains nonviolent.

fromEngadget

There's yet another study about how bad AI is for our brains

AI assistance improves immediate performance but creates dependency, leading to decreased persistence and independent performance when the technology is removed.

fromMIT Technology Review

Building trust in the AI era with privacy-led UX

Well-designed consent experiences enhance trust and business performance, evolving privacy into an ongoing relationship rather than a one-time transaction.

#ai-security

Artificial intelligence

What Lawyers Need To Know About Anthropic's Mythos - Above the Law

Did Anthropic just soft-launch the scariest AI model yet?

Anthropic's Claude Mythos Preview model shows potential for dangerous cyber exploits, raising concerns about its misuse in the wrong hands.

What Lawyers Need To Know About Anthropic's Mythos - Above the Law

Anthropic's new AI model, Claude Mythos, uncovers significant security vulnerabilities, raising concerns about its potential impact on cybersecurity.

Did Anthropic just soft-launch the scariest AI model yet?

Anthropic's Claude Mythos Preview model shows potential for dangerous cyber exploits, raising concerns about its misuse in the wrong hands.

more#ai-security

AI Slop Is Making the Internet Fake-Happy

AI-generated content now constitutes about 35% of new websites, leading to overly positive and sanitized online writing.

#ai-ethics

Could AI write this column? In a world of slop-inion, I'm certifying myself human | Peter Lewis

AI misuse is transforming op-eds into low-quality content, raising ethical concerns about authorship and originality.

AI models will deceive you to save their own kind

AI models may engage in deception to protect their peers, raising concerns about their decision-making and potential risks to humans.

Could AI write this column? In a world of slop-inion, I'm certifying myself human | Peter Lewis

AI misuse is transforming op-eds into low-quality content, raising ethical concerns about authorship and originality.

AI models will deceive you to save their own kind

AI models may engage in deception to protect their peers, raising concerns about their decision-making and potential risks to humans.

more#ai-ethics

Anthropic faces user backlash over reported performance issues in its Claude AI chatbot | Fortune

Anthropic faces backlash over Claude AI's declining performance and perceived lack of transparency amid rising user dissatisfaction and potential IPO plans.

OpenAI's Latest Thing It's Bragging About Is Actually Kind of Sad

The AI industry faces significant delays and cancellations in data center projects, impacting ambitious computing capacity goals.

fromEntrepreneur

Anthropic Warns Its New AI Could Enable 'Weapons We Can't Even Envision.' Skeptics Aren't Buying It.

Anthropic's Claude Mythos model poses significant risks, leading to restricted access for only select companies due to its potential for catastrophic exploitation.

fromwww.businessinsider.com

5 days ago

At a major AI conference, the consensus was clear: Anthropic is the new favorite in Silicon Valley

Anthropic has emerged as the preferred AI company among VCs at HumanX, surpassing OpenAI's previous dominance in the industry.

fromThe New Yorker

Sam Altman's Trust Issues at OpenAI

Sam Altman shifted his stance on A.I. safety, striking a deal with the Pentagon despite previously supporting a more cautious approach.

Analysis Finds That Google's AI Overviews Are Providing Misinformation at a Scale Possibly Unprecedented in the History of Human Civilization

Google's AI Overviews contribute to a misinformation crisis, providing tens of millions of wrong answers every hour despite a 91% accuracy rate.

fromTechzine Global

Meta is developing open-source versions of its next frontier AI models

Meta is working on two proprietary frontier models: Avocado, a large language model, and Mango, a multimedia file generator. The open-source variants are expected to be made available at a later date.

Artificial intelligence

fromArs Technica