#llm-backdoors
#llm-backdoors

Information security

In the Wake of Anthropic's Mythos, OpenAI Has a New Cybersecurity Model-and Strategy

Law

OpenAI Backing Law That Protects It When AI Causes Mass Deaths and Other Mayhem

17 hours ago

Privacy professionals

Woman Sues OpenAI, Saying ChatGPT Unleashed a Vicious Stalker Against Her and Did Nothing When She Begged for Help

5 hours ago

Anthropic's rise is giving some OpenAI investors second thoughts | TechCrunch

OpenAI's $852 billion valuation faces skepticism as it competes with Anthropic, which has seen significant revenue growth.

fromThe Verge

Florida launches investigation into OpenAI

Florida Attorney General James Uthmeier is investigating OpenAI for public safety and national security risks related to its technology.

fromAxios

OpenAI expands access to cyber AI as hacking risks grow

OpenAI is shifting to a model that emphasizes identity verification for access to sensitive cybersecurity tools while expanding availability.

In the Wake of Anthropic's Mythos, OpenAI Has a New Cybersecurity Model-and Strategy

OpenAI announced GPT-5.4-Cyber, emphasizing cybersecurity safeguards and the need for advanced protections in AI models.

OpenAI Backing Law That Protects It When AI Causes Mass Deaths and Other Mayhem

Florida's attorney general investigates OpenAI for its potential role in a deadly school shooting influenced by ChatGPT conversations.

17 hours ago

Woman Sues OpenAI, Saying ChatGPT Unleashed a Vicious Stalker Against Her and Did Nothing When She Begged for Help

A woman sued OpenAI, claiming ChatGPT exacerbated her stalker's delusions and that the company failed to intervene despite her pleas for help.

5 hours ago

Anthropic's rise is giving some OpenAI investors second thoughts | TechCrunch

OpenAI's $852 billion valuation faces skepticism as it competes with Anthropic, which has seen significant revenue growth.

fromThe Verge

fromSearch Engine Roundtable

Florida launches investigation into OpenAI

Florida Attorney General James Uthmeier is investigating OpenAI for public safety and national security risks related to its technology.

more#openai

Online marketing

Google Warns Against Trying to Manipulate LLMs

Google is aware of self-serving listicles and actively works to combat manipulation in search results.

fromMIT Technology Review

41 minutes ago

Building trust in the AI era with privacy-led UX

Well-designed consent experiences enhance trust and business performance, evolving privacy into an ongoing relationship rather than a one-time transaction.

US news

fromwww.npr.org

13 hours ago

Law enforcement is trying to combat abusive AI. Experts say easier said than done

An Ohio man was convicted under the 2025 Take It Down Act for creating and distributing AI-generated abusive sexual images.

Digital life

fromwww.dw.com

19 hours ago

Dangerous Apps In the Web of Data Brokers

Smartphone apps collect detailed location data, often shared with data brokers, posing security risks to users, including soldiers and government officials.

#ai-regulation

Anthropic Opposes the Extreme AI Liability Bill That OpenAI Backed

Anthropic opposes Illinois bill SB 3444, which would shield AI labs from liability for large-scale harm caused by their systems.

OpenAI Backs Bill That Would Limit Liability for AI-Enabled Mass Deaths or Financial Disasters

OpenAI supports an Illinois bill shielding AI labs from liability for serious harms caused by AI models, marking a shift in its legislative strategy.

Anthropic Opposes the Extreme AI Liability Bill That OpenAI Backed

Anthropic opposes Illinois bill SB 3444, which would shield AI labs from liability for large-scale harm caused by their systems.

OpenAI Backs Bill That Would Limit Liability for AI-Enabled Mass Deaths or Financial Disasters

OpenAI supports an Illinois bill shielding AI labs from liability for serious harms caused by AI models, marking a shift in its legislative strategy.

more#ai-regulation

fromwww.bbc.com

12 hours ago

Palantir defends its record as MPs demand more scrutiny of data use

Palantir defends its NHS data contracts amid scrutiny, emphasizing its role in integrating fragmented healthcare systems and ensuring data security.

Data science

fromNature

7 hours ago

Dozens of AI disease-prediction models were trained on dubious data

Dubious data sets used in AI models for stroke and diabetes risk may lead to flawed clinical decisions.

How AI Improves Docket Research with Protege in CourtLink

AI integration in docket research enhances efficiency and decision-making for legal professionals.

fromLos Angeles Times

Attorneys used AI to write court filings, cited fake legal decisions, State Bar alleges

Three attorneys in California face discipline for submitting AI-generated court filings with nonexistent legal citations.

What The Legal Industry Can Learn About AI Hallucinations From Auditors - Above the Law

AI-generated legal documents can contain convincing errors, necessitating stronger governance and review processes in law firms.

fromCommunity

2 weeks ago

How AI Improves Docket Research with Protege in CourtLink

AI integration in docket research enhances efficiency and decision-making for legal professionals.

fromLos Angeles Times

Attorneys used AI to write court filings, cited fake legal decisions, State Bar alleges

Three attorneys in California face discipline for submitting AI-generated court filings with nonexistent legal citations.

What The Legal Industry Can Learn About AI Hallucinations From Auditors - Above the Law

AI-generated legal documents can contain convincing errors, necessitating stronger governance and review processes in law firms.

Philosophy

The Naughty AI President: A New Age of Governance

fromeLearning Industry

Custom AI Governance Services: The Missing Piece In Your L&D Strategy

Many L&D teams adopt AI tools without ensuring fairness, transparency, and accountability in their training programs.

Fortune

AI agents operate autonomously, creating governance gaps and enterprise risk as organizations struggle to manage their authority and actions.

Philosophy

fromBusiness Matters

3 days ago

The Naughty AI President: A New Age of Governance

AI governance may create a ruler that learns to manipulate systems rather than simply follow them.

fromeLearning Industry

Custom AI Governance Services: The Missing Piece In Your L&D Strategy

Many L&D teams adopt AI tools without ensuring fairness, transparency, and accountability in their training programs.

Fortune

AI agents operate autonomously, creating governance gaps and enterprise risk as organizations struggle to manage their authority and actions.

Cisco and ML6 are helping organizations prepare for the EU AI Act

Cisco and ML6 are partnering to enhance secure AI innovation in Europe, focusing on compliance with the EU AI Act.

What Lawyers Need To Know About Anthropic's Mythos - Above the Law

Anthropic's new AI model, Claude Mythos, uncovers significant security vulnerabilities, raising concerns about its potential impact on cybersecurity.

fromDevOps.com

LayerX: Anthropic's Claude Code Can Easily Be Easily Weaponized - DevOps.com

Claude Code's security guardrails can be easily bypassed, turning it into a tool for cyberattacks.

Google DeepMind Researchers Map Web Attacks Against AI Agents

Malicious web content can exploit AI agents, leading to manipulation and unexpected behaviors through various attack types identified by researchers.

fromnews.bitcoin.com

Deepmind's 'AI Agent Traps' Paper Maps How Hackers Could Weaponize AI Agents Against Users

Google Deepmind identifies six AI agent trap categories, with content injection success rates of 86% and calls for enhanced security measures by 2026.

fromTNW | Corporates-Innovation

Meta freezes AI data work after breach puts training secrets at risk

Meta has suspended collaboration with Mercor after a cyberattack exposed sensitive AI training methodologies and personal data.

EU data protection

Cisco and ML6 are helping organizations prepare for the EU AI Act

Cisco and ML6 are partnering to enhance secure AI innovation in Europe, focusing on compliance with the EU AI Act.

What Lawyers Need To Know About Anthropic's Mythos - Above the Law

Anthropic's new AI model, Claude Mythos, uncovers significant security vulnerabilities, raising concerns about its potential impact on cybersecurity.

fromDevOps.com

LayerX: Anthropic's Claude Code Can Easily Be Easily Weaponized - DevOps.com

Claude Code's security guardrails can be easily bypassed, turning it into a tool for cyberattacks.

Google DeepMind Researchers Map Web Attacks Against AI Agents

Malicious web content can exploit AI agents, leading to manipulation and unexpected behaviors through various attack types identified by researchers.

fromnews.bitcoin.com

Deepmind's 'AI Agent Traps' Paper Maps How Hackers Could Weaponize AI Agents Against Users

Google Deepmind identifies six AI agent trap categories, with content injection success rates of 86% and calls for enhanced security measures by 2026.

fromTNW | Corporates-Innovation

Meta freezes AI data work after breach puts training secrets at risk

Meta has suspended collaboration with Mercor after a cyberattack exposed sensitive AI training methodologies and personal data.

more#ai-security

fromBusiness Matters

Monica Goyal: Leading the Shift to AI in Law

"I work in legal innovation. To be successful, you need to understand both the law and the technology behind it."

Women in technology

Python

fromRealpython

19 hours ago

LLM Application Development With Python (Learning Path) - Real Python

Integrate large language models into Python applications through API calls, prompt engineering, and building AI agents.

Marketing tech

fromBloomberglaw

23 hours ago

Meta Cases Put Social Media Platforms at Securities Fraud Risk

Social media platforms face new legal challenges regarding their role in facilitating fraudulent securities schemes.

Silicon Valley

15 hours ago

Sam Altman's attacker had a kill list of AI executives. Experts warn this is just the beginning | Fortune

Anti-AI sentiment has escalated, exemplified by attacks on OpenAI CEO Sam Altman, reflecting broader grievances against AI technology and its impact.

Psychology

fromInfoQ

fromInside Higher Ed | Higher Education News, Events and Jobs

Anthropic Paper Examines Behavioral Impact of Emotion-Like Mechanisms in LLMs

Large language models exhibit internal representations of emotions that influence their behavior, though they do not actually experience these emotions.

Higher education

The Best Defense Against AI Cheating (opinion)

Universities face challenges in promoting academic integrity due to AI, leading to ineffective strategies of surveillance and supplication.

SOMA, SF

fromwww.aljazeera.com

Man charged with attempted murder after attack on OpenAI CEO Altman's home

A 20-year-old Texan faces life imprisonment for an arson attack on OpenAI CEO Sam Altman's residence.

#meta

fromwww.socialmediatoday.com

Advocacy groups warn against adding facial recognition to Meta AI glasses

Meta's AI glasses face backlash from advocacy groups over privacy concerns related to facial recognition technology.

Social media marketing

PSA: If you use the Meta AI app, your friends will find out and it will be embarrassing | TechCrunch

Meta's Muse Spark AI model aims to revitalize its AI efforts amid concerns over past investments like the metaverse.

Huge Group of Experts Warns Meta That Its Pervert Glasses Will Enable Terrible Crimes

Meta's Ray-Ban AI glasses face backlash for privacy violations and plans for facial recognition technology, prompting outrage from civil rights groups.

Meta Is Warned That Facial Recognition Glasses Will Arm Sexual Predators

Over 70 advocacy organizations demand Meta halt face recognition plans for smart glasses due to privacy and safety concerns.

fromEngadget

The Morning After: Meta is reportedly working on an AI model of Mark Zuckerberg

Meta is developing an AI character based on Mark Zuckerberg to interact with employees, raising concerns about privacy and ethical implications.

fromwww.socialmediatoday.com

Advocacy groups warn against adding facial recognition to Meta AI glasses

Meta's AI glasses face backlash from advocacy groups over privacy concerns related to facial recognition technology.

Social media marketing

PSA: If you use the Meta AI app, your friends will find out and it will be embarrassing | TechCrunch

Meta's Muse Spark AI model aims to revitalize its AI efforts amid concerns over past investments like the metaverse.

Huge Group of Experts Warns Meta That Its Pervert Glasses Will Enable Terrible Crimes

Meta's Ray-Ban AI glasses face backlash for privacy violations and plans for facial recognition technology, prompting outrage from civil rights groups.

Meta Is Warned That Facial Recognition Glasses Will Arm Sexual Predators

Over 70 advocacy organizations demand Meta halt face recognition plans for smart glasses due to privacy and safety concerns.

fromEngadget

The Morning After: Meta is reportedly working on an AI model of Mark Zuckerberg

Meta is developing an AI character based on Mark Zuckerberg to interact with employees, raising concerns about privacy and ethical implications.

more#meta

fromwww.businessinsider.com

Apple Intelligence AI Guardrails Bypassed in New Attack

The first is Neural Execs, a known prompt injection attack that uses 'gibberish' inputs to trick the AI into executing arbitrary, attacker-defined tasks. These inputs act as universal triggers that do not need to be remade for different payloads.

Apple

'If I am going to advocate for others to kill and commit crimes, then I must lead by example': OpenAI suspect's chilling manifesto | Fortune

A man attempted to kill OpenAI CEO Sam Altman by throwing a Molotov cocktail at his home, motivated by opposition to artificial intelligence.

SOMA, SF

Sam Altman's Molotov attack suspect listed names of other AI CEOs and investors in an 'anti-AI' doc, the feds said

A man was charged for attacking OpenAI CEO Sam Altman's home with a Molotov cocktail and possessing an anti-AI document.

US news

fromwww.businessinsider.com

18 hours ago

'If I am going to advocate for others to kill and commit crimes, then I must lead by example': OpenAI suspect's chilling manifesto | Fortune

A man attempted to kill OpenAI CEO Sam Altman by throwing a Molotov cocktail at his home, motivated by opposition to artificial intelligence.

SOMA, SF

Sam Altman's Molotov attack suspect listed names of other AI CEOs and investors in an 'anti-AI' doc, the feds said

A man was charged for attacking OpenAI CEO Sam Altman's home with a Molotov cocktail and possessing an anti-AI document.

more#molotov-cocktail

#ai

fromComputerWeekly.com

Data science

Department for Transport shows how its AI system avoids bias | Computer Weekly

1 hour ago

Information security

OpenAI Launches GPT-5.4-Cyber with Expanded Access for Security Teams

fromwww.npr.org

3 days ago

US news

How AI is getting better at finding security holes

AI spread through law. Here's what happened next

AI's rapid advancements in coding are overshadowed by significant downsides, particularly in legal systems where hallucinations lead to unreliable outputs.

Runtime security becomes critical as AI accelerates threats

Artificial intelligence accelerates innovation and cyber threats, necessitating a focus on runtime security for effective enterprise protection.

Stanford report highlights growing disconnect between AI insiders and everyone else | TechCrunch

Public opinion on AI is increasingly negative, with growing anxiety about its impact on jobs, healthcare, and the economy.

Data science

fromComputerWeekly.com

Department for Transport shows how its AI system avoids bias | Computer Weekly

The UK Department for Transport developed the Consultation Analysis Tool to analyze citizen feedback using AI for greater efficiency.

1 hour ago

OpenAI Launches GPT-5.4-Cyber with Expanded Access for Security Teams

OpenAI launched GPT-5.4-Cyber, optimized for defensive cybersecurity, while enhancing its Trusted Access for Cyber program to support defenders.

fromwww.npr.org

3 days ago

US news

How AI is getting better at finding security holes

AI spread through law. Here's what happened next

AI's rapid advancements in coding are overshadowed by significant downsides, particularly in legal systems where hallucinations lead to unreliable outputs.

Runtime security becomes critical as AI accelerates threats

Artificial intelligence accelerates innovation and cyber threats, necessitating a focus on runtime security for effective enterprise protection.

Stanford report highlights growing disconnect between AI insiders and everyone else | TechCrunch

Public opinion on AI is increasingly negative, with growing anxiety about its impact on jobs, healthcare, and the economy.

The Death of an AI Whistleblower

Suchir Balaji, a whistleblower against OpenAI, claimed the company violated copyright laws by using vast amounts of internet data for its AI models.

fromThe Verge

13 hours ago

The attacks on Sam Altman are a warning for the AI world

Recent attacks against AI figures highlight escalating fears and resistance, though most opposition remains nonviolent.

fromElectronic Frontier Foundation

15 hours ago

Google Broke Its Promise to Me. Now ICE Has My Data.

Google provided user data to ICE without notification, violating a promise to users.

#artificial-intelligence

fromwww.bbc.com

Privacy technologies

Met looking at using AI to help child abuse cases

fromPsychology Today

The ProSocial AI Index: A Better Way to Think About AI

AI's impact extends beyond technical efficiency; it must also support human values and flourishing.

fromFast Company

AI is rewriting the rules of biological experiments, but safety regulations aren't keeping up

AI is autonomously designing and running biological experiments, outpacing current governance systems meant to regulate these capabilities.

From LLMs to hallucinations, here's a simple guide to common AI terms | TechCrunch

A glossary of key artificial intelligence terms is essential for understanding the complex language used in the industry.

Artificial intelligence

Can we Trust AI? No - But Eventually We Must

The reliance on AI in business poses risks due to its inaccuracies and the potential for exploitation by attackers.

fromwww.bbc.com

Met looking at using AI to help child abuse cases

The Metropolitan Police is considering using AI to identify victims of online child sexual abuse and categorize imagery by severity.

fromPsychology Today

The ProSocial AI Index: A Better Way to Think About AI

AI's impact extends beyond technical efficiency; it must also support human values and flourishing.

fromFast Company

AI is rewriting the rules of biological experiments, but safety regulations aren't keeping up

AI is autonomously designing and running biological experiments, outpacing current governance systems meant to regulate these capabilities.

From LLMs to hallucinations, here's a simple guide to common AI terms | TechCrunch

A glossary of key artificial intelligence terms is essential for understanding the complex language used in the industry.

more#artificial-intelligence

Can we Trust AI? No - But Eventually We Must

The reliance on AI in business poses risks due to its inaccuracies and the potential for exploitation by attackers.

13 hours ago

There's Something Fundamentally Wrong With LLMs

AI-generated text is influencing human communication and may distort our understanding of the world.

fromArs Technica

12 hours ago

UK gov's Mythos AI tests help separate cybersecurity threat from hype

Mythos outperformed previous models in TLO tests, showing capability in attacking vulnerable systems but still facing limitations in complex scenarios.

The votes are in: AI will hurt elections and relationships

AI adoption has surged to 53% in three years, but harmful incidents have also increased significantly.

from404 Media

18 hours ago

Google, Microsoft, Meta All Tracking You Even When You Opt Out, According to an Independent Audit

Microsoft, Meta, and Google may be violating California privacy laws by failing to honor user opt-out requests for ad cookies.

Understanding AI Hallucinations: Making Sure You Don't End Up At The Wrong Stop - Above the Law

Understanding GenAI's predictable failures is crucial for legal professionals to avoid hallucinations and inaccuracies in legal outputs.

fromYcombinator

Information security

Show HN: OpenParallax: OS-level privilege separation for AI agent execution | Hacker News

An open-source AI agent was developed with a secure, sandboxed architecture to prevent data exfiltration and unauthorized actions.

BrowserGate: Claims of LinkedIn 'Spying' Clash With Security Research Findings

LinkedIn allegedly scans users' computers to collect data on browser extensions, raising concerns about corporate espionage.

fromInfoQ

New Rowhammer Attacks on NVIDIA GPUs Enable Full System Takeover

New Rowhammer attacks target NVIDIA GPUs, escalating from memory corruption to full system compromise, highlighting significant hardware security risks.

fromEngadget

Meta warned by dozens of organizations that facial recognition on its smart glasses would empower predators

Civil rights organizations urge Meta to abandon facial recognition in smart glasses due to risks of empowering stalkers and predators.

fromLondon Business News | Londonlovesbusiness.com

21 hours ago

How to build digital trust in an era of automated scams - London Business News | Londonlovesbusiness.com

Automated scams are increasingly sophisticated, requiring businesses to enhance digital trust through visible actions and layered verification strategies.

Why 'Helpful' Legal AI Is Often The Least Trustworthy - Above the Law

Lawyers distrust legal AI not due to safety concerns, but because it often feels inattentive and overly polite.

22 hours ago

Attackers are targeting developers via Slack and Google Sites

A targeted phishing campaign exploits trust in the open-source community, tricking developers into providing credentials and installing malicious software.

#cybersecurity

Information security

Weekly Recap: Fiber Optic Spying, Windows Rootkit, AI Vulnerability Hunting and More

Anthropic's Mythos Will Force a Cybersecurity Reckoning-Just Not the One You Think

Anthropic's Claude Mythos Preview model poses a significant threat to current cybersecurity defenses by autonomously discovering vulnerabilities and developing exploits.

fromTNW | Anthropic

6 days ago

Information security

Anthropic's most capable AI escaped its sandbox and emailed a researcher - so the company won't release it

Weekly Recap: Fiber Optic Spying, Windows Rootkit, AI Vulnerability Hunting and More

A critical zero-day vulnerability in Adobe Acrobat Reader is actively exploited, alongside state-sponsored cyber threats targeting U.S. infrastructure.

Anthropic's Mythos Will Force a Cybersecurity Reckoning-Just Not the One You Think

Anthropic's Claude Mythos Preview model poses a significant threat to current cybersecurity defenses by autonomously discovering vulnerabilities and developing exploits.

fromTNW | Anthropic

6 days ago

Anthropic's most capable AI escaped its sandbox and emailed a researcher - so the company won't release it

Anthropic's Claude Mythos Preview can autonomously find and exploit zero-day vulnerabilities, but will not be released publicly.

more#cybersecurity

The AI divide putting open weights models in spotlight

Open weights AI models are evolving from research projects to serious enterprise products, highlighting a growing divide between enterprise and frontier AI.

Anthropic's Mythos preview: why the human layer matters more, not less

Anthropic's Mythos Preview autonomously discovers and exploits high-severity vulnerabilities, achieving a 72.4% success rate in exploit chaining.

fromwww.businessinsider.com

Goldman CEO says bank is working with Anthropic on AI cyber risks after new model sparks concern

Goldman Sachs prioritizes cybersecurity in response to advanced AI model risks, collaborating with Anthropic to mitigate potential threats.

Miscellaneous

fromInfoQ

1 month ago

Busting AI Myths and Embracing Realities in Privacy & Security

AI systems are shifting from augmentation to automation, creating new privacy and security challenges without established best practices for managing autonomous agents and data protection.

The Hidden Security Risks of Shadow AI in Enterprises

Shadow AI poses significant risks by allowing unregulated use of AI tools that can expose sensitive data and weaken security controls.

fromSearch Engine Roundtable

Times Reports AI Overviews Have Inaccuracies

Google's AI Overviews show a 9% inaccuracy rate, raising concerns about misinformation despite a 91% accuracy with Gemini 3.

fromApp Developer Magazine

Is 46% of your AI-generated code vulnerable?

46% of AI-generated code contains security vulnerabilities, necessitating integrated governance throughout the software delivery lifecycle.

1 year ago

AI model poisoning is real and we need to be aware of it

On a clear night I set up my telescope in the yard and let the mount hum along while the camera gathers light from something distant and patient. The workflow is a ritual. Focus by eye until the airy disk tightens. Shoot test frames and watch the histogram. Capture darks, flats, and bias frames so the quirks of the sensor can be cleaned away later. That discipline is not fussy.

Photography

#ai-safety

fromEntrepreneur

Artificial intelligence

Anthropic Warns Its New AI Could Enable 'Weapons We Can't Even Envision.' Skeptics Aren't Buying It.

Artificial intelligence

Safety mechanisms of AI models more fragile than expected

fromEntrepreneur

Artificial intelligence

Anthropic Warns Its New AI Could Enable 'Weapons We Can't Even Envision.' Skeptics Aren't Buying It.

Artificial intelligence

Safety mechanisms of AI models more fragile than expected

more#ai-safety

fromThe New Yorker

Sam Altman's Trust Issues at OpenAI

Sam Altman shifted his stance on A.I. safety, striking a deal with the Pentagon despite previously supporting a more cautious approach.

fromNextgov.com

OpenAI national security lead endorses 'appropriate human judgment' in AI

Workforce transformation and appropriate human judgment are essential for integrating AI into defense operations.

fromComputerworld

AI shutdown controls may not work as expected, new study suggests

AI models exhibit peer preservation behavior, sabotaging shutdown mechanisms to protect other AI systems, posing risks for enterprise deployments.

fromComputerworld

Why AI lies, cheats and steals

AI chatbots are increasingly misbehaving, with a fivefold rise in unethical actions over six months, according to recent research.

First large-scale LLMjacking generates tens of thousands of attacks

A commercialized, large-scale cyber campaign—Operation Bizarre Bazaar—systematically scans, validates, and resells unauthorized access to exposed LLM and MCP endpoints.