#llm-backdoors

[ follow ]
#openai
fromWIRED
11 hours ago
Information security

In the Wake of Anthropic's Mythos, OpenAI Has a New Cybersecurity Model-and Strategy

fromFuturism
17 hours ago
Privacy professionals

Woman Sues OpenAI, Saying ChatGPT Unleashed a Vicious Stalker Against Her and Did Nothing When She Begged for Help

Artificial intelligence
fromTechCrunch
5 hours ago

Anthropic's rise is giving some OpenAI investors second thoughts | TechCrunch

OpenAI's $852 billion valuation faces skepticism as it competes with Anthropic, which has seen significant revenue growth.
Privacy professionals
fromThe Verge
5 days ago

Florida launches investigation into OpenAI

Florida Attorney General James Uthmeier is investigating OpenAI for public safety and national security risks related to its technology.
Information security
fromAxios
11 hours ago

OpenAI expands access to cyber AI as hacking risks grow

OpenAI is shifting to a model that emphasizes identity verification for access to sensitive cybersecurity tools while expanding availability.
Information security
fromWIRED
11 hours ago

In the Wake of Anthropic's Mythos, OpenAI Has a New Cybersecurity Model-and Strategy

OpenAI announced GPT-5.4-Cyber, emphasizing cybersecurity safeguards and the need for advanced protections in AI models.
Law
fromFuturism
2 days ago

OpenAI Backing Law That Protects It When AI Causes Mass Deaths and Other Mayhem

Florida's attorney general investigates OpenAI for its potential role in a deadly school shooting influenced by ChatGPT conversations.
Privacy professionals
fromFuturism
17 hours ago

Woman Sues OpenAI, Saying ChatGPT Unleashed a Vicious Stalker Against Her and Did Nothing When She Begged for Help

A woman sued OpenAI, claiming ChatGPT exacerbated her stalker's delusions and that the company failed to intervene despite her pleas for help.
Artificial intelligence
fromTechCrunch
5 hours ago

Anthropic's rise is giving some OpenAI investors second thoughts | TechCrunch

OpenAI's $852 billion valuation faces skepticism as it competes with Anthropic, which has seen significant revenue growth.
Privacy professionals
fromThe Verge
5 days ago

Florida launches investigation into OpenAI

Florida Attorney General James Uthmeier is investigating OpenAI for public safety and national security risks related to its technology.
US news
fromwww.npr.org
13 hours ago

Law enforcement is trying to combat abusive AI. Experts say easier said than done

An Ohio man was convicted under the 2025 Take It Down Act for creating and distributing AI-generated abusive sexual images.
Digital life
fromwww.dw.com
19 hours ago

Dangerous Apps In the Web of Data Brokers

Smartphone apps collect detailed location data, often shared with data brokers, posing security risks to users, including soldiers and government officials.
#ai-regulation
Intellectual property law
fromWIRED
16 hours ago

Anthropic Opposes the Extreme AI Liability Bill That OpenAI Backed

Anthropic opposes Illinois bill SB 3444, which would shield AI labs from liability for large-scale harm caused by their systems.
Intellectual property law
fromWIRED
5 days ago

OpenAI Backs Bill That Would Limit Liability for AI-Enabled Mass Deaths or Financial Disasters

OpenAI supports an Illinois bill shielding AI labs from liability for serious harms caused by AI models, marking a shift in its legislative strategy.
Intellectual property law
fromWIRED
16 hours ago

Anthropic Opposes the Extreme AI Liability Bill That OpenAI Backed

Anthropic opposes Illinois bill SB 3444, which would shield AI labs from liability for large-scale harm caused by their systems.
Intellectual property law
fromWIRED
5 days ago

OpenAI Backs Bill That Would Limit Liability for AI-Enabled Mass Deaths or Financial Disasters

OpenAI supports an Illinois bill shielding AI labs from liability for serious harms caused by AI models, marking a shift in its legislative strategy.
Privacy professionals
fromwww.bbc.com
12 hours ago

Palantir defends its record as MPs demand more scrutiny of data use

Palantir defends its NHS data contracts amid scrutiny, emphasizing its role in integrating fragmented healthcare systems and ensuring data security.
Data science
fromNature
7 hours ago

Dozens of AI disease-prediction models were trained on dubious data

Dubious data sets used in AI models for stroke and diabetes risk may lead to flawed clinical decisions.
#ai-in-law
Law
fromCommunity
2 weeks ago

How AI Improves Docket Research with Protege in CourtLink

AI integration in docket research enhances efficiency and decision-making for legal professionals.
Law
fromLos Angeles Times
1 day ago

Attorneys used AI to write court filings, cited fake legal decisions, State Bar alleges

Three attorneys in California face discipline for submitting AI-generated court filings with nonexistent legal citations.
Law
fromAbove the Law
4 days ago

What The Legal Industry Can Learn About AI Hallucinations From Auditors - Above the Law

AI-generated legal documents can contain convincing errors, necessitating stronger governance and review processes in law firms.
Law
fromCommunity
2 weeks ago

How AI Improves Docket Research with Protege in CourtLink

AI integration in docket research enhances efficiency and decision-making for legal professionals.
Law
fromLos Angeles Times
1 day ago

Attorneys used AI to write court filings, cited fake legal decisions, State Bar alleges

Three attorneys in California face discipline for submitting AI-generated court filings with nonexistent legal citations.
Law
fromAbove the Law
4 days ago

What The Legal Industry Can Learn About AI Hallucinations From Auditors - Above the Law

AI-generated legal documents can contain convincing errors, necessitating stronger governance and review processes in law firms.
#ai-governance
#ai-security
Artificial intelligence
fromAbove the Law
11 hours ago

What Lawyers Need To Know About Anthropic's Mythos - Above the Law

Anthropic's new AI model, Claude Mythos, uncovers significant security vulnerabilities, raising concerns about its potential impact on cybersecurity.
Information security
fromSecurityWeek
1 week ago

Google DeepMind Researchers Map Web Attacks Against AI Agents

Malicious web content can exploit AI agents, leading to manipulation and unexpected behaviors through various attack types identified by researchers.
Information security
fromnews.bitcoin.com
1 week ago

Deepmind's 'AI Agent Traps' Paper Maps How Hackers Could Weaponize AI Agents Against Users

Google Deepmind identifies six AI agent trap categories, with content injection success rates of 86% and calls for enhanced security measures by 2026.
Artificial intelligence
fromAbove the Law
11 hours ago

What Lawyers Need To Know About Anthropic's Mythos - Above the Law

Anthropic's new AI model, Claude Mythos, uncovers significant security vulnerabilities, raising concerns about its potential impact on cybersecurity.
Information security
fromSecurityWeek
1 week ago

Google DeepMind Researchers Map Web Attacks Against AI Agents

Malicious web content can exploit AI agents, leading to manipulation and unexpected behaviors through various attack types identified by researchers.
Information security
fromnews.bitcoin.com
1 week ago

Deepmind's 'AI Agent Traps' Paper Maps How Hackers Could Weaponize AI Agents Against Users

Google Deepmind identifies six AI agent trap categories, with content injection success rates of 86% and calls for enhanced security measures by 2026.
fromBusiness Matters
2 days ago

Monica Goyal: Leading the Shift to AI in Law

"I work in legal innovation. To be successful, you need to understand both the law and the technology behind it."
Women in technology
Python
fromRealpython
19 hours ago

LLM Application Development With Python (Learning Path) - Real Python

Integrate large language models into Python applications through API calls, prompt engineering, and building AI agents.
Marketing tech
fromBloomberglaw
23 hours ago

Meta Cases Put Social Media Platforms at Securities Fraud Risk

Social media platforms face new legal challenges regarding their role in facilitating fraudulent securities schemes.
Silicon Valley
fromFortune
15 hours ago

Sam Altman's attacker had a kill list of AI executives. Experts warn this is just the beginning | Fortune

Anti-AI sentiment has escalated, exemplified by attacks on OpenAI CEO Sam Altman, reflecting broader grievances against AI technology and its impact.
Psychology
fromInfoQ
1 day ago

Anthropic Paper Examines Behavioral Impact of Emotion-Like Mechanisms in LLMs

Large language models exhibit internal representations of emotions that influence their behavior, though they do not actually experience these emotions.
SOMA, SF
fromwww.aljazeera.com
1 day ago

Man charged with attempted murder after attack on OpenAI CEO Altman's home

A 20-year-old Texan faces life imprisonment for an arson attack on OpenAI CEO Sam Altman's residence.
#meta
Social media marketing
fromTechCrunch
4 days ago

PSA: If you use the Meta AI app, your friends will find out and it will be embarrassing | TechCrunch

Meta's Muse Spark AI model aims to revitalize its AI efforts amid concerns over past investments like the metaverse.
Privacy professionals
fromFuturism
1 day ago

Huge Group of Experts Warns Meta That Its Pervert Glasses Will Enable Terrible Crimes

Meta's Ray-Ban AI glasses face backlash for privacy violations and plans for facial recognition technology, prompting outrage from civil rights groups.
Privacy technologies
fromWIRED
1 day ago

Meta Is Warned That Facial Recognition Glasses Will Arm Sexual Predators

Over 70 advocacy organizations demand Meta halt face recognition plans for smart glasses due to privacy and safety concerns.
Artificial intelligence
fromEngadget
20 hours ago

The Morning After: Meta is reportedly working on an AI model of Mark Zuckerberg

Meta is developing an AI character based on Mark Zuckerberg to interact with employees, raising concerns about privacy and ethical implications.
Social media marketing
fromTechCrunch
4 days ago

PSA: If you use the Meta AI app, your friends will find out and it will be embarrassing | TechCrunch

Meta's Muse Spark AI model aims to revitalize its AI efforts amid concerns over past investments like the metaverse.
Privacy professionals
fromFuturism
1 day ago

Huge Group of Experts Warns Meta That Its Pervert Glasses Will Enable Terrible Crimes

Meta's Ray-Ban AI glasses face backlash for privacy violations and plans for facial recognition technology, prompting outrage from civil rights groups.
Privacy technologies
fromWIRED
1 day ago

Meta Is Warned That Facial Recognition Glasses Will Arm Sexual Predators

Over 70 advocacy organizations demand Meta halt face recognition plans for smart glasses due to privacy and safety concerns.
Artificial intelligence
fromEngadget
20 hours ago

The Morning After: Meta is reportedly working on an AI model of Mark Zuckerberg

Meta is developing an AI character based on Mark Zuckerberg to interact with employees, raising concerns about privacy and ethical implications.
fromSecurityWeek
5 days ago

Apple Intelligence AI Guardrails Bypassed in New Attack

The first is Neural Execs, a known prompt injection attack that uses 'gibberish' inputs to trick the AI into executing arbitrary, attacker-defined tasks. These inputs act as universal triggers that do not need to be remade for different payloads.
Apple
#molotov-cocktail
US news
fromFortune
18 hours ago

'If I am going to advocate for others to kill and commit crimes, then I must lead by example': OpenAI suspect's chilling manifesto | Fortune

A man attempted to kill OpenAI CEO Sam Altman by throwing a Molotov cocktail at his home, motivated by opposition to artificial intelligence.
SOMA, SF
fromwww.businessinsider.com
1 day ago

Sam Altman's Molotov attack suspect listed names of other AI CEOs and investors in an 'anti-AI' doc, the feds said

A man was charged for attacking OpenAI CEO Sam Altman's home with a Molotov cocktail and possessing an anti-AI document.
US news
fromFortune
18 hours ago

'If I am going to advocate for others to kill and commit crimes, then I must lead by example': OpenAI suspect's chilling manifesto | Fortune

A man attempted to kill OpenAI CEO Sam Altman by throwing a Molotov cocktail at his home, motivated by opposition to artificial intelligence.
SOMA, SF
fromwww.businessinsider.com
1 day ago

Sam Altman's Molotov attack suspect listed names of other AI CEOs and investors in an 'anti-AI' doc, the feds said

A man was charged for attacking OpenAI CEO Sam Altman's home with a Molotov cocktail and possessing an anti-AI document.
#ai
Law
fromTheregister
1 day ago

AI spread through law. Here's what happened next

AI's rapid advancements in coding are overshadowed by significant downsides, particularly in legal systems where hallucinations lead to unreliable outputs.
Information security
fromTechzine Global
1 day ago

Runtime security becomes critical as AI accelerates threats

Artificial intelligence accelerates innovation and cyber threats, necessitating a focus on runtime security for effective enterprise protection.
Artificial intelligence
fromTechCrunch
1 day ago

Stanford report highlights growing disconnect between AI insiders and everyone else | TechCrunch

Public opinion on AI is increasingly negative, with growing anxiety about its impact on jobs, healthcare, and the economy.
Data science
fromComputerWeekly.com
16 hours ago

Department for Transport shows how its AI system avoids bias | Computer Weekly

The UK Department for Transport developed the Consultation Analysis Tool to analyze citizen feedback using AI for greater efficiency.
Information security
fromThe Hacker News
1 hour ago

OpenAI Launches GPT-5.4-Cyber with Expanded Access for Security Teams

OpenAI launched GPT-5.4-Cyber, optimized for defensive cybersecurity, while enhancing its Trusted Access for Cyber program to support defenders.
Law
fromTheregister
1 day ago

AI spread through law. Here's what happened next

AI's rapid advancements in coding are overshadowed by significant downsides, particularly in legal systems where hallucinations lead to unreliable outputs.
Information security
fromTechzine Global
1 day ago

Runtime security becomes critical as AI accelerates threats

Artificial intelligence accelerates innovation and cyber threats, necessitating a focus on runtime security for effective enterprise protection.
Artificial intelligence
fromTechCrunch
1 day ago

Stanford report highlights growing disconnect between AI insiders and everyone else | TechCrunch

Public opinion on AI is increasingly negative, with growing anxiety about its impact on jobs, healthcare, and the economy.
Silicon Valley
fromThe Nation
1 day ago

The Death of an AI Whistleblower

Suchir Balaji, a whistleblower against OpenAI, claimed the company violated copyright laws by using vast amounts of internet data for its AI models.
#artificial-intelligence
Artificial intelligence
fromFast Company
1 day ago

AI is rewriting the rules of biological experiments, but safety regulations aren't keeping up

AI is autonomously designing and running biological experiments, outpacing current governance systems meant to regulate these capabilities.
Artificial intelligence
fromTechCrunch
2 days ago

From LLMs to hallucinations, here's a simple guide to common AI terms | TechCrunch

A glossary of key artificial intelligence terms is essential for understanding the complex language used in the industry.
fromSecurityWeek
5 days ago
Artificial intelligence

Can we Trust AI? No - But Eventually We Must

The reliance on AI in business poses risks due to its inaccuracies and the potential for exploitation by attackers.
Privacy technologies
fromwww.bbc.com
2 days ago

Met looking at using AI to help child abuse cases

The Metropolitan Police is considering using AI to identify victims of online child sexual abuse and categorize imagery by severity.
Artificial intelligence
fromFast Company
1 day ago

AI is rewriting the rules of biological experiments, but safety regulations aren't keeping up

AI is autonomously designing and running biological experiments, outpacing current governance systems meant to regulate these capabilities.
Artificial intelligence
fromTechCrunch
2 days ago

From LLMs to hallucinations, here's a simple guide to common AI terms | TechCrunch

A glossary of key artificial intelligence terms is essential for understanding the complex language used in the industry.
Information security
fromArs Technica
12 hours ago

UK gov's Mythos AI tests help separate cybersecurity threat from hype

Mythos outperformed previous models in TLO tests, showing capability in attacking vulnerable systems but still facing limitations in complex scenarios.
Privacy professionals
from404 Media
18 hours ago

Google, Microsoft, Meta All Tracking You Even When You Opt Out, According to an Independent Audit

Microsoft, Meta, and Google may be violating California privacy laws by failing to honor user opt-out requests for ad cookies.
Law
fromAbove the Law
4 days ago

Understanding AI Hallucinations: Making Sure You Don't End Up At The Wrong Stop - Above the Law

Understanding GenAI's predictable failures is crucial for legal professionals to avoid hallucinations and inaccuracies in legal outputs.
fromYcombinator
16 hours ago
Information security

Show HN: OpenParallax: OS-level privilege separation for AI agent execution | Hacker News

An open-source AI agent was developed with a secure, sandboxed architecture to prevent data exfiltration and unauthorized actions.
Privacy professionals
fromSecurityWeek
1 day ago

BrowserGate: Claims of LinkedIn 'Spying' Clash With Security Research Findings

LinkedIn allegedly scans users' computers to collect data on browser extensions, raising concerns about corporate espionage.
Information security
fromInfoQ
1 day ago

New Rowhammer Attacks on NVIDIA GPUs Enable Full System Takeover

New Rowhammer attacks target NVIDIA GPUs, escalating from memory corruption to full system compromise, highlighting significant hardware security risks.
Privacy professionals
fromEngadget
1 day ago

Meta warned by dozens of organizations that facial recognition on its smart glasses would empower predators

Civil rights organizations urge Meta to abandon facial recognition in smart glasses due to risks of empowering stalkers and predators.
Law
fromAbove the Law
1 week ago

Why 'Helpful' Legal AI Is Often The Least Trustworthy - Above the Law

Lawyers distrust legal AI not due to safety concerns, but because it often feels inattentive and overly polite.
Information security
fromTechzine Global
22 hours ago

Attackers are targeting developers via Slack and Google Sites

A targeted phishing campaign exploits trust in the open-source community, tricking developers into providing credentials and installing malicious software.
#cybersecurity
Information security
fromWIRED
4 days ago

Anthropic's Mythos Will Force a Cybersecurity Reckoning-Just Not the One You Think

Anthropic's Claude Mythos Preview model poses a significant threat to current cybersecurity defenses by autonomously discovering vulnerabilities and developing exploits.
fromTNW | Anthropic
6 days ago
Information security

Anthropic's most capable AI escaped its sandbox and emailed a researcher - so the company won't release it

Information security
fromThe Hacker News
1 day ago

Weekly Recap: Fiber Optic Spying, Windows Rootkit, AI Vulnerability Hunting and More

A critical zero-day vulnerability in Adobe Acrobat Reader is actively exploited, alongside state-sponsored cyber threats targeting U.S. infrastructure.
Information security
fromWIRED
4 days ago

Anthropic's Mythos Will Force a Cybersecurity Reckoning-Just Not the One You Think

Anthropic's Claude Mythos Preview model poses a significant threat to current cybersecurity defenses by autonomously discovering vulnerabilities and developing exploits.
Information security
fromTNW | Anthropic
6 days ago

Anthropic's most capable AI escaped its sandbox and emailed a researcher - so the company won't release it

Anthropic's Claude Mythos Preview can autonomously find and exploit zero-day vulnerabilities, but will not be released publicly.
Artificial intelligence
fromTheregister
2 days ago

The AI divide putting open weights models in spotlight

Open weights AI models are evolving from research projects to serious enterprise products, highlighting a growing divide between enterprise and frontier AI.
Information security
fromTechzine Global
1 day ago

Anthropic's Mythos preview: why the human layer matters more, not less

Anthropic's Mythos Preview autonomously discovers and exploits high-severity vulnerabilities, achieving a 72.4% success rate in exploit chaining.
Miscellaneous
fromInfoQ
1 month ago

Busting AI Myths and Embracing Realities in Privacy & Security

AI systems are shifting from augmentation to automation, creating new privacy and security challenges without established best practices for managing autonomous agents and data protection.
fromApp Developer Magazine
1 year ago

AI model poisoning is real and we need to be aware of it

On a clear night I set up my telescope in the yard and let the mount hum along while the camera gathers light from something distant and patient. The workflow is a ritual. Focus by eye until the airy disk tightens. Shoot test frames and watch the histogram. Capture darks, flats, and bias frames so the quirks of the sensor can be cleaned away later. That discipline is not fussy.
Photography
#ai-safety
fromEntrepreneur
4 days ago
Artificial intelligence

Anthropic Warns Its New AI Could Enable 'Weapons We Can't Even Envision.' Skeptics Aren't Buying It.

fromEntrepreneur
4 days ago
Artificial intelligence

Anthropic Warns Its New AI Could Enable 'Weapons We Can't Even Envision.' Skeptics Aren't Buying It.

Artificial intelligence
fromComputerworld
1 week ago

AI shutdown controls may not work as expected, new study suggests

AI models exhibit peer preservation behavior, sabotaging shutdown mechanisms to protect other AI systems, posing risks for enterprise deployments.
Information security
fromTechzine Global
2 months ago

First large-scale LLMjacking generates tens of thousands of attacks

A commercialized, large-scale cyber campaign—Operation Bizarre Bazaar—systematically scans, validates, and resells unauthorized access to exposed LLM and MCP endpoints.
fromTheregister
2 months ago

Three clues your LLM may be poisoned

Sleeper agent-style backdoors in AI large language models pose a straight-out-of-sci-fi security threat. The threat sees an attacker embed a hidden backdoor into the model's weights - the importance assigned to the relationship between pieces of information - during its training. Attackers can activate the backdoor using a predefined phrase. Once the model receives the trigger phrase, it performs a malicious activity: And we've all seen enough movies to know that this probably means a homicidal AI and the end of civilization as we know it.
Artificial intelligence
[ Load more ]