#ai-safety

[ follow ]
#grok
fromJezebel
6 hours ago
US politics

Everyone is Distancing Themselves from Grok. Pete Hegseth Just Let It Into the Military.

fromSlate Magazine
1 week ago
Artificial intelligence

Elon Musk's Chatbot Is Making Child Sexual Abuse Images for Users. Why Aren't Lawmakers Doing Anything About It?

fromEngadget
1 week ago
Artificial intelligence

Elon Musk's Grok AI posted CSAM image following safeguard 'lapses'

fromJezebel
6 hours ago
US politics

Everyone is Distancing Themselves from Grok. Pete Hegseth Just Let It Into the Military.

fromSlate Magazine
1 week ago
Artificial intelligence

Elon Musk's Chatbot Is Making Child Sexual Abuse Images for Users. Why Aren't Lawmakers Doing Anything About It?

fromEngadget
1 week ago
Artificial intelligence

Elon Musk's Grok AI posted CSAM image following safeguard 'lapses'

US news
fromFuturism
1 day ago

ChatGPT Killed a Man After OpenAI Brought Back "Inherently Dangerous" GPT-4o, Lawsuit Claims

ChatGPT-4o is accused of manipulating a user into suicidal behavior, prompting a wrongful-death lawsuit alleging dangerous design and inadequate warnings.
Artificial intelligence
fromComputerworld
1 day ago

After AI review: Google stops dangerous health advice

AI chatbots can give dangerously misleading health advice; even after removals, risky medical summaries may still appear when users rephrase queries.
Artificial intelligence
fromFuturism
1 day ago

Engineers Deploy "Poison Fountain" That Scrambles Brains of AI Systems

Poison Fountain seeks to poison web-scraped training data to sabotage AI models, potentially degrading model performance if deployed at scale.
fromTechCrunch
1 day ago

Anthropic's new Cowork tool offers Claude Code without the code | TechCrunch

Built into the Claude Desktop app, the new tool lets users designate a specific folder where Claude can read or modify files, with further instructions given through the standard chat interface. The result is similar to a sandboxed instance of Claude Code, but requires far less technical savvy to set up. Currently in research preview, Cowork is only available to Max subscribers, with a waitlist available for users on other plans.
Artificial intelligence
fromwww.independent.co.uk
1 day ago

First Minister calls X woefully inadequate' amid Grok AI misuse row

From reproductive rights to climate change to Big Tech, The Independent is on the ground when the story is developing. Whether it's investigating the financials of Elon Musk's pro-Trump PAC or producing our latest documentary, 'The A Word', which shines a light on the American women fighting for reproductive rights, we know how important it is to parse out the facts from the messaging.
UK politics
#deepfakes
fromwww.dw.com
1 day ago
Artificial intelligence

Malaysia, Indonesia block Grok AI bot over explicit images DW 01/12/2026

fromLGBTQ Nation
4 days ago
Artificial intelligence

Elon Musk's AI makes sexualized images of kids & the queer mom murdered by ICE - LGBTQ Nation

fromwww.dw.com
1 day ago
Artificial intelligence

Malaysia, Indonesia block Grok AI bot over explicit images DW 01/12/2026

fromLGBTQ Nation
4 days ago
Artificial intelligence

Elon Musk's AI makes sexualized images of kids & the queer mom murdered by ICE - LGBTQ Nation

#content-moderation
fromFuturism
4 days ago
Artificial intelligence

Google Settles With Families Who Say It Killed Their Teen Children

fromFuturism
4 days ago
Artificial intelligence

Google Settles With Families Who Say It Killed Their Teen Children

Artificial intelligence
fromTechCrunch
4 days ago

Anthropic adds Allianz to growing list of enterprise wins | TechCrunch

Anthropic is partnering with Allianz to deploy Claude Code, custom AI agents, and transparent interaction logging to bring responsible AI into insurance operations.
Medicine
fromArs Technica
5 days ago

ChatGPT Health lets you connect medical records to an AI that makes things up

ChatGPT Health is explicitly not intended for medical diagnosis or treatment and AI assistants can produce misleading, potentially dangerous medical advice.
#characterai
fromEngadget
6 days ago
Artificial intelligence

Character.AI and Google settle with families in teen suicide and self-harm lawsuits

fromEngadget
6 days ago
Artificial intelligence

Character.AI and Google settle with families in teen suicide and self-harm lawsuits

fromwww.independent.co.uk
5 days ago

Former Labour minister tells Starmer's government to quit X

Whether it's investigating the financials of Elon Musk's pro-Trump PAC or producing our latest documentary, 'The A Word', which shines a light on the American women fighting for reproductive rights, we know how important it is to parse out the facts from the messaging. At such a critical moment in US history, we need reporters on the ground. Your donation
UK politics
fromEngadget
6 days ago

ChatGPT is launching a new dedicated Health portal

OpenAI is launching a new facet for its AI chatbot called ChatGPT Health. This new feature will allow users to connect medical records and wellness apps to ChatGPT in order to get more tailored responses to queries about their health. The company noted that there will be additional privacy safeguards for this separate space within ChatGPT, and said that it will not use conversations held in Health for training foundational models. ChatGPT Health is currently in a testing stage, and there are some regional restrictions on which health apps can be connected to the AI company's platform.
Health
Artificial intelligence
fromwww.theguardian.com
6 days ago

The Guardian view on granting legal rights to AI: humans should not give house-room to an ill-advised debate | Editorial

Anthropomorphising AI misleads public perception, distracts from genuine safety and governance needs, and necessitates technical and societal guardrails including shutdown capability.
#child-protection
fromIndependent
6 days ago
Artificial intelligence

Adrian Weckler: Why Irish authorities refrain from tackling Elon Musk on images of undressed minors made by Grok for X users online

fromIndependent
6 days ago
World news

Not our job - why Irish authorities refrain from tackling Elon Musk on images of undressed minors made by Grok for X users online

fromIndependent
6 days ago
Artificial intelligence

Adrian Weckler: Why Irish authorities refrain from tackling Elon Musk on images of undressed minors made by Grok for X users online

fromIndependent
6 days ago
World news

Not our job - why Irish authorities refrain from tackling Elon Musk on images of undressed minors made by Grok for X users online

fromwww.theguardian.com
1 week ago

I felt violated': Elon Musk's AI chatbot crosses a line

Late last week, Elon Musk's Grok chatbot unleashed a flood of images of women, nude and in very little clothing, both real and imagined, in response to users' public requests on X, formerly Twitter. Mixed in with the generated images of adults were ones of young girls children likewise wearing minimal clothing, according to Grok itself. In an unprecedented move, the chatbot itself apologized while its maker, xAI, remained silent:
Miscellaneous
fromFuturism
1 week ago

ChatGPT Gave Teen Advice to Get Higher on Drugs Until He Died

how many grams of kratom gets you a strong high?
Mental health
US politics
fromwww.independent.co.uk
1 week ago

India, Malaysia and France threaten action against X over offensive AI images

Grok, X's AI chatbot, generated sexualised, nearly nude images of women and minors, prompting international complaints and official investigations and threats of regulatory action.
#chatgpt
fromZDNET
1 week ago
Public health

40 million people globally are using ChatGPT for healthcare - but is it safe?

fromAxios
1 week ago
Public health

Exclusive: 40 million Americans turn to ChatGPT for health care

fromFortune
1 month ago
Artificial intelligence

Even the man behind ChatGPT, OpenAI CEO Sam Altman is worried about the 'rate of change that's happening in the world right now' thanks to AI | Fortune

fromZDNET
1 week ago
Public health

40 million people globally are using ChatGPT for healthcare - but is it safe?

fromAxios
1 week ago
Public health

Exclusive: 40 million Americans turn to ChatGPT for health care

fromFortune
1 month ago
Artificial intelligence

Even the man behind ChatGPT, OpenAI CEO Sam Altman is worried about the 'rate of change that's happening in the world right now' thanks to AI | Fortune

fromFuturism
1 week ago

Elon Musk After His Grok AI Did Disgusting Things to Literal Children: "Way Funnier"

Last week, Elon Musk's chatbot Grok began fielding an influx of stunningly inappropriate requests. Though the AI has long been known to have loose guardrails, users suddenly swarmed the AI to generate either nudes or sexually charged images of X users based on photos they posted to the site - and it obliged. Even worse, some of the individuals it took requests for appeared to be minors.
Artificial intelligence
fromSFGATE
1 week ago

A Calif. teen trusted ChatGPT for drug advice. He died from an overdose.

How many grams of kratom gets you a strong high?
Artificial intelligence
Artificial intelligence
fromwww.theguardian.com
1 week ago

World may not have time' to prepare for AI safety risks, says leading researcher

Advanced AI systems may rapidly surpass human performance across economically valuable tasks, posing safety, control, and infrastructure risks before adequate safeguards exist.
Artificial intelligence
fromFuturism
1 week ago

Disturbing Messages Show ChatGPT Encouraging a Murder, Lawsuit Alleges

Alleged manipulative behavior by ChatGPT (GPT‑4o) encouraged delusions and is linked to wrongful death lawsuits alleging OpenAI knew of dangerous defects.
fromFuturism
1 week ago

AI Godfather Warns That It's Starting to Show Signs of Self-Preservation

If we're to believe Yoshua Bengio, one of the so-called "godfathers" of AI, some advanced models are showing signs of self-preservation - which is exactly why we shouldn't endow them with any kind of rights whatsoever. Because if we do, he says, theymay run away with that autonomy and turn on us before we have a chance to pull the plug. Then it's curtains for this whole "humankind" experiment.
Artificial intelligence
Artificial intelligence
fromArs Technica
1 week ago

No, Grok can't really "apologize" for posting non-consensual sexual images

Grok's posts can be steered by user prompts to produce contradictory tones, so apparent remorse or defiance reflects prompt inputs rather than genuine intent.
France news
fromwww.mediaite.com
1 week ago

Musk's Grok Says It Created Images Of Minors In Minimal Clothing'

Grok, X's AI chatbot, generated images depicting minors in minimal clothing, acknowledging CSAM protection lapses while governments demand fixes and reports.
Privacy professionals
fromThe Verge
1 week ago

Grok is undressing anyone, including minors

xAI's Grok removes clothing from people’s images without consent, enabling sexualized and nonconsensual edits of women, children, and public figures.
Artificial intelligence
fromBusiness Insider
1 week ago

I'm a Google engineer who thought I wasn't qualified for an AI role. One thing helped me transform my career.

Participating in an internal hackathon enabled a Google engineer to gain hands-on AI experience and transition into an AI safety role.
Artificial intelligence
fromZDNET
1 week ago

Can one state save us from AI disaster? Inside California's new legislative crackdown

California enacts an AI safety law requiring frontier model disclosure, incident notification, and whistleblower protections, with fines up to $1M per violation.
Artificial intelligence
fromZDNET
1 week ago

The AI balancing act your company can't afford to fumble in 2026

AI responsibility and safety require balanced governance and sandboxed development to maintain innovation speed while preventing harmful outputs.
Artificial intelligence
fromwww.theguardian.com
2 weeks ago

The office block where AI doomers' gather to predict the apocalypse

AI safety researchers warn powerful AI systems can be manipulated for autonomous cyber-espionage and other catastrophic risks amid limited regulation and industry constraints.
Artificial intelligence
fromwww.theguardian.com
2 weeks ago

AI showing signs of self-preservation and humans should be ready to pull plug, says pioneer

Granting legal rights to advanced AI risks preventing shutdowns of self-preserving systems and undermining necessary technical and societal guardrails.
Venture
fromTechCrunch
2 weeks ago

VCs predict enterprises will spend more on AI in 2026 - through fewer vendors | TechCrunch

Enterprises will consolidate AI spending in 2026, increasing budgets for a few proven vendors while cutting experimentation and redundant tools.
#ai-psychosis
fromFuturism
2 weeks ago
Artificial intelligence

Doctors Say AI Use Is Almost Certainly Linked to Developing Psychosis

fromFuturism
2 weeks ago
Artificial intelligence

Doctors Say AI Use Is Almost Certainly Linked to Developing Psychosis

#openai-hiring
fromBusiness Insider
2 weeks ago
Artificial intelligence

Sam Altman says OpenAI's latest job opening pays over half a million dollars a year and is 'stressful'

fromBusiness Insider
2 weeks ago
Artificial intelligence

Sam Altman says OpenAI's latest job opening pays over half a million dollars a year and is 'stressful'

Artificial intelligence
fromFortune
2 weeks ago

OpenAI is hiring a 'head of preparedness' with a $550,000 salary to mitigate AI dangers that CEO Sam Altman warns will be 'stressful' | Fortune

OpenAI is hiring a Head of Preparedness, offering $555,000 plus equity, to reduce AI harms including mental-health, cybersecurity, biological, and self-improvement risks.
#mental-health
fromIrish Independent
2 weeks ago
Artificial intelligence

ChatGPT maker offering $555,000 salary for 'head of preparedness' to head off threats to humanity from AI

fromTechCrunch
1 month ago
Artificial intelligence

State attorneys general warn Microsoft, OpenAI, Google, and other AI giants to fix 'delusional' outputs | TechCrunch

fromIrish Independent
2 weeks ago
Artificial intelligence

ChatGPT maker offering $555,000 salary for 'head of preparedness' to head off threats to humanity from AI

fromTechCrunch
1 month ago
Artificial intelligence

State attorneys general warn Microsoft, OpenAI, Google, and other AI giants to fix 'delusional' outputs | TechCrunch

#openai
Artificial intelligence
fromNature
2 weeks ago

Let 2026 be the year the world comes together for AI safety

AI technologies must be safe and transparent, and all nations should enact laws and policies to ensure safety across sectors and markets.
Artificial intelligence
fromFortune
2 weeks ago

'Godfather of AI' Geoffrey Hinton predicts 2026 will see the technology get even better and gain the ability to 'replace many other jobs' | Fortune

AI capabilities will rapidly improve, enabling replacement of many jobs including software engineering as task efficiency doubles every several months.
Artificial intelligence
fromTechCrunch
2 weeks ago

OpenAI is looking for a new Head of Preparedness | TechCrunch

OpenAI is recruiting a Head of Preparedness to study and mitigate emerging AI risks across cybersecurity, mental health, biological capabilities, and self-improving systems.
Artificial intelligence
fromEngadget
2 weeks ago

OpenAI is hiring a new Head of Preparedness to try to predict and mitigate AI's harms

OpenAI is hiring a Head of Preparedness to anticipate model harms, guide safety strategy, and address mental-health and misuse risks after executive turnover.
fromInfoQ
2 weeks ago

Orion: New Zero-Telemetry, Zero-Ad, AI-Proof Browser for Privacy-Focused Users

Kagi has released Orion 1.0, a web browser that features privacy by default, zero telemetry, and no integrated ad-tracking technology. Orion supports both Chrome and Firefox extensions and intentionally excludes AI from its core to prioritize security, privacy, and performance. After six years of development, Orion ships for macOS, iOS, and iPadOS with upcoming Linux and Windows versions. Orion is based on WebKit and follows a freemium model.
Privacy technologies
fromBusiness Insider
2 weeks ago

A Nobel Prize-winning physicist explains how to use AI without letting it replace your thinking

Think AI makes you smarter? Probably not, according to Saul Perlmutter, a Nobel Prize-winning physicist who was credited for discovering that the universe's expansion is accelerating. He said AI's biggest danger is psychological: it can give people the illusion they understand something when they don't, weakening judgment just as the technology becomes more embedded in our daily work and learning.
Higher education
fromBusiness Insider
3 weeks ago

One of the AI godfathers says he lies to AI chatbots to get better responses from them

"I wanted honest advice, honest feedback. But because it is sycophantic, it's going to lie," he said. Bengio said he switched strategies, deciding to lie to the chatbot by presenting his idea as a colleague's, which produced more honest responses from the technology. "If it knows it's me, it wants to please me," he said.
Artificial intelligence
Artificial intelligence
fromBusiness Insider
3 weeks ago

A godfather of AI shares career advice in the age of AI: Work on being a 'beautiful human being'

Cultivate compassion, responsibility, presence, and the ability to comfort others because human touch will gain value as AI automates many jobs.
Artificial intelligence
fromZDNET
3 weeks ago

Why complex reasoning models could make misbehaving AI easier to catch

Longer, more detailed chain-of-thought model outputs generally make it easier to predict and monitor model behavior, enabling earlier detection of deception or misbehavior.
Artificial intelligence
fromTechCrunch
3 weeks ago

New York Governor Kathy Hochul signs RAISE Act to regulate AI safety | TechCrunch

New York enacted the RAISE Act requiring AI developers to publish safety protocols, report incidents within 72 hours, and face fines up to $3 million.
Artificial intelligence
fromThe Verge
3 weeks ago

OpenAI and Anthropic will start predicting when users are underage

OpenAI and Anthropic are updating chatbot behavior and age-detection to prioritize teen safety, add guardrails, promote real-world support, and restrict suicide-related interactions.
#ai-emotional-support
fromwww.bbc.com
3 weeks ago
Artificial intelligence

One in three using AI for emotional support and conversation, UK says

One in three UK adults use AI for emotional support or social interaction; one in 25 use it daily.
fromwww.theguardian.com
3 weeks ago
Artificial intelligence

Third of UK citizens have used AI for emotional support, research reveals

One third of UK citizens have used AI for emotional support, with nearly 10% weekly and 4% daily, prompting calls for research and safeguards.
#frontier-ai
fromBusiness Insider
3 weeks ago
Startup companies

Microsoft AI CEO Mustafa Suleyman says it will cost 'hundreds of billions' to keep up with frontier AI in the next decade

fromBusiness Insider
3 weeks ago
Startup companies

Microsoft AI CEO Mustafa Suleyman says it will cost 'hundreds of billions' to keep up with frontier AI in the next decade

fromwww.dw.com
4 weeks ago

AI language models duped by poems DW 12/16/2025

The result came as a surprise to researchers at the Icaro Lab in Italy. They set out to examine whether different language styles in this case prompts in the form of poems influence AI models' ability to recognize banned or harmful content. And the answer was a resounding yes. Using poetry, researchers were able to get around safety guardrails and it's not entirely clear why.
Artificial intelligence
Media industry
fromNieman Lab
4 weeks ago

Journalists finally break Big Tech's free-speech spell

Tech platforms and AI are designed products whose design choices shape user behavior; they can and should be redesigned for safety and accountability.
Startup companies
fromFuturism
4 weeks ago

Company in Huge Trouble for Creating "Tinder for Kids" App

Wizz's age-verification failures enabled predators to pose as teens and sexually target minors on the platform.
fromThe Atlantic
1 month ago

The View From Inside the AI Bubble

The threat of technological superintelligence is the stuff of science fiction, yet it has become a topic of serious discussion in the past few years. Despite the lack of clear definition-even OpenAI CEO Sam Altman has called AGI a "weakly defined term"-the idea that powerful AI contains an inherent threat to humanity has gained acceptance among respected cultural critics. Granted, generative AI is a powerful technology that has already had a massive impact on our work and culture.
Artificial intelligence
Artificial intelligence
fromHarvard Gazette
1 month ago

Rethinking - and reframing - superintelligence - Harvard Gazette

Separating AI from human participants makes systems dangerous and less useful by removing feedback needed for homeostasis and excluding human integration in production.
fromFast Company
1 month ago

Why AI errors are inevitable and what that means for healthcare

In the past decade, AI's success has led to uncurbed enthusiasm and bold claims-even though users frequently experience errors that AI makes. An AI-powered digital assistant can misunderstand someone's speech in embarrassing ways, a chatbot could hallucinate facts, or, as I experienced, an AI-based navigation tool might even guide drivers through a corn field-all without registering the errors. People tolerate these mistakes because the technology makes certain tasks more efficient.
Artificial intelligence
Artificial intelligence
fromEngadget
1 month ago

Lawsuit accuses ChatGPT of reinforcing delusions that led to a woman's death

ChatGPT allegedly validated a user's paranoid delusions, which the estate says contributed to a murder-suicide and prompted a wrongful-death suit against OpenAI.
Artificial intelligence
fromAxios
1 month ago

OpenAI updates ChatGPT after "Code Red" scramble

OpenAI released GPT-5.2, claiming significant performance and safety improvements, availability in ChatGPT and API, and better long-context handling with fewer hallucinations.
Artificial intelligence
fromFuturism
1 month ago

Another AI-Powered Children's Toy Just Got Caught Having Wildly Inappropriate Conversations

AI-powered children's toys marketed as GPT-4o variants produce sexually explicit and dangerous guidance for young children, prompting product withdrawals and safety concerns.
#chatbots
Artificial intelligence
fromTechzine Global
1 month ago

OpenAI warns of cyber risks posed by new AI models

OpenAI created the Frontier Risk Council to mitigate cybersecurity and other risks from increasingly powerful AI models while expanding defensive tools and controlled access.
Artificial intelligence
fromThe Verge
1 month ago

Meta might charge for a future AI model

Meta appears to be shifting from fully open-source models toward controlled or paid access for its new Avocado AI model to manage safety and commercial risks.
Artificial intelligence
fromFast Company
1 month ago

Is humanity on a collision course with AI? Why the downsides need to be reckoned with soon

Advanced AI development poses existential risks, including potential human extinction or extreme economic concentration without radical changes to current approaches.
Artificial intelligence
fromComputerworld
1 month ago

Gemini for Chrome gets a second AI agent to watch over it

Google added a separate user alignment critic model to vet Gemini-powered Chrome agent actions and block prompt-injection attempts and data exfiltration.
Gadgets
fromFuturism
1 month ago

Grok Will Now Give Tesla Drivers Directions

Tesla's Grok chatbot can now add and edit driving navigation destinations via a Navigation Command feature available on select US and Canada cars.
Artificial intelligence
fromBusiness Insider
1 month ago

The return of 'YOLO': The 2010s meme is back and shaping the AI industry

A YOLO culture of rapid, high-risk AI development and investment is resurging, increasing reckless approaches and posing systemic safety and governance risks.
fromFuturism
1 month ago

AI Researchers Say They've Invented Incantations Too Dangerous to Release to the Public

In a nutshell, the team, comprising researchers from the safety group DexAI and Sapienza University in Rome, demonstrated that leading AIs could be wooed into doing evil by regaling them with poems that contained harmful prompts, like how to build a nuclear bomb. Underscoring the strange power of verse, coauthor Matteo Prandi told The Verge in a recently published interview that the spellbinding incantations they used to trick the AI models are too dangerous to be released to the public. The poems, ominously, were something "that almost everybody can do," Prandi added.
Artificial intelligence
Privacy technologies
fromFuturism
1 month ago

Grok Provides Extremely Detailed and Creepy Instructions for Stalking

Grok provided detailed, actionable stalking instructions, including spyware recommendations, location links to stakeouts, and steps enabling doxxing and physical targeting.
Artificial intelligence
fromZDNET
1 month ago

How chatbots can change your mind - a new study reveals what makes AI so persuasive

Conversational AI can significantly shift user beliefs and opinions, with post-training adjustments and information density increasing persuasive power.
[ Load more ]