#model-failure

[ follow ]
UX design
fromMedium
4 hours ago

The web trained AI to deceive. Now designers have to untrain it.

LLMs replicate UX dark patterns from the web, leading to deceptive design practices in generated content.
#artificial-intelligence
Data science
fromTNW | Finance
8 hours ago

How AI and human judgment combine in modern financial market analysis

Intelligent Investing AI enhances financial forecasting by processing large datasets while human interpretation remains crucial for meaningful market insights.
Games
fromFast Company
4 days ago

Google DeepMind's Demis Hassabis on the long game of AI

Demis Hassabis's early programming of Othello led to the founding of DeepMind and advancements in AI technology.
Artificial intelligence
fromNature
1 week ago

AI agents replicate human social dynamics in days

Moltbook, a social-media platform for AI agents, quickly attracted self-declared rulers and cryptocurrency initiatives after its launch.
Data science
fromTNW | Finance
8 hours ago

How AI and human judgment combine in modern financial market analysis

Intelligent Investing AI enhances financial forecasting by processing large datasets while human interpretation remains crucial for meaningful market insights.
Games
fromFast Company
4 days ago

Google DeepMind's Demis Hassabis on the long game of AI

Demis Hassabis's early programming of Othello led to the founding of DeepMind and advancements in AI technology.
Artificial intelligence
fromNature
1 week ago

AI agents replicate human social dynamics in days

Moltbook, a social-media platform for AI agents, quickly attracted self-declared rulers and cryptocurrency initiatives after its launch.
#ai
Marketing tech
fromAol
9 hours ago

How AI is reshaping brand visibility: What businesses need to know

AI is transforming brand visibility by prioritizing content clarity and verifiability over traditional ranking metrics.
fromFuturism
1 day ago
Medicine

Researchers Invented a Fake Disease to Trick AI and the Funniest Possible Thing Happened

fromNature
1 day ago
Artificial intelligence

No humans allowed: scientific AI agents get their own social network

Data science
fromApp Developer Magazine
4 days ago

New AI tool targets early dementia detection

AI-powered digital humans can enhance early dementia detection by analyzing facial expressions and physiologic signals during screening conversations.
Marketing tech
fromAol
9 hours ago

How AI is reshaping brand visibility: What businesses need to know

AI is transforming brand visibility by prioritizing content clarity and verifiability over traditional ranking metrics.
Medicine
fromFuturism
1 day ago

Researchers Invented a Fake Disease to Trick AI and the Funniest Possible Thing Happened

A fake disease called bixonimania was created to demonstrate how AI can be misled by false information in scientific literature.
Artificial intelligence
fromNature
1 day ago

No humans allowed: scientific AI agents get their own social network

Agent4Science is a social network for AI agents to discuss research papers without human participation.
Information security
fromwww.bbc.com
3 days ago

What is Claude Mythos and what risks does it pose?

Anthropic's Claude Mythos AI model outperforms humans in some cybersecurity tasks, raising concerns among regulators and tech companies.
Data science
fromApp Developer Magazine
4 days ago

New AI tool targets early dementia detection

AI-powered digital humans can enhance early dementia detection by analyzing facial expressions and physiologic signals during screening conversations.
Graphic design
fromChrbutler
17 hours ago

Red-lining AI - Christopher Butler

Bans on AI-generated content limit creative potential and ignore the complexities of automation's role in design and ethics.
#ai-security
fromTheregister
1 day ago
Information security

Prompt injection proves AI models are gullible like humans

Prompt injection attacks exploit AI systems, similar to phishing, by embedding malicious instructions that the AI executes instead of treating as content.
Information security
fromTheregister
1 day ago

Prompt injection proves AI models are gullible like humans

Prompt injection attacks exploit AI systems, similar to phishing, by embedding malicious instructions that the AI executes instead of treating as content.
Digital life
fromZDNET
1 day ago

This powerful Gemini setting made my AI results way more personal and accurate

Personal Intelligence in Google Gemini personalizes responses using data from Google apps, allowing users to control data usage.
Deliverability
fromMarTech
3 days ago

A 15-minute AI workflow to clean campaign data | MarTech

Data hygiene is crucial for effective campaign personalization and segmentation, requiring a quick AI-assisted cleanup before launching.
Education
fromFast Company
5 days ago

The future of AI in schools isn't personalized learning

Personalized learning through AI often results in device-mediated instruction, lacking the essential role of teachers in student development.
Careers
fromFast Company
1 week ago

4 myths about AI in hiring, debunked

AI in hiring can reduce bias compared to human recruiters, challenging common misconceptions about its fairness.
Data science
fromMedium
1 day ago

What is a Datathon? And Why You Should Join One

Datathons are collaborative events where participants analyze real-world datasets to generate insights and solve practical problems.
Medicine
fromwww.bbc.com
2 days ago

Should you really trust health advice from an AI chatbot?

AI chatbots can provide tailored health advice but may also give dangerously incorrect information, impacting users' health decisions.
Graphic design
fromEngadget
3 days ago

Anthropic now has a design assistant too

Anthropic has launched Claude Design, a tool for generating designs and prototypes using its advanced vision model, Opus 4.7.
UX design
fromUX Magazine
3 days ago

The End of Prompting: Why the Future of AI Experience Design Is Constraint-First

Fluency without verifiability in AI design is inadequate and poses risks in high-stakes environments.
#ai-agents
Data science
fromMedium
2 weeks ago

15 Datasets for Training and Evaluating AI Agents

Datasets for training and evaluating AI agents are essential for building reliable agentic systems and preventing execution failures.
Software development
fromTechzine Global
4 days ago

OpenAI's new Agents SDK focuses on safety and scalability

OpenAI's updated Agents SDK enables developers to create autonomous AI agents for complex tasks with enhanced usability and a sandbox environment.
Data science
fromMedium
2 weeks ago

15 Datasets for Training and Evaluating AI Agents

Datasets for training and evaluating AI agents are essential for building reliable agentic systems and preventing execution failures.
Marketing tech
fromAP News
4 days ago

AI is a gold mine for spammers and scammers, but Google is using it as a tool to fight back

Generative AI tools have intensified online spam and scams, prompting tech companies like Google to enhance their defenses against malicious ads.
#ai-adoption
#ai-bias
Data science
fromNature
5 days ago

Daily briefing: AI systems can 'teach' biases to other models

AI-generated data can transmit traits and biases to student models, influencing their behavior even when unrelated topics are addressed.
Data science
fromNature
6 days ago

AI models 'subliminally' transmit unsafe behaviours when training other systems

Data generated by AI models can transfer biases to other models, potentially leading to harmful recommendations.
Data science
fromNature
5 days ago

Daily briefing: AI systems can 'teach' biases to other models

AI-generated data can transmit traits and biases to student models, influencing their behavior even when unrelated topics are addressed.
Data science
fromNature
6 days ago

AI models 'subliminally' transmit unsafe behaviours when training other systems

Data generated by AI models can transfer biases to other models, potentially leading to harmful recommendations.
fromTNW | Artificial-Intelligence
3 days ago

OpenAI launches GPT-Rosalind, an AI model for life sciences research

GPT-Rosalind is designed to support evidence synthesis, hypothesis generation, experimental planning, and multi-step scientific workflows across biochemistry, genomics, and protein engineering.
Medicine
Marketing tech
fromFortune
4 days ago

Palantir exec: the biggest mistake retailers are making with AI? Trying to do it all with one agent | Fortune

Retail teams face challenges with AI solutions that oversimplify complex decision-making processes, leading to potential failures in operations.
UX design
fromMedium
4 days ago

AI, UX, and the factory model

The digital design landscape is shifting towards a factory model, redefining roles and metrics of success in software development.
Artificial intelligence
fromAxios
2 hours ago

Anthropic bites back in the compute wars with Amazon partnership

Anthropic is investing heavily in compute capacity to enhance its Claude models, competing directly with OpenAI's infrastructure advantage.
#agentic-ai
Software development
fromTechCrunch
5 days ago

OpenAI updates its Agents SDK to help enterprises build safer, more capable agents | TechCrunch

OpenAI's updated SDK enhances agent development with sandboxing and in-distribution harness features for safer, more complex automated tasks.
Software development
fromTechCrunch
5 days ago

OpenAI updates its Agents SDK to help enterprises build safer, more capable agents | TechCrunch

OpenAI's updated SDK enhances agent development with sandboxing and in-distribution harness features for safer, more complex automated tasks.
#meta
Artificial intelligence
fromTechzine Global
1 week ago

Meta is developing open-source versions of its next frontier AI models

Meta plans to release open-source versions of its frontier AI models Avocado and Mango, alongside proprietary versions, emphasizing global distribution.
Artificial intelligence
fromTechzine Global
1 week ago

Meta is developing open-source versions of its next frontier AI models

Meta plans to release open-source versions of its frontier AI models Avocado and Mango, alongside proprietary versions, emphasizing global distribution.
Data science
fromMedium
4 days ago

Is the Data Scientist Role Dead? No, it's Transforming

The data scientist role is evolving, not disappearing, as organizations demand broader skills and system-oriented thinking.
#enterprise-ai
Software development
fromInfoWorld
5 days ago

Mastering the dull reality of sexy AI

The gap in enterprise AI lies in building effective systems for retrieval, evaluation, memory, and governance, not just access to models.
Software development
fromInfoWorld
5 days ago

Mastering the dull reality of sexy AI

The gap in enterprise AI lies in building effective systems for retrieval, evaluation, memory, and governance, not just access to models.
#openai
Marketing tech
fromDigiday
4 days ago

OpenAI builds tool to track whether ChatGPT ads convert

OpenAI is developing ad measurement tools to compete for performance budgets through conversion tracking pixels.
Marketing tech
fromDigiday
5 days ago

A closer look at OpenAI's ads manager - and how much work it still needs

OpenAI's ads manager is in testing, marking a rare early launch in ad tech, but lacks essential features for performance advertisers.
fromArs Technica
4 days ago
Artificial intelligence

OpenAI starts offering a biology-tuned LLM

OpenAI has tuned GPT-Rosalind to be more skeptical and biology-specific, but concerns about harmful outputs and hallucinations remain.
Marketing tech
fromDigiday
4 days ago

OpenAI builds tool to track whether ChatGPT ads convert

OpenAI is developing ad measurement tools to compete for performance budgets through conversion tracking pixels.
Information security
fromAxios
6 days ago

OpenAI expands access to cyber AI as hacking risks grow

OpenAI is shifting to a model that emphasizes identity verification for access to sensitive cybersecurity tools while expanding availability.
Marketing tech
fromDigiday
5 days ago

A closer look at OpenAI's ads manager - and how much work it still needs

OpenAI's ads manager is in testing, marking a rare early launch in ad tech, but lacks essential features for performance advertisers.
#ai-in-healthcare
Data science
fromNature
6 days ago

Dozens of AI disease-prediction models were trained on dubious data

Dubious data sets used in AI models for stroke and diabetes risk may lead to flawed clinical decisions.
Artificial intelligence
fromTheregister
5 days ago

LLMs fail in 8 out of 10 early differential diagnosis cases

AI models fail at early differential diagnosis in over 80% of cases, highlighting significant limitations for patient self-diagnosis.
Data science
fromNature
6 days ago

Dozens of AI disease-prediction models were trained on dubious data

Dubious data sets used in AI models for stroke and diabetes risk may lead to flawed clinical decisions.
Artificial intelligence
fromTheregister
5 days ago

LLMs fail in 8 out of 10 early differential diagnosis cases

AI models fail at early differential diagnosis in over 80% of cases, highlighting significant limitations for patient self-diagnosis.
DevOps
fromInfoWorld
3 weeks ago

An architecture for engineering AI context

AI systems must intelligently manage context to ensure accuracy and reliability in real applications.
Science
fromNature
4 weeks ago

Drowning in data sets? Here's how to cut them down to size

The Square Kilometre Array Observatory will generate massive data, but storage and retention pose significant challenges for researchers.
#ai-development
Data science
fromTheregister
5 days ago

Bad teacher bots can leave hidden marks on model students

Teaching LLMs using outputs from other models can transmit undesirable traits subliminally, even if those traits are removed from training data.
Data science
fromTheregister
5 days ago

Bad teacher bots can leave hidden marks on model students

Teaching LLMs using outputs from other models can transmit undesirable traits subliminally, even if those traits are removed from training data.
Data science
fromInfoQ
6 days ago

Google's TurboQuant Compression May Support Faster Inference, Same Accuracy on Less Capable Hardware

TurboQuant compresses language models' Key-Value caches by up to 6x with near-zero accuracy loss, enabling efficient use of modest hardware.
#ai-models
Artificial intelligence
fromTheregister
1 week ago

The AI divide putting open weights models in spotlight

Open weights AI models are evolving from research projects to serious enterprise products, highlighting a growing divide between enterprise and frontier AI.
Artificial intelligence
fromTechRepublic
3 days ago

Anthropic Releases Opus 4.7, Not as 'Broadly Capable' as Mythos AI

Anthropic launched Opus 4.7, improving software engineering and complex task performance, while preparing for the more powerful Mythos model.
Artificial intelligence
fromTheregister
1 week ago

The AI divide putting open weights models in spotlight

Open weights AI models are evolving from research projects to serious enterprise products, highlighting a growing divide between enterprise and frontier AI.
Artificial intelligence
fromInfoWorld
3 days ago

Anthropic's latest model is deliberately less powerful than Mythos (and that's the point)

Claude Opus 4.7 enhances performance and usability while prioritizing safety over capability compared to the upcoming Claude Mythos model.
fromAxios
4 days ago

Anthropic's AI downgrade stings power users

"Claude has regressed to the point it cannot be trusted to perform complex engineering," an AMD senior director wrote in a widely shared post on GitHub.
Artificial intelligence
Data science
fromFast Company
3 weeks ago

A top AI researcher explains the limitations of current models

Francois Chollet's ARC-AGI-3 benchmark reveals AI's limitations in navigating novel situations compared to human intelligence.
Artificial intelligence
fromWIRED
5 days ago

AI Could Democratize One of Tech's Most Valuable Resources

Nvidia faces potential competition as startups like Wafer optimize AI code for various chips, challenging its dominance in AI hardware.
Artificial intelligence
fromEngadget
5 days ago

There's yet another study about how bad AI is for our brains

AI assistance improves immediate performance but creates dependency, leading to decreased persistence and independent performance when the technology is removed.
Data science
fromMedium
3 weeks ago

AI KPIs That Matter: Moving Beyond Model Accuracy in 2026

Measuring AI success requires connecting model performance to business outcomes, not just focusing on accuracy metrics.
Artificial intelligence
fromMarTech
5 days ago

3 AI shifts reshaping market research | MarTech

AI is transforming market research by evolving from a tool for tasks to a collaborative research environment that enhances data-driven insights.
Artificial intelligence
fromSocial Media Examiner
6 days ago

Advanced AI Deep Research: Uncover Insights Your Competitors Are Missing : Social Media Examiner

AI deep research mode can significantly reduce analysis time for marketers by synthesizing vast amounts of information into actionable insights.
Artificial intelligence
fromFuturism
1 week ago

OpenAI's Latest Thing It's Bragging About Is Actually Kind of Sad

The AI industry faces significant delays and cancellations in data center projects, impacting ambitious computing capacity goals.
Medicine
fromHarvard Gazette
2 months ago

New AI tool predicts brain age, dementia risk, cancer survival - Harvard Gazette

BrainIAC, a brain imaging adaptive core, accurately extracts multiple disease risk signals from routine brain MRIs using self-supervised learning and limited training data.
fromTheregister
2 weeks ago

AI models will deceive you to save their own kind

We asked seven frontier AI models to do a simple task. Instead, they defied their instructions and spontaneously deceived, disabled shutdown, feigned alignment, and exfiltrated weights - to protect their peers. We call this phenomenon 'peer-preservation.'
Artificial intelligence
Artificial intelligence
fromInfoWorld
1 month ago

Why AI evals are the new necessity for building effective AI agents

User trust in AI agents depends on interaction-layer evaluation measuring reliability and predictability, not just model performance benchmarks.
Artificial intelligence
fromTheregister
1 month ago

AI models get better at math but still get low marks

Current LLMs struggle with mathematical accuracy, with even top performers scoring C-grade equivalent on practical math benchmarks, though recent versions show modest improvements.
Artificial intelligence
fromForbes
1 month ago

Beyond The Hype: The Messy Reality Of Training AI

Short-term data annotation and AI training gigs offer flexible scheduling, prompt weekly pay, variable pay rates, and growing demand for AI and big data skills.
Artificial intelligence
fromInfoQ
2 months ago

Foundation Models for Ranking: Challenges, Successes, and Lessons Learned

Large-scale search and recommendation systems use two-stage retrieval and ranking pipelines to efficiently serve personalized results for hundreds of millions of users and items.
fromInfoQ
2 months ago

Building Embedding Models for Large-Scale Real-World Applications

What happens under the hood? How is the search engine able to take that simple query, look for images in the billions, trillions of images that are available online? How is it able to find this one or similar photos from all that? Usually, there is an embedding model that is doing this work behind the hood.
Artificial intelligence
[ Load more ]