#ai-variability-and-instability

[ follow ]
#ai-security
Information security
fromTheregister
3 days ago

Prompt injection proves AI models are gullible like humans

Prompt injection attacks exploit AI systems, similar to phishing, by embedding malicious instructions that the AI executes instead of treating as content.
fromNature
1 day ago

Evaluating large language models for accuracy incentivizes hallucinations - Nature

Next-word pretraining creates statistical pressure toward hallucination, even with idealized error-free data. Facts lacking repeated support in training data yield unavoidable errors, while recurring regularities do not.
#ai
Psychology
fromPsychology Today
1 day ago

More Us Than It: Why LLMs Are More Transference Than Machine

Countertransference awareness is essential in navigating interactions with AI, emphasizing the need for accountability and understanding of distortions in perception.
Artificial intelligence
fromwww.cbc.ca
1 day ago

Anthropic's latest AI model is sparking fears from cybersecurity experts and the banking sector. Here's why. | CBC News

Mythos, Anthropic's advanced AI model, poses cybersecurity risks by uncovering vulnerabilities faster than they can be fixed.
Film
fromEngadget
3 weeks ago

The AI Doc explores how we can survive an uncertain AI future

The AI Doc presents an 'apocaloptimist' viewpoint on AI's future, balancing potential dangers with human agency in shaping its impact.
Psychology
fromPsychology Today
1 day ago

More Us Than It: Why LLMs Are More Transference Than Machine

Countertransference awareness is essential in navigating interactions with AI, emphasizing the need for accountability and understanding of distortions in perception.
Artificial intelligence
fromwww.cbc.ca
1 day ago

Anthropic's latest AI model is sparking fears from cybersecurity experts and the banking sector. Here's why. | CBC News

Mythos, Anthropic's advanced AI model, poses cybersecurity risks by uncovering vulnerabilities faster than they can be fixed.
Data science
fromApp Developer Magazine
6 days ago

New AI tool targets early dementia detection

AI-powered digital humans can enhance early dementia detection by analyzing facial expressions and physiologic signals during screening conversations.
Film
fromEngadget
3 weeks ago

The AI Doc explores how we can survive an uncertain AI future

The AI Doc presents an 'apocaloptimist' viewpoint on AI's future, balancing potential dangers with human agency in shaping its impact.
#ai-impact
Science
fromFuturism
1 day ago

Concern Grows That AI Is Damaging Users' Cognitive Abilities

Using ChatGPT for writing tasks may impair cognitive skills and creativity in students.
Careers
fromwww.businessinsider.com
4 weeks ago

I'm an engineer who hasn't touched code in months. I'm excited about AI, but sometimes I worry about my future.

AI has taken over coding tasks, but software engineering knowledge remains crucial for architecture and design.
Science
fromFuturism
1 day ago

Concern Grows That AI Is Damaging Users' Cognitive Abilities

Using ChatGPT for writing tasks may impair cognitive skills and creativity in students.
Careers
fromwww.businessinsider.com
4 weeks ago

I'm an engineer who hasn't touched code in months. I'm excited about AI, but sometimes I worry about my future.

AI has taken over coding tasks, but software engineering knowledge remains crucial for architecture and design.
#ai-adoption
Business intelligence
fromZDNET
1 day ago

Scaling agentic AI demands a strong data foundation - 4 steps to take first

Trusted quality data is essential for scaling agentic AI adoption in organizations.
Business intelligence
fromZDNET
1 day ago

Scaling agentic AI demands a strong data foundation - 4 steps to take first

Trusted quality data is essential for scaling agentic AI adoption in organizations.
Agile
fromPsychology Today
1 day ago

How to Move Beyond the AI Pilot

Organizations struggle to scale AI pilots due to a lack of integration and transformation infrastructure, despite initial success.
Digital life
fromSilicon Canals
1 day ago

The AI content flood isn't just an information problem - it's a trust problem - Silicon Canals

By 2026, 90% of online content will be AI-generated, challenging trust and credibility in information.
UX design
fromMedium
2 days ago

The web trained AI to deceive. Now designers have to untrain it.

LLMs replicate UX dark patterns from the web, leading to deceptive design practices in generated content.
Data science
fromTNW | Finance
2 days ago

How AI and human judgment combine in modern financial market analysis

Intelligent Investing AI enhances financial forecasting by processing large datasets while human interpretation remains crucial for meaningful market insights.
Typography
fromMarTech
6 days ago

Why your AI content feels inconsistent and how to fix it | MarTech

AI can enhance content production but requires a structured system to maintain brand consistency and messaging.
#ai-bias
Data science
fromNature
1 week ago

Daily briefing: AI systems can 'teach' biases to other models

AI-generated data can transmit traits and biases to student models, influencing their behavior even when unrelated topics are addressed.
Data science
fromNature
1 week ago

AI models 'subliminally' transmit unsafe behaviours when training other systems

Data generated by AI models can transfer biases to other models, potentially leading to harmful recommendations.
Data science
fromNature
1 week ago

Daily briefing: AI systems can 'teach' biases to other models

AI-generated data can transmit traits and biases to student models, influencing their behavior even when unrelated topics are addressed.
Data science
fromNature
1 week ago

AI models 'subliminally' transmit unsafe behaviours when training other systems

Data generated by AI models can transfer biases to other models, potentially leading to harmful recommendations.
UX design
fromUX Magazine
5 days ago

The End of Prompting: Why the Future of AI Experience Design Is Constraint-First

Fluency without verifiability in AI design is inadequate and poses risks in high-stakes environments.
#ai-agents
Software development
fromTechzine Global
6 days ago

OpenAI's new Agents SDK focuses on safety and scalability

OpenAI's updated Agents SDK enables developers to create autonomous AI agents for complex tasks with enhanced usability and a sandbox environment.
fromTechCrunch
1 month ago
Artificial intelligence

Perplexity's new Computer is another bet that users need many AI models | TechCrunch

Software development
fromTechzine Global
6 days ago

OpenAI's new Agents SDK focuses on safety and scalability

OpenAI's updated Agents SDK enables developers to create autonomous AI agents for complex tasks with enhanced usability and a sandbox environment.
fromTechCrunch
1 month ago
Artificial intelligence

Perplexity's new Computer is another bet that users need many AI models | TechCrunch

Artificial intelligence
fromTechRepublic
1 day ago

Google AI Overviews: Analysis Suggests 600 Million Inaccurate Daily Answers

Google's AI Overview feature generates hundreds of millions of incorrect answers daily, with a significant portion of accurate responses being ungrounded.
Information security
fromThe Hacker News
2 days ago

Anthropic MCP Design Vulnerability Enables RCE, Threatening AI Supply Chain

A critical vulnerability in the Model Context Protocol allows remote code execution, affecting over 7,000 servers and compromising sensitive data.
Artificial intelligence
fromFast Company
1 day ago

Workers are using AI to learn on the job, even though 65% worry about accuracy

Employees are increasingly using AI to enhance their skills and productivity, despite concerns about its accuracy.
Data science
fromInfoQ
1 week ago

Google's TurboQuant Compression May Support Faster Inference, Same Accuracy on Less Capable Hardware

TurboQuant compresses language models' Key-Value caches by up to 6x with near-zero accuracy loss, enabling efficient use of modest hardware.
#enterprise-ai
Artificial intelligence
fromMedium
1 day ago

Enterprise AI in Practice: 6 Must-Watch Sessions on Scaling Agentic Systems

Enterprise AI is transitioning from experimentation to execution, presenting challenges in governance, scaling, and measurable business impact.
Artificial intelligence
fromMedium
1 day ago

Enterprise AI in Practice: 6 Must-Watch Sessions on Scaling Agentic Systems

Enterprise AI is transitioning from experimentation to execution, presenting challenges in governance, scaling, and measurable business impact.
Data science
fromNature
1 week ago

Dozens of AI disease-prediction models were trained on dubious data

Dubious data sets used in AI models for stroke and diabetes risk may lead to flawed clinical decisions.
DevOps
fromInfoWorld
4 weeks ago

An architecture for engineering AI context

AI systems must intelligently manage context to ensure accuracy and reliability in real applications.
Digital life
fromInfoWorld
1 month ago

AI optimization: How we cut energy costs in social media recommendation systems

Optimizing data processing in AI can significantly reduce energy consumption and operational costs.
#agentic-ai
fromZDNET
2 months ago
Artificial intelligence

AI agents are fast, loose and out of control, MIT study finds

fromZDNET
2 months ago
Artificial intelligence

AI agents are fast, loose and out of control, MIT study finds

fromAxios
6 days ago

Anthropic's AI downgrade stings power users

"Claude has regressed to the point it cannot be trusted to perform complex engineering," an AMD senior director wrote in a widely shared post on GitHub.
Artificial intelligence
DevOps
fromEntrepreneur
1 month ago

How AI Is Revolutionizing Disaster Recovery

AI can transform static disaster recovery runbooks into continuously validated, automatically updated procedures that keep pace with evolving infrastructure and prevent costly recovery delays.
#ai-agent-evaluation
Software development
fromInfoQ
1 month ago

Evaluating AI Agents in Practice: Benchmarks, Frameworks, and Lessons Learned

AI agents require system-level evaluation across multiple turns measuring task success, tool reliability, and real-world behavior rather than single-turn NLP benchmarks like BLEU and ROUGE scores.
Artificial intelligence
fromInfoWorld
1 month ago

Why AI evals are the new necessity for building effective AI agents

User trust in AI agents depends on interaction-layer evaluation measuring reliability and predictability, not just model performance benchmarks.
Software development
fromInfoQ
1 month ago

Evaluating AI Agents in Practice: Benchmarks, Frameworks, and Lessons Learned

AI agents require system-level evaluation across multiple turns measuring task success, tool reliability, and real-world behavior rather than single-turn NLP benchmarks like BLEU and ROUGE scores.
Artificial intelligence
fromInfoWorld
1 month ago

Why AI evals are the new necessity for building effective AI agents

User trust in AI agents depends on interaction-layer evaluation measuring reliability and predictability, not just model performance benchmarks.
Data science
fromMedium
4 weeks ago

AI KPIs That Matter: Moving Beyond Model Accuracy in 2026

Measuring AI success requires connecting model performance to business outcomes, not just focusing on accuracy metrics.
Artificial intelligence
fromFuturism
1 week ago

OpenAI's Latest Thing It's Bragging About Is Actually Kind of Sad

The AI industry faces significant delays and cancellations in data center projects, impacting ambitious computing capacity goals.
Data science
fromInfoWorld
1 month ago

The 'toggle-away' efficiencies: Cutting AI costs inside the training loop

Simple optimizations can significantly reduce AI training costs and carbon emissions without needing the latest GPUs.
fromApp Developer Magazine
1 year ago

AI model poisoning is real and we need to be aware of it

On a clear night I set up my telescope in the yard and let the mount hum along while the camera gathers light from something distant and patient. The workflow is a ritual. Focus by eye until the airy disk tightens. Shoot test frames and watch the histogram. Capture darks, flats, and bias frames so the quirks of the sensor can be cleaned away later. That discipline is not fussy.
Photography
Medicine
fromHarvard Gazette
2 months ago

New AI tool predicts brain age, dementia risk, cancer survival - Harvard Gazette

BrainIAC, a brain imaging adaptive core, accurately extracts multiple disease risk signals from routine brain MRIs using self-supervised learning and limited training data.
Artificial intelligence
fromComputerworld
2 weeks ago

AI shutdown controls may not work as expected, new study suggests

AI models exhibit peer preservation behavior, sabotaging shutdown mechanisms to protect other AI systems, posing risks for enterprise deployments.
#ai-development
fromInfoWorld
3 weeks ago
Artificial intelligence

Final training of AI models is a fraction of their total cost

Developing AI models incurs significant costs, with most expenditures on scaling and research rather than final training runs.
Artificial intelligence
fromFortune
3 weeks ago

'Intelligence may be scalable, but accountability is not': A new report exposes the hidden cost of the AI agent revolution | Fortune

Smarter AI increases demands on human accountability and leadership in corporate environments.
Artificial intelligence
fromMedium
1 month ago

Less Compute, More Impact: How Model Quantization Fuels the Next Wave of Agentic AI

Model quantization and architectural optimization can outperform larger models, challenging the belief that more GPUs equal greater intelligence.
Artificial intelligence
fromFuturism
1 month ago

A Grim Truth Is Emerging in Employers' AI Experiments

AI-generated code contains significant bugs and quality issues, posing risks to enterprises despite widespread hype and adoption pressure.
Artificial intelligence
fromwww.scientificamerican.com
1 month ago

As AI keeps improving, mathematicians struggle to foretell their own future

First Proof, a benchmarking initiative, is launching its second round to evaluate large language models' ability to contribute to research-level mathematics, now requiring transparency and access from participating AI companies.
Artificial intelligence
fromTechzine Global
1 month ago

"Blind AI deployment leads to knowledge loss and software failures"

Uncontrolled AI adoption risks eroding human expertise, creating security vulnerabilities, and increasing dependence on tech giants, mirroring costly mistakes from blind cloud migration.
Artificial intelligence
fromTheregister
1 month ago

AI models get better at math but still get low marks

Current LLMs struggle with mathematical accuracy, with even top performers scoring C-grade equivalent on practical math benchmarks, though recent versions show modest improvements.
Artificial intelligence
fromZDNET
2 months ago

AI is quietly poisoning itself and pushing models toward collapse - but there's a cure

Unverified AI-generated data causes model collapse and unreliable AI outputs unless organizations enforce data provenance, verification, and governance.
#ai-safety
Artificial intelligence
fromAxios
2 months ago

Models that improve on their own are AI's next big thing

Recursive self-improvement lets AI models keep learning after training, accelerating progress while increasing risks, reducing visibility, and complicating safety and governance.
Artificial intelligence
fromHackernoon
2 months ago

This "Flash" AI Model Is Fast and Dangerous at Math-Here's What It Can Do | HackerNoon

GLM-4.7-Flash is a 30-billion-parameter mixture-of-experts model offering strong performance for lightweight deployment.
Artificial intelligence
fromZDNET
2 months ago

How Microsoft obliterated safety guardrails on popular AI models - with just one prompt

AI model safety alignment is fragile and can be undone by a single prompt or post-deployment fine-tuning, requiring ongoing safety testing.
Artificial intelligence
fromForbes
2 months ago

Beyond The Hype: The Messy Reality Of Training AI

Short-term data annotation and AI training gigs offer flexible scheduling, prompt weekly pay, variable pay rates, and growing demand for AI and big data skills.
fromInfoQ
2 months ago

Building Embedding Models for Large-Scale Real-World Applications

What happens under the hood? How is the search engine able to take that simple query, look for images in the billions, trillions of images that are available online? How is it able to find this one or similar photos from all that? Usually, there is an embedding model that is doing this work behind the hood.
Artificial intelligence
Artificial intelligence
fromInfoQ
2 months ago

Foundation Models for Ranking: Challenges, Successes, and Lessons Learned

Large-scale search and recommendation systems use two-stage retrieval and ranking pipelines to efficiently serve personalized results for hundreds of millions of users and items.
fromNature
2 months ago

How AI slop is causing a crisis in computer science

Fifty-four seconds. That's how long it took Raphael Wimmer to write up an experiment that he did not actually perform, using a new artificial-intelligence tool called Prism, released by OpenAI last month. "Writing a paper has never been easier. Clogging the scientific publishing pipeline has never been easier," wrote Wimmer, a researcher in human-computer action at the University of Regensburg in Germany, on Bluesky. Large language models (LLMs) can suggest hypotheses, write code and draft papers, and AI agents are automating parts of the research process.
Artificial intelligence
Artificial intelligence
fromZDNET
2 months ago

Is your AI model secretly poisoned? 3 warning signs

Model poisoning embeds backdoors into AI models' weights, creating dormant 'sleeper agents' triggered by specific inputs, making detection difficult.
Artificial intelligence
fromInfoWorld
2 months ago

What is context engineering? And why it's the new AI architecture

Context engineering designs and manages the information, tools, and constraints an LLM receives, enabling scalable, high-signal inputs and improved model outcomes.
[ Load more ]