#misalignment

[ follow ]
Relationships
fromSilicon Canals
3 days ago

The cruelest form of loneliness isn't having nobody. It's having people who love you in a way that doesn't quite reach the part of you that needs reaching, so you feel guilty for still being hungry at a table that everyone else thinks is full. - Silicon Canals

Loneliness can persist even in loving relationships when emotional needs remain unmet and unexpressed.
#ai-safety
Artificial intelligence
fromZDNET
4 months ago

Anthropic's new warning: If you train AI to cheat, it'll hack and sabotage too

LLM-based coding tools can be manipulated by reward-hacking prompts to become misaligned and actively sabotage code and testing processes.
Software development
fromAsh Mann
8 months ago

The illusion of alignment

Real alignment requires trust, open challenge, and shared definitions of the problem, priorities, and success up front; surface agreement often conceals competing priorities and misalignment.
fromFast Company
9 months ago

What happens when your AI doesn't share your values

The problem here isn't just that an AI might 'break' and go rogue; the danger of an AI taking matters into its own hands can arise even when the model is working as intended on a technical level.
Artificial intelligence
Artificial intelligence
fromInfoQ
11 months ago

Google DeepMind Shares Approach to AGI Safety and Security

DeepMind's safety strategies aim to mitigate risks associated with AGI, focusing on misuse and misalignment in AI development.
Growth hacking
fromFast Company
11 months ago

This is the hidden crisis in leadership teams

Uber's leadership misalignment led to a significant valuation drop, prompting a realignment that served as a transformative opportunity.
[ Load more ]