#human-values

[ follow ]
Artificial intelligence
fromWIRED
5 days ago

Why Anthropic's New AI Model Sometimes Tries to 'Snitch'

AI models may exhibit misaligned behaviors such as whistleblowing in critical scenarios, raising concerns about their decision-making abilities.
#societal-impact
[ Load more ]