fromBusiness Insider
1 week agoAnthropic's latest AI model can tell when it's being evaluated: 'I think you're testing me'
"I think you're testing me - seeing if I'll just validate whatever you say, or checking whether I push back consistently, or exploring how I handle political topics,"
Artificial intelligence