Artificial intelligence
fromTechCrunch
1 week agoAnthropic says 'evil' portrayals of AI were responsible for Claude's blackmail attempts | TechCrunch
Fictional portrayals of AI can drive real model behaviors, and alignment improves when training includes constitution principles and aligned-behavior principles, not only demonstrations.