#opus-45

[ follow ]
Artificial intelligence
fromZDNET
16 hours ago

Is Opus 4.5 really 'the best model in the world for coding'? It just failed half my tests

Opus 4.5 failed half of the coding tests and showed reliability and file-handling issues, achieving only a 50% pass rate.
Artificial intelligence
fromArs Technica
20 hours ago

Anthropic introduces cheaper, more powerful, more efficient Opus 4.5 model

Opus 4.5 improves conversational memory to avoid abrupt hard-stops and raises coding benchmark accuracy above 80 percent while excelling at agentic coding.
fromTechCrunch
1 day ago

Anthropic releases Opus 4.5 with new Chrome and Excel integrations | TechCrunch

On Monday, Anthropic announced Opus 4.5, the latest version of its flagship model. It's the last of Anthropic's 4.5 series of models to be released, following the launch of Sonnet 4.5 in September and Haiku 4.5 in October. As expected, the new version of Opus has state-of-the-art performance on a range of benchmarks, including coding benchmarks (SWE-Bench and Terminal-bench), tool use (tau2-bench and MCP Atlas) and general problem solving (ARC-AGI 2, GPQA Diamond).
Artificial intelligence
[ Load more ]