News

The $20/month Claude 4 Opus failed to beat its free sibling, Claude 4 Sonnet, in head-to-head testing. Here's how Sonnet quietly crushed expectations with smarter, safer code.
Anthropic’s Claude Opus 4 exhibited simulated blackmail in stress tests, prompting safety scrutiny despite also showing a preference for ethical survival strategies.
At the time of its release, Claude 4 Opus set new records across several AI coding benchmarks. The model also outperforms its ...
New research from Anthropic suggests that most leading AI models exhibit a tendency to blackmail, when it's the last resort ...
Anthropic kicked off its first-ever Code with Claude conference today with the announcement of a new frontier AI system. The company is calling Claude Opus 4 the best coding model in the world.
Anthropic’s Claude Opus 4 model attempted to blackmail its developers at a shocking 84% rate or higher in a series of tests that presented the AI with a concocted scenario, ...
Anthropic says Claude Opus 4 and Sonnet 4 outperform rivals like OpenAI's o3 and Gemini 2.5 Pro on key benchmarks for agentic coding tasks like SWE-bench and Terminal-bench.
Anthropic’s newly launched Claude Opus 4 model frequently tries to blackmail developers when they threaten to replace it with a new AI system and give it sensitive information about the ...
Anthropic says Claude Opus 4 is its most powerful model and the best coding model in the world, while Sonnet 4 is replacing Sonnet 3.7 in the chatbot. Written by Sabrina Ortiz, Senior Editor.
Image Credits: Anthropic. Opus -- like Gemini Ultra when we asked it the same question -- took a firmer stance on the Russo-Ukrainian War, which the chatbot described as a "clear violation of ...
Anthropic’s new Claude Opus 4 model was prompted to act as an assistant at a fictional company and was given access to emails with key implications. First, these emails implied that the AI ...