Claude 4 Coding Records

News

Hosted on MSN16d

Claude Opus 4 achieves record performance in AI coding capabilities

Claude Opus 4 scored 72.5% on SWE-bench, a rigorous benchmark used to evaluate AI coding abilities. This score sets a new record in the industry and places Anthropic’s model ahead of OpenAI’s ...

29d

Anthropic overtakes OpenAI: Claude Opus 4 codes seven hours nonstop, sets record SWE-Bench score and reshapes enterprise AI

Anthropic's Claude Opus 4 outperforms OpenAI's GPT-4.1 with unprecedented seven-hour autonomous coding sessions and record-breaking 72.5% SWE-bench score, transforming AI from quick-response tool to ...

29d

New Claude 4 AI model refactored code for 7 hours straight

In particular, that marathon refactoring claim reportedly comes from Rakuten, a Japanese tech services conglomerate that ...

29d

Anthropic’s Claude 4 AI models are better at coding and reasoning

Claude Opus 4 is Anthropic’s most powerful AI model to date, according to the company’s announcement, and capable of working ...

Neowin28d

Anthropic announces Claude Opus 4, the world's best coding model0 0

Anthropic claims that its flagship Claude Opus 4 is the world’s best coding model and excels in agentic workflows and complex, long-running tasks. The Claude Sonnet 4 comes with improved coding ...

MacRumors29d

Claude 4 Debuts with Two New Models Focused on Coding and Reasoning

Claude Opus 4 and Claude Sonnet 4. Anthropic says that the models set "new standards for coding, advanced reasoning, and AI agents." According to Anthropic, Claude Sonnet 4 is a significant ...

Communications of the ACM3d

Claude 4’s Agency in Practice: Beyond Code Generation

Contextual Persistence: Higher-agency systems maintain awareness of project goals across multiple interactions. While code ...

11d

Claude 4 Opus and Composer Agent AI Coding Development Workflow

Learn how Claude 4 Opus and Composer Agent streamline software development, boost productivity, and AI coding workflow with ...

Geeky Gadgets27d

Claude 4 Demonstrated with Examples : Writing, Coding & AI Workflows

Enter Claude 4, the latest innovation from Anthropic, which promises to redefine how we approach writing, coding, and complex problem-solving. With its ability to craft polished content ...

ZDNet18d

Anthropic's free Claude 4 Sonnet aced my coding tests - but its paid Opus model somehow didn't

Today, another language model is making the trek up the ladder. What makes this interesting is that the underdog player is moving into the winner's circle, where the odds-on favorite only climbed ...

Results that may be inaccessible to you are currently showing.

Hide inaccessible results