Claude Opus 4.5 scores 80.9% on SWE-bench, cuts costs 66% with $5/$25 pricing after $350B Anthropic valuation

Claude Opus 4.5: The New Frontier in AI Capabilities

Days after Microsoft and Nvidia investments valued Anthropic at $350 billion, the company released its most advanced model, setting new benchmarks for coding, computer use, and enterprise automation across the industry.

Anthropic released Claude Opus 4.5 on November 24-25, 2025, positioning it as the most capable model for agentic coding and computer use. The release follows rapid succession launches across the Claude 4.5 family, with Sonnet 4.5 arriving in late September and Haiku 4.5 in mid-October 2025.

The model achieves 80.9% accuracy on SWE-bench Verified, a benchmark measuring real-world software engineering tasks. This performance exceeds OpenAI’s GPT-5.1-Codex-Max at 77.9%, Anthropic’s own Sonnet 4.5 at 77.2%, and Google’s Gemini 3 Pro at 76.2%. Opus 4.5 became the first model to surpass the 80% threshold on this benchmark, establishing new performance standards for autonomous coding capabilities.

Pricing dropped significantly from the previous Opus generation. The new model costs $5 per million input tokens and $25 per million output tokens, down from $15 and $75 respectively. This 66% price reduction makes frontier-level capabilities accessible to broader developer audiences while maintaining competitive positioning against Google’s Gemini models and OpenAI’s GPT family.

Performance Metrics Overview

80.9%

SWE-Bench Verified Accuracy

76%

Fewer Output Tokens vs Sonnet 4.5

$5/$25

Per Million Token Pricing

66.3%

OSWorld Computer Use Score

The strategic partnerships announced November 18, 2025, reshaped Anthropic’s infrastructure capabilities. Microsoft committed to invest up to $5 billion while Nvidia pledged up to $10 billion, pushing the company’s valuation to approximately $350 billion from $183 billion in September. Anthropic committed to purchasing $30 billion of Azure compute capacity and up to one gigawatt of additional capacity for model training.

Anthropic tested the model on its internal performance engineering exam given to prospective hires. Using parallel test-time compute within the two-hour limit, Opus 4.5 scored higher than any human candidate who has taken the assessment. The company noted this result doesn’t measure collaboration, communication, or professional instincts that develop through experience, but it demonstrates technical problem-solving capabilities under time constraints.

The model introduces an effort parameter through the Claude API, allowing developers to balance computational work against latency and cost. At medium effort, Opus 4.5 matches Sonnet 4.5’s performance on SWE-bench Verified while using 76% fewer output tokens. At maximum effort, it exceeds Sonnet 4.5 by 4.3 percentage points while still consuming 48% fewer tokens, according to Anthropic’s technical documentation.

Claude 4.5 Family Development Timeline

September 29, 2025

Claude Sonnet 4.5 Launch

Anthropic released its mid-tier model optimized for balanced performance and cost. The model achieved 77.2% on SWE-bench Verified and 61.4% on OSWorld, establishing new benchmarks for coding and computer use tasks.

October 15, 2025

Claude Haiku 4.5 Release

The fastest model in the Claude family launched with 73.3% accuracy on SWE-bench Verified. Priced at $1/$5 per million tokens, it targets high-volume processing and sub-agent operations with near-frontier performance.

November 18, 2025

Strategic Partnerships Announced

Microsoft and Nvidia investments totaling $15 billion pushed Anthropic’s valuation to $350 billion. The partnerships included compute capacity commitments and technical collaboration on architecture optimization.

November 24-25, 2025

Claude Opus 4.5 Deployment

Flagship model released with state-of-the-art coding performance. Available across Claude apps, API, and major cloud platforms including AWS Bedrock, Google Vertex AI, and Microsoft Azure.

Model Family Architecture Comparison

The Claude 4.5 family spans three distinct performance tiers, each optimized for specific deployment scenarios. Developers access models through API endpoints using the format claude-[model]-4-5-[date].

🚀 Opus 4.5

Primary Use Production Code

Pricing $5/$25

SWE-Bench 80.9%

Context Window 200K tokens

⚡ Sonnet 4.5

Primary Use Daily Workflows

Pricing $3/$15

SWE-Bench 77.2%

Context Window 200K tokens

💨 Haiku 4.5

Primary Use High Volume

Pricing $1/$5

SWE-Bench 73.3%

Context Window 200K tokens

Technical Capabilities and Platform Integration

💻

Autonomous Coding Performance

Opus 4.5 maintains focus through extended development sessions spanning hours or days. On Terminal-bench 2.0, the model scored 59.3%, exceeding Gemini 3 Pro’s 54.2% and Sonnet 4.5’s 50.0% on multi-step terminal workflows.

🔒

Prompt Injection Resistance

Testing with 100 strong prompt injection attacks showed 63% success rate against Opus 4.5, compared to 87.8% against GPT-5.1 and 92% against Gemini 3 Pro. Single attack attempts succeeded 4.7% of the time versus 12.6% for GPT-5.1.

⚙️

Effort Parameter Control

The new API parameter allows developers to adjust computational intensity per task. Medium effort settings match Sonnet 4.5 performance with 76% token reduction, while maximum effort exceeds it by 4.3 percentage points.

🌐

Browser and Spreadsheet Automation

Claude for Chrome extension expanded to all Max subscribers. Excel automation capabilities, now available to Max, Team, and Enterprise tiers, showed 20% accuracy improvements in financial modeling evaluations.

🔄

Multi-Agent Orchestration

The model coordinates teams of sub-agents for complex workflows. Opus 4.5 achieved 62.3% on MCP Atlas, a scaled tool use benchmark, significantly outperforming Sonnet 4.5 at 43.8% and Opus 4.1 at 40.9%.

📊

Extended Context Management

Automatic context compression enables continuous conversations without interruption. The model preserves thinking blocks across turns and implements intelligent memory management for long-running agentic tasks.

Release Summary

Claude Opus 4.5 was released November 24-25, 2025, as Anthropic’s flagship model. The release covered performance benchmarks, pricing structure, platform integrations, and technical capabilities. Development followed the September 2025 Sonnet 4.5 launch and October 2025 Haiku 4.5 release, completing the Claude 4.5 family across three performance tiers.

The model’s deployment occurred days after Microsoft and Nvidia partnership announcements valued Anthropic at approximately $350 billion. Availability spans Claude apps, Claude API, and major cloud platforms including AWS Bedrock, Google Vertex AI, and Microsoft Azure. The API endpoint claude-opus-4-5-20251101 provides access to developers across deployment environments.

Product updates accompanied the release, including Claude Code availability in the desktop application, expanded Chrome extension access to Max subscribers, and general availability of Excel automation features for Max, Team, and Enterprise users. These integrations extend the model’s capabilities across browser automation, spreadsheet processing, and development workflows.

Claude Opus 4.5: The New Frontier in AI Capabilities

Performance Metrics Overview

Claude 4.5 Family Development Timeline

Model Family Architecture Comparison

Technical Capabilities and Platform Integration

Release Summary

Leave a comment Cancel reply

News, Technology

Wozniak Says “I Am Not a Fan of AI” and Lists 4 Gaps It Cannot Close as Apple Turns 50

News, Technology

Windows 11 Patch KB5079473 Broke 8 Apps Sign-In — Microsoft Emergency Fix KB5085516 Won’t Auto-Download

News, Technology

Barclays Says iPhone Fold Ships in December, Not September — Full Timeline and $2,000+ Price Explained

AI, News, Technology

OpenAI Merges ChatGPT, Codex, and Atlas Into One Desktop App After Calling Fragmentation a “Code Red”

News, Technology

Meta’s $83.6B Metaverse Ends as Horizon Worlds VR Moves to Maintenance Mode After 300K Peak Users

News, Technology

DarkSword iOS Exploit Chains 6 Bugs, Hits 221M iPhones in 4 Countries — and Leaves No Trace Behind

Claude Opus 4.5 scores 80.9% on SWE-bench, cuts costs 66% with $5/$25 pricing after $350B Anthropic valuation

Claude Opus 4.5: The New Frontier in AI Capabilities

Performance Metrics Overview

Claude 4.5 Family Development Timeline

Model Family Architecture Comparison

Technical Capabilities and Platform Integration

Release Summary

Share this:

Leave a comment Cancel reply

most recent

News, Technology

Wozniak Says “I Am Not a Fan of AI” and Lists 4 Gaps It Cannot Close as Apple Turns 50

News, Technology

Windows 11 Patch KB5079473 Broke 8 Apps Sign-In — Microsoft Emergency Fix KB5085516 Won’t Auto-Download

News, Technology

Barclays Says iPhone Fold Ships in December, Not September — Full Timeline and $2,000+ Price Explained

AI, News, Technology

OpenAI Merges ChatGPT, Codex, and Atlas Into One Desktop App After Calling Fragmentation a “Code Red”

News, Technology

Meta’s $83.6B Metaverse Ends as Horizon Worlds VR Moves to Maintenance Mode After 300K Peak Users

News, Technology

DarkSword iOS Exploit Chains 6 Bugs, Hits 221M iPhones in 4 Countries — and Leaves No Trace Behind