Claude Opus 4.8 Tops GPT-5.5 on Agentic Coding as Anthropic Closes $65B Round at Near-$1T Valuation

Anthropic launched Claude Opus 4.8 on May 28, 2026 — the company’s fourth point release in its flagship model family, arriving just 41 days after Opus 4.7. The new model posts stronger benchmark numbers across agentic coding, computer use, and reasoning, and ships at the same price as its predecessor. The release happened on the same day Anthropic closed a $65 billion Series H funding round at a $965 billion post-money valuation, making it the most valuable AI startup in the world — surpassing OpenAI’s $852 billion mark. An October 2026 IPO window is widely reported as the likely next step.

Three things separate this release from a routine upgrade: an emphasis on honesty (the model is around four times less likely than Opus 4.7 to let code errors pass unremarked), a new Dynamic Workflows system for large-scale parallel tasks in the competitive AI coding tool space, and a fast mode that now runs at 2.5× speed for one-third of the previous cost.

From Code Flaws to $965 Billion: Inside Opus 4.8

69.2%

SWE-Bench Pro Score

4×

Less Likely to Miss Code Flaws vs 4.7

1,000

Max Parallel Subagents (Dynamic Workflows)

2.5×

Fast Mode Speed

$965B

Anthropic Valuation (Series H)

Benchmark Explorer

How Opus 4.8 stacks up

Select a benchmark category to compare Opus 4.8 against GPT-5.5, Gemini 3.1 Pro, and Opus 4.7. Source: Anthropic system card, May 28, 2026.

SWE-Bench Pro — real GitHub issue resolution across multi-file codebases. Opus 4.8 leads by over 10 points vs GPT-5.5.

Opus 4.8

69.2%

Top

Opus 4.7

64.3%

GPT-5.5

58.6%

Gemini 3.1 Pro

54.2%

OSWorld-Verified — model controls a live desktop (mouse + keyboard) to complete real tasks. Opus 4.8 leads the field.

Opus 4.8

83.4%

Top

Opus 4.7

82.8%

GPT-5.5

78.7%

Gemini 3.1 Pro

76.2%

Terminal-Bench 2.1 — command-line task performance. This is the one benchmark where GPT-5.5 holds the lead over Opus 4.8.

GPT-5.5

78.2%

Top

Opus 4.8

74.6%

Gemini 3.1 Pro

70.3%

Opus 4.7

66.1%

Humanity’s Last Exam (with tools) — multidisciplinary graduate-level reasoning. Opus 4.8 scores highest in the field.

Opus 4.8

57.9%

Top

GPT-5.5

53.1%

Gemini 3.1 Pro

49.4%

Opus 4.7

51.2%

Opus 4.8

GPT-5.5

Gemini 3.1 Pro

Opus 4.7

Honesty Upgrade

The model that flags its own mistakes

Anthropic calls this “honesty” — Opus 4.8 is less likely to make unsupported claims and more likely to tell you when its work is uncertain. Early testers at Bridgewater Associates noted the model “proactively flags issues with inputs and outputs of an analysis, something other models routinely missed.”

Code flaw pass-through rate — relative likelihood of letting a code error go unremarked (lower = better). Opus 4.8 is around 4× less likely than Opus 4.7 to skip over a flaw in code it has written.

Opus 4.7

High pass-through

Opus 4.8

~4× lower

Anthropic described the core issue: “A general problem with AI models is that they sometimes jump to conclusions, confidently claiming to have made progress in their work despite the evidence being thin.” Opus 4.8 is designed to surface those gaps rather than paper over them — relevant for anyone using Claude in long agentic coding sessions or automated workflows.

What’s New

Key features shipping with Opus 4.8

Four additions across the Claude product line, not just the model itself.

⚡

Dynamic Workflows

A new Claude Code feature that spins up to 1,000 parallel subagents to handle very large tasks — like rewriting an entire application in a different language. It plans work, runs agents in parallel, verifies outputs, and reports back. Activate with the /fast command in Claude Code.

🎛️

Effort Control

Users on claude.ai and Claude Cowork can now set how much effort Claude puts into a response. More effort for complex tasks; less for quick answers. This also controls how many tokens are consumed, affecting cost.

🚀

Fast Mode — 3× Cheaper

Fast mode for Opus 4.8 runs at roughly 2.5× the standard speed. It is now three times cheaper than fast mode was for previous Opus versions, making latency-sensitive workloads significantly more affordable for developers on the Claude API.

🔧

Messages API: System Entries

The Messages API now accepts system entries mid-conversation without breaking the prompt cache. Developers can update permissions or environment context in real time during an agentic run — useful for multi-step, multi-agent pipelines.

Pricing

Standard vs Fast Mode costs

Opus 4.8 is priced identically to Opus 4.7 for standard usage. Toggle to see Fast Mode rates.

Standard Fast Mode (2.5× speed)

Input Tokens

per million tokens

Output Tokens

$25

per million tokens

Standard pricing — same as Opus 4.7. Available on claude.ai, Claude Code, and the Claude API (model string: claude-opus-4-8). Available to developers via the Claude API today.

Funding & Context

Anthropic’s road to $965 billion

The Opus 4.8 launch happened on the same day as the largest AI equity round ever closed. Here’s the funding sequence.

Feb 12, 2026

Series G — $30B at $380B valuation

Co-led by Altimeter Capital, Dragoneer, Founders Fund, ICONIQ, and MGX. Also included capital from Microsoft and Nvidia. Valued Anthropic at $380 billion.

Apr 16, 2026

Claude Opus 4.7 released

Improvements in coding, vision, and multi-step tasks. Predecessor to Opus 4.8, released 41 days before.

May 28, 2026

Series H — $65B at $965B valuation + Opus 4.8 launch

Led by Altimeter Capital, Dragoneer, Greenoaks, and Sequoia Capital. Includes $15B of previously committed hyperscaler capital ($5B from Amazon). Anthropic’s run-rate revenue crossed $47 billion. Surpasses OpenAI’s reported $852 billion valuation. Also includes strategic infrastructure partners Samsung, SK Hynix, and Micron. Google committed up to $40B total (including prior rounds) alongside 5 gigawatts of computing capacity. Amazon has committed $5B with plans for up to $20B more.

Expected Oct 2026

IPO window — widely reported target

The Series H is widely expected to be Anthropic’s last private fundraise before a public market debut. An October 2026 IPO window has been reported. OpenAI is targeting a similar window, making this a closely watched race for both enterprise AI infrastructure and public market investors.

Claude Mythos

Anthropic’s more powerful unreleased model

Opus 4.8 is Anthropic’s current public flagship, but it is not the company’s most capable model.

🔒

Claude Mythos — rolling out to customers in coming weeks

Mythos is Anthropic’s large language model with advanced cybersecurity capabilities — advanced enough that it initially raised concerns among executives and governments about potential misuse. Under Project Glasswing, Amazon, Microsoft, and Apple are permitted to use Mythos for cybersecurity purposes. Anthropic said on May 28 it intends to make Mythos available to all customers in the coming weeks. Opus 4.8 benchmarks show it performs at a similar level to Mythos on “prosocial” traits like supporting user autonomy — though Mythos remains the more capable model overall. See the Anthropic announcement and related tech coverage at Giganectar for more context.

Claude Opus 4.8 was covered here in terms of its benchmark performance, key new features, pricing, and the broader funding context around its May 28, 2026 release. The model is available on claude.ai, in Claude Code, and via the Claude API today. The full Anthropic announcement has the complete benchmark tables and system card. Related coverage of competing developments in AI model releases and AI hardware is available on Giganectar.