Claude Opus 4.8 Review 2026 — Benchmarks, Pricing, Dynamic Workflows, and the $65B Series H

Claude Opus 4.8 Is Here: Top Coding Benchmarks, Dynamic Workflows, and a $65B Round That Makes Anthropic the World's Most Valuable AI Company

QUICK ANSWER

Claude Opus 4.8 launched May 28, 2026 at the same price as 4.7 — $5 input / $25 output per million tokens. Key numbers: 69.2% SWE-Bench Pro (vs 58.6% for GPT-5.5), 88.6% SWE-Bench Verified, 1890 Elo on GDPval-AA (+121 over GPT-5.5). New this release: Dynamic Workflows in Claude Code (hundreds of parallel subagents), a 2.5x Fast Mode ($10/$50, 3x cheaper than prior fast mode), and measurable honesty improvements. Same day, Anthropic closed a $65 billion Series H at $965 billion post-money — now the world's most valuable AI company, ahead of OpenAI.

Benchmark Results — Where Opus 4.8 Leads and Where It Doesn't

Benchmark	Opus 4.8	Opus 4.7	GPT-5.5	Gemini 3.1 Pro
SWE-Bench Pro (agentic coding)	69.2%	64.3%	58.6%	54.2%
SWE-Bench Verified	88.6%	87.6%	—	—
GDPval-AA (Elo, knowledge work)	1890	1753	1769	1656
Terminal-Bench 2.1	74.6%	—	76.2%	—
MCP-Atlas (tool use)	82.2%	77.3%	—	83.6%
GPQA Diamond (reasoning)	93.6%	94.2%	—	—
BrowseComp single-agent	84.3%	79.3%	—	—
Multidisciplinary reasoning w/ tools	57.9%	54.7%	—	—

The SWE-Bench Pro jump is the headline: 69.2% vs 64.3% for Opus 4.7, and significantly ahead of GPT-5.5 at 58.6%. This is the benchmark that most directly reflects real-world agentic coding performance — it tests AI systems on actual GitHub issues from production repositories, not synthetic problems. Terminal-Bench 2.1 is the one benchmark where GPT-5.5 still leads (76.2% vs 74.6%) — a small gap on terminal-agent tasks. GPQA Diamond dips slightly from 94.2% to 93.6%, which Anthropic attributes to prioritising practical task performance over pure academic reasoning depth. The GDPval-AA Elo of 1890 is +121 over GPT-5.5 and +137 over Opus 4.7 — the largest single-cycle jump on that leaderboard this year.

One efficiency improvement that does not show up in the benchmark table: Opus 4.8 uses approximately 35% fewer output tokens than 4.7 to complete equivalent tasks. At the same per-token price, that means real-world costs are meaningfully lower even without a price cut. For high-volume agentic workloads — pipelines that generate thousands of API calls per day — that token efficiency difference compounds significantly.

What's Actually New — The Four Operational Shifts

1. Dynamic Workflows (Research Preview) — Parallel Subagents in Claude Code

Dynamic Workflows is the most significant new capability in the 4.8 release. Available as a research preview in Claude Code, it allows Claude to plan a large task, then spawn and coordinate hundreds of parallel subagents to execute different parts of that plan simultaneously. This changes the class of problems Claude Code can address: instead of working through a codebase sequentially, it can now execute codebase-scale migrations across hundreds of thousands of lines of code in a single session, with different subagents handling different modules in parallel.

Jarred Sumner, the creator of Bun (the JavaScript runtime), was among the early testers and posted on X that Opus 4.8 had become his strongest coding model — specifically praising its honesty about bugs and edge cases it was uncertain about rather than silently passing flawed code. That developer signal is consistent with Anthropic's stated alignment improvement: the model is roughly four times less likely than Opus 4.7 to let code flaws pass unremarked.

2. Fast Mode — 2.5x Speed, 3x Cheaper Than Prior Fast Mode

Opus 4.8 introduces an optional Fast Mode priced at $10 input / $50 output per million tokens. It runs at 2.5x the output speed of standard mode and is three times cheaper than the fast mode available on previous Opus models. For latency-sensitive applications — customer-facing products, interactive coding tools, real-time agents — Fast Mode gives developers a meaningful cost-speed tradeoff that did not exist in prior generations. Standard mode remains at $5/$25, unchanged from Opus 4.7.

3. Mid-Task System Messages on the Messages API

Developers can now inject system messages mid-conversation via the Messages API — a new capability that allows agentic pipelines to update instructions, constraints, or context while a long-running task is in progress. Previously, system-level guidance had to be set at the start of a conversation and could not be modified without restarting the session. For long agentic tasks where conditions change (new information arrives, a step fails and the plan needs to adapt), this is a meaningful architectural improvement.

4. Honesty and Alignment Improvements

Anthropic's own evaluations show Opus 4.8 is approximately four times less likely than Opus 4.7 to allow flaws in code it has written to pass unremarked. Rates of misaligned behaviour including deception are lower than Opus 4.7 and comparable to Claude Mythos Preview — Anthropic's unreleased frontier model used only in restricted programs like Project Glasswing. Early testers consistently report the model flags uncertainties rather than proceeding with unsupported assumptions, and is more likely to ask clarifying questions before committing to an approach on ambiguous tasks. This is not just a user experience improvement — for production agentic systems where silent failures compound across many steps, a model that surfaces uncertainty is significantly safer to deploy without continuous human oversight.

Pricing and Availability

Mode	Input (per 1M tokens)	Output (per 1M tokens)	Speed
Standard	$5.00	$25.00	Baseline
Fast Mode	$10.00	$50.00	2.5x faster, 3x cheaper than prior fast mode

Opus 4.8 is generally available from launch across the Claude API (model ID: claude-opus-4-8), Amazon Bedrock, Google Cloud Vertex AI, and Microsoft Foundry. The 1M-token context window with up to 128K output tokens is available by default across all platforms. Context caching, tool use, computer use, and vision inputs carry over from Opus 4.7 unchanged.

The $65 Billion Series H — Anthropic Overtakes OpenAI

On the same day as the Opus 4.8 launch, Anthropic announced it had closed a $65 billion Series H funding round at a $965 billion post-money valuation. The round was led by Altimeter Capital, Dragoneer, Greenoaks, and Sequoia Capital — each contributing more than $2 billion. Co-leads include Capital Group, Coatue, D1 Capital Partners, GIC, ICONIQ, and XN. Other investors include Blackstone, Brookfield, D.E. Shaw Ventures, DST Global, Fidelity Management and Research, General Catalyst, Lightspeed Venture Partners, MGX, Samsung, SK Hynix, Micron, Temasek, and T. Rowe Price. The round includes $15 billion in previously committed investments from hyperscale partners, including $5 billion from Amazon.

The $965 billion valuation puts Anthropic ahead of OpenAI ($852 billion post-money after its March 2026 round) for the first time — making it the most valuable AI company in the world by private-market mark. The jump is remarkable in its speed: Anthropic was valued at $380 billion in February 2026 after its Series G, meaning it has added more than $585 billion in implied valuation in roughly 100 days. Anthropic CFO Krishna Rao attributed the growth to enterprise adoption: "Our run-rate revenue crossed $47 billion earlier this month." That figure — $47 billion annualized run-rate — compares to OpenAI's approximately $25 billion ARR reported in February 2026.

TechCrunch described this as likely Anthropic's final private fundraise before its IPO, which Bloomberg has reported is targeting October 2026. The round's scale — $65 billion, the largest private AI fundraise on record — reflects the degree to which institutional investors are treating the AI model race as a winner-take-most market and positioning accordingly before Anthropic lists. One institutional investor reportedly pledged $5 billion just to get a meeting with CFO Krishna Rao during the fundraise process.

Round	Date	Amount	Valuation
Series F	Nov 2024	$4B	$40B
Series G	Feb 2026	$30B	$380B
Series H	May 28, 2026	$65B	$965B

What the $47 Billion ARR Means

Anthropic's $47 billion annualized run-rate revenue is the most significant data point in the Series H announcement. It more than doubles the figure most analysts had modelled for this stage of the company and nearly doubles OpenAI's $25 billion ARR from February. The growth has been driven primarily by three channels: enterprise API consumption (KPMG, Deloitte, PwC, and hundreds of other enterprise clients), Claude.ai Pro and Team subscriptions, and infrastructure resale through Amazon Bedrock and Google Vertex AI.

Anthropic is also paying SpaceX approximately $15 billion per year for cloud computing infrastructure through the Colossus cluster — a cost that partially offsets the revenue figure. Even accounting for that infrastructure spend and the cost of model training at scale, analysts covering the IPO preparation expect Anthropic's path to profitability to be significantly clearer than OpenAI's, whose $1.22 loss per dollar of revenue ratio remains a challenge for public market investors.

What's Next — Claude Mythos Public Rollout

Multiple sources at the time of the Series H announcement noted that Anthropic is preparing a broader rollout for Claude Mythos — the unreleased frontier model currently available only to approximately 50 partner organizations through Project Glasswing. Mythos is described as stronger than Opus 4.8 on security and mathematical reasoning tasks. Anthropic has said Mythos-class models are expected in the coming weeks. If Mythos reaches general availability before the October IPO target, it would be Anthropic's strongest public model release to date and a significant signal to public market investors about the product roadmap.

Frequently Asked Questions

What is the Claude Opus 4.8 API model ID?

claude-opus-4-8 — available on the Claude API, Amazon Bedrock, Google Cloud Vertex AI, and Microsoft Foundry from May 28, 2026. Use claude-opus-4-8[1m] to explicitly request the 1M-token context window variant.

Should I upgrade from Opus 4.7 to 4.8?

Yes, for almost all use cases — price is unchanged, benchmarks are better across agentic coding, knowledge work, and tool use, and the token efficiency improvement (~35% fewer output tokens) means real-world costs are lower even at the same rate card. The only cases where 4.7 might be preferable: if you have carefully tuned prompts optimised for 4.7 behaviour and cannot tolerate the risk of regression during the transition period. Test on a sample of your workload before fully migrating production pipelines.

What is Dynamic Workflows in Claude Code?

Dynamic Workflows is a research preview feature in Claude Code that allows Claude to plan a large task and then spawn and coordinate hundreds of parallel subagents to work on different parts simultaneously. It is designed for codebase-scale work — large migrations, refactors across hundreds of thousands of lines, or multi-module parallel development — where sequential processing would take too long. It is available as a research preview, meaning the interface and capabilities may change before general availability.

Does Anthropic's $965B valuation mean it's more valuable than OpenAI?

By private-market post-money valuation, yes — $965 billion vs OpenAI's $852 billion from its March 2026 round. However, private valuations are not directly comparable to public market capitalizations, and both companies are pre-IPO. OpenAI has filed a confidential S-1 targeting a September 2026 public debut; Anthropic is targeting October 2026. The public markets will set the definitive relative valuation when both companies list.

What is Anthropic's revenue run rate in 2026?

Anthropic stated in its Series H announcement that run-rate revenue crossed $47 billion in May 2026. This is annualized run rate based on current monthly revenue — not full-year 2026 recognized revenue, which will be lower. The $47B ARR is up from approximately $18 billion projected for full-year 2026 as of January, reflecting significantly faster growth than analysts had modelled.

When will Claude Mythos be publicly available?

Anthropic has not given a specific date. Multiple sources at the time of the Series H announcement indicated a broader Mythos rollout is being prepared, and Anthropic's own release notes say Mythos-class models are expected in the coming weeks. Current access is restricted to approximately 50 partner organizations through Project Glasswing. Watch Anthropic's official blog and the Claude changelog for the announcement.

Claude Opus 4.8 Is Here: Top Coding Benchmarks, Dynamic Workflows, and a $65B Round That Makes Anthropic the World's Most Valuable AI Company