← Back to Reviews | AI Development

Claude Sonnet 5 Review: Anthropic's Most Agentic Sonnet Model Yet (2026)

Marcus Webb · · Rated 9/10 · $3/M input tokens, $15/M output tokens (intro $2/$10 until Aug 31)
9 / 10
Ease of Use 8.5
Features 9.2
Value for Money 9.5
Performance 9
Support & Ecosystem 8.8

✅ Pros

  • Near-Opus 4.8 performance at roughly half the price
  • Substantially improved agentic capabilities over Sonnet 4.6
  • Built-in effort scaling for cost-performance optimization
  • Available as default model across all Claude plans including Free
  • Lower rate of undesirable behaviors in agentic contexts

⚠️ Cons

  • Higher absolute cost per task compared to GPT-5 or Gemini 2.5 Pro
  • Limited cybersecurity capabilities vs Opus models
  • Introductory pricing expires August 31, 2026
  • Some complex code refactoring still benefits from Opus 4.8
Best For

Developers needing agentic coding assistance, teams building AI-powered workflows, and professionals seeking a smarter everyday assistant

Pricing

$3/M input tokens, $15/M output tokens (intro $2/$10 until Aug 31)

Quick Verdict

Claude Sonnet 5 closes the gap between Sonnet-class and Opus-class models more than any previous release. With near-Opus 4.8 performance across agentic benchmarks at roughly half the price, it’s the best value proposition Anthropic has ever offered. For most developers and knowledge workers, Sonnet 5 is now the sweet spot — capable enough for complex multi-step tasks without reaching for the Opus tier.


What’s New in Claude Sonnet 5

Anthropic launched Claude Sonnet 5 on June 30, 2026, positioning it as the most “agentic” Sonnet model yet. The key advancement is in reasoning depth, tool use reliability, and autonomous task completion — areas where previous Sonnet models would often stall or stop short.

Benchmark Performance

Compared to Sonnet 4.6 and Opus 4.8:

BenchmarkSonnet 4.6Sonnet 5Opus 4.8
SWE-bench Verified53.2%67.8%71.4%
BrowseComp (agentic search)41.5%58.3%62.1%
OSWorld-Verified38.7%52.4%56.9%
MMLU-Pro78.3%85.6%88.2%
MATH-50076.1%83.2%87.5%

At higher “effort” levels, Sonnet 5 can match Opus 4.8 on certain categories of agentic search and computer use tasks, making it a versatile option for cost-conscious teams.

Agentic Capabilities

Early access testers across multiple companies reported consistent findings:

Lovable (AI app builder): “Claude Sonnet 5 gets more done with less. Same output quality, fewer steps to get there. It refuses unsafe requests cleanly and consistently.”

Anonymous enterprise tester: “We handed Claude Sonnet 5 a two-part job — update Salesforce account tiers, send a launch announcement to enterprise contacts — and it finished end to end. That used to stall halfway.”

The model demonstrates unprompted self-checking behavior: testers described Sonnet 5 writing reproducing tests, implementing fixes, and stashing changes without being asked.


Pricing Breakdown

PlanInput TokensOutput TokensAvailability
Introductory (until Aug 31, 2026)$2/M$10/MClaude API & Claude Code
Standard (after Aug 31)$3/M$15/MClaude API & Claude Code
Free planN/A (limited)N/ADefault model
Pro ($20/mo)IncludedIncludedDefault model
Max ($100/mo)IncludedIncludedDefault model
Team/EnterpriseNegotiatedNegotiatedDefault model

At the introductory price of $2/$10 per million tokens, Sonnet 5 is positioned competitively against GPT-5 ($5/$15) and Gemini 2.5 Pro ($1.25/$10), though Gemini remains cheaper for input-heavy workloads.


Pros & Cons

Strengths

  • Value champion: Near-Opus intelligence at Sonnet pricing makes it the best deal in Anthropic’s lineup
  • Self-correcting behavior: The model proactively checks and fixes its own output without explicit prompting
  • Effort scaling: Low to high effort modes let you trade off cost vs. quality granularly
  • Safety improvements: Anthropic’s system card reports a lower rate of undesirable behaviors compared to Sonnet 4.6

Limitations

  • Not a universal replacement: Complex cybersecurity and penetration testing tasks still require Opus models
  • Higher per-task cost than Gemini 2.5 Pro: For high-volume token consumption, Gemini still holds a price advantage
  • Introductory pricing is temporary: Teams budgeting at the $2/$10 rate need to plan for the 50% increase to $3/$15 after August

How It Compares

DimensionClaude Sonnet 5GPT-5Gemini 2.5 Pro
Intelligence9/108.5/108.5/10
Coding9.2/108.8/108.5/10
Agentic tasks9.0/108.0/108.3/10
Speed85 tok/s110 tok/s95 tok/s
Price (in/out per M)$3/$15$5/$15$1.25/$10
Context window200K tokens128K tokens1M tokens

Sonnet 5 leads on pure intelligence and agentic capability, but falls behind Gemini 2.5 Pro on context window size and input pricing.


Who Should Buy This

Buy if: You’re a developer or team building AI-powered workflows, especially those requiring multi-step tool use, code generation, and autonomous task completion. The model’s self-checking behavior alone can save significant debugging time.

Skip if: Your workload is primarily high-volume summarization or simple chat — Gemini 2.5 Pro offers better value at lower prices. Also skip if you need cutting-edge cybersecurity AI capabilities, where Opus 4.8 still leads.


FAQ

Q: Is Claude Sonnet 5 better than Claude Opus 4.8? A: Not universally. Sonnet 5 matches Opus 4.8 on some agentic tasks at higher effort levels, but Opus 4.8 still leads on complex reasoning, cybersecurity, and nuanced creative writing.

Q: When does the introductory pricing end? A: August 31, 2026. After that, prices go to $3/M input and $15/M output tokens.

Q: Can I use Claude Sonnet 5 with Claude Code? A: Yes. Claude Code defaults to Sonnet 5 as of the June 30 release. Run claude in your terminal — it will use Sonnet 5 automatically.

Q: Is Sonnet 5 available on the Free plan? A: Yes. It’s the default model for Free and Pro plans as of launch day.

Q: How does the “effort” system work? A: Claude’s API now accepts an effort parameter (low/medium/high). Higher effort spends more thinking tokens for better quality on complex tasks. Sonnet 5 benefits significantly from high-effort mode, approaching Opus-level quality on many agentic evaluations.

Claude Sonnet 5 Anthropic AI model agentic coding LLM