← All episodes

Live Vibe Check: Testing Claude Sonnet 4.6 — Opus-Level Performance at Sonnet Pricing

| 8 products mentioned
Every Every host
Watch on YouTube ai models claude sonnet api pricing coding assistants ai agents model evaluation developer tools

The hosts conduct a live "vibe check" of Claude Sonnet 4.6, Anthropic's newly released model that claims to deliver Opus-level performance at Sonnet pricing. Without early access, they test the model across real-world tasks—including code review, creative problem-solving, spreadsheet analysis, and design work—to determine whether it truly matches Opus's capabilities while costing roughly 40% less.

Key takeaways
  • Claude Sonnet 4.6 appears to match Opus 4.6 in reasoning quality and creative problem-solving, making it viable for production applications where cost was previously prohibitive.
  • The model performs notably slower than expected for a Sonnet-tier product, with response times feeling closer to Opus than a faster, cheaper alternative, though speed may improve as server load decreases post-launch.
  • Sonnet 4.6 is positioned to unlock new use cases by reducing API costs from $5/$25 per million tokens (Opus) to $3/$15 (Sonnet), allowing developers to deploy agents in production without prohibitive expenses.
  • The model shows occasional signs of over-eagerness or caution—jumping into tasks prematurely or asking unnecessary clarifying questions—suggesting users need to monitor its behavior more closely than Opus in some contexts.
  • Anthropic's strategy of pushing equivalent performance down the pricing tier each release cycle provides predictable costs for developers while maintaining a consistent Opus-Sonnet-Haiku hierarchy.
  • For hard debugging and meta-level reasoning tasks, Opus 4.6 still outperforms Sonnet 4.6 with faster, more direct solutions, though Sonnet excels at task execution and context management.

Recommendations (3)

Claude
Claude uses

"I have it currently in my model picker in my Claude app. This is my Claude. I'm going to say, can you upgrade me to Sonnet 4.6?"

Every · ▶ 1:05

"I have the compound engineering plugin. It's very active. Lots of people adding stuff. I want to review the pull requests, see if they're all good."

Live Vibe Check · ▶ 7:06

Claude Code

"I'm trying in Claude Code. It's not there yet. I just handcoded the model and it's working. CC is my Claude Code."

Live Vibe Check · ▶ 1:14

Mentioned (5)

Claude Sonnet 4.6
Claude Sonnet 4.6 "We're going to do a live vibe check on Sonnet 46. Anthropic told me it's sort of like Opus but at..." ▶ 0:25
Claude Opus
Claude Opus "Opus had one of the more interesting answers to this that I've seen so far. Each generation of mo..." ▶ 3:16
Cursor
Cursor "Copilot has added Kro and I think Cursor also. There's a Cursor plugin as well by Eric. So we mer..." ▶ 7:24
GitHub Copilot "Copilot has added Kro and I think Cursor also. We merged this so native Cursor get a copilot whic..." ▶ 7:24
ChatGPT
ChatGPT "It's not mindblowing. It's not like the OpenAI Spark that we saw the Codex." ▶ 9:43