Vibe Check

Taste-testing new models.

Popular Newest Oldest

Mar 8, 2025

Vibe Check: Claude 3.7 Sonnet and Claude Code

All about the newest tools from Anthropic

Vivian Meng

Sep 29, 2025

Vibe Check: Claude Sonnet 4.5

Faster than GPT-5 Codex, smarter and more steerable than Opus 4.1

Dan Shipper

Nov 19, 2025

Vibe Check: Gemini 3 Pro, A Reliable Workhorse With Surprising Flair

After 24 hours of hands-on testing, we found a model that’s fast, reliable, and surprisingly funny—but still prone to overreaching and not yet a writing champ

Rhea Purohit

Oct 21, 2025

Vibe Check: OpenAI’s New AI Browser, Atlas

It feels less like learning something new than a browser that has caught up to how we already want to work with AI

Dan Shipper

Oct 23, 2025

We Tested Claude Sonnet 4.5 for Writing and Editing

Five tests across blind comparisons, editorial standards, and deadlines—here's what changed our setup

Katie Parrott

Nov 25, 2025

The AI Browsers That Made It Into Our Daily Workflow

Switching browsers is a pain. Here are the ones that our team deemed worth it.

Rhea Purohit

Sep 15, 2025

Vibe Check: GPT-5 Codex Can Code for 35 Minutes Straight—If You Ask Nicely

It launches today—here’s our day-zero vibe check

Dan Shipper

Jul 18, 2025

Vibe Check: Grok 4 Aced Its Exams. The Real World Is a Different Story.

The smartest model isn’t always the most useful one

Rhea Purohit

May 9, 2025

Vibe Check: Gemini 2.5 Pro and Gemini 2.5 Flash

Why Google might quietly win the race to be AI’s top backend provider

Katie Parrott

Oct 15, 2025

Vibe Check: Anthropic Cooked on Claude Haiku 4.5

This one’s for the developers

Dan Shipper

Dec 11, 2025

Vibe Check: GPT-5.2 Is an Incremental Upgrade

OpenAI's latest model update excels at instruction-following and extended tasks, but don't expect it to surprise you

Katie Parrott

Jan 13, 2026

Vibe Check: Claude Cowork Is Claude Code for the Rest of Us

The asynchronous, agentic workflow developers love is finally accessible to everyone—but the polish isn't there yet

Katie Parrott

Feb 2, 2026

Vibe Check: OpenAI's Codex App Gains Ground on Claude Code

OpenAI nailed the interface. But it's built for hardcore engineering.

Dan Shipper

Apr 17, 2026

Vibe Check: Opus 4.7 Stopped Reading Between the Lines

Anthropic's latest Opus is more precise, more literal, and the best coding model we've tested on well-specified tasks—but it won't fill in the gaps for you anymore

Katie Parrott

Oct 16, 2025

OpenAI Made Video Creation Effortless—Here's What Happened Next

Sora 2 removed every creative barrier, but our feeds tell a different story about human imagination

Rhea Purohit

Feb 18, 2026

Vibe Check: Anthropic Just Made Opus Cheaper Without Calling It That

Sonnet 4.6 delivers Opus-close performance at half the price—but speed didn't come along for the ride

Katie Parrott

May 28, 2026

Vibe Check: Opus 4.8—Anthropic Should’ve Rounded Up to 5

Opus 4.8 tops both our Senior Engineer benchmark and our writing tests. It’s the most complete model we’ve tested. We just wish it had an app to match.

Dan Shipper