Vibe Check

Taste-testing new models.

Oct 21, 2025

Vibe Check: OpenAI’s New AI Browser, Atlas

It feels less like learning something new than a browser that has caught up to how we already want to work with AI

Oct 23, 2025

We Tested Claude Sonnet 4.5 for Writing and Editing

Five tests across blind comparisons, editorial standards, and deadlines—here's what changed our setup

Oct 29, 2025

Vibe Check: Cursor 2.0 and Composer 1 Alpha

Two new things: A code editor designed to manage agents and a lightning-fast model

Oct 30, 2025

Vibe Check: I Canceled Two AI Max Plans for Factory’s Coding Agent Droid

The one that keeps me in flow across Anthropic and OpenAI’s models—without switching tools

Nov 3, 2025

Vibe Check: Claude Skills Need a ‘Share’ Button

The feature is powerful for individuals and tricky for teams—but it does lighten the cognitive load

Nov 19, 2025

Vibe Check: Gemini 3 Pro, A Reliable Workhorse With Surprising Flair

After 24 hours of hands-on testing, we found a model that’s fast, reliable, and surprisingly funny—but still prone to overreaching and not yet a writing champ

Nov 24, 2025

Vibe Check: Opus 4.5 Is the Coding Model We’ve Been Waiting For

But it’s not perfect—it failed our editing test

Nov 25, 2025

The AI Browsers That Made It Into Our Daily Workflow

Switching browsers is a pain. Here are the ones that our team deemed worth it.

Dec 11, 2025

Vibe Check: GPT-5.2 Is an Incremental Upgrade

OpenAI's latest model update excels at instruction-following and extended tasks, but don't expect it to surprise you

Jan 13, 2026

Vibe Check: Claude Cowork Is Claude Code for the Rest of Us

The asynchronous, agentic workflow developers love is finally accessible to everyone—but the polish isn't there yet

Feb 2, 2026

Vibe Check: OpenAI’s Codex App Gains Ground on Claude Code

OpenAI nailed the interface. But it's built for hardcore engineering.

Feb 5, 2026

Vibe Check: GPT-5.3 Codex—The 10x Engineer, Now More Fun at Parties

The autonomy we wanted is here—but the model still does what you say, not what you mean

Feb 5, 2026

Vibe Check: Opus 4.6—The Best Coding Model We’ve Tested (With Some Maddening Habits)

It one-shotted a problem other models missed—and brings agentic, parallel work to non-coding tasks

Feb 5, 2026

GPT-5.3 Codex vs. Opus 4.6: The Great Convergence

We’ve tested both models thoroughly—here’s our head-to-head Vibe Check

Feb 18, 2026

Vibe Check: Anthropic Just Made Opus Cheaper Without Calling It That

Sonnet 4.6 delivers Opus-close performance at half the price—but speed didn't come along for the ride

Mar 5, 2026

Vibe Check: GPT-5.4—OpenAI Is Back

GPT-5.4 is fast, opinionated, and good enough to tempt our Opus loyalist

Apr 2, 2026

Vibe Check: Cursor 3.0 Bets Big on Agent Orchestration

The AI-native IDE is now becoming an agent-orchestration tool. Will it work?

Apr 17, 2026

Vibe Check: Opus 4.7 Stopped Reading Between the Lines

Anthropic's latest Opus is more precise, more literal, and the best coding model we've tested on well-specified tasks—but it won't fill in the gaps for you anymore

Apr 23, 2026

Vibe Check: GPT-5.5 Has It All

OpenAI’s new model is a top-end senior engineer—and easy to talk to

We use analytics and advertising tools by default. You can update this anytime.