Vibe Check

Taste-testing new models.

Dec 9, 2024

Vibe Check: OpenAI’s Sora

The text-to-video model is finally available

Apr 16, 2025

Vibe Check: o3 Is Here—And It’s Great

The highest praise I can give is that I’m already using it all the time

Aug 12, 2025

Vibe Check: Claude Sonnet 4 Now Has a 1-million Token Context Window

Fast, reliable long-context responses—for a price

Jan 22, 2025

We Tried OpenAI’s New Agent—Here’s What We Found

Operator (Could you help me do this task?)

May 16, 2025

Vibe Check: Codex—OpenAI’s New Coding Agent

Our hands-on day-0 review of the new autonomous software engineer

Jul 17, 2025

Vibe Check: OpenAI Enters the Browser Wars With ChatGPT Agent

It’s launching today! Here’s our day-zero, hands-on report.

Aug 5, 2025

Vibe Check: OpenAI Drops Two New Open-weight Models

OpenAI President Greg Brockman: ‘The team cooked with this one’

Jul 31, 2025

Vibe Check: Claude’s New Agents Are Confusing as Hell—And We Love Them

We spawned AI agents like crazy. Then we tried to work with them.

Mar 26, 2025

Vibe Check: OpenAI’s GPT-4o Image Generation

'Finally, native images in ChatGPT!'

Nov 3, 2025

Vibe Check: Claude Skills Need a ‘Share’ Button

The feature is powerful for individuals and tricky for teams—but it does lighten the cognitive load

Apr 18, 2025

Vibe Check: OpenAI’s o3, GPT-4.1, and o4-mini

Our take on what’s powerful, what’s practical, and what’s still TBD

May 22, 2025

Vibe Check: Claude 4 Opus

Anthropic’s new model crushes pull requests, research deep dives, and honest editing—yet o3 keeps the daily-driver crown

Feb 2, 2025

We Tried OpenAI’s New Deep Research—Here’s What We Found

Vibe check: It’s awesome.

Oct 20, 2025

Vibe Check: Claude Code Now Works on Mobile and the Web

Anthropic’s coding agent promises work from anywhere. After a weekend of testing, it still feels very beta.

Oct 6, 2025

Vibe Check: OpenAI DevDay 2025

Apps, agents, and API updates—but where's the vision that makes you dream?

Oct 30, 2025

Vibe Check: I Canceled Two AI Max Plans for Factory’s Coding Agent Droid

The one that keeps me in flow across Anthropic and OpenAI’s models—without switching tools

Aug 8, 2025

Vibe Check: Genie 3, Claude 4.1, GPT-oss, and GPT-5

Four model launches, four ideas about where AI goes next

Jun 23, 2025

o3-pro Vibe Check—A Slow, Steady Last Resort

OpenAI’s latest model trades speed for occasional brilliance—when nothing else works, it might

Oct 29, 2025

Vibe Check: Cursor 2.0 and Composer 1 Alpha

Two new things: A code editor designed to manage agents and a lightning-fast model

Nov 24, 2025

Vibe Check: Opus 4.5 Is the Coding Model We’ve Been Waiting For

But it’s not perfect—it failed our editing test

Oct 21, 2025

Vibe Check: OpenAI’s New AI Browser, Atlas

It feels less like learning something new than a browser that has caught up to how we already want to work with AI

Mar 8, 2025

Vibe Check: Claude 3.7 Sonnet and Claude Code

All about the newest tools from Anthropic

Sep 29, 2025

Vibe Check: Claude Sonnet 4.5

Faster than GPT-5 Codex, smarter and more steerable than Opus 4.1

Nov 19, 2025

Vibe Check: Gemini 3 Pro, A Reliable Workhorse With Surprising Flair

After 24 hours of hands-on testing, we found a model that’s fast, reliable, and surprisingly funny—but still prone to overreaching and not yet a writing champ