Vibe Check

Taste-testing new models.

Apr 16, 2025

Vibe Check: o3 Is Here—And It’s Great

The highest praise I can give is that I’m already using it all the time

Dec 10, 2024

Vibe Check: OpenAI’s Sora

The text-to-video model is finally available

Jan 23, 2025

We Tried OpenAI’s New Agent—Here’s What We Found

Operator (Could you help me do this task?)

Aug 12, 2025

Vibe Check: Claude Sonnet 4 Now Has a 1-million Token Context Window

Fast, reliable long-context responses—for a price

Aug 5, 2025

Vibe Check: OpenAI Drops Two New Open-weight Models

OpenAI President Greg Brockman: ‘The team cooked with this one’

Apr 18, 2025

Vibe Check: OpenAI’s o3, GPT-4.1, and o4-mini

Our take on what’s powerful, what’s practical, and what’s still TBD

May 16, 2025

Vibe Check: Codex—OpenAI’s New Coding Agent

Our hands-on day-0 review of the new autonomous software engineer

May 22, 2025

Vibe Check: Claude 4 Opus

Anthropic’s new model crushes pull requests, research deep dives, and honest editing—yet o3 keeps the daily-driver crown

Jul 17, 2025

Vibe Check: OpenAI Enters the Browser Wars With ChatGPT Agent

It’s launching today! Here’s our day-zero, hands-on report.

Mar 26, 2025

Vibe Check: OpenAI’s GPT-4o Image Generation

'Finally, native images in ChatGPT!'

May 9, 2025

Vibe Check: Gemini 2.5 Pro and Gemini 2.5 Flash

Why Google might quietly win the race to be AI’s top backend provider

Jul 31, 2025

Vibe Check: Claude’s New Agents Are Confusing as Hell—And We Love Them

We spawned AI agents like crazy. Then we tried to work with them.

Sep 29, 2025

Vibe Check: Claude Sonnet 4.5

Faster than GPT-5 Codex, smarter and more steerable than Opus 4.1

Feb 3, 2025

We Tried OpenAI’s New Deep Research—Here’s What We Found

Vibe check: It’s awesome.

Mar 8, 2025

Vibe Check: Claude 3.7 Sonnet and Claude Code

All about the newest tools from Anthropic

Oct 20, 2025

Vibe Check: Claude Code Now Works on Mobile and the Web

Anthropic’s coding agent promises work from anywhere. After a weekend of testing, it still feels very beta.

Oct 6, 2025

Vibe Check: OpenAI DevDay 2025

Apps, agents, and API updates—but where's the vision that makes you dream?

Aug 8, 2025

Vibe Check: Genie 3, Claude 4.1, GPT-oss, and GPT-5

Four model launches, four ideas about where AI goes next

Jun 23, 2025

o3-pro Vibe Check—A Slow, Steady Last Resort

OpenAI’s latest model trades speed for occasional brilliance—when nothing else works, it might

Oct 21, 2025

Vibe Check: OpenAI’s New AI Browser, Atlas

It feels less like learning something new than a browser that has caught up to how we already want to work with AI

Jul 18, 2025

Vibe Check: Grok 4 Aced Its Exams. The Real World Is a Different Story.

The smartest model isn’t always the most useful one

Sep 15, 2025

Vibe Check: GPT-5 Codex Can Code for 35 Minutes Straight—If You Ask Nicely

It launches today—here’s our day-zero vibe check

Nov 3, 2025

Vibe Check: Claude Skills Need a ‘Share’ Button

The feature is powerful for individuals and tricky for teams—but it does lighten the cognitive load

Nov 24, 2025

Vibe Check: Opus 4.5 Is the Coding Model We’ve Been Waiting For

But it’s not perfect—it failed our editing test