Vibe Check

Taste-testing new models.

Apr 16, 2025

Vibe Check: o3 Is Here—And It’s Great

The highest praise I can give is that I’m already using it all the time

Aug 12, 2025

Vibe Check: Claude Sonnet 4 Now Has a 1-million Token Context Window

Fast, reliable long-context responses—for a price

Jan 23, 2025

We Tried OpenAI’s New Agent—Here’s What We Found

Operator (Could you help me do this task?)

Dec 10, 2024

Vibe Check: OpenAI’s Sora

The text-to-video model is finally available

Aug 5, 2025

Vibe Check: OpenAI Drops Two New Open-weight Models

OpenAI President Greg Brockman: ‘The team cooked with this one’

May 16, 2025

Vibe Check: Codex—OpenAI’s New Coding Agent

Our hands-on day-0 review of the new autonomous software engineer

May 22, 2025

Vibe Check: Claude 4 Opus

Anthropic’s new model crushes pull requests, research deep dives, and honest editing—yet o3 keeps the daily-driver crown

Apr 18, 2025

Vibe Check: OpenAI’s o3, GPT-4.1, and o4-mini

Our take on what’s powerful, what’s practical, and what’s still TBD

Jul 17, 2025

Vibe Check: OpenAI Enters the Browser Wars With ChatGPT Agent

It’s launching today! Here’s our day-zero, hands-on report.

Mar 26, 2025

Vibe Check: OpenAI’s GPT-4o Image Generation

'Finally, native images in ChatGPT!'

May 9, 2025

Vibe Check: Gemini 2.5 Pro and Gemini 2.5 Flash

Why Google might quietly win the race to be AI’s top backend provider

Mar 8, 2025

Vibe Check: Claude 3.7 Sonnet and Claude Code

All about the newest tools from Anthropic

Feb 3, 2025

We Tried OpenAI’s New Deep Research—Here’s What We Found

Vibe check: It’s awesome.

Jul 31, 2025

Vibe Check: Claude’s New Agents Are Confusing as Hell—And We Love Them

We spawned AI agents like crazy. Then we tried to work with them.

Aug 8, 2025

Vibe Check: Genie 3, Claude 4.1, GPT-oss, and GPT-5

Four model launches, four ideas about where AI goes next

Jun 23, 2025

o3-pro Vibe Check—A Slow, Steady Last Resort

OpenAI’s latest model trades speed for occasional brilliance—when nothing else works, it might

Jul 18, 2025

Vibe Check: Grok 4 Aced Its Exams. The Real World Is a Different Story.

The smartest model isn’t always the most useful one

Aug 7, 2025

GPT-5

Our hands-on review of OpenAI’s newest model based on weeks of testing