After 24 hours of hands-on testing, we found a model that’s fast, reliable, and surprisingly funny—but still prone to overreaching and not yet a writing champ
The smartest model isn’t always the most useful one
It launches today—here’s our day-zero vibe check
Switching browsers is a pain. Here are the ones that our team deemed worth it.
Why Google might quietly win the race to be AI’s top backend provider
This one’s for the developers
OpenAI's latest model update excels at instruction-following and extended tasks, but don't expect it to surprise you
Anthropic's latest Opus is more precise, more literal, and the best coding model we've tested on well-specified tasks—but it won't fill in the gaps for you anymore
OpenAI nailed the interface. But it's built for hardcore engineering.
The asynchronous, agentic workflow developers love is finally accessible to everyone—but the polish isn't there yet
Sonnet 4.6 delivers Opus-close performance at half the price—but speed didn't come along for the ride
Sora 2 removed every creative barrier, but our feeds tell a different story about human imagination
Opus 4.8 tops both our Senior Engineer benchmark and our writing tests. It’s the most complete model we’ve tested. We just wish it had an app to match.
OpenAI’s new model is a top-end senior engineer—and easy to talk to
We’ve tested both models thoroughly—here’s our head-to-head Vibe Check
GPT-5.4 is fast, opinionated, and good enough to tempt our Opus loyalist
Our hands-on review of OpenAI’s newest model based on weeks of testing
The AI-native IDE is now becoming an agent-orchestration tool. Will it work?
It one-shotted a problem other models missed—and brings agentic, parallel work to non-coding tasks
The autonomy we wanted is here—but the model still does what you say, not what you mean
We use analytics and advertising tools by default. You can update this anytime.
Manage optional tracking categories. Necessary cookies stay on so the site can function.
Global Privacy Control is active, so advertising and sharing is turned off.