Midjourney/Every illustration.

Vibe Check: Grok 4 Aced Its Exams. The Real World Is a Different Story.

The smartest model isn’t always the most useful one

6 1

Comments

You need to login before you can comment.
Don't have an account? Sign up!
@federicoescobarcordoba 40 minutes ago

I jumped on the Grok 4 ship by subscribing as soon as it launched—and then canceled within two days. It was obvious the writing was off (try asking it to complete a paragraph, and it’ll produce something that sounds like it’s from another century). Its analysis wasn’t as sharp as Grok 3’s, either. I’ve seen some improvement over the past few days—at least in analysis, which now feels on par with Grok 3. But for my use cases, Gemini 2.5 Pro is still leagues ahead of the pack. Thanks for the insights from this post. Ethan Mollick posted about the McNamara Fallacy recently, and my mind went to Grok 4 right away.