Midjourney/Every illustration.

The Model Got Stranger

Plus: Using NotebookLM to run the Pyramid Principle in reverse

Like 1 Comments

Hello, and happy Sunday! Was this newsletter forwarded to you? Sign up to get it in your inbox.


Knowledge base

“Vibe Check: Opus 4.7 Stopped Reading Between the Lines” by Katie Parrott/Vibe Check: Opus 4.7 is the best coding model Every has tested on well-specified tasks—Kieran Klaassen called his Rubber Duck benchmark run “best model ever”—but it won’t infer what you want the way 4.6 did, and the prompts you’ve tuned for the last two months will likely disappoint you at first. The gap between a tight brief and a loose one is wider than in any prior Opus. Read this for the full breakdown of where to switch to 4.7 now and where to stay on 4.6.

“The Folder Is the Agent” by Kieran Klaassen/Source Code: After three months trying to make AI agent swarms work in his coding flow, Kieran Klaassen realized that what was doing the work was a folder. A project directory with a CLAUDE.md, accumulated context, and specialized sub-agents is all you need to turn a general model into a domain expert. He’s now running 44 of them, connected by a Ruby dispatch layer that routes work while he sleeps. Read this to learn how to build the dispatch layer yourself.

“(Re(Re))Introducing Sparkle: Marie Kondo Your Mac” by Yash Poojary/On Every: Yash Poojary rebuilt Sparkle to purge the 80% percent of files on the average Mac that are screenshots, installer packages, and duplicates you’ll never open again before it organizes. The new version runs a cleanup pass first, then proposes a custom folder structure you can reshape through chat until it matches you like to work. Download the app and try it yourself.

🎧 🖥 “Mini-Vibe Check: Claude Managed Agents Handle the Infrastructure Work” by Laura Entis/Context Window: Dan Shipper sits down with Eve Bodnia, founder and CEO of Logical Intelligence, who argues that LLMs have a ceiling—and that energy-based models, which scan the full landscape of possible answers rather than predicting one token at a time, are what comes next. Plus: A Mini-Vibe Check on Anthropic’s Claude Managed Agents; Willie Williams proposes new vocabulary for the AI age. 🎧 🖥 Listen on Spotify or Apple Podcasts, or watch on X or YouTube.

“You’re the Manager Now” by Laura Entis/Context Window: The Claude Code desktop app gets a redesign built for managing parallel agent work—and Kieran Klaassen was already living in it. Plus: Dan Shipper explains why you should ignore the viral claim that smaller models can match Anthropic’s Mythos, Austin Tedesco shares the one question he asks Claude Code before shipping anything, and Eleanor Warnock on why the Dia browser’s bet on beauty might be the right one.

“Living Software” by Jack Cheng: AI-accelerated development has made software feel zombieish—tools that shouldn’t be alive suddenly sprouting chat boxes and AI sidebars. Jack Cheng proposes a distinction: “tool-like software,” which users expect to be stable, versus “living software,” which users expect to adapt and grow. The two categories carry different expectations, and confusing them causes disorientation. Read this for his practical advice on how builders of both should design, ship, and communicate with their users.



Log on

Create a free account to continue reading

The Only Subscription
You Need to Stay at the
Edge of AI

The essential toolkit for those shaping the future

"This might be the best value you
can get from an AI subscription."

- Jay S.

Mail Every Content
AI&I Podcast AI&I Podcast
Monologue Monologue
Cora Cora
Sparkle Sparkle
Spiral Spiral

Join 100,000+ leaders, builders, and innovators

Community members

Already have an account? Sign in

What is included in a subscription?

Daily insights from AI pioneers + early access to powerful AI tools

Pencil Front-row access to the future of AI
Check In-depth reviews of new models on release day
Check Playbooks and guides for putting AI to work
Check Prompts and use cases for builders

Comments

You need to login before you can comment.
Don't have an account? Sign up!

We use analytics and advertising tools by default. You can update this anytime.