The Model Got Stranger

Hello, and happy Sunday! Was this newsletter forwarded to you? Sign up to get it in your inbox.

Knowledge base

“Vibe Check: Opus 4.7 Stopped Reading Between the Lines” by Katie Parrott/Vibe Check: Opus 4.7 is the best coding model Every has tested on well-specified tasks—Kieran Klaassen called his Rubber Duck benchmark run “best model ever”—but it won’t infer what you want the way 4.6 did, and the prompts you’ve tuned for the last two months will likely disappoint you at first. The gap between a tight brief and a loose one is wider than in any prior Opus. Read this for the full breakdown of where to switch to 4.7 now and where to stay on 4.6.

“The Folder Is the Agent” by Kieran Klaassen/Source Code: After three months trying to make AI agent swarms work in his coding flow, Kieran Klaassen realized that what was doing the work was a folder. A project directory with a CLAUDE.md, accumulated context, and specialized sub-agents is all you need to turn a general model into a domain expert. He’s now running 44 of them, connected by a Ruby dispatch layer that routes work while he sleeps. Read this to learn how to build the dispatch layer yourself.

“(Re(Re))Introducing Sparkle: Marie Kondo Your Mac” by Yash Poojary/On Every: Yash Poojary rebuilt Sparkle to purge the 80% percent of files on the average Mac that are screenshots, installer packages, and duplicates you’ll never open again before it organizes. The new version runs a cleanup pass first, then proposes a custom folder structure you can reshape through chat until it matches you like to work. Download the app and try it yourself.

🎧 🖥 “Mini-Vibe Check: Claude Managed Agents Handle the Infrastructure Work” by Laura Entis/Context Window: Dan Shipper sits down with Eve Bodnia, founder and CEO of Logical Intelligence, who argues that LLMs have a ceiling—and that energy-based models, which scan the full landscape of possible answers rather than predicting one token at a time, are what comes next. Plus: A Mini-Vibe Check on Anthropic’s Claude Managed Agents; Willie Williams proposes new vocabulary for the AI age. 🎧 🖥 Listen on Spotify or Apple Podcasts, or watch on X or YouTube.

“You’re the Manager Now” by Laura Entis/Context Window: The Claude Code desktop app gets a redesign built for managing parallel agent work—and Kieran Klaassen was already living in it. Plus: Dan Shipper explains why you should ignore the viral claim that smaller models can match Anthropic’s Mythos, Austin Tedesco shares the one question he asks Claude Code before shipping anything, and Eleanor Warnock on why the Dia browser’s bet on beauty might be the right one.

“Living Software” by Jack Cheng: AI-accelerated development has made software feel zombieish—tools that shouldn’t be alive suddenly sprouting chat boxes and AI sidebars. Jack Cheng proposes a distinction: “tool-like software,” which users expect to be stable, versus “living software,” which users expect to adapt and grow. The two categories carry different expectations, and confusing them causes disorientation. Read this for his practical advice on how builders of both should design, ship, and communicate with their users.