Every illustration/Anthropic.

Claude 3 Is The Most Human AI Yet

But that doesn’t mean it's going to beat ChatGPT

201 5

Sponsored By: Composer

Wall Street legend Jim Simons has generated 66 percent returns annually for over 30 years. His secret? Algorithmic trading.

With Composer, you can create algorithmic trading strategies that automatically trade for you (no coding required).

  • Build the strategy using AI, a no-code editor, or use one of 1,000-plus community strategies
  • Test the performance
  • Invest in a click

Every technology is described by the same words when it comes to market: faster, cheaper, smarter. With the release of Claude 3—an AI model comparable to GPT-4—its creator, Anthropic, said the typical stuff. The company published evaluations that showed its newest model was on par or slightly more powerful than its peers. It discussed how its models could be run more cheaply. 

All of this is cool, but the differences in benchmark testing or price weren’t so drastic as to warrant an essay from this column. 

The correct word, the unusual one, the word that Anthropic didn’t use and I feel is most accurate, is something weird. 

The right way to describe Claude3 is warm

It is the most human-feeling, creative, and naturalistic AI that I have ever interacted with. I’m aware that this is not a scientific metric. But frankly, we don’t have the right tests to understand this dimension of AI. Shoot, we might not even have the right language. 

Our team at Every includes some of the few people on the planet that have access to OpenAI’s latest public GPT models, Google’s newest Gemini 1.5 models, and Anthropic’s Claude 3. We’ve also been using LLMs for years—testing, evaluating, scrutinizing, and inventing new ways to work

In all that labor, we’ve never found an AI able to act as a robust, independent writing companion that can take on large portions of the creative burden—until now. Claude 3 seems to have finally done it. Anthropic’s team has made an AI that, when paired with a smart writer, results in a dramatically better creative product. 

This has profound implications. Great writing isn’t just about the prose, or about commas or sentences. Instead, great writing is deep thought made enjoyable. Until Claude 3, most AI models were good thought made presentable. Claude 3 frequently crosses the great/enjoyable rubicon. 

To determine that I had it work on the hardest pitch of all time: convincing someone to follow Jesus. 

Unlock the power of algorithmic trading with Composer. Our platform is designed for both novices and seasoned traders, offering tools to create automatic trading strategies without any coding knowledge.

With more than $1 billion in trading volume already, Composer is the platform of choice. Create your own trading strategies using AI or choose from over 1,000 community-shared strategies. Test their performance and invest with a single click.

Jesus, automated

Every Monday at 2 p.m. ET the entire brood of my in-laws jumps on a Facebook Messenger call. The purpose of our weekly chat is to connect with my youngest brother-in-law, Gage, who is on a Mormon mission to Brazil. We tell him about how his favorite sports teams are doing, he tells us about who he is teaching about Jesus that week. Many of us on the call donned the necktie-and-white-shirt combo and served missions ourselves, so we share pointers on what worked for us. 

This Monday, I had an idea. Rather than rely on our experience for guidance, we could summon the collective wisdom of mankind—i.e., we could ask the LLMs for advice on missionary work. So I pulled up Claude 3, GPT-4, and Gemini 1.5, and prompted them with a question about how to help people learning about the Church understand the long list of rules that members are asked to follow. 

First, I asked GPT-4 to help my brother-in-law teach in a more compelling way. It outputted a long list of generic advice that felt cold. 

Source: Screenshot by the author.(This went on for another 11 bullets.) Then I tried Gemini. The result was similarly impersonal writing, with comparable substance—and it took an extra 20 seconds. 

Source: Screenshot by the author.Finally I tried Claude 3. It was remarkably gentle, kind, and specific. I was astonished.

Source: Screenshot by the author.While this test was interesting, it wasn’t possible to draw general conclusions from it. The more compelling example was when we deployed it in our own products at work.

Find Out What
Comes Next in Tech.

Start your free trial.

New ideas to help you build the future—in your inbox, every day. Trusted by over 75,000 readers.

Subscribe

Already have an account? Sign in

What's included?

  • Unlimited access to our daily essays by Dan Shipper, Evan Armstrong, and a roster of the best tech writers on the internet
  • Full access to an archive of hundreds of in-depth articles
  • Unlimited software access to Spiral, Sparkle, and Lex

  • Priority access and subscriber-only discounts to courses, events, and more
  • Ad-free experience
  • Access to our Discord community

Comments

You need to login before you can comment.
Don't have an account? Sign up!
Tintin 8 months ago

Good read regarding Claude 3: https://twitter.com/hahahahohohe/status/1765088860592394250

@mark_1679 8 months ago

Biggest takeaway for me was ‘Platforms like Redit that were relying on AI data licensing revenue should also be worried. There is no need to pay them when LLMs can generate training data for you.’
Try explaining synthetic data to folks that are just beginning to understand how the heck these models work.
Everyday I have ‘a moment’ with this stuff.
Super read and one I’ll be sharing with colleagues and friends.

Mark Modesti 8 months ago

"To determine that I had it work on the hardest pitch of all time: convincing someone to follow Jesus." Curious why you have this bias. Christianity has fared quite well for the ppast couple thousand years.

Tom Parish 8 months ago

Thought, insightful. Thank you. I've come to similar conclusions with my use of Claude 3. How fascinating it is to think what comes next.

Steve Selzer 8 months ago

How does it compare to Pi? Curious because I’ve felt this “warmth” and EQ from day one with Inflection AI’s product and curious whether you’re testing it out alongside these LLM competitors. Thanks!

Every

What Comes Next in Tech

Subscribe to get new ideas about the future of business, technology, and the self—every day