Anthropic’s new Claude AI model can decide between speed and deep thinking

Anthropic released on Monday its Claude 3.7 Sonnet model, which it says returns results faster and can show the user the “chain of thought” it follows to reach an answer. This latest model also powers a new coding tool called Claude Code that can perform some development tasks autonomously.

Claude 3.7 Sonnet offers an “extended thinking” mode that engages in a more detailed “chain of thought” reasoning but takes longer to generate a response. For simpler questions it eschews this mode and instead focuses on speed. Other models offer their own versions of “thinking” mode, but typically the user has to select that feature for harder problems;  Anthropic says Claude 3.7 Sonnet is the first publicly available model with the capability to choose the best mode based on the user’s question. If Grok 3 and DeepSeek-R1 are stick shifts, then Anthropic’s new model is an automatic.

“Just as humans use a single brain for both quick responses and deep reflection, we believe reasoning should be an integrated capability of frontier models rather than a separate model entirely,” Anthropic says in a blog post.

Claude 3.7 Sonnet outperforms other “thinking” models in some important benchmark tests. On SWE-bench, which evaluates AI models’ ability to solve real-world software issues, the model beat OpenAI’s o1 and o3-mini and DeepSeek-R1 by a comfortable margin. It was the same story on TAU-bench, which tests AI agents on complex real-world tasks with user and tool interactions. However, OpenAI’s o1 model still edges out Claude 3.7 Sonnet in math problem solving, visual reasoning, multilingual Q&A, and graduate-level reasoning benchmarks.

Anthropic describes the Claude Code tool as an active collaborator that can search and read code, edit files, write and run tests, and commit and push code to GitHub. The company says the tool has already become “indispensable” for its own coders, completing tasks in a single pass that would normally take 45 minutes or more of manual work. 

Claude 3.7 Sonnet is now available on all Claude subscription plans—Free, Pro, Team, and Enterprise–but the extended thinking mode isn’t available to users of the free tier. Claude 3.7 Sonnet is also available to developers as an API for the same price as earlier Claude models.

https://www.fastcompany.com/91283751/anthropic-new-claude-3-7-sonnet-ai-chain-of-thought?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

Létrehozva 6mo | 2025. febr. 24. 20:20:05


Jelentkezéshez jelentkezzen be

EGYÉB POSTS Ebben a csoportban

Crypto.com bets big on sports prediction markets

Legal sports betting is still not offered in California or Texas, the country’s two most populous states—and in Florida, the third most-populous, it’s largely controlled by the Seminole Tribe. But

2025. szept. 2. 18:30:03 | Fast company - tech
Pinterest bets on ‘additive AI’ as it reimagines personalization

For more than a decade, social platforms have faced criticism for embedding algorithms that fuel compulsive behaviors, encourage doomscrolling, and measure success by time spent glued to screens.

2025. szept. 2. 13:50:05 | Fast company - tech
This startup is using AI to take on high real estate commissions

A new startup called Ridley wants to make it cheaper to sell a home by challenging the traditional real estate commission model.

Founder and CEO

2025. szept. 2. 13:50:03 | Fast company - tech
5 ways to write better AI prompts

This article is republished with permission from Wonder Tools, a newsletter that helps you discover the most useful sites and apps. 

2025. szept. 2. 11:20:06 | Fast company - tech
This scrappy developer is bringing back what millions loved about Trello

For all the many features it’s been lobbing into the world lately, Trello hasn’t given its most dedicated fans the one thing many of them crave most—and that’s a ticket back in t

2025. szept. 2. 6:40:07 | Fast company - tech
How I took control of my email address with a custom domain

Over the past three years, I’ve changed email providers three times without ever changing email addresses.

That’s because my address is entirely under my control. Instead of relying on a

2025. szept. 1. 14:30:04 | Fast company - tech
This viral grocery hack will help you save money and reduce waste

If you dread the weekly grocery shop, or get sidetracked by fun snacks only to end up with no real meals, this might be the hack for you.

The 5-4-3-2-1 method gives shoppers like you a s

2025. aug. 31. 13:10:02 | Fast company - tech