Anthropic’s new Claude AI model can decide between speed and deep thinking

Anthropic released on Monday its Claude 3.7 Sonnet model, which it says returns results faster and can show the user the “chain of thought” it follows to reach an answer. This latest model also powers a new coding tool called Claude Code that can perform some development tasks autonomously.

Claude 3.7 Sonnet offers an “extended thinking” mode that engages in a more detailed “chain of thought” reasoning but takes longer to generate a response. For simpler questions it eschews this mode and instead focuses on speed. Other models offer their own versions of “thinking” mode, but typically the user has to select that feature for harder problems;  Anthropic says Claude 3.7 Sonnet is the first publicly available model with the capability to choose the best mode based on the user’s question. If Grok 3 and DeepSeek-R1 are stick shifts, then Anthropic’s new model is an automatic.

“Just as humans use a single brain for both quick responses and deep reflection, we believe reasoning should be an integrated capability of frontier models rather than a separate model entirely,” Anthropic says in a blog post.

Claude 3.7 Sonnet outperforms other “thinking” models in some important benchmark tests. On SWE-bench, which evaluates AI models’ ability to solve real-world software issues, the model beat OpenAI’s o1 and o3-mini and DeepSeek-R1 by a comfortable margin. It was the same story on TAU-bench, which tests AI agents on complex real-world tasks with user and tool interactions. However, OpenAI’s o1 model still edges out Claude 3.7 Sonnet in math problem solving, visual reasoning, multilingual Q&A, and graduate-level reasoning benchmarks.

Anthropic describes the Claude Code tool as an active collaborator that can search and read code, edit files, write and run tests, and commit and push code to GitHub. The company says the tool has already become “indispensable” for its own coders, completing tasks in a single pass that would normally take 45 minutes or more of manual work. 

Claude 3.7 Sonnet is now available on all Claude subscription plans—Free, Pro, Team, and Enterprise–but the extended thinking mode isn’t available to users of the free tier. Claude 3.7 Sonnet is also available to developers as an API for the same price as earlier Claude models.

https://www.fastcompany.com/91283751/anthropic-new-claude-3-7-sonnet-ai-chain-of-thought?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

Établi 5mo | 24 févr. 2025, 20:20:05


Connectez-vous pour ajouter un commentaire

Autres messages de ce groupe

A newly discovered exoplanet rekindles humanity’s oldest question: Are we alone?

Child psychologists tell us that around the age of five or six, children begin to seriously contemplate the world around them. It’s a glorious moment every parent recognizes—when young minds start

13 juil. 2025, 11:10:06 | Fast company - tech
How Watch Duty became a go-to app during natural disasters

During January’s unprecedented wildfires in Los Angeles, Watch Duty—a digital platform providing real-time fire data—became the go-to app for tracking the unfolding disaster and is credit

13 juil. 2025, 06:30:05 | Fast company - tech
Why the AI pin won’t be the next iPhone

One of the most frequent questions I’ve been getting from business execs lately is whether the

12 juil. 2025, 12:10:02 | Fast company - tech
Microsoft will soon delete your Authenticator passwords. Here are 3 password manager alternatives

Users of Microsoft apps are having a rough year. First, in May, the Windows maker

12 juil. 2025, 09:40:03 | Fast company - tech
Yahoo Creators platform hits record revenue as publisher bets big on influencer-led content

Yahoo’s bet on creator-led content appears to be paying off. Yahoo Creators, the media company’s publishing platform for creators, had its most lucrative month yet in June.

Launched in M

11 juil. 2025, 17:30:04 | Fast company - tech
GameStop’s Nintendo Switch 2 stapler sells for more than $100,000 on eBay after viral mishap

From being the face of memestock mania to going viral for inadvertently stapling the screens of brand-new video game consoles, GameStop is no stranger to infamy.

Last month, during the m

11 juil. 2025, 12:50:04 | Fast company - tech
Don’t take the race for ‘superintelligence’ too seriously

The technology industry has always adored its improbably audacious goals and their associated buzzwords. Meta CEO Mark Zuckerberg is among the most enamored. After all, the name “Meta” is the resi

11 juil. 2025, 12:50:02 | Fast company - tech