Anthropic’s new Claude AI model can decide between speed and deep thinking

Anthropic released on Monday its Claude 3.7 Sonnet model, which it says returns results faster and can show the user the “chain of thought” it follows to reach an answer. This latest model also powers a new coding tool called Claude Code that can perform some development tasks autonomously.

Claude 3.7 Sonnet offers an “extended thinking” mode that engages in a more detailed “chain of thought” reasoning but takes longer to generate a response. For simpler questions it eschews this mode and instead focuses on speed. Other models offer their own versions of “thinking” mode, but typically the user has to select that feature for harder problems; Anthropic says Claude 3.7 Sonnet is the first publicly available model with the capability to choose the best mode based on the user’s question. If Grok 3 and DeepSeek-R1 are stick shifts, then Anthropic’s new model is an automatic.

“Just as humans use a single brain for both quick responses and deep reflection, we believe reasoning should be an integrated capability of frontier models rather than a separate model entirely,” Anthropic says in a blog post.

Claude 3.7 Sonnet outperforms other “thinking” models in some important benchmark tests. On SWE-bench, which evaluates AI models’ ability to solve real-world software issues, the model beat OpenAI’s o1 and o3-mini and DeepSeek-R1 by a comfortable margin. It was the same story on TAU-bench, which tests AI agents on complex real-world tasks with user and tool interactions. However, OpenAI’s o1 model still edges out Claude 3.7 Sonnet in math problem solving, visual reasoning, multilingual Q&A, and graduate-level reasoning benchmarks.

Anthropic describes the Claude Code tool as an active collaborator that can search and read code, edit files, write and run tests, and commit and push code to GitHub. The company says the tool has already become “indispensable” for its own coders, completing tasks in a single pass that would normally take 45 minutes or more of manual work.

Claude 3.7 Sonnet is now available on all Claude subscription plans—Free, Pro, Team, and Enterprise–but the extended thinking mode isn’t available to users of the free tier. Claude 3.7 Sonnet is also available to developers as an API for the same price as earlier Claude models.

https://www.fastcompany.com/91283751/anthropic-new-claude-3-7-sonnet-ai-chain-of-thought?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

Erstellt 2mo | 24.02.2025, 20:20:05

Melden Sie sich an, um einen Kommentar hinzuzufügen

Andere Beiträge in dieser Gruppe

What ChatGPT’s ‘sycophancy’ failure teaches publishers about AI and trust

When OpenAI pulled back its latest ChatGPT release—one that apparently turned the helpful

09.05.2025, 08:50:02 | Fast company - tech

‘His views better have changed since 2012’: How a viral meme account beat the Vatican to the Pope Leo XIV news

White smoke poured from the Sistine Chapel chimney Thursday at 6:07 p.m. local time, signaling the end of the conclave and the election of a new pope to lead the Catholic Church. Cardinal Robert F

08.05.2025, 23:30:07 | Fast company - tech

X blocks access to jailed Istanbul mayor’s account per Turkey’s request

The social media platform X said Thursday it has blocked access to ja

08.05.2025, 18:50:08 | Fast company - tech

Coding emerges as generative AI’s breakout star

Welcome to AI Decoded, Fast Company’s weekly newsletter that breaks down the most important news in the world of AI. You can sign up to receive this newsletter every week

08.05.2025, 18:50:06 | Fast company - tech

Instacart CEO Fidji Simo is heading to OpenAI: Here’s what you need to know

The woman behind Instacart’s successful IPO, Fidji Simo, is joining OpenAI’s C-suite.

On Wednesda

08.05.2025, 18:50:05 | Fast company - tech

Nintendo profits tanked 43% in Q1 but hopes to bounce back with the Switch 2 release

Japanese video-game maker Nintendo on Thursday reported a 43% decline in profit for the fiscal year through March, but promised a turnaro

08.05.2025, 16:30:11 | Fast company - tech

Thanks to DOGE, Gumroad’s founder has a second job with the VA

Sahil Lavingia has had just three jobs over a 15-year career in tech.

The first was as the second employee of Pinterest. The second was by founding the startup

08.05.2025, 11:50:06 | Fast company - tech

Tomas_r2