Anthropic’s new Claude AI model can decide between speed and deep thinking

Anthropic released on Monday its Claude 3.7 Sonnet model, which it says returns results faster and can show the user the “chain of thought” it follows to reach an answer. This latest model also powers a new coding tool called Claude Code that can perform some development tasks autonomously.

Claude 3.7 Sonnet offers an “extended thinking” mode that engages in a more detailed “chain of thought” reasoning but takes longer to generate a response. For simpler questions it eschews this mode and instead focuses on speed. Other models offer their own versions of “thinking” mode, but typically the user has to select that feature for harder problems;  Anthropic says Claude 3.7 Sonnet is the first publicly available model with the capability to choose the best mode based on the user’s question. If Grok 3 and DeepSeek-R1 are stick shifts, then Anthropic’s new model is an automatic.

“Just as humans use a single brain for both quick responses and deep reflection, we believe reasoning should be an integrated capability of frontier models rather than a separate model entirely,” Anthropic says in a blog post.

Claude 3.7 Sonnet outperforms other “thinking” models in some important benchmark tests. On SWE-bench, which evaluates AI models’ ability to solve real-world software issues, the model beat OpenAI’s o1 and o3-mini and DeepSeek-R1 by a comfortable margin. It was the same story on TAU-bench, which tests AI agents on complex real-world tasks with user and tool interactions. However, OpenAI’s o1 model still edges out Claude 3.7 Sonnet in math problem solving, visual reasoning, multilingual Q&A, and graduate-level reasoning benchmarks.

Anthropic describes the Claude Code tool as an active collaborator that can search and read code, edit files, write and run tests, and commit and push code to GitHub. The company says the tool has already become “indispensable” for its own coders, completing tasks in a single pass that would normally take 45 minutes or more of manual work. 

Claude 3.7 Sonnet is now available on all Claude subscription plans—Free, Pro, Team, and Enterprise–but the extended thinking mode isn’t available to users of the free tier. Claude 3.7 Sonnet is also available to developers as an API for the same price as earlier Claude models.

https://www.fastcompany.com/91283751/anthropic-new-claude-3-7-sonnet-ai-chain-of-thought?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

Creado 4mo | 24 feb 2025, 20:20:05


Inicia sesión para agregar comentarios

Otros mensajes en este grupo.

Astroworld is back in the spotlight and survivors are sharing haunting stories on TikTok

Astroworld is back in the news, and social media has some thoughts.

In November 2021, a

20 jun 2025, 23:10:03 | Fast company - tech
Your reliance on ChatGPT might be really bad for your brain

If you value critical thinking, you may want to rethink your use of ChatGPT.

As graduates

20 jun 2025, 18:30:02 | Fast company - tech
What is ‘office chair butt’? TikTok’s viral term for a real health problem

Rather than the Sunday scaries or toxic bosses, employees have unlocked a new workplace fear: office chair butt.

While not a new concern, the term has resurfaced on TikTok to describe ho

20 jun 2025, 16:10:07 | Fast company - tech
How this Parisian music streaming service is fighting AI fraud

Music streaming service Deezer said Friday that it will start flagging albums with AI-generated songs, part of its fight against

20 jun 2025, 16:10:06 | Fast company - tech
Nvidia and Hexagon’s Aeon humanoid robot brings AI-powered automation to factories

Artificial intelligence is evolving at an unprecedented pace, advancing from simple generative tasks to autonomous decision-making through

20 jun 2025, 16:10:05 | Fast company - tech
VisionOS 26 proves Apple isn’t treating the Vision Pro like a hobby

In 2023, the flagship reveal at Apple’s WWDC keynote was unquestionably the debut of

20 jun 2025, 13:40:08 | Fast company - tech
What the Wright Brothers can teach science entrepreneurs about how to survive a funding pullback

What happens when venture capital and government pull back from science entrepreneurs at the same time? Many scientists think we’re about to find out, and are looking at how we can preserve our co

20 jun 2025, 11:30:03 | Fast company - tech