Anthropic’s new Claude AI model can decide between speed and deep thinking

Anthropic released on Monday its Claude 3.7 Sonnet model, which it says returns results faster and can show the user the “chain of thought” it follows to reach an answer. This latest model also powers a new coding tool called Claude Code that can perform some development tasks autonomously.

Claude 3.7 Sonnet offers an “extended thinking” mode that engages in a more detailed “chain of thought” reasoning but takes longer to generate a response. For simpler questions it eschews this mode and instead focuses on speed. Other models offer their own versions of “thinking” mode, but typically the user has to select that feature for harder problems; Anthropic says Claude 3.7 Sonnet is the first publicly available model with the capability to choose the best mode based on the user’s question. If Grok 3 and DeepSeek-R1 are stick shifts, then Anthropic’s new model is an automatic.

“Just as humans use a single brain for both quick responses and deep reflection, we believe reasoning should be an integrated capability of frontier models rather than a separate model entirely,” Anthropic says in a blog post.

Claude 3.7 Sonnet outperforms other “thinking” models in some important benchmark tests. On SWE-bench, which evaluates AI models’ ability to solve real-world software issues, the model beat OpenAI’s o1 and o3-mini and DeepSeek-R1 by a comfortable margin. It was the same story on TAU-bench, which tests AI agents on complex real-world tasks with user and tool interactions. However, OpenAI’s o1 model still edges out Claude 3.7 Sonnet in math problem solving, visual reasoning, multilingual Q&A, and graduate-level reasoning benchmarks.

Anthropic describes the Claude Code tool as an active collaborator that can search and read code, edit files, write and run tests, and commit and push code to GitHub. The company says the tool has already become “indispensable” for its own coders, completing tasks in a single pass that would normally take 45 minutes or more of manual work.

Claude 3.7 Sonnet is now available on all Claude subscription plans—Free, Pro, Team, and Enterprise–but the extended thinking mode isn’t available to users of the free tier. Claude 3.7 Sonnet is also available to developers as an API for the same price as earlier Claude models.

https://www.fastcompany.com/91283751/anthropic-new-claude-3-7-sonnet-ai-chain-of-thought?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

Établi 6mo | 24 févr. 2025, 20:20:05

Connectez-vous pour ajouter un commentaire

Autres messages de ce groupe

Sony to raise PlayStation 5 prices in the U.S. amid tariff uncertainty

Sony will raise prices of its PlayStation 5 consoles in the United States b

20 août 2025, 19:30:14 | Fast company - tech

In Uvalde massacre lawsuit, Meta lawyer argues Instagram isn’t responsible for gunmaker’s posts

A lawsuit filed by families of the

20 août 2025, 17:20:04 | Fast company - tech

How AI will radically change military command structures

Despite two centuries of evolution, the structure of a modern military staff would be recognizable to Napoleon. At the same time, m

20 août 2025, 17:20:02 | Fast company - tech

This startup knows what AI is saying about your brand

Internet users are increasingly turning to AI tools like ChatGPT, rath

20 août 2025, 14:50:08 | Fast company - tech

OpenAI gave GPT-5 an emotional lobotomy, and it crippled the model

It’s rare for a tech titan to show any weakness or humanity. Yet even OpenAI’s notoriously understated CEO Sam Altman had to admit this week that the rollout of the company’s

20 août 2025, 14:50:06 | Fast company - tech

An engineer explains how AI can prevent satellite disasters in space

With satellite mega-constellations like SpaceX’s Starlink deploying thousands of spacecraft, monitoring their health has become an enormous challenge. Traditional methods can’t easily scale

20 août 2025, 12:30:15 | Fast company - tech

Landline phones are back—and they’re helping kids connect safely with friends

In today’s world, communication is largely done through one of two methods: smartphones or social media. Young children, however, rarely have access to either—and experts say they shouldn’t have a

20 août 2025, 12:30:12 | Fast company - tech

Tomas_r2