Anthropic’s new Claude AI model can decide between speed and deep thinking

Anthropic released on Monday its Claude 3.7 Sonnet model, which it says returns results faster and can show the user the “chain of thought” it follows to reach an answer. This latest model also powers a new coding tool called Claude Code that can perform some development tasks autonomously.

Claude 3.7 Sonnet offers an “extended thinking” mode that engages in a more detailed “chain of thought” reasoning but takes longer to generate a response. For simpler questions it eschews this mode and instead focuses on speed. Other models offer their own versions of “thinking” mode, but typically the user has to select that feature for harder problems; Anthropic says Claude 3.7 Sonnet is the first publicly available model with the capability to choose the best mode based on the user’s question. If Grok 3 and DeepSeek-R1 are stick shifts, then Anthropic’s new model is an automatic.

“Just as humans use a single brain for both quick responses and deep reflection, we believe reasoning should be an integrated capability of frontier models rather than a separate model entirely,” Anthropic says in a blog post.

Claude 3.7 Sonnet outperforms other “thinking” models in some important benchmark tests. On SWE-bench, which evaluates AI models’ ability to solve real-world software issues, the model beat OpenAI’s o1 and o3-mini and DeepSeek-R1 by a comfortable margin. It was the same story on TAU-bench, which tests AI agents on complex real-world tasks with user and tool interactions. However, OpenAI’s o1 model still edges out Claude 3.7 Sonnet in math problem solving, visual reasoning, multilingual Q&A, and graduate-level reasoning benchmarks.

Anthropic describes the Claude Code tool as an active collaborator that can search and read code, edit files, write and run tests, and commit and push code to GitHub. The company says the tool has already become “indispensable” for its own coders, completing tasks in a single pass that would normally take 45 minutes or more of manual work.

Claude 3.7 Sonnet is now available on all Claude subscription plans—Free, Pro, Team, and Enterprise–but the extended thinking mode isn’t available to users of the free tier. Claude 3.7 Sonnet is also available to developers as an API for the same price as earlier Claude models.

https://www.fastcompany.com/91283751/anthropic-new-claude-3-7-sonnet-ai-chain-of-thought?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

Vytvořeno 3mo | 24. 2. 2025 20:20:05

Chcete-li přidat komentář, přihlaste se

Ostatní příspěvky v této skupině

Vibe coding lets anyone write software—but comes with risks

Whether you’re streaming a show, paying bills online or sending an email, each of these actions relies on computer programs that run behind the scenes. The process of writing computer programs is

8. 6. 2025 9:40:04 | Fast company - tech

This free AI supersite is like Gemini Deep Research on steroids

Everywhere you look these days, there it is—some manner of breathlessly hyped new “AI” service that’s, like, totally gonna change your life forever. (Like, totally. For realsies.)

7. 6. 2025 12:50:02 | Fast company - tech

WWDC25: Here’s everything Apple is likely to announce

Apple’s annual Worldwide Developers Conference begins this Monday, June 9. Although the five-day event has historically been aimed at developers, Apple’s consumer fans generally can’t wait to tune

7. 6. 2025 10:30:05 | Fast company - tech

WordPress veterans launch FAIR project to tackle security and control concerns

The recent travails of WordPress have caused consternation among the web commu

6. 6. 2025 18:20:04 | Fast company - tech

How the Musk-Trump breakup could damage the U.S. space program

About $22 billion of SpaceX’s government contracts are at risk and

6. 6. 2025 15:50:09 | Fast company - tech

Why vibecoding your own apps is so amazing—and exasperating

“The truth is, I cannot explain exactly where your 1,216 image files went or when they disappeared. I apologize for not being more careful about investigating the root cause before taking any acti

6. 6. 2025 13:40:07 | Fast company - tech

Waymo is winning in San Francisco

The self-driving car service Waymo has been active in San Francisco for 20 months and has already captured 27% of the city’s rideshare market, according to

6. 6. 2025 13:40:05 | Fast company - tech

Tomas_r2