Anthropic’s new Claude AI model can decide between speed and deep thinking

Anthropic released on Monday its Claude 3.7 Sonnet model, which it says returns results faster and can show the user the “chain of thought” it follows to reach an answer. This latest model also powers a new coding tool called Claude Code that can perform some development tasks autonomously.

Claude 3.7 Sonnet offers an “extended thinking” mode that engages in a more detailed “chain of thought” reasoning but takes longer to generate a response. For simpler questions it eschews this mode and instead focuses on speed. Other models offer their own versions of “thinking” mode, but typically the user has to select that feature for harder problems;  Anthropic says Claude 3.7 Sonnet is the first publicly available model with the capability to choose the best mode based on the user’s question. If Grok 3 and DeepSeek-R1 are stick shifts, then Anthropic’s new model is an automatic.

“Just as humans use a single brain for both quick responses and deep reflection, we believe reasoning should be an integrated capability of frontier models rather than a separate model entirely,” Anthropic says in a blog post.

Claude 3.7 Sonnet outperforms other “thinking” models in some important benchmark tests. On SWE-bench, which evaluates AI models’ ability to solve real-world software issues, the model beat OpenAI’s o1 and o3-mini and DeepSeek-R1 by a comfortable margin. It was the same story on TAU-bench, which tests AI agents on complex real-world tasks with user and tool interactions. However, OpenAI’s o1 model still edges out Claude 3.7 Sonnet in math problem solving, visual reasoning, multilingual Q&A, and graduate-level reasoning benchmarks.

Anthropic describes the Claude Code tool as an active collaborator that can search and read code, edit files, write and run tests, and commit and push code to GitHub. The company says the tool has already become “indispensable” for its own coders, completing tasks in a single pass that would normally take 45 minutes or more of manual work. 

Claude 3.7 Sonnet is now available on all Claude subscription plans—Free, Pro, Team, and Enterprise–but the extended thinking mode isn’t available to users of the free tier. Claude 3.7 Sonnet is also available to developers as an API for the same price as earlier Claude models.

https://www.fastcompany.com/91283751/anthropic-new-claude-3-7-sonnet-ai-chain-of-thought?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

созданный 3mo | 24 февр. 2025 г., 20:20:05


Войдите, чтобы добавить комментарий

Другие сообщения в этой группе

Apple partners with a brain-computer startup to turn thoughts into device control

Apple is partnering with brain-computer interface company Synchron to develop technology that lets users control devices using neural signals.

Still in the early stages, the technology c

13 мая 2025 г., 19:20:07 | Fast company - tech
Couples are saying ‘I do’ in ‘Minecraft’ as virtual weddings become more popular

Destination weddings are out, and virtual weddings are in.

Rather than traveling to the Amalfi Coast or Provence, Wired

13 мая 2025 г., 19:20:06 | Fast company - tech
Sal Khan’s new Dialogues program teaches students how to have civil, thoughtful discussions

In recent years, Khan Academy founder Sal Khan has been most visible promoting the organization’s

13 мая 2025 г., 17:10:03 | Fast company - tech
Spotify’s AI-powered DJ now takes song requests

Since it launched two years ago, Spotify’s AI DJ has been a one-way experience. It curates old favorites and helps listeners discover new tracks based on past listening experience and what similar

13 мая 2025 г., 14:40:06 | Fast company - tech
California’s location data privacy bill aims to reshape digital consent

Amid the ongoing evolution of digital privacy laws, one California proposal is drawing heightened attention from legal scholars, technologists, and privacy advocates.

13 мая 2025 г., 12:30:04 | Fast company - tech
Apple’s App Store is getting ‘nutrition labels’ for accessibility

You can learn a lot about an app before you download it from Apple’s App Store, such as what other users think of it, the access it

13 мая 2025 г., 12:30:04 | Fast company - tech
Anaconda launches an AI platform to become the GitHub of enterprise open-source development

AI integration remains a top priority across enterprises worldwide, yet success remains elusive despite widespread enthusiasm and significant investment. An

13 мая 2025 г., 12:30:03 | Fast company - tech