Anthropic’s Claude Opus 4 model can work autonomously for nearly a full workday

Anthropic kicked off its first-ever Code with Claude conference today with the announcement of a new frontier AI system. The company is calling Claude Opus 4 the best coding model in the world. According to Anthropic, Opus 4 is dramatically better at tasks that require it to complete thousands of separate steps, giving it the ability to work continuously for several hours in one go. Additionally, the new model can use multiple software tools in parallel, and it's better at following instructions more precisely.

In combination, Anthropic says those capabilities make Opus 4 ideal for powering upcoming AI agents. For the unfamiliar, agentic systems are AIs that are designed to plan and carry out complicated tasks without human supervision. They represent an important step towards the promise of artificial general intelligence (AGI). In customer testing, Anthropic saw Opus 4 work on its own seven hours, or nearly a full workday. That's an important milestone for the type of agentic systems the company wants to build.  

Claude Plays Pokemon
Anthropic

Another reason Anthropic thinks Opus 4 is ready to enable the creation of better AI agents is because the model is 65 percent less likely to use a shortcut or loophole when completing tasks. The company says the system also demonstrates significantly better "memory capabilities," particularly when developers grant Claude local file access. To encourage devs to try Opus 4, Anthropic is making Claude Code, its AI coding agent, widely available. It has also added new integrations with Visual Studio Code and JetBrains.

Even if you're not a coder, Anthropic might have something for you. That's because alongside Opus 4, the company announced a new version of its Sonnet model. Like Claude 3.7 Sonnet before it and Opus 4, the new system is a hybrid reasoning model, meaning it can execute prompts nearly instantaneously and engage in extended thinking. As a user, this gives you a best of both worlds chatbot that's better equipped to tackle complex problems when needed. It also incorporates many of the same improvements found in Opus 4, including the ability to use tools in parallel and follow instructions more faithfully. 

Sonnet 3.7 was so popular among users Anthropic ended up introducing a Max plan in response, which starts at $100 per month. The good news is you won't need to pay anywhere near that much to use Sonnet 4, as Anthropic is making it available to free users.

Claude 4 benchmarks
Anthropic

For those who want to use Sonnet 4 for a project, API pricing is staying at $3 per one million input tokens and $15 for the same amount of output tokens. Notably, outside of all the usual places you'll find Anthropic's models, including Amazon Bedrock and Google Vertex AI, Microsoft is making Sonnet 4 the default model for the new coding agent it's offering through GitHub Copilot. Both Opus 4 and Sonnet 4 are available to use today. 

Today's announcement comes during what's already been a busy week in the AI industry. On Tuesday, Google kicked off its I/O 2025 conference, announcing, among other things, that it was rolling out AI Mode to all Search users in the US. A day later, OpenAI said it was spending $6.5 billion to buy Jony Ive’s hardware startup.

This article originally appeared on Engadget at https://www.engadget.com/ai/anthropics-claude-opus-4-model-can-work-autonomously-for-nearly-a-full-workday-164526696.html?src=rss https://www.engadget.com/ai/anthropics-claude-opus-4-model-can-work-autonomously-for-nearly-a-full-workday-164526696.html?src=rss
Erstellt 1mo | 22.05.2025, 17:10:25


Melden Sie sich an, um einen Kommentar hinzuzufügen

Andere Beiträge in dieser Gruppe

Tesla's inaugural Robotaxi rides will have a human 'safety monitor' on board

A select few will soon get to experience Tesla's robotaxi service for the first time, but they won't be alone in the car. The company plans to launch its fully autonomous ride-hailing service in Au

21.06.2025, 18:10:16 | Engadget
Chinese company Netease is making an AAA action-adventure game called 'Blood Message'

NetEase, the Chinese video game company that published

21.06.2025, 18:10:14 | Engadget
Xbox's VR headset with Meta could release sooner than we thought

Xbox has come a long way since its humble beginnings as a chunky console. It's recently taken on the form of an Asus

21.06.2025, 18:10:13 | Engadget
Playdate Season 2 review: Shadowgate PD and CatchaDiablos

Earlier in this Playdate season, I

21.06.2025, 15:40:17 | Engadget
Apple is reportedly considering the acquisition of Perplexity AI

Apple's executives are thinking of acquiring Perplexity AI both to get more talent and to be able to offer an AI-based search engine in the future, according to

21.06.2025, 15:40:16 | Engadget
Amazon Prime Day 2025: The best early deals you can shop now, dates and everything else you need to know

Amazon Prime Day 2025 will be here soon on July 8-11, but as to be expected, you can already find some decent sales

21.06.2025, 15:40:15 | Engadget
Engadget review recap: Switch 2, Playdate games and a Framework laptop

The Nintendo Switch 2 has been all the rage around the Engadget HQ for the last few weeks. Even the editors who didn't write the official review have had their hands glued to their new toys. Of cou

21.06.2025, 13:30:09 | Engadget