Anthropic’s Claude 3 model outperforms GPT-4 and Gemini Ultra in many tests

Anthropic announced on Monday a new family of AI models, collectively called the Claude 3 model family. As is commonly done, the company released three different sizes of models, each with a varying balance of intelligence, speed, and cost.

The largest of the new models, called “Opus,” outperforms both OpenAI’s and Google’s most advanced models, GPT-4 and Gemini Ultra, respectively, on tests measuring undergraduate level expert knowledge (MMLU), graduate level expert reasoning (GPQA) as well as basic mathematics (GSM8k), Anthropic says.

The middle child in the family, Claude 3 “Sonnet,” is twice as fast as Anthropic’s previous best model, Claude 2.1, and with higher intelligence. Anthropic says Sonnet excels at intelligent tasks demanding rapid responses, like knowledge retrieval or sales automation.

The smallest model, called “Haiku,” beats other comparably sized models in performance, speed and cost, the company says. It can read a dense research paper of roughly 7,500 words with charts and graphs in less than three seconds.

All three models can process visual imagery, which enables them to understand uploaded documents, analyze web interfaces, and generate image catalog metadata. Anthropic says that for many of its enterprise customers, up to half of their knowledgebases consist of documents in image formats such as PDFs, flowcharts, or slides.

The Opus and Sonnet models are available today, while the Haiku model will be available soon.

https://www.fastcompany.com/91046925/anthropic-claude-3-models?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

созданный 1y | 4 мар. 2024 г., 21:10:09


Войдите, чтобы добавить комментарий

Другие сообщения в этой группе

Why the AI pin won’t be the next iPhone

One of the most frequent questions I’ve been getting from business execs lately is whether the

12 июл. 2025 г., 12:10:02 | Fast company - tech
Microsoft will soon delete your Authenticator passwords. Here are 3 password manager alternatives

Users of Microsoft apps are having a rough year. First, in May, the Windows maker

12 июл. 2025 г., 09:40:03 | Fast company - tech
Yahoo Creators platform hits record revenue as publisher bets big on influencer-led content

Yahoo’s bet on creator-led content appears to be paying off. Yahoo Creators, the media company’s publishing platform for creators, had its most lucrative month yet in June.

Launched in M

11 июл. 2025 г., 17:30:04 | Fast company - tech
GameStop’s Nintendo Switch 2 stapler sells for more than $100,000 on eBay after viral mishap

From being the face of memestock mania to going viral for inadvertently stapling the screens of brand-new video game consoles, GameStop is no stranger to infamy.

Last month, during the m

11 июл. 2025 г., 12:50:04 | Fast company - tech
Don’t take the race for ‘superintelligence’ too seriously

The technology industry has always adored its improbably audacious goals and their associated buzzwords. Meta CEO Mark Zuckerberg is among the most enamored. After all, the name “Meta” is the resi

11 июл. 2025 г., 12:50:02 | Fast company - tech
Why AI-powered hiring may create legal headaches

Even as AI becomes a common workplace tool, its use in

11 июл. 2025 г., 12:50:02 | Fast company - tech
Gen Zers are posting their unemployment era on TikTok—and it’s way too real

Finding a job is hard right now. To cope, Gen Zers are documenting the reality of unemployment in 2025.

“You look sadder,” one TikTok po

11 июл. 2025 г., 10:30:04 | Fast company - tech