Anthropic’s Claude 3 model outperforms GPT-4 and Gemini Ultra in many tests

Anthropic announced on Monday a new family of AI models, collectively called the Claude 3 model family. As is commonly done, the company released three different sizes of models, each with a varying balance of intelligence, speed, and cost.

The largest of the new models, called “Opus,” outperforms both OpenAI’s and Google’s most advanced models, GPT-4 and Gemini Ultra, respectively, on tests measuring undergraduate level expert knowledge (MMLU), graduate level expert reasoning (GPQA) as well as basic mathematics (GSM8k), Anthropic says.

The middle child in the family, Claude 3 “Sonnet,” is twice as fast as Anthropic’s previous best model, Claude 2.1, and with higher intelligence. Anthropic says Sonnet excels at intelligent tasks demanding rapid responses, like knowledge retrieval or sales automation.

The smallest model, called “Haiku,” beats other comparably sized models in performance, speed and cost, the company says. It can read a dense research paper of roughly 7,500 words with charts and graphs in less than three seconds.

All three models can process visual imagery, which enables them to understand uploaded documents, analyze web interfaces, and generate image catalog metadata. Anthropic says that for many of its enterprise customers, up to half of their knowledgebases consist of documents in image formats such as PDFs, flowcharts, or slides.

The Opus and Sonnet models are available today, while the Haiku model will be available soon.

https://www.fastcompany.com/91046925/anthropic-claude-3-models?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

Creato 1y | 4 mar 2024, 21:10:09


Accedi per aggiungere un commento

Altri post in questo gruppo

This IBM ThinkPad was astounding in 1995—and still is

Closed, it looks pretty much like any other laptop manufactured in 1995.

To be sure, it’s more compact than most—making it, in the parlance of the day, a subnotebook. But it’s still comi

15 lug 2025, 07:20:02 | Fast company - tech
This IBM ThinkPad was astounding in 1995—and still is

Closed, it looks pretty much like any other laptop manufactured in 1995.

To be sure, it’s more compact than most—making it, in the parlance of the day, a subnotebook. But it’s still comi

15 lug 2025, 04:50:04 | Fast company - tech
$100,000, 100 streamers: IShowSpeed and Jynxzi’s Fortnite tournament is already drawing excitement

IShowSpeed and Jynxzi are teaming up to host a $100,000 Fortnite tournament, bringing together 100 top creators for what’s shaping up to be the biggest celebrity Fortnite match to date.

14 lug 2025, 19:40:06 | Fast company - tech
Zuckerberg announces Meta’s new AI data centers for superintelligence

Mark Zuckerberg said on Monday that Meta Platforms would spend hundreds of billions of dollars to build several massive

14 lug 2025, 19:40:05 | Fast company - tech
Meta’s massive data center bet is a direct challenge to OpenAI and Alphabet

Meta may not currently lead the race for AI superintelligence, but it&

14 lug 2025, 19:40:04 | Fast company - tech
Antipasto-gate: How a $40 salad sparked viral small-town drama on TikTok

Southern small-town drama has made its way to TikTok. If you’re not familiar

14 lug 2025, 19:40:03 | Fast company - tech