Anthropic’s Claude 3 model outperforms GPT-4 and Gemini Ultra in many tests

Anthropic announced on Monday a new family of AI models, collectively called the Claude 3 model family. As is commonly done, the company released three different sizes of models, each with a varying balance of intelligence, speed, and cost.

The largest of the new models, called “Opus,” outperforms both OpenAI’s and Google’s most advanced models, GPT-4 and Gemini Ultra, respectively, on tests measuring undergraduate level expert knowledge (MMLU), graduate level expert reasoning (GPQA) as well as basic mathematics (GSM8k), Anthropic says.

The middle child in the family, Claude 3 “Sonnet,” is twice as fast as Anthropic’s previous best model, Claude 2.1, and with higher intelligence. Anthropic says Sonnet excels at intelligent tasks demanding rapid responses, like knowledge retrieval or sales automation.

The smallest model, called “Haiku,” beats other comparably sized models in performance, speed and cost, the company says. It can read a dense research paper of roughly 7,500 words with charts and graphs in less than three seconds.

All three models can process visual imagery, which enables them to understand uploaded documents, analyze web interfaces, and generate image catalog metadata. Anthropic says that for many of its enterprise customers, up to half of their knowledgebases consist of documents in image formats such as PDFs, flowcharts, or slides.

The Opus and Sonnet models are available today, while the Haiku model will be available soon.

https://www.fastcompany.com/91046925/anthropic-claude-3-models?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

Létrehozva 1y | 2024. márc. 4. 21:10:09

Jelentkezéshez jelentkezzen be

EGYÉB POSTS Ebben a csoportban

The CEO of Ciena on how AI is fueling a global subsea cable boom

Under the ocean’s surface lies the true backbone of the internet: an estimated

2025. júl. 15. 18:50:04 | Fast company - tech

AI therapy chatbots are unsafe and stigmatizing, a new Stanford study finds

AI chatbot therapists have made plenty of headlines in recent months—s

2025. júl. 15. 18:50:03 | Fast company - tech

Elon Musk’s chatbot Grok searches for his views before answering questions

The latest version of Elon Musk’s artificial intelligence chatbot Grok is echoing the views of its

2025. júl. 15. 16:30:06 | Fast company - tech

How this Florida county is using new 911 technology to save lives

When an emergency happens in Collier County, Florida, the

2025. júl. 15. 16:30:05 | Fast company - tech

How a ‘Shark Tank’-winning neuroscientist invented the bionic hand that stole the show at Comic-Con

A gleaming Belle from Beauty and the Beast glided along the exhibition floor at last year’s San Diego Comic-Con adorned in a yellow corseted gown with cascading satin folds. She could bare

2025. júl. 15. 14:20:03 | Fast company - tech

Why 1995 was the year the internet grew up

The internet wasn’t born whole—it came together from parts. Most know of ARPANET, the internet’s most famous precursor, but it was always limited strictly to government use. It was NSFNET that bro

2025. júl. 15. 11:50:03 | Fast company - tech

What is quantum computing? Here’s everything you need to know right now

Computing revolutions are surprisingly rare. Despite the extraordinary technological progress that separates the first general-purpose digital computer—1945’s

2025. júl. 15. 9:30:04 | Fast company - tech

Tomas_r2