OpenAI’s new o1 models push AI to PhD-level intelligence

OpenAI introduced on Thursday OpenAI o1, a new series of large language models the company says are designed for solving difficult problems and working though complex tasks.

The models were trained to take longer to perform tasks than other AI models, thinking through problems in ways a human might. They can “refine their thinking process, try different strategies, and recognize their mistakes, OpenAI says in a press release. The models perform similarly to PhD students when working on physics, chemistry, and biology problems.

The o1 models scored 83% on a qualifying exam for the International Mathematics Olympiad, OpenAI says, while its earlier GPT-4o model correctly solved only 13% of problems.

OpenAI provided some specific use case examples. The o1 models could be used by healthcare researchers to annotate cell sequencing data, by physicists to generate complicated mathematical formulas needed for quantum optics, and by developers to build and execute multi-step workflows. They also perform well in math and coding.

Within OpenAI the o1 models were first codenamed “Q*” (pronounced “Q-star”), then “Strawberry.”

OpenAI says it’s taking a slow and cautious approach to releasing the new models. It’s releasing a couple of “early previews” of two of the models in the series. People with ChatGPT Plus or Teams accounts can access “o1-preview” by choosing it in a drop down menu within the chatbot. They can also choose “o1-mini,” which is faster and good at STEM questions, OpenAI says.

Developers and researchers can access the models within ChatGPT and via an application programming interface.

OpenAI says the new models won’t initially be able to access the internet. Users won’t be able to upload images or files to the models. OpenAI says it’s beefed up the safety features around the models, and has informed federal authorities about the more capable models.

https://www.fastcompany.com/91189817/openais-new-o1-models-push-ai-to-phd-level-intelligence?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

Creado 10mo | 12 sept 2024, 20:30:04

Inicia sesión para agregar comentarios

Otros mensajes en este grupo.

Genesys wants agentic AI to make customer service less robotic

When Tony Bates became chairman and CEO of Genesys in 2019, the company was already a global leader in contact center software. But Bates was determined

25 jun 2025, 14:20:04 | Fast company - tech

How a travel and expense platform is breaking ground on a zero-hallucinations AI workforce

AI hallucinations are one of users’ biggest concerns when utilizing larg

25 jun 2025, 11:50:06 | Fast company - tech

The AI baby boom is here. But can ChatGPT really raise a child?

Sam Altman is “extremely kid-pilled.”

The OpenAI CEO announced the birth of his son in February. Since then, Altman has employ

25 jun 2025, 11:50:05 | Fast company - tech

I’ve become an AI vibecoding convert

A few weeks ago, I finally paid for ChatGPT Plus.

It started with a simple goal: I wanted to create a personal archive of my published articles, but wasn’t sure how to begin. That led to

25 jun 2025, 9:40:03 | Fast company - tech

These are the top 10 emerging technologies of 2025, according to the World Economic Forum

Breakthroughs happen all the time in the tech world, but only a select few manage to make a lasting impact.

Predicting which innovations will shape the future is always a challenge. On T

25 jun 2025, 4:50:06 | Fast company - tech

Anthropic’s AI copyright ‘win’ is more complicated than it looks

Big tech scored a major victory this week in the battle over using copyrighted materials to train AI models. Anthropic

24 jun 2025, 19:40:06 | Fast company - tech

How Roblox handles millions of players on viral games like ‘Grow a Garden’

Just this past weekend, social and gaming platform Roblox saw a peak of 30.6 million concurrently active players, the

24 jun 2025, 17:30:02 | Fast company - tech

Tomas_r2