OpenAI’s new o1 models push AI to PhD-level intelligence

OpenAI introduced on Thursday OpenAI o1, a new series of large language models the company says are designed for solving difficult problems and working though complex tasks.

The models were trained to take longer to perform tasks than other AI models, thinking through problems in ways a human might. They can “refine their thinking process, try different strategies, and recognize their mistakes, OpenAI says in a press release. The models perform similarly to PhD students when working on physics, chemistry, and biology problems.

The o1 models scored 83% on a qualifying exam for the International Mathematics Olympiad, OpenAI says, while its earlier GPT-4o model correctly solved only 13% of problems.

OpenAI provided some specific use case examples. The o1 models could be used by healthcare researchers to annotate cell sequencing data, by physicists to generate complicated mathematical formulas needed for quantum optics, and by developers to build and execute multi-step workflows. They also perform well in math and coding.

Within OpenAI the o1 models were first codenamed “Q*” (pronounced “Q-star”), then “Strawberry.”

OpenAI says it’s taking a slow and cautious approach to releasing the new models. It’s releasing a couple of “early previews” of two of the models in the series. People with ChatGPT Plus or Teams accounts can access “o1-preview” by choosing it in a drop down menu within the chatbot. They can also choose “o1-mini,” which is faster and good at STEM questions, OpenAI says.

Developers and researchers can access the models within ChatGPT and via an application programming interface.

OpenAI says the new models won’t initially be able to access the internet. Users won’t be able to upload images or files to the models. OpenAI says it’s beefed up the safety features around the models, and has informed federal authorities about the more capable models.

https://www.fastcompany.com/91189817/openais-new-o1-models-push-ai-to-phd-level-intelligence?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

Établi 10mo | 12 sept. 2024, 20:30:04

Connectez-vous pour ajouter un commentaire

Autres messages de ce groupe

Windows 95’s look and feel are more impressive than ever

Every so often, Microsoft design director Diego Baca boots up an old computer so he can play around with Windows 95 again.

Baca has made a hobby of assembling old PCs with new-in-box vin

16 juil. 2025, 06:30:02 | Fast company - tech

Jack Dorsey’s new Sun Day app tells you exactly how long to tan before you burn

Twitter cofounder Jack Dorsey is back with a new app that tracks sun exposure and vitamin D levels.

Sun Day uses location-based data to show the current UV index, the day’s high, and add

15 juil. 2025, 21:10:06 | Fast company - tech

The CEO of Ciena on how AI is fueling a global subsea cable boom

Under the ocean’s surface lies the true backbone of the internet: an estimated

15 juil. 2025, 18:50:04 | Fast company - tech

AI therapy chatbots are unsafe and stigmatizing, a new Stanford study finds

AI chatbot therapists have made plenty of headlines in recent months—s

15 juil. 2025, 18:50:03 | Fast company - tech

Elon Musk’s chatbot Grok searches for his views before answering questions

The latest version of Elon Musk’s artificial intelligence chatbot Grok is echoing the views of its

15 juil. 2025, 16:30:06 | Fast company - tech

How this Florida county is using new 911 technology to save lives

When an emergency happens in Collier County, Florida, the

15 juil. 2025, 16:30:05 | Fast company - tech

How a ‘Shark Tank’-winning neuroscientist invented the bionic hand that stole the show at Comic-Con

A gleaming Belle from Beauty and the Beast glided along the exhibition floor at last year’s San Diego Comic-Con adorned in a yellow corseted gown with cascading satin folds. She could bare

15 juil. 2025, 14:20:03 | Fast company - tech

Tomas_r2