OpenAI's experimental model achieved gold at the International Math Olympiad

OpenAI has achieved "gold medal-level performance" at the International Math Olympiad, notching another important milestone for AI's fast-paced growth. Alexander Wei, a research scientist at OpenAI working on LLMs and reasoning, posted on X that an experimental research model delivered on this "longstanding grand challenge in AI."

According to Wei, an unreleased model from OpenAI was able to solve five out of six problems at one of the world's longest-standing and prestigious math competitions, earning 35 out of 42 points total. The International Math Olympiad (IMO) sees countries send up to six students to solve extremely difficult algebra and pre-calculus problems. These exercises are seemingly simple but usually require some creativity to score the highest marks on each problem. For this year's competition, only 67 of the 630 total contestants received gold medals, or roughly 10 percent.

AI is often tasked with tackling complex datasets and repetitive actions, but it usually falls short when it comes to solving problems that require more creativity or complex decision-making. However, with the latest IMO competition, OpenAI says its model was able to handle complicated math problems with human-like reasoning.

"By doing so, we've obtained a model that can craft intricate, watertight arguments at the level of human mathematicians," Wei wrote on X. Wei and Sam Altman, CEO of OpenAI, both added that the company doesn't expect to release anything with this level of math capability for several months. That means the upcoming GPT-5 will likely be an improvement from its predecessor, but it won't feature that same impressive capability to compete in the IMO.

This article originally appeared on Engadget at https://www.engadget.com/ai/openais-experimental-model-achieved-gold-at-the-international-math-olympiad-182719801.html?src=rss https://www.engadget.com/ai/openais-experimental-model-achieved-gold-at-the-international-math-olympiad-182719801.html?src=rss
Creato 7h | 19 lug 2025, 20:40:04


Accedi per aggiungere un commento

Altri post in questo gruppo

What to read this weekend: When the world spins out of control

These are some recently released titles we think are worth adding to your reading list. This week, we're diving into Alex Foster's futuristic debut, Circular Motion, and the return of

19 lug 2025, 22:50:14 | Engadget
Corning avoids EU antitrust fine by ending exclusive deals with phone manufacturers

Corning, the US-based glass manufacturer behind Gorilla Glass, has

19 lug 2025, 18:20:16 | Engadget
EA's big reveal for its next Battlefield game may already be spoiled

Looks like we can skip the drum roll for the next Battl

19 lug 2025, 18:20:15 | Engadget
Neon Abyss 2, a prison-break RPG and other new indie games worth checking out

Welcome to our weekly roundup of the goings on in the indie game space. It's been quite the busy spell, with several notable games debuting or landing on more platforms and some intriguing upcoming

19 lug 2025, 11:20:12 | Engadget
What the hell is going on with Subnautica 2?

If I had to describe the status of Subnautica 2 in just three words, it would be these: messy, messy, messy. That’s not to say the game itself is in terrible shape — this is actually a piv

18 lug 2025, 23:40:14 | Engadget
Netflix is already using generative AI in its original shows

Netflix admitted during its

18 lug 2025, 21:30:25 | Engadget
Microsoft unceremoniously kills off the Xbox Movies & TV store

Microsoft has rather abruptly closed down its Movies & TV app, which is accessible on Xbox and Windows PCs via the Microsoft Store. This allowed people to rent or buy movies or TV shows natively th

18 lug 2025, 19:10:23 | Engadget