OpenAI's experimental model achieved gold at the International Math Olympiad

OpenAI has achieved "gold medal-level performance" at the International Math Olympiad, notching another important milestone for AI's fast-paced growth. Alexander Wei, a research scientist at OpenAI working on LLMs and reasoning, posted on X that an experimental research model delivered on this "longstanding grand challenge in AI."

According to Wei, an unreleased model from OpenAI was able to solve five out of six problems at one of the world's longest-standing and prestigious math competitions, earning 35 out of 42 points total. The International Math Olympiad (IMO) sees countries send up to six students to solve extremely difficult algebra and pre-calculus problems. These exercises are seemingly simple but usually require some creativity to score the highest marks on each problem. For this year's competition, only 67 of the 630 total contestants received gold medals, or roughly 10 percent.

AI is often tasked with tackling complex datasets and repetitive actions, but it usually falls short when it comes to solving problems that require more creativity or complex decision-making. However, with the latest IMO competition, OpenAI says its model was able to handle complicated math problems with human-like reasoning.

"By doing so, we've obtained a model that can craft intricate, watertight arguments at the level of human mathematicians," Wei wrote on X. Wei and Sam Altman, CEO of OpenAI, both added that the company doesn't expect to release anything with this level of math capability for several months. That means the upcoming GPT-5 will likely be an improvement from its predecessor, but it won't feature that same impressive capability to compete in the IMO.

This article originally appeared on Engadget at https://www.engadget.com/ai/openais-experimental-model-achieved-gold-at-the-international-math-olympiad-182719801.html?src=rss https://www.engadget.com/ai/openais-experimental-model-achieved-gold-at-the-international-math-olympiad-182719801.html?src=rss
Établi 1mo | 19 juil. 2025, 20:40:04


Connectez-vous pour ajouter un commentaire

Autres messages de ce groupe

SpaceX is about to launch Starship for its 10th test flight

SpaceX's massive Starship rocket is scheduled to lift off from the company's Texas launch site as soon as this evening for its 10th flight. The launch window opens at 7:30PM ET (6:30PM CT). As alwa

24 août 2025, 23:40:15 | Engadget
Ayn reveals a Nintendo DS-style handheld that comes in the classic Game Boy Color purple

Ayn added more than just a touch of nostalgia with its upcoming dual-screen handheld that gives us modern-day Nintendo DS vibes. After teasing the device in a

24 août 2025, 21:20:23 | Engadget
You can now download and tweak Grok 2.5 for yourself as it goes open source

">Unhinged as Grok may be, it's now open source. xAI'

24 août 2025, 19:10:07 | Engadget
Sonos back-to-school sale: Headphones and speakers are up to 25 percent off

The back-to-school season isn't only a good time to save on things like a new laptop. Case in point: Sonos' bac

24 août 2025, 16:40:25 | Engadget
Get up to 35 percent off Anker wireless chargers ahead of Labor Day

Anker makes some of our favorite charging gear, and now you can save on a bunch of wireless power accessories from the brand. Whether you're going back to school soon or want a new charging station

24 août 2025, 16:40:23 | Engadget
The best Labor Day sales for 2025: Get up to 50 percent off tech from Apple, Anker, Shark and others

Labor Day marks the unofficial end to summer as the weather starts to get crisper and students head back to school f

24 août 2025, 14:20:27 | Engadget