OpenAI's new o3 and o4-mini models are all about 'thinking with images'

A mere two days after announcing GPT-4.1, OpenAI is releasing not one but two new models. The company today announced the public availability of o3 and o4-mini. Of the former, OpenAI says o3 is its most advanced reasoning model yet, with it showing "strong performance" in coding, math and science tasks. As for o4-mini, OpenAI is billing it as a lower cost alternative that still delivers "impressive results" across those same fields.

More notably, both models offer novel capabilities not found in OpenAI's past systems. For first time, the company's reasoning models can use and combine all of the tools available in ChatGPT, including web browsing and image generation. The company says this capability allows o3 and o4-mini solve challenging, multi-step problems more effectively, and "take real steps toward acting independently." 

At the same time, o3 and o4-mini can not just see images, but also interpret and "think" about them in a way that significantly extends their visual processing capabilities. For instance, you can upload images of whiteboards, diagrams or sketches — even poor quality ones — and the new models will understand them. They can also adjust the images as part of how they reason.  

"The combined power of state-of-the-art reasoning with full tool access translates into significantly stronger performance across academic benchmarks and real-world tasks, setting a new standard in both intelligence and usefulness," says OpenAI. 

Separately, OpenAI is releasing a new coding agent (à la Claude Code) named Codex CLI. It's designed to give developers a minimal interface they can use to link OpenAI's models with their local code. Out of the box, it works with o3 and o4-mini, with support for GPT-4.1 on the way. 

Today's announcement comes after OpenAI CEO Sam Altman said the company was changing course on the roadmap he detailed in February. At the time, Altman indicated OpenAI would not release o3, which the company first previewed late last year, as a standalone product. However, at the start of April, he announced a "change of plans," noting OpenAI was moving forward with the release of o3 and o4-mini.  

"There are a bunch of reasons for this, but the most exciting one is that we are going to be able to make GPT-5 much better than we originally though," he wrote on X. "We also found it harder than we thought it was going to be to smoothly integrate everything. and we want to make sure we have enough capacity to support what we expect to be unprecedented demand."

That means the streamlining Altman promised in February will likely need to wait until at least the release of GPT-5, which he said would arrive sometime in the next "few months." 

In the meantime, ChatGPT Plus, Pro and Team users can begin using o3 and o4-mini starting today. Sometime in the next few weeks, OpenAI will bring online o3-pro, an even more powerful version of its flagship reasoning model, and make it available to Pro subscribers. For the time being, those users can continue to use o1-pro. 

This article originally appeared on Engadget at https://www.engadget.com/ai/openais-new-o3-and-o4-mini-models-are-all-about-thinking-with-images-170043465.html?src=rss https://www.engadget.com/ai/openais-new-o3-and-o4-mini-models-are-all-about-thinking-with-images-170043465.html?src=rss
Created 3mo | Apr 16, 2025, 6:40:16 PM


Login to add comment

Other posts in this group

Opera takes its browser beef with Microsoft to Brazil in antitrust complaint

Opera is filing an antitrust complaint against Microsoft in Brazil,

Jul 29, 2025, 11:50:15 PM | Engadget
Home Depot has a new animatronic version of Skelly the skeleton

The Home Depot is well on its way to becoming a Spirit Halloween that also sells weed whackers. Here we are in July, and the retailer is already

Jul 29, 2025, 7:20:45 PM | Engadget
ChatGPT's Study Mode will guide students to an answer stey by step

OpenAI is rolling out a new Study Mode the company says is designed to give students a better understa

Jul 29, 2025, 7:20:42 PM | Engadget
Google adds Video Overviews to NotebookLM

NotebookLM, the Google research tool that gained notoriety for its

Jul 29, 2025, 7:20:40 PM | Engadget
YouTube is turning over age verification to AI

YouTube will start using machine learning to determine whether viewers should be on a teen account. The company

Jul 29, 2025, 7:20:39 PM | Engadget
Prime members can get the DJI Mini 4K drone on sale for $249

Amazon is selling the

Jul 29, 2025, 5:10:23 PM | Engadget
Our favorite Logitech mouse is $40 off right now

If you're in the market for a new mouse that won't totally break the bank then today is your lucky day. Right now,

Jul 29, 2025, 5:10:22 PM | Engadget