OpenAI’s new Operator is a step into AI’s agentic future

OpenAI announced on Thursday a research preview of Operator, an AI agent that can browse the web and perform tasks for the user. Operator is powered by the Computer-Using Agent (CUA), an AI model that merges GPT-4o’s vision capabilities with reasoning capability.

OpenAI trained CUA to let Operator complete digital tasks by interacting with the buttons, menus, and text fields within the graphical user interfaces of the user’s computer and the websites they visit. Add the reasoning and self-checking capabilities seen in OpenAI’s o1 model, and Operator can break down tasks into steps and adaptively self-correct when it runs into problems. 

Operator is OpenAI’s answer to Anthropic’s Computer Use Model, which was unveiled last October and marks a step toward generative AI models gaining more autonomy and the ability to control outside tools. 

OpenAI says the tool is still a work in progress, but that it has already set records in a number of benchmark tests that measure success with computer-based and web-based tasks. 

The tool is available as a “research preview” only to subscribers to OpenAI’s “Pro” tier, which costs $200 a month. The company intends to roll out Operator to its Plus, Team, and Enterprise subscribers, and eventually build the features into ChatGPT. OpenAI told Techcrunch that it’s working with companies including DoorDash and Instacart to make sure Operator doesn’t come in breach of any terms of service agreements. “The CUA model is trained to ask for user confirmation before finalizing tasks with external side effects; for example, before submitting an order, sending an email, etc.,” OpenAI’s blog post explains, “so that the user can double-check the model’s work before it becomes permanent.”

https://www.fastcompany.com/91266338/openais-new-operator-is-a-step-into-ais-agentic-future?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

Létrehozva 6mo | 2025. jan. 23. 21:10:02


Jelentkezéshez jelentkezzen be

EGYÉB POSTS Ebben a csoportban

Linda Yaccarino was supposed to tame X. Elon Musk wouldn’t let her

Some news stories are gobsmackingly obvious in their importance. Others are complete nonstories. So what to make of the

2025. júl. 9. 19:10:07 | Fast company - tech
Apple’s next CEO: A new look at Tim Cook’s potential successors after latest exec shakeup

Yesterday, Apple unexpectedly announced the most radical shakeup to its C-suite in years. The company revealed that Jeff Williams, its current chief operating officer (COO), will be departing the

2025. júl. 9. 16:40:09 | Fast company - tech
PBS chief Paula Kerger warns public broadcasting could collapse in small communities if Congress strips federal funding

As Congress moves to make massive cuts to public broadcasting this week, Paula Kerger, president and CEO of the Public Broadcasting Service (PBS), gives an unflinching look at the organization’s f

2025. júl. 9. 14:30:04 | Fast company - tech
These personality types are most likely to cheat using AI

As recent graduates proudly showcase their use of ChatGPT for final projects, some may wonder: What kind of person turns to

2025. júl. 9. 14:30:04 | Fast company - tech