OpenAI announced on Thursday a research preview of Operator, an AI agent that can browse the web and perform tasks for the user. Operator is powered by the Computer-Using Agent (CUA), an AI model that merges GPT-4o’s vision capabilities with reasoning capability.
OpenAI trained CUA to let Operator complete digital tasks by interacting with the buttons, menus, and text fields within the graphical user interfaces of the user’s computer and the websites they visit. Add the reasoning and self-checking capabilities seen in OpenAI’s o1 model, and Operator can break down tasks into steps and adaptively self-correct when it runs into problems.
Operator is OpenAI’s answer to Anthropic’s Computer Use Model, which was unveiled last October and marks a step toward generative AI models gaining more autonomy and the ability to control outside tools.
OpenAI says the tool is still a work in progress, but that it has already set records in a number of benchmark tests that measure success with computer-based and web-based tasks.
The tool is available as a “research preview” only to subscribers to OpenAI’s “Pro” tier, which costs $200 a month. The company intends to roll out Operator to its Plus, Team, and Enterprise subscribers, and eventually build the features into ChatGPT. OpenAI told Techcrunch that it’s working with companies including DoorDash and Instacart to make sure Operator doesn’t come in breach of any terms of service agreements. “The CUA model is trained to ask for user confirmation before finalizing tasks with external side effects; for example, before submitting an order, sending an email, etc.,” OpenAI’s blog post explains, “so that the user can double-check the model’s work before it becomes permanent.”
Jelentkezéshez jelentkezzen be
EGYÉB POSTS Ebben a csoportban

Artificial intelligence might be the future of the workplace, but companies that are trying to get a head start on that future are running into all sorts of problems.
Klarna and Duloingo

Now that the “100 men vs. one gorilla” debate has been settled, a new question is circulati

The rideshare market has reached a crossroads. Autonomous vehicles are on the rise, driver unrest is mounting, and customers are questioning everything from pricing to trust and safety. In the mid

A group backed by tech billionaires spent years and $800 million secretly buying up over 60,

Move aside, Google Maps: Snapchat’s Snap Map has hit a major milestone with 400 million monthly active users.
Launched in 2017, Snap Map began as a GPS-based feature that allowed users t

In April 2024, Yahoo acquired Artifact, a tool that uses AI to recommend news to readers. Yahoo folded Artifact’s—which was cofounded by Instagram cofounders Mike Krieger and Kevin Systrom—into it

It is hard to believe that in 2025, we are still dialing to schedule doctor appointments, get referrals, refill prescriptions, confirm office hours and addresses, and handle many other healthcare