AIDive
EN
Sign in

Description

OpenAI Operator is a ChatGPT agent that can complete tasks in a web browser on your behalf. It “sees” pages via screenshots, then clicks, types, and scrolls to finish multi-step workflows.

What Operator can do

  • Open websites and interact with them without special integrations
  • Handle repetitive tasks like filling out forms, ordering groceries, or creating memes
  • Ask for your help when needed (sign-in, passwords, payment confirmation)

How it works

  • Built on the Computer-Using Agent (CUA) model, trained to operate common graphical interfaces
  • Combines GPT-4o-style visual understanding (screenshots) with reinforcement learning for decision-making
  • Tries to recover from mistakes; if it can’t, it hands control back to you

Availability, privacy, and limits

  • Launched in January 2025 as a Research Preview for US-based ChatGPT Pro users
  • For passwords and payment details, Operator yields control and does not retain what you enter
  • Requests confirmation before purchases or sending emails, and refuses certain high-risk requests (for example, banking actions or employment decisions)
  • Can struggle with complex UI elements (slideshows, calendars) and sites heavy on pop-ups or CAPTCHAs

Developers are expected to be able to use this “computer-using” agent in their own services over time, with broader ChatGPT plan availability planned later.

22
0 комментариев

Newsletter

Get notified when new AI tools are added

Join the community.