It's OpenAI's new AI agent that autonomously takes action across the web on your behalf.
The 9 most impressive use cases I’ve tried (videos sped up):
1. Ordering dinner ingredients based on a picture and a recipe
2. Planning a weekend trip based on hidden gems off Reddit, my budget and interests
Notice how at 0:06, ChatGPT Operator was blocked from Reddit but then decided to just do a Bing search with "Reddit" at the end
Very impressive decision-making
3. Crypto investment research based on tokens that are actually worth looking into
Notice how ChatGPT Operator got hit with a "Are you human" CAPTCHA, then pinged me to take control to confirm
Wild workaround
4. Booking a one-way flight from Zurich to Vienna using the Booking integration
This one required a bit of back and forth, with ChatGPT Operator pinging me and asking for my flight preference and having me take control of entering payment details
5. Scheduling an appointment with my barber after looking at my Google Calendar schedule/availability
Note that in this demo, ChatGPT Operator pinged me that I needed to sign in to Google to check my calendar
I tried a second time, and my login was saved session-to-session
6. Researching a good birthday gift for my mom based on what she likes
Similar to the Reddit block, ChatGPT Operator couldn't access NYTimes, so it pivoted and found another site.
Really neat.
Also cool to see it compare and find the best price across the web for me, too
7. Booking a one-time house cleaner for my home through the Thumbtack integration based on my budget
ChatGPT Operator came back to me with four highly rated options within my price range
8. Finding the best/cheapest health insurance coverage in Switzerland
This was interesting since most prices are not publicly available and are gated behind a meeting
ChatGPT Operator did what it could, and presented me with a good blog for me to read further
9. Finding a top-rated dog walker in Vancouver BC
This is no easy task, so I wanted to test how well ChatGPT Operator could handle it
To my surprise, I got 3 really solid options at the end
Overall, I was very impressed by the research preview of Operator.
I loved that it can do tasks for me as I do other work, and simply ping me when it needs me to "take over"
I also really enjoyed the saved tasks tab, and adding Custom Instructions for specific websites.
But it's important to note that Operator is still a research preview and is improving.
I found that:
-Quite a few sites were blocked after they detected the AI
-There's a limited set of partner integrations
-It's true purpose is to take actions across the web (more below)
Operator *operates* within ChatGPT, but it's a completely different tool.
Its output lengths are small, and its true purpose is to take actions across the web (typing, clicking, scrolling).
Meaning it's not like ChatGPT, which can produce essays and write long code
With every new tool, comes a new way of using it optimally.
E.g. with GPT-4, CoT prompting produced the best results, but prompting o1 best is completely different.
The exact same thing is happening here with Operator, and I'm 100% just scratching the surface with these tests.
The future of tech work is here. And personally, I'm incredibly excited about it.
Agents can do the boring work, so I can spend more time doing what I love.
I'll be publicly sharing all the ways I automate my work with agents, so follow me @rowancheung for more.
Lastly, big thanks to @OpenAI for granting me early access. I had a ton of fun early testing Operator.
If you want to support my work, like/retweet the first tweet of this thread to share with friends:
AI NEWS: Meta just teamed up with Oakley to expand its family of AI smart glasses.
Plus, more news from Mistral, Moonshot AI, MiniMax, Anthropic, and xAI.
Here's everything you need to know:
Meta launched new performance-focused AI smart glasses with Oakley. Key features include:
—Upgraded camera with up to 3K videos
—2x battery, up to 8 hours of usage & 19 hrs on standby
—PRIZM Lens tech to amplify contrast across changing light and weather
Mistral released Mistral Small 3.2-24B, a minor update for the Small-3.1 model, with:
—Enhanced instruction following capabilities
—More robust function calling
—Fewer infinite generations or repetitive answers
AI NEWS: Sam Altman just revealed Meta is offering "$100M signing bonuses" to poach talent from OpenAI.
Plus, more news from Google, Baidu, MiniMax, Proactor, Krea, and Intelligent Internet.
Here's everything you need to know:
Sam Altman cooked Meta on his brother's podcast. Key takeaways:
— Meta’s offering $100K bonus to poach OAI talent
— None of OAI’s best have taken the offers
— OAI has a better shot at AGI, will eventually be more valuable
— Meta isn't great at innovation
Google just launched Gemini 2.5 Pro and Flash models in general availability
The co also started rolling out a hyper-efficient Flash-Lite model (in preview)
All three feature adjustable "thinking" with Flash-Lite defaulting to thinking off for max speed
AI NEWS: China's Tencent just dropped an AI that generates 3D assets with cinematic quality.
Plus, more news from OpenAI, ElevenLabs, Flowstep, The Alan Turing Institute, and Beijing Academy of Artificial Intelligence.
Here's everything you need to know:
Tencent released Hunyuan 3D 2.1, an open-source model for generating 3D assets, including PBR materials
It can synthesize objects with cinematic textures and realism while covering their light interactions
Fully open-source with model weights and code
OpenAI dropped a few notable ChatGPT updates, including:
—Downloads for Canvas docs and code, including as PDF, docx, or markdown
—A new Projects experience with deep research, voice mode, file uploads via mobile, and improved memory to reference chats
AI NEWS: Apple kicked off WWDC 2025 with a major UI revamp — but was pretty light on Apple Intelligence
Plus, more news from Google DeepMind, OpenAI, Microsoft, Manus, and Skywork AI.
Here's everything you need to know:
At WWDC 2025, Apple showed off only a handful of AI upgrades, including:
—New Live translation for FaceTime, Messages, and calls
—Visual intelligence via screenshots
—AI-powered intelligent actions in Shortcuts
—AI "Workout Buddy" on Apple Watch
Google DeepMind and the UK govt just dropped "Extract," a Gemini-powered tool to expedite infrastructure and housing decisions
It uses multimodal reasoning to turn complex planning docs – including handwritten notes – into digital data in just 40 seconds