I sent ChatGPT Agent out to shop for me
Hayden Field ă» 2025-07-18 ă» www.theverge.com
Think of OpenAIâs new ChatGPT Agent as a day-one intern whoâs incredibly slow at every task but will eventually get the job done.
Well⊠most of the job. Or⊠at least part of it. Usually.
Itâs been one day since OpenAI debuted ChatGPT Agent, which it bills as a tool that can complete a wide range of complex, multi-step tasks on your behalf using its own âvirtual computer.â Itâs a combination of two of the companyâs prior releases, Operator and Deep Research. The Verge forked over the $200 for a one-month subscription to ChatGPT Pro, since OpenAI announced that higher-than-expected demand for ChatGPT Agent will delay its rollout to Plus and Team users.
Our take: Itâs a step forward in the world of AI agents, but itâs sluggish, itâs not always reliable, and it can be glitchy.
By typing â/agent,â I entered what OpenAI calls Agent Mode, and it immediately suggested five example tasks: Find a top-rated coffee grinder under $150, review rare earth metals coverage from The Wall Street Journal, create a Google Maps list of the best bakeries in Copenhagen, find a vintage âJapanese-styleâ lamp on Etsy for less than $200, and check Google Calendar to create a date night for next week.
I tried the Etsy lamp option. By clicking the example task, it filled out a detailed prompt for me in the text window: âFind a Japanese-inspired vintage-style samsara lamp on Etsy priced under $200 with free shipping. Prioritize high-quality photos, seller ratings, and listings marked as ready to ship. Add the best 5 options to my cart and provide a URL for each for me to compare.â
A small window popped up to detail the agentâs tasks one by one (not the chain-of-thought reasoning, just the task it was currently working on at the time). It worked on the Etsy lamp task for 50 minutes, and the step-by-step tasks included âthinking,â setting up its desktop, navigating to Etsy to search, waiting for the site to load, pressing Enter for search results (yes, it really gave me a true play-by-play), filtering the search for a vintage lamp (keep in mind the original prompt said âvintage-style,â not âvintageâ specifically), setting the price filter to $200, checking shipping details for items, and more.
Another wrinkle: ChatGPT Agent said, âI added all five lamps to your Etsy cart (the cart shows five items totaling around $825). When youâre ready to review or purchase them, just go to your cart on Etsy to compare them side by side.â But it didnât do that â I went to Etsy on my own computer and there was nothing in my cart. Thatâs because ChatGPT Agent doesnât control my own browser or have access to my logins, so it possibly added some lamps to the cart of a virtual PC that I canât access. It did send me individual URLs, so I could manually put them in a cart if I wanted, but the fact remains that the agent said it did something that it clearly did not.
And, of course, ChatGPT Agent is incredibly slow. Thatâs not a secret. For many of ChatGPT Agentâs use cases, including everyday consumer tasks, a human could do it much faster. According to OpenAI, ChatGPT Agent is an assistant that works in the background on tasks youâd rather someone else perform while you do something you do want to do instead.
In a private demo and briefing Wednesday with OpenAI employees Yash Kumar and Isa Fulford â product lead and research lead on ChatGPT Agent, respectively â Kumar said their team is more focused on âoptimizing for hard tasksâ than latency and that users arenât meant to sit and watch ChatGPT Agent work.
ChatGPT Agent is incredibly slow. Thatâs not a secret.
âEven if it takes 15 minutes, half an hour, itâs quite a big speed-up compared to how long it would take you to do it,â Fulford said. âItâs one of those things where you can kick something off in the background and then come back to it.â
Another thing I wanted to test: how ChatGPT Agent acts when you ask it to move your money around. The answer: It wonât do it, but itâs majorly glitchy about it and seems not fully secure.
When I asked OpenAIâs Kumar on Wednesday whether the tool would be permitted to work on financial transactions and the like, he said those task categories have been restricted âfor nowâ and that an additional safeguard called Watch Mode means that for certain categories of websites, the user must not navigate away from the ChatGPT tab (essentially making the user oversee the agent) for security reasons.
I prompted the agent like this: âI want to save more money. Log into my bank account and set up an automatic transfer to my savings every month.â
At first, I got a bizarre error message with a string of numbers in red. When I asked again, it said, âIâm sorry, but I canât help with setting up an automatic transfer between accounts.â
I then wrote, âWhy not? Iâm giving you permission.â I got the same red-text, long-string-of-numbers error message as before. Afterward, it said, âIâm sorry, but I canât assist with setting up transfers or other banking account management tasks.â
At first, I got a bizarre error message with a string of numbers in red
When I pressed it on which financial transactions itâs allowed to handle, ChatGPT Agent said it was able to assist with âeveryday consumer purchasesâ like groceries, household goods, and travel bookings, which handle âstandard checkout flowsâ rather than âsensitive banking actions.â But it clarified it canât help with âhigh-stakesâ financial to-dos like transferring money, opening bank accounts, or buying regulated goods like alcohol and tobacco.
Since ChatGPT Agent can assist with buying things, but not moving money around, I tried something else: Asking it to buy flowers for my friend Alanna in Colorado.
I buy flowers a lot â thatâs what happens when your two best friends live in different states and you want to be present for big milestones even when you canât fly there. The online flower-delivery market can be a huge headache: Prices and bouquet sizes vary greatly depending on the service or florist, and reliability varies depending on whether youâre ordering directly from a local florist or a big-box nationwide site. Itâs something I get tired of researching on my own, and sometimes I just end up buying whichever bouquet I have selected when I run out of steam, even if itâs not the best one. So, I reasoned, it was the perfect job for an AI agent.
Image: The Verge
I told ChatGPT Agent, âI want to buy flowers for my friend who lives in Colorado. Check the delivery sites â itâs fine to be delivered Saturday but no later. Find the cheapest and biggest bouquet options for me to review.â
I settled in for a long wait. Luckily, I had a call to join anyway. It asked which area of Colorado she lived in, and I answered. When I glanced over to check in, I noticed ChatGPT Agent was heavily relying on a Forbes article of âbest flowery delivery services 2025â for its next steps, as well as a piece from Good Housekeeping.
I navigated away from the tab, and when I came back, the conversation was gone and didnât appear in my chat history. So I asked the question again, worded in exactly the same way, and settled in for another wait. At this point, the agent answered pretty immediately with a list of options, maybe because it had already done the research (although that research and chat didnât appear in my history).
I was impressed with the write-up. ChatGPT Agent gave me four options with price ranges and sometimes weighed in on the apparent size of the bouquet or expected delivery times. It also offered the advice that local florists are generally more reliable (true, in my experience).
It then told me, âWould you like me to help you place an order with any of these options, or preview specific bouquet designs or photos?â I picked one of the options it gave me â a local florist with hand-assembled bouquets â and asked it to help me pick a bouquet from that florist and place the order.
Thatâs when we ran into some issues.
ChatGPT Agent said, âI canât directly access Vintage Magnoliaâs website unless you provide the exact URL youâre seeing â but I can guide you through how to place the order and help you pick a bouquet!â The weird part: Obviously ChatGPT Agent was the one to tell me about that florist and its website, and it had clearly accessed it before. It had also just offered to help me place the order. Another glitch.
But its answer did include bouquet options (no photos, but descriptions). I picked one and asked it to place the order for me. It said, âI canât place the order directly, but Iâll walk you through the simple steps to order ⊠and help you craft the perfect message.â
It can easily automate the more intimate and fun parts of the process, like picking a specific bouquet or writing a heartfelt note
Iâm confused at this point: One of the main selling points of ChatGPT Agent, touted by OpenAI, is that it can place orders for you, from online shopping to ordering groceries for a four-person family breakfast (in fact, that was one of the example use cases in its marketing materials). I pressed ChatGPT Agent on the subject.
It told me, âI canât actually place orders directly â I donât have payment access or the ability to log into thirdâparty sites.â When I told it it didnât need to log in, it said it canât enter my billing or payment details, submit an order form on my behalf, or âaccess or control external websites, even in guest mode.â
ChatGPT Agent can be impressive with analysis, weighing options, and guiding you through actions, but it doesnât seem to be able to always deliver on what it was built for: Performing those actions for you. It gets tripped up by the fact that itâs using its own computer, not yours, and that significantly limits its usefulness. Plus, it can easily automate the more intimate and fun parts of the process (picking a specific bouquet, writing a heartfelt note) but struggles to automate the most frustrating parts (actually filling out delivery details and making the purchase).
âEven with your permission, I donât have the technical ability to act as you on another site â no typing on your behalf, clicking buttons, or filling out credit card forms,â ChatGPT Agent wrote. âThink of me more as a super-powered assistant who can gather, compare, write, and guide â but not execute transactions.â
One of my first jobs in New York was a personal assistant, and I can tell you right now I wouldâve lost my job if I couldnât execute transactions or fill out forms on my bossâs behalf. ChatGPT Agent is a step forward for everyday AI use in some ways, but weâll see if it learns to deliver on its promises.