AI agents are science fiction not yet ready for primetime

That is The Stepback, a weekly publication breaking down one important story from the tech world. For extra on all issues AI, comply with Hayden Discipline. The Stepback arrives in our subscribers’ inboxes at 8AM ET. Decide in for The Stepback right here.

It began with J.A.R.V.I.S. Sure, that J.A.R.V.I.S. The one from the Marvel motion pictures.

Properly, perhaps it didn’t begin with Iron Man’s AI assistant, however the fictional system positively helped the idea of an AI agent alongside. Every time I’ve interviewed AI trade people about agentic AI, they typically level to J.A.R.V.I.S. for instance of the best AI device in some ways — one which is aware of what you want completed earlier than you even ask, can analyze and discover insights in massive swaths of knowledge, and may supply strategic recommendation or run level on sure elements of your online business. Folks generally disagree on the precise definition of an AI agent, however at its core, it’s a step past chatbots in that it’s a system that may carry out multistep, complicated duties in your behalf with out continually needing back-and-forth communication with you. It basically makes its personal to-do checklist of subtasks it wants to finish in an effort to get to your most popular finish objective. That fantasy is nearer to being a actuality in some ways, however in the case of precise usefulness for the on a regular basis person, there are a variety of issues that don’t work — and perhaps won’t ever work.

The time period “AI agent” has been round for a very long time, nevertheless it particularly began trending within the tech trade in 2023. That was the yr of the idea of AI brokers; the time period was on everybody’s lips as individuals tried to suss out the concept and make it a actuality, however you didn’t see many profitable use instances. The following yr, 2024, was the yr of deployment — individuals have been actually placing the code out into the sphere and seeing what it may do. (The reply, on the time, was… not a lot. And stuffed with a bunch of error messages.)

I can pinpoint the hype round AI brokers changing into widespread to at least one particular announcement: In February 2024, Klarna, a fintech firm, stated that after one month, its AI assistant (powered by OpenAI’s tech) had efficiently completed the work of 700 full-time customer support brokers and automatic two-thirds of the corporate’s customer support chats. For months, these statistics got here up in virtually each AI trade dialog I had.

The hype by no means died down, and within the following months, each Massive Tech CEO appeared to harp on the time period in each earnings name. Executives at Amazon, Meta, Google, Microsoft, and an entire host of different corporations started to speak about their dedication to constructing helpful and profitable AI brokers — and tried to place their cash the place their mouths are to make it occur.

The imaginative and prescient was that at some point, an AI agent may do all the things from e-book your journey to generate visuals for your online business shows. The perfect device may even, say, discover a good time and place to hang around with a bunch of your folks that works with all your calendars, meals preferences, and dietary restrictions — after which e-book the dinner reservation and create a calendar occasion for everybody.

Now let’s speak in regards to the “AI coding” of all of it: For years, AI coding has been carrying the agentic AI trade. When you requested anybody about real-life, profitable, not-annoying use instances for AI brokers occurring proper now and never conceptually in a not-too-distant future, they’d level to AI coding — and that was just about the one concrete factor they may level to. Many engineers use AI brokers for coding, they usually’re seen as objectively fairly good. Ok, the truth is, that at Microsoft and Google, as much as 30 p.c of the code is now being written by AI brokers. And for startups like OpenAI and Anthropic, which burn by money at excessive charges, one in all their largest income turbines is AI coding instruments for enterprise shoppers.

So till lately, AI coding has been the primary real-life use case of AI brokers, however clearly, that’s not pandering to the on a regular basis shopper. The imaginative and prescient, keep in mind, was all the time a jack-of-all-trades kind of AI agent for the “everyman.” And we’re not fairly there but — however in 2025, we’ve gotten nearer than we’ve ever been earlier than.

Final October, Anthropic kicked issues off by introducing “Pc Use,” a device that allowed Claude to make use of a pc like a human would possibly — looking, looking out, accessing completely different platforms, and finishing complicated duties on a person’s behalf. The final consensus was that the device was a step ahead for expertise, however evaluations stated that in apply, it left lots to be desired. Quick-forward to January 2025, and OpenAI launched Operator, its model of the identical factor, and billed it as a device for filling out varieties, ordering groceries, reserving journey, and creating memes. As soon as once more, in apply, many customers agreed that the device was buggy, sluggish, and never all the time environment friendly. However once more, it was a big step. The following month, OpenAI launched Deep Analysis, an agentic AI device that would compile lengthy analysis experiences on any matter for a person, and that spun issues ahead, too. Some individuals stated the analysis experiences have been extra spectacular in size than content material, however others have been critically impressed. After which in July, OpenAI mixed Deep Analysis and Operator into one AI agent product: ChatGPT Agent. Was it higher than most consumer-facing agentic AI instruments that got here earlier than? Completely. Was it nonetheless robust to make work efficiently in apply? Completely.

So there’s a protracted technique to go to achieve that imaginative and prescient of an excellent AI agent, however on the similar time, we’re technically nearer than we’ve ever been earlier than. That’s why tech corporations are placing increasingly more cash into agentic AI, by means of investing in further compute, analysis and growth, or expertise. Google lately employed Windsurf’s CEO, cofounder, and a few R&D staff members, particularly to assist Google push its AI agent tasks ahead. And corporations like Anthropic and OpenAI are racing one another up the ladder, rung by rung, to introduce incremental options to place these brokers within the palms of customers. (Anthropic, as an illustration, simply introduced a Chrome extension for Claude that enables it to work in your browser.)

So actually, what occurs subsequent is that we’ll see AI coding proceed to enhance (and, sadly, probably change the roles of many entry-level software program engineers). We’ll additionally see the consumer-facing agent merchandise enhance, probably slowly however absolutely. And we’ll see brokers used more and more for enterprise and authorities functions, particularly since Anthropic, OpenAI, and xAI have all debuted government-specific AI platforms in latest months.

Total, count on to see extra false begins, begins and stops, and mergers and acquisitions because the AI agent competitors picks up (and the hype bubble continues to balloon). One query we’ll all must ask ourselves because the months go on: What will we really need a conceptual “AI agent” to have the ability to do for us? Do we would like them to interchange simply the logistics or additionally the extra private, human elements of life (i.e., serving to write a marriage toast or a notice for a flower supply)? And the way good are they at serving to with the logistics vs. the private stuff? (Reply for that final one: not superb for the time being.)

Apart from the astronomical environmental value of AI — particularly for giant fashions, that are those powering AI agent efforts — there’s an elephant within the room. And that’s the concept that “smarter AI that may do something for you” isn’t all the time good, particularly when individuals need to use it to do… unhealthy issues. Issues like creating chemical, organic, radiological, and nuclear (CBRN) weapons. Prime AI corporations say they’re more and more fearful in regards to the dangers of that. (After all, they’re not fearful sufficient to cease constructing.)
Let’s speak in regards to the regulation of all of it. Lots of people have fears in regards to the implications of AI, however many aren’t totally conscious of the potential risks posed by uber-helpful, aiming-to-please AI brokers within the palms of unhealthy actors, each stateside and overseas (suppose: “vibe-hacking,” romance scams, and extra). AI corporations say they’re forward of the danger with the voluntary safeguards they’ve applied. However many others say this can be a case for an exterior gut-check.

0 FeedbackComply with subjects and authors from this story to see extra like this in your personalised homepage feed and to obtain e-mail updates.

Hayden DisciplineShut
Hayden Discipline
Posts from this writer might be added to your day by day e-mail digest and your homepage feed.
PlusComply with
See All by Hayden Discipline
AIShut
AI
Posts from this matter might be added to your day by day e-mail digest and your homepage feed.
PlusComply with
See All AI
ColumnShut
Column
Posts from this matter might be added to your day by day e-mail digest and your homepage feed.
PlusComply with
See All Column
TechShut
Tech
Posts from this matter might be added to your day by day e-mail digest and your homepage feed.
PlusComply with
See All Tech
The StepbackShut
The Stepback
Posts from this matter might be added to your day by day e-mail digest and your homepage feed.
PlusComply with
See All The Stepback

What's Hot

How Tariffs Are Playing Out So Far

Why brands are delaying creator holiday deals until the last minute

Tamron launches All-in-One 8x zoom for Sony E-mount by Jose Antunes

AI agents are science fiction not yet ready for primetime

This Online Account Pays an Eye-Popping 6.75% — Just as the Fed Gets Ready To Cut

Unified and Augmented by AI Agents

China announces a 5-year plan focused on science, tech self-reliance

The Wolf’s Goodbye in ‘Pulp Fiction’: One of The Coolest Exits Ever Written

Unpacking the Dark Legacy Behind the Gold Watch in ‘Pulp Fiction’

Ready to File? Test Your Social Security Application IQ

How Tariffs Are Playing Out So Far

Why brands are delaying creator holiday deals until the last minute

Tamron launches All-in-One 8x zoom for Sony E-mount by Jose Antunes

Southwest’s new cabin design has more legroom—for some people

Four ways to be more selfish at work

How to Create a Seamless Instagram Carousel Post

Up First from NPR : NPR

Meta Plans to Release New Oakley, Prada AI Smart Glasses

Our Picks

How Tariffs Are Playing Out So Far

Why brands are delaying creator holiday deals until the last minute

Subscribe to Updates

What's Hot

AI agents are science fiction not yet ready for primetime

Hayden Discipline

AI

Column

Tech

The Stepback

Related Posts