Why 40% of Agentic AI Projects Get Cancelled

The vote took four minutes. Eighteen months of work, a dedicated squad, a budget line half of finance could recite from memory, and the steering committee killed the agentic AI pilot in roughly the time it takes to reheat a coffee. Not because it broke. The demos ran clean. The agent booked the meetings, drafted the summaries, even flagged a duplicate invoice once in a way that made the room go quiet for a second. What killed it was a question nobody on the build team could answer without hedging: what, exactly, was this thing allowed to decide, and who signed off on that? Nobody knew. The room moved to the next agenda item, and a year and a half of effort became a line in the postmortem.

That scene is about to play out across boardrooms everywhere, and there is now a number attached to it. In June 2025, Gartner predicted that more than 40% of agentic AI projects will be cancelled by the end of 2027, undone by escalating costs, unclear business value, or inadequate risk controls (Gartner, June 2025). Read those three causes one more time. Not one of them says the model was too weak, hallucinated too often, or could not reason. The agents mostly work. The programs around them do not, and they fail for reasons that were decided long before a single token was generated.

>40%

of agentic AI projects will be cancelled by the end of 2027, driven by escalating costs, unclear business value, or inadequate risk controls

Source: Gartner, "Over 40% of Agentic AI Projects Will Be Canceled by End of 2027," June 25, 2025

What is actually driving agentic AI project cancellations?

Hype, mostly, meeting reality at the production door. Gartner is blunt about the mechanism: most agentic projects today are early-stage experiments or proofs of concept, often driven by buzz and just as often misapplied, which blinds organizations to the real cost and complexity of running agents at scale. Then the bill arrives. The same report calls out "agent washing," the rebranding of old chatbots, assistants, and robotic process automation as agentic, and estimates that of the thousands of vendors claiming agentic capabilities, only around 130 are the real thing.

The money makes the stakes obvious. Worldwide AI spending is forecast to hit $2.59 trillion in 2026, a 47% jump year over year, according to Gartner (Gartner via CIO Dive, May 2026). When that much capital pours into a category, finance starts asking harder questions sooner. A January 2025 Gartner poll of 3,412 webinar attendees found only 19% had made significant investments in agentic AI, with 42% taking a conservative position and the rest waiting or unsure. Translation: most of the market is still cautious, still in pilot, still deciding whether the thing pays back. Those pilots are exactly the ones a CFO can cancel without much resistance, because nobody armed them with proof.

Here is the part I want to sit on, because it is the whole argument. Cost, unclear value, weak risk controls. Those are not three separate problems. They are three symptoms of one upstream failure: the agent was pointed at a goal nobody bothered to define, bound, or own. Cost runs away because there was no success criterion to stop it. Value stays unclear because "clear value" was never specified. Risk controls stay weak because the behavior they were supposed to constrain was never written down. The model is fine. The brief behind it never existed.

Why does "human-on-the-loop" still need governed requirements?

Because you cannot supervise a goal that was never written down. The governance world spent the last two years moving from "human-in-the-loop," where a person approves every agent action before it fires, to "human-on-the-loop," where the agent runs autonomously and a person watches, ready to intervene. That shift is necessary; you cannot scale agents if a human has to bless each step. But it quietly raises the stakes on the upstream work, because supervision only works against a defined target.

Think about what on-the-loop oversight actually requires. A supervisor has to notice when the agent drifts from its intended behavior. Drifts from what, though? If "intended behavior" lives only in someone's head, or worse, in three people's heads with three different versions, the supervisor has nothing to measure drift against. They are watching a process with no spec, which is just watching. Gartner made a related point in May 2026, warning that applying one uniform governance policy across very different agents will itself cause enterprise AI agent failure (Gartner, May 2026). Governance has to be specific to what each agent is for. And "what it is for" is a requirements question, decided before the build, not a policy you bolt on after.

An agent without governed requirements is not autonomous. It is unsupervised, which is a very different and far more expensive thing. Autonomy is freedom inside boundaries you defined. Without the boundaries, you do not have an agent. You have a liability that ships fast.

How do ambiguous goals become unprovable ROI?

Simple chain. You cannot prove value you never defined. When a pilot launches against "improve customer operations" or "make the team more productive," it has set itself up to fail the only review that matters, the one eighteen months later where someone asks for the number. There is no number, because nobody agreed on what to count. So the conversation defaults to vibes, and vibes lose to a hard budget line every single time.

I have watched this exact failure from the other side of the table, and it is rarely the technology that disappoints. The agent did real work. It just did work nobody had tied to a metric a CFO recognizes. "We summarized 40,000 tickets" is an activity, not a value. Did it cut handle time? By how much, against what baseline, measured how? If those questions were not answered during discovery, they will not get answered during the postmortem, and the project dies with a working demo still running on someone's laptop. Provable value is a requirements artifact. You define it up front or you forfeit it.

And the counterargument is fair, so I will give it its sentence: yes, some agentic value is genuinely hard to quantify early, and forcing a fake number can be worse than admitting uncertainty. True. But there is a wide gap between "we are honestly unsure and here is how we will find out" and "we never defined success at all," and almost every cancelled pilot I have seen lived in the second camp, not the first. This is the through-line of everything we have written on AI governance and the audit trail, and it starts upstream of the model.

What does an audit-ready requirements trail give an agentic program?

It gives you an answer to the question that killed the pilot in the opening. An audit-ready requirements trail records, for every decision the agent is allowed to make, what the objective was, what the boundaries are, what trade-off was chosen, and who signed off. When the steering committee asks "what was this allowed to decide, and who decided that," the answer is a document, not a shrug. That single difference is often what separates a program that graduates from one that gets voted down.

The market is already pricing this in. Gartner expects spending on AI governance platforms to reach $492 million in 2026 and pass $1 billion by 2030 (Gartner, February 2026). Money is moving toward governance because the cancellations taught a lesson the hard way. But here is the nuance I keep coming back to: a governance platform records and enforces decisions. It does not make the decisions correct or complete in the first place. If the requirements feeding it are ambiguous, you now have an audit trail of ambiguity, beautifully logged. The trail is only as good as the discovery behind it.

From the field

EY offers a useful counterexample to the cancellation trend, and it is worth crediting plainly. As the firm scaled agentic AI, it did not start by chasing autonomy. It started by building the governed foundation underneath it. Its EY.ai EYQ platform was deployed to more than 300,000 professionals, with what EY describes as governed prompt tooling, a centralized model catalog, guardrails, and a permissioned, lineage-rich, compliant data layer through the EY.ai Data Marketplace (EY, 2026).

The outcome is the tell. Within nine months the platform supported the development of more than 50,000 agents, and workforce adoption crossed 80%. That is not a pilot quietly dying on a laptop. That is agents scaling, because the objectives, the data lineage, and the controls were defined before the fleet was let loose, not bolted on after a committee got nervous.

The lesson is not "be EY," with EY's budget. The lesson is the order of operations. Govern the requirements first, then scale the agents. Do it backwards and you join the 40%.

The agent is identical on both sides. The only difference is whether the requirements were governed before launch. That difference decides the review.

How does this connect to the August 2 EU AI Act deadline?

It connects directly, and the timing is tight. On August 2, 2026, the high-risk obligations of the EU AI Act become enforceable, including Article 12's automatic logging and record-keeping requirements and the law's human-oversight provisions. An agentic system that cannot show what it was permitted to decide, who approved it, and what it actually did is now exposed twice over: it fails the internal value review and the external compliance bar with the same missing artifact. We covered the mechanics in detail in the EU AI Act compliance guide, and the deadline is no longer theoretical.

So notice the convergence. The internal reason agentic pilots get cancelled (no defined objective, no provable value, no auditable trail) is the same shape as the external reason they fail compliance (no record of decisions, no human-oversight evidence, no logged behavior). One substrate satisfies both. A validated, traceable requirements set says what the agent is for, what it may decide, who owns that decision, and it does so in a form an auditor and a CFO can both read. Build that once and you answer two committees at the same time. Skip it and you are exposed on both fronts, which is a genuinely bad place to be forty days before a regulatory deadline.

The model was never the risk

Agentic AI projects do not mostly die of weak models. They die of ambiguous goals, unprovable value, and ungoverned behavior, which are all requirements failures wearing a technical costume. Gartner's three cancellation causes share one root, and it sits upstream of the first prompt.

Govern the requirements before you scale the agents. Define the objective in numbers, bound what the agent may decide, name the owner of every trade-off, and record it in a trail an auditor and a CFO can both read. Do that and your pilot survives the steering review and the August 2 deadline at once. Skip it and you ship fast, prove nothing, and join the 40%. This is the same discipline we argue for in why your specs, not your agent, are the problem.

What are the most common questions about governing agentic AI projects?

Gartner predicts that more than 40% of agentic AI projects will be cancelled by the end of 2027, driven by escalating costs, unclear business value, and inadequate risk controls (Gartner, June 2025). Gartner notes that most current projects are early-stage experiments driven by hype, and that vendor "agent washing" inflates expectations, with only about 130 of thousands of claimed agentic vendors considered real. None of the three failure causes is the model's raw capability.

They get cancelled when a steering committee cannot answer two questions: what was this agent supposed to do, and who decided that. Cost overruns and weak risk controls are usually downstream symptoms of an upstream gap. The agent was launched against an ambiguous goal and ungoverned requirements, so its value cannot be proven and its behavior cannot be controlled. The model rarely fails on its own. The brief behind it was never validated.

Human-in-the-loop means a person approves each agent action before it executes. Human-on-the-loop means the agent acts autonomously while a person supervises and can intervene. The shift toward human-on-the-loop is what makes governed requirements non-negotiable, because a supervisor can only catch an agent drifting from intent if that intent was defined, bounded, and assigned an owner in the first place. You cannot oversee a goal nobody wrote down.

Governed requirements give an agentic program the three things its cancellation causes attack: a defined objective that makes business value provable, explicit risk boundaries that make behavior controllable, and a traceable record of who decided what that makes the program auditable. Requirements intelligence runs structured discovery before the agent is built, so success is specified before launch rather than argued about after. See what requirements intelligence is for the full method.

High-risk obligations under the EU AI Act become enforceable on August 2, 2026, including record-keeping and automatic logging under Article 12 and human-oversight requirements. An agentic system that cannot show what it was allowed to decide, who approved it, and what it actually did is exposed on both fronts at once: it fails the internal value review and the external compliance bar. A validated, traceable requirements trail is the same substrate that satisfies both. See the EU AI Act compliance guide.

Nicolas Payette

CEO and Founder, Specira AI

Nicolas Payette has spent 25 years in enterprise software delivery, leading digital transformations at companies like Technology Evaluation Centers and Optimal Solutions. He founded Specira AI to solve the root cause of project failure: unclear requirements, not slow code.

Why 40% of Agentic AI Projects Will Be Cancelled by 2027

What is actually driving agentic AI project cancellations?

Why does "human-on-the-loop" still need governed requirements?

How do ambiguous goals become unprovable ROI?

What does an audit-ready requirements trail give an agentic program?

How does this connect to the August 2 EU AI Act deadline?

The model was never the risk

What are the most common questions about governing agentic AI projects?

Keep your agents off the cancellation list.

What is actually driving agentic AI project cancellations?

Why does "human-on-the-loop" still need governed requirements?

How do ambiguous goals become unprovable ROI?

What does an audit-ready requirements trail give an agentic program?

How does this connect to the August 2 EU AI Act deadline?

The model was never the risk

What are the most common questions about governing agentic AI projects?

Related reading

AI Governance Starts With the Requirements, Not the Model

EU AI Act Compliance: What August 2, 2026 Actually Requires

Keep your agents off the cancellation list.