Software Still Fails the Same Way It Did in 1995. A 2026 Index Just Named the Patterns.

I read the new report on a Wednesday morning, coffee going cold, and I started laughing. Not because it was wrong. Because it was so exactly right that it could have been written about a project I watched go sideways before some of its authors finished high school. The 2026 Software Delivery Failure Index from Sonatafy Technology had crisp new names for four ways software dies, and every single one of them was a story I already knew by heart. Same disease. New branding. I closed the laptop, opened it again, and read the whole thing twice.

Here is the uncomfortable part. We have been measuring this failure for more than thirty years, and the rate has barely moved. Standish started counting in the 1990s. The methodologies changed, the tooling changed, the job titles changed, and the projects kept failing in the same shape. That is not bad luck. That is a root cause nobody is treating.

37%

of organizations name inaccurate requirements as the primary reason their projects fail. Not slow code. Not weak talent. The requirements

Source: Project Management Institute, Requirements Management Pulse Report

What did the 2026 Sonatafy index actually find?

It found four patterns, and gave each one a name that sticks. Sonatafy distilled 195 episodes of a software-leadership podcast into 48 detailed failure narratives, then looked for what kept repeating across them (Sonatafy Technology, 2026 Software Delivery Failure Index). The result is the most quotable taxonomy of project death I have seen in a while. Read it once and you will recognize your last bad quarter in it.

The four: the Backlog Illusion, where a team looks productive on every velocity chart while shipping nothing a customer would pay for. The Coordination Tax, where product, engineering, and leadership quietly mean different things, and the rework from that mismatch surfaces sprints later at exponentially higher cost. The Ownership Gap, where you add engineers and somehow get slower, because nobody owns the outcome. And the AI Validation Gap, where AI accelerates the building and the checking never catches up. One report buried a single brutal number: a company in the dataset overspent by 42 million dollars scaling into 20 geographies before it had validated the product even worked. Forty-two million. To learn something a hard conversation could have surfaced for free.

Four patterns, dressed in 2026 language. I could match every one of them to a project from a decade or two ago without trying. The names are new. The graveyard is old.

Why does the failure rate barely move no matter the methodology?

Because almost every methodology we have ever shipped improves the build, not the decision about what to build. Think about the parade. Waterfall promised discipline. Agile promised responsiveness. DevOps promised flow. Each one genuinely improved some slice of how teams execute, faster feedback here, cleaner pipelines there, and each one left the hardest, oldest part of the work completely untouched: figuring out, before you build anything, what the system actually needs to do and why it needs to do it that way.

That part has a name. Discovery. It is unglamorous, it does not demo well, and it is where the failures are seeded. I used to think the methodology wars were mostly about ego and conference talks. I changed my mind watching enough projects fail across every framework. The framework was never the variable. A team can run flawless Agile ceremonies and still build the wrong thing beautifully, on time, under budget, and watch it die in production. The ceremony was perfect. The requirement underneath it was a guess nobody checked.

$2.24M

more per strategic project, on average, for companies with weak requirements practices versus companies with strong ones. The tax is structural, and it is paid every project

Source: IAG Consulting, Business Analysis Benchmark

So the rate holds steady because we keep optimizing the part that was never the bottleneck. It is a little like buying faster ovens to fix a restaurant that keeps cooking the wrong order. The kitchen gets quicker. The complaints do not stop. For the full cost mechanics of a missed requirement as it travels downstream, we walked through the numbers in the hidden cost of missed requirements, and the cost-of-delay curve in the 29x rule. The short version: the later you catch it, the more it costs, and a guessed requirement is a mistake you have agreed not to catch until it is expensive.

Where do all four 2026 failure patterns actually trace back to?

To one place. Requirements that nobody validated against the real work. Walk the four patterns down to the bedrock and they converge. The Backlog Illusion is a team busy delivering features no validated requirement asked for. The Coordination Tax is, almost by definition, a requirements problem: product, engineering, and leadership holding three incompatible mental models of the same spec and not discovering it until the code makes the disagreement concrete. Misalignment is just unvalidated requirements wearing a calendar.

The Ownership Gap looks like a management issue, and partly it is. But ask why nobody owns the outcome and you usually find there was no shared, validated definition of the outcome to own in the first place. You cannot hold someone accountable to a target that was never pinned down. And the AI Validation Gap is the newest face of the oldest problem: a machine confidently producing what it was told, when what it was told was never checked against reality. I want to be fair here, because it would be lazy to claim requirements is the only variable. Talent matters. Process discipline matters. Funding and politics absolutely matter. But across thirty years of data, the requirements layer is the one that shows up first, costs the most, and repeats the hardest.

From the field

The clearest proof at national scale is the United Kingdom's National Programme for IT, launched for the NHS in 2002 as the largest civilian software programme in the world. The original estimate was roughly 2.3 billion pounds. By the time it was dismantled in 2011, spending had passed 10 billion (UK Parliament, Public Accounts Committee report).

The Public Accounts Committee did not trace the collapse to bad engineers or weak code. It traced it to requirements that were never anchored in the real work. Suppliers built against centralized specifications that did not capture how clinical environments actually operated, and the people who would use the system every day were not engaged to define what they genuinely needed. The spec was top-down. The reality was bottom-up. The two never met.

Ten billion pounds. To relearn the lesson that a documented requirement is not the same thing as a validated one. The technology was buildable. The requirements were a guess wearing the authority of a government mandate.

Did AI change the failure pattern, or just the speed?

Just the speed. And that is the whole point of the AI Validation Gap. The failure pattern is unchanged: build the wrong thing, discover it late, pay to undo it. What AI changed is the clock. When generation was slow, a bad requirement took weeks to become a broken feature, and those weeks were accidental insurance, time in which a sharp reviewer or a nervous tester might catch the gap before it shipped. That buffer is gone.

Now an agent turns a one-line prompt into running, plausible, passing code in minutes. Feed it a wrong requirement and it builds the wrong thing flawlessly, faster than anyone can review it, with the unearned confidence of clean syntax. The mistake used to crawl downstream. It sprints now. We have written before about how this plays out inside the editor and the spec, in what requirements intelligence actually is, and the lesson holds: a faster builder is a force multiplier on whatever you point it at, including the wrong target. AI is not making projects fail in new ways. It is making them fail sooner, and at higher volume, which from a distance can look like the same old failure rate holding steady while the underlying tempo quietly doubles.

AI did not invent the failure. It deleted the slack that used to let teams catch a wrong requirement before it shipped.

What does fixing requirements first actually change?

It changes where you pay. Every project pays for its requirement mistakes; the only question is whether you pay in a cheap discovery conversation or an expensive production incident. Fixing requirements first moves the catch upstream, to the moment when a mistake is a sentence you can rewrite instead of a system you have to rebuild. That is the entire economic argument, and it has been true since long before anyone said the word "agentic" out loud.

Here is what that looks like in practice. You run discovery on purpose instead of hoping a curious analyst stumbles onto the missing exception in minute 55 of an interview. You surface the conflicting assumption between two teams before the code makes it permanent. You capture the reason behind each requirement, the why, as a first-class artifact, so the next person and the next agent inherit it instead of guessing. That is what requirements intelligence is for. It treats discovery as a structured, multi-expert process with an output worth building from, rather than a meeting people skip when the sprint is tight. The 2026 index gave four failure patterns memorable names. Good. Naming a disease is progress. But you do not cure it by renaming it every five years. You cure it by treating the root, and the root has been the same since 1995.

The names keep changing. The root cause does not.

The 2026 Sonatafy index named four ways software dies: the Backlog Illusion, the Coordination Tax, the Ownership Gap, and the AI Validation Gap. Useful names. Underneath all four sits the same root the industry has documented for decades: requirements that were wrong, incomplete, or never validated against the real work.

AI did not change that pattern. It changed the speed, deleting the accidental delay that used to let teams catch a bad requirement before it shipped. The failure rate moves when the requirements move, not when the methodology gets a new name. Catch the mistake in a discovery conversation, where it costs a sentence, or catch it in production, where it costs a rebuild. That choice, made before any code exists, is the one that has decided projects for thirty years and still does.

I went back to that report a third time before writing this. Same four patterns. Same recognition. The difference between the projects I watched survive and the ones I watched sink was never the methodology and, lately, it is never the model. It was whether someone did the unglamorous work of validating the requirements before the building started. That work is cheap. Skipping it is what costs ten billion pounds.

What are the most common questions about why software projects fail?

Because almost every new methodology improves how teams build, not what they decide to build. Waterfall, Agile, DevOps, and now AI-assisted delivery each made execution faster or cleaner. None of them fixed the discovery layer, where someone has to figure out what the system actually needs to do and why. Study after study, from the 1990s to the 2026 Sonatafy Software Delivery Failure Index, keeps landing on the same root: requirements that were wrong, incomplete, or never validated against the real work. New tools, same root cause.

The Backlog Illusion (teams look busy hitting sprint velocity while shipping little real value), the Coordination Tax (misalignment between product, engineering, and leadership creates compounding rework that surfaces sprints later at higher cost), the Ownership Gap (adding engineers without clear accountability slows execution instead of speeding it), and the AI Validation Gap (AI accelerates delivery while validation fails to keep pace). Sonatafy built the index from 195 leadership-podcast episodes distilled into 48 failure narratives. The patterns describe symptoms. Underneath the Coordination Tax in particular sits unvalidated requirements.

It is the most consistently documented one. The Project Management Institute found that 37% of organizations name inaccurate requirements as the primary reason their projects fail. IAG Consulting's benchmark found companies with weak requirements practices spend on average 2.24 million dollars more per strategic project than companies with strong ones. The Standish CHAOS reports have listed incomplete requirements and weak user involvement among the top failure factors for decades. Code quality matters, talent matters, process matters, but the requirements layer is where the largest and most repeatable losses originate.

No. AI changed how fast teams reach the failure, not the shape of it. When generation was slow, a bad requirement took weeks to become a broken feature, and there was time to catch it. Now an agent turns a one-line prompt into running code in minutes, so a gap in the requirements becomes shipped, confident, wrong software faster than anyone can review it. The 2026 Sonatafy index named this directly as the AI Validation Gap. Faster building does not help if you are building the wrong thing. It just gets you there sooner.

It moves the cheapest place to catch a mistake, the discovery conversation, ahead of the most expensive place to catch it, production. Structured discovery surfaces the hidden exception, the conflicting assumption, and the reason behind a rule before a line of code exists. That is what requirements intelligence does: it runs multi-expert discovery on purpose, captures the why behind each requirement, and hands builders and AI agents a brief that already contains the edge cases. The failure rate moves when the requirements move, because that is where the failures were being seeded all along. See what requirements intelligence is for the full method.

Nicolas Payette

CEO and Founder, Specira AI

Nicolas Payette has spent 25 years in enterprise software delivery, leading digital transformations at companies like Technology Evaluation Centers and Optimal Solutions. He founded Specira AI to solve the root cause of project failure: unclear requirements, not slow code.

What did the 2026 Sonatafy index actually find?

Why does the failure rate barely move no matter the methodology?

Where do all four 2026 failure patterns actually trace back to?

Did AI change the failure pattern, or just the speed?

What does fixing requirements first actually change?

The names keep changing. The root cause does not.

What are the most common questions about why software projects fail?

Related reading

The Hidden Cost of Missed Requirements

The 29x Rule

Treat the root cause, not the new name for it.