Amit Kothari
Amit Kothari CEO of Tallyfy, AI advisor at Blue Sheen

The AI failure post-mortem template

In brief

MIT research shows 95% of generative AI pilots fail to achieve results. When they do, most companies bury failures instead of extracting lessons. A structured post-mortem process paired with proper iteration budgeting turns project failure into organizational knowledge that prevents repeating mistakes.

MIT’s State of AI in Business 2025 report found that almost all generative AI pilots fail to achieve rapid revenue acceleration. CIO Dive reported that 88% of AI pilots fail to reach production, and that played out exactly as expected. Thousands of failed projects.

What frustrates me is the pattern that follows every single one of them. Teams blame data quality, vendor hype, or “unclear requirements.” Then they move on. Nothing gets documented. The budget for next time looks identical. The abandonment rate tells the whole story: 42% of companies walked away from most AI initiatives in 2025, up from 17% the year prior.

The problem isn’t that AI projects fail. It’s that we fail to learn from them. Understanding the common patterns of why AI projects fail makes post-mortems more targeted.

Why post-mortems usually miss the point

Post-mortems routinely read like legal briefs. Fifty pages of defensive explanations about why nobody could have predicted the collapse. These rubbish documents exist to protect careers.

Not extract knowledge.

Research from RAND Corporation interviewed 65 data scientists and engineers and identified five leading root causes of AI project failure. The first one: industry stakeholders often misunderstand or miscommunicate what problem needs to be solved using AI.

That’s not a data science problem. That’s a listening problem.

I might be wrong, but I’d argue that’s where most enterprise AI projects actually break down first. When post-mortems focus on technical debugging rather than communication breakdowns, they miss the real issue. The thing is, the code worked fine. The humans didn’t agree on what it should do.

The budget structure that dooms learning before it starts

Most AI budget templates have the same flaw. They allocate funds for building the thing, not for learning how to build it better.

Most AI projects never reach production at all, and the ones that do crawl through months of iteration to get there. Meanwhile, about 85% of companies miss their AI cost forecasts by more than 10%, and nearly a quarter are off by 50% or more. When the project fails, teams blame the estimate. The estimate wasn’t the problem. The painful lack of iteration budget was.

AI projects aren’t software deployments. They’re experimental cycles. Each round teaches you something about your data, your problem, or your organization’s readiness. If your AI budget template only covers “Phase 1: Build, Phase 2: Deploy,” you’ve already lost.

Organizations that learn fast budget differently. They plan for three to five experimental iterations with post-mortem analysis built into each cycle. Not as an afterthought when things collapse, but as a scheduled learning checkpoint.

This means allocating real time and money for:

  • Documenting what you tried and why it didn’t work
  • Analyzing root causes with people who weren’t on the project team
  • Updating your approach based on what you learned
  • Sharing what we learned across the organization so others don’t repeat the same mistakes

S&P Global’s AI adoption survey drives this home: companies are walking away from AI initiatives at the proof-of-concept stage in record numbers, and the ones that break through are the ones that put real budget into adoption rather than just the model. The companies that fail spend everything on building and nothing on the learning. You would think that is obvious. Apparently not.

Need a thinking partner on this? Blue Sheen takes on this kind of advisory work.

What a useful post-mortem actually tracks

AI post-mortem loop from failure through root cause and learning capture to prevention

Google’s Site Reliability Engineering team, built by Ben Treynor Sloss, has refined the post-mortem into something worth doing. Their approach is blameless: understand how something happened, not who is responsible. Borrow from established incident response practices instead of inventing your own. Consistent structure: problem, trigger, root cause, correlating problems, action items. Two to three pages maximum. Not a dissertation. A learning tool.

For AI projects, I’d add specific failure categories based on the RAND research:

Problem misalignment: Did stakeholders agree on what problem they were solving? If not, where did the communication break down? Who needed to be in earlier conversations but wasn’t?

Data quality gaps: What specific data issues prevented the model from performing? Where were they discovered: before training, during testing, or after deployment? Could they have been caught earlier?

Infrastructure limitations: Did the team have the technical foundation to support this application? What capabilities were missing? How much would it cost to build them versus buy them?

Expectation management: Who oversold what the AI could do? Where did unrealistic expectations come from: vendor promises, internal pressure, real misunderstanding?

Wrong problem selection: Was this problem actually solvable with current AI capabilities? Should the team have started with something simpler?

These aren’t yes/no questions. They’re diagnostic tools. The deeper you dig, the more useful the post-mortem becomes.

Root causes at the organizational level

Taiichi Ohno’s five whys technique works for technical failures. “Why did the model underperform?” Training data was incomplete. “Why was it incomplete?” The team couldn’t access the production database. “Why not?” IT security protocols blocked the connection. You get the idea.

But AI project failures often have organizational root causes that five whys won’t reach.

By some estimates, more than 80 percent of AI projects fail, at twice the rate of non-AI IT projects. Turns out, that’s probably not because the technology is harder. It’s because organizations haven’t adapted their processes to handle experimental work. RAND’s interviews with 65 data scientists and engineers confirm that the majority of challenges in AI rollout relate to people and processes, not technical issues.

When a post-mortem reveals that the project failed because “we needed three more months for data preparation,” that’s not the root cause. The root cause is this: the team estimated AI implementation like software development, using fixed timelines for experimental work.

Can better project management fix this? No. The fix isn’t padding the schedule. It’s changing how you fund and manage AI projects. An AI budget template designed for iterative learning, not linear delivery. A tough sell, but the right one.

This matters because the next project will fail the same way unless you change the funding model. The post-mortem document means nothing if it doesn’t change how you allocate resources.

Making post-mortems into something that lasts

The best post-mortems become organizational assets. Not PDFs buried in SharePoint. Living documents that shaped every project that followed.

One approach: maintain a central repository of AI project learnings tagged by failure pattern. When someone proposes a new AI initiative, they review relevant post-mortems first. Prevents repeating known mistakes.

Another: quarterly cross-team sessions where teams share recent failures and learnings. Not formal presentations. Working sessions where people troubleshoot each other’s problems. Atlassian treats these as incident management processes, treating project failures like system outages: learning opportunities rather than career enders. Amy Edmondson’s research at Harvard Business School calls this psychological safety. People need to feel safe reporting failures, or they will not report them.

The shift that matters is treating AI project failures as data collection rather than performance failures. Good luck getting most boards to see it that way. You’re gathering information about how AI works in your specific organizational context. Each failure teaches you something about your data, your processes, or your readiness. But only if you budget for that learning.

Poor data quality and readiness ranks as the top obstacle to AI success, cited by 43% of organizations in Informatica’s CDO Insights 2025 survey. Known problem. Documented clearly. How many AI budgets include real funds for data quality assessment and remediation before model development even starts?

Very few. Because they’re built on implementation assumptions rather than learning assumptions.

Learning budgets matter more than building budgets. Post-mortems accelerate that learning, but only if you fund them properly and take what we learn seriously.

I said accelerate. More accurately, they make learning possible at all.

When your next AI project fails, and the statistics say it probably will, the question isn’t whether to document it. It’s whether you’ve budgeted enough time and money to extract real value from that failure. Most organizations haven’t.

The failed project isn’t the real waste. The lost learning is.

About the Author

Amit Kothari is an experienced consultant, advisor, coach, and educator specializing in AI and operations for executives and their companies. With 25+ years of experience, he is the Co-Founder & CEO of Tallyfy® (raised $3.6m, the Workflow Made Easy® platform) and Partner at Blue Sheen, an AI advisory firm for mid-size companies. He helps companies identify, plan, and implement practical AI solutions that actually work. Originally British and now based in St. Louis, MO, Amit combines deep technical expertise with real-world business understanding. Read Amit's full bio →

Disclaimer: The content in this article represents personal opinions based on extensive research and practical experience. While every effort has been made to ensure accuracy through data analysis and source verification, this should not be considered professional advice. Always consult with qualified professionals for decisions specific to your situation.

Related Posts

View All Posts »
AI budget template - plan for iteration, not implementation

AI budget template - plan for iteration, not implementation

Traditional project budgeting assumes you know the outcome before you start. AI budgeting assumes you will discover the outcome through iteration. RAND research shows more than 80 percent of AI projects fail because of this mismatch. Here is a practical framework mid-size companies can use to budget for AI projects without setting money on fire.

The consultant who fought to keep his client off AI

The consultant who fought to keep his client off AI

Some advisors resist letting a company connect AI to its own systems, dressed up as too risky. The Everlaw survey found 90% of legal professionals expect AI to change billing within two years. The real driver is an AI consultant protecting the gatekeeper role.

Good-enough AI will eat the premium-model business

Good-enough AI will eat the premium-model business

Good-enough AI is driving commoditization from below. Stanford HAI clocked a 280-fold drop in the cost of running a GPT-3.5-level model. Once a cheaper model clears the bar for a job, the frontier model stops earning its premium for that job.

How I run my whole consulting practice with Claude

How I run my whole consulting practice with Claude

I run Blue Sheen, my AI advisory firm, through Claude and Claude Code. The practice lives in a version-controlled folder that Claude reads at the start of every session, with Close CRM as the source of truth. This is the real workflow stage by stage: prospecting, proposals, delivery, and the judgment a human still has to own.

When to use a dynamic workflow

When to use a dynamic workflow

A dynamic workflow in Claude Code runs up to sixteen subagents at once and a thousand across a job. That power is wasted on most tasks. This is the decision I use before reaching for one: when a single agent wins, when a dynamic workflow earns its cost, and when the answer is to not automate at all.

AI does tasks. It does not do jobs.

AI does tasks. It does not do jobs.

Ten years building Tallyfy, and a year pointing AI agents at it, taught me one blunt thing. A job is a chain of tasks, and AI reliability multiplies down that chain until the whole thing is a coin flip. The fix is not a smarter model.

AI advisory services via Blue Sheen.
Contact me Follow 10k+