June 15, 2026

How to measure AI ROI: what to define before turning on the model

AI return is not measured at the end of the project. It is set at the start, by choosing the right metric and recording the baseline before anything runs.

AI for businessROIMetrics

The question about return almost always arrives late. The project runs, someone asks for the ROI, and the team discovers there is no way to calculate it, because nobody measured the prior state. AI return is not something you compute at the end. It is something you design at the start, with three simple decisions.

1. Choose one main metric

Resist the urge to measure ten things. Pick a metric that represents the value of the process and anchor the project to it. It usually falls into one of four families:

  • Time: how much faster a case is resolved.
  • Unit cost: how much it costs to process each item.
  • Error rate: how many cases come out wrong and need rework.
  • Revenue or conversion: how many cases turn into a financial result.

There can be secondary metrics, but one has to be the main one. That is the one that tells you whether it was worth it.

2. Record the baseline

Before turning on the model, measure the current state. How long it takes today, how much it costs today, the error rate today. That number is your point of comparison. Without it, any result becomes opinion. With it, the conversation becomes objective: we went from X to Y.

The baseline does not need to be perfect. It needs to exist and be honest. Two weeks of real data are worth more than a meeting-room estimate.

3. Count the full cost, not just the tool

ROI is return on investment, and the investment is not just the tool subscription. Add the time to set it up and maintain it, the integration with current systems, the human review of exceptions, and the cost of mistakes early on. Projects look cheap when you forget half the cost. Compare the real gain against the real cost.

A concrete example

Imagine an order triage that takes eight minutes per case today, with two full-time reviewers. The main metric is time per case. The baseline is eight minutes. The pilot puts AI on the first pass and keeps a human reviewing only the exceptions.

After a month, the average time drops to three minutes and the error rate stays stable. The gain is not "AI is amazing". The gain is five minutes per case, multiplied by the monthly volume, minus the cost of running the system. That is a number you take to leadership without needing an adjective.

The most common mistake

Measuring only what is easy to see and ignoring what matters. Speed goes up, but quality dropped and nobody noticed. That is why it helps to always have a guard metric: the constraint that cannot get worse. In the support example, speed is the target and customer satisfaction is the guard. Improving the target while destroying the guard is not a gain. It is debt in disguise.

Summary

Choose one main metric, record the baseline, count the full cost, and protect the constraint that cannot get worse. Do that, and ROI stops being a debate and becomes arithmetic.

I cover more of this on YouTube and X.