CAPA Effectiveness Metrics: The KPIs That Tell You Corrective Action Is Working
Verifying that one corrective action worked is a per-CAPA question. You set acceptance criteria, run the observation window, and confirm the specific failure mode stopped. CAPA effectiveness verification covers exactly how to do that for a single record.
This post is about the other question, the one management review is supposed to answer and usually does not: is your CAPA system working at all? You can verify every individual CAPA correctly and still run a program that is quietly failing, because the same classes of problems keep generating new CAPAs and nobody is watching the pattern. The metrics that catch that are different from the ones most quality dashboards show, and most dashboards show the wrong ones.
Activity Is Not Effectiveness
Walk into a typical quality review and the CAPA slide shows two numbers: how many CAPAs are open, and what percentage closed on time. Both are activity metrics. They measure how busy the process is, not whether it is removing causes.
A team can close 100 percent of its CAPAs on time and have a broken corrective action system. On-time closure rewards speed, and the fastest way to close a CAPA is to skip the verification window and write a containment action in the permanent-action box. Count of open CAPAs is worse, because it moves for reasons that have nothing to do with effectiveness. A spike in open CAPAs can mean the process is getting worse or it can mean detection got better and you are finally catching defects you used to ship. The number alone cannot tell you which.
Effectiveness has one definition that survives scrutiny: the same problem does not come back. Every metric below is a different way of measuring that, and the discipline is to track effectiveness metrics on the management slide and demote activity metrics to the working level where they belong.
Recurrence Rate: The Metric the Whole System Answers To
Recurrence rate is the headline. It is the percentage of closed CAPAs where the same failure mode returned within a defined window after closure.
Recurrence rate = (CAPAs whose failure mode recurred post-closure) / (CAPAs closed in the period)
The two design decisions that make or break it are the window and the definition of "same." Pick a window long enough for the failure mode to actually reappear if the fix did not hold. For a defect that shows up once every few thousand parts, a 30-day window is meaningless because the process may not have run enough volume to expose recurrence. Tie the window to production volume or number of runs, not just calendar time, when the failure mode is rare.
The target most customer-specific requirements drive toward is zero recurrence, and that is the right direction to trend even if you never sit on zero in a given quarter. What matters more than the absolute number is the slope. A recurrence rate trending down quarter over quarter says the system is learning. A flat or rising rate says you are closing records, not solving problems, no matter how good the on-time closure number looks next to it.
This is the same idea the 8D formalizes at D7, where you confirm recurrence has been prevented across the broader system, not just on the one line. If your discipline runs 8Ds, the recurrence rate is the rolled-up D7 result for the whole portfolio. 8D vs CAPA covers where the two formats diverge and why D7 so often gets collapsed into the same afternoon as D5.
Define "Same Failure Mode" Before It Becomes an Argument
Recurrence rate lives or dies on how you decide two events are the "same" problem, and if you leave that undefined, every recurrence becomes a debate that the CAPA owner wins. "That was a different lot, different cause" is the sentence that drives a recurrence rate to a fake zero.
Set the rule in advance and write it down. A workable definition keys on the failure mode plus the affected element, not on the convenience of the person who owns the record:
- Same part number and same defect code is the strictest and cleanest. If part 48217 fails for porosity twice, that is a recurrence, full stop.
- Same process and same failure mode across part numbers catches systemic causes that a part-level view misses. If three different machined parts drift out of tolerance on the same operation, the operation is the problem, and counting those as unrelated hides it.
- Same root cause category is the broadest and is useful for management review trending, but it is too loose to grade an individual CAPA.
Pick the level deliberately and apply it consistently. The point is not to find the definition that produces the lowest number. It is to find the one that surfaces the patterns you actually need to act on.
Repeat Findings: The Audit and Customer View of Recurrence
Recurrence does not only show up on the production floor. It shows up in audit findings and in customer complaints, and those two views are often the first place a failing CAPA system becomes visible to someone with authority to act.
Track repeat audit findings against the same clause. If your internal audit or a customer audit writes a finding against, say, control of monitoring and measuring resources two cycles in a row, the corrective action from the first finding did not hold, regardless of what the closed CAPA says. Internal audit checklists are where these repeats get caught, so the finding history is worth trending as its own metric.
Track repeat customer complaints and repeat supplier corrective action requests the same way. A second SCAR to the same supplier for the same issue is a recurrence with a dollar figure attached. These external recurrences carry more weight than internal ones in management review because they represent escapes that a customer felt, which is the next metric.
Aging and Cycle Time, and Why On-Time Closure Lies
Cycle time and aging are worth tracking, but the standard way of presenting them, percent closed on time, actively misleads, because it treats a CAPA held open for a legitimate verification window the same as one that is overdue because nobody touched it.
Separate the two states. A CAPA that is open past its target date because the effectiveness verification window has not closed yet is doing exactly what it should. A CAPA that is open past its target date with no activity for three weeks is rotting. A single "overdue" bucket lumps them together and pressures owners to close the first kind early, which is precisely how you manufacture a recurrence.
Better to track cycle time by phase: time to containment, time to root cause, time in verification. Containment that takes two weeks is a problem. A verification phase that takes 30 days is usually correct and should not count against the owner. When you measure the phases separately, you can compress the parts that should be fast without crushing the part that needs to be slow. The goal of a tighter CAPA workflow is to shorten the cycle from weeks to days everywhere except the observation window, which has to run as long as the failure mode needs to reappear.
Reopen Rate and Escape Rate
Two more metrics close the loop on effectiveness.
Reopen rate is the percentage of closed CAPAs that had to be reopened. A reopened CAPA is the cleanest possible proof that the original verification was not real, because the system itself overturned the closure. If your reopen rate is climbing, the problem is upstream in how verification is being signed off, and the fix is objective acceptance criteria defined before the window opens, not more pressure on closure dates.
Escape rate is the percentage of failure modes that reached the customer before they were caught. It answers a question the internal metrics cannot: when corrective action fails, who feels it? An escape is the most expensive form of recurrence, and a CAPA system that is genuinely effective drives the escape rate toward zero even while internal detection stays busy.
Wiring the Metrics Into Management Review
None of this is optional under the standards, which is the part most teams miss. ISO 9001:2015 clause 9.3.2 lists the required inputs to management review, and item (c) explicitly includes nonconformities and corrective actions and the extent to which objectives have been met. IATF 16949 sharpens this in clause 9.3.2.1, which adds inputs covering process effectiveness, product conformance, and the cost of poor quality, and the management review is expected to act on the trend, not just receive it.
In plain terms: top management is required to look at whether corrective action is working, and "we closed 94 percent on time" does not satisfy that requirement. The slide that does carries a small, stable set of numbers:
- Recurrence rate, trended over at least four periods.
- Repeat findings, audit and customer, by clause or category.
- Aging distribution, with the verification-window holds separated from true overdue.
- Reopen rate.
- Escape rate.
Five metrics, each one a measure of whether problems stay solved. A review that sees these can make a decision. A review that sees open count and on-time percentage can only ask why the count went up.
Segment the Numbers or They Mislead
A single company-wide recurrence rate is almost useless because it averages away the signal. Segment every one of these metrics by source and by area before you trust it.
Split by source first: internal nonconformities, customer complaints, supplier SCARs, and audit findings behave differently and need different owners. A low internal recurrence rate sitting next to a high supplier SCAR recurrence rate tells you exactly where to spend the next quarter, and the blended average would have hidden that.
Then split by process area or product line. The recurrence rate for a mature, stable cell should be near zero, and if it is not, that cell has a specific problem worth a focused look. A new launch will run hotter, and that is expected. Mixing the two into one number lets a launch mask a chronic problem in a line that should be solved by now.
Where the Data Has to Come From
The reason most teams track activity instead of effectiveness is not that they prefer the wrong metrics. It is that the right ones require connecting data that lives in separate systems. Recurrence rate needs the CAPA records joined to the defect history. Repeat findings need the audit log joined to the CAPA log. Aging by phase needs timestamps the spreadsheet never captured. Pulling all of that together by hand every quarter is enough work that it does not happen, so the review gets the two numbers the system can produce on its own.
The fix is to run the corrective action workflow where the data already connects. The Correct module runs the 5 Why, Fishbone, and 8D analyses and keeps each CAPA linked to the affected control plan, PFMEA, and the defect that triggered it, so recurrence is a query rather than a memory. The Monitor dashboards trend the effectiveness metrics and the audit readiness view continuously instead of being rebuilt the week before management review. The capability evidence behind a verification comes from the statistical tools, so the Cpk that closes a CAPA is the same number the process is actually running. When the CAPA, the document cascade, and the SPC data sit in one workflow, the five metrics above stop being a quarterly assembly project and become a report you can open on any given day.
The One Thing to Change This Week
If your CAPA slide currently shows open count and on-time closure, add one metric: recurrence rate, with a written definition of what counts as the "same" failure mode and a window long enough for it to matter. Trend it for a quarter.
That single number will do something the activity metrics never could. It will tell you, with evidence, whether your corrective action system is removing causes or just closing records. Everything else on the effectiveness slide builds from there, but the recurrence rate alone is enough to change what management review is actually reviewing, from how busy the process is to whether it works.




