Are You Measuring the Wrong Success?

Introduction

A good question does not merely ask whether something is “better”. It defines what better means before evidence is collected. If the outcome is left vague, weak answers can appear convincing because they measure an easier, more flattering result than the one that actually matters. A programme that participants enjoy, a search engine that feels helpful, or a treatment that improves a laboratory measurement may all look successful while failing to improve real-world performance.

Outcomes illustration 1 Choosing the right outcome measure is therefore one of the most effective ways to improve analytical thinking. It prevents evidence from drifting towards convenient proxies, ensures that competing claims are judged against the same standard, and makes it much harder for impressive-looking but irrelevant evidence to win the argument.

Why “better” needs a measurable meaning

Many disputes are not really about the evidence. They are about which outcome counts as success.

Consider the question, “Does this writing course work?” Depending on the chosen outcome, the answer could legitimately be:

Yes, because learners rated it highly.
Yes, because test scores increased immediately afterwards.
No, because workplace writing quality did not improve six months later.
No, because productivity stayed unchanged despite improved knowledge.

All of these findings could be true simultaneously because they measure different outcomes.

This is why evidence-based disciplines ask researchers to specify a primary outcome before collecting data. In clinical research, for example, studies distinguish between primary outcomes (the main question the trial is designed to answer) and secondary outcomes that provide supporting information. Changing emphasis after seeing the results creates opportunities for selective reporting and exaggerated claims. [U.S. Food and Drug Administration]fda.govFood and Drug AdministrationGuidance for Industry:Patient-Reported Outcome MeasuresDecember 8, 2009 — For example, a clinical trial may i…Published: December 8, 2009

The practical lesson extends well beyond science: define the outcome before looking for evidence, not afterwards.

Satisfaction versus performance outcomes

One of the most common mistakes is treating satisfaction as though it proves effectiveness.

A positive experience certainly matters. Happy customers return. Engaged students are more likely to continue learning. Patients deserve respectful care. However, satisfaction measures something different from performance.

Training evaluation illustrates this distinction particularly well. The widely used Kirkpatrick framework separates four increasingly demanding levels:

Reaction: Did participants like the experience?
Learning: Did they acquire knowledge or skills?
Behaviour: Did they actually change what they do?
Results: Did those behavioural changes improve organisational or practical outcomes? [Kirkpatrick Partners, LLC.+2PMC]kirkpatrickpartners.comKirkpatrick Partners, LLC.The Kirkpatrick ModelThe Kirkpatrick Model, or four levels of training evaluation, consists of Reaction, Learni…

The important insight is that success at one level does not guarantee success at the next.

For example:

A workshop may receive excellent feedback while producing no lasting behaviour change.
Employees may pass a knowledge test yet continue using old working habits.
New procedures may be adopted but fail to improve customer outcomes.

Reaction measures are valuable because they can identify delivery problems or barriers to engagement. They become misleading only when they are presented as evidence that the intervention itself achieved its intended purpose.

When proxy outcomes replace the real outcome

Weak answers often rely on proxy or surrogate outcomes—measurements that are easier or quicker to collect than the result people actually care about.

Sometimes proxies are necessary. Waiting years for the ultimate outcome may be impractical. However, proxies deserve confidence only when there is strong evidence that improving the proxy reliably predicts improvement in the real outcome.

Healthcare provides a clear example.

Researchers sometimes measure blood pressure, cholesterol or tumour shrinkage because waiting for differences in survival or quality of life would take much longer. These surrogate endpoints are useful only when evidence shows that changes in the surrogate consistently predict meaningful patient benefits. Regulators and health technology assessment organisations therefore treat surrogate outcomes cautiously rather than assuming they always represent genuine clinical improvement. [Wikipedia+2Value in Health]WikipediaSurrogate endpointSurrogate endpoint

The same reasoning applies outside medicine.

Suppose an organisation claims that introducing an AI search assistant improved decision quality because staff completed searches more quickly. Faster searching is a legitimate measure, but it is only a proxy. If users now make more errors, overlook contradictory evidence, or become overconfident, the faster searches may represent worse decisions rather than better ones.

The closer an outcome is to the decision-maker’s real objective, the harder it becomes to create misleading success stories.

Outcomes illustration 2

Measuring what matters to the people affected

Another protection against weak evidence is ensuring that outcomes reflect the perspective of the people experiencing the intervention rather than only the organisation delivering it.

Healthcare increasingly incorporates patient-reported outcome measures (PROMs) because laboratory values alone cannot fully describe whether patients actually feel better, function better or experience improved quality of life. Regulators and researchers increasingly recognise that patient-centred outcomes provide information unavailable from clinical measurements alone. [PMC+2dcricollab.dcri.duke.edu]pmc.ncbi.nlm.nih.govThe importance of patient-reported outcomes in clinical trials…by R Mercieca-Bebber · 2018 · Cited by 759 — Patient-reported outcom…

This principle generalises to many decisions.

A transport project should not be judged only by construction speed if travellers ultimately care about journey reliability.

An education programme should not be judged only by attendance if learners ultimately need durable competence.

A customer service system should not be judged only by shorter call durations if customers still require repeat contacts.

The best primary outcome reflects the benefit that motivated the decision in the first place.

How to choose the main success measure

Before searching for evidence, ask a small set of disciplined questions.

What final outcome actually matters? [facebook.com]facebook.comn what patients actually experience during and after…

Name the result you ultimately want, not merely something associated with it.

Who experiences that outcome?

Different stakeholders often value different results. Decide whose perspective is primary.

Can the outcome be measured objectively?

Vague ideas such as “better quality” become more useful when translated into observable measures.

Is this a direct outcome or only a proxy?

If using a proxy, ask what evidence links it to the real objective.

When should success be measured?

Immediate improvements may disappear over time, while delayed benefits may not appear in short evaluations.

Answering these questions before collecting evidence sharply reduces the chance of accepting persuasive but irrelevant findings.

Outcomes illustration 3

Practical warning signs that the wrong outcome is being used

Several patterns frequently indicate that evidence is stronger in appearance than in substance.

The measured outcome is easier than the real goal. Counting clicks instead of completed tasks, or attendance instead of learning.
Only favourable secondary outcomes are highlighted. The main intended outcome receives little attention or is omitted.
The outcome changes during the discussion. Claims quietly shift from effectiveness to popularity, from quality to speed, or from long-term results to short-term gains.
The reported improvement lacks practical significance. A statistically detectable change may still be too small to matter in everyday use.
The measure rewards the metric rather than the purpose. Once people optimise the score itself, the score may stop representing genuine success.

These warning signs do not automatically invalidate evidence, but they should prompt careful examination of whether the reported outcome genuinely answers the original question.

Better outcome measures produce better thinking

Strong analytical thinking depends as much on selecting the right outcome as on evaluating the evidence itself. When the desired result is clearly defined before searching begins, impressive anecdotes, satisfaction surveys and convenient proxy measures lose much of their persuasive power.

The most reliable evidence answers the question that actually matters—not merely the question that happened to be easiest to measure.

Amazon book picks

Marketplace Samples

Live-tested eBay searches with available results related to this page.

Example eBay listing

Motivational Quote Poster Wall Art Gym Office Bedroom Home Decor A4 A3 A5

Search eBay.co.uk: are poster

Browse similar on eBay.co.uk

Example eBay listing

Motivational Quote Poster Wall Art Gym Office Bedroom Home Decor A4 A3 A5

Search eBay.co.uk: are poster

Browse similar on eBay.co.uk

Example eBay listing

Classic Great Movie Posters Wall Art Poster Prints Collection 2

Search eBay.co.uk: are poster

Browse similar on eBay.co.uk

Example eBay listing

Music These Are Difficults Times Fr Framed Wall Art Poster Canvas Print Picture

Search eBay.co.uk: are poster

Browse similar on eBay.co.uk

Browse more on eBay.co.uk

Example items shown for inspiration; availability and pricing can change. Branchoria may earn a commission if you purchase through outbound eBay links.

Endnotes

Source: fda.gov
Link: https://www.fda.gov/media/77832/download
Source snippet
Food and Drug AdministrationGuidance for Industry:Patient-Reported Outcome MeasuresDecember 8, 2009 — For example, a clinical trial may i...

Published: December 8, 2009
Source: Wikipedia
Title: Outcome measure
Link: https://en.wikipedia.org/wiki/Outcome_measure
Source: pmc.ncbi.nlm.nih.gov
Title: This level seeks to determine if an increase in company
Link: https://pmc.ncbi.nlm.nih.gov/articles/PMC3070232/
Source snippet
Employing Kirkpatrick's Evaluation Framework to Determine...by DN Rouse · 2011 · Cited by 274 — Kirkpatrick's model stresses evaluati...
Source: Wikipedia
Title: Surrogate endpoint
Link: https://en.wikipedia.org/wiki/Surrogate_endpoint
Source: pmc.ncbi.nlm.nih.gov
Link: https://pmc.ncbi.nlm.nih.gov/articles/PMC6219423/
Source snippet
The importance of patient-reported outcomes in clinical trials...by R Mercieca-Bebber · 2018 · Cited by 759 — Patient-reported outcom...
Source: dcricollab.dcri.duke.edu
Title: Patient-Reported Outcomes
Link: https://dcricollab.dcri.duke.edu/sites/NIHKR/KR/PRO%20Resource%20Chapter.pdf
Source snippet
Introduction. This resource white paper was developed in 2014 to introduce clinical researchers to the definition and use of patient-repo...
Source: hspr.cuhk.edu.hk
Title: Concept of Patient Reported Outcomes for Attendees update
Link: https://hspr.cuhk.edu.hk/wp-content/uploads/Concept-of-Patient-Reported-Outcomes_for-Attendees_update.pdf
Source snippet
Concept of Patient Reported Outcomes13 Jul 2025 — Clinical outcomes are cornerstone evidence for evaluating therapeutic benefits and risk...
Source: kirkpatrickpartners.com
Link: https://www.kirkpatrickpartners.com/the-kirkpatrick-model/
Source snippet
Kirkpatrick Partners, LLC.The Kirkpatrick ModelThe Kirkpatrick Model, or four levels of training evaluation, consists of Reaction, Learni...
Source: valueinhealthjournal.com
Link: https://www.valueinhealthjournal.com/article/S1098-3015%2826%2900050-1/fulltext
Source snippet
Methods for Evaluation of Surrogate Endpoints...by S Bujkiewicz · 2026 · Cited by 1 — This ISPOR Task Force report addresses how surroga...

Additional References

Source: klarahr.com
Link: https://klarahr.com/en/resources/blog/the-kirk-patrick-modele-valuating-the-impact-of-your-training-programmes-across-four-levels
Source snippet
The Kirkpatrick Model: Evaluating Your Training Programmes4 Oct 2022 — The Kirkpatrick model is a professional training evaluation framew...
Source: employment-studies.co.uk
Link: https://www.employment-studies.co.uk/system/files/resources/files/392.pdf
Source snippet
might imply about the process of learning and changing behaviour, and we also review some of the research evidence on training...Read more...
Source: educationaltechnology.net
Link: https://educationaltechnology.net/kirkpatrick-model-four-levels-learning-evaluation/
Source: ispor.org
Title: task force addresses critical gap in surrogate endpoint guidance
Link: https://www.ispor.org/heor-resources/news-top/news/2026/05/28/ispor-task-force-addresses-critical-gap-in-surrogate-endpoint-guidance
Source snippet
ISPOR Task Force Addresses Critical Gap in Surrogate...28 May 2026 — The report will outline different methods to deal with the use of s...

Published: May 2026
Source: europepmc.org
Link: https://europepmc.org/article/pmc/3070232
Source snippet
Employing Kirkpatrick's evaluation framework to determine...by DN Rouse · 2011 · Cited by 274 — Kirkpatrick's model stresses evaluation...
Source: esmed.org
Title: evaluating medical curriculum kirkpatricks approach
Link: https://esmed.org/evaluating-medical-curriculum-kirkpatricks-approach/
Source snippet
Evaluating Medical Curriculum: Kirkpatrick's Approachby M Kusmiati · 2025 · Cited by 1 — It outlines the four levels of evaluation define...
Source: wecanadvocate.eu
Link: https://wecanadvocate.eu/academy/patient-reported-outcomes-and-other-patient-relevant-measures-and-endpoints/
Source snippet
mes to record the patient's lived experience and burden of a disease.Read more...
Source: unife.it
Title: yardley kirkpatricks levels and education evidence
Link: https://www.unife.it/medicina/lm.riabilitazione/studiare/minisiti/pianificazione_gestione_proc-edu/modulo-di-processi-educativi-delladulto/2016-17/yardley_kirkpatricks-levels-and-education-evidence.pdf
Source snippet
Kirkpatricks levels and education evidenceby S Yardley · 2011 · Cited by 636 — This study aims to review, criti- cally, the suitability o...
Source: facebook.com
Link: https://www.facebook.com/EuropeanCancer/posts/patient-reported-outcome-measures-proms-can-help-cancer-professionals-reflect-on/1461357926004588/
Source snippet
n what patients actually experience during and after...
Source: youtube.com
Title: Primary vs Secondary outcomes
Link: https://www.youtube.com/watch?v=pXesXYmScgI
Source snippet
Hard Facts, Dangerous Half-Truths, and Total Nonsense by Jeffrey Pfeffer: 6 Minute Summary...

Are You Measuring the Wrong Success?

Introduction

Why “better” needs a measurable meaning

Satisfaction versus performance outcomes

When proxy outcomes replace the real outcome

Measuring what matters to the people affected

How to choose the main success measure

Practical warning signs that the wrong outcome is being used

Better outcome measures produce better thinking

Further Reading

How to Measure Anything

The Tyranny of Metrics

Thinking, Fast and Slow

The Book of Why

Marketplace Samples

Motivational Quote Poster Wall Art Gym Office Bedroom Home Decor A4 A3 A5

Motivational Quote Poster Wall Art Gym Office Bedroom Home Decor A4 A3 A5

Classic Great Movie Posters Wall Art Poster Prints Collection 2

Music These Are Difficults Times Fr Framed Wall Art Poster Canvas Print Picture

Endnotes

Additional References

Follow this branch

Parent topic

Related pages 5