What empathy assessment still isn't capturing — and why the gap between what we measure and what actually changes behaviour costs organisations their entire empathy training investment.
Organisations that commission empathy assessments, design training programmes, and measure outcomes are only measuring what their instruments are capable of measuring. If those instruments have significant gaps — and the evidence reviewed in this paper demonstrates that they do — then the investment may be considerably less effective than the data suggests. Worse, it may be optimising for the wrong thing entirely.
When an organisation uses a pre/post empathy assessment to evaluate a training programme, it is asking: did this intervention produce a measurable change in empathy capability? If the instrument does not measure the dimension of empathy that the training was designed to develop, the answer it returns is meaningless. Not wrong — meaningless. It is measuring something real, just not the right thing.
The absence of a gold standard for empathy measurement is not a minor technical inconvenience. It means that every empathy training programme built on a self-report baseline is measuring the dimension of empathy that people can most easily misrepresent — and ignoring the dimension that most directly determines whether they behave differently when it matters.
Systematic reviews consistently find no gold standard. Researchers consistently identify the gap between what instruments measure and what empathy requires. The field knows the instruments are incomplete. It has continued to use them because the alternatives are harder, slower, and more expensive to develop. That situation is now commercially costly in a way it was not before.
A fair account of these tools requires acknowledging both. They are not bad instruments. They are incomplete ones — and that incompleteness has a direct and costly consequence for organisations that use them as the basis for investment decisions.
| Instrument | What It Measures | Dimension Coverage | Key Limitation |
|---|---|---|---|
| IRI (1980) | Self-reported cognitive and emotional empathic tendencies across four subscales |
✔ Cognitive
✔ Emotional
✘ Physical
|
Self-report only; no significant correlation with actual empathic accuracy; physical dimension entirely absent |
| EQ (2004) | Self-reported cognitive drive and affective response to others' emotional states |
✔ Cognitive
✔ Emotional
✘ Physical
|
Designed for clinical floors; self-report bias; poor sensitivity at development range; physical dimension absent |
| RMET (2001) | Ability to identify mental states from photographs of the eye region of faces |
✔ Cognitive (recognition)
✘ Emotional
✘ Physical
|
Static images; poor internal consistency; measures recognition not response; no ecological validity for real interaction |
| Perth Empathy Scale (2023) | Cognitive and affective empathy across positive and negative emotional valence |
✔ Cognitive
✔ Emotional
✘ Physical
|
Most recent and rigorous self-report scale; still self-report; physical dimension not addressed |
"Most existing measures of empathy rely on self-reports of dispositional tendencies or assess subjective or physiological responses to static images; consequently, they fail to assess the ability to monitor rapidly changing social cues, a skill that is very important in navigating real-life social interactions."Zaki & Ochsner, 2011 · cited in multiple systematic reviews
When organisations design empathy development programmes, they design them to produce measurable outcomes. What is measurable determines what is prioritised. What is prioritised determines what gets developed. If the available instruments do not measure the physical dimension, the physical dimension does not become a programme objective.
This gap became evident through the training work itself. Participants in physical empathy programmes consistently report changes they cannot fully articulate — a shift in how they are with other people, a new quality of attunement, a change in what they notice and how they respond. These are real changes. The neural circuits underlying physical attunement are plastic. The problem is not that the training doesn't work. The problem is that no instrument currently exists to measure what it produces.
The design principles for a complete empathy audit are clear from the literature, even if a fully validated instrument does not yet exist. A valid empathy assessment needs to meet four psychometric criteria that current tools repeatedly fail to satisfy.
"The question is not whether empathy can be measured. The question is whether we are measuring what empathy actually does."Stuart Nolan · Stuart Nolan Consulting
Founding Partner organisations receive early access to the proprietary Empathy Audit methodology, genuine influence over its development, and permanently preferred pricing. Download the full white paper for the complete review of measurement tools, the systematic review evidence, and the development roadmap. Or book a 30-minute discovery call.