Section 4.67 min read

Interpreting Effect Sizes in Context

Core summary

A number is not an interpretation. Whether an effect is small, large, or meaningful depends on the baseline risk, the seriousness of the outcome, the cost and harms, and the smallest difference that matters to patients.

Detailed explanation

This lesson ties the module together by asking how to judge whether an effect actually matters. The first move is to read relative and absolute measures together. A large relative effect (a risk ratio, odds ratio, or hazard ratio far from 1) on a rare outcome can be clinically trivial, while a modest relative effect on a common, serious outcome can be hugely valuable. The absolute measures, the absolute risk reduction, number needed to treat, and mean difference, anchor the relative ones in real-world stakes. Second, compare the effect to the minimal clinically important difference (MCID), the smallest change patients actually notice or that would change management. A statistically significant 1 mmHg drop, or a 1-point shift on a 100-point scale, may fall below the MCID and simply not matter. Third, weigh the effect against harms and costs: a small benefit can still be worthwhile for a cheap, safe drug, while a 'large' benefit may not justify a toxic or expensive one. Fourth, consider the outcome's importance and durability, since an effect on a hard outcome such as death or stroke outweighs the same-sized effect on a surrogate marker, and a lasting benefit matters more than a transient one. Finally, never read an effect size without its confidence interval. A wide interval means the true effect could range from trivial to large, so even an impressive point estimate may be uncertain, while a narrow interval around a meaningful effect is genuinely persuasive. Putting these together, relative plus absolute, measured against the MCID, weighed against harms and costs, for an outcome that matters, and read with its confidence interval, turns a statistical result back into a clinical decision. This judgment, not the p-value, is what should determine whether evidence changes practice.

Clinical example

A drug shows a statistically significant 0.3 percentage-point absolute reduction in a mild outcome but causes frequent side effects; despite significance, the tiny ARR and the harms make it clinically unattractive.

Research example

Reviewers debate a trial with a hazard ratio of 0.8 (significant) for a serious outcome; the absolute survival gain and a narrow confidence interval tip them toward clinical importance despite the 'modest' relative effect.

Knowledge check

Q1. A large relative risk increase is reported for a very rare outcome. The clinical impact is likely:

Q2. What does the minimal clinically important difference (MCID) help you judge?

Q3. Why must an effect size be read with its confidence interval?