Section 4.27 min read

Descriptive vs Inferential Statistics

Core summary

Descriptive statistics summarize your data (means, percentages, tables). Inferential statistics draw conclusions about a larger population from your sample (confidence intervals, p-values, hypothesis tests).

Detailed explanation

Statistics in research serves two fundamental purposes. Descriptive statistics describe what your data look like. Inferential statistics use your data to make claims about a broader population. Descriptive statistics include measures of central tendency (mean, median, mode), measures of spread (standard deviation, interquartile range, range), frequencies and percentages, and visual displays (histograms, box plots, bar charts). Table 1 in most clinical papers is pure descriptive statistics — it summarizes the study population. Inferential statistics go beyond description. They use sample data to estimate what is likely true in the larger population, accounting for the uncertainty inherent in sampling. Key inferential tools include confidence intervals (estimating a range for the true population value), p-values (testing whether an observed difference is likely due to chance), and hypothesis tests (formal decisions about the null hypothesis). The distinction matters because descriptive statistics alone cannot answer research questions. Saying 'the treatment group had a lower infection rate' is descriptive. Saying 'the treatment group had a significantly lower infection rate (RR 0.6, 95% CI 0.4-0.8, p=0.002)' is inferential — it claims the difference is unlikely due to chance and estimates the true effect. Every research paper uses both types. You describe your sample (Table 1) and then infer conclusions about the population (Results section). Understanding the distinction helps you read papers critically and plan your own analyses.

Clinical example

Table 1 of an RCT shows: mean age 55 years, 60% male, mean BMI 28. These are descriptive statistics — they describe who was in the study. The Results section shows: treatment reduced mortality by 25% (HR 0.75, 95% CI 0.62-0.90, p=0.003). This is inferential — it estimates the treatment effect in the broader population.

Research example

In the RECOVERY trial, descriptive statistics showed that 2,104 patients received dexamethasone and 4,321 received usual care. Inferential statistics showed that dexamethasone reduced 28-day mortality (rate ratio 0.83, 95% CI 0.75-0.93, p<0.001) — a conclusion about the effect in the broader population of COVID-19 patients.

Knowledge check

Q1. Which of the following is an example of INFERENTIAL statistics?

Q2. Table 1 in a clinical paper typically contains inferential statistics.

Q3. Descriptive statistics are used to: