Standard deviation is a measure of how spread out a set of values is around its mean. It expresses, in the original units of the data, the typical distance of an observation from the average. A small standard deviation means values cluster tightly around the mean; a large standard deviation means they are widely dispersed. It is one of the most widely reported summary statistics in quantitative research because it captures variability that a mean alone conceals.
Standard deviation and the mean
Two datasets can share an identical mean yet behave very differently. Consider two classes whose mean test score is 70. In the first, scores fall between 68 and 72; in the second, they range from 40 to 100. Both means are 70, but the second class is far more variable. The standard deviation quantifies that difference, which is why reporting a mean without a measure of spread is incomplete.
Standard deviation is the square root of the variance. Variance is the average of the squared deviations of each value from the mean. Squaring removes negative signs and emphasises larger departures, but it also leaves variance in squared units. Taking the square root returns the figure to the original units, making standard deviation the more interpretable companion to the mean.
Population versus sample
The formula differs depending on whether the data represent an entire population or a sample drawn from one. The population standard deviation divides the sum of squared deviations by N, the number of values. The sample standard deviation divides by n minus 1 rather than n. This adjustment, known as Bessel’s correction, compensates for the tendency of a sample to underestimate the spread of the population it came from. Because most research analyses a sample and infers something about a wider population, the sample formula with n minus 1 is the one most often applied.
| Quantity | Divisor | Used when |
|---|---|---|
| Population standard deviation | N | Every member of the population is measured |
| Sample standard deviation | n − 1 | A sample is used to estimate the population |
The 68-95-99.7 rule
When data follow a normal (bell-shaped) distribution, standard deviation maps onto predictable proportions of the data. This is the empirical rule, often called the 68-95-99.7 rule. Approximately 68% of values fall within one standard deviation of the mean, about 95% fall within two standard deviations, and roughly 99.7% fall within three. These figures hold only for a normal distribution and are approximations for real data that merely resemble one; skewed or heavy-tailed distributions will not obey them.
| Range from the mean | Approximate share of data (normal distribution) |
|---|---|
| ±1 standard deviation | 68% |
| ±2 standard deviations | 95% |
| ±3 standard deviations | 99.7% |
A worked conceptual example
Suppose adult resting heart rates in a sample have a mean of 70 beats per minute and a standard deviation of 8. If the distribution is roughly normal, then about 68% of people in that sample have a resting rate between 62 and 78 (the mean plus or minus one standard deviation). About 95% fall between 54 and 86 (two standard deviations), and almost everyone, around 99.7%, falls between 46 and 94 (three standard deviations). A reading of 100 would lie more than three standard deviations above the mean and would therefore be unusual relative to this sample. Examining such extreme values links directly to outlier detection, a related step in data quality assessment.
Standard deviation versus standard error
A frequent source of confusion is the difference between standard deviation and standard error. Standard deviation describes the variability of individual observations in the data. The standard error of the mean describes the variability of the sample mean itself as an estimate of the population mean, and it equals the standard deviation divided by the square root of the sample size. Because dividing by the root of n shrinks it, the standard error is always smaller than the standard deviation and grows narrower as the sample grows.
The choice between them depends on what is being communicated. To describe how much individuals differ from one another, report the standard deviation. To express how precisely the mean has been estimated, report the standard error or, more informatively, a confidence interval. Reporting a standard error where a standard deviation is meant can mislead readers into thinking data are far less variable than they are. For practical reporting conventions, see the CASRAI author guidance and the CASRAI dictionary.
Frequently asked questions
Why divide by n minus 1 for a sample?
Dividing by n minus 1 corrects a bias: using the sample mean to centre the data slightly reduces the spread, so dividing by the smaller divisor produces an unbiased estimate of the population variance. This is Bessel’s correction.
Can standard deviation be negative?
No. It is a square root of an average of squared quantities, so it is always zero or positive. A standard deviation of zero means every value is identical to the mean.
Should I report standard deviation or standard error?
Report the standard deviation to describe variability among observations, and the standard error or a confidence interval to describe the precision of the mean. For wider context on variability and uncertainty, see our guide to confidence intervals and the reproducibility news category.