Measure how spread out data is from the mean—both sample and population calculations.
Last updated: March 2026
(n−1 divisor)
(n divisor)
Standard deviation (σ for population, s for sample) measures how spread out data is from the average (mean). A small standard deviation means data clusters tightly around the mean. A large standard deviation means data is scattered far from the mean. It's the square root of variance and always expressed in the same units as the original data.
There are two formulas: population SD (if you have all data) uses divisor n; sample SD (if you have a subset) uses divisor n−1 (Bessel's correction). The n−1 adjustment in sample SD corrects for bias and provides an unbiased estimate of population SD. Use sample SD when analyzing a subset and want to estimate the population's spread.
Standard deviation is fundamental to statistics: it's used in confidence intervals (±1.96σ for 95% CI in normal distribution), hypothesis testing, and understanding data quality. In the 68-95-99.7 rule, approximately 68% of data falls within ±1σ, 95% within ±2σ, and 99.7% within ±3σ of the mean (for normal distributions).
Test Scores: Comparing Two Classes
Using n underestimates true population SD (bias). Using n−1 (Bessel's correction) corrects this bias. As n increases, the difference shrinks. For n > 30, sample and population SD are very similar.
SD = √(Variance). Variance is SD squared. Both measure spread, but SD is in original units (easier to interpret). Example: if scores have SD = 10, variance = 100.
Yes! SD is always positive or zero (only zero if all values are identical). This is because we square deviations, making them all non-negative, then take the square root.
For normal distributions: 68% of data is within ±1SD, 95% within ±2SD, 99.7% within ±3SD. This rule breaks down for non-normal data, but works well for approximately normal distributions.
Not directly if they have different means or units. Use Coefficient of Variation (CV = SD/mean × 100%) to compare spread of datasets with different averages or units.
Large outliers inflate SD significantly. If outliers are errors, remove them. If real, report SD but also mention median and interquartile range (IQR) as robust alternatives to mean and SD.