Standard Deviation (represented by the Greek letter sigma, σ) is one of the most important concepts in statistics. While the Mean (average) tells you where the center of the data is, the Standard Deviation tells you how spread out the numbers are from that center.
[Image of low vs high standard deviation graph]- Low Standard Deviation: The data points are clustered closely around the mean (Consistent).
- High Standard Deviation: The data points are spread out over a wider range (Volatile).
1. Why is it useful?
Imagine two classes took a test. Both classes had an average score of 80%.
- Class A (Low SD): Almost everyone scored between 78% and 82%.
- Class B (High SD): Some scored 100%, some scored 60%.
The average is the same, but the reality is very different. Standard Deviation gives you that reality.
2. The Formula
The calculation differs slightly depending on whether you have data for an entire Population or just a Sample.
[Image of standard deviation formula]Where:
- x = Each value in the dataset.
- μ = The Mean of the values.
- N = Total number of values.
- Σ = Summation (Add them up).
Note: If calculating for a Sample (a small part of a larger group), divide by N-1 instead of N.
3. Step-by-Step Calculation
Let's calculate the Standard Deviation for this simple dataset: 3, 4, 5, 6, 7.
[Image of standard deviation calculation table]Step 1: Find the Mean (μ)
(3 + 4 + 5 + 6 + 7) / 5 = 25 / 5 = 5.
Step 2: Find the Deviation (Difference) for each number
- 3 - 5 = -2
- 4 - 5 = -1
- 5 - 5 = 0
- 6 - 5 = 1
- 7 - 5 = 2
Step 3: Square each Deviation
We square them to get rid of negative numbers.
- (-2)² = 4
- (-1)² = 1
- 0² = 0
- 1² = 1
- 2² = 4
Step 4: Find the Mean of these Squares (Variance)
Sum = 4 + 1 + 0 + 1 + 4 = 10.
Variance = 10 / 5 = 2.
Step 5: Take the Square Root
Standard Deviation (σ) = √2 ≈ 1.41.
4. The Bell Curve (Normal Distribution)
Standard deviation is the ruler for the famous "Bell Curve." In a normal distribution:
[Image of normal distribution empirical rule]- 68% of values fall within 1 SD of the mean.
- 95% of values fall within 2 SDs of the mean.
- 99.7% of values fall within 3 SDs of the mean.
Conclusion
Standard Deviation is the ultimate tool for measuring consistency. Whether checking the volatility of a stock price or the reliability of a machine part, knowing the "sigma" tells you exactly how much variation to expect.