Probability
5. Inequalities & Limit Theorems
Chebyshev's Theorem

Chebyshev's Theorem (Inequality)

🛡️

Chebyshev's Theorem is the ultimate mathematical safety net. While the Empirical Rule (68-95-99.7) only works for beautiful, perfectly symmetrical Normal distributions, Chebyshev's Theorem works for literally any distribution in the universe (as long as it has a defined mean and variance).

The Problem with the Empirical Rule

We love the Normal distribution because it gives us precise guarantees: "95% of data will fall within 2 standard deviations of the mean."

But what if your data is severely right-skewed? What if it has two massive peaks (bimodal)? What if you don't even know what the shape is? You can no longer rely on the 68-95-99.7 rule.

This is where Russian mathematician Pafnuty Chebyshev steps in.

The Mathematical Guarantee

Chebyshev's Inequality guarantees a minimum bound on the proportion of data that must fall within kk standard deviations (σ\sigma) of the mean (μ\mu), for any k>1k > 1.

The formula states that the proportion of data falling within kk standard deviations is at least:

P(Xμ<kσ)11k2P(|X - \mu| < k\sigma) \ge 1 - \frac{1}{k^2}

Let's plug in some numbers to see what this actually means:

Standard Deviations (kk)Chebyshev's Minimum GuaranteeNormal Curve Guarantee (For comparison)
k=1k = 11(1/12)=1 - (1/1^2) = 0% (Not helpful)68.27%
k=2k = 21(1/22)=1 - (1/2^2) = 75%95.45%
k=3k = 31(1/32)=1 - (1/3^2) = 88.89%99.73%
k=4k = 41(1/42)=1 - (1/4^2) = 93.75%99.99%

What does this mean in plain English?

No matter how weird, distorted, or skewed your dataset is, at least 75% of all data points must live within 2 standard deviations of the mean. At least 88.8% must live within 3 standard deviations.

⚠️

The Trade-off: Because Chebyshev's theorem makes a universal guarantee for every shape of data, its bounds are very conservative. If you know your data is Normally distributed, use the Empirical Rule instead because it's much more precise.

Interactive Visualization

Below is a highly unnatural, "Bimodal" (two-peaked) distribution. Even for this bizarre shape, watch how Chebyshev's minimum guarantees hold true as you adjust the bounds!

Chebyshev’s inequality

A distribution-free bound: a guaranteed minimum fraction lies within k standard deviations.

Guaranteed minimum within ±2.0σ
75.0%
The shaded band shows the ±kσ region.
Standard deviations (k)
2.0
Chebyshev: ≥ 1 - 1/k²

Test Your Knowledge

Example: Applying Chebyshev's Bounds

A heavily skewed dataset of salaries has a mean of 50,000andastandarddeviationof50,000 and a standard deviation of 10,000. What is the minimum percentage of salaries that must fall between 20,000and20,000 and 80,000?

View Step-by-Step Solution

First, find how many standard deviations (kk) the bounds are from the mean.

  • 80,00050,000=30,00080,000 - 50,000 = 30,000, which is 3×σ3 \times \sigma.
  • 50,00020,000=30,00050,000 - 20,000 = 30,000, which is 3×σ3 \times \sigma. So k=3k = 3.

Chebyshev's formula: 11k21 - \frac{1}{k^2} 1132=119=890.88891 - \frac{1}{3^2} = 1 - \frac{1}{9} = \frac{8}{9} \approx 0.8889

No matter how skewed the salaries are, at least 88.89% of them must fall in this range.