Probability
3. Continuous Distributions
Normal Distribution

Normal (Gaussian) Distribution

🔔

The Normal distribution, often called the "bell curve," is the single most important probability distribution in statistics. It appears everywhere in nature and forms the backbone of machine learning, hypothesis testing, and inferential statistics.

Core Concepts

If a random variable XX follows a Normal distribution, we write:

XN(μ,σ2)X \sim \mathcal{N}(\mu, \sigma^2)

Where:

  • μ\mu (mu) is the mean (which dictates the center of the bell).
  • σ2\sigma^2 (sigma squared) is the variance (which dictates the spread). The square root, σ\sigma, is the standard deviation.

Probability Density Function (PDF)

🤯

Don't panic! The equation below looks terrifying, but it perfectly defines the elegant, smooth shape of the bell curve.

f(x)=1σ2πe12(xμσ)2f(x) = \frac{1}{\sigma\sqrt{2\pi}} e^{-\frac{1}{2}\left(\frac{x-\mu}{\sigma}\right)^2}

While complex, let's break down why this works mathematically:

  • The e()2e^{-(\dots)^2} part creates the exponential decay that forms the "bell" shape.
  • The xμσ\frac{x-\mu}{\sigma} part centers the curve at μ\mu and scales its width by σ\sigma.
  • The 1σ2π\frac{1}{\sigma\sqrt{2\pi}} part is a normalizing constant. It ensures that the total area under the entire curve equals exactly 1.01.0.

The Empirical Rule (68-95-99.7)

One of the most useful heuristics in statistics applies to all Normal distributions:

  • 68.27%\approx 68.27\% of data falls within 1 standard deviation (±1σ\pm 1\sigma) of the mean.
  • 95.45%\approx 95.45\% of data falls within 2 standard deviations (±2σ\pm 2\sigma) of the mean.
  • 99.73%\approx 99.73\% of data falls within 3 standard deviations (±3σ\pm 3\sigma) of the mean.

Real-World Examples

The Normal distribution is everywhere because of the Central Limit Theorem (covered later).

  1. Human Heights: The heights of adult males in a country cluster around a mean and taper off symmetrically.
  2. Measurement Error: The tiny errors a machine makes when precisely measuring weights or lengths.
  3. Test Scores: Standardized tests like the SAT or GRE are explicitly designed to force scores into a normal distribution.

Interactive Visualization

Below is an interactive graph.

  • Drag the Mean (μ\mu) to shift the entire curve left or right along the x-axis. The shape does not change.
  • Drag the Standard Deviation (σ\sigma) to see how variance impacts the curve. A smaller σ\sigma makes the curve tall and spiked (data is highly predictable). A larger σ\sigma makes the curve flat and wide (data is highly spread out).

Normal Distribution

Drag μ and σ to shift the curve and change its spread.

Mean (μ)
0.0
Standard Deviation (σ)
1.0

Test Your Knowledge

Example: Normal Curve Empirical Rule

A population of adult male heights is normally distributed with a mean μ=70\mu = 70 inches and a standard deviation σ=3\sigma = 3 inches. What percentage of men are between 64 inches and 76 inches tall?

View Step-by-Step Solution

Notice that 64 inches is exactly μ2σ\mu - 2\sigma (702(3)=6470 - 2(3) = 64). Notice that 76 inches is exactly μ+2σ\mu + 2\sigma (70+2(3)=7670 + 2(3) = 76).

According to the Empirical Rule (68-95-99.7), approximately 95% of normally distributed data falls within 2 standard deviations of the mean.