Probability
1. Foundations
Key Metrics

Key Metrics: The "Shape" of a Distribution

🧬

Visualizing a distribution curve is helpful, but statisticians rely on four mathematical "moments" to precisely describe the shape, center, and spread of any distribution without needing to draw it. Think of these as the statistical fingerprints of a random variable.


1. Expected Value (The Mean)

The Expected Value, denoted as E[X]E[X] or μ\mu (mu), is the theoretical long-run average of the random variable. It represents the "center of mass" or the balancing point of the distribution.

  • For Discrete variables: Multiply every possible outcome by its probability, and sum them up. E[X]=xp(x)E[X] = \sum x \cdot p(x)
  • For Continuous variables: Multiply every value by its probability density, and integrate across the entire range. E[X]=xf(x)dxE[X] = \int x \cdot f(x) dx
🎲

Intuition Note: The expected value does not have to be a value that can actually occur. The expected value of a 6-sided die roll is 3.53.5, even though you can never actually roll a 3.5!


2. Variance and Standard Deviation (The Spread)

While the mean tells you where the center is, the Variance (σ2\sigma^2) tells you how spread out or clustered the data is around that center.

Variance is the average of the squared differences from the Mean. We square the differences so that negative errors don't cancel out positive errors.

Var(X)=E[(Xμ)2]\text{Var}(X) = E[(X - \mu)^2]

Because variance is measured in squared units (e.g., "squared dollars" or "squared inches"), it is mathematically awkward to interpret. Therefore, we almost always take the square root to get the Standard Deviation (σ\sigma), which brings the spread back into the original units.

σ=Var(X)\sigma = \sqrt{\text{Var}(X)}

3. Skewness (The Lean)

Skewness measures the asymmetry of a distribution.

  • Zero Skew: The distribution is perfectly symmetrical (like the Normal distribution). The mean and median are perfectly identical.
  • Positive (Right) Skew: The distribution has a long "tail" stretching off to the right. The mass of the data is concentrated on the left. (e.g., Human income—most people earn an average amount, but a few billionaires stretch the tail infinitely to the right). Mean >> Median.
  • Negative (Left) Skew: The distribution has a long tail stretching to the left. (e.g., Human lifespan—most people live to old age, with a tail stretching down towards zero). Mean << Median.

Interactive Visualization: Skewness vs The Mean

Notice how the tail of a skewed distribution physically "drags" the Mean away from the Median!

Skewness: mean vs median

Drag the slider to skew a Beta distribution and compare mean vs median.

Positive (right) skew
Mean 0.20 • Median 0.18
Skew control
2.0
Right-skew → symmetric → left-skew
RightSymmetricLeft

4. Kurtosis (The Tails)

Kurtosis measures the "tailedness" of a distribution relative to a standard Normal distribution. It tells you about the frequency of extreme outliers.

  • Mesokurtic (Kurtosis \approx 3): Standard Normal distribution tails.
  • Leptokurtic (High Kurtosis): "Fat tails." The distribution produces extreme outliers far more frequently than a Normal distribution. Financial market returns are famously leptokurtic.
  • Platykurtic (Low Kurtosis): "Thin tails." The distribution produces fewer outliers. A Uniform distribution is highly platykurtic because it has absolutely zero probability of an outlier beyond its bounds.

Test Your Knowledge

Example: Expected Value

You play a game where you roll a fair 6-sided die. If you roll a 6, you win 10.Ifyourollanythingelse,youlose10. If you roll anything else, you lose 2. What is the Expected Value (Mean) of this game?

View Step-by-Step Solution

Expected Value is the sum of (Outcome ×\times Probability).

  • Outcome 1: Win 10withprobability10 with probability 1/6$
  • Outcome 2: Lose 2(whichis2 (which is -2) with probability 5/65/6

E[X]=(10×16)+(2×56)=106106=0E[X] = (10 \times \frac{1}{6}) + (-2 \times \frac{5}{6}) = \frac{10}{6} - \frac{10}{6} = 0

The Expected Value is $0. This is a perfectly "fair" game. Over the long run, you will neither win nor lose money.