Probability
Continuous Distributions
Exponential Distribution

Exponential Distribution

The Exponential distribution is the continuous counterpart to the discrete Poisson distribution. While the Poisson distribution models the number of events in a fixed timeframe, the Exponential distribution models the time between those events.

Core Concepts

If a random variable XX follows an Exponential distribution, we write:

XExp(λ)X \sim \text{Exp}(\lambda)

Where:

  • λ\lambda (lambda) is the rate parameter (the exact same rate parameter used in the Poisson distribution!).

Probability Density Function (PDF)

The probability density function creates a rapidly decaying curve:

f(x;λ)={λeλxif x00if x<0f(x; \lambda) = \begin{cases} \lambda e^{-\lambda x} & \text{if } x \ge 0 \\ 0 & \text{if } x < 0 \end{cases}

Cumulative Distribution Function (CDF)

For continuous distributions, the CDF is often more useful than the PDF because it directly gives us the probability that the time between events will be less than or equal to xx.

F(x;λ)=1eλxF(x; \lambda) = 1 - e^{-\lambda x}

The "Memoryless" Property: The "Reset" Button

🧠

The Exponential distribution is the only continuous distribution that is memoryless. This is its most famous, and often most counter-intuitive, feature.

The Intuition

Imagine you are waiting at a bus stop where buses arrive according to an exponential distribution (a Poisson Process). You have already been waiting for 20 minutes, and no bus has come.

  • Human Intuition: "I've been waiting 20 minutes! Surely the bus is more likely to arrive in the next 5 minutes than it was when I first got here."
  • The Reality: The probability that the bus arrives in the next 5 minutes is exactly the same as it was when you first arrived. The system has "no memory" of your 20-minute wait.

This property implies there is no wear and tear. If a lightbulb's lifespan is exponential, a 10-year-old bulb is just as "healthy" as a brand new one.


The Mathematical Proof

To prove this, we use the Survival Function, which is P(X>t)P(X > t). For the exponential distribution:

P(X>t)=1F(t)=1(1eλt)=eλtP(X > t) = 1 - F(t) = 1 - (1 - e^{-\lambda t}) = e^{-\lambda t}

We want to show that the probability of waiting an additional time tt, given that we have already waited time ss, is the same as the original probability of waiting time tt.

Step 1: Set up the Conditional Probability
By the definition of conditional probability:

P(X>s+tX>s)=P(X>s+t and X>s)P(X>s)P(X > s + t \mid X > s) = \frac{P(X > s + t \text{ and } X > s)}{P(X > s)}

Step 2: Simplify the Numerator
If we know that X>s+tX > s + t, then XX must already be greater than ss. Therefore, the intersection of these two events is simply X>s+tX > s + t:

P(X>s+tX>s)=P(X>s+t)P(X>s)P(X > s + t \mid X > s) = \frac{P(X > s + t)}{P(X > s)}

Step 3: Plug in the Survival Function

eλ(s+t)eλs\frac{e^{-\lambda(s+t)}}{e^{-\lambda s}}

Step 4: Use Exponent Rules

eλseλteλs=eλt\frac{e^{-\lambda s} \cdot e^{-\lambda t}}{e^{-\lambda s}} = e^{-\lambda t}

Conclusion:

P(X>s+tX>s)=P(X>t)P(X > s + t \mid X > s) = P(X > t)

The "already waited" time ss has completely vanished from the equation.


The "Discrete" Cousin

The only other distribution with this property is the Geometric Distribution, which is essentially the "discrete version" of the Exponential (counting trials instead of measuring time).

Key Metrics

  • Expected Value (Mean): E[X]=1λE[X] = \frac{1}{\lambda}
    • (Intuition: If a bus arrives at a rate of λ=4\lambda = 4 times per hour, you expect to wait 14\frac{1}{4} of an hour (15 minutes) for the next bus).
  • Variance: Var(X)=1λ2\text{Var}(X) = \frac{1}{\lambda^2}

Real-World Examples

  1. Waiting Times: The time you wait until the next customer calls the support desk.
  2. Reliability Engineering: The amount of time until a mechanical part fails (assuming a constant failure rate).
  3. Radioactive Decay: The time until a radioactive atom decays.

Interactive Visualization

Use the slider below to adjust the rate parameter (λ\lambda). Notice how the density is highly concentrated near zero, meaning shorter wait times are always more probable than longer wait times. As the rate λ\lambda increases, the curve decays even faster.

Exponential Distribution

Adjust λ (rate). Higher λ concentrates probability near 0.

Rate Parameter (λ)
1.0

Test Your Knowledge

Example: Exponential Lifespan

The lifespan of a certain electronic component follows an exponential distribution with an average life of 4 years. What is the probability that a component fails within the first year?

View Step-by-Step Solution

First, find the rate parameter λ\lambda. Since the mean is 1/λ=41/\lambda = 4, then λ=0.25\lambda = 0.25.

The CDF of the exponential distribution is P(Xx)=1eλxP(X \le x) = 1 - e^{-\lambda x}.

We want P(X1)P(X \le 1): 1e0.25(1)=1e0.2510.7788=0.22121 - e^{-0.25(1)} = 1 - e^{-0.25} \approx 1 - 0.7788 = 0.2212

There is a 22.12% probability the component fails in the first year.