Continuous Probability Distributions: A Note

▶ Table of Contents

I. Foundations: Continuous vs. Discrete
II. Continuous Distributions: Explained
III. A Quick Reference
IV. Example
V. The Math Behind the Distributions

Continuous probability distributions describe variables that can take on any value within a given range, unlike discrete distributions which deal with countable values.

I. Foundations: Continuous vs. Discrete

The key difference: continuous distributions deal with things you measure (like height, weight, or temperature), while discrete distributions deal with things you count (like the number of heads in coin flips). A continuous random variable can take on any value within a given range (possibly infinite).

II. Continuous Distributions: Explained

Let’s explore some common continuous distributions, focusing on what they describe, when to use them, and how to identify them.

A. Normal Distribution

What it describes: The "bell curve" – a symmetrical distribution where most values cluster around the mean (average), and values farther from the mean become less likely. Key parameters: μ (mean), σ (standard deviation).

When to use it:

Heights or weights of people.
Test scores (often, though not always perfectly).
Errors in measurements.
Many natural phenomena.

How to identify: Look for a bell-shaped, symmetrical distribution. The mean, median, and mode are all equal. The distribution is described entirely by its mean (μ) and standard deviation (σ).

B. Standard Normal Distribution

What it describes: A special case of the normal distribution where the mean (μ) is 0 and the standard deviation (σ) is 1. Often denoted by the variable Z.

When to use it: Crucial for calculating probabilities related to *any* normal distribution. By converting a normal random variable to a standard normal variable (a process called "standardization"), we can use standard normal tables (or software) to find probabilities.

How to identify: It's a normal distribution, but specifically with μ = 0 and σ = 1. Often, problems will involve finding probabilities for "Z-scores."

C. Normal Approximations

What it describes: The normal distribution can be used to approximate other probability distributions (Binomial and Poisson) under certain conditions, simplifying calculations.

C1. Normal Approximation to the Binomial

When to use it: When the number of trials (n) in a binomial distribution is large, and the probability of success (p) is not too close to 0 or 1. A common rule of thumb is that both np > 5 and n(1-p) > 5. A *continuity correction* is typically applied to improve the accuracy.

How to identify: You have a binomial situation (fixed trials, two outcomes, independent trials, constant probability), but calculating the exact binomial probabilities would be cumbersome (due to large n).

Continuity Correction: Since the binomial distribution is discrete and we are approximating with the continuous normal distribution, we improve the approximation via the continuity correction. For example:

$P(X = x)$ becomes $P(x - 0.5 < X < x + 0.5)$
$P(X \le x)$ becomes $P(X < x + 0.5)$
$P(X < x)$ becomes $P(X < x - 0.5)$
$P(X \ge x)$ becomes $P(X > x - 0.5)$
$P(X > x)$ becomes $P(X > x + 0.5)$

C2. Normal Approximation to the Poisson

When to use it: When the average rate (λ) of events in a Poisson distribution is relatively large (typically λ > 5). A *continuity correction* is typically applied to improve the accuracy.

How to identify: You have a Poisson situation (events occurring in an interval, independent events, constant average rate), but calculating exact Poisson probabilities would be difficult (due to large λ).

Continuity Correction: Similar to the binomial approximation, use these adjustments:

$P(X = x)$ becomes $P(x - 0.5 < X < x + 0.5)$
$P(X \le x)$ becomes $P(X < x + 0.5)$
$P(X < x)$ becomes $P(X < x - 0.5)$
$P(X \ge x)$ becomes $P(X > x - 0.5)$
$P(X > x)$ becomes $P(X > x + 0.5)$

D. Exponential Distribution

What it describes: The time (or distance) *between* events in a Poisson process (where events occur at a constant average rate). Key parameter: λ (the same rate parameter as in the corresponding Poisson distribution).

When to use it:

Time until the next customer arrives at a store.
Time until a light bulb fails.
Time between calls to a call center.
Distance between defects on a long wire.

How to identify: You're measuring time (or distance) *between* events, and those events follow a Poisson process (independent, constant average rate). The distribution is skewed to the right (not symmetrical).

Lack of Memory Property: A key characteristic of the exponential distribution. It means that the probability of an event occurring in the future doesn't depend on how much time has already passed. For example, if a light bulb has an exponential lifetime, the probability it burns out in the next hour is the same whether the bulb is brand new or has been used for 1000 hours.

III. A Quick Reference

Distribution	What are you measuring?	Symmetrical?	Key Parameters	Approximates…
Normal	Any continuous variable with a bell-shaped distribution	Yes	`μ`, `σ`	Binomial, Poisson
Standard Normal	Z-scores (standardized normal values)	Yes	`μ = 0`, `σ = 1`
Exponential	Time (or distance) between Poisson events	No	`λ`

IV. Example

Let’s imagine customers arriving at a coffee shop. We can ask several questions that relate to different continuous distributions:

Normal: The time a barista spends making a single latte is normally distributed with a mean of 2 minutes and a standard deviation of 0.5 minutes. What’s the probability a latte takes more than 2.5 minutes to make? This is a normal distribution problem because we are told the variable (time to make a latte) is normally distributed.
Standard Normal: Continuing from the previous example, what’s the Z-score for a latte that takes 2.75 minutes to make? We’re working with a standardized value derived from a normal distribution.
Normal Approximation to Binomial: In a busy hour, the shop sells 300 lattes. The probability a customer asks for soy milk is 0.2. What’s the probability that more than 70 lattes will be made with soy milk? While this is technically a binomial problem (fixed number of trials, two outcomes), the large number of trials makes the normal approximation convenient.
Normal Approximation to Poisson: On average, 60 customers enter the coffe shop per hour. What is the probability less than 50 customers enter the shop during the next hour. While this is technically a poisson problem (fixed number of event occuring, fixed rate), the large number of rate makes normal approximation convenient.
Exponential: The time between customer arrivals follows an exponential distribution with an average of 1 minute between arrivals (λ = 1 arrival per minute). What’s the probability that more than 2 minutes pass between two consecutive customer arrivals? This is an exponential problem because we’re measuring time between events in a Poisson process.

V. The Math Behind the Distributions

Let’s delve into the mathematical formulas for each distribution – Probability Density Function (PDF), Mean, and Variance – along with illustrative examples.

A. Normal Distribution

Notation: $X \sim N(\mu, \sigma^2)$
Probability Density Function (PDF): $$f(x) = \frac{1}{\sigma\sqrt{2\pi}} e^{-\frac{1}{2}(\frac{x - \mu}{\sigma})^2}, \quad -\infty < x < \infty$$ The PDF defines the shape of the curve. Unlike discrete distributions, you don’t use the PDF to directly calculate the probability of a specific value (since the probability of any single point in a continuous distribution is 0). Instead, you find probabilities for ranges of values by calculating the area under the curve (using integration or, more commonly, tables or software).
Mean (Expected Value): $$E(X) = \mu$$
Variance: $$Var(X) = \sigma^2$$
Example: Heights of adult women are normally distributed with a mean of 65 inches and a standard deviation of 3.5 inches. You can’t use the PDF to find P(height = 68 inches), but you can find P(67 < height < 69 inches) using standardization and a Z-table (or software).

B. Standard Normal Distribution

Notation: $Z \sim N(0, 1)$
Probability Density Function (PDF): $$f(z) = \frac{1}{\sqrt{2\pi}} e^{-\frac{1}{2}z^2}, \quad -\infty < z < \infty$$ This is a simplified version of the normal PDF, with μ = 0 and σ = 1.
Mean (Expected Value): $$E(Z) = 0$$
Variance: $$Var(Z) = 1$$
Standardization: To convert any normal random variable X (with mean μ and standard deviation σ) to a standard normal variable Z, use the formula: $$Z = \frac{X - \mu}{\sigma}$$
Example: Using the latte example (mean = 2 minutes, standard deviation = 0.5 minutes), a latte that takes 2.75 minutes has a Z-score of: $Z = \frac{2.75 - 2}{0.5} = 1.5$. You can then use a Z-table to find the probability of a Z-score being less than 1.5, which corresponds to the probability of a latte taking less than 2.75 minutes to make.

C. Normal Approximation to the Binomial

Notation: $X \sim Bin(n, p)$ and $X$ can be approximated as $Y \sim N(np, np(1-p))$
Condition: $np > 5$ and $n(1-p) > 5$
Continuity Correction $P(X=x) \approx P(x-0.5 < Y < x+0.5)$
Z-score Calculation (with Continuity Correction):
To find probabilities, we use the Z-score formula, incorporating the continuity correction. For example, to find $P(X \le x)$:
$$Z = \frac{(x + 0.5) - np}{\sqrt{np(1-p)}}$$
You would then find $P(Z \le z)$ using a standard normal table.
Example: 300 lattes (n=300), probability of soy milk request = 0.2 (p=0.2). Find P(more than 70 soy lattes). This is P(X > 70).
1. Check conditions: np = 300 * 0.2 = 60 > 5. n(1-p) = 300 * 0.8 = 240 > 5. The normal approximation is appropriate.
2. Apply continuity correction: P(X > 70) becomes P(X > 70.5).
3. Calculate Z-score: $Z = \frac{70.5 - (300 \times 0.2)}{\sqrt{300 \times 0.2 \times 0.8}} = \frac{10.5}{\sqrt{48}} \approx 1.52$
4. Find probability: Use a Z-table (or software) to find P(Z > 1.52) = 1 - P(Z ≤ 1.52) ≈ 1 - 0.9357 = 0.0643. So, the approximate probability of more than 70 soy lattes is 6.43%.

D. Normal Approximation to the Poisson

Notation: $X \sim Poisson(\lambda)$ can be approximated as $Y\sim N(\lambda, \lambda)$
Mean (μ): λ
Variance (σ²): λ
Condition: $\lambda$ > 5 (The larger, the better)
Z-score Calculation (with Continuity Correction): To find $P(X = x)$, $$Z = \frac{(x + 0.5) - \lambda}{\sqrt{\lambda}}$$
Continuity Correction $P(X=x) \approx P(x-0.5 < Y < x+0.5)$
Example: 60 customers per hour (λ = 60). What is P(X < 50)?
1. Check condition: λ = 60 > 5. The normal approximation is appropriate.
2. Apply continuity correction: $P(X < 50)$ becomes $P(X < 49.5)$
3. Calculate the Z-score: $Z = \frac{(49.5 - 60)}{\sqrt{60}} \approx -1.36$
4. Find probability: Use a Z-table to find P(Z < -1.36) ≈ 0.0869.

E. Exponential Distribution

Notation: $X \sim Exp(\lambda)$
Probability Density Function (PDF): $$f(x) = \lambda e^{-\lambda x}, \quad x \ge 0, \lambda > 0$$
Cumulative Distribution Function (CDF): $$F(x) = P(X \le x) = 1 - e^{-\lambda x}$$ This is often more useful than the PDF for direct probability calculations.
Mean (Expected Value): $$E(X) = \frac{1}{\lambda}$$
Variance: $$Var(X) = \frac{1}{\lambda^2}$$
Example: Time between customer arrivals is exponential with λ = 1 arrival per minute. What’s the probability that more than 2 minutes pass between arrivals? This is P(X > 2).
- We use the complement of the CDF: $P(X > 2) = 1 - P(X \le 2) = 1 - (1 - e^{-1 \times 2}) = e^{-2} \approx 0.135$