4.9 Discrete Distributions
1. Discrete vs. Continuous Variables
A random variable $X$ takes values in a domain. It can be:
- Discrete: e.g., $X \in \{10, 20, 30\}$ or $X \in \{0, 1, 2, 3, \dots\}$. A discrete variable takes values in a finite or countable set.
- Continuous: e.g., $X \in [10, 20]$ or $X \in \mathbb{R}$. A continuous variable takes values in an interval or intervals.
This section covers discrete random variables.
Discrete Random Variable
Let $X$ be a variable that takes the values 10, 20, and 30 with probabilities 0.2, 0.3, and 0.5. This can be shown in a table:
| $X$ | 10 | 20 | 30 |
|---|---|---|---|
| $P(X=x)$ | 0.2 | 0.3 | 0.5 |
For any discrete random variable $X$ with values $x_1, x_2, x_3, \dots$ and probabilities $p_1, p_2, p_3, \dots$:
- (i) $p_i \ge 0$ for all $i$ (probabilities are non-negative)
- (ii) $\sum p_i = 1$, meaning $p_1 + p_2 + p_3 + \dots = 1$ (the sum of probabilities is 1)
2. The Expected Value $\mu = E(X)$
The mean $\mu$, or expected value $E(X)$, is defined as:
For the previous example, the expected value is:
This mean is identical to the standard statistical mean. Consider these ten numbers:
EXAMPLE 1
Consider the following probability distribution:
| $X$ | 10 | 20 | 30 |
|---|---|---|---|
| $P(X=x)$ | $a$ | $b$ | 0.5 |
Solution: Use two equations based on the properties of distributions:
- Sum of probabilities is 1: $a + b + 0.5 = 1 \implies a + b = 0.5$
- Expected value formula: $10a + 20b + 30(0.5) = 23 \implies 10a + 20b = 8$
Probability distributions apply to betting games:
EXAMPLE 2
Using the previous table, select one number from 10, 20, 30 at random.
- Selecting 10 earns 6 points
- Selecting 20 earns 1 point
- Selecting 30 loses 2 points
What is the expected number of points per game?
| $X$ | 10 | 20 | 30 |
|---|---|---|---|
| Profit | 6 points | 1 point | -2 points |
| Prob | 0.2 | 0.3 | 0.5 |
$\text{Expected profit} = 6 \times 0.2 + 1 \times 0.3 - 2 \times 0.5 = 1.2 + 0.3 - 1.0 = \mathbf{0.5}$
We earn an average of 0.5 points per game.
- 2 times the number 10 $\implies 2 \times 6 = 12$ points
- 3 times the number 20 $\implies 3 \times 1 = 3$ points
- 5 times the number 30 $\implies 5 \times (-2) = -10$ points
EXAMPLE 3
We throw two dice.
- TWO SIXES earns 15€
- ONE SIX earns 1€
- NO SIX loses 1€
Find the expected profit per game.
| Result | TWO SIXES | ONE SIX | NO SIX |
|---|---|---|---|
| Profit | 15€ | 1€ | -1€ |
| Prob | $\dfrac{1}{36}$ | $\dfrac{10}{36}$ | $\dfrac{25}{36}$ |
3. Median and Mode
These measures are defined for probability distributions as follows:
- MODE = The value $X=a$ with the highest probability.
- MEDIAN = The value $X=m$ where the cumulative probability splits the distribution (reaches 0.5).
EXAMPLE 4 (Finding Mode and Median)
| $X$ | 10 | 20 | 30 |
|---|---|---|---|
| $P(X=x)$ | 0.4 | 0.3 | 0.3 |
- MODE = 10 (Highest probability)
- MEDIAN = 20 (Cumulative probability reaches 0.5 at 20)
| $X$ | 10 | 20 | 30 |
|---|---|---|---|
| $P(X=x)$ | 0.2 | 0.3 | 0.5 |
- MODE = 30 (Highest probability)
- MEDIAN = 25 (At $X=20$, the cumulative sum is 0.5. The midpoint between 20 and 30 divides the distribution: $\dfrac{20+30}{2} = 25$).
4. VarianceHL ONLY
The variance of a discrete random variable is:
Expanded, this is:
An alternative formula is:
where
EXAMPLE 5
Using the initial probability distribution:
| $X$ | 10 | 20 | 30 |
|---|---|---|---|
| $P(X=x)$ | 0.2 | 0.3 | 0.5 |