Probability Distribution and CDF

Scope Label

Core 9758. This note covers the distribution-table and cumulative-probability language needed for discrete random variables.

Use it with the hub Discrete Random Variables.

Notation Recap

This note assumes that $X$ is a discrete random variable.

Keep three objects separate:

Symbol	Meaning
$X$	the random variable
$x$	one possible value of the random variable
${X = x}$	the event that $X$ takes the value $x$

So $P (X = x)$ is the probability of an event, even though it is written using numerical notation.

From Experiment to Distribution

There is a natural chain:

start with a random experiment
define a random variable from that experiment
identify the possible values of the variable
attach probabilities to those values
use the distribution to calculate probabilities and summaries

The experiment comes first. The random variable is built from the experiment. The distribution is built from the random variable.

For example, suppose two fair coins are tossed and $Y$ is the number of heads.

Outcome	Value of $Y$
$HH$	$2$
$H T$	$1$
$T H$	$1$
$TT$	$0$

The possible values of $Y$ are $0, 1, 2$ , and the probability distribution is:

$y$	$0$	$1$	$2$
$P (Y = y)$	$\frac{1}{4}$	$\frac{1}{2}$	$\frac{1}{4}$

The distribution no longer lists the original outcomes. It lists the possible numerical values of $Y$ and their probabilities.

Probability Distribution Function

The probability distribution function of a discrete random variable tells us the probability attached to each possible value.

If $X$ can take values $x_{1}, x_{2}, \dots$ , then the distribution gives:

P (X = x)

for each possible value $x$ .

In some textbooks this exact-value function is called the probability mass function. In this wiki, “probability distribution function” means the same discrete table of probabilities.

For a discrete random variable, pdf-style thinking answers “exactly this value”.

For example:

exactly $2$ heads
exactly $4$ defective items
exactly $0$ arrivals
exactly $3$ successes

Conceptually, the distribution function is the full probability model for the variable. It tells us:

which values are possible
how likely each value is
how later quantities such as expectation and variance can be calculated

Two structural facts must always hold:

0 \leq P (X = x) \leq 1

for every possible value, and

\sum P (X = x) = 1

across all possible values.

These facts encode that the possible values are exhaustive and mutually exclusive.

Caption: A discrete probability distribution assigns a valid probability to each possible value and the total must sum to $1$ .

Checking and Constructing a Valid Distribution

A proposed distribution must be checked before it is used.

Ask two questions:

Are all probabilities valid?
Do all probabilities add to $1$ ?

For example, suppose $X$ has distribution:

$x$	$0$	$1$	$2$
$P (X = x)$	$k$	$2 k$	$3 k$

To make this a valid distribution:

k + 2 k + 3 k = 1.

So:

6 k = 1,

and hence:

k = \frac{1}{6} .

Therefore:

$x$	$0$	$1$	$2$
$P (X = x)$	$\frac{1}{6}$	$\frac{1}{3}$	$\frac{1}{2}$

The important idea is not just solving for $k$ . It is using the fact that all possible values of $X$ must account for total probability $1$ .

Cumulative Distribution Function

The cumulative distribution function records running probability totals.

For a discrete random variable $X$ ,

P (X \leq x)

is the probability that the value of $X$ does not exceed $x$ .

The pdf and cdf answer different questions:

the pdf gives probability at a single value
the cdf gives total probability up to a value

That is why the cdf is naturally:

between $0$ and $1$
non-decreasing

The cdf is useful when the wording includes:

at most
not more than
up to
no greater than

For example, if $X$ represents the number of defective items in a sample, then:

P (X \leq 3)

means the probability of at most $3$ defective items:

P (X \leq 3) = P (X = 0) + P (X = 1) + P (X = 2) + P (X = 3) .

Caption: The cdf is built by adding probabilities from the distribution up to the chosen value.

Translating Inequality Language

Many discrete-random-variable questions are about translating words into precise inequality notation.

For integer-valued $X$ :

P (X < 3) = P (X \leq 2) .

This is because if $X$ only takes integer values, being less than $3$ means being at most $2$ .

Similarly:

P (X \geq 3) = 1 - P (X \leq 2) .

This uses the complement.

For an interval:

P (2 < X \leq 5) = P (X \leq 5) - P (X \leq 2) .

This removes everything up to $2$ , leaving $X = 3, 4, 5$ .

Translation Guide

Wording	Mathematical form	Common method
exactly $r$	$P (X = r)$	use pdf
at most $r$	$P (X \leq r)$	use cdf
not more than $r$	$P (X \leq r)$	use cdf
fewer than $r$	$P (X < r)$	convert if $X$ is integer-valued
at least $r$	$P (X \geq r)$	often use complement
more than $r$	$P (X > r)$	often use complement
between $a$ and $b$ inclusive	$P (a \leq X \leq b)$	subtract cumulative probabilities carefully

The main habit is to convert words into precise probability notation before calculating.

Worked Example: Distribution and CDF

Let $Y$ be the number of heads when two fair coins are tossed.

The probability distribution is:

$y$	$0$	$1$	$2$
$P (Y = y)$	$\frac{1}{4}$	$\frac{1}{2}$	$\frac{1}{4}$

The cdf is:

$y$	$0$	$1$	$2$
$P (Y \leq y)$	$\frac{1}{4}$	$\frac{3}{4}$	$1$

For example:

P (Y < 2) = P (Y \leq 1) = \frac{3}{4} .

Also:

P (Y \geq 1) = 1 - P (Y \leq 0) = 1 - \frac{1}{4} = \frac{3}{4} .

The values are simple here, but the method is the same for larger distribution tables.

Common Pitfalls

Mistake	Better thinking
Using a table before checking that probabilities sum to $1$	First verify that the proposed distribution is valid
Treating $P (X = x)$ and $P (X \leq x)$ as the same object	The pdf is exact-value probability; the cdf is accumulated probability
Translating $P (X < r)$ as $P (X \leq r)$ for integer-valued $X$	If $X$ is integer-valued, $X < r$ means $X \leq r - 1$
Forgetting endpoint inclusion in intervals	Read $<$ , $\leq$ , $>$ , and $\geq$ carefully
Using complement without checking the boundary	For integer-valued variables, the complement boundary shifts by one

Revision Checklist

Can you build a distribution table from a simple experiment?
Can you check that all probabilities are valid?
Can you use $\sum P (X = x) = 1$ to find an unknown constant?
Can you form a cdf table from a pdf table?
Can you explain why a cdf is non-decreasing?
Can you translate “at most”, “fewer than”, “at least”, and “between” correctly?

Singapore H2 Math Wiki

Start Here

Probability Distribution and CDF

Probability Distribution and CDF

Scope Label

Notation Recap

From Experiment to Distribution

Probability Distribution Function

Checking and Constructing a Valid Distribution

Cumulative Distribution Function

Translating Inequality Language

Translation Guide

Worked Example: Distribution and CDF

Common Pitfalls

Revision Checklist

Graph View

Table of Contents

Backlinks