Binomial Distribution
Scope Label
Core 9758. This branch is the main student-facing note for the current core special discrete model.
Use it with the hub Special Discrete Random Variables.
What This Note Assumes
This note assumes that you already know the general discrete-random-variable language from Discrete Random Variables.
In particular:
- is the random variable
- is one possible integer value of
- is the probability of the event that exactly successes occur
The Binomial distribution is a special case of this general language. It should only be used after the trial structure has been checked.
The Binomial Situation
A binomial random variable counts the number of successes in a fixed number of repeated trials.
A binomial model is appropriate when all of the following conditions are satisfied:
- there is a fixed number of trials
- each trial has exactly two possible outcomes, usually called success and failure
- the probability of success is the same for each trial
- the trials are independent
If these conditions are satisfied, write:
Here, counts the number of successes in the trials.
The word “success” does not necessarily mean something good. It simply means the outcome being counted.
For example:
- if we count defective items, then “success” may mean “the item is defective”
- if we count sixes in die throws, then “success” means “a six is obtained”
- if we count correct answers in a guessed MCQ test, then “success” means “the answer is correct”
Caption: A binomial model requires a fixed number of independent yes/no trials with constant success probability.
Checking the Conditions in Context
The four binomial conditions should be checked before calculation.
| Condition | Meaning | Example |
|---|---|---|
| Fixed number of trials | The number of attempts is decided before the experiment starts. | A die is thrown 7 times. |
| Two outcomes per trial | Each trial is classified as success or failure. | Six or not six. |
| Constant success probability | The value of does not change from trial to trial. | Each die throw has probability of giving a six. |
| Independence | One trial does not affect another trial. | One die throw does not affect the next. |
If balls are drawn from a small bag without replacement, the success probability may change after each draw. In that case, the binomial model may not be suitable.
Stating the Random Variable and Distribution
A good solution should state the random variable clearly before using formulas.
A useful writing template is:
Let be the number of [successes] in [number of trials]. Then .
Example:
Let be the number of sixes obtained in 7 throws of a fair die. Then:
Another example:
Let be the number of correct answers obtained when a student guesses all 20 questions in a multiple-choice test with 4 options per question. Then:
This model statement confirms:
- what is being counted
- what one trial means
- what success means
- what is
- what is
Parameters and Possible Values
For
- is the number of trials
- is the probability of success in each trial
- is the probability of failure in each trial
- is the number of successes
The possible values of are:
This upper bound matters. A binomial random variable cannot exceed the number of trials.
For example, if
then is impossible because only 7 trials are performed.
Probability Function
For
the probability of exactly successes is:
The formula has a structure:
- chooses the positions of the successes
- gives the probability of those successes
- gives the probability of the failures
So the formula is:
Core Example 1: Exact Binomial Probability
A fair die is thrown 7 times. Find the probability of obtaining exactly 3 sixes.
Let be the number of sixes obtained in 7 throws.
Then:
We want:
Using the binomial formula:
This gives approximately:
The important modelling steps are:
- define
- state
- translate the question into
- calculate
Cumulative Binomial Probabilities
The cumulative binomial probability is:
It is useful for wording such as:
- at most
- not more than
- fewer than
- at least
- more than
For integer-valued :
| Wording | Probability statement |
|---|---|
| exactly | |
| at most | |
| fewer than | |
| at least | |
| more than |
The difference between and is one integer step.
Core Example 2: Cumulative Binomial Probability
Suppose
Find .
It is usually easier to use the complement:
Since:
we get:
This gives approximately:
The key step is translating as .
Mean and Variance
For
the mean is:
The variance is:
The mean represents the expected number of successes.
For example, if a student guesses 20 multiple-choice questions with 4 options each, then:
The expected number of correct answers is:
The variance is:
The standard deviation is:
Shape and Interpretation
A binomial distribution can be visualised using a probability histogram.
For fixed :
- if is small, the distribution is skewed to the right
- if , the distribution is symmetric
- if is large, the distribution is skewed to the left
- the centre of the distribution is near
This is important because the binomial distribution is not just a formula. It is a full probability distribution with a shape.
Caption: For fixed , changing shifts the centre and changes the skewness of the binomial distribution.
Most Probable Value
The most probable value of a discrete random variable is the value with the largest probability.
It is the tallest bar in the probability histogram.
This value is not necessarily equal to .
For example, if
then:
In this case, the most probable value is also . But this agreement does not always happen.
To find the most probable value, one may:
- inspect a probability table
- inspect a histogram
- compare neighbouring probabilities
- use calculator table values after the distribution has been correctly identified
Core Example 3: MCQ Guessing and Thresholds
A multiple-choice test has 20 questions with 4 options each. A student guesses all answers randomly.
Let be the number of correct answers.
Then:
The mean is:
The standard deviation is:
Suppose we want the least value of such that:
The anchor note checks cumulative probabilities and finds:
while:
So the least value is:
This example illustrates that cumulative probability questions often require a threshold search rather than one direct exact-value probability.
Combining Independent Binomial Samples
If two independent samples have the same binomial structure, it may be possible to combine them.
For example, suppose a sample of 10 items is drawn from a very large population where 10% of items are defective. Let be the number of defective items.
Then:
If a second independent sample of 10 items is drawn under the same conditions, and is its number of defectives, then:
The total number of defectives in the two samples is:
This works because the two samples together form 20 independent trials with the same success probability.
But be careful: if the question asks for a specific split, such as exactly one defective in each sample, then calculate the two sample events separately and multiply because of independence.
Calculator Workflow
A calculator is useful after the modelling has been done.
Use exact binomial probability for:
Use cumulative binomial probability for:
Before using the calculator, write:
- Let be the number of …
- Required probability
The calculator should not replace model checking or probability-language translation.
For example, a calculator can evaluate quickly, but it cannot decide whether the problem should be modelled as in the first place.
Common Pitfalls
| Mistake | Better thinking |
|---|---|
| Forgetting that trials must be independent | Check whether one trial affects another |
| Forgetting that must remain constant | If the probability changes, the binomial model may fail |
| Confusing success with a desirable outcome | Success means the outcome being counted |
| Forgetting that only takes values | A binomial count cannot exceed the number of trials |
| Using when the question says | For integer , means |
| Starting with calculator commands | First define , state the distribution, and translate the probability |
Revision Checklist
- Can you state the four binomial conditions?
- Can you define success clearly in context?
- Can you write with correct and ?
- Can you explain why the possible values are ?
- Can you calculate ?
- Can you use complements for “at least” and “more than” questions?
- Can you interpret and ?
- Can you find a most probable value from a table or histogram?