Sampling Distribution of the Sample Mean

Scope Label

Core 9758. This branch develops the distribution of the sample mean $\overset{ˉ}{X}$ , including exact normal cases, the central limit theorem, and standardising with the standard error.

Use it with the hub Sampling and Estimation and the normal-distribution topic Special Continuous Random Variables.

From One Sample Mean to a Sampling Distribution

One observed sample gives one value of $\overset{x}{ˉ}$ .

Before the sample is taken, however, many possible random samples could occur. Each sample may give a different sample mean. Therefore the sample mean has its own distribution.

The notation is:

$X$ is one randomly chosen value from the population;
$\overset{ˉ}{X}$ is the random variable representing the mean of a random sample;
$\overset{x}{ˉ}$ is one observed value of $\overset{ˉ}{X}$ after sampling.

This distinction is essential. A question about $X$ is about one item. A question about $\overset{ˉ}{X}$ is about an average of $n$ items.

Caption: Repeated random samples produce different values of $\overset{x}{ˉ}$ ; these possible values form the sampling distribution of $\overset{ˉ}{X}$ .

Mean and Variance of $\overset{ˉ}{X}$

The standard H2 formulas below assume a proper random sample whose observations can be treated as independent and identically distributed. This is automatic for a sample from an infinite population, and it is also the usual model for sampling from a finite population with replacement.

For finite sampling without replacement, the exact variance can need a finite-population adjustment. H2 questions using the formula below normally signal the standard random-sample model, so do not add an extra correction unless the question explicitly requires it.

For a random sample of size $n$ from a population with mean $μ$ and variance $σ^{2}$ ,

E (\overset{ˉ}{X}) = μ

and

Var (\overset{ˉ}{X}) = \frac{σ ^{2}}{n} .

Hence

SD (\overset{ˉ}{X}) = \frac{σ}{n} .

This standard deviation is called the standard error of the sample mean.

Interpret the two results separately:

$E (\overset{ˉ}{X}) = μ$ says the sample mean is centred correctly;
$Var (\overset{ˉ}{X}) = σ^{2} / n$ says sample means become less variable as $n$ increases.

Caption: As sample size increases, the distribution of the sample mean stays centred at $μ$ but becomes less spread out.

Distribution of $\overset{ˉ}{X}$ : The Exact Normal Case

If the population itself is normal, then the sample mean is exactly normal.

X \sim N (μ, σ^{2}),

then for a random sample of size $n$ ,

\overset{ˉ}{X} \sim N (μ, \frac{σ ^{2}}{n}) .

This is exact for any sample size $n$ .

For example, if

X \sim N (50, 1 2^{2})

and $n = 36$ , then

\overset{ˉ}{X} \sim N (50, \frac{1 2 ^{2}}{36}) = N (50, 4) .

The variance of $\overset{ˉ}{X}$ is $4$ , so the standard deviation of $\overset{ˉ}{X}$ is $2$ .

Distribution of $\overset{ˉ}{X}$ : The Central Limit Theorem

If the population is not necessarily normal, the sample mean may still be approximately normal when the sample size is large.

If the population has mean $μ$ and variance $σ^{2}$ , then for large $n$ ,

\overset{ˉ}{X} \approx N (μ, \frac{σ ^{2}}{n}) .

This is the central limit theorem.

In H2 work, the usual guide is

n \geq 30.

The important distinction is:

$X$ itself may be skewed, discrete, or non-normal;
$\overset{ˉ}{X}$ may still be approximately normal when $n$ is large.

Caption: The central limit theorem connects many population shapes to an approximately normal distribution for the sample mean when $n$ is large.

Choosing the Distribution of $\overset{ˉ}{X}$

Use this decision process:

Is the population normal?
If yes, use the exact result $\overset{ˉ}{X} \sim N (μ, σ^{2} / n)$ .
If no, is $n$ large enough?
If yes, use the approximation $\overset{ˉ}{X} \approx N (μ, σ^{2} / n)$ .
If no, the usual normal model for $\overset{ˉ}{X}$ is not justified from the given information.

Caption: To choose the distribution of $\overset{ˉ}{X}$ , first ask whether the population is normal; if not, check whether the sample size is large enough for the central limit theorem.

Standardising $\overset{ˉ}{X}$

Once the distribution of $\overset{ˉ}{X}$ is known or approximated, probability statements can be standardised.

For one observation,

Z = \frac{X - μ}{σ} .

For a sample mean,

Z = \frac{X ˉ - μ}{σ / n} .

The denominator is the standard error, not the population standard deviation. This is the most common procedural error in this branch.

Caption: Standardising one observation uses $σ$ ; standardising a sample mean uses $σ / n$ .

Worked Example 1: Exact Normal Distribution of $\overset{ˉ}{X}$

The diameter of a metal rod is normally distributed with mean $1.00$ cm and standard deviation $0.01$ cm. A random sample of $4$ rods is selected.

Find

P (\overset{ˉ}{X} > 0.99) .

Let $X$ be the diameter of one rod. Then

X \sim N (1.00, 0.0 1^{2}) .

Since the population is normal,

\overset{ˉ}{X} \sim N (1.00, \frac{0.0 1 ^{2}}{4}) .

The standard deviation of $\overset{ˉ}{X}$ is

\frac{0.01}{4} = 0.005.

Therefore

P (\overset{ˉ}{X} > 0.99) = P (Z > \frac{0.99 - 1.00}{0.005}) = P (Z > - 2) .

Hence

P (\overset{ˉ}{X} > 0.99) \approx 0.977.

Worked Example 2: Central Limit Theorem

Suppose

X \sim B (10, 0.6) .

A random sample of size $40$ is taken. Find approximately

P (\overset{ˉ}{X} > 5.5) .

First find the population mean and variance of $X$ :

E (X) = 10 (0.6) = 6,

and

Var (X) = 10 (0.6) (0.4) = 2.4.

Since $n = 40$ is large, by the central limit theorem,

\overset{ˉ}{X} \approx N (6, \frac{2.4}{40}) .

The standard deviation of $\overset{ˉ}{X}$ is

\frac{2.4}{40} = 0.06 .

Therefore

P (\overset{ˉ}{X} > 5.5) = P (Z > \frac{5.5 - 6}{0.06}) .

Since

\frac{5.5 - 6}{0.06} \approx - 2.041,

we get

P (\overset{ˉ}{X} > 5.5) = P (Z > - 2.041) \approx 0.979.

No continuity correction is needed here because the central limit theorem is being applied to the sample mean $\overset{ˉ}{X}$ , not directly to a discrete count.

Worked Example 3: Symmetric Interval for $\overset{ˉ}{X}$

Suppose $X$ has mean $80$ and standard deviation $15$ . A random sample of size $64$ is taken, and the central limit theorem is applicable.

Find

P (77 < \overset{ˉ}{X} < 83) .

The sample mean has approximate distribution

\overset{ˉ}{X} \approx N (80, \frac{1 5 ^{2}}{64}) .

The standard error is

\frac{15}{64} = 1.875.

Therefore

P (77 < \overset{ˉ}{X} < 83) = P (\frac{77 - 80}{1.875} < Z < \frac{83 - 80}{1.875}) .

P (77 < \overset{ˉ}{X} < 83) = P (- 1.6 < Z < 1.6) \approx 0.890.

Link to Hypothesis Testing

Hypothesis testing uses this branch directly. A test for a population mean assumes a value of $μ$ under $H_{0}$ , uses the distribution of $\overset{ˉ}{X}$ under that assumption, and checks whether the observed $\overset{x}{ˉ}$ is unusually extreme.

So the sampling distribution is not just another probability calculation. It is the foundation of inference about a population mean.

Common Pitfalls

Treating $\overset{ˉ}{X}$ and $\overset{x}{ˉ}$ as the same object.
Using $σ$ instead of $σ / n$ when standardising $\overset{ˉ}{X}$ .
Forgetting that the exact normal result requires the population to be normal.
Applying the central limit theorem when $n$ is small and no normal population is given.
Thinking the central limit theorem makes $X$ normal. It applies to $\overset{ˉ}{X}$ .
Adding a continuity correction when the question is about the sample mean rather than a discrete count.

Revision Checklist

Can you explain why $\overset{ˉ}{X}$ is random before sampling?
Can you state and interpret $E (\overset{ˉ}{X})$ ?
Can you state and interpret $Var (\overset{ˉ}{X})$ ?
Can you identify the standard error $σ / n$ ?
Can you decide whether the distribution of $\overset{ˉ}{X}$ is exact normal or approximate normal?
Can you standardise probability statements involving $\overset{ˉ}{X}$ correctly?
Can you explain how this branch prepares for hypothesis testing?

Singapore H2 Math Wiki

Start Here

Sampling Distribution of the Sample Mean

Sampling Distribution of the Sample Mean

Scope Label

From One Sample Mean to a Sampling Distribution

Mean and Variance of $\overset{ˉ}{X}$

Distribution of $\overset{ˉ}{X}$ : The Exact Normal Case

Distribution of $\overset{ˉ}{X}$ : The Central Limit Theorem

Choosing the Distribution of $\overset{ˉ}{X}$

Standardising $\overset{ˉ}{X}$

Worked Example 1: Exact Normal Distribution of $\overset{ˉ}{X}$

Worked Example 2: Central Limit Theorem

Worked Example 3: Symmetric Interval for $\overset{ˉ}{X}$

Link to Hypothesis Testing

Common Pitfalls

Revision Checklist

Graph View

Table of Contents

Backlinks

Singapore H2 Math Wiki

Start Here

Sampling Distribution of the Sample Mean

Sampling Distribution of the Sample Mean

Scope Label

From One Sample Mean to a Sampling Distribution

Mean and Variance of Xˉ

Distribution of Xˉ: The Exact Normal Case

Distribution of Xˉ: The Central Limit Theorem

Choosing the Distribution of Xˉ

Standardising Xˉ

Worked Example 1: Exact Normal Distribution of Xˉ

Worked Example 2: Central Limit Theorem

Worked Example 3: Symmetric Interval for Xˉ

Link to Hypothesis Testing

Common Pitfalls

Revision Checklist

Graph View

Table of Contents

Backlinks

Mean and Variance of $\overset{ˉ}{X}$

Distribution of $\overset{ˉ}{X}$ : The Exact Normal Case

Distribution of $\overset{ˉ}{X}$ : The Central Limit Theorem

Choosing the Distribution of $\overset{ˉ}{X}$

Standardising $\overset{ˉ}{X}$

Worked Example 1: Exact Normal Distribution of $\overset{ˉ}{X}$

Worked Example 3: Symmetric Interval for $\overset{ˉ}{X}$