Z-Tests for a Population Mean

Scope Label

Core 9758. This branch covers one-tailed and two-tailed -tests for a single population mean. It is the main procedural branch for Hypothesis Testing.

Use it with Hypothesis Testing Logic and Decision Methods for the conceptual language.

When a -Test Is Used

The core -test applies when testing a claim about one population mean and the population standard deviation is known.

The sample statistic is .

Under , the test statistic is

For observed data, use

The denominator is the standard error of the sample mean. This links directly to Sampling Distribution of the Sample Mean.

Distributional Justification

The -test is justified in two common cases:

  • if the population is normal, then is exactly normal;
  • if the sample size is large, then is approximately normal by the central limit theorem.

In both cases, under ,

exactly or approximately.

Standard Workflow

For a -test:

  1. Define clearly in context.
  2. State and .
  3. Identify the tail direction.
  4. State or compute the test statistic.
  5. Use either the critical-region method or the p-value method.
  6. Write a contextual conclusion.

Caption: A core -test follows a fixed chain from context and hypotheses to a statistic, decision method, and conclusion.

Two Routes to the Same Statistic

The formula for the statistic is the same in the two common justifications, but the reason it is valid is different.

SituationDistribution of Comment
Population normal, knownexact normalworks for any
Population not necessarily normal, known, largeapproximately normaluses the central limit theorem

In both cases, under ,

is standard normal exactly or approximately.

Do not use this core -test formula if the population standard deviation is not given and the question is not clearly using a large-sample approximation.

Critical-Region Method

The critical-region method compares the observed statistic with critical values.

For common -tests:

TestAlternativeCritical-region idea
Lower-tailedreject for sufficiently negative
Upper-tailedreject for sufficiently positive
Two-tailedreject for sufficiently large $

The critical values come from the standard normal distribution.

p-Value Method

The p-value is computed under .

For :

  • lower-tailed test: ;
  • upper-tailed test: ;
  • two-tailed test: probability of being at least as extreme in either direction.

Reject if

Caption: The critical-region and p-value methods lead to the same decision when applied to the same test.

Worked Example 1: Lower-Tailed -Test

A manufacturer claims that the mean mass of a component is g. The population standard deviation is known to be g. A random sample of components has mean mass g.

Test at the level whether the mean mass has decreased.

Step 1: Define the parameter

Let be the true mean mass of the components.

Step 2: State the hypotheses

This is a lower-tailed test.

Step 3: Compute the statistic

Critical-region method

For a lower-tailed test, the critical value is approximately .

Since

the observed statistic lies in the critical region. Reject .

Conclusion: there is sufficient evidence at the level that the mean mass has decreased.

p-Value method

The p-value is

Since

reject .

The conclusion is the same.

Worked Example 2: Two-Tailed -Test

A machine is claimed to fill packets with mean mass g. The population standard deviation is known to be g. A random sample of packets has mean mass g.

Test at the level whether the machine is incorrectly calibrated.

Let be the true mean fill mass.

This is a two-tailed test because “incorrectly calibrated” means too low or too high.

The observed statistic is

For a two-tailed test, the critical values are approximately

Since

reject .

Conclusion: there is sufficient evidence at the level that the machine is incorrectly calibrated.

Worked Example 3: Insufficient Evidence

A company claims that a process has mean completion time minutes. The population standard deviation is minutes. A sample of jobs has mean time minutes.

Test at the level whether the mean completion time is less than minutes.

The observed statistic is

For a lower-tailed test, the critical value is approximately .

Since

the statistic is not in the critical region. Do not reject .

Conclusion: there is insufficient evidence at the level that the mean completion time is less than minutes.

Calculator Discipline

A graphing calculator can be used to:

  • compute tail probabilities;
  • compute p-values;
  • find normal critical values.

However, the calculator does not decide:

  • the correct ;
  • the test direction;
  • whether to use or ;
  • how to write the conclusion.

Those decisions must come from the context and the sampling distribution.

When reading calculator output, identify:

  • the test statistic, usually reported as ;
  • the p-value;
  • whether the tail choice entered into the calculator matches .

The calculator may give a number, but it does not know whether the question asks for an increase, decrease, or any change. That decision must be made before using the calculator.

Common Pitfalls

  • Using instead of in the test statistic.
  • Forgetting the denominator .
  • Using a one-tailed critical value for a two-tailed test.
  • Failing to state in context.
  • Writing “accept ” when the correct conclusion is “do not reject ”.
  • Reporting the decision without a contextual conclusion.

Revision Checklist

  • Can you state when a -test for one mean is appropriate?
  • Can you compute correctly?
  • Can you choose the correct critical region from ?
  • Can you calculate and interpret a p-value?
  • Can you write a conclusion in context?