Hypothesis Testing Logic and Decision Methods

Scope Label

Core 9758. This branch develops the reasoning language behind one-tailed and two-tailed $Z$ -tests for a single population mean.

Use it with the hub Hypothesis Testing and the procedural branch Z-Tests for a Population Mean.

A Test Measures Evidence Under $H_{0}$

A hypothesis test starts by temporarily assuming the null hypothesis is true.

It then asks:

Under that assumption, how unusual is the sample result?

This is the reason the sampling distribution is built under $H_{0}$ . The test is not comparing two equally trusted claims. It is asking whether the sample gives enough evidence to reject the baseline claim.

Caption: A hypothesis test starts by assuming $H_{0}$ , then measures how unusual the sample is under that assumption before making a decision.

Statistical Hypotheses

A statistical hypothesis is a claim about a population parameter.

For a population mean, the null hypothesis is usually

H_{0} : μ = μ_{0} .

The equality is essential because it fixes the reference distribution used for the test statistic.

The alternative hypothesis may be

H_{1} : μ < μ_{0}, H_{1} : μ > μ_{0}, H_{1} : μ \neq = μ_{0} .

Choose $H_{1}$ from the wording and purpose of the investigation, not from the sample mean after it is observed.

Caption: The null hypothesis fixes the reference value, while the alternative hypothesis states the kind of departure being tested.

Translating Wording into $H_{1}$

Many testing errors happen before calculation begins.

Wording	Meaning	Alternative
has increased	definite increase	$H_{1} : μ > μ_{0}$
is greater than	definite increase	$H_{1} : μ > μ_{0}$
has decreased	definite decrease	$H_{1} : μ < μ_{0}$
is less than	definite decrease	$H_{1} : μ < μ_{0}$
has changed	either direction	$H_{1} : μ \neq = μ_{0}$
is different	either direction	$H_{1} : μ \neq = μ_{0}$
is affected	either direction	$H_{1} : μ \neq = μ_{0}$
has overstated the mean	true mean is lower	$H_{1} : μ < μ_{0}$
has understated the mean	true mean is higher	$H_{1} : μ > μ_{0}$

For example, “the machine is not correctly calibrated” usually means the mean may be too high or too low, so the test is two-tailed.

Tail Direction

The form of $H_{1}$ determines where the rejection region lies.

Lower-tailed test

H_{1} : μ < μ_{0},

then unusually small test-statistic values support $H_{1}$ .

Upper-tailed test

H_{1} : μ > μ_{0},

then unusually large test-statistic values support $H_{1}$ .

Two-tailed test

H_{1} : μ \neq = μ_{0},

then unusually extreme values in either direction support $H_{1}$ .

Caption: The alternative hypothesis decides which tail or tails contain the rejection region.

Significance Level

The significance level $α$ is the chosen threshold for rejecting $H_{0}$ .

It controls how extreme a result must be before it counts as sufficient evidence against $H_{0}$ .

Typical levels are:

10%, 5%, 1%.

A smaller significance level demands stronger evidence before rejecting $H_{0}$ .

There is a more precise interpretation: $α$ is the probability of rejecting $H_{0}$ when $H_{0}$ is actually true. In words, it is the allowed risk of a false rejection.

So a $5%$ significance level means:

P (reject H_{0} ∣ H_{0} is true) = 0.05.

This does not mean there is a $5%$ probability that $H_{0}$ is true. The probability is calculated under the assumption that $H_{0}$ is true.

Critical-Region Method

The critical region is the set of test-statistic values that lead to rejection of $H_{0}$ .

The critical-region method is:

decide the test direction from $H_{1}$ ;
use $α$ to find the critical value or values;
compute the observed test statistic;
reject $H_{0}$ if the statistic lies in the critical region.

For a $Z$ -test:

an upper-tailed $5%$ test uses the right tail;
a lower-tailed $5%$ test uses the left tail;
a two-tailed $5%$ test uses $2.5%$ in each tail.

p-Value Method

The p-value is the probability, assuming $H_{0}$ is true, of obtaining a result at least as extreme as the observed result.

Decision rule:

reject H_{0} if p-value < α .

A small p-value means the observed result would be unusual if $H_{0}$ were true. It does not mean that the p-value is the probability that $H_{0}$ is true.

Caption: The critical-region and p-value methods express the same testing logic in two different ways.

Critical Region Versus p-Value

The two methods are equivalent when used correctly.

Method	Question asked	Decision
Critical region	Did the statistic enter the rejection region?	Reject $H_{0}$ if yes
p-value	Is the observed result more extreme than $α$ allows?	Reject $H_{0}$ if p-value $< α$

Use whichever method the question asks for. If no method is specified, either is acceptable if the working is clear.

Mini-Examples: Wording Before Calculation

Overstated mean

Suppose a school claims that the mean score is at least $80$ , and the question asks whether the school has overstated the mean.

The suspected direction is lower than the claim, so write

H_{0} : μ = 80, H_{1} : μ < 80.

Correctly calibrated

Suppose a machine is claimed to fill bottles with mean volume $500$ ml, and the question asks whether the machine is correctly calibrated.

Too low and too high both matter, so write

H_{0} : μ = 500, H_{1} : μ \neq = 500.

Do not change this to an upper-tailed test just because the observed sample mean happens to be above $500$ .

Writing Conclusions

A testing conclusion must be cautious and contextual.

If $H_{0}$ is rejected:

There is sufficient evidence at the $α$ level that …

If $H_{0}$ is not rejected:

There is insufficient evidence at the $α$ level that …

Avoid saying:

“ $H_{0}$ is true”;
“ $H_{0}$ is accepted”;
“the alternative has been proved”.

A hypothesis test makes a decision at a chosen level of evidence. It does not deliver absolute proof.

Common Pitfalls

Choosing the tail direction from the observed sample mean instead of the question wording.
Forgetting that $H_{0}$ must contain equality.
Splitting $α$ across two tails in a one-tailed test.
Failing to split $α$ across two tails in a two-tailed test.
Treating the p-value as the probability that $H_{0}$ is true.
Writing a conclusion without context.

Revision Checklist

Can you explain why the test statistic is considered under $H_{0}$ ?
Can you translate wording into $H_{1}$ ?
Can you decide whether a test is lower-tailed, upper-tailed, or two-tailed?
Can you use both critical-region and p-value methods?
Can you write a proper conclusion without overclaiming?

Singapore H2 Math Wiki

Start Here

Hypothesis Testing Logic and Decision Methods

Hypothesis Testing Logic and Decision Methods

Scope Label

A Test Measures Evidence Under $H_{0}$

Statistical Hypotheses

Translating Wording into $H_{1}$

Tail Direction

Lower-tailed test

Upper-tailed test

Two-tailed test

Significance Level

Critical-Region Method

p-Value Method

Critical Region Versus p-Value

Mini-Examples: Wording Before Calculation

Overstated mean

Correctly calibrated

Writing Conclusions

Common Pitfalls

Revision Checklist

Graph View

Table of Contents

Backlinks

Singapore H2 Math Wiki

Start Here

Hypothesis Testing Logic and Decision Methods

Hypothesis Testing Logic and Decision Methods

Scope Label

A Test Measures Evidence Under H0​

Statistical Hypotheses

Translating Wording into H1​

Tail Direction

Lower-tailed test

Upper-tailed test

Two-tailed test

Significance Level

Critical-Region Method

p-Value Method

Critical Region Versus p-Value

Mini-Examples: Wording Before Calculation

Overstated mean

Correctly calibrated

Writing Conclusions

Common Pitfalls

Revision Checklist

Graph View

Table of Contents

Backlinks

A Test Measures Evidence Under $H_{0}$

Translating Wording into $H_{1}$