Probability Density And CDF

Scope Label

Conceptual support / Enrichment. This note explains the general continuous-probability language used by normal-distribution work. It is useful for understanding Special Continuous Random Variables, but the current 9758 scope reference names the normal distribution itself as the core continuous model.

What This Note Assumes

You should already know:

  • probabilities are numbers between and ;
  • total probability is ;
  • discrete random variables attach probability to listed values;
  • integration can represent accumulated area.

The new idea is that continuous probability is attached to intervals, not exact points.

From Discrete Values to Continuous Intervals

For a discrete random variable, we can write probabilities such as

The possible values are separated, so each value may have its own probability.

For a continuous random variable, the possible values vary across intervals. A measured height, time, or mass is not naturally listed value by value. Between two possible values, more possible values lie between them.

Therefore a continuous model does not assign probability directly to each exact value. It spreads probability across the number line using a density.

Probability Density Function

A probability density function, or pdf, is a function whose area gives probability.

For a continuous random variable ,

Read this as:

  • is the height of the density curve;
  • represents a tiny piece of probability near ;
  • the integral accumulates those pieces over an interval;
  • the resulting area is the probability.

Caption: Probability over an interval is the area under the density curve over that interval.

Valid Pdf Conditions

A valid pdf must satisfy two conditions.

First,

for all values in its domain. Negative density would lead to negative probability over small intervals, which is impossible.

Second,

This says the total area under the whole density curve is .

Caption: A valid pdf is non-negative and has total area .

Do not check only whether the formula looks positive. Always check the intended domain. A function may be non-negative on one interval but invalid if the total area is not .

Density Is Not Probability

A density value such as is not the probability that .

The value is a height. It tells us how concentrated the distribution is near , but it has no interval width by itself. Probability needs area.

This is why a density value may be greater than without causing a contradiction. A tall, narrow density can still have total area .

The safe reading is:

Height is density. Area is probability.

Exact-Point Probability

For any continuous random variable,

This follows directly from the area interpretation:

A single point has zero width. Since probability is area, it contributes zero probability.

This does not mean the value is impossible in practical language. It means the model gives probability to intervals such as

not to the infinitely precise point .

Caption: A point has no width, so it contributes no area under a continuous density curve.

Endpoint Conventions

Because exact-point probabilities are zero, endpoint inclusion does not matter for continuous random variables.

For example,

and

This is different from discrete random variables. In a discrete model, adding or removing a possible value can change the probability. In a continuous model, adding or removing finitely many endpoints does not change area.

Cumulative Distribution Function

The cumulative distribution function, or cdf, is

It measures probability accumulated up to .

If the pdf is , then

If the random variable is defined only from a lower endpoint , this is often written as

for values of inside the domain.

The cdf has three useful properties:

  • is non-decreasing;
  • ;
  • approaches once the full domain has been covered.

Caption: The cdf value is the area accumulated under the pdf up to .

Using the CDF for Intervals

The cdf collects probability from the left. Therefore

The subtraction removes the area accumulated before , leaving only the area between and .

Common translations are:

WordingProbability form
less than
at most
more than
at least
between and
outside

For continuous variables, the endpoint versions are equivalent. The important work is identifying the correct interval.

Worked Example: Constant Density

Let

and otherwise.

First check that the total area is :

So is a valid pdf.

Now find

Using area,

This probability is the area of a rectangle of width and height . It is not the density height .

The cdf is

Then

which matches the area calculation.

Common Pitfalls

MistakeCorrection
Reading as is density, not probability.
Forgetting to check total areaA valid pdf must integrate to .
Worrying about open or closed endpointsEndpoints do not affect continuous probabilities.
Using for an intervalUse .
Ignoring the domainOutside the domain, the density may be and the cdf may be constant.

Revision Checklist

  • Can you explain why probability is area in a continuous model?
  • Can you state the two validity conditions for a pdf?
  • Can you explain why a density value may exceed ?
  • Can you explain why ?
  • Can you use endpoint equivalence correctly?
  • Can you define ?
  • Can you use to find an interval probability?