Chapter 5: Continuous Random Variables
5.1 Continuous Probability Functions
Learning Objectives
By the end of this section, students will be able to:
- describe the properties of continuous probability distributions.
Properties of Continuous Probability Distributions
The graph of a continuous probability distribution is a curve. Probability is represented by area under the curve.
The curve is called the probability density function (abbreviated as pdf). We use the symbol f(x) to represent the curve. f(x) is the function that corresponds to the graph; we use the density function f(x) to draw the graph of the probability distribution.
Area under the curve is given by a different function called the cumulative distribution function (abbreviated as cdf). The cumulative distribution function is used to evaluate probability as area.
- The outcomes are measured, not counted.
- The entire area under the curve and above the x-axis is equal to one.
- Probability is found for intervals of x values rather than for individual x values.
- P(c < x < d) is the probability that the random variable X is in the interval between the values c and d. P(c < x < d) is the area under the curve, above the x-axis, to the right of c and the left of d.
- P(x = c) = 0 The probability that x takes on any single individual value is zero. The area below the curve, above the x-axis, and between x = c and x = c has no width, and therefore no area (area = 0). Since the probability is equal to the area, the probability is also zero.
- P(c < x < d) is the same as P(c ≤ x ≤ d) because probability is equal to area.
We will find the area that represents probability by using geometry, formulas, technology, or probability tables. In general, calculus is needed to find the area under the curve for many probability density functions. When we use formulas to find the area in this textbook, the formulas were found by using the techniques of integral calculus. However, because most students taking this course have not studied calculus, we will not be using calculus in this textbook.
There are many continuous probability distributions. When using a continuous probability distribution to model probability, the distribution used is selected to model and fit the particular situation in the best way.
In this chapter and the next, we will study the uniform distribution, the exponential distribution, and the normal distribution. The following graphs illustrate these distributions.
Continuous Probability Functions
We begin by defining a continuous probability density function. We use the function notation f(x). Intermediate algebra may have been your first formal introduction to functions. In the study of probability, the functions we study are special. We define the function f(x) so that the area between it and the x-axis is equal to a probability. Since the maximum probability is one, the maximum area is also one. For continuous probability distributions, PROBABILITY = AREA.
Example
Consider the function
f(x) = [latex]\frac{1}{20}[/latex] for 0 ≤ x ≤ 20. x = a real number. The graph of
f(x) = [latex]\frac{1}{20}[/latex] is a horizontal line. However, since 0 ≤ x ≤ 20, f(x) is restricted to the portion between x = 0 and x = 20, inclusive.
f(x) = [latex]\frac{1}{20}[/latex]for 0 ≤ x ≤ 20.
The graph of f(x) = [latex]\frac{1}{20}[/latex] is a horizontal line segment when 0 ≤ x ≤ 20.
The area between f(x) = [latex]\frac{1}{20}[/latex] where 0 ≤ x ≤ 20 and the x-axis is the area of a rectangle with base = 20 and height = [latex]\frac{1}{20}[/latex].
Suppose we want to find the area between f(x) = [latex]\frac{1}{20}[/latex] and the x-axis where 0 < x < 2.
[latex]\text{AREA }=\text{ }\left(2\text{ }–\text{ }0\right)\left(\frac{1}{20}\right)\text{ }=\text{ }0.1[/latex]
[latex]\left(2\text{}–\text{}0\right)\text{}=\text{}2\text{}=\text{base of a rectangle}[/latex]
area of a rectangle = (base)(height).
The area corresponds to a probability. The probability that x is between zero and two is 0.1, which can be written mathematically as P(0 < x < 2) = P(x < 2) = 0.1.
Suppose we want to find the area between f(x) = [latex]\frac{1}{20}[/latex] and the x-axis where 4 < x < 15.
[latex]\text{AREA }=\text{ }\left(15\text{ }–\text{ }4\right)\left(\frac{1}{20}\right)\text{ }=\text{ }0.55[/latex]
[latex]\text{AREA }=\text{ }\left(15\text{ }–\text{ }4\right)\left(\frac{1}{20}\right)\text{ }=\text{ }0.55[/latex]
[latex]\left(15\text{ }–\text{ }4\right)\text{ }=\text{ }11\text{ }=\text{ the base of a rectangle}[/latex]
The area corresponds to the probability P(4 < x < 15) = 0.55.
Suppose we want to find P(x = 15). On an x-y graph, x = 15 is a vertical line. A vertical line has no width (or zero width). Therefore, P(x = 15) = (base)(height) = (0)[latex]\left(\frac{1}{20}\right)[/latex] = 0
P(X ≤ x) (can be written as P(X < x) for continuous distributions) is called the cumulative distribution function or CDF. Notice the “less than or equal to” symbol. We can use the CDF to calculate P(X > x). The CDF gives “area to the left” and P(X > x) gives “area to the right.” We calculate P(X > x) for continuous distributions as follows: P(X > x) = 1 – P (X < x).
Label the graph with
f(x) and x. Scale the x and y axes with the maximum x and y values. f(x) = [latex]\frac{1}{20}[/latex], 0 ≤ x ≤ 20.
To calculate the probability that x is between two values, look at the following graph. Shade the region between x = 2.3 and x = 12.7. Then calculate the shaded area of a rectangle.
[latex]P\left(2.3
Your Turn!
Consider the function f(x) = [latex]\frac{\text{1}}{8}[/latex]
for 0 ≤ x ≤ 8. Draw the graph of f(x) and find P(2.5 < x < 7.5).
Solution
P (2.5 < x < 7.5) = 0.625
a continuous random variable (RV) that has equally likely outcomes over the domain, a < x < b; it is often referred to as the rectangular distribution because the graph of the pdf has the form of a rectangle. Notation: X ~ U(a,b). The mean is μ = [latex]\frac{a+b}{2}[/latex] and the standard deviation is [latex]\sigma =\sqrt{\frac{{\left(b-a\right)}^{2}}{12}}[/latex]. The probability density function is f(x) = [latex]\frac{1}{b-a}[/latex] for a < x < b or a ≤ x ≤ b. The cumulative distribution is P(X ≤ x) = [latex]\frac{x-a}{b-a}[/latex].
a continuous random variable (RV) that appears when we are interested in the intervals of time between some random events, for example, the length of time between emergency arrivals at a hospital; the notation is X ~ Exp(m). The mean is μ = 1m and the standard deviation is σ = 1m. The probability density function is f(x) = me−mx, x ≥ 0 and the cumulative distribution function is P(X ≤ x) = 1 − e−mx.