"

Chapter 4: Discrete Random Variables

4.3 Binomial Distribution

Learning Objectives

By the end of this section, you should be able to:

  • Identify the components of a binomial experiment
  • Use the formulas for a binomial random variable to compute mean, variance, and standard deviation

Binomial Experiments

There are three characteristics of a binomial experiment.

  1. There are a fixed number of trials. Think of trials as repetitions of an experiment. The letter [latex]n[/latex] denotes the number of trials.
  2. There are only two possible outcomes, called “success” and “failure,” for each trial. The letter [latex]p[/latex] denotes the probability of a success on one trial, and [latex]q[/latex] denotes the probability of a failure on one trial [latex]p + q = 1[/latex].
  3. The [latex]n[/latex] trials are independent and are repeated using identical conditions. Because the [latex]n[/latex] trials are independent, the outcome of one trial does not help in predicting the outcome of another trial. Another way of saying this is that for each individual trial, the probability, [latex]p[/latex], of a success and probability, [latex]q[/latex], of a failure remain the same.

For example, randomly guessing at a true-false statistics question has only two outcomes. If a success is guessing correctly, then a failure is guessing incorrectly. Suppose Taylor consistently guesses correctly on any statistics true-false question with probability [latex]p = 0.6[/latex]. Then, [latex]q = 0.4.[/latex] Consistency in guessing means that for every true-false statistics question Taylor answers, the probability of success ([latex]p = 0.6[/latex]) and the probability of failure ([latex]q = 0.4[/latex]) remain the same, a necessary criteria for a situation to be binomial.

Any experiment that has characteristics two and three and where [latex]n = 1[/latex] is called a Bernoulli Trial (named after Jacob Bernoulli who, in the late 1600s, studied them extensively). A binomial experiment takes place when the number of successes is counted in one or more Bernoulli Trials.

The outcomes of a binomial experiment fit a binomial probability distribution. The random variable [latex]X =[/latex] the number of successes obtained in the [latex]n[/latex] independent trials.

Notation for the Binomial

We use [latex]B[/latex] to represent the binomial probability distribution, and when [latex]X[/latex] fits the binomial distribution, we write [latex]X \sim B(n,p)[/latex]. Read this as “[latex]X[/latex] is a random variable with a binomial distribution with parameters [latex]n[/latex] and [latex]p[/latex],” which again represent the number of trials and the probability of “success.”

Example

At ABC College, the withdrawal rate from an elementary physics course is 30% for any given term. This implies that, for any given term, 70% of the students stay in the class for the entire term. A “success” could be defined as an individual who withdrew. The random variable [latex]X =[/latex] the number of students who withdraw from the randomly selected elementary physics class.

If at the start of a particular term, 300 students are enrolled in the elementary physics course, [latex]n = 300[/latex] and [latex]p = 0.3[/latex] and [latex]X \sim B(300, 0.3)[/latex]. The possible outcomes are [latex]x = 0, \ldots, 300[/latex] and the probability [latex]P(X=x)[/latex] is the probability that [latex]x[/latex] students will withdraw during the term.

Your Turn!

Approximately 70% of statistics students do their homework in time for it to be collected and graded. Each student does homework independently. In a statistics class of 50 students, what is the probability that at least 40 will do their homework on time?

a. This is a binomial problem because there is only a success or a [latex]\underline{\hspace{20pt}}[/latex], there are a fixed number of trials, and the probability of a success is 0.70 for each trial.

b. If we are interested in the number of students who do their homework on time, then how do we define [latex]X[/latex]?

c. What values does [latex]x[/latex] take on?

d. What is a “failure,” in words?

e. If [latex]p + q = 1[/latex], then what is [latex]q[/latex]?

f. The words “at least” translate as what kind of inequality for the probability question [latex]P(X \underline{\hspace{20pt}} 40)[/latex].

Solution

a. failure
b. [latex]X =[/latex] the number of statistics students who do their homework on time
c. 0, 1, 2, …, 50
d. Failure is defined as a student who does not complete his or her homework on time. The probability of a success is [latex]p=0.70[/latex]. The number of trials is [latex]n=50[/latex].
e. [latex]q = 0.30[/latex]
f. [latex]P(X \ge 40)[/latex]

Your Turn!

Suppose you play a game that you can only either win or lose. The probability that you win any game is 55%, and the probability that you lose is 45%. Each game you play is independent. If you play the game 20 times, write the function that describes the probability that you win 15 of the 20 times.

Solution

Here, if you define [latex]X[/latex] as the number of wins, then [latex]X[/latex] takes on the values [latex]0,1,2,3,…,20[/latex]. The probability of a success is [latex]p=0.55[/latex]. The probability of a failure is [latex]q=0.45[/latex]. The number of trials is [latex]n=20[/latex]. The probability question can be stated mathematically as [latex]P(X=15)[/latex] or [latex]P(15)[/latex].

Experiments That Are Not Binomial

Here are some common experiments that are not binomial:

  1. Flipping a coin until you get one head. While there are only two outcomes (heads and tails) to each coin flip, and while the probability of getting a head on each flip is consistent, the number of trials will vary.
  2. Most characteristics about people, such as weight, height, ethnicity, and gender. When we survey a group of people and ask for their weight, we get far more than two responses. When surveying ethnicity, as discussed in Chapter 1 an Other or Unknown category is needed to include all people in our results, specifically people who did not feel they fit into any of the ethnicity categories or declined to respond. While gender has historically been treated as a binomial outcome (male and female), not all people will feel they fit those two categories, so we may lose information and reliability by modeling gender as binomial.
  3. Drawing cards without replacement. While you can specify a set number of [latex]n[/latex] trials, and you can set it up as two outcomes (success = red, failure = black), the draws are not independent. If you draw a red on the first draw, it is now slightly more likely to draw black than red on the second draw.

Example

ABC College has a student advisory committee made up of ten staff members and six students. The committee wishes to choose a chairperson and a recorder by putting the names of all committee members into a box, and two names are drawn without replacement. The first name drawn determines the chairperson and the second name the recorder. Suppose the college wishes to find the probability that the chairperson and recorder are both students. Is this binomial?

Solution

There are two trials. However, the trials are not independent because the outcome of the first trial affects the outcome of the second trial. The probability of a student on the first draw is [latex]\frac{6}{16}[/latex]. The probability of a student on the second draw is [latex]\frac{5}{15}[/latex], when the first draw selects a student. The probability is [latex]\frac{6}{15}[/latex], when the first draw selects a staff member. The probability of drawing a student’s name changes for each of the trials and, therefore, violates the condition of independence.

Your Turn!

Suppose 55% of people pass the state driver’s exam on the first try. You survey fifty randomly chosen adults in the state.

Which of these are binomial problems?

  • The number of the adults who have a driver’s license.
  • The number of the adults who got their driver’s license on the first try.
  • The number of times each adult has taken the driver’s exam.
Solution

The first two of the three options can be modeled as a binomial random variable.

In the first scenario, a success is having a driver’s license, and each person either does or does not have a current driver’s license. We have [latex]n=50[/latex] trials. Since the adults are drawn randomly among the state’s population, we just need to know the percentage of adults in the state that have a driver’s license, then it can be modeled as a binomial random variable.

The second option, where we ask not only if they have a driver’s license, but if they got it on their first try, is a binomial problem. There is only a success or a failure – they did, or did not, get it on the first try — and there are a definite number of trials (each person gets one first try for [latex]n=50[/latex]). The probability of a success stays the same for each trial at [latex]p=0.55[/latex] and we can write [latex]X \sim B(50,0.55)[/latex].

The third option is not binomial. We may get any number of different responses, since it is quite probable that some of these adults took more than one try to pass the exam.

Binomial Probability Function

Once we have decided we can use the binomial for a given situation, we can use the binomial probability function to find the probability of a specific number of successes, [latex]P(X=x)[/latex].  The binomial PMF is made up of two parts:

First, we need to find out how many different ways we can get x successes in n trials.  To do this we can use the “Choose” function, also called the binomial coefficient, written as:

[latex]_n C_x= \binom nx =\frac{n!}{x!(n-x)!}[/latex]

Note

“!” is the factorial operator.

 

The next part gives us the probability of a single one of those ways to get x successes in n trials.  We can do this by using our independent multiplication rule.   We multiply the probability of success ([latex]p[/latex]) raised to the number of successes ([latex]x[/latex]) by the probability of failure ([latex]q=1-p[/latex]) raised to the number of failures ([latex]n-x[/latex]).

[latex]p^x q^{(n-x)}[/latex]

Since we know each of these ways are equally likely and how many ways are possible we can now put the two pieces together. We multiply the probability of one way by how many we have to give us our overall probability of x successes in n trials.

[latex]P(X = x) = \frac{n!}{x!(n-x)!} p^x q^{(n-x)}[/latex]

Unfortunately the binomial does not have a nice form of CDF, but it is simply the sum of PDFs up until that point. Consider the following example to demonstrate this point.

Example

It has been stated that about 41% of adult workers have a high school diploma but do not pursue any further education. Twenty adult workers are randomly selected.

Let [latex]X =[/latex] the number of workers who have a high school diploma but do not pursue any further education.

Then [latex]X[/latex] takes on the values [latex]0, 1, 2, \ldots, 20[/latex] where [latex]n = 20, p = 0.41[/latex], and [latex]q = 1-0.41 = 0.59[/latex]. Finally, [latex]X \sim B(20,0.41).[/latex]

The y-axis contains the probability of [latex]x[/latex], where [latex]X =[/latex] the number of workers who have only a high school diploma.

The graph of [latex]X \sim B(20, 0.41)[/latex] is as follows:

Histogram of Binomial Distribution with 20 trials and success probability 0.41

Find the probability that:

a. Exactly 12 of them have a high school diploma

Solution

We can simply plug into the Binomial PMF

[latex]P(X=x) = \frac{n!}{x!(n−x)!}p^x q^{n−x}[/latex]

for [latex]P(X=12)[/latex] with [latex]n=20[/latex] and [latex]p=0.41[/latex]

[latex]= \frac{20!}{12!(20−12)!} {0.41}^{12} \cdot {(1−0.41)}^{20−12} = 0.0417[/latex]

 

b. At most 12 of them have a high school diploma but do not pursue any further education. How many adult workers do you expect to have a high school diploma but do not pursue any further education?

Solution

We want to find [latex]P(X \le 12)[/latex], so we use the CDF function. On a calculator, that’s binomcdf(20,0.41,12). In Excel, we enter BINOM.DIST(12,20,0.41,TRUE). In both cases, we get about 0.9738.

So the probability that at most 12 workers have a high school diploma but do not pursue any further education is 0.9738.

Note

To compute binomial probabilities on a graphing calculator, go into 2nd DISTR. The syntax for the instructions are as follows:

To calculate [latex]P(X = x)[/latex]: binompdf(n, p, x).
If [latex]x[/latex] is left out, the result is the binomial probability table.

To calculate [latex]P(X \leq x)[/latex]: binomcdf(n, p, x).
If [latex]x[/latex] is left out, the result is the cumulative binomial probability table.

For the example problem: After you are in 2nd DISTR, arrow down to binomcdf(. Press ENTER. Enter 20,0.41,12). The result is [latex]P(X \leq 12) = 0.9738[/latex].

If you wanted to instead find [latex]P(X>12)[/latex], use 1 – binomcdf(20,0.41,12).

In Excel, both binomial probabilities are computed using BINOM.DIST(x, n, p, True/False), where False computes [latex]P(X=x)[/latex] and is equivalent to binompdf, and True computes [latex]P(X \leq x)[/latex] and is equivalent to binomcdf.

 

Your Turn!

About 32% of students participate in a community volunteer program outside of school. If 30 students are selected at random, find:

a. The probability that exactly 14 of them participate in a community volunteer program outside of school.  First try plugging in to the binomial formula by hand, then check yourself with technology.

b. The probability that exactly 14 of them participate in a community volunteer program outside of school. Rely on technology for this probability.

Solution

a. Since [latex]q=1−0.32=0.68[/latex], we have
[latex]P(X=14)= \frac{30!}{14!(30−14)!} (0.32^{14})(0.68^{(30−14)})=0.03589[/latex].

b. The answer of 0.03589 can also be found using binompdf(30,0.32,14) on a calculator or BINOM.DIST(14,30,0.32,FALSE).

Expected Value and Standard Deviation

The mean, [latex]\mu[/latex], and variance, [latex]\sigma^2[/latex], for the binomial probability distribution are [latex]\mu = np[/latex] and [latex]\sigma^2 = npq[/latex]. The standard deviation is then [latex]\sigma = \sqrt{npq}[/latex].

Example

In the 2013 Jerry’s Artarama art supplies catalog, there are 560 pages. Eight of the pages feature signature artists. Suppose we randomly sample 100 pages. Let [latex]X =[/latex] the number of pages that feature signature artists.

  1. What values does [latex]x[/latex] take on?
  2. What is the probability distribution? Find the following probabilities:
    1. the probability that two pages feature signature artists
    2. the probability that at most six pages feature signature artists
    3. the probability that more than three pages feature signature artists.
  3. Using the formulas, calculate the (i) mean and (ii) standard deviation.
Solution
  1. [latex]x = 0, 1, 2, 3, ..., n[/latex], where [latex]n=100[/latex]
  2. [latex]X \sim B \left(100,\frac{8}{560}\right)[/latex]
    1. [latex]P(x = 2) = \text{binompdf} \left(100,\frac{8}{560},2\right) = 0.2466[/latex]
    2. [latex]P(x \le 6) = \text{binomcdf} \left(100,\frac{8}{560},6\right) = 0.9994[/latex]
    3. [latex]P(x > 3) = 1 – P(x \le 3) = 1 – \text{binomcdf} \left(100,\frac{8}{560},3\right) = 1 – 0.9443 = 0.0557[/latex]
  3. (i) [latex]\text{Mean} = np = (100) \left(\frac{8}{560}\right)[/latex] = [latex]\frac{800}{560}[/latex] ≈ 1.4286
    (ii) [latex]\text{Standard Deviation} = \sqrt{npq} = \sqrt{\left(100\right)\left(\frac{8}{560}\right)\left(\frac{552}{560}\right)} \approx 1.1867[/latex]

Your Turn!

According to a Gallup poll, 60% of American adults prefer saving over spending. Let [latex]X =[/latex] the number of American adults out of a random sample of 50 who prefer saving to spending.

  1. What is the probability distribution for [latex]X[/latex]?
  2. Use your calculator to find the following probabilities:
    1. the probability that 25 adults in the sample prefer saving over spending
    2. the probability that at most 20 adults prefer saving
    3. the probability that more than 30 adults prefer saving
  3. Using the formulas, calculate the (i) mean and (ii) standard deviation of [latex]X[/latex].
    Solution
    1. [latex]X \sim B(50,0.6)[/latex]
    2. Using the TI-83, 83+, 84 calculator with instructions as provided:
      1. [latex]P(x = 25) = \text{binompdf}(50, 0.6, 25) = 0.0405[/latex]
      2. [latex]P(x \le 20) = \text{binomcdf}(50, 0.6, 20) = 0.0034[/latex]
      3. [latex]P(x > 30) = 1 - \text{binomcdf}(50, 0.6, 30) = 1 – 0.5535 = 0.446[/latex]
    3. [latex]\text{Mean} = np = 50(0.6) = 30[/latex]
      [latex]\text{Standard Deviation} = \sqrt{npq} = \sqrt{50(0.6)(0.4)} \approx 3.4641[/latex]

    Your Turn!

    During the 2013 regular NBA season, DeAndre Jordan of the Los Angeles Clippers had the highest field goal completion rate in the league. DeAndre scored with 61.3% of his shots. Suppose you choose a random sample of 80 shots made by DeAndre during the 2013 season. Let [latex]X =[/latex] the number of shots that scored points.

    1. What is the probability distribution for [latex]X[/latex]?
    2. Using the formulas, calculate the (i) mean and (ii) standard deviation of [latex]X[/latex].
    3. Use your calculator to find the probability that DeAndre scored with 60 of these shots.
    4. Find the probability that DeAndre scored with more than 50 of these shots.
    Solution
    1. [latex]X \sim B(80, 0.613)[/latex]
    2. [latex]\text{Mean} = np = 80(0.613) = 49.04[/latex]
      [latex]\text{Standard Deviation} = \sqrt{npq} = \sqrt{80(0.613)(0.387)} \approx 4.3564[/latex]
    3. Using the calculator with instructions as provided:
      [latex]P(x=60)= \text{binompdf} (80,0.613,60)=0.0036[/latex]
    4. [latex]P(x > 50) = 1 – P(x \le 50) = 1 – \text{binomcdf}(80, 0.613, 50) = 1 – 0.6282 = 0.3718[/latex]

    Section 4.3 Review

    A statistical experiment can be classified as a binomial experiment if the following conditions are met:

    1. There are a fixed number of trials [latex]n[/latex].
    2. There are only two possible outcomes, called “success” and “failure” for each trial. The letter [latex]p[/latex] denotes the probability of a success on one trial and [latex]q[/latex] denotes the probability of a failure on one trial.
    3. The [latex]n[/latex] trials are independent and are repeated using identical conditions.

    The outcomes of a binomial experiment fit a binomial probability distribution. The random variable [latex]X = \text{the number of successes obtained in the } n \text{ independent trials}[/latex]. The mean of [latex]X[/latex] can be calculated using the formula [latex]\mu = np[/latex], and the standard deviation is given by the formula [latex]\sigma = \sqrt{npq}[/latex].

    Formula Review

    [latex]X \sim B(n,p)[/latex] means that the discrete random variable [latex]X[/latex] has a binomial probability distribution with [latex]n[/latex] trials and probability of success [latex]p[/latex].

    [latex]X= \text{the number of successes in } n \text{ independent trials}[/latex]

    [latex]n = \text{the number of independent trials}[/latex]

    [latex]X[/latex] takes on the values [latex]x = 0, 1, 2, 3, \ldots, n[/latex]

    [latex]p= \text{the probability of a success for any trial}[/latex]

    [latex]q = \text{the probability of a failure for any trial}[/latex]

    [latex]p+q=1[/latex]

    [latex]q = 1-p[/latex]

    The mean of [latex]X[/latex] is [latex]\mu = np[/latex]. The standard deviation of [latex]X[/latex] is  [latex]\sigma = \sqrt{npq}[/latex].

    Section 4.3 Practice

    Use the following information to answer the next eight exercises: The Higher Education Research Institute at UCLA collected data from 203,967 incoming first-time, full-time freshmen from 270 four-year colleges and universities in the U.S. 71.3% of those students replied that, yes, they believe that same-sex couples should have the right to legal marital status. Suppose that you randomly pick eight first-time, full-time freshmen from the survey. You are interested in the number that believes that same sex-couples should have the right to legal marital status.

    1. In words, define the random variable [latex]X[/latex].

    Solution

    [latex]X =[/latex] the number that reply “yes”

     

    2. [latex]X \sim \underline{\hspace{2cm}} ( \underline{\hspace{2cm}} , \underline{\hspace{2cm}} )[/latex]

    Solution

    [latex]B(8,0.713)[/latex]

     

    3. What values does the random variable X take on?

    Solution

    0, 1, 2, 3, 4, 5, 6, 7, 8

     

    4. Construct the probability distribution function (PDF).

     

    5. On average ([latex]\mu[/latex]), how many would you expect to answer yes?

    Solution

    5.7

     

    6. What is the standard deviation ([latex]\sigma[/latex])?

    Solution

    1.2795

     

    7. What is the probability that at most five of the freshmen reply “yes”?

    Solution

    0.4151

     

    8. What is the probability that at least two of the freshmen reply “yes”?

    Solution

    0.9990

    According to a recent article the average number of babies born with significant hearing loss (deafness) is approximately two per 1,000 babies in a healthy baby nursery. The number climbs to an average of 30 per 1,000 babies in an intensive care nursery.

    Suppose that 1,000 babies from healthy baby nurseries were randomly surveyed. Find the probability that exactly two babies were born deaf.

    Solution

    0.2709

    Use the following information to answer the next four exercises. Recently, a nurse commented that when a patient calls the medical advice line claiming to have the flu, the chance that he or she truly has the flu (and not just a nasty cold) is only about 4%. Of the next 25 patients calling in claiming to have the flu, we are interested in how many actually have the flu.

    1. Define the random variable and list its possible values.

    Solution

    [latex]X =[/latex]the number of patients calling in claiming to have the flu, who actually have the flu.

    [latex]X = 0, 1, 2, ...25[/latex]

     

    2. State the distribution of [latex]X[/latex].

    Solution

    [latex]B(25,0.04)[/latex]

     

    3. Find the probability that at least four of the 25 patients actually have the flu.

    Solution

    0.0165

     

    4. On average, for every 25 patients calling in, how many do you expect to have the flu?

    Solution

    one

    People visiting video rental stores often rent more than one DVD at a time. The probability distribution for DVD rentals per customer at Video To Go is given in the following table. There is a five-video limit per customer at this store, so nobody ever rents more than five DVDs.

    x P(x)
    0 0.03
    1 0.50
    2 0.24
    3
    4 0.07
    5 0.04
    1. Describe the random variable [latex]X[/latex] in words.
    2. Find the probability that a customer rents three DVDs.
    3. Find the probability that a customer rents at least four DVDs.
    4. Find the probability that a customer rents at most two DVDs.
    Solution
    1. [latex]X =[/latex] the number of DVDs a Video to Go customer rents
    2. 0.12
    3. 0.11
    4. 0.77

    A school newspaper reporter decides to randomly survey 12 students to see if they will attend Tet (Vietnamese New Year) festivities this year. Based on past years, she knows that 18% of students attend Tet festivities. We are interested in the number of students who will attend the festivities.

    1. In words, define the random variable [latex]X[/latex].
    2. List the values that [latex]X[/latex] may take on.
    3. Give the distribution of [latex]X[/latex]. [latex]X \sim \underline{\hspace{2cm}} ( \underline{\hspace{2cm}} , \underline{\hspace{2cm}} )[/latex]
    4. How many of the 12 students do we expect to attend the festivities?
    5. Find the probability that at most four students will attend.
    6. Find the probability that more than two students will attend.
    Solution
    1. [latex]X =[/latex] the number of students who will attend Tet.
    2. 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12
    3. [latex]X \sim B(12,0.18)[/latex]
    4. 2.16
    5. 0.9511
    6. 0.3702

    Use the following information to answer the next three exercises. The probability that the San Jose Sharks will win any given game is 0.3694 based on a 13-year win history of 382 wins out of 1,034 games played (as of a certain date). An upcoming monthly schedule contains 12 games.

    1. The expected number of wins for that upcoming month is:

    a. 1.67
    b. 12
    c. [latex]\frac{382}{1043}[/latex]
    d. 4.43

    Solution

    d. 4.43

     

    2. Let [latex]X =[/latex] the number of games won in that upcoming month. What is the probability that the San Jose Sharks win six games in that upcoming month?

    a. 0.1476
    b. 0.2336
    c. 0.7664
    d. 0.8903

    Solution

    a

     

    3. What is the probability that the San Jose Sharks win at least five games in that upcoming month

    a. 0.3694
    b. 0.5266
    c. 0.4734
    d. 0.2305

    Solution

    c

    A student takes a ten-question true-false quiz but did not study and randomly guesses each answer. Find the probability that the student passes the quiz with a grade of at least 70% of the questions correct.

    Solution

    [latex]X =[/latex] number of questions answered correctly
    [latex]X \sim B(10, 0.5)[/latex]
    We are interested in AT LEAST 70% of ten questions correct. 70% of ten is seven. We want to find the probability that X is greater than or equal to seven. The event “at least seven” is the complement of “less than or equal to six”.
    Using your calculator’s distribution menu: 1 – binomcdf(10, .5, 6) gives 0.171875
    The probability of getting at least 70% of the ten questions correct when randomly guessing is approximately 0.172.

    A student takes a 32-question multiple-choice exam, but did not study and randomly guesses each answer. Each question has three possible choices for the answer. Find the probability that the student guesses more than 75% of the questions correctly.

    Solution
    • [latex]X =[/latex]number of questions answered correctly
    • [latex]X \sim B \left(\text{32, }\frac{\text{1}}{\text{3}}\right)[/latex]
    • We are interested in MORE THAN 75% of 32 questions correct. 75% of 32 is 24. We want to find [latex]P(x > 24)[/latex]. The event “more than 24” is the complement of “less than or equal to 24.”
    • Using your calculator’s distribution menu: 1 – binomcdf[latex]\left(\text{32, }\frac{\text{1}}{\text{3}},\text{ 24}\right)[/latex]
    • [latex]P(x > 24) = 0[/latex]
    • The probability of getting more than 75% of the 32 questions correct when randomly guessing is very small and practically zero.

    Six different colored dice are rolled. Of interest is the number of dice that show a one.

    1. In words, define the random variable [latex]X[/latex].
    2. List the values that [latex]X[/latex] may take on.
    3. Give the distribution of [latex]X[/latex]. [latex]X \sim \underline{\hspace{2cm}} ( \underline{\hspace{2cm}} , \underline{\hspace{2cm}} )[/latex]
    4. On average, how many dice would you expect to show a one?
    5. Find the probability that all six dice show a one.
    6. Is it more likely that three or that four dice will show a one? Use numbers to justify your answer numerically.
    Solution
    1. [latex]X =[/latex] the number of dice that show a one
    2. [latex]0, 1, 2, 3, 4, 5, 6[/latex]
    3. [latex]X \sim B(6,16)[/latex]
    4. 1
    5. 0.00002
    6. three dice

    More than 96 percent of the very largest colleges and universities (more than 15,000 total enrollments) have some online offerings. Suppose you randomly pick 13 such institutions. We are interested in the number that offer distance learning courses.

    1. In words, define the random variable [latex]X[/latex].
    2. List the values that [latex]X[/latex] may take on.
    3. Give the distribution of [latex]X[/latex]. [latex]X \sim \underline{\hspace{2cm}} ( \underline{\hspace{2cm}} , \underline{\hspace{2cm}} )[/latex]
    4. On average, how many schools would you expect to offer such courses?
    5. Find the probability that at most ten offer such courses.
    6. Is it more likely that 12 or that 13 will offer such courses? Use numbers to justify your answer numerically and answer in a complete sentence.
    Solution
    1. [latex]X =[/latex] the number of colleges and universities that offer online offerings.
    2. [latex]0, 1, 2, …, 13[/latex]
    3. [latex]X \sim B(13, 0.96)[/latex]
    4. 12.48
    5. 0.0135
    6. [latex]P(x = 12) = 0.3186[/latex]; [latex]P(x = 13) = 0.5882[/latex]; You are more likely to get 13.

    Suppose that about 85% of graduating students attend their graduation. A group of 22 graduating students is randomly chosen.

    1. In words, define the random variable [latex]X[/latex].
    2. List the values that [latex]X[/latex] may take on.
    3. Give the distribution of [latex]X[/latex]. [latex]X \sim \underline{\hspace{2cm}} ( \underline{\hspace{2cm}} , \underline{\hspace{2cm}} )[/latex]
    4. How many are expected to attend their graduation?
    5. Find the probability that 17 or 18 attend.
    6. Based on numerical values, would you be surprised if all 22 attended graduation? Justify your answer numerically.
    Solution
    1. [latex]X =[/latex] the number of students who attend their graduation
    2. [latex]0, 1, 2, …, 22[/latex]
    3. [latex]X \sim B(22, 0.85)[/latex]
    4. 18.7
    5. 0.3249
    6. [latex]P(x = 22) = 0.0280[/latex] (less than 3%) which is unusual

    At The Fencing Center, 60% of the fencers use the foil as their main weapon. We randomly survey 25 fencers at The Fencing Center. We are interested in the number of fencers who do not use the foil as their main weapon.

    1. In words, define the random variable [latex]X[/latex].
    2. List the values that [latex]X[/latex] may take on.
    3. Give the distribution of [latex]X[/latex]. [latex]X \sim \underline{\hspace{2cm}} ( \underline{\hspace{2cm}} , \underline{\hspace{2cm}} )[/latex]
    4. How many are expected to not to use the foil as their main weapon?
    5. Find the probability that six do not use the foil as their main weapon.
    6. Based on numerical values, would you be surprised if all 25 did not use foil as their main weapon? Justify your answer numerically.
    Solution
    1. [latex]X =[/latex] the number of fencers who do not use the foil as their main weapon
    2. [latex]0, 1, 2, 3,... 25[/latex]
    3. [latex]X \sim B(25,0.40)[/latex]
    4. 10
    5. 0.0442
    6. The probability that all 25 not use the foil is almost zero. Therefore, it would be very surprising.

    Approximately 8% of students at a local high school participate in after-school sports all four years of high school. A group of 60 seniors is randomly chosen. Of interest is the number who participated in after-school sports all four years of high school.

    1. In words, define the random variable [latex]X[/latex].
    2. List the values that [latex]X[/latex] may take on.
    3. Give the distribution of [latex]X[/latex]. [latex]X \sim \underline{\hspace{2cm}} ( \underline{\hspace{2cm}} , \underline{\hspace{2cm}} )[/latex]
    4. How many seniors are expected to have participated in after-school sports all four years of high school?
    5. Based on numerical values, would you be surprised if none of the seniors participated in after-school sports all four years of high school? Justify your answer numerically.
    6. Based upon numerical values, is it more likely that four or that five of the seniors participated in after-school sports all four years of high school? Justify your answer numerically.
    Solution
    1. [latex]X =[/latex] the number of high school students who participate in after school sports all four years of high school.
    2. [latex]0, 1, 2, …, 60[/latex]
    3. [latex]X \sim B(60, 0.08)[/latex]
    4. 4.8
    5. Yes, [latex]P(x = 0) = 0.0067[/latex], which is a small probability
    6. [latex]P(x = 4) = 0.1873[/latex], [latex]P(x = 5) = 0.1824[/latex]. More likely to get four.

    The chance of an IRS audit for a tax return with over $25,000 in income is about 2% per year. We are interested in the expected number of audits a person with that income has in a 20-year period. Assume each year is independent.

    1. In words, define the random variable [latex]X[/latex].
    2. List the values that [latex]X[/latex] may take on.
    3. Give the distribution of [latex]X[/latex]. [latex]X \sim \underline{\hspace{2cm}} ( \underline{\hspace{2cm}} , \underline{\hspace{2cm}} )[/latex]
    4. How many audits are expected in a 20-year period?
    5. Find the probability that a person is not audited at all.
    6. Find the probability that a person is audited more than twice.
    Solution
    1. [latex]X =[/latex] the number of audits in a 20-year period
    2. [latex]0, 1, 2, …, 20[/latex]
    3. [latex]X \sim B(20, 0.02)[/latex]
    4. 0.4
    5. 0.6676
    6. 0.0071

    It has been estimated that only about 30% of California residents have adequate earthquake supplies. Suppose you randomly survey 11 California residents. We are interested in the number who have adequate earthquake supplies.

    1. In words, define the random variable [latex]X[/latex].
    2. List the values that [latex]X[/latex] may take on.
    3. Give the distribution of [latex]X[/latex]. [latex]X \sim \underline{\hspace{2cm}} ( \underline{\hspace{2cm}} , \underline{\hspace{2cm}} )[/latex]
    4. What is the probability that at least eight have adequate earthquake supplies?
    5. Is it more likely that none or that all of the residents surveyed will have adequate earthquake supplies? Why?
    6. How many residents do you expect will have adequate earthquake supplies?
    Solution
    1. [latex]X =[/latex] the number of California residents who do have adequate earthquake supplies.
    2. [latex]0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11[/latex]
    3. [latex]X \sim B(11, 0.30)[/latex]
    4. 0.0043
    5. [latex]P(x = 0) = 0.0198[/latex]; [latex]P(x = 11) = 0[/latex] or none
    6. 3.3

    There are two similar games played for Chinese New Year and Vietnamese New Year. In the Chinese version, fair dice with numbers 1, 2, 3, 4, 5, and 6 are used, along with a board with those numbers. In the Vietnamese version, fair dice with pictures of a gourd, fish, rooster, crab, crayfish, and deer are used. The board has those six objects on it, also. We will play with bets being $1. The player places a bet on a number or object. The “house” rolls three dice. If none of the dice show the number or object that was bet, the house keeps the $1 bet. If one of the dice shows the number or object bet (and the other two do not show it), the player gets back his or her $1 bet, plus $1 profit. If two of the dice show the number or object bet (and the third die does not show it), the player gets back his or her $1 bet, plus $2 profit. If all three dice show the number or object bet, the player gets back his or her $1 bet, plus $3 profit. Let X = number of matches and Y = profit per game.

    1. In words, define the random variable [latex]X[/latex].
    2. List the values that [latex]X[/latex] may take on.
    3. Give the distribution of [latex]X[/latex]. [latex]X \sim \underline{\hspace{2cm}} ( \underline{\hspace{2cm}} , \underline{\hspace{2cm}} )[/latex]
    4. List the values that [latex]Y[/latex] may take on. Then, construct one PDF table that includes both [latex]X[/latex] and [latex]Y[/latex] and their probabilities.
    5. Calculate the average expected matches over the long run of playing this game for the player.
    6. Calculate the average expected earnings over the long run of playing this game for the player.
    7. Determine who has the advantage, the player or the house.
    Solution
    1. [latex]X =[/latex] the number of matches
    2. [latex]0, 1, 2, 3[/latex]
    3. [latex]X \sim B \left(3,\frac{1}{6}\right)[/latex]
    4. In dollars: −1, 1, 2, 3
    5. [latex]\frac{1}{2}[/latex]
    6. Multiply each [latex]Y[/latex] value by the corresponding [latex]X[/latex] probability from the PDF table. The answer is −0.0787. You lose about eight cents, on average, per game.
    7. The house has the advantage.

    According to The World Bank, only 9% of the population of Uganda had access to electricity as of 2009. Suppose we randomly sample 150 people in Uganda. Let [latex]X =[/latex] the number of people who have access to electricity.

    1. What is the probability distribution for [latex]X[/latex]?
    2. Using the formulas, calculate the mean and standard deviation of [latex]X[/latex].
    3. Use your calculator to find the probability that 15 people in the sample have access to electricity.
    4. Find the probability that at most ten people in the sample have access to electricity.
    5. Find the probability that more than 25 people in the sample have access to electricity.
    Solution
    1. [latex]X \sim B(150,0.09)[/latex]
    2. [latex]\text{Mean} = np = 150(0.09) = 13.5[/latex]; [latex]\text{Standard Deviation} =npq=150(0.09)(0.91)\approx{3.5050}[/latex]
    3. [latex]P(x = 15) = \text{binompdf} (150, 0.09, 15) = 0.0988[/latex]
    4. [latex]P(x \le 10) = \text{binomcdf} (150, 0.09, 10) = 0.1987[/latex]
    5. [latex]P(x > 25) = 1 – P(x \le 25) = 1 – \text{binomcdf}(150, 0.09, 25) = 1 – 0.9991 = 0.0009[/latex]

    The literacy rate for a nation measures the proportion of people age 15 and over that can read and write. The literacy rate in Afghanistan is 28.1%. Suppose you choose 15 people in Afghanistan at random. Let [latex]X =[/latex] the number of people who are literate.

    1. Sketch a graph of the probability distribution of [latex]X[/latex].
    2. Using the formulas, calculate the (i) mean and (ii) standard deviation of [latex]X[/latex].
    3. Find the probability that more than five people in the sample are literate. Is it more likely that three people or four people are literate?
    Solution
    1. [latex]X \sim B(15, 0.281)[/latex]
      This histogram shows a binomial probability distribution. It is made up of bars that are fairly normally distributed. The x-axis shows values from 0 to 15, with bars from 0 to 9. The y-axis shows values from 0 to 0.25 in increments of 0.05.
      1. [latex]\text{Mean} = \mu = np = 15(0.281) = 4.215[/latex]
      2. [latex]\text{Standard Deviation} = \sigma = \sqrt{npq} = \sqrt{15\left(0.281\right)\left(0.719\right)} = 1.7409[/latex]
    2. [latex]P(x > 5) = 1 – P(x \le 5) = 1 – \text{binomcdf} (15, 0.281, 5) = 1 – 0.7754 = 0.2246[/latex]
      [latex]P(x = 3) = \text{binompdf} (15, 0.281, 3) = 0.1927[/latex]
      [latex]P(x = 4) = \text{binompdf} (15, 0.281, 4) = 0.2259[/latex]
    3. It is more likely that four people are literate than three people are.

    References

    “Access to electricity (% of population),” The World Bank, 2013. Available online at http://data.worldbank.org/indicator/EG.ELC.ACCS.ZS?order=wbapi_data_value_2009%20wbapi_data_value%20wbapi_data_value-first&sort=asc (accessed May 15, 2015).

    “Distance Education.” Wikipedia. Available online at http://en.wikipedia.org/wiki/Distance_education (accessed May 15, 2013).

    “NBA Statistics – 2013,” ESPN NBA, 2013. Available online at http://espn.go.com/nba/statistics/_/seasontype/2 (accessed May 15, 2013).

    Newport, Frank. “Americans Still Enjoy Saving Rather than Spending: Few demographic differences seen in these views other than by income,” GALLUP® Economy, 2013. Available online at http://www.gallup.com/poll/162368/americans-enjoy-saving-rather-spending.aspx (accessed May 15, 2013).

    Pryor, John H., Linda DeAngelo, Laura Palucki Blake, Sylvia Hurtado, Serge Tran. The American Freshman: National Norms Fall 2011. Los Angeles: Cooperative Institutional Research Program at the Higher Education Research Institute at UCLA, 2011. Also available online at http://heri.ucla.edu/PDFs/pubs/TFS/Norms/Monographs/TheAmericanFreshman2011.pdf (accessed May 15, 2013).

    “The World FactBook,” Central Intelligence Agency. Available online at https://www.cia.gov/library/publications/the-world-factbook/geos/af.html (accessed May 15, 2013).

    “What are the key statistics about pancreatic cancer?” American Cancer Society, 2013. Available online at http://www.cancer.org/cancer/pancreaticcancer/detailedguide/pancreatic-cancer-key-statistics (accessed May 15, 2013).

    Media Attributions

    • Picture1
    definition

    License

    Icon for the Creative Commons Attribution-ShareAlike 4.0 International License

    Introductory Statistics Copyright © 2024 by LOUIS: The Louisiana Library Network is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License, except where otherwise noted.