Chapter 7: Confidence Intervals

7.4 Using Technology for Confidence Intervals

Learning Objectives

By the end of this section, the student should be able to:

  • use technology to find Confidence Intervals

This is another section to show technology tools that can be used to help with the statistics concepts taught in this chapter. This section will cover tools to help find Confidence Intervals.

Graphing Calculator

Similar to all the other technology sections, the main tool introduced will be the TI-83, 83+, 84, or 84+ Graphing Calculator. The graphing calculator can help to find values to help calculate the error bound formula and even construct the entire confidence interval.

 

A Confidence Interval with a Known Standard Deviation

We learned in 7.1 A Single Population Mean Using the Normal Distribution, to construct a confidence interval for a single unknown population, mean [latex]\mu[/latex], where the population standard deviation is known, we need [latex]\overline{x}[/latex] as an estimate for [latex]\mu[/latex] and we need the margin of error. Remember, the margin of error ([latex]\text{EBM}[/latex]) is called the error bound for a population mean (abbreviated [latex]\text{EBM}[/latex]). The sample mean [latex]\overline{x}[/latex] is the point estimate of the unknown population mean [latex]\mu[/latex].

The confidence interval estimate will have the form: [latex](\text{point estimate} - \text{error bound}, \text{point estimate} + \text{error bound})[/latex] or, in symbols, ([latex]\overline{x}-\text{EBM},\overline{x}+\text{EBM}[/latex]).

We did by-hand calculations to calculate the error bound [latex]\text{EBM}[/latex] and construct the confidence interval. The error bound formula for an unknown population mean, [latex]\mu[/latex], when the population standard deviation [latex]\sigma[/latex] is known is: [latex]\text{EBM} = {z}_{\frac{\alpha }{2}}\left(\frac{\sigma }{\sqrt{n}}\right)[/latex]. The by-hand work we did had us find the value of z that puts an area equal to the confidence level (in decimal form) in the middle of the standard normal distribution [latex]Z \sim N(0, 1)[/latex]:

  • The confidence level, [latex]\text{CL}[/latex], is the area in the middle of the standard normal distribution. [latex]\text{CL} = 1 - \alpha[/latex], so [latex]\alpha[/latex] is the area that is split equally between the two tails.
  • Each of the tails contains an area equal to [latex]\frac{\alpha}{2}[/latex].
  • The z-score that has an area to the right of [latex]\frac{\alpha }{2}[/latex] is denoted by [latex]{z}_{\frac{\alpha }{2}}[/latex].

For example, when [latex]\text{CL} = 0.95[/latex], [latex]\alpha = 0.05[/latex] and [latex]\frac{\alpha }{2} = 0.025[/latex]; we write [latex]{z}_{\frac{\alpha }{2}} = z_0.025[/latex]. The area to the right of [latex]z_{0.025}[/latex] is 0.025 and the area to the left of [latex]z_{0.025}[/latex] is [latex]1 – 0.025 = 0.975[/latex]. We then used a standard normal probability table to find [latex]{z}_{\frac{\alpha }{2}} = {z}_{0.025} = 1.96[/latex]. It was work, but we got there, right?

Well, the first thing we can do with the graphing calculator is to use the [latex]\text{invNorm}[/latex] function.

Finding the z-score for the stated Confidence Level using the TI-83, 83+, 84, or 84+ Graphing Calculator

  1. Access [DISTR] by pressing 2nd key, vars key. (Note: DISTR stands for "Distributions")
  2. The function to find is <invNorm()>
  3. The syntax is invNorm(Area to the Left of [latex]{z}_{\frac{\alpha }{2}}[/latex], 0, 1)

Legend

  • A blank calculator button represents a button press
  • [ ] represents yellow command or green letter behind a key
  • < > represents items on the screen

 

Notice, we are using 0 and 1 since we want a z-score and [latex]Z \sim N(0, 1)[/latex]. We still have to go through taking our [latex]\text{CL}[/latex], finding [latex]\alpha[/latex] and [latex]\frac{\alpha }{2}[/latex]. It is helpful to remember the graph below.

 

A normal curve with peak at [latex]\overline{x}[/latex] and [latex]CL = 1 - \alpha[/latex] region shaded.
Figure 1. A normal curve with peak at [latex]\overline{x}[/latex], points [latex]\overline{x}-\text{EBM}[/latex] and [latex]\overline{x}+\text{EBM}[/latex] labeled with each unshaded tail has area a [latex]\frac{\alpha }{2}[/latex]

 

For the example provided above, we would use [latex]\text{invNorm}(0.975, 0, 1)[/latex] and get [latex]z = 1.96[/latex]. We would take that z-score and use it to calculate the error bound [latex]\text{EBM}[/latex] and construct the confidence interval. Let's see another example.

Example

Suppose scores on exams in statistics are normally distributed with an unknown population mean and a population standard deviation of three points. A random sample of 36 scores is taken and gives a sample mean (sample mean score) of 68. Find a confidence interval estimate for the population mean exam score (the mean score on all exams).

Find a 90% confidence interval for the true (population) mean of statistics exam scores

We know the following from our problem:

  • [latex]\overline{x} = 68[/latex]
  • [latex]\sigma = 3[/latex]
  • [latex]n = 36[/latex]
  • The confidence level is 90%, so ([latex]\text{CL} = 0.90[/latex])

To find the confidence interval, you need the sample mean, [latex]\overline{x}[/latex], and the [latex]\text{EBM}[/latex], where [latex]\text{EBM} = \left({z}_{\frac{\alpha }{2}}\right) \left(\frac{\sigma }{\sqrt{n}}\right)[/latex]. Since [latex]\text{CL} = 0.90[/latex], [latex]\alpha = 1 – \text{CL} = 1 – 0.90 = 0.10[/latex], and [latex]\frac{\alpha }{2} = 0.05[/latex]. That makes [latex]{z}_{\frac{\alpha }{2}}={z}_{0.05}[/latex]. The area to the right of [latex]{z}_{0.05}[/latex] is 0.05 and the area to the left of [latex]{z}_{0.05}[/latex] is [latex]1 - 0.05 = 0.95[/latex].

Using [latex]\text{invNorm}(0.95, 0, 1)[/latex] on the TI-83, 83+, and 84+ calculator, we would get [latex]z=1.645[/latex].

  • [latex]\text{EBM} = (1.645)(\frac{3}{\sqrt{36}}) = 0.8225[/latex]
  • [latex]\overline{x} - \text{EBM} = 68 - 0.8225 = 67.1775[/latex]
  • [latex]\overline{x} + \text{EBM} = 68 + 0.8225 = 68.8225[/latex]

The 90% confidence interval is (67.1775, 68.8225).

Example

The Specific Absorption Rate (SAR) for a cell phone measures the amount of radio frequency (RF) energy absorbed by the user’s body when using the handset. Every cell phone emits RF energy. Different phone models have different SAR measures. To receive certification from the Federal Communications Commission (FCC) for sale in the United States, the SAR level for a cell phone must be no more than 1.6 watts per kilogram. The table below shows the highest SAR level for a random selection of cell phone models as measured by the FCC.

Table 1: Phone Models with different SAR measures
Phone Model SAR Phone Model SAR Phone Model SAR
Apple iPhone 4S 1.11 LG Ally 1.36 Pantech Laser 0.74
BlackBerry Pearl 8120 1.48 LG AX275 1.34 Samsung Character 0.5
BlackBerry Tour 9630 1.43 LG Cosmos 1.18 Samsung Epic 4G Touch 0.4
Cricket TXTM8 1.3 LG CU515 1.3 Samsung M240 0.867
HP/Palm Centro 1.09 LG Trax CU575 1.26 Samsung Messager III SCH-R750 0.68
HTC One V 0.455 Motorola Q9h 1.29 Samsung Nexus S 0.51
HTC Touch Pro 2 1.41 Motorola Razr2 V8 0.36 Samsung SGH-A227 1.13
Huawei M835 Ideos 0.82 Motorola Razr2 V9 0.52 SGH-a107 GoPhone 0.3
Kyocera DuraPlus 0.78 Motorola V195s 1.6 Sony W350a 1.48
Kyocera K127 Marbl 1.25 Nokia 1680 1.39 T-Mobile Concord 1.38

Find a 98% confidence interval for the true (population) mean of the Specific Absorption Rates (SARs) for cell phones. Assume that the population standard deviation is [latex]\sigma = 0.337[/latex].

For this problem, we first need to find the point estimate, the sample mean [latex]\overline{x}[/latex], before we can find the confidence interval. Reminder from 2.9 Using Technology for Descriptive Statistics, that we can use our List and 1-Var Stats features to do this. We would find [latex]\overline{x}=1.024[/latex]. To find the [latex]\text{EBM}[/latex], because you are creating a 98% confidence interval, [latex]\text{CL} = 0.98[/latex], you need to find [latex]z_{0.01}[/latex] having the property that the area under the normal density curve to the right of [latex]z_{0.01}[/latex] is 0.01 and the area to the left is 0.99. Using your calculator to find the z-score, [latex]\text{invNorm}(0.99, 0, 1)[/latex], we would get [latex]z=2.326[/latex].

  • [latex]EBM=(2.326)\frac{0.337}{\sqrt{30}}=0.1431[/latex]
  • [latex]\overline{x} - \text{EBM} = 1.024 - 0.1431 = 0.8809[/latex]
  • [latex]\overline{x} - \text{EBM} = 1.024 - 0.1431 = 1.1671[/latex]

 

Now, there is actually an even quicker way to use technology - and that's to have it construct the entire confidence interval.

Using the ZInterval in the TI-83, 83+, 84, or 84+ Graphing Calculator

  1. Access statistics mode by pressing stat key.

  2. Navigate to <TESTS> by using the arrow right key.
  3. Access <ZInterval>. Highlight it and press enter key.

Notes:

  • ZInterval is the confidence interval for mean when σ is known.
  • The confidence levels should be given as decimals (ex. enter ".95" for a 95% confidence level).
  • You can enter the data into a list and use <DATA> to have the calculator find the sample mean and standard deviation, or you may enter the sample mean and standard deviation directly by using <STATS>.

      Legend

      • A blank calculator button represents a button press
      • [ ] represents yellow command or green letter behind a key
      • < > represents items on the screen

       

      Let's look at some examples to use the ZInterval.

      Example

      Suppose scores on exams in statistics are normally distributed with an unknown population mean and a population standard deviation of three points. A random sample of 36 scores is taken and gives a sample mean (sample mean score) of 68. Find a confidence interval estimate for the population mean exam score (the mean score on all exams).

      Find a 90% confidence interval for the true (population) mean of statistics exam scores.

      Solution

      After accessing the ZInterval, arrow to Stats and press ENTER.

      Arrow down and enter three for [latex]\sigma[/latex], 68 for [latex]\overline{x}[/latex], 36 for n, and .90 for C-Level.

      Arrow down to Calculate and press ENTER.

      The confidence interval is (to three decimal places) (67.178, 68.822).

      Example

      The Specific Absorption Rate (SAR) for a cell phone measures the amount of radio frequency (RF) energy absorbed by the user’s body when using the handset. Every cell phone emits RF energy. Different phone models have different SAR measures. To receive certification from the Federal Communications Commission (FCC) for sale in the United States, the SAR level for a cell phone must be no more than 1.6 watts per kilogram. The table below shows the highest SAR level for a random selection of cell phone models as measured by the FCC.

      Table 2: Phone Models with different SAR measures
      Phone Model SAR Phone Model SAR Phone Model SAR
      Apple iPhone 4S 1.11 LG Ally 1.36 Pantech Laser 0.74
      BlackBerry Pearl 8120 1.48 LG AX275 1.34 Samsung Character 0.5
      BlackBerry Tour 9630 1.43 LG Cosmos 1.18 Samsung Epic 4G Touch 0.4
      Cricket TXTM8 1.3 LG CU515 1.3 Samsung M240 0.867
      HP/Palm Centro 1.09 LG Trax CU575 1.26 Samsung Messager III SCH-R750 0.68
      HTC One V 0.455 Motorola Q9h 1.29 Samsung Nexus S 0.51
      HTC Touch Pro 2 1.41 Motorola Razr2 V8 0.36 Samsung SGH-A227 1.13
      Huawei M835 Ideos 0.82 Motorola Razr2 V9 0.52 SGH-a107 GoPhone 0.3
      Kyocera DuraPlus 0.78 Motorola V195s 1.6 Sony W350a 1.48
      Kyocera K127 Marbl 1.25 Nokia 1680 1.39 T-Mobile Concord 1.38

      Find a 98% confidence interval for the true (population) mean of the Specific Absorption Rates (SARs) for cell phones. Assume that the population standard deviation is [latex]\sigma = 0.337[/latex].

      Solution

      Let's assume we didn't already find [latex]\overline{x}=1.024[/latex] for this problem. You would start by entering this data into a List in your calculator.

      Then, access the ZInterval, arrow to Data and press ENTER.

      Arrow down and enter 0.337 for [latex]\sigma[/latex], L1 for List (or the list name where you put the data), 1 for Freq, and .98 for C-Level.

      Arrow down to Calculate and press ENTER.

      The confidence interval is (to three decimal places) (0.881, 1.167).

      Or you could find [latex]\overline{x}=1.024[/latex] first by using 1-Var Stats, then access the ZInterval, arrow to Stats and press ENTER.

      Arrow down and enter the following values:

      • [latex]\sigma : 0.337[/latex]
      • [latex]\overline{x}: 1.024[/latex]
      • [latex]n: 30[/latex]
      • [latex]C-\text{Level} : 0.98[/latex]

      Arrow down to Calculate and press ENTER.

      The confidence interval is (to three decimal places) (0.881, 1.167).

       

      A Confidence Interval with an Unknown Standard Deviation

      Remember, in practice, we rarely know the population standard deviation. In the past, when the sample size was large, this did not present a problem to statisticians. They used the sample standard deviation s as an estimate for [latex]\sigma[/latex] and proceeded as before to calculate a confidence interval with close enough results. However, statisticians ran into problems when the sample size was small. A small sample size caused inaccuracies in the confidence interval.

      In 7.2 A Single Population Mean using the Student t Distribution we saw how to find a confidence interval if the population standard deviation is not known and the population has an approximate normal distribution, by using the Student's t distribution.

      Calculators and computers can easily calculate any Student's t-probabilities. The TI-83,83+, and 84+ have a [latex]\text{tcdf}[/latex] function to find the probability for given values of t. The syntax for the [latex]\text{tcdf}[/latex] command is [latex]\text{tcdf(lower bound, upper bound, degrees of freedom)}[/latex]. However, for confidence intervals, we need to use inverse probability to find the value of t when we know the probability.

      • tpdf(x,df) yields the probability density function value (only useful to plot the student-t curve, in which case "x" is the variable)
      • tcdf(left bound, right bound, df) corresponds to P(left bound < t < right bound)

      For the TI-84+ you can use the [latex]\text{invT}[/latex] command on the [latex]\text{DISTRibution}[/latex] menu. The [latex]\text{invT}[/latex] command works similarly to the invnorm. The invT command requires two inputs: [latex]\text{invT(area to the left, degrees of freedom)}[/latex]. The output is the t-score that corresponds to the area we specified. Unfortunately, the TI-83 and 83+ do not have the [latex]\text{invT}[/latex] command.

      But we now know that there is an even quicker way to use technology - and that's to have it construct the entire confidence interval.

      Using the TInterval in the TI-83, 83+, 84, or 84+ Graphing Calculator

      1. Access statistics mode by pressing stat key.

      2. Navigate to <TESTS> by using the arrow right key.
      3. Access <TInterval>. Highlight it and press enter key.

      Notes:

      • TInterval is the confidence interval for mean when σ is unknown; s estimates σ.
      • The confidence levels should be given as decimals (ex. enter ".95" for a 95% confidence level).
      • You can enter the data into a list and use <DATA> to have the calculator find the sample mean and standard deviation, or you may enter the sample mean and standard deviation directly by using <STATS>.

          Legend

          • A blank calculator button represents a button press
          • [ ] represents yellow command or green letter behind a key
          • < > represents items on the screen

           

          Let's look at some examples to use the TInterval.

          Example

          Suppose you do a study of acupuncture to determine how effective it is in relieving pain. You measure sensory rates for 15 subjects with the results given.

          Use the sample data below to construct a 95% confidence interval for the mean sensory rate for the population (assumed normal) from which you took the data.

          8.6; 9.4; 7.9; 6.8; 8.3; 7.3; 9.2; 9.6; 8.7; 11.4; 10.3; 5.4; 8.1; 5.5; 6.9

          Solution

          You would start by entering this data in a List in your calculator.

          Then, access the TInterval, arrow to Data and press ENTER.

          Arrow down and enter L1 for List (or the list name where you put the data), 1 for Freq, and .95 for C-Level.

          Arrow down to Calculate and press ENTER.

          The confidence interval is (7.3006, 9.1527).

          Or you could find [latex]\overline{x}=8.2267[/latex] and [latex]s = 1.6722[/latex] first by using 1-Var Stats, then access the TInterval, arrow to Stats and press ENTER.

          Arrow down and enter 15 for n, L1 for List (or the list name where you put the data), 1 for Freq, and .95 for C-Level.

          Arrow down to Calculate and press ENTER.

          Example

          The Human Toxome Project (HTP) is working to understand the scope of industrial pollution in the human body. Industrial chemicals may enter the body through pollution or as ingredients in consumer products. In October 2008, the scientists at HTP tested cord blood samples for 20 newborn infants in the United States. The cord blood of the "In utero/newborn" group was tested for 430 industrial compounds, pollutants, and other chemicals, including chemicals linked to brain and nervous system toxicity, immune system toxicity, and reproductive toxicity, and fertility problems. There are health concerns about the effects of some chemicals on the brain and nervous system. The data below shows how many of the targeted chemicals were found in each infant’s cord blood.

          79; 145; 147; 160; 116; 100; 159; 151; 156; 126; 137; 83; 156; 94; 121; 144; 123; 114; 139; 99

          Use this sample data to construct a 90% confidence interval for the mean number of targeted industrial chemicals to be found in an infant’s blood.

          Solution

          Start by entering this data in a List in your calculator.

          Then, access the TInterval, arrow to Data and press ENTER.

          Arrow down and enter L1 for List (or the list name where you put the data), 1 for Freq, and .90 for C-Level.

          Arrow down to Calculate and press ENTER.

          The 90% confidence interval is (117.41, 137.49)

          Your Turn!

          A random sample of statistics students were asked to estimate the total number of hours they spend watching television in an average week. The responses are recorded in the table below. Use this sample data to construct a 98% confidence interval for the mean number of hours statistics students will spend watching television in one week.

          0; 3; 1; 20; 9; 5; 10; 1; 10; 4; 14; 2; 4; 4; 5

          Solution

          Start by entering this data in a List in your calculator.

          Then, access the TInterval, arrow to Data and press ENTER.

          Arrow down and enter L1 for List (or the list name where you put the data), 1 for Freq, and .98 for C-Level.

          Arrow down to Calculate and press ENTER.

          The 98% confidence interval is (2.3965, 9.8702).

           

          Confidence Interval for a Proportion

          Let's recall - how do you know you are dealing with a proportion problem?

          1. First, the underlying distribution is a binomial distribution. (There is no mention of a mean or average.)
          2. If [latex]X[/latex] is a binomial random variable, then [latex]X \sim B(n, p)[/latex] where [latex]n[/latex] is the number of trials and [latex]p[/latex] is the probability of a success.
          3. To form a proportion, take [latex]X[/latex], the random variable for the number of successes and divide it by [latex]n[/latex], the number of trials (or the sample size).
          4. The random variable [latex]P^{\prime}[/latex](read "P prime") is that proportion, [latex]{P}^{\prime }=\frac{X}{n}[/latex]

          By-hand, the calculation for the confidence interval of a proportion in 7.3 A Population Proportion is the most difficulty. Remember, we needed to find [latex]{p}^{\prime}[/latex], [latex]{q}^{\prime}[/latex], and [latex]\text{EBP}[/latex]:

          • [latex]{p}^{\prime }=\frac{x}{n}[/latex]
          • [latex]{q}^{\prime}= 1 - {p}^{\prime}[/latex]
          • [latex]\text{EBP}=\left({z}_{\frac{\alpha }{2}}\right)\sqrt{\frac{{p}^{\prime }{q}^{\prime }}{n}}[/latex]

          The confidence interval for the true binomial population proportion is [latex](p^{\prime} - \text{EBP}, p^{\prime} + \text{EBP})[/latex]. But who really wants to calculate that [latex]\text{EBP}=\left({z}_{\frac{\alpha }{2}}\right)\sqrt{\frac{{p}^{\prime }{q}^{\prime }}{n}}[/latex]?

          Well, [latex]P^{\prime}[/latex] follows a normal distribution for proportions. Which means, to find [latex]{z}_{\frac{\alpha }{2}}[/latex] in the [latex]\text{EBP}[/latex] formula, we could use the TI-83, 83+, or 84+ calculator command [latex]\text{invNorm}()[/latex]. So that would cut down on some of the work!

          But we now know that there is an even quicker way to use technology - and that's to have it construct the entire confidence interval.

          Using the 1-PropZInt in the TI-83, 83+, 84, or 84+ Graphing Calculator

          1. Access statistics mode by pressing stat key.

          2. Navigate to <TESTS> by using the arrow right key.
          3. Access <1-PropZInt>. Highlight it and press enter key.

          Notes:

          • 1-PropZInt is the confidence interval for proportion.
          • The confidence levels should be given as decimals (ex. enter ".95" for a 95% confidence level).
          • There is no <DATA> or <STATS> option for this interval.

              Legend

              • A blank calculator button represents a button press
              • [ ] represents yellow command or green letter behind a key
              • < > represents items on the screen

               

              The hardest thing to remember with these is that [latex]X[/latex] is the random variable for the number of successes and [latex]n[/latex] is the number of trials (or the sample size). Let's look at some examples to use the TInterval.

              Example

              Suppose that a market research firm is hired to estimate the percent of adults living in a large city who have cell phones. Five hundred randomly selected adult residents in this city are surveyed to determine whether they have cell phones. Of the 500 people surveyed, 421 responded yes - they own cell phones.

              Using a 95% confidence level, compute a confidence interval estimate for the true proportion of adult residents of this city who have cell phones.

              Solution

              Access 1-PropZint. Press ENTER.

              Arrow down to [latex]x[/latex] and enter 421.

              Arrow down to [latex]n[/latex] and enter 500.

              Arrow down to C-Level and enter .95.

              Arrow down to Calculate and press ENTER.

              The confidence interval is (0.81003, 0.87397).

              Example

              For a class project, a political science student at a large university wants to estimate the percent of students who are registered voters. He surveys 500 students and finds that 300 are registered voters. Compute a 90% confidence interval for the true percent of students who are registered voters and interpret the confidence interval.

              Solution

              Access 1-PropZint. Press ENTER.

              Arrow down to [latex]x[/latex] and enter 300.

              Arrow down to [latex]n[/latex] and enter 500.

              Arrow down to C-Level and enter 0.90.

              Arrow down to Calculate and press ENTER.

              The confidence interval is (0.564, 0.636).

              Your Turn!

              A student polls his school to see if students in the school district are for or against the new legislation regarding school uniforms. She surveys 600 students and finds that 480 are against the new legislation.

              In a sample of 300 students, 68% said they own an iPod and a smartphone. Compute a 97% confidence interval for the true percent of students who own an iPod and a smartphone.

              Solution

              Access 1-PropZint. Press ENTER.

              Arrow down to x and enter 300*0.68. (Note, here we are just letting the calculator find x for is)

              Arrow down to n and enter 300.

              Arrow down to C-Level and enter 0.97.

              Arrow down to Calculate and press ENTER.

              The confidence interval is (0.6216, 0.7384).

              Example

              A random sample of 25 statistics students was asked: “Have you smoked a cigarette in the past week?” Six students reported smoking within the past week.

              Use the “plus-four” method to find a 95% confidence interval for the true proportion of statistics students who smoke.

              Solution

              Access 1-PropZint. Press ENTER.

              Reminder: The plus-four method assumes an additional four trials: two successes and two failures. You do not need to change the process for calculating the confidence interval; simply update the values of x and n to reflect these additional trials.

              Arrow down to x and enter eight.

              Arrow down to n and enter 29.

              Arrow down to C-Level and enter 0.95.

              Arrow down to Calculate and press ENTER.

              The confidence interval is (0.113, 0.439).

              Your Turn!

              Out of a random sample of 65 freshmen at State University, 31 students have declared a major.

              Use the “plus-four” method to find a 96% confidence interval for the true proportion of freshmen at State University who have declared a major.

              Solution

              Access 1-PropZint. Press ENTER.

              Arrow down to x and enter 33.

              Arrow down to n and enter 69.

              Arrow down to C-Level and enter 0.96.

              Arrow down to Calculate and press ENTER.

              The confidence interval is (0.355, 0.602).

              Example

              The Berkman Center for Internet & Society at Harvard recently conducted a study analyzing the privacy management habits of teen internet users. In a group of 50 teens, 13 reported having more than 500 friends on Facebook. Use the “plus four” method to find a 90% confidence interval for the true proportion of teens who would report having more than 500 Facebook friends.

              Solution

              Access 1-PropZint. Press ENTER.

              Arrow down to x and enter 15.

              Arrow down to n and enter 54.

              Arrow down to C-Level and enter 0.90.

              Arrow down to Calculate and press ENTER.

              The confidence interval is (0.178, 0.378).

              Your Turn!

              The Berkman Center Study referenced in the previous example talked to teens in smaller focus groups, but also interviewed additional teens over the phone. When the study was complete, 588 teens had answered the question about their Facebook friends with 159 saying that they have more than 500 friends. Use the “plus-four” method to find a 90% confidence interval for the true proportion of teens that would report having more than 500 Facebook friends based on this larger sample. Compare the results to those in the previous example.

              Solution

              Access 1-PropZint. Press ENTER.

              Arrow down to x and enter 161.

              Arrow down to n and enter 592.

              Arrow down to C-Level and enter 0.90.

              Arrow down to Calculate and press ENTER.

              The confidence interval is (0.242, 0.302).

              The confidence interval for the larger sample is narrower than the interval from the previous example. Larger samples will always yield more precise confidence intervals than smaller samples. The “plus four” method has a greater impact on the smaller sample. It shifts the point estimate from 0.26 ([latex]\frac{13}{50}[/latex]) to 0.278 ([latex]\frac{15}{54}[/latex]). It has a smaller impact on the EPB, changing it from 0.102 to 0.100. In the larger sample, the point estimate undergoes a smaller shift: from 0.270 [latex](\frac{159}{588})[/latex] to 0.272 [latex](\frac{161}{592})[/latex]. It is easy to see that the plus-four method has the greatest impact on smaller samples.

               

              Helpful Videos for the Graphing Calculator

              Below are links to helpful videos for using the graphing calculator for the concepts covered on this page:

               

              Additional Technology Tools

              In addition to the graphing calculator, there are some additional technology tools that can be used for the concepts covered on this page. Below are links to helpful videos for those tools:

               

              Section Practice

              Among various ethnic groups, the standard deviation of heights is known to be approximately three inches. We wish to construct a 95% confidence interval for the mean height of male Swedes. Forty-eight male Swedes are surveyed. The sample mean is 71 inches. The sample standard deviation is 2.8 inches.

              1. [latex]\overline{x} =\underline{\hspace{2cm}}[/latex]
              2. [latex]\sigma = \underline{\hspace{2cm}}[/latex]
              3. [latex]n =\underline{\hspace{2cm}}[/latex]
              4. In words, define the random variables [latex]X[/latex] and [latex]\overline{X}[/latex].
              5. Which distribution should you use for this problem? Explain your choice.
              6. Construct a 95% confidence interval for the population mean height of male Swedes.
                • State the confidence interval.
                • Sketch the graph.
                • Calculate the error bound.
              7. What will happen to the level of confidence obtained if 1,000 male Swedes are surveyed instead of 48? Why?
              Solution
              1. 71
              2. 3
              3. 48
              4. [latex]X[/latex] is the height of a Swiss male and is the mean height from a sample of 48 Swiss males.
              5. Normal. We know the standard deviation for the population, and the sample size is greater than 30.
              6. Construct a 95% confidence interval for the population mean height of male Swedes.
                • [latex]\text{CI}: (70.151, 71.49)[/latex]
                • Graph below.
                • [latex]\text{EBM} = 0.849[/latex]
              7. The confidence interval will decrease in size, because the sample size increased. Recall, when all factors remain unchanged, an increase in sample size decreases variability. Thus, we do not need as large an interval to capture the true population mean.
              A normal distribution with an unknown shaded region between 70.15 and 71.85
              Figure 2. Normal Distribution with an unknown shaded region between 70.15 and 71.85

              Announcements for 84 upcoming engineering conferences were randomly picked from a stack of IEEE Spectrum magazines. The mean length of the conferences was 3.94 days, with a standard deviation of 1.28 days. Assume the underlying population is normal.

              1. In words, define the random variables [latex]X[/latex] and [latex]\overline{X}[/latex].
              2. Which distribution should you use for this problem? Explain your choice.
              3. Construct a 95% confidence interval for the population mean length of engineering conferences.
                • State the confidence interval.
                • Sketch the graph.
                • Calculate the error bound.

              A sample of 16 small bags of the same brand of candies was selected. Assume that the population distribution of bag weights is normal. The weight of each bag was then recorded. The mean weight was two ounces with a standard deviation of 0.12 ounces. The population standard deviation is known to be 0.1 ounce.

              1. Identify the following:
                • [latex]\overline{x} =\underline{\hspace{2cm}}[/latex]
                • [latex]\sigma  =\underline{\hspace{2cm}}[/latex]
                • [latex]s_{x} =\underline{\hspace{2cm}}[/latex]
              2. In words, define the random variable [latex]X[/latex].
              3. In words, define the random variable [latex]\overline{X}[/latex].
              4. Which distribution should you use for this problem? Explain your choice.
              5. Construct a 90% confidence interval for the population mean weight of the candies.
                • State the confidence interval.
                • Sketch the graph.
                • Calculate the error bound.
              6. Construct a 98% confidence interval for the population mean weight of the candies.
                • State the confidence interval.
                • Sketch the graph.
                • Calculate the error bound.
              7. In complete sentences, explain why the confidence interval in exercise 6 is larger than the confidence interval in exercise 5.
              8. In complete sentences, give an interpretation of what the interval in exercise 6 means.

              A camp director is interested in the mean number of letters each child sends during his or her camp session. The population standard deviation is known to be 2.5. A survey of 20 campers is taken. The mean from the sample is 7.9 with a sample standard deviation of 2.8.

              1. Identify the following:
                • [latex]\overline{x} =\underline{\hspace{2cm}}[/latex]
                • [latex]\sigma =\underline{\hspace{2cm}}[/latex]
                • [latex]n =\underline{\hspace{2cm}}[/latex]
              2. Define the random variables [latex]X[/latex] and [latex]\overline{X}[/latex] in words.
              3. Which distribution should you use for this problem? Explain your choice.
              4. Construct a 90% confidence interval for the population mean number of letters campers send home.
                • State the confidence interval.
                • Sketch the graph.
                • Calculate the error bound.
              5. What will happen to the error bound and confidence interval if 500 campers are surveyed? Why?
              Solution
              1. Identify the following:
                • 7.9
                • 2.5
                • 20
              2. [latex]X[/latex] is the number of letters a single camper will send home. [latex]\overline{X}[/latex] is the mean number of letters sent home from a sample of 20 campers.
              3. N[latex]7.9\left(\frac{2.5}{\sqrt{20}}\right)[/latex]
              4. Construct a 90% confidence interval for the population mean number of letters campers send home
                • [latex]\text{CI}: (6.98, 8.82)[/latex]
                • Graph below.
                • [latex]\text{EBM}: 0.92[/latex]
              5. The error bound and confidence interval will decrease.
              A normal distribution with an unknown shaded region between 6.98 and 8.882
              Figure 3. Normal Distribution with an unknown shaded region between 6.98 and 8.882

              The Federal Election Commission collects information about campaign contributions and disbursements for candidates and political committees each election cycle. During the 2012 campaign season, there were 1,619 candidates for the House of Representatives across the United States who received contributions from individuals. Below are the total receipts from individuals for a random selection of 40 House candidates rounded to the nearest $100. The standard deviation for this data to the nearest hundred is [latex]\sigma = \$ 909,200[/latex].

              $3,600; $1,243,900; $10,900; $385,200; $581,500; $7,400; $2,900; $400; $3,714,500; $632,500; $391,000; $467,400; $56,800; $5,800; $405,200; $733,200; $8,000; $468,700; $75,200; $41,000; $13,300; $9,500; $953,800; $1,113,500; $1,109,300; $353,900; $986,100; $88,600; $378,200; $13,200; $3,800; $745,100; $5,800; $3,072,100; $1,626,700; $512,900; $2,309,200; $6,600; $202,400; $15,800

              1. Find the point estimate for the population mean.
              2. Using 95% confidence, calculate the error bound.
              3. Create a 95% confidence interval for the mean total individual contributions.
              4. Interpret the confidence interval in the context of the problem.
              Solution
              1. [latex]\overline{x}[/latex] = $568,873
              2. [latex]\text{CL} = 0.95[/latex]; [latex]\alpha = 1 – 0.95 = 0.05[/latex]; [latex]{z}_{\frac{\alpha }{2}} = 1.96[/latex]
                [latex]\text{EBM} = {z}_{0.025}\frac{\sigma }{\sqrt{n}}[/latex] = 1.96 [latex]\frac{909200}{\sqrt{40}} = \$ 281,764[/latex]
              3. [latex]\overline{x} - \text{EBM} = 568,873 - 281,764 = 287,109[/latex]
                [latex]\overline{x} + \text{EBM} = 568,873+ 281,764 = 850,637[/latex]

                Alternate solution:

                1. Press STAT and arrow over to TESTS.
                2. Arrow down to 7:ZInterval.
                3. Press ENTER.
                4. Arrow to Stats and press ENTER.
                5. Arrow down and enter the following values:
                  • [latex]\sigma: 909,200[/latex]
                  • [latex]\overline{x}: 568,873[/latex]
                  • [latex]n: 40[/latex]
                  • [latex]\text{CL}: 0.95[/latex]
                6. Arrow down to Calculate and press ENTER.
                7. The confidence interval is ($287,114, $850,632).
                8. Notice the small difference between the two solutions—these differences are simply due to rounding errors in the hand calculations.
              4. We estimate with 95% confidence that the mean amount of contributions received from all individuals by House candidates is between $287,109 and $850,637.

              A pharmaceutical company makes tranquilizers. It is assumed that the distribution for the length of time they last is approximately normal. Researchers in a hospital used the drug on a random sample of nine patients. The effective period of the tranquilizer for each patient (in hours) was as follows: 2.7; 2.8; 3.0; 2.3; 2.3; 2.2; 2.8; 2.1; and 2.4.

              1. Identify the following:
                • [latex]\overline{x} = \underline{\hspace{2cm}}[/latex]
                • [latex]{s}_{x} = \underline{\hspace{2cm}}[/latex]
                • [latex]n = \underline{\hspace{2cm}}[/latex]
                • [latex]n – 1 = \underline{\hspace{2cm}}[/latex]
              2. Define the random variable [latex]X[/latex] in words.
              3. Define the random variable [latex]\overline{X}[/latex] in words.
              4. Which distribution should you use for this problem? Explain your choice.
              5. Construct a 95% confidence interval for the population mean length of time.
                • State the confidence interval.
                • Sketch the graph.
                • Calculate the error bound.
              6. What does it mean to be “95% confident” in this problem?

               

              Solution
              1. Identify the following:
                • [latex]\overline{x} = 2.51[/latex]
                • [latex]{s}_{x} = 0.318[/latex]
                • [latex]n = 9[/latex]
                • [latex]n - 1 = 8[/latex]
              2. the effective length of time for a tranquilizer
              3. the mean effective length of time of tranquilizers from a sample of nine patients
              4. We need to use a Student’s-t distribution, because we do not know the population standard deviation.
              5. Construct a 95% confidence interval
                • [latex]\text{CI}: (2.27, 2.76)[/latex]
                • Check student's solution.
                • [latex]\text{EBM}: 0.25[/latex]
              6. If we were to sample many groups of nine patients, 95% of the samples would contain the true population mean length of time.

              Suppose that 14 children, who were learning to ride two-wheel bikes, were surveyed to determine how long they had to use training wheels. It was revealed that they used them an average of six months with a sample standard deviation of three months. Assume that the underlying population distribution is normal.

              1. Identify the following:
                • [latex]\overline{x} = \underline{\hspace{2cm}}[/latex]
                • [latex]{s}_{x} = \underline{\hspace{2cm}}[/latex]
                • [latex]n = \underline{\hspace{2cm}}[/latex]
                • [latex]n – 1 = \underline{\hspace{2cm}}[/latex]
              2. Define the random variable [latex]X[/latex] in words.
              3. Define the random variable[latex]\overline{X}[/latex] in words.
              4. Which distribution should you use for this problem? Explain your choice.
              5. Construct a 99% confidence interval for the population mean length of time using training wheels.
                • State the confidence interval.
                • Sketch the graph.
                • Calculate the error bound.
              6. Why would the error bound change if the confidence level were lowered to 90%?

              The Federal Election Commission (FEC) collects information about campaign contributions and disbursements for candidates and political committees each election cycle. A political action committee (PAC) is a committee formed to raise money for candidates and campaigns. A Leadership PAC is a PAC formed by a federal politician (senator or representative) to raise money to help other candidates’ campaigns.

              The FEC has reported financial information for 556 Leadership PACs that operated during the 2011–2012 election cycle. The data below shows the total receipts during this cycle for a random selection of 20 Leadership PACs.

              $46,500.00

              $0

              $40,966.50

              $105,887.20

              $5,175.00

              $29,050.00

              $19,500.00

              $181,557.20

              $31,500.00

              $149,970.80

              $2,555,363.20

              $12,025.00

              $409,000.00

              $60,521.70

              $18,000.00

              $61,810.20

              $76,530.80

              $119,459.20

              $0

              $63,520.00

              $6,500.00

              $502,578.00

              $705,061.10

              $708,258.90

              $135,810.00

              $2,000.00

              $2,000.00

              $0

              $1,287,933.80

              $219,148.30

              [latex]\overline{x}=$251,854.23[/latex]

              [latex]s=\$521,130.41[/latex]

              Use this sample data to construct a 96% confidence interval for the mean amount of money raised by all Leadership PACs during the 2011–2012 election cycle. Use the Student's t-distribution.

              Solution

              Enter the data as a list.

              Press STAT and arrow over to TESTS.

              Arrow down to 8:TInterval.

              Press ENTER.

              Arrow to Data and press ENTER.

              Arrow down and enter the name of the list where the data is stored.

              Enter Freq: 1

              Enter C-Level: 0.96

              Arrow down to Calculate and press Enter.

              The 96% confidence interval is ($47,262, $456,447).

              The difference between solutions arises from rounding differences.

              Forbes magazine published data on the best small firms in 2012. These were firms that had been publicly traded for at least a year, have a stock price of at least $5 per share, and have reported annual revenue between $5 million and $1 billion. The data below shows the ages of the corporate CEOs for a random sample of these firms.

              48

              58

              51

              61

              56

              59

              74

              63

              53

              50

              59

              60

              60

              57

              46

              55

              63

              57

              47

              55

              57

              43

              61

              62

              49

              67

              67

              55

              55

              49

              Use this sample data to construct a 90% confidence interval for the mean age of CEO’s for these top small firms. Use the Student's t-distribution.

              Unoccupied seats on flights cause airlines to lose revenue. Suppose a large airline wants to estimate its mean number of unoccupied seats per flight over the past year. To accomplish this, the records of 225 flights are randomly selected and the number of unoccupied seats is noted for each of the sampled flights. The sample mean is 11.6 seats and the sample standard deviation is 4.1 seats.

              1. Identify the following information:
                • [latex]\overline{x} = \underline{\hspace{2cm}}[/latex]
                • [latex]{s}_{x} = \underline{\hspace{2cm}}[/latex]
                • [latex]n = \underline{\hspace{2cm}}[/latex]
                • [latex]n – 1 = \underline{\hspace{2cm}}[/latex]
              2. Define the random variables [latex]X[/latex] and [latex]\overline{X}[/latex] in words.
              3. Which distribution should you use for this problem? Explain your choice.
              4. Construct a 92% confidence interval for the population mean number of unoccupied seats per flight.
                • State the confidence interval.
                • Sketch the graph.
                • Calculate the error bound.

               

              Solution
              1. Identify the following information
                • [latex]\overline{x} = 11.6[/latex]
                • [latex]{s}_{x} = 4.1[/latex]
                • [latex]n = 225[/latex]
                • [latex]n - 1 = 224[/latex]
              2. [latex]X[/latex] is the number of unoccupied seats on a single flight. [latex]\overline{X}[/latex] is the mean number of unoccupied seats from a sample of 225 flights.
              3. We will use a Student’s-t distribution, because we do not know the population standard deviation.
              4. Construct a 92% confidence interval
                • [latex]\text{CI}: (11.12 , 12.08)[/latex]
                • Check student's solution.
                • [latex]\text{EBM}: 0.48[/latex]

              A national survey of 1,000 adults was conducted on May 13, 2013 by Rasmussen Reports. It concluded with 95% confidence that 49% to 55% of Americans believe that big-time college sports programs corrupt the process of higher education.

              1. Find the point estimate and the error bound for this confidence interval.
              2. Can we (with 95% confidence) conclude that more than half of all American adults believe this?
              3. Calculate a 75% confidence interval for the proportion of American adults that believe that major college sports programs corrupt higher education.
              4. Can we (with 75% confidence) conclude that at least half of all American adults believe this?
              Solution
              1. [latex]p^{\prime} = \frac{(0.55 + 0.49)}{2} = 0.52[/latex]; [latex]\text{EBP} = 0.55 - 0.52 = 0.03[/latex]
              2. No, the confidence interval includes values less than or equal to 0.50. It is possible that less than half of the population believe this.
              3. STAT TESTS A: 1-PropZinterval with [latex]x = (0.52)(1,000)[/latex], [latex]n = 1,000[/latex], [latex]\text{CL} = 0.75[/latex]. Answer is (0.502, 0.538)
              4. Yes – this interval does not fall less than 0.50 so we can conclude that at least half of all American adults believe that major sports programs corrupt education – but we do so with only 75% confidence.

              An article regarding interracial dating and marriage recently appeared in the Washington Post. Of the 1,709 randomly selected adults, 315 identified themselves as Latinos, 323 identified themselves as blacks, 254 identified themselves as Asians, and 779 identified themselves as whites. In this survey, 86% of blacks said that they would welcome a white person into their families. Among Asians, 77% would welcome a white person into their families, 71% would welcome a Latino, and 66% would welcome a black person.

              1. We are interested in finding the 95% confidence interval for the percent of all black adults who would welcome a white person into their families. Define the random variables [latex]X[/latex] and [latex]P^{\prime}[/latex], in words.
              2. Which distribution should you use for this problem? Explain your choice.
              3. Construct a 95% confidence interval.
                • State the confidence interval.
                • Sketch the graph.
                • Calculate the error bound.

              Use the information provided in the previous exercise to answer the following questions.

              1. Construct three 95% confidence intervals:
                • percent of all Asians who would welcome a white person into their families.
                • percent of all Asians who would welcome a Latino into their families.
                • percent of all Asians who would welcome a black person into their families.
              2. Even though the three-point estimates are different, do any of the confidence intervals overlap? Which?
              3. For any intervals that do overlap, in words, what does this imply about the significance of the differences in the true proportions?
              4. For any intervals that do not overlap, in words, what does this imply about the significance of the differences in the true proportions?
              Solution
              1. Construct three 95% confidence intervals
                • (0.72, 0.82)
                • (0.65, 0.76)
                • (0.60, 0.72)
              2. Yes, the intervals (0.72, 0.82) and (0.65, 0.76) overlap, and the intervals (0.65, 0.76) and (0.60, 0.72) overlap.
              3. We can say that there does not appear to be a significant difference between the proportion of Asian adults who say that their families would welcome a white person into their families and the proportion of Asian adults who say that their families would welcome a Latino person into their families.
              4. We can say that there is a significant difference between the proportion of Asian adults who say that their families would welcome a white person into their families and the proportion of Asian adults who say that their families would welcome a black person into their families.

              Stanford University conducted a study of whether running is healthy for men and women over age 50. During the first eight years of the study, 1.5% of the 451 members of the 50-Plus Fitness Association died. We are interested in the proportion of people over 50 who ran and died in the same eight-year period.

              1. Define the random variables [latex]X[/latex] and [latex]P^{\prime}[/latex] in words.
              2. Which distribution should you use for this problem? Explain your choice.
              3. Construct a 97% confidence interval for the population proportion of people over 50 who ran and died in the same eight–year period.
                • State the confidence interval.
                • Sketch the graph.
                • Calculate the error bound.
              4. Explain what a “97% confidence interval” means for this study.

              Refer to the previous example. Another question in the poll was “[How much are] you worried about the quality of education in our schools?” Sixty-three percent responded “a lot”. We are interested in the population proportion of adult Americans who are worried a lot about the quality of education in our schools.

              1. Define the random variables [latex]X[/latex] and [latex]P^{\prime}[/latex] in words.
              2. Which distribution should you use for this problem? Explain your choice.
              3. Construct a 95% confidence interval for the population proportion of adult Americans who are worried a lot about the quality of education in our schools.
                • State the confidence interval.
                • Sketch the graph.
                • Calculate the error bound.
              4. The sampling error given by Yankelovich Partners, Inc. (which conducted the poll) is [latex]\pm 3 \%[/latex]. In one to three complete sentences, explain what the [latex]\pm 3 \%[/latex] represents.

              License

              Icon for the Creative Commons Attribution-ShareAlike 4.0 International License

              Introductory Statistics Copyright © 2024 by LOUIS: The Louisiana Library Network is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License, except where otherwise noted.