Chapter 8: Hypothesis Testing with One Sample

8.5 Additional Information and Full Hypothesis Test Examples

Learning Objectives

By the end of this section, the student should be able to:

  • Conduct and interpret hypothesis tests for a single population mean, population standard deviation known.
  • Conduct and interpret hypothesis tests for a single population mean, population standard deviation unknown.
  • Conduct and interpret hypothesis tests for a single population proportion.

Level of Significance and P-value

  • In a hypothesis test problem, you may see words such as "the level of significance is 1%." The "1%" is the preconceived or preset [latex]\alpha[/latex].
  • The statistician setting up the hypothesis test selects the value of α to use before collecting the sample data.
  • If no level of significance is given, a common standard to use is [latex]\alpha = 0.05[/latex].
  • When you calculate the p-value and draw the picture, the p-value is the area in the left tail, the right tail, or split evenly between the two tails. For this reason, we call the hypothesis test left, right, or two tailed.
  • The alternative hypothesis, [latex]{H}_{a}[/latex], tells you if the test is left, right, or two-tailed. It is the key to conducting the appropriate test.
  • [latex]{H}_{a}[/latex] never has a symbol that contains an equal sign.
  • Thinking about the meaning of the p-value: A data analyst (and anyone else) should have more confidence that he made the correct decision to reject the null hypothesis with a smaller p-value (for example, 0.001 as opposed to 0.04) even if using the 0.05 level for alpha. Similarly, for a large p-value such as 0.4, as opposed to a p-value of 0.056 ([latex]\alpha = 0.05[/latex] is less than either number), a data analyst should have more confidence that she made the correct decision in not rejecting the null hypothesis. This makes the data analyst use judgment rather than mindlessly applying rules.

The following examples illustrate a left-, right-, and two-tailed test.

Example

[latex]{H}_{0}: μ = 5, {H}_{a}: μ \lt 5[/latex]

Test of a single population mean. [latex]{H}_{a}[/latex] tells you the test is left-tailed. The picture of the p-value is as follows:

Normal distribution curve of a single population mean with a value of 5 on the x-axis and the p-value points to the area on the left tail of the curve.
Figure 1. Normal distribution curve with a value of 5, as the population mean, and a p-value shaded region in the left tail.

Your Turn!

[latex]{H}_{0}: \mu = 10, {H}_{a}: \mu \lt 10[/latex]

Assume the p-value is 0.0935. What type of test is this? Draw the picture of the p-value.

Solution

left-tailed test

Normal distribution curve of a single population mean with a value of 10 on the x-axis and the p-value points to the area on the left tail of the curve.
Figure 2. Normal distribution curve with a value of 10, as the population mean, and a p-value shaded region in the left tail.

Example

[latex]{H}_{0}: p \le 0.2, {H}_{a}: p > 0.2[/latex]

This is a test of a single population proportion. [latex]{H}_{a}[/latex] tells you the test is right-tailed. The picture of the p-value is as follows:

Normal distribution curve of a single population proportion with the value of 0.2 on the x-axis. The p-value points to the area on the right tail of the curve.
Figure 3. Normal distribution curve with a value of 0.2, as the population proportion, and a p-value shaded region in the right tail.

Your Turn!

[latex]{H}_{0}: \mu \le 1, {H}_{a}: \mu > 1[/latex]

Assume the p-value is 0.1243. What type of test is this? Draw the picture of the p-value.

Solution

right-tailed test

Normal distribution curve of a single population mean with the value of 1 on the x-axis. The p-value points to the area on the right tail of the curve.
Figure 4. Normal distribution curve with a value of 1, as the population mean, and a p-value shaded region in the right tail.

Example

[latex]{H}_{0}: p = 50, {H}_{a}: p \neq 50[/latex]

This is a test of a single population mean. [latex]{H}_{a}[/latex] tells you the test is two-tailed. The picture of the p-value is as follows.

Normal distribution curve of a single population mean with a value of 50 on the x-axis. The p-value points to the areas in both tails.
Figure 5. Normal distribution curve with a value of 50, as the population mean, and a p-value formulas, 1/2(p-value), for a two-tailed test is shown

Your Turn!

[latex]{H}_{0}: p = 0.5, {H}_{a}: p \neq 0.5[/latex]

Assume the p-value is 0.2564. What type of test is this? Draw the picture of the p-value.

Solution

two-tailed test

Normal distribution curve of a single population proportion with a value of 5.0 on the x-axis. The p-value points to the areas in both tails.
Figure 6. Normal distribution curve with a value of 50, as the population proportion, and a p-value formulas, 1/2(p-value), for a two-tailed test is shown

Hypothesis Tests for a Single Population Mean, Population Standard Deviation Known

Now, let's use all the information we learned in this Chapter to work full examples of hypothesis testing for a single sample. We will start with Hypothesis Tests for a Single Population Mean, Population Standard Deviation Known.

Example

Jeffrey, as an eight-year old, established a mean time of 16.43 seconds for swimming the 25-yard freestyle, with a standard deviation of 0.8 seconds. His dad, Frank, thought that Jeffrey could swim the 25-yard freestyle faster using goggles. Frank bought Jeffrey a new pair of expensive goggles and timed Jeffrey for 15 25-yard freestyle swims. For the 15 swims, Jeffrey's mean time was 16 seconds. Frank thought that the goggles helped Jeffrey to swim faster than the 16.43 seconds. Conduct a hypothesis test using a preset α = 0.05. Assume that the swim times for the 25-yard freestyle are normal.

Solution

Set up the Hypothesis Test:

Since the problem is about a mean, this is a test of a single population mean.

[latex]{H}_{0}: \mu = 16.43, {H}_{a}: \mu \lt 16.43[/latex]

For Jeffrey to swim faster, his time will be less than 16.43 seconds. The "<" tells you this is left-tailed.

Determine the distribution needed:

Random variable: [latex]\overline{X}[/latex] = the mean time to swim the 25-yard freestyle.

Distribution for the test: [latex]\overline{X}[/latex] is normal (population standard deviation is known: [latex]\sigma = 0.8[/latex])

[latex]\overline{X} \sim N\left(\mu ,\frac{{\sigma }_{X}}{\sqrt{n}}\right)[/latex]

Therefore, [latex]\overline{X} \sim N\left(16.43,\frac{0.8}{\sqrt{15}}\right)[/latex]

[latex]\mu = 16.43[/latex] comes from [latex]{H}_{0}[/latex] and not the data. [latex]\sigma = 0.8[/latex], and [latex]n = 15[/latex].

Calculate the p-value using the normal distribution for a mean:

[latex]\text{p-value} = 0.0187[/latex] where the sample mean in the problem is given as 16.

p-value = 0.0187 (This is called the actual level of significance.) The p-value is the area to the left of the sample mean is given as 16.

Graph:

A normal distribution with 16.43 in the middle. A value of 16 with a shaded region shaded to the left of it as the p-value.
Figure 7. Normal distribution curve for the average time to swim the 25-yard freestyle with values 16, as the sample mean, and 16.43 on the x-axis. A shaded region in the left tail of the curve.

 

[latex]\mu = 16.43[/latex] comes from[latex]{H}_{0}[/latex]. Our assumption is [latex]\mu = 16.43[/latex].

Interpretation of the p-value: If [latex]{H}_{0}[/latex] is true, there is a 0.0187 probability (1.87%)that Jeffrey's mean time to swim the 25-yard freestyle is 16 seconds or less. Because a 1.87% chance is small, the mean time of 16 seconds or less is unlikely to have happened randomly. It is a rare event.

Compare [latex]\alpha[/latex] and the p-value:

[latex]\alpha = 0.05[/latex] and [latex]\text{p-value} = 0.0187[/latex], so [latex]\alpha > \text{p-value}[/latex]

Make a decision: Since [latex]\alpha > \text{p-value}[/latex], reject [latex]H_{0}[/latex].

This means that you reject [latex]\mu = 16.43[/latex]. In other words, you do not think Jeffrey swims the 25-yard freestyle in 16.43 seconds but faster with the new goggles.

Conclusion: At the 5% significance level, we conclude that Jeffrey swims faster using the new goggles. The sample data show there is sufficient evidence that Jeffrey's mean time to swim the 25-yard freestyle is less than 16.43 seconds.

The Type I and Type II errors for this problem are as follows:

The Type I error is to conclude that Jeffrey swims the 25-yard freestyle, on average, in less than 16.43 seconds when, in fact, he actually swims the 25-yard freestyle, on average, in 16.43 seconds. (Reject the null hypothesis when the null hypothesis is true.)

The Type II error is that there is no evidence to conclude that Jeffrey swims the 25-yard free-style, on average, in less than 16.43 seconds when, in fact, he actually does swim the 25-yard free-style, on average, in less than 16.43 seconds. (Do not reject the null hypothesis when the null hypothesis is false.)

Historical Note

The traditional way to compare the two probabilities, [latex]\alpha[/latex] and the p-value, is to compare the critical value (z-score from [latex]\alpha[/latex]) to the test statistic (z-score from data). The calculated test statistic for the p-value is –2.08. (From the Central Limit Theorem, the test statistic formula is [latex]z=\frac{\overline{x}-{\mu }_{X}}{\left(\frac{{\sigma }_{X}}{\sqrt{n}}\right)}[/latex].) For this problem, [latex]\overline{x} = 16[/latex], [latex]\mu_{X} = 16.43[/latex] from the null hypothesis is [latex]\sigma_{X} = 0.8[/latex], and [latex]n = 15[/latex].) You can find the critical value for [latex]\alpha = 0.05[/latex] in the normal table. Remember, the probability table found in the Back Matter - Statistics Tables to find the z-score.

The z-score for an area to the left equal to 0.05 is midway between –1.65 and –1.64 (0.05 is midway between 0.0505 and 0.0495). The z-score is –1.645. Since [latex]-1.645 > -2.08[/latex] (which demonstrates that [latex]\alpha > \text{p-value}[/latex]), reject [latex]H_{0}[/latex]. Traditionally, the decision to reject or not reject was done in this way. Today, comparing the two probabilities [latex]\alpha[/latex] and the p-value is very common. For this problem, the p-value, 0.0187 is considerably smaller than [latex]\alpha[/latex], 0.05. You can be confident about your decision to reject. The graph shows [latex]\alpha[/latex], the p-value, and the test statistics and the critical value.

As described in the text above.
Figure 8. Distribution curve comparing the α to the p-value.

Your Turn!

The mean throwing distance of a football for Marco, a high school freshman quarterback, is 40 yards, with a standard deviation of two yards. The team coach tells Marco to adjust his grip to get more distance. The coach records the distances for 20 throws. For the 20 throws, Marco’s mean distance was 45 yards. The coach thought the different grip helped Marco throw farther than 40 yards. Conduct a hypothesis test using a preset α = 0.05. Assume the throw distances for footballs are normal.

First, determine what type of test this is, set up the hypothesis test, find the p-value, sketch the graph, and state your conclusion.

Solution

Since the problem is about a mean, this is a test of a single population mean.

[latex]H_{0} : \mu = 40[/latex]

[latex]H_{a} : \mu > 40[/latex]

[latex]p = 0.0062[/latex]

A normal distribution with 40 in the middle. A value of 45 with a shaded region shaded to the right of it as the p-value.
Figure 9. Normal distribution curve with values 40, as the population mean, and 45, as the point to determine the p-value

Because [latex]p \lt \alpha[/latex], we reject the null hypothesis. There is sufficient evidence to suggest that the change in grip improved Marco’s throwing distance.

Example

A college football coach thought that his players could bench press a mean weight of 275 pounds. It is known that the standard deviation is 55 pounds. Three of his players thought that the mean weight was more than that amount. They asked 30 of their teammates for their estimated maximum lift on the bench press exercise. The data ranged from 205 pounds to 385 pounds. The actual different weights were (frequencies are in parentheses) 205(3); 215(3); 225(1); 241(2); 252(2); 265(2); 275(2); 313(2); 316(5); 338(2); 341(1); 345(2); 368(2); and 385(1).

Conduct a hypothesis test using a 2.5% level of significance to determine if the bench press mean is more than 275 pounds.

Solution

Set up the Hypothesis Test:

Since the problem is about a mean weight, this is a test of a single population mean.

[latex]H_{0}: \mu = 275, H_{a}: \mu > 275[/latex]

This is a right-tailed test.

Calculating the distribution needed:

Random variable: [latex]\overline{X}[/latex] = the mean weight, in pounds, lifted by the football players.

Distribution for the test: It is normal because [latex]\sigma[/latex] is known.

[latex]\overline{X} \sim N\left(275,\frac{55}{\sqrt{30}}\right)[/latex]

[latex]\overline{x}=286.2[/latex] pounds (from the data).

[latex]\sigma = 55 \text{ pounds}[/latex] (Always use [latex]\sigma[/latex] if you know it.) We assume [latex]\mu = 275 \text{ pounds}[/latex] unless our data shows us otherwise.

[latex]p\text{-value}= 0.1323[/latex].

Interpretation of the p-value: If [latex]H_0[/latex] is true, then there is a 0.1331 probability (13.23%) that the football players can lift a mean weight of 286.2 pounds or more. Because a 13.23% chance is large enough, a mean weight lift of 286.2 pounds or more is not a rare event.

A normal distribution with 275 in the middle. A value of 286.2 with a shaded region shaded to the right of it as p-value = 0.1323
Figure 10. Normal distribution curve of the average weight lifted by football players with values of 275 and 286.2 on the x-axis. The p-value points to the area to the right of 286.2.

 

Compare [latex]\alpha[/latex] and the p-value: [latex]\alpha = 0.025, \text{ p-value} = 0.1323[/latex]

Make a decision: Since [latex]\alpha \lt \text{p-value}[/latex], do not reject [latex]H_{0}[/latex].

Conclusion: At the 2.5% level of significance, from the sample data, there is not sufficient evidence to conclude that the true mean weight lifted is more than 275 pounds.

Hypothesis Tests for a Single Population Mean, Population Standard Deviation Unknown

We can also do full hypothesis testing for a single population mean when the population standard deviation is unknown.

Example

Statistics students believe that the mean score on the first statistics test is 65. A statistics instructor thinks the mean score is higher than 65. He samples ten statistics students and obtains the scores 65, 65, 70, 67, 66, 63, 63, 68, 72, and 71. He performs a hypothesis test using a 5% level of significance. The data are assumed to be from a normal distribution.

Solution

Set up the hypothesis test:

A 5% level of significance means that [latex]\alpha = 0.05[/latex]. This is a test of a single population mean.

[latex]H_{0}: \mu = 65, H_{a}: \mu > 65[/latex]

Since the instructor thinks the average score is higher, use a ">". The ">" means the test is right-tailed.

Determine the distribution needed:

Random variable: [latex]\overline{X}[/latex] = average score on the first statistics test.

Distribution for the test: If you read the problem carefully, you will notice that there is no population standard deviation given. You are only given [latex]n = 10[/latex] sample data values. Notice also that the data come from a normal distribution. This means that the distribution for the test is a student's t. The distribution for the test is t9 where [latex]n = 10[/latex] and [latex]df = 10 - 1 = 9[/latex].

Calculate the p-value using the Student's t-distribution:

[latex]\text{p-value} = 0.0396[/latex] where the sample mean and sample standard deviation are calculated as 67 and 3.1972 from the data.

Interpretation of the p-value: If the null hypothesis is true, then there is a 0.0396 probability (3.96%) that the sample mean is 65 or more.

A normal distribution with 65 in the middle. A value of 67 with a shaded region shaded to the right of it as p-value = 0.0396
Figure 11. Normal distribution curve of the scores of the first statistics test with values of 65 and 67 on the x-axis. The p-value points to the area to the right of 67.

 

Compare [latex]\alpha[/latex] and the p-value: Since [latex]\alpha = 0.05[/latex] and [latex]\text{p-value} = 0.0396[/latex], [latex]\alpha > \text{p-value}[/latex].

Make a decision: Since [latex]\alpha > \text{p-value}[/latex], reject [latex]H_{0}[/latex].

This means you reject [latex]\mu = 65[/latex]. In other words, you believe the average test score is more than 65.

Conclusion: At a 5% level of significance, the sample data show sufficient evidence that the mean (average) test score is more than 65, just as the math instructor thinks.

Example

It is believed that a stock price for a particular company will grow at a rate of $5 per week with a standard deviation of $1. An investor believes the stock won’t grow as quickly. The changes in stock price is recorded for ten weeks and are as follows: $4, $3, $2, $3, $1, $7, $2, $1, $1, $2. Perform a hypothesis test using a 5% level of significance. State the null and alternative hypotheses, find the p-value, state your conclusion, and identify the Type I and Type II errors.

Solution

[latex]H_{0}: \mu = 5[/latex]

[latex]H_{0}: \mu \lt 5[/latex]

[latex]p = 0.0013[/latex]

Because [latex]p \lt \alpha[/latex], we reject the null hypothesis. There is sufficient evidence to suggest that the stock price of the company grows at a rate less than $5 a week.

Type I Error: To conclude that the stock price is growing slower than $5 a week when, in fact, the stock price is growing at $5 a week (reject the null hypothesis when the null hypothesis is true).

Type II Error: To conclude that the stock price is growing at a rate of $5 a week when, in fact, the stock price is growing slower than $5 a week (do not reject the null hypothesis when the null hypothesis is false).

Hypothesis Tests for a Single Population Proportion

Lastly, we can do hypothesis tests for a single population proportion.

Example

Joon believes that 50% of first-time brides in the United States are younger than their grooms. She performs a hypothesis test to determine if the percentage is the same or different from 50%. Joon samples 100 first-time brides and 53 reply that they are younger than their grooms. For the hypothesis test, she uses a 1% level of
significance.

Solution

Set up the hypothesis test:

The 1% level of significance means that α = 0.01. This is a test of a single population proportion.

[latex]H_{0}: p = 0.50, H_{a}: p \neq 0.50[/latex]

The words "is the same or different from" tell you this is a two-tailed test.

Calculate the distribution needed:

Random variable: [latex]P^{\prime}[/latex] = the percentage of first-time brides who are younger than their grooms.

Distribution for the test: The problem contains no mention of a mean. The information is given in terms of percentages. Use the distribution for P′, the estimated proportion.

[latex]{P}^{\prime } \sim N\left(p,\sqrt{\frac{p\cdot q}{n}}\right)[/latex] Therefore, [latex]{P}^{\prime } \sim N\left(0.5,\sqrt{\frac{0.5\cdot 0.5}{100}}\right)[/latex] where [latex]p = 0.50[/latex], [latex]q = 1−p = 0.50[/latex], and [latex]n = 100[/latex].

Calculate the p-value using the normal distribution for proportions:

[latex]\text{p-value} = 0.5485[/latex] where [latex]x = 53[/latex], [latex]p^{\prime} = \frac{x}{n} = \frac{\text{53}}{100} = 0.53[/latex].

Interpretation of the p-value: If the null hypothesis is true, there is 0.5485 probability (54.85%) that the sample (estimated) proportion [latex]p^{\prime}[/latex] is 0.53 or more OR 0.47 or less (see the graph below).

A normal distribution with 0.50 in the middle and values of 0.47 and 0.53 with shaded regions in both tails. 1/2(p-value) = 0.27425
Figure 12. Normal distribution curve of the percent of first-time brides who are younger than the groom with values of 0.47, 0.50, and 0.53 on the x-axis. 1/2(p-values) are calculated for the areas on outsides of 0.47 and 0.53.

 

[latex]\mu = p = 0.50[/latex] comes from [latex]H_{0}[/latex], the null hypothesis.

p′ = 0.53. Since the curve is symmetrical and the test is two-tailed, the p′ for the left tail is equal to 0.50 – 0.03 = 0.47 where μ = p = 0.50. (0.03 is the difference between 0.53 and 0.50.)

Compare [latex]\alpha[/latex] and the p-value:

Since [latex]\alpha = 0.01[/latex] and [latex]\text{p-value} = 0.5485[/latex]. [latex]\alpha \lt \text{p-value}[/latex]

Make a decision: Since [latex]\alpha \lt \text{p-value}[/latex], you cannot reject [latex]H_{0}[/latex].

Conclusion: At the 1% level of significance, the sample data do not show sufficient evidence that the percentage of first-time brides who are younger than their grooms is different from 50%.

The Type I and Type II errors are as follows:

The Type I error is to conclude that the proportion of first-time brides who are younger than their grooms is different from 50% when, in fact, the proportion is actually 50%. (Reject the null hypothesis when the null hypothesis is true).

The Type II error is there is not enough evidence to conclude that the proportion of first time brides who are younger than their grooms differs from 50% when, in fact, the proportion does differ from 50%. (Do not reject the null hypothesis when the null hypothesis is false.)

Your Turn!

A teacher believes that 85% of students in the class will want to go on a field trip to the local zoo. She performs a hypothesis test to determine if the percentage is the same or different from 85%. The teacher samples 50 students and 39 reply that they would want to go to the zoo. For the hypothesis test, use a 1% level of significance.

First, determine what type of test this is, set up the hypothesis test, find the p-value, sketch the graph, and state your conclusion.

Solution

Since the problem is about percentages, this is a test of single population proportions.

[latex]H_{0}: p = 0.85, H_{a}: p \neq 0.85[/latex]

[latex]p = 0.1657[/latex]

A normal distribution with 0.85 in the middle and value of 0.78 to the left with shaded regions in both tails. 1/2(p-value) in both tails.
Figure 13. Normal distribution curve of the percent of students who want to go on a field trip with a value of 0.85 on the x-axis. 1/2(p-values) are calculated for the areas in both tails.

 

Because [latex]p > \alpha[/latex], we fail to reject the null hypothesis. There is not sufficient evidence to suggest that the proportion of students that want to go to the zoo is not 85%.

Example

Suppose a consumer group suspects that the proportion of households that have three cell phones is 30%. A cell phone company has reason to believe that the proportion is not 30%. Before they start a big advertising campaign, they conduct a hypothesis test. Their marketing people survey 150 households with the result that 43 of the households have three cell phones.

  1. Set up the Hypothesis Test.
  2. Determine the distribution needed.
  3. Calculate [latex]p^{\prime}[/latex].
  4. What is a success for this problem?
  5. What is the level of significance?
  6. Draw the graph for this problem. Draw the horizontal axis. Label and shade appropriately. Calculate the p-value.
  7. Make a decision. [latex]\underline{\hspace{2cm}}[/latex](Reject/Do not reject) [latex]H_{0}[/latex] because [latex]\underline{\hspace{2cm}}[/latex].
Solution
  1. [latex]H_{0}: p = 0.30, H_{a}: p \neq 0.30[/latex]
  2. The random variable is [latex]P^{\prime} =[/latex] proportion of households that have three cell phones. The distribution for the hypothesis test is [latex]P^{\prime} \sim N\left(0.30,\sqrt{\frac{\left(0.30\right)\cdot \left(0.70\right)}{150}}\right)[/latex].
  3. The value that helps determine the p-value is [latex]p^{\prime}[/latex]. [latex]p^{\prime} = \frac{x}{n}[/latex] where [latex]x[/latex] is the number of successes and [latex]n[/latex] is the total number in the sample. [latex]x = 43, n = 150[/latex]. [latex]p^{\prime} = \frac{\text{43}}{\text{150}}[/latex]
  4. A success is having three cell phones in a household.
  5. The level of significance is the preset α. Since α is not given, assume that [latex]\alpha = 0.05[/latex].
  6. [latex]\text{p-value} = 0.7216[/latex]
  7. Assuming that [latex]\alpha = 0.05[/latex], [latex]\alpha \lt \text{p-value}[/latex]. The decision is do not reject [latex]H_{0}[/latex] because there is not sufficient evidence to conclude that the proportion of households that have three cell phones is not 30%.

Your Turn!

Marketers believe that 92% of adults in the United States own a cell phone. A cell phone manufacturer believes that number is actually lower. 200 American adults are surveyed, of which, 174 report having cell phones. Use a 5% level of significance. State the null and alternative hypothesis, find the p-value, state your conclusion, and identify the Type I and Type II errors.

Solution

[latex]H_{0}: p = 0.92, H_{a}: p \lt 0.92[/latex]

[latex]\text{p-value} = 0.0046[/latex]

Because [latex]p \lt 0.05[/latex], we reject the null hypothesis. There is sufficient evidence to conclude that fewer than 92% of American adults own cell phones.

Type I Error: To conclude that fewer than 92% of American adults own cell phones when, in fact, 92% of American adults do own cell phones (reject the null hypothesis when the null hypothesis is true).

Type II Error: To conclude that 92% of American adults own cell phones when, in fact, fewer than 92% of American adults own cell phones (do not reject the null hypothesis when the null hypothesis is false).

The next example is a poem written by a statistics student named Nicole Hart. The solution to the problem follows the poem. Notice that the hypothesis test is for a single population proportion. This means that the null and alternate hypotheses use the parameter [latex]p[/latex]. The distribution for the test is normal. The estimated proportion [latex]p^{\prime}[/latex] is the proportion of fleas killed to the total fleas found on Fido. This is sample information. The problem gives a preconceived [latex]\alpha = 0.01[/latex], for comparison, and a 95% confidence interval computation. The poem is clever and humorous, so please enjoy it!

Example

My dog has so many fleas,
They do not come off with ease.
As for shampoo, I have tried many types
Even one called Bubble Hype,
Which only killed 25% of the fleas,
Unfortunately I was not pleased.

I've used all kinds of soap,
Until I had given up hope
Until one day I saw
An ad that put me in awe.

A shampoo used for dogs
Called GOOD ENOUGH to Clean a Hog
Guaranteed to kill more fleas.

I gave Fido a bath
And after doing the math
His number of fleas
Started dropping by 3's!

Before his shampoo
I counted 42.
At the end of his bath,
I redid the math
And the new shampoo had killed 17 fleas.
So now I was pleased.

Now it is time for you to have some fun
With the level of significance being .01,
You must help me figure out
Use the new shampoo or go without?

 

Solution

Set up the hypothesis test:

[latex]H_{0}: p \le 0.25, H_{a}: p > 0.25[/latex]

Determine the distribution needed:

In words, CLEARLY state what your random variable [latex]\overline{X}[/latex] or [latex]P^{\prime}[/latex] represents.

[latex]P^{\prime} =[/latex] The proportion of fleas that are killed by the new shampoo

State the distribution to use for the test.

Normal: [latex]N\left(0.25,\sqrt{\frac{\left(0.25\right)\left(1-0.25\right)}{42}}\right)[/latex]

Test Statistic: [latex]z = 2.3163[/latex]

Calculate the p-value using the normal distribution for proportions:

[latex]\text{p-value} = 0.0103[/latex]

In one to two complete sentences, explain what the p-value means for this problem.

If the null hypothesis is true (the proportion is 0.25), then there is a 0.0103 probability that the sample (estimated) proportion is [latex]\left(\frac{17}{42}\right) = 0.4048[/latex] or more.

Use the previous information to sketch a picture of this situation. CLEARLY, label and scale the horizontal axis and shade the region(s) corresponding to the p-value.

Normal distribution curve of a single population proportion with a value of 0.25 on the x-axis. The p-value points to the area in the right tail.
Figure 14. Normal distribution graph of the proportion of fleas killed by the new shampoo with values of 0.25 and 0.4048 on the x-axis. The area to the right is shaded. The test statistic of the sample proportion is listed.

 

Compare [latex]\alpha[/latex] and the p-value: Indicate the correct decision (“reject” or “do not reject” the null hypothesis), the reason for it, and write an appropriate conclusion, using complete sentences.

alpha = 0.01

decision: Do not reject [latex]{H}_{0}[/latex]

reason for decision: [latex]\alpha \lt \text{p-value}[/latex]

Conclusion: At the 1% level of significance, the sample data do not show sufficient evidence that the percentage of fleas that are killed by the new shampoo is more than 25%.

Construct a 95% confidence interval for the true mean or proportion. Include a sketch of the graph of the situation.

Label the point estimate and the lower and upper bounds of the confidence interval.

Normal distribution curve of a single population proportion with a value of 17/42 on the x-axis. A shaded region of 0.95 is between 0.26 and 0.55
Figure 15. Normal distribution graph of the proportion of fleas killed by the new shampoo with values of 0.26, 17/42, and 0.55 on the x-axis. The area between these two points is equal to 0.95.

 

Confidence Interval: (0.26, 0.55) We are 95% confident that the true population proportion p of fleas that are killed by the new shampoo is between 26% and 55%.

Note

This test result is not very definitive since the p-value is very close to alpha. In reality, one would probably do more tests by giving the dog another bath after the fleas have had a chance to return.

Example

The National Institute of Standards and Technology provides exact data on conductivity properties of materials. Following are conductivity measurements for 11 randomly selected pieces of a particular type of glass.

1.11; 1.07; 1.11; 1.07; 1.12; 1.08; 0.98; 0.98 1.02; 0.95; 0.95

Is there convincing evidence that the average conductivity of this type of glass is greater than one? Use a significance level of 0.05. Assume the population is normal.

Solution

Let’s follow a four-step process to answer this statistical question.

  1. State the Question: We need to determine if, at a 0.05 significance level, the average conductivity of the selected glass is greater than one. Our hypotheses will be [latex]H_{0}: \mu \le 1, H_{a}: μ > 1[/latex]
  2. Plan: We are testing a sample mean without a known population standard deviation. Therefore, we need to use a Student's-t distribution. Assume the underlying population is normal.
  3. Do the calculations: This means to calculate the p-value and test statistic.
  4. State the Conclusions: Since the p-value (p = 0.036) is less than our alpha value, we will reject the null hypothesis. It is reasonable to state that the data supports the claim that the average conductivity level is greater than one.

Example

In a study of 420,019 cell phone users, 172 of the subjects developed brain cancer. Test the claim that cell phone users developed brain cancer at a greater rate than that for non-cell phone users (the rate of brain cancer for non-cell phone users is 0.0340%). Since this is a critical issue, use a 0.005 significance level. Explain why the significance level should be so low in terms of a Type I error.

Solution

We will follow the four-step process.

  1. We need to conduct a hypothesis test on the claimed cancer rate. Our hypotheses will be [latex]H_{0}: p \le 0.00034[/latex]and [latex]H_{a}: p > 0.00034[/latex].

    If we commit a Type I error, we are essentially accepting a false claim. Since the claim describes cancer-causing environments, we want to minimize the chances of incorrectly identifying causes of cancer.

  2. We will be testing a sample proportion with [latex]x = 172[/latex] and [latex]n = 420,019[/latex]. The sample is sufficiently large because we have [latex]np = 420,019(0.00034) = 142.8[/latex], [latex]nq = 420,019(0.99966) = 419,876.2[/latex], two independent outcomes, and a fixed probability of success [latex]p = 0.00034[/latex]. Thus we will be able to generalize our results to the population.
  3. Calculate the p-value and test statistic.
  4. Since the [latex]\text{p-value} = 0.0073[/latex] is greater than [latex]\alpha = 0.005[/latex], we cannot reject the null. Therefore, we conclude that there is not enough evidence to support the claim of higher brain cancer rates for the cell phone users.

Your Turn!

According to the US Census there are approximately 268,608,618 residents aged 12 and older. Statistics from the Rape, Abuse, and Incest National Network indicate that, on average, 207,754 rapes occur each year (male and female) for persons aged 12 and older. This translates into a percentage of sexual assaults of 0.078%. In Daviess County, KY, there were reported 11 rapes for a population of 37,937. Conduct an appropriate hypothesis test to determine if there is a statistically significant difference between the local sexual assault percentage and the national sexual assault percentage. Use a significance level of 0.01.

Solution

We will follow the four-step plan.

  1. We need to test whether the proportion of sexual assaults in Daviess County, KY is significantly different from the national average.
  2. Since we are presented with proportions, we will use a one-proportion z-test. The hypotheses for the test will be [latex]H_{0}: p = 0.00078[/latex] and [latex]H_{a}: p \neq 0.00078[/latex].
  3. Calculate the p-value and test statistic.
  4. Since the p-value, [latex]p = 0.00063[/latex], is less than the alpha level of 0.01, the sample data indicates that we should reject the null hypothesis. In conclusion, the sample data support the claim that the proportion of sexual assaults in Daviess County, Kentucky is different from the national average proportion.

Section 8.5 Review

The hypothesis test itself has an established process. This can be summarized as follows:

  • Determine [latex]H_{0}[/latex] and [latex]H_{a}[/latex]. Remember, they are contradictory.
  • Determine the random variable.
  • Determine the distribution for the test.
  • Draw a graph, calculate the test statistic, and use the test statistic to calculate the p-value. (A z-score and a t-score are examples of test statistics.)
  • Compare the preconceived α with the p-value, make a decision (reject or do not reject H0), and write a clear conclusion using English sentences.

Notice that in performing the hypothesis test, you use [latex]\alpha[/latex] and not [latex]\beta[/latex]. [latex]\beta[/latex] is needed to help determine the sample size of the data that is used in calculating the p-value. Remember that the quantity [latex]1 - \beta[/latex] is called the Power of the Test. A high power is desirable. If the power is too low, statisticians typically increase the sample size while keeping [latex]\alpha[/latex] the same. If the power is low, the null hypothesis might not be rejected when it should be.

Section 8.5 Practice

Assume [latex]H_{0}: \mu = 9[/latex] and [latex]H_{a}: \mu \lt 9[/latex]. Is this a left-tailed, right-tailed, or two-tailed test?

Solution

This is a left-tailed test.

Assume [latex]H_{0}: \mu \le 6[/latex] and [latex]H_{a}: \mu > 6[/latex]. Is this a left-tailed, right-tailed, or two-tailed test?

Assume [latex]H_{0}: p = 0.25[/latex] and [latex]H_{a}: p \neq 0.25[/latex]. Is this a left-tailed, right-tailed, or two-tailed test?

Solution

This is a two-tailed test.

Draw the general graph of a left-tailed test.

Draw the graph of a two-tailed test.

Solution
Normal distribution curve of a single population mean with a value of mu on the x-axis. The p-value points to the areas in both tails.
Figure 16. Normal distribution curve with a value of mu, as the population proportion, and a p-value formulas, 1/2(p-value), for a two-tailed test is shown

A bottle of water is labeled as containing 16 fluid ounces of water. You believe it is less than that. What type of test would you use?

Your friend claims that his mean golf score is 63. You want to show that it is higher than that. What type of test would you use?

Solution

a right-tailed test

A bathroom scale claims to be able to identify correctly any weight within a pound. You think that it cannot be that accurate. What type of test would you use?

You flip a coin and record whether it shows heads or tails. You know the probability of getting heads is 50%, but you think it is less for this particular coin. What type of test would you use?

Solution

a left-tailed test

If the alternative hypothesis has a not-equals [latex]( \neq )[/latex] symbol, which type of test do you use?

Assume the null hypothesis states that the mean is at least 18. Is this a left-tailed, right-tailed, or two-tailed test?

Solution

This is a left-tailed test.

Assume the null hypothesis states that the mean is at most 12. Is this a left-tailed, right-tailed, or two-tailed test?

Assume the null hypothesis states that the mean is equal to 88. The alternative hypothesis states that the mean is not equal to 88. Is this a left-tailed, right-tailed, or two-tailed test?

Solution

This is a two-tailed test.

A particular brand of tires claims that its deluxe tire averages at least 50,000 miles before it needs to be replaced. From past studies of this tire, the standard deviation is known to be 8,000. A survey of owners of that tire design is conducted. From the 28 tires surveyed, the mean lifespan was 46,500 miles with a standard deviation of 9,800 miles. Using [latex]\alpha = 0.05[/latex], is the data highly inconsistent with the claim?

Use the Hypothesis Testing with One Sample - Solution Sheet on the Introduction to Chapter 8 page. (Note: If you are using a Student's t-distribution, you may assume that the underlying population is normally distributed, but you must first prove that assumption, however.)

Solution
  1. [latex]H_0: \mu \ge 50,000[/latex]
  2. [latex]H_a: \mu \lt 50,000[/latex]
  3. Let [latex]\overline{X} =[/latex] the average lifespan of a brand of tires.
  4. normal distribution
  5. [latex]z = -2.315[/latex]
  6. [latex]\text{p-value} = 0.0103[/latex]
  7. Check student’s solution.
  8. Decision, reason, and conclusion:
    • Alpha: 0.05
    • Decision: Reject the null hypothesis.
    • Reason for decision: The p-value is less than 0.05.
    • Conclusion: There is sufficient evidence to conclude that the mean lifespan of the tires is less than 50,000 miles.
  9. (43537, 49463)

In 1955, Life Magazine reported that the 25 year-old mother of three worked, on average, an 80 hour week. Recently, many groups have been studying whether or not the women's movement has, in fact, resulted in an increase in the average work week for women (combining employment and at-home work). Suppose a study was done to determine if the mean work week has increased. 81 women were surveyed with the following results. The sample mean was 83; the sample standard deviation was ten. Does it appear that the mean work week has increased for women at the 5% level?

Use the Hypothesis Testing with One Sample - Solution Sheet on the Introduction to Chapter 8 page. (Note: If you are using a Student's t-distribution, you may assume that the underlying population is normally distributed, but you must first prove that assumption, however.)

Your statistics instructor claims that 60 percent of the students who take her Elementary Statistics class go through life feeling more enriched. For some reason that she can't quite figure out, most people don't believe her. You decide to check this out on your own. You randomly survey 64 of her past Elementary Statistics students and find that 34 feel more enriched as a result of her class. Now, what do you think?

Use the Hypothesis Testing with One Sample - Solution Sheet on the Introduction to Chapter 8 page. (Note: If you are using a Student's t-distribution, you may assume that the underlying population is normally distributed, but you must first prove that assumption, however.)

Solution
  1. [latex]H_0: p \ge 0.6[/latex]
  2. [latex]H_a: p \lt 0.6[/latex]
  3. Let [latex]P^{\prime} =[/latex] the proportion of students who feel more enriched as a result of taking Elementary Statistics.
  4. normal for a single proportion
  5. 1.12
  6. [latex]\text{p-value} = 0.1308[/latex]
  7. Check student’s solution.
  8. Decision, reason, and conclusion:
    • Alpha: 0.05
    • Decision: Do not reject the null hypothesis.
    • Reason for decision: The p-value is greater than 0.05.
    • Conclusion: There is insufficient evidence to conclude that less than 60 percent of her students feel more enriched.
  9. Confidence Interval: (0.409, 0.654)

A Nissan Motor Corporation advertisement read, “The average man’s I.Q. is 107. The average brown trout’s I.Q. is 4. So why can’t man catch brown trout?” Suppose you believe that the brown trout’s mean I.Q. is greater than four. You catch 12 brown trout. A fish psychologist determines the I.Q.s as follows: 5; 4; 7; 3; 6; 4; 5; 3; 6; 3; 8; 5. Conduct a hypothesis test of your belief.

Use the Hypothesis Testing with One Sample - Solution Sheet on the Introduction to Chapter 8 page. (Note: If you are using a Student's t-distribution, you may assume that the underlying population is normally distributed, but you must first prove that assumption, however.)

Using the information from the previous problem, conduct a hypothesis test to see if your decision and conclusion would change if your belief were that the brown trout’s mean I.Q. is not four.

Use the Hypothesis Testing with One Sample - Solution Sheet on the Introduction to Chapter 8 page. (Note: If you are using a Student's t-distribution, you may assume that the underlying population is normally distributed, but you must first prove that assumption, however.)

Solution
  1. [latex]H_0: \mu = 4[/latex]
  2. [latex]H_a: \mu \neq 4[/latex]
  3. Let [latex]\overline{X} =[/latex] the average I.Q. of a set of brown trout.
  4. two-tailed Student's t-test
  5. [latex]t = 1.95[/latex]
  6. [latex]\text{p-value} = 0.076[/latex]
  7. Check student’s solution.
  8. Decision, reason, and conclusion:
    • Alpha: 0.05
    • Decision: Reject the null hypothesis.
    • Reason for decision: The p-value is greater than 0.05
    • Conclusion: There is insufficient evidence to conclude that the average IQ of brown trout is not four.
  9. (3.8865, 5.9468)

According to an article in Newsweek, the natural ratio of girls to boys is 100:105. In China, the birth ratio is 100: 114 (46.7% girls). Suppose you don’t believe the reported figures of the percent of girls born in China. You conduct a study. In this study, you count the number of girls and boys born in 150 randomly chosen recent births. There are 60 girls and 90 boys born of the 150. Based on your study, do you believe that the percentage of girls born in China is 46.7?

Use the Hypothesis Testing with One Sample - Solution Sheet on the Introduction to Chapter 8 page. (Note: If you are using a Student's t-distribution, you may assume that the underlying population is normally distributed, but you must first prove that assumption, however.)

References

Data from Amit Schitai. Director of Instructional Technology and Distance Learning. LBCC.

Data from Bloomberg Businessweek. Available online at http://www.businessweek.com/news/2011-
09-15/nyc-smoking-rate-falls-to-record-low-of-14-bloomberg-says.html.

Data from energy.gov. Available online at http://energy.gov (accessed June 27. 2013).

Data from Gallup®. Available online at www.gallup.com (accessed June 27, 2013).

Data from Growing by Degrees by Allen and Seaman.

Data from La Leche League International. Available online at http://www.lalecheleague.org/Law/BAFeb01.html.

Data from the American Automobile Association. Available online at www.aaa.com (accessed June 27, 2013).

Data from the American Library Association. Available online at www.ala.org (accessed June 27, 2013).

Data from the Bureau of Labor Statistics. Available online at http://www.bls.gov/oes/current/oes291111.htm.

Data from the Centers for Disease Control and Prevention. Available online at www.cdc.gov (accessed June 27, 2013)

Data from the U.S. Census Bureau, available online at http://quickfacts.census.gov/qfd/states/00000.html (accessed June 27, 2013).

Data from the United States Census Bureau. Available online at http://www.census.gov/hhes/socdemo/language/.

Data from Toastmasters International. Available online at http://toastmasters.org/artisan/detail.asp?CategoryID=1&SubCategoryID=10&ArticleID=429&Page=1.

Data from Weather Underground. Available online at www.wunderground.com (accessed June 27, 2013).

Federal Bureau of Investigations. “Uniform Crime Reports and Index of Crime in Daviess in the State of Kentucky enforced by Daviess County from 1985 to 2005.” Available online at http://www.disastercenter.com/kentucky/crime/3868.htm (accessed June 27, 2013).

“Foothill-De Anza Community College District.” De Anza College, Winter 2006. Available online at http://research.fhda.edu/factbook/DAdemofs/Fact_sheet_da_2006w.pdf.

Johansen, C., J. Boice, Jr., J. McLaughlin, J. Olsen. “Cellular Telephones and Cancer—a Nationwide Cohort Study in Denmark.” Institute of Cancer Epidemiology and the Danish Cancer Society, 93(3):203-7. Available online at http://www.ncbi.nlm.nih.gov/pubmed/11158188 (accessed June 27, 2013).

Rape, Abuse & Incest National Network. “How often does sexual assault occur?” RAINN, 2009. Available online at http://www.rainn.org/get-information/statistics/frequency-of-sexual-assault (accessed June 27, 2013).

License

Icon for the Creative Commons Attribution-ShareAlike 4.0 International License

Introductory Statistics Copyright © 2024 by LOUIS: The Louisiana Library Network is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License, except where otherwise noted.