OBJECTIVES By the end of this section, I will be able to …
1 The Essential Idea About Hypothesis Testing for the Mean
Recall that in Section 9.1, we wanted to determine whether the population mean systolic blood pressure was less than 110 and we considered the hypotheses
We stated that a large difference between the observed sample mean and the hypothesized mean would result in the rejection of the null hypothesis . The question is, “How large is large?”
The test for the mean tells us when our results are statistically significant. To learn how this test works, consider the following: A sample of patients who are taking the medication shows a sample mean systolic blood pressure level of . Further assume that the population standard deviation systolic blood pressure reading is , and that the population of such readings is normal. Would this value represent sufficient evidence to reject and conclude that ?
Recall from Chapter 7 that the sampling distribution of the sample mean is the collection of sample means of all possible samples of size . When the population is normal, or the sample size is large, the sampling distribution of is approximately normal, with mean and standard error . The idea behind the test is to determine where our sample mean falls within the sampling distribution. Is somewhere near the middle of the sampling distribution, or is it an outlier? Now, if is true, then and we may standardize to get
498
Substituting, we get
In other words, lies 3 standard errors below the hypothesized mean . Thus, if we accept that the null hypothesis is true, then is an outlier, an extreme value (see Figure 1). That is, if is true, then the probability of observing is very small , because the corresponding -value lies in the tail of the distribution, and nearly all the values of are greater than 104.
Note: Here, we are using Facts 1–4 and the Central Limit Theorem from Chapter 7.
We are developing the test using a left-tailed test, but the same idea applies to right-tailed tests and twotailed tests, too.
Thus, we must choose one of the following two scenarios:
Developing Your Statistical Sense
The Data Prevail!
When faced with the above situation, because we don't want to base our decisions on “amazingly unlikely events,” we therefore would conclude that is not correct. Remember that the null hypothesis is just a conjecture, but the sample mean represents directly observable “hard data.” The scientific method states that, when there is a conflict between a conjecture and the observed data, the data prevail, and we need to rethink our null hypothesis.
This conclusion illustrates the essential idea about hypothesis testing for the mean.
The Essential Idea About Hypothesis Testing for the Mean
When the observed value of is unusual or extreme in the sampling distribution of that assumes is true, we should reject . Otherwise, we should not reject .
All the remaining parts of Sections 9.2–9.4, all the steps and all the calculations, are really just ways to implement this essential idea. Stated roughly: If the difference between and is large, then reject . Otherwise don't.
499
2 Test Statistic
The statistic that we calculated earlier
contains four quantities—three of which are taken from data. The sample mean and the sample size are characteristics of the sample data, and the population standard deviation σ represents the population data. Thus, we call this statistic .
The Test Statistic
The test statistic used for the test for the mean is
summarizes the information in the data set regarding the hypothesis test.
For the blood pressure data, we have
is an example of a test statistic, a statistic generated from a data set for the purposes of testing a statistical hypothesis. We will discuss several other test statistics throughout the remainder of the text. The hypothesis test in this section and Section 9.3 is called the Z test because the test statistic comes from the standard normal distribution.
EXAMPLE 7 Calculating
Clothing Store Sales
Retail stores need to generate sales. A sample of 100 customers' data was taken from the Chapter 9 Case Study data set, Clothing Store, and the total sales were obtained for each customer during the six-month time period. Suppose this sample showed a sample mean of . Assume the population standard deviation is . The district manager has set a goal to achieve a population mean of more than $413 total sales per customer.
Solution
Using our strategy for constructing the hypotheses from Section 9.1, the key words “more than” means “>,” and the “>” symbol occurs only in the right-tailed test. Answering the question “More than what?” is . Thus, our hypotheses are
where represents the population mean total sales per customer.
The sample size is , with a sample mean of , and . Thus,
NOW YOU CAN DO
Exercises 9–22.
YOUR TURN #3
For Example 7, suppose everything else stayed the same, but now . Calculate .
(The solution is shown in Appendix A.)
500
standardizes the distance between the sample mean and the hypothesized population mean , so that this distance is now on the standard normal scale. Thus, we can sometimes tell with a glance at , using our knowledge of the standard normal distribution (Section 6.4), whether is extreme or not, and therefore whether to reject. Specifically, we recall that almost all values of lie between −3 and 3. Here are two examples:
Of course, not all cases are as obvious as these, thus the need for the hypothesis testing procedure.
3 Critical Regions and Critical Values
In the critical-value method for the test, we compare with a threshold value, or critical value of , called . The value of separates into two regions (see Table 4):
represents the boundary between values of that are statistically significant and those that are not statistically significant. The value of depends on the value of , the probability of wrongly rejecting . A smaller value of will make it harder to reject , that is, harder to find statistical significance. Thus, is called the level of significance of the hypothesis test.
The value of depends on (a) the form of the hypothesis test, and (b) the level of significance . Table 4 shows values of for the most commonly used levels of significance . It also shows the location of the critical region.
Form of hypothesis test | |||
---|---|---|---|
Right-tailed | Left-tailed | Two-tailed | |
Level of significance | |||
0.10 | |||
0.05 | |||
0.01 | |||
Critical region | |||
Rejection rule: | Reject if | Reject if | Reject if or |
501
EXAMPLE 8 Finding and the critical region
For the hypotheses,
where represents the population mean systolic blood pressure, let the level of significance .
Solution
We have a left-tailed test and level of significance , so Table 4 tells us that the critical value is . The graph showing the critical region is provided in Figure 2. We would reject for values of that are .
NOW YOU CAN DO
Exercises 23–26.
YOUR TURN #4
In Example 7, we had the hypothesis test:
where represents the population mean total sales per customer. Let the level of significance .
(The solutions are shown in Appendix A.)
Developing Your Statistical Sense
Why Is It Called a Left-Tailed Test Mean? Right-Tailed Test? Two-Tailed Test?
A hypothesis test of the form
is called a left-tailed test because the critical region lies in the left (lower) tail. Similarly, a hypothesis test of the form
is called a right-tailed test because its critical region lies in the right (upper) tail. Finally, a hypothesis test of the form
is called a two-tailed test because its critical region occupies both the lower and upper tails.
502
4 Performing the test for the Mean Using the Critical-value Method
We are now ready to learn the steps for performing the test for the population mean using the critical-value method.
Test for the Population Mean : Critical-Value Method
When a random sample of size is taken from a population where the population standard deviation σ is known, you can use the test if (a) the population is normal, or (b) the sample size is large .
Step 1 State the hypotheses.
Use one of the forms from Table 4. State the meaning of .
Step 2 Find and state the rejection rule.
Use Table 4 and the given level of significance .
Step 3 Calculate .
Step 4 State the conclusion and the interpretation.
If falls in the critical region, then reject ; otherwise, do not reject . Interpret your conclusion.
What Does This Conclusion Mean?
Interpreting Your Conclusion
Recall that a data analyst needs to interpret the results so that the general public can understand them. You can use the following generic interpretation for the two possible conclusions. Just remember that generic interpretations are no substitute for thinking clearly about the problem and the implications of the conclusion.
Interpreting the Conclusion
For example, suppose our conclusion for the hypotheses in Example 8
was to reject . Then the interpretation of this conclusion would be: There is evidence at level of significance that the population mean systolic blood pressure reading is less than 110.
Next, we illustrate the critical-value method of performing a right-tailed test, a left-tailed test, and a two-tailed test for .
EXAMPLE 9 Test for , critical-value method, right-tailed test
Clothing Store Sales
For the situation in Example 7, test at level of significance whether the population mean total sales per customer is more than $413.
Solution
We may apply the test because the sample is large , and the population standard deviation σ is known.
503
Step 1 State the hypotheses.
From Example 7, our hypotheses are
where represents the population mean total sales per customer.
Step 2 Find and state the rejection rule.
We have a right-tailed test and level of significance , which, from Table 4, tell us that . Because we have a right-tailed test, the rejection rule will be “Reject if ,” that is, “Reject if ” (see Figure 3).
Step 3 Find .
From Example 7, we have .
Step 4 State the conclusion and interpretation.
Our rejection rule states that we will reject if . Because , which is not ≥ 2.33, the conclusion is to not reject (Figure 4). Even though the sample mean of exceeds , it does not do so by a wide enough margin to overcome the reasonable doubt that the difference between and may have been due to chance. We interpret our conclusion as follows: “There is insufficient evidence at the 0.01 level of significance that the population mean total sales is greater than $413 per customer over the six-month period.”
NOW YOU CAN DO
Exercises 37–40.
EXAMPLE 10 Test for , critical-value method, left-tailed test
For the hypotheses in Example 8, perform the test for the population mean, using level of significance . Assume systolic blood pressure is normally distributed.
Solution
We may use the test, because the population of systolic blood pressure readings is normally distributed, and the population standard deviation σ is known.
Step 1 State the hypotheses.
From Example 8, we have
where represents the population mean systolic blood pressure reading.
Step 2 Find and state the rejection rule.
Example 8 gives us the critical value , and Table 4 tells us that, for level of significance , we will reject if , that is, if (Figure 4).
Step 3 Calculate .
From page 499, we know that
Step 4 State the conclusion and the interpretation.
In Step 2, we stated that we would reject if . Our of , therefore, we reject . Our interpretation is: “There is evidence at level of significance that the population mean systolic blood pressure reading is less than 110.”
NOW YOU CAN DO
Exercises 41–44.
504
EXAMPLE 11 Test for , critical-value method, two-tailed test
When the level of hemoglobin in the blood is too low, a person is anemic. Unusually high levels of hemoglobin are also undesirable and can be associated with dehydration. The optimal hemoglobin level is 13.8 grams per deciliter (g/dl). Suppose a random sample of women at a certain college showed a sample mean hemoglobin of , the population standard deviation of hemoglobin level is , and hemoglobin level is normally distributed. We are interested in testing whether the population mean hemoglobin level differs from 13.8 g/dl. Perform the appropriate hypothesis test, using level of significance .
Solution
We may use the test, because the population of hemoglobin levels is normally distributed, and the population standard deviation σ is known.
Step 1 State the hypotheses.
The key words “differs from” indicate a two-tailed test, with . Thus, our hypotheses are
where represents the population mean hemoglobin level.
Step 2 Find and state the rejection rule.
We have a two-tailed test and level of significance . Using this information, Table 4 tells us that the critical value and that we will reject if or if (Figure 5).
Step 3 Calculate .
We have , , , and . Substituting:
Step 4 State the conclusion and the interpretation.
, which is ≤−1.645. Therefore we reject . There is evidence at level of significance that the population mean hemoglobin level differs from 13.8 g/dl.
NOW YOU CAN DO
Exercises 45–48.