9.2 Test for the Population Mean: Critical-Value Method

OBJECTIVES By the end of this section, I will be able to …

  1. Explain the essential idea about hypothesis testing for the population mean.
  2. Calculate the test statistic .
  3. Find the critical region(s) and critical value(s) for a hypothesis test.
  4. Perform the test for the mean, using the critical-value method.

1 The Essential Idea About Hypothesis Testing for the Mean

Recall that in Section 9.1, we wanted to determine whether the population mean systolic blood pressure was less than 110 and we considered the hypotheses

We stated that a large difference between the observed sample mean and the hypothesized mean would result in the rejection of the null hypothesis . The question is, “How large is large?”

The test for the mean tells us when our results are statistically significant. To learn how this test works, consider the following: A sample of patients who are taking the medication shows a sample mean systolic blood pressure level of . Further assume that the population standard deviation systolic blood pressure reading is , and that the population of such readings is normal. Would this value represent sufficient evidence to reject and conclude that ?

Recall from Chapter 7 that the sampling distribution of the sample mean is the collection of sample means of all possible samples of size . When the population is normal, or the sample size is large, the sampling distribution of is approximately normal, with mean and standard error . The idea behind the test is to determine where our sample mean falls within the sampling distribution. Is somewhere near the middle of the sampling distribution, or is it an outlier? Now, if is true, then and we may standardize to get

498

Substituting, we get

In other words, lies 3 standard errors below the hypothesized mean . Thus, if we accept that the null hypothesis is true, then is an outlier, an extreme value (see Figure 1). That is, if is true, then the probability of observing is very small , because the corresponding -value lies in the tail of the distribution, and nearly all the values of are greater than 104.

Note: Here, we are using Facts 1–4 and the Central Limit Theorem from Chapter 7.

We are developing the test using a left-tailed test, but the same idea applies to right-tailed tests and twotailed tests, too.

image
Figure 9.1: FIGURE 1 An extreme value of calls for rejection of .

Thus, we must choose one of the following two scenarios:

  1. is true, the value of is accurate, and our observation of this extreme value of is an amazingly unlikely event.
  2. is not correct, and the true value of is closer to .

Developing Your Statistical Sense

The Data Prevail!

When faced with the above situation, because we don't want to base our decisions on “amazingly unlikely events,” we therefore would conclude that is not correct. Remember that the null hypothesis is just a conjecture, but the sample mean represents directly observable “hard data.” The scientific method states that, when there is a conflict between a conjecture and the observed data, the data prevail, and we need to rethink our null hypothesis.

This conclusion illustrates the essential idea about hypothesis testing for the mean.

The Essential Idea About Hypothesis Testing for the Mean

When the observed value of is unusual or extreme in the sampling distribution of that assumes is true, we should reject . Otherwise, we should not reject .

All the remaining parts of Sections 9.29.4, all the steps and all the calculations, are really just ways to implement this essential idea. Stated roughly: If the difference between and is large, then reject . Otherwise don't.

499

2 Test Statistic

The statistic that we calculated earlier

contains four quantities—three of which are taken from data. The sample mean and the sample size are characteristics of the sample data, and the population standard deviation σ represents the population data. Thus, we call this statistic .

The Test Statistic

The test statistic used for the test for the mean is

summarizes the information in the data set regarding the hypothesis test.

For the blood pressure data, we have

is an example of a test statistic, a statistic generated from a data set for the purposes of testing a statistical hypothesis. We will discuss several other test statistics throughout the remainder of the text. The hypothesis test in this section and Section 9.3 is called the Z test because the test statistic comes from the standard normal distribution.

EXAMPLE 7 Calculating

image

image Clothing Store Sales

Retail stores need to generate sales. A sample of 100 customers' data was taken from the Chapter 9 Case Study data set, Clothing Store, and the total sales were obtained for each customer during the six-month time period. Suppose this sample showed a sample mean of . Assume the population standard deviation is . The district manager has set a goal to achieve a population mean of more than $413 total sales per customer.

  1. Construct the hypotheses.
  2. Calculate the test statistic .

Solution

  1. Using our strategy for constructing the hypotheses from Section 9.1, the key words “more than” means “>,” and the “>” symbol occurs only in the right-tailed test. Answering the question “More than what?” is . Thus, our hypotheses are

    where represents the population mean total sales per customer.

  2. The sample size is , with a sample mean of , and . Thus,

NOW YOU CAN DO

Exercises 9–22.

YOUR TURN #3

For Example 7, suppose everything else stayed the same, but now . Calculate .

(The solution is shown in Appendix A.)

500

standardizes the distance between the sample mean and the hypothesized population mean , so that this distance is now on the standard normal scale. Thus, we can sometimes tell with a glance at , using our knowledge of the standard normal distribution (Section 6.4), whether is extreme or not, and therefore whether to reject. Specifically, we recall that almost all values of lie between −3 and 3. Here are two examples:

Of course, not all cases are as obvious as these, thus the need for the hypothesis testing procedure.

3 Critical Regions and Critical Values

In the critical-value method for the test, we compare with a threshold value, or critical value of , called . The value of separates into two regions (see Table 4):

  • The critical region consists of the range of values of the test statistic for which we reject the null hypothesis.
  • The noncritical region consists of the range of values of the test statistic for which we do not reject the null hypothesis.
  • The value of that separates the critical region from the noncritical region is called the critical value .

represents the boundary between values of that are statistically significant and those that are not statistically significant. The value of depends on the value of , the probability of wrongly rejecting . A smaller value of will make it harder to reject , that is, harder to find statistical significance. Thus, is called the level of significance of the hypothesis test.

The value of depends on (a) the form of the hypothesis test, and (b) the level of significance . Table 4 shows values of for the most commonly used levels of significance . It also shows the location of the critical region.

Table 9.7: Table 4 Table of critical values for common values of the level of significance
Form of hypothesis test
Right-tailed Left-tailed Two-tailed
Level of significance
0.10
0.05
0.01
Critical region image
Rejection rule: Reject if Reject if Reject if
or

501

EXAMPLE 8 Finding and the critical region

For the hypotheses,

where represents the population mean systolic blood pressure, let the level of significance .

  1. Find the critical value .
  2. Graph the distribution of , showing the critical region.

Solution

We have a left-tailed test and level of significance , so Table 4 tells us that the critical value is . The graph showing the critical region is provided in Figure 2. We would reject for values of that are .

image
Figure 9.2: FIGURE 2 Critical region for a left-tailed test lies in the left (lower) tail.

NOW YOU CAN DO

Exercises 23–26.

YOUR TURN #4

In Example 7, we had the hypothesis test:

where represents the population mean total sales per customer. Let the level of significance .

  1. Find the critical value .
  2. Graph the distribution of , showing the critical region.

(The solutions are shown in Appendix A.)

Developing Your Statistical Sense

Why Is It Called a Left-Tailed Test Mean? Right-Tailed Test? Two-Tailed Test?

A hypothesis test of the form

is called a left-tailed test because the critical region lies in the left (lower) tail. Similarly, a hypothesis test of the form

is called a right-tailed test because its critical region lies in the right (upper) tail. Finally, a hypothesis test of the form

is called a two-tailed test because its critical region occupies both the lower and upper tails.

502

4 Performing the test for the Mean Using the Critical-value Method

We are now ready to learn the steps for performing the test for the population mean using the critical-value method.

Test for the Population Mean : Critical-Value Method

When a random sample of size is taken from a population where the population standard deviation σ is known, you can use the test if (a) the population is normal, or (b) the sample size is large .

  • Step 1 State the hypotheses.

    Use one of the forms from Table 4. State the meaning of .

  • Step 2 Find and state the rejection rule.

    Use Table 4 and the given level of significance .

  • Step 3 Calculate .

  • Step 4 State the conclusion and the interpretation.

    If falls in the critical region, then reject ; otherwise, do not reject . Interpret your conclusion.

What Does This Conclusion Mean?

Interpreting Your Conclusion

Recall that a data analyst needs to interpret the results so that the general public can understand them. You can use the following generic interpretation for the two possible conclusions. Just remember that generic interpretations are no substitute for thinking clearly about the problem and the implications of the conclusion.

Interpreting the Conclusion

  • If you reject , the interpretation is: There is evidence at level of significance that [whatever says].
  • If you do not reject , the interpretation is: There is insufficient evidence at level of significance that [whatever says].

For example, suppose our conclusion for the hypotheses in Example 8

was to reject . Then the interpretation of this conclusion would be: There is evidence at level of significance that the population mean systolic blood pressure reading is less than 110.

Next, we illustrate the critical-value method of performing a right-tailed test, a left-tailed test, and a two-tailed test for .

EXAMPLE 9 Test for , critical-value method, right-tailed test

image Clothing Store Sales

For the situation in Example 7, test at level of significance whether the population mean total sales per customer is more than $413.

Solution

We may apply the test because the sample is large , and the population standard deviation σ is known.

503

  • Step 1 State the hypotheses.

    From Example 7, our hypotheses are

    where represents the population mean total sales per customer.

    image
    Figure 9.3: FIGURE 3 Critical region for a right-tailed test.
  • Step 2 Find and state the rejection rule.

    We have a right-tailed test and level of significance , which, from Table 4, tell us that . Because we have a right-tailed test, the rejection rule will be “Reject if ,” that is, “Reject if ” (see Figure 3).

  • Step 3 Find .

    From Example 7, we have .

  • Step 4 State the conclusion and interpretation.

    Our rejection rule states that we will reject if . Because , which is not ≥ 2.33, the conclusion is to not reject (Figure 4). Even though the sample mean of exceeds , it does not do so by a wide enough margin to overcome the reasonable doubt that the difference between and may have been due to chance. We interpret our conclusion as follows: “There is insufficient evidence at the 0.01 level of significance that the population mean total sales is greater than $413 per customer over the six-month period.”

NOW YOU CAN DO

Exercises 37–40.

EXAMPLE 10 Test for , critical-value method, left-tailed test

For the hypotheses in Example 8, perform the test for the population mean, using level of significance . Assume systolic blood pressure is normally distributed.

Solution

We may use the test, because the population of systolic blood pressure readings is normally distributed, and the population standard deviation σ is known.

  • Step 1 State the hypotheses.

    From Example 8, we have

    where represents the population mean systolic blood pressure reading.

    image
    Figure 9.4: FIGURE 4 Critical region for a left-tailed test.
  • Step 2 Find and state the rejection rule.

    Example 8 gives us the critical value , and Table 4 tells us that, for level of significance , we will reject if , that is, if (Figure 4).

  • Step 3 Calculate .

    From page 499, we know that

  • Step 4 State the conclusion and the interpretation.

    In Step 2, we stated that we would reject if . Our of , therefore, we reject . Our interpretation is: “There is evidence at level of significance that the population mean systolic blood pressure reading is less than 110.”

NOW YOU CAN DO

Exercises 41–44.

504

EXAMPLE 11 Test for , critical-value method, two-tailed test

When the level of hemoglobin in the blood is too low, a person is anemic. Unusually high levels of hemoglobin are also undesirable and can be associated with dehydration. The optimal hemoglobin level is 13.8 grams per deciliter (g/dl). Suppose a random sample of women at a certain college showed a sample mean hemoglobin of , the population standard deviation of hemoglobin level is , and hemoglobin level is normally distributed. We are interested in testing whether the population mean hemoglobin level differs from 13.8 g/dl. Perform the appropriate hypothesis test, using level of significance .

Solution

We may use the test, because the population of hemoglobin levels is normally distributed, and the population standard deviation σ is known.

  • Step 1 State the hypotheses.

    The key words “differs from” indicate a two-tailed test, with . Thus, our hypotheses are

    where represents the population mean hemoglobin level.

    image
    Figure 9.5: FIGURE 5 Critical region for a two-tailed test.
  • Step 2 Find and state the rejection rule.

    We have a two-tailed test and level of significance . Using this information, Table 4 tells us that the critical value and that we will reject if or if (Figure 5).

  • Step 3 Calculate .

    We have , , , and . Substituting:

  • Step 4 State the conclusion and the interpretation.

    , which is ≤−1.645. Therefore we reject . There is evidence at level of significance that the population mean hemoglobin level differs from 13.8 g/dl.

NOW YOU CAN DO

Exercises 45–48.