Processing math: 60%

9.4 t Test for the Population Mean

OBJECTIVES By the end of this section, I will be able to …

  1. Perform the t test for the mean using the critical-value method.
  2. Perform the t test for the mean using the p-value method.
  3. Use confidence intervals to perform two-tailed hypothesis tests.

1 t Test for μ Using the Critical-Value Method

In many real-world scenarios, the value of the population standard deviation s is unknown. When this occurs, we should use neither the Z interval nor the Z test. Recall that in Section 8.2, we used the t distribution to find a confidence interval for the mean when σ was not known. The situation is similar for hypothesis testing.

Let ˉx be the sample mean, μ be the unknown population mean, s be the sample standard deviation, and n be the sample size. The t statistic

t=ˉx-μs/n

with n-1 degrees of freedom may be used when either the population is normal or the sample size is large. We call this t statistic tdata because its value depends largely on the sample data.

The test statistic used for the t test for the mean is

tdata=ˉx-μ0s/n

tdata represents the number of standard errors ˉx lies above or below µ0.

Extreme values of ˉx, that is, values of ˉx that are significantly far from the hypothesized μ, will translate into extreme values of tdata. In other words, just as with Zdata, when ˉx is far from μ0, tdata will be far from 0. We answer the question “How extreme is extreme?” using the critical-value method by finding a critical value of t, called tcrit. This threshold value tcrit separates the values of tdata for which we reject H0 (the critical region) from the values of tdata for which we will not reject H0 (the noncritical region). Because a different t curve exists for every different sample size, you need to know the following to find the value of tcrit: (a) the form of the hypothesis test (right-tailed, left-tailed, or two-tailed), (b) the degrees of freedom (df=n-1), and (c) the level of significance α.

The degrees of freedom is a measure of how the t distribution changes as the sample size changes.

t Test for the Population Mean μ: Critical-Value Method

When a random sample of size n is taken from a population, you can use the t test if either the population is normal or the sample size is large (n30).

  • Step 1 State the hypotheses.

    Use one of the forms from Table 8. State the meaning of μ.

  • Step 2 Find tcrit and state the rejection rule.

    Use Table D in the Appendix and Table 8.

  • Step 3 Calculate tdata.

    tdata=ˉx-μ0s/n

  • Step 4 State the conclusion and the interpretation.

    If tdata falls within the critical region, then reject H0. Otherwise, do not reject H0. Interpret your conclusion so that a nonspecialist can understand.

Page 526
Table 9.21: Table 8 Critical regions and rejection rules for various forms of the t test for μ
Form of test Right-tailed test Left-tailed test Two - tailed test
Hypotheses H0:μ=μ0

Ha:μ>μ0


level of significance α
H0:μ=μ0

Ha:μ<μ0


level of significanc α
H0:μ=μ0

Ha:μμ0


level of significance α
Critical region image
Rejection rule Reject H0 if tdatatcrit Reject H0 if tdata-tcrit Reject H0 if tdatatcrit or

tdata-tcrit

EXAMPLE 18 t Test for μ using critical-value method: Left-tailed test

We are interested in testing, using level of significance α=0.05, whether the mean age at onset of anorexia nervosa in young women has been decreasing. Assume that the previous mean age at onset was 15 years old. Data were gathered for a study of the onset age for this disorder.6 From these data, a random sample (shown here) was taken of n=20 young women who were admitted under this diagnosis to the Toronto Hospital for Sick Children. The Minitab descriptive statistics shown here indicate a sample mean age of ˉx=14.250 years and a sample standard deviation of s=1.512 years. If appropriate, perform the t test.

image
Descriptive statistics for anorexia data.
14.50 15.75 14.17 14.00
14.67 17.25 11.00 16.00
14.50 15.17 12.00 13.00
13.00 13.50 15.42 16.08
14.00 12.58 13.50 14.92
Table 9.22: Age at onset of anorexia

Solution

The sample size n=20 is not large, so we need to verify normality. The normal probability plot of the ages at onset in Figure 18 indicates that the ages in the sample are normally distributed. We may proceed to perform the t test for the mean.

  • Step 1 State the hypotheses.

    The key word “decreasing” guides us to state our hypotheses as follows:

    H0:μ=15versusHa:μ<15

    where μ refers to the population mean age at onset.

  • Step 2 Find tcrit and state the rejection rule.

    Our hypotheses from Step 1 indicate that we have a left-tailed test, meaning that the critical region represents an area in the left tail (see Figure 20, page 528). To find tcrit, we turn to the t table, an excerpt of which is shown in Figure 19. Because we have a one-tailed test, under “Area in one tail,” select the column with our α value 0.05. Then choose the row with our df=n-1=20-1=19, so that we get tcrit=1.729. Because we have a left-tailed test, the rejection rule from Table 8 is “Reject H0 if tdata-tcrit ”; that is, we will reject H0 if tdata-1.729.

    Page 527
    FIGURE 18 Normal probability plot for age at onset of anorexia nervosa.
    image
    FIGURE 19 Finding tcrit for a one-tailed test. For a two-tailed test, use “Area in two tails.”
    image
  • Step 3 Calculate tdata.

    We have n=20, ˉx=14.250, and s=1.512 years. Also, μ0=15, because this is the hypothesized value of μ stated in H0. Therefore, our test statistic is

    tdata=ˉx-μ0s/n=14.250-151.512/20-2.2183

  • Step 4 State the conclusion and interpretation.

    The rejection rule from Step 2 says to reject H0 if tdata-1.729. From Step 3, we have tdata=-2.2183. Because −2.2183 is less than −1.729, our conclusion is to reject H0. If you prefer the graphical approach, consider Figure 20, which shows where tdata falls in relation to the critical region. Because tdata=-2.2183 falls within the critical region, our conclusion is to reject H0. There is evidence at level of significance α=0.05 that the population mean age of onset has decreased from its previous level of 15 years.

    Page 528
    image
    FIGURE 20 Our tdata=-2.2183 falls in the critical region.

NOW YOU CAN DO

Exercises 3–8.

YOUR TURN#8

  1. For the data in Example 18, test, using level of significance α=0.01, whether the mean age at onset of anorexia nervosa in young women has been decreasing.
  2. Discuss two possible resolutions to the contradiction between the conclusions in Example 18 and Part (a).

(The solutions are shown in Appendix A.)

EXAMPLE 19 t Test for μ using critical-value method: two-tailed test

CNN reported in 2014 that, even after factoring in part-time jobs, Americans worked an average of 38 hours per week. Suppose a social science researcher disputes this finding and is interested in testing whether the population mean number of hours worked per week differs from 38. A random sample of n=30 working Americans yields a sample mean of ˉx=35.26 hours worked, with a sample standard deviation of s=10 hours. If the conditions are met, perform the appropriate hypothesis test using level of significance α=0.10.

Solution

Because n=3030, we may proceed with the t test.

  • Step 1 State the hypotheses.

    The key words “differs from” indicate a two-tailed test, with μ0=38, because we are testing whether μ differs from 38. So our hypotheses are

    H0:μ=38versusHa:μ38

    where μ represents the population mean number of hours worked per week.

    image
    FIGURE 21 Critical region for two-tailed test.
  • Step 2 Find tcrit and state the rejection rule.

    To find tcrit for a two-tailed test with level of significance α=0.10, we look in the 0.10 column in the “Area in two tails” section of Table D in the Appendix. The degrees of freedom df=n-1=29 gives us tcrit=1.699. From Table 8, the rejection rule is: “Reject H0 if tdata1.699 or tdata-1.699.”

    Page 529
  • Step 3 Calculate tdata:

    tdata=ˉx-μ0s/n=35.26-3810/30-1.5

  • Step 4 State the conclusion and the interpretation.

    tdata=-1.5 is not ≥1.699 and it is not ≤−1.699; therefore, we do not reject H0. See Figure 21. There is insufficient evidence, at level of significance α=0.10, that the population mean number of hours worked per week differs from 38.

NOW YOU CAN DO

Exercises 9–14.

2 t Test for μ Using the p-Value Method

We may also use the p-value method for performing the t test for μ. The critical-value method and the p-value are equivalent, so they will provide identical conclusions.

t Test for the Population Mean μ: p-Value Method

When a random sample of size n is taken from a population, you can use the t test if either the population is normal or the sample size is large (n30).

  • Step 1 State the hypotheses and the rejection rule.

    Use one of the forms from Table 9. State the meaning of μ. The rejection rule is “Reject H0 if the p-valueα.”

  • Step 2 Calculate tdata.

    tdata=ˉx-μ0s/n

  • Step 3 Find the p-value.

    Either use technology to find the p-value or estimate the p-value using Table D, t Distribution, in the Appendix.

  • Step 4 State the conclusion and the interpretation.

    If the p-valueα, then reject H0. Otherwise, do not reject H0. Interpret your conclusion.

The definition of a p-value for a t test is similar to the p-value for a Z test. Unusual and extreme values of ˉx, and therefore of tdata, will have a small p-value, whereas values of ˉx and tdata nearer to the center of the distribution will have a large p-value. Table 9 summarizes the definition of the p-value for t tests. Note that we will not be finding these p-values manually but will either (a) use a computer or calculator or (b) estimate them using the t table.

Table 9.23: Table 9 p-Values for t tests
Form of test Right-tailed test Left-tailed test Two-tailed test
Hypotheses H0:μ=μ0Ha:μ>μ0level of significance α H0:μ=μ0Ha:μ<μ0level of significance α H0:μ=μ0Ha:μμ0level of significance α
p-Value = is tail area
associated with tdata
p-value=P(t>tdata)Area to the right of tdata p-value=P(t<tdata)Area to the left of tdata p-value=P(t>|tdata|)+P(t<-|tdata|)=2·P(t>|tdata|)Sum of the two tail areas
image
Page 530

EXAMPLE 20 t Test using the p-value method: Right-tailed test

milkprice

The U.S. Bureau of Labor Statistics reports that the mean price for a gallon of milk in July 2014 was $3.65. Gallons of milk were recently bought in a random sample of n=10 different cities, with the prices shown in the accompanying table. Test, using level of significance α=0.05, whether the population mean price for a gallon of milk is greater than $3.65.

Solution

We first check whether the conditions for performing the t test are met. Because our sample size is small, we must check for normality. The normal probability plot in Figure 22 shows acceptable normality, allowing us to proceed with the t test.

FIGURE 22 Normal probability plot of milk prices.
image
  • Step 1 State the hypotheses and the rejection rule.

    The key words “is greater than” means that we have a right-tailed test. Answering the question “Greater than what?” gives us μ0=3.65.

    H0:μ=3.65versusHa:μ>3.65

    where μ represents the population mean price of milk. We will reject H0 if the p-valueα=0.10.

  • Step 2 Calculate tdata.

    We use the instructions from the Step-by-Step Technology Guide on page 537. Figure 23 shows the TI-83/84 results from the t test for μ.

    FIGURE 23 TI-83/84 results for right-tailed t test.
    image

    For a more accurate calculation of the p-value, we retain 9 decimal places for the value of tdata.

    Using the statistics from Figure 23, we have the test statistic

    tdata=ˉx-μ0s/n=3.714-3.650.0873307888/10=2.3174618382.3175

  • Step 3 Find the p-value.

    From Figures 23 and 24, we have

    p-value 

    Page 531
    FIGURE 24 The p-value for a right-tailed t test.
    image
  • Step 4 State the conclusion and the interpretation.

    The p-value0.0228 is less than the level of significance α=0.05; therefore, reject H0. There is evidence, at level of significance α=0.05, that the population mean price of milk is greater than $3.65.

NOW YOU CAN DO

Exercises 15–20.

YOUR TURN#9

In Example 20, suppose the level of significance was α=0.01. Describe how this would affect the following, if at all:

  1. Hypotheses
  2. tdata
  3. p-value
  4. Conclusion

(The solutions are given in Appendix A.)

EXAMPLE 21 t Test using the p-value method: Two-tailed test

The Golden Ratio

image
FIGURE 25a The golden ratio.
image
FIGURE 25b The Parthenon follows the golden ratio.

Euclid's Elements, the Parthenon, the Mona Lisa, and the beadwork of the Shoshone tribe all have in common an appreciation for the golden ratio.

Suppose we have two quantities A and B, with A>B>0. Then A/B is called the golden ratio if

A+BA=AB

that is, if the ratio of the sum of the quantities to the larger quantity equals the ratio of the larger to the smaller (see Figure 25a).

Euclid wrote about the golden ratio in his Elements, calling it the “extreme and mean ratio.” The ratio of the width A and height B of the Parthenon, one of the most famous temples in ancient Greece, equals the golden ratio (Figure 25b). If you enclose the face of Leonardo da Vinci's Mona Lisa in a rectangle, the resulting ratio of the long side to the short side follows the golden ratio (Figure 26). The golden ratio has a value of approximately 1.618.

Now, we will test whether evidence exists for the use of the golden ratio in the artistic traditions of the Shoshone, a Native American tribe from the American West. Figure 27 shows a detail of a nineteenth-century Shoshone beaded dress that belonged to Nahtoma, the daughter of Chief Washakie of the Eastern Shoshone.7 It is intriguing to consider whether the Shoshone beaded rectangles, such as those on this dress, follow the golden ratio. Table 10 contains the ratios of lengths to widths of 18 beaded rectangles made by Shoshone artisans.8 We will perform a hypothesis test to determine whether the population mean ratio of Shoshone beaded rectangles equals the golden ratio of 1.618.

Page 532
image
FIGURE 26 Mona Lisa's face follows the golden ratio.
image
FIGURE 27 Beaded dress of Nahtoma, daughter of Chief Washakie of the Eastern Shoshone, showing rectangles that may follow the golden ratio.

shoshone

Table 9.24: Table 10 Ratio of length to width of a sample of Shoshone beaded rectangles
1.44 1.75 1.64 1.66 1.64
1.51 1.34 1.53 1.74 1.81
1.45 1.49 1.63 1.49
1.65 1.59 1.50 1.65

Solution

The population standard deviation for such rectangles is unknown, so we must use a t test instead of a Z test. Our sample size n=18 is not large, so we must assess whether the data are normally distributed. Figure 28 shows the normal probability plot, indicating acceptable support for the normality assumption. We proceed with the t test, using level of significance α=0.05.

FIGURE 28 Normal probability plot.
image

We use the TI-83/84 and CrunchIt! to perform this hypothesis test, using the Step-by-Step Technology Guide at the end of this section.

  • Step 1 State the hypotheses and the rejection rule.

    We are interested in whether the population mean length-to-width ratio of Shoshone beaded rectangles equals the golden ratio of 1.618, so we perform a two-tailed test:

    H0:μ=1.618versusHa:μ1.618

    where μ represents the population mean length-to-width ratio of Shoshone beaded rectangles. We will reject H0 if the p-value0.05.

    Page 533
  • Step 2 Find tdata.

    Using the statistics from Figure 29a, we have the test statistic

    tdata=x¯-μ0s/n=1.583888889-1.6180.1230083158/18-1.176545482-1.1765

  • Step 3 Find the p-value.

    From Figures 29a, 29b, and 30, we have

    p-value=P(t>|-1.1765|)+P(t<-|-1.1765|)0.2556

  • Step 4 State the conclusion and interpretation.

    Because p-value0.2556 is notα=0.05, we do not reject H0. Thus, there is insufficient evidence, at level of significance α=0.05, that the population mean ratio differs from 1.618. In other words, the data do not reject the claim that Shoshone beaded rectangles follow the same golden ratio exhibited by the Parthenon and the Mona Lisa.

    image
    FIGURE 29a TI-83/84 results.
    image
    FIGURE 29b CrunchIt! results.
    image
    FIGURE 30 p-Value for two-tailed t test is sum of two tail areas.

NOW YOU CAN DO

Exercises 21–26.

EXAMPLE 22 Estimating the p-value using the t table

Suppose we did not have access to technology. Estimate the p-value from Example 19 using the t table (Appendix Table D). For Example 19, our hypotheses are

H0:μ=38versusHa:μ38

where μ represents the population mean number of hours worked per week. Our test statistic is tdata=-1.5.

Solution

For a two-tailed test, choose the row of the t table with the heading “Area in two tails.” Then select the row in the table with the appropriate degrees of freedom, in this case df=n-1=30-1=29. Note the t-values in this row: 1.311, 1.699, 2.045, 2.462, and 2.756. Think of these values as existing on a horizontal number line. We want to place our tdata=-1.5 somewhere on this number line, but all the t-values in the table are positive. Fortunately, because of the symmetry of the t distribution about zero, we may take |tdata|=1.5. Now, where would |tdata|=1.5 fit on this “number line”? Between 1.311 and 1.699, as indicated in Figure 31, an excerpt from the t table. Therefore, we may estimate the p-value to be between 0.20 and 0.10. In fact, the actual p-value for this problem is about 0.144 (see Figure 32), so that our estimate is confirmed.

Page 534
image
FIGURE 31 Estimating the p-value using the t table.
image
FIGURE 32 Actual p-value confirms the estimate.

NOW YOU CAN DO

Exercises 27–30.

YOUR TURN#10

Estimate the p-value for the hypothesis test in Example 18.

(The solution is shown in Appendix A.)

3 Using Confidence Intervals to Perform Two-Tailed t tests

Just as we did for two-tailed Z tests in Section 9.3, we may use a 100(1-α)%t confidence interval to perform a two-tailed t test with level of significance α for various hypothesized values of μ0. The strategy is the same: if a certain value for μ0 lies outside the 100(1-α)%t confidence interval for μ, then the null hypothesis specifying this value for μ0 would be rejected. Otherwise, it would not be rejected.

EXAMPLE 23 Using a confidence interval to perform two-tailed t tests

In Example 14 of Chapter 8 (page 452), we found the 99% t-confidence interval for μ, the population mean sodium content per serving of all breakfast cereals, to be the following:

Lower bound=144.1 gramsUpper bound=227.7 grams

Test, using level of significance α=0.01, whether the population mean amount of sodium differs from the following values: (a) 100 grams, (b) 170 grams, (c) 250 grams.

Solution

The key words “differs from” mean that we are using two-tailed tests. Then, for each hypothesized value of μ0, we determine whether it falls inside or outside the given confidence interval.

  1. H0:μ=100versusHa:μ100

    The confidence interval is (144.1, 227.7), and because μ0=100 lies outside the interval (see Figure 33), we reject H0.

    Page 535
  2. H0:μ=170versusHa:μ170

    μ0=170 lies inside the interval, so we do not reject H0.

  3. H0:μ=250versusHa:μ250

    μ0=250 lies outside the interval, so we reject H0.

    image
    FIGURE 33 Reject H0 for values of μ0 that lie outside (144.1, 227.7).

NOW YOU CAN DO

Exercises 31–36.

In Section 9.3, we showed how to perform and interpret Z tests for μ using the TI-83/84 and Minitab software output. Here, in Section 9.4, we demonstrate how to perform and interpret t tests for μ using software output from SPSS and JMP.

EXAMPLE 24 Interpreting software output

Each of (a) and (b) represent software output from a t test for μ. For each, examine the indicated software output, and provide the following steps:

  • Step 1 State the hypotheses and the rejection rule.
  • Step 2 Find tdata.
  • Step 3 Find the p-value.
  • Step 4 State the conclusion and the interpretation.

Use level of significance α=0.10 for each hypothesis test.

  1. SPSS output for a t test for μ, where μ represents the population mean number of orchard farms per county, nationwide.

    image
  2. JMP output for a t test for μ, where μ represents the population mean number of grocery stores per county, nationwide.

    image
Page 536

Solution

  1. Interpreting the SPSS output.
    • Step 1 State the hypotheses and the rejection rule.

      In the SPSS output, the “Test Value=40” indicates that μ0=40. Also, the “2-tailed” in the output indicates that we have a two-tailed test. Thus, our hypotheses are:

      H0:μ=40versusHa:μ40

      where μ represents the population mean number of orchard farms per county, nationwide. We will reject H0 if the p-value is less than level of significance α=0.10.

    • Step 2 Find tdata.

      Under the “t” in the SPSS is the value for tdata,-1.079.

    • Step 3 Find the p-value.

      The abbreviation “Sig.” stands for “Significance,” which represents the p-value: 0.281.

    • Step 4 State the conclusion and the interpretation.

      The p-value of 0.281 is not less than the level of significance α=0.10, so we do not reject H0. There is insufficient evidence that the population mean number of orchard farms per county differs from 40.

  2. Interpreting the JMP output.
    • Step 1 State the hypotheses and the rejection rule.

      In the JMP output, the “Hypothesized Value” indicates that μ0=20. Now, JMP is unusual in that it performs all three types of hypothesis test simultaneously: two-tailed, right-tailed, and left-tailed, as shown in the JMP output. It does not specify a particular form of the test. Let us use the right-tailed test for this example. Thus, our hypotheses are:

      H0:μ=20vsHa:μ>20

      where μ represents the population mean number of grocery stores per county, nationwide. We will reject H0 if the p-value is less than level of significance α=0.10.

    • Step 2 Find tdata.

      Next to “Test Statistic” in the JMP output, we find the value of our test statistic, tdata, 0.3324.

    • Step 3 Find the p-value.

      Here, we need to be careful, because JMP gives us three different p-values, depending on which form of the hypothesis test is performed. We chose the right-tailed test, so our p-value is next to Prob>t”:p-value=0.3698, as indicated in the JMP output.

    • Step 4 State the conclusion and the interpretation.

      The p-value of 0.3698 is not less than the level of significance α=0.10, so we do not reject H0. There is insufficient evidence that the population mean number of grocery stores per county is greater than 20.

NOW YOU CAN DO

Exercises 37–40.

[Leave] [Close]