10.2 Inference for Two Independent Means

OBJECTIVES By the end of this section, I will be able to …

  1. Perform and interpret tests about using Welch's method.5
  2. Compute and interpret intervals for using Welch's method.
  3. Use confidence intervals for to perform two-tailed tests about .
  4. Perform and interpret tests and intervals about using the pooled variance method.
  5. Apply tests and intervals for when and are known.

1 Independent Sample test for

On page 178 in Chapter 3, we used boxplots to find evidence of a difference between male and female body temperature for a sample of 65 women and a sample of 65 men.6 The summary statistics are shown in Table 7.

Table 10.19: Table 7 Summary statistics for female versus male body temperatures in °F
Gender Sample
size
Sample mean
body temperature
Sample standard
deviation
Population mean
body temperature
Females (sample 1)
Males (sample 2)

However, because the female subjects did not determine the male subjects, and vice versa, the 65 women and 65 men represent independent samples, so we cannot use the dependent sampling methods we learned in Section 10.1.

Note that for independent samples, we have two sample sizes, and ; two sample means, and ; two sample standard deviations, and ; and two unknown population means, and . We are interested in the difference in the population means, so we consider the quantity

590

Developing Your Statistical Sense

The Difference Difference

A difference in interpretation exists between the quantity and the quantity from Section 10.1. Here, refers to the difference in population means, whereas represents the population mean of the paired differences.

In previous chapters, we used the statistic to learn about the parameter . Here, we will use the statistic to perform inference about the parameter , whose value is unknown. Note from Table 7 that the value of for these samples is

We use as a point estimate of . If we repeat the experiment an infinite number of times, then the values of will form a distribution called the sampling distribution of .

It is unlikely that the experimenter will have knowledge of both population standard deviations and . Therefore, we use the estimates of and provided by the sample standard deviations and . Recall from Section 8.2 that, when the population standard deviation σ is unknown, and if either the population is normally distributed or the sample size is large, the quantity

has a distribution with degrees of freedom. By analogy, we have the following sampling distribution.

Sampling Distribution of

When random samples are drawn independently from two populations with population means and , and either (a) the two populations are normally distributed, or (b) the two sample sizes are large (at least 30), then the quantity

approximately follows a distribution with degrees of freedom equal to the smaller of and , where and represent the mean and standard deviation of the sample taken from population 1, and and represent the mean and standard deviation of the sample taken from population 2.

This statistic is called Welch's approximate t, after the twentieth-century English statistician Bernard Lewis Welch. Although there are other distributions that statisticians use to estimate the difference between two population means, we use this approximation because it is conservative and easy to calculate.

Researchers are often interested in testing whether the mean of one population is greater than, less than, or different from the mean of another population. Thus, we next learn how to perform hypothesis tests for the difference in population means . Usually, the most important hypothesized value for is 0. Consider the two-tailed hypothesis test

which is equivalent to

In practice, the hypothesized difference between the two population means is nearly always . Thus, the test statistic takes the following form:

591

Just as in Section 9.4, if is extreme, then it represents evidence against the null hypothesis. The hypothesis test may be performed using either the critical-value method or the -value method.

Welch's Hypothesis Test for the Difference in Two Population Means: Critical-Value Method

The hypothesis test applies whenever either

  1. Both populations are normally distributed, or
  2. Both samples are large, that is, and .
  • Step 1 State the hypotheses.

    Use one of the forms from Table 8. State the meaning of and .

  • Step 2 Find and state the rejection rule.

    To find , use the table and degrees of freedom the smaller of and . To find the rejection rule, use Table 8.

  • Step 3 calculate .


    which follows an approximate distribution with degrees of freedom the smaller of and .

  • Step 4 State the conclusion and the interpretation.

    Compare with .

Table 10.20: Table 8 Critical regions and rejection rules for test for
image

EXAMPLE 7 Test for : Critical-value method

Using Table 7, test whether women's population mean body temperature differs from that of men, using the critical-value method and .

Solution

Both sample sizes are large (), so we can perform the hypothesis test.

592

  • Step 1 State the hypotheses.

    The key words “differs from” indicate a two-tail test:

    where and represent the population mean body temperature for women and men, respectively.

  • Step 2 Find and state the rejection rule.

    The required degrees of freedom is the smaller of and , which is . Unfortunately, is not in the table in Appendix Table D, so we use the conservative . For , this gives . We have a two-tailed test, so Table 8 gives us the following rejection rule:

  • Step 3 Find .

  • Step 4 State the conclusion and the interpretation.

    The test statistic is greater than (see Figure 11). We therefore reject . There is evidence, at level of significance , that the population mean body temperatures are not the same for women and men.

    image
    Figure 10.11: FIGURE 11 falls within the critical region.

NOW YOU CAN DO

Exercises 3–6.

We may also use the -value method to perform the independent sample test for .

Welch's Hypothesis Test for the Difference in Two Population Means: -value Method

The hypothesis test applies whenever either

  1. Both populations are normally distributed, or
  2. Both samples are large, that is, and .
  • Step 1 State the hypotheses and the rejection rule.

    Use one of the forms from Table 9. State the meaning of and . The rejection rule is: Reject if the -value is .

  • Step 2 Calculate .

    which follows an approximate distribution with degrees of freedom the smaller of and .

  • Step 3 Find the -value.

    Use technology or estimate using the table.

  • Step 4 State the conclusion and the interpretation.

    Compare the -value with .

593

Table 10.21: Table 9 -Values for t test for
image

EXAMPLE 8 Test for using the -value method

Note: Our degrees of freedom, the smaller of and , is . However, the TI-83/84 output on the next page shows . Why does the technology use different degrees of freedom than we do? Recall that we are using Welch's approximation to the distribution. The TI-83/84, Excel, Minitab, and other technology calculate the degrees of freedom as follows:7

This provides a more accurate determination of the degrees of freedom than our method. However, our method is a conservative estimate that is easier to calculate, and it is recommended for hand calculations.

image Bank Loans

Here, we do some analysis on our Chapter 10 Case Study data set: Bank loans. When evaluating loan applicants for approval of a loan, banks examine several aspects of an applicant's financial history. Of particular importance is the applicant's credit score. Here, we will look for differences in the mean credit score between applicants who were approved and those who were denied. Independent samples of size 100 were taken from the Bank loans data set, from among those approved and those denied the loan. The descriptive statistics for each group are shown in Table 10. Use the TI-83/84 or Excel, the -value method, and level of significance to test whether the population mean credit score for successful loan applicants is greater than that for those who were denied the loan.

Table 10.22: Table 10 Summary statistics for credit scores of loan applicants who were approved and those who were denied the loan
Loan Status Sample size Sample mean
credit score
Sample standard
deviation
Population mean
credit score
Approved
(sample 1)
Denied
(sample 2)
Table 10.22: Source: Data Mining and Predictive Analytics, by Daniel Larose and Chantal Larose, Wiley, 2015.

Solution

Because both samples are large (), we may proceed with the test for .

594

  • Step 1 State the hypotheses and the rejection rule.

    The approved applicants represent sample 1, and we are interested in whether the mean credit score for the approved applicants is greater than that of the denied applicants. Thus, we have the following hypotheses:

    where and represent the population mean credit score for the approved and denied loan applicants, respectively. The rejection rule is to reject if -value ≤ 0.01.

  • Step 2 Find .

    We use the instructions provided in the Step-by-Step Technology Guide at the end of this section. From Figure 12,

    image
    Figure 10.12: FIGURE 12 TI-83/84 output.
  • Step 3 Find the -value.

    From Figure 12,

  • Step 4 State the conclusion and the interpretation.

    Our -value of essentially zero is smaller than the level of significance 0.01. Therefore, we reject . There is evidence that the population mean credit score for approved applicants is greater than that of the denied loan applicants. In fact, with a -value so close to zero, the evidence is very strong indeed.

Remember that when the TI calculator gives you output such as “E-17,” it is indicating that the decimal point needs to be moved 17 places to the left.

NOW YOU CAN DO

Exercises 7–10.

2 Confidence Intervals for

Recall from Section 8.2 that to estimate the unknown population mean , we can use a confidence interval:

where is the margin of error. By analogy, here the interval for takes the following form.

Welch's Confidence Interval for

For two independent random samples taken from two populations with population means and , a confidence interval for is given by

where , , and represent the mean, standard deviation, and sample size of the sample taken from population 1, , , and represent the mean, standard deviation, and sample size of the sample taken from population 2, and is associated with the confidence level and degrees of freedom of the smaller of and .

The interval applies whenever either of the following conditions is met:

  • Both populations are normally distributed, or
  • Both sample sizes are large.

Margin of Error

The margin of error for a confidence interval for is given by

595

Thus, the confidence interval for takes the form .

image

This is a confidence interval for the difference in two population means, which is not the same as in Section 10.1, which was for the population mean of the differences of matched pairs. Here, we calculate the means of the samples and then compute the difference. In Section 10.1, we calculated the differences of sample values first and then computed the mean of these differences.

EXAMPLE 9 Confidence interval for

image Bank Loans

Find a 90% confidence interval for the difference in population mean credit scores for those approved and those denied bank loans, using the data in Table 10.

Solution

Both sample sizes are large (), so we may construct the interval. For , the required degrees of freedom is the smaller of and , which is . Because is not listed in the table, we use the next lower value listed as a conservative alternative: . For 90% confidence, then .

The margin of error is

The 90% confidence interval is then

We are 90% confident that the difference in population mean credit scores lies between 60.066 and 85.934. Because 0 is not contained in this interval, we may conclude that , just as we did in Example 8.

NOW YOU CAN DO

Exercises 11–16.

3 Using Confidence Intervals to Perform Hypothesis Tests

As in earlier sections, we may use a confidence interval for to perform two-tailed tests about .

Equivalence of a Two-Tailed Test About and a Confidence Interval for

  • If a certain value for lies outside the corresponding confidence interval for , then the null hypothesis specifying this value would be rejected for level of significance .
  • Alternatively, if a certain value for lies inside the confidence interval for , then the null hypothesis specifying this value would not be rejected for level of significance .

EXAMPLE 10 Using a confidence interval to perform a two-tailed test about

image Bank Loans

Use the 90% confidence interval for the difference in population mean credit scores from Example 9 to test, using level of significance , whether differs from the following values:

  1. 60
  2. 70

596

Solution

The hypotheses for our two-sample test look like this:

which is equivalent to the following, if we subtract from each side of the equations:

  1. Here, we replace zero with the value of 60 hypothesized for the difference in population means, obtaining

    The value of 60 lies outside the confidence interval (60.066, 85.934) from Example 9; therefore, we reject .

  2. Our hypotheses are

    The value of 70 lies inside the confidence interval (60.066, 85.934); therefore, we do not reject .

NOW YOU CAN DO

Exercises 17–20.

4 Inference for Using Pooled Variance

Recall that the variance equals the square of the standard deviation.

An alternative method for inference may be applied when the data analyst has reason to believe that , that is, the variances of the two populations are equal. A pooled estimate of the common variance is used.

Pooled estimate of the Common Variance

Some statisticians think that the pooled variance method should be used sparingly.8

The conditions for performing inference using pooled variance are the same as for Welch's method (page 591), with the additional condition that . The test statistic for the pooled variance test is then given by

We illustrate the pooled variance test and the pooled variance confidence interval using the following two examples.

EXAMPLE 11 Pooled variance test

image Bank Loans

Another indicator of financial health that banks look at when evaluating loan applicants is the debt-to-income ratio, which is defined as follows.

For example, if you owed $5000 on your credit card and $10,000 on your car, and your annual income was $50,000, then your debt-to-income ratio would be .

597

Here, we examine whether the mean debt-to-income ratio differed between those approved and those denied the bank loans. Samples of size 100 were obtained from each group, with the summary statistics shown in Table 11.

Table 10.23: Table 11 Summary statistics for debt-to-income ratios of loan applicants who were approved and those who were denied the loan
Loan status Sample size Sample mean
debt-to-income
ratio
Sample standard
deviation
Population mean
debt-to-income
ratio
Approved
(sample 1)
Denied
(sample 2)

Use the critical-value method for the pooled variance test to test whether the population mean debt-to-income ratio for those approved is less than that of those who were not approved for the loan. Assume and use level of significance .

Solution

  • Step 1 State the hypotheses.

    where and represent the population mean debt-to-income ratio for those approved and those denied the loan, respectively.

  • Step 2 Find .

    The degrees of freedom for the pooled variance test equals . Because is not in the table, we use the next lower value instead, obtaining the critical value . Reject if .

  • Step 3 Calculate and .

    Plugging this value into the following formula for the test statistic, we obtain

  • Step 4 Conclusion and interpretation.

    The test statistic is less than the critical value . Therefore, we reject . There is evidence that the population mean debt-to-income ratio for those whose loan was approved is less than that for those whose loan was not approved.

NOW YOU CAN DO

Exercises 21 and 22.

The pooled variance method may also be used to construct a confidence interval for .

598

EXAMPLE 12 Pooled variance confidence interval for

image Bank Loans

Use the data from Example 11 to construct a 95% confidence interval for the difference in population mean debt-to-income ratio. Use the pooled variance method.

Solution

The confidence interval for using the pooled variance method is given by the following formula:

where is found using degrees of freedom. Similar to Example 11, we use because is not in the table. So, we have . Thus, our 95% confidence interval is:

We are 95% confident that the difference in population mean debt-to-income ratios, lies between −0.09 and −0.01.

NOW YOU CAN DO

Exercises 23 and 24.

5 Inference for When and Are Known

Do not use inference for unless both and are known.

When the population standard deviations and are known, the data analyst may prefer to use inference for because the margin of error for inference is smaller than for inference. The conditions for performing inference for are similar to Welch's method (page 591), with the additional condition that and are known. We illustrate the two-sample test and the confidence interval for using the following two examples.

EXAMPLE 13 Two-sample test

A Kaiser Family Foundation report found that the mean amount of time that young people ages 8–18 spend talking on their cell phones is minutes per day, whereas the mean amount of time spent watching TV shows on their cell phones is minutes per day.9 Assume that the sample sizes are and , and that the population standard deviations are known to be minutes and minutes. Test, using the critical-value method and level of significance , whether the population mean amount of time young people spending talking on their cell phones is less than the population mean amount of time they spend watching TV shows on their cell phones.

Solution

  • Step 1 State the hypotheses.

    where and represent the population mean amount of time young people spend talking and watching TV shows, respectively, on their cell phones.

    599

  • Step 2 Find .

    From Table 4 in Chapter 9 (page 500), we have . Reject if .

  • Step 3 Calculate .

    The test statistic for the test for takes the form

  • Step 4 Conclusion and interpretation.

    The test statistic is less than the critical value . Therefore, we reject . There is evidence that the population mean amount of time young people spending talking on their cell phones is less than the population mean amount of time they spend watching TV shows on their cell phones.

NOW YOU CAN DO

Exercises 25 and 26.

When and are known, we can also construct a confidence interval for .

EXAMPLE 14 confidence interval for

Use the data from Example 13 to construct a 95% confidence interval for the difference in population mean amount of time spent using cell phones.

Solution

The confidence interval for is as follows:

From Table 1 in Chapter 8 (page 432) we have . Thus, our 95% confidence interval is

We are 95% confident that the difference in population mean amounts of time spent by young people on cell phones talking and watching TV shows lies between 223.463 minutes and −8.537 minutes. In other words, we are 95% confident that young people spend between 8.537 minutes and 23.463 minutes longer watching TV shows on their cell phones rather than talking on their cell phones.

NOW YOU CAN DO

Exercises 27 and 28.