OBJECTIVES By the end of this section, I will be able to …
1 Independent Sample test for
On page 178 in Chapter 3, we used boxplots to find evidence of a difference between male and female body temperature for a sample of 65 women and a sample of 65 men.6 The summary statistics are shown in Table 7.
Gender | Sample size |
Sample mean body temperature |
Sample standard deviation |
Population mean body temperature |
---|---|---|---|---|
Females (sample 1) | ||||
Males (sample 2) |
However, because the female subjects did not determine the male subjects, and vice versa, the 65 women and 65 men represent independent samples, so we cannot use the dependent sampling methods we learned in Section 10.1.
Note that for independent samples, we have two sample sizes, and ; two sample means, and ; two sample standard deviations, and ; and two unknown population means, and . We are interested in the difference in the population means, so we consider the quantity
590
Developing Your Statistical Sense
The Difference Difference
A difference in interpretation exists between the quantity and the quantity from Section 10.1. Here, refers to the difference in population means, whereas represents the population mean of the paired differences.
In previous chapters, we used the statistic to learn about the parameter . Here, we will use the statistic to perform inference about the parameter , whose value is unknown. Note from Table 7 that the value of for these samples is
We use as a point estimate of . If we repeat the experiment an infinite number of times, then the values of will form a distribution called the sampling distribution of .
It is unlikely that the experimenter will have knowledge of both population standard deviations and . Therefore, we use the estimates of and provided by the sample standard deviations and . Recall from Section 8.2 that, when the population standard deviation σ is unknown, and if either the population is normally distributed or the sample size is large, the quantity
has a distribution with degrees of freedom. By analogy, we have the following sampling distribution.
Sampling Distribution of
When random samples are drawn independently from two populations with population means and , and either (a) the two populations are normally distributed, or (b) the two sample sizes are large (at least 30), then the quantity
approximately follows a distribution with degrees of freedom equal to the smaller of and , where and represent the mean and standard deviation of the sample taken from population 1, and and represent the mean and standard deviation of the sample taken from population 2.
This statistic is called Welch's approximate t, after the twentieth-century English statistician Bernard Lewis Welch. Although there are other distributions that statisticians use to estimate the difference between two population means, we use this approximation because it is conservative and easy to calculate.
Researchers are often interested in testing whether the mean of one population is greater than, less than, or different from the mean of another population. Thus, we next learn how to perform hypothesis tests for the difference in population means . Usually, the most important hypothesized value for is 0. Consider the two-tailed hypothesis test
which is equivalent to
In practice, the hypothesized difference between the two population means is nearly always . Thus, the test statistic takes the following form:
591
Just as in Section 9.4, if is extreme, then it represents evidence against the null hypothesis. The hypothesis test may be performed using either the critical-value method or the -value method.
Welch's Hypothesis Test for the Difference in Two Population Means: Critical-Value Method
The hypothesis test applies whenever either
Step 1 State the hypotheses.
Use one of the forms from Table 8. State the meaning of and .
Step 2 Find and state the rejection rule.
To find , use the table and degrees of freedom the smaller of and . To find the rejection rule, use Table 8.
Step 3 calculate .
which follows an approximate distribution with degrees of freedom the smaller of and .
Step 4 State the conclusion and the interpretation.
Compare with .
EXAMPLE 7 Test for : Critical-value method
Using Table 7, test whether women's population mean body temperature differs from that of men, using the critical-value method and .
Solution
Both sample sizes are large (), so we can perform the hypothesis test.
592
Step 1 State the hypotheses.
The key words “differs from” indicate a two-tail test:
where and represent the population mean body temperature for women and men, respectively.
Step 2 Find and state the rejection rule.
The required degrees of freedom is the smaller of and , which is . Unfortunately, is not in the table in Appendix Table D, so we use the conservative . For , this gives . We have a two-tailed test, so Table 8 gives us the following rejection rule:
Step 4 State the conclusion and the interpretation.
The test statistic is greater than (see Figure 11). We therefore reject . There is evidence, at level of significance , that the population mean body temperatures are not the same for women and men.
NOW YOU CAN DO
Exercises 3–6.
We may also use the -value method to perform the independent sample test for .
Welch's Hypothesis Test for the Difference in Two Population Means: -value Method
The hypothesis test applies whenever either
Step 1 State the hypotheses and the rejection rule.
Use one of the forms from Table 9. State the meaning of and . The rejection rule is: Reject if the -value is .
Step 2 Calculate .
which follows an approximate distribution with degrees of freedom the smaller of and .
Step 3 Find the -value.
Use technology or estimate using the table.
Step 4 State the conclusion and the interpretation.
Compare the -value with .
593
EXAMPLE 8 Test for using the -value method
bankloan_approved
bankloan_denied
Note: Our degrees of freedom, the smaller of and , is . However, the TI-83/84 output on the next page shows . Why does the technology use different degrees of freedom than we do? Recall that we are using Welch's approximation to the distribution. The TI-83/84, Excel, Minitab, and other technology calculate the degrees of freedom as follows:7
This provides a more accurate determination of the degrees of freedom than our method. However, our method is a conservative estimate that is easier to calculate, and it is recommended for hand calculations.
Bank Loans
Here, we do some analysis on our Chapter 10 Case Study data set: Bank loans. When evaluating loan applicants for approval of a loan, banks examine several aspects of an applicant's financial history. Of particular importance is the applicant's credit score. Here, we will look for differences in the mean credit score between applicants who were approved and those who were denied. Independent samples of size 100 were taken from the Bank loans data set, from among those approved and those denied the loan. The descriptive statistics for each group are shown in Table 10. Use the TI-83/84 or Excel, the -value method, and level of significance to test whether the population mean credit score for successful loan applicants is greater than that for those who were denied the loan.
Loan Status | Sample size | Sample mean credit score |
Sample standard deviation |
Population mean credit score |
---|---|---|---|---|
Approved (sample 1) |
||||
Denied (sample 2) |
Solution
Because both samples are large (), we may proceed with the test for .
594
Step 1 State the hypotheses and the rejection rule.
The approved applicants represent sample 1, and we are interested in whether the mean credit score for the approved applicants is greater than that of the denied applicants. Thus, we have the following hypotheses:
where and represent the population mean credit score for the approved and denied loan applicants, respectively. The rejection rule is to reject if -value ≤ 0.01.
Step 2 Find .
We use the instructions provided in the Step-by-Step Technology Guide at the end of this section. From Figure 12,
Step 3 Find the -value.
From Figure 12,
Step 4 State the conclusion and the interpretation.
Our -value of essentially zero is smaller than the level of significance 0.01. Therefore, we reject . There is evidence that the population mean credit score for approved applicants is greater than that of the denied loan applicants. In fact, with a -value so close to zero, the evidence is very strong indeed.
Remember that when the TI calculator gives you output such as “E-17,” it is indicating that the decimal point needs to be moved 17 places to the left.
NOW YOU CAN DO
Exercises 7–10.
2 Confidence Intervals for
Recall from Section 8.2 that to estimate the unknown population mean , we can use a confidence interval:
where is the margin of error. By analogy, here the interval for takes the following form.
Welch's Confidence Interval for
For two independent random samples taken from two populations with population means and , a confidence interval for is given by
where , , and represent the mean, standard deviation, and sample size of the sample taken from population 1, , , and represent the mean, standard deviation, and sample size of the sample taken from population 2, and is associated with the confidence level and degrees of freedom of the smaller of and .
The interval applies whenever either of the following conditions is met:
Margin of Error
The margin of error for a confidence interval for is given by
595
Thus, the confidence interval for takes the form .
This is a confidence interval for the difference in two population means, which is not the same as in Section 10.1, which was for the population mean of the differences of matched pairs. Here, we calculate the means of the samples and then compute the difference. In Section 10.1, we calculated the differences of sample values first and then computed the mean of these differences.
EXAMPLE 9 Confidence interval for
Bank Loans
Find a 90% confidence interval for the difference in population mean credit scores for those approved and those denied bank loans, using the data in Table 10.
Solution
Both sample sizes are large (), so we may construct the interval. For , the required degrees of freedom is the smaller of and , which is . Because is not listed in the table, we use the next lower value listed as a conservative alternative: . For 90% confidence, then .
The margin of error is
The 90% confidence interval is then
We are 90% confident that the difference in population mean credit scores lies between 60.066 and 85.934. Because 0 is not contained in this interval, we may conclude that , just as we did in Example 8.
NOW YOU CAN DO
Exercises 11–16.
3 Using Confidence Intervals to Perform Hypothesis Tests
As in earlier sections, we may use a confidence interval for to perform two-tailed tests about .
Equivalence of a Two-Tailed Test About and a Confidence Interval for
EXAMPLE 10 Using a confidence interval to perform a two-tailed test about
Bank Loans
Use the 90% confidence interval for the difference in population mean credit scores from Example 9 to test, using level of significance , whether differs from the following values:
596
Solution
The hypotheses for our two-sample test look like this:
which is equivalent to the following, if we subtract from each side of the equations:
Here, we replace zero with the value of 60 hypothesized for the difference in population means, obtaining
The value of 60 lies outside the confidence interval (60.066, 85.934) from Example 9; therefore, we reject .
Our hypotheses are
The value of 70 lies inside the confidence interval (60.066, 85.934); therefore, we do not reject .
NOW YOU CAN DO
Exercises 17–20.
4 Inference for Using Pooled Variance
Recall that the variance equals the square of the standard deviation.
An alternative method for inference may be applied when the data analyst has reason to believe that , that is, the variances of the two populations are equal. A pooled estimate of the common variance is used.
Pooled estimate of the Common Variance
Some statisticians think that the pooled variance method should be used sparingly.8
The conditions for performing inference using pooled variance are the same as for Welch's method (page 591), with the additional condition that . The test statistic for the pooled variance test is then given by
We illustrate the pooled variance test and the pooled variance confidence interval using the following two examples.
EXAMPLE 11 Pooled variance test
Bank Loans
Another indicator of financial health that banks look at when evaluating loan applicants is the debt-to-income ratio, which is defined as follows.
For example, if you owed $5000 on your credit card and $10,000 on your car, and your annual income was $50,000, then your debt-to-income ratio would be .
597
Here, we examine whether the mean debt-to-income ratio differed between those approved and those denied the bank loans. Samples of size 100 were obtained from each group, with the summary statistics shown in Table 11.
Loan status | Sample size | Sample mean debt-to-income ratio |
Sample standard deviation |
Population mean debt-to-income ratio |
---|---|---|---|---|
Approved (sample 1) |
||||
Denied (sample 2) |
Use the critical-value method for the pooled variance test to test whether the population mean debt-to-income ratio for those approved is less than that of those who were not approved for the loan. Assume and use level of significance .
Solution
Step 1 State the hypotheses.
where and represent the population mean debt-to-income ratio for those approved and those denied the loan, respectively.
Step 2 Find .
The degrees of freedom for the pooled variance test equals . Because is not in the table, we use the next lower value instead, obtaining the critical value . Reject if .
Step 3 Calculate and .
Plugging this value into the following formula for the test statistic, we obtain
Step 4 Conclusion and interpretation.
The test statistic is less than the critical value . Therefore, we reject . There is evidence that the population mean debt-to-income ratio for those whose loan was approved is less than that for those whose loan was not approved.
NOW YOU CAN DO
Exercises 21 and 22.
The pooled variance method may also be used to construct a confidence interval for .
598
EXAMPLE 12 Pooled variance confidence interval for
Bank Loans
Use the data from Example 11 to construct a 95% confidence interval for the difference in population mean debt-to-income ratio. Use the pooled variance method.
Solution
The confidence interval for using the pooled variance method is given by the following formula:
where is found using degrees of freedom. Similar to Example 11, we use because is not in the table. So, we have . Thus, our 95% confidence interval is:
We are 95% confident that the difference in population mean debt-to-income ratios, lies between −0.09 and −0.01.
NOW YOU CAN DO
Exercises 23 and 24.
5 Inference for When and Are Known
Do not use inference for unless both and are known.
When the population standard deviations and are known, the data analyst may prefer to use inference for because the margin of error for inference is smaller than for inference. The conditions for performing inference for are similar to Welch's method (page 591), with the additional condition that and are known. We illustrate the two-sample test and the confidence interval for using the following two examples.
EXAMPLE 13 Two-sample test
A Kaiser Family Foundation report found that the mean amount of time that young people ages 8–18 spend talking on their cell phones is minutes per day, whereas the mean amount of time spent watching TV shows on their cell phones is minutes per day.9 Assume that the sample sizes are and , and that the population standard deviations are known to be minutes and minutes. Test, using the critical-value method and level of significance , whether the population mean amount of time young people spending talking on their cell phones is less than the population mean amount of time they spend watching TV shows on their cell phones.
Solution
Step 1 State the hypotheses.
where and represent the population mean amount of time young people spend talking and watching TV shows, respectively, on their cell phones.
599
Step 2 Find .
From Table 4 in Chapter 9 (page 500), we have . Reject if .
Step 3 Calculate .
The test statistic for the test for takes the form
Step 4 Conclusion and interpretation.
The test statistic is less than the critical value . Therefore, we reject . There is evidence that the population mean amount of time young people spending talking on their cell phones is less than the population mean amount of time they spend watching TV shows on their cell phones.
NOW YOU CAN DO
Exercises 25 and 26.
When and are known, we can also construct a confidence interval for .
EXAMPLE 14 confidence interval for
Use the data from Example 13 to construct a 95% confidence interval for the difference in population mean amount of time spent using cell phones.
Solution
The confidence interval for is as follows:
From Table 1 in Chapter 8 (page 432) we have . Thus, our 95% confidence interval is
We are 95% confident that the difference in population mean amounts of time spent by young people on cell phones talking and watching TV shows lies between 223.463 minutes and −8.537 minutes. In other words, we are 95% confident that young people spend between 8.537 minutes and 23.463 minutes longer watching TV shows on their cell phones rather than talking on their cell phones.
NOW YOU CAN DO
Exercises 27 and 28.