602
CLARIFYING THE CONCEPTS
1. What are the conditions that permit us to perform Welch's two-sample test? (p. 592)
10.2.1
The two populations are normally distributed. The sample sizes are large (at least 30).
2. If a confidence interval for contains 0, then with level of significance , what is our conclusion regarding the hypothesis that there is no difference in the population means? (p. 596)
PRACTICING THE TECHNIQUES
CHECK IT OUT!
To do | Check out | Topic |
---|---|---|
Exercises 3–6 | Example 7 |
test for : critical-value method |
Exercises 7–10 | Example 8 |
test for : -value method |
Exercises 11–16 | Example 9 |
confidence interval for |
Exercises 17–20 | Example 10 | Equivalence between confidence intervals and tests for |
Exercises 21–22 | Example 11 | Pooled variance test for |
Exercises 23–24 | Example 12 | Pooled variance confidence interval for |
Exercises 25–26 | Example 13 | test for |
Exercises 27–28 | Example 14 |
confidence interval for |
For Exercises 3–6, perform the indicated Welch's hypothesis test using the critical-value method. The summary statistics were taken from random samples that were drawn independently. For each exercise, follow these steps:
3. Test, at level of significance , whether .
Sample 1 | |||
---|---|---|---|
Sample 2 |
10.2.3
(a) vs. (b) . Reject if or . (c) . (d) Since , we reject . There is evidence that the population mean for Population 1 is different from the population mean for Population 2.
4. Test, at level of significance , whether .
Sample 1 | |||
Sample 2 |
5. Test, at level of significance , whether .
Sample 1 | |||
Sample 2 |
10.2.5
(a) vs. (b) . Reject if . (c) . (d) Since is , we reject . There is evidence at the level of significance that the population mean of Population 1 is less than the population mean of Population 2.
6. Test, at level of significance , whether .
Sample 1 | |||
Sample 2 |
For Exercises 7–10, perform the indicated Welch's hypothesis test using the -value method. The summary statistics were taken from random samples that were drawn independently. For each exercise follow these steps:
7. Test, at level of significance , whether .
Sample 1 | |||
Sample 2 |
10.2.7
(a) Reject if the ≤ 0.10.
(b) . (c) . (d) Since the is ≤ 0.10, we reject . There is evidence at the level of significance that the population mean of Population 1 is different from the population mean of Population 2.
8. Test, at level of significance , whether > .
Sample 1 | |||
Sample 2 |
9. Test, at level of significance , whether .
Sample 1 | |||
Sample 2 |
10.2.9
(a) . Reject if the ≤ 0.05.
(b) . (c) . (d) Since the is ≤ 0.05, we reject . There is evidence at the level of significance that the population mean of Population 1 is less than the population mean of Population 2.
10. Test, at level of significance , whether .
Sample 1 | |||
Sample 2 |
For Exercises 11–16, do the following for the designated data:
11. Data in Exercise 3, confidence level = 90%
10.2.11
(a) (b) . We can estimate the difference in the population means of Population 1 and Population 2 to within 0.797 with 90% confidence. (c) (1.203, 2.797). We are 90% confident that the difference in the population means of Population 1 and Population 2 lies between 1.203 and 2.797.
12. Data in Exercise 4, confidence level = 95%
13. Data in Exercise 5, confidence level = 99%
10.2.13
(a) (b) . We can estimate the difference in the population means of Population 1 and Population 2 to within 5.326 with 99% confidence. (c) (−15.326, −4.674). We are 99% confident that the difference in the population means of Population 1 and Population 2 lies between 215.326 and 24.674.
14. Data in Exercise 6, confidence level = 95%
15. Data in Exercise 7, confidence level = 95%
10.2.15
(a) (b) . We can estimate the difference in the population means of Population 1 and Population 2 to within 0.811 with 95% confidence. (c) (–1.811, −0.189). We are 95% confident that the difference in the population means of Population 1 and Population 2 lies between 2 1.811 and 20.189.
16. Data in Exercise 8, confidence level = 90%
For Exercises 17–20 a confidence interval for is given. Use the confidence interval to test, at level of significance , whether differs from each of the designated hypothesized values.
603
17. A 95% confidence interval for is (10, 15). Hypothesized values are
10.2.17
(a) lies outside of the interval (10, 15), so we reject at the level of significance. (b) . lies inside of the interval (10, 15), so we do not reject at the level of significance. (c) . lies outside of the interval (10, 15), so we reject at the level of significance.
18. A 99% confidence interval for is (0, 100). Hypothesized values are
19. A 90% confidence interval for is (−10, 10). Hypothesized values are
10.2.19
(a) . lies outside of the interval (–10, 10), so we reject at the level of significance. (b) . lies inside of the interval (–10, 10), so we do not reject at the level of significance. (c) . lies inside of the interval (–10, 10), so we do not reject at the level of significance.
20. A 95% confidence interval for is (−25, −15). Hypothesized values are
For Exercises 21–22, perform the indicated hypothesis test using the pooled variance method. The summary statistics were taken from random samples that were drawn independently. Assume .
21. Test, at level of significance , whether > .
Sample 1 | |||
Sample 2 |
10.2.21
. . Reject if . . . Since is not ≥ 1.294, we do not reject . There is insufficient evidence at the level of significance that the population mean of Population 1 is greater than the population mean of Population 2.
22. Test, at level of significance , whether .
Sample 1 | |||
Sample 2 |
For Exercises 23 and 24, construct a 95% confidence interval for for the indicated data using the pooled variance method.
23. The data in Exercise 21
10.2.23
(–2.940, 6.940). We are 95% confident that the difference in the population means of Population 1 and Population 2 lies between −2.940 and 6.940.
24. The data in Exercise 22
For Exercises 25 and 26, perform the indicated hypothesis test using the test. The summary statistics were taken from random samples that were drawn independently. Assume that and are known.
25. Test, at level of significance , whether > .
Sample 1 | |||
Sample 2 |
10.2.25
. . Reject if . . Since is , we reject . There is evidence at the level of significance that the population mean of Population 1 is greater than the population mean of Population 2.
26. Test, at level of significance , whether .
Sample 1 | |||
Sample 2 |
For Exercises 27 and 28, construct a 95% confidence interval for for the indicated data.
27. The data in Exercise 25
10.2.27
(0.289, 1.711). We are 95% confident that the difference in the population means of Population 1 and Population 2 lies between 0.289 and 1.711.
28. The data in Exercise 26
APPLYING THE CONCEPTS
For Exercises 29–49, assume normality and use Welch's test and interval unless otherwise indicated.
29. PC Sales. A personal computer company launched an advertising campaign in the hopes of boosting sales. A random sample (sample 1) of 16 days before the advertising blitz showed mean sales of 120 computers per day with a standard deviation of 30. A random sample (sample 2) of 15 days after the advertisements appeared showed mean sales of 125 computers per day with a standard deviation of 35. If it is appropriate, test whether . If not, explain why not.
10.2.29
Since both sample sizes are less than 30 and the distribution of both populations is unknown, it is not appropriate to use Welch's test.
30. Flight Delays. The U.S. Customs and Border Protection Agency keeps track of the flight delays at all U.S. international airports. The summary statistics for the waiting times at Los Angeles International Airport (Tom Bradley Terminal) on July 1, 2014 at 5 P.M. and at 11 P.M. are provided in the following table.
Waiting Times at 5 P.M. |
minutes |
minutes |
|
Waiting Times at 11 P.M. |
minutes |
minutes |
Test whether the mean waiting times at 5 P.M.are longer than those at 11 P.M., using level of significance .
31. Income in California Counties. According to random samples taken in 2010 by the Bureau of Economic Analysis, the mean income for Sacramento County and Los Angeles County, California, was $31,987 and $33,179, respectively. Suppose the samples had the following sample statistics.
Sacramento County | |||
los Angeles County |
10.2.31
(a) –$1192 (b) $2426.7954 (c) (–3618.7954, 1234.7954). We are 95% confident that the difference in population mean incomes lies between –$3618.7954 and $1234.7954. (d) versus . Reject if the . ; (TI-84: 0.1608312442). The is not , so we do not reject . There is insufficient evidence at level of significance that the population mean income of Sacramento County is less than that of Los Angeles County. (e) No. The hypothesis test in (d) is a one-tailed test and confidence intervals can only be used for two-tailed tests.
32. Math Scores. The Institute of Educational Sciences published the results of the Trends in International Math and Science Study. In 2011, the sample mean mathematics scores for 8th-grade students from the United States and Hong Kong were 509 and 586, respectively. Suppose independent random samples are drawn from each population, and assume that the populations are normally distributed with the following summary statistics.
USA | |||
Hong Kong |
Provide the point estimate of the difference in population means .
604
33. Children per Classroom. According to LocalSchoolDirectory.com, the sample mean number of children per teacher in the towns of Cupertino, California, and Santa Rosa, California, are 20.9 and 19.3, respectively. Suppose random samples of classrooms are taken from each county, with the following sample statistics.
Cupertino | |||
Santa Rosa |
10.2.33
(a) (–1.047, 4.247). We are 99% confident that the interval captures the difference in the population mean number of children per teacher in the towns of Cupertino, California, and Santa Rosa, California.
(b) . lies inside of the interval (–1.047, 4.247), so we do not reject . There is insufficient evidence at the level of significance that the population mean number of children per teacher in the town of Cupertino, California, differs from the population mean number of children per teacher in the town of Santa Rosa, California.
propertytax
34. Property Taxes. Suppose you want to move to either a small town in Ohio (sample 1) or a small town in North Carolina (sample 2). You did some research on property taxes in each state and chose two random samples shown in the table. The data represent the property taxes in dollars for a residence assessed at $250,000. Test whether using level of significance .
North Carolina | Ohio | ||
---|---|---|---|
164 | 206 | 298 | 270 |
147 | 129 | 270 | 315 |
207 | 176 | 165 | 177 |
138 | 120 | 400 | 245 |
143 | 154 | 268 | 180 |
201 | 123 | 289 | 292 |
285 | 291 | ||
225 |
35. Engineering Starting Salaries. The National Association of Colleges (NAC) reports mean starting salaries for college graduates in various disciplines. NAC reported that the mean salary for 2014 engineering graduates was $62,719, whereas the mean salary for 2013 engineering graduates was $62,535. Assume that these statistics were drawn from independent samples of size 100, each with a sample standard deviation of $10,000. Test whether there was a significant change in the salary of engineering graduates from 2013 to 2014, using level of significance .
10.2.35
versus ; Reject if the . . (TI-84: 0.8966133443). The is not , so we do not reject . There is insufficient evidence at level of significance that the population mean salary of engineering graduates has changed from 2013 to 2014.
36. Park Usage. Suppose that planners for the town of The Woodlands, Texas, were interested in assessing usage of their parks. Random samples were taken of the number of daily visitors to Windvale Park and Cranebrook Park, with the statistics as reported here.
Windvale Park | |||
Cranebrook Park |
Coaching for the SAT. Use this information for Exercises 37–39. The College Board reports that a pretest and post-test study was done to investigate whether coaching had a significant effect on SAT scores. The improvement from pretest to post-test was 29 points for the coached sample of students, with a standard deviation of 59 points. For the uncoached students, the pretest to post-test improvement was 21 points with a standard deviation of 52 points.
37. Suppose we consider a sample of 100 students from each group. Perform a test, at level of significance , to determine whether the population mean coached SAT pretest–post-test improvement is greater than that for the uncoached students.
10.2.37
. Reject if . . . Since the is not , we do not reject . There is insufficient evidence that the population coached SAT score improvement is greater than the population noncoached SAT score improvement. Critical-value method: . . Reject if . . Since is not , we do not reject . There is insufficient evidence at the level of significance that the population mean coached SAT improvement is greater than the population mean noncoached improvement.
38. Refer to Exercise 37.
39. What if the sample sizes for each group were some number greater than ?
10.2.39
(a) Since the width of the confidence interval is , an increase in the sample sizes will result in a decrease in the width of the confidence interval. This is good because smaller confidence intervals give a more precise estimate. (b) It depends on how large the new sample sizes are.
40. Nursing Support Services. A statistical study found that when nurses made home visits to pregnant teenagers to provide support services, discourage smoking, and otherwise provide care, the sample mean birth weight of the babies was higher for this treatment group (3285 grams) than for the control group (2922 grams) when the visits began before mid-gestation.10 There were 21 patients in the treatment group and 11 in the control group. Suppose the birth weights for the babies in both groups follow a normal distribution. Assume that the population standard deviation in each sample is 500 grams.
605
41. Nursing Support Services. Refer to Exercise 40. What if the birth weights of the babies in each group are the same certain amount greater? Explain how this would affect the following:
10.2.41
(a) Remain the same (b) Remain the same (c) Remain the same
(d) Remain the same
Zooplankton and Phytoplankton. Refer to the table below for Exercises 42 and 43. Meta-analysis refers to the statistical analysis of a set of similar research studies. In a meta-analysis, each data value represents an effect size calculated from the results of a particular study. The table contains effect sizes calculated in a meta-analysis for zooplankton and phytoplankton.11 Not surprisingly, the paper found that zooplankton biomass was reduced by the introduction of zooplanktivorous fish (that is, fish that eat zooplankton), but that phytoplankton biomass increased, because there were now fewer zooplankton, and the zooplankton eat the phytoplankton. Such an effect is called a trophic cascade. See if you can replicate the scientists' results in the following exercises.
Zooplankton | Phytoplankton | ||
---|---|---|---|
−2.37 | −3.00 | 10.61 | 3.04 |
−0.64 | −0.68 | 2.97 | 0.65 |
−2.05 | −1.39 | 1.58 | 2.55 |
−1.54 | −0.64 | 2.55 | 1.05 |
−6.60 | −3.88 | 5.67 | 2.11 |
0.26 | 1.57 |
plankton
42. Use technology to construct a comparison dot plot or a comparison histogram of the zooplankton and the phytoplankton effect sizes. Is there evidence of a difference in mean effect size for the two groups?
plankton
43. Test whether the population means effect sizes differ, at level of significance .
10.2.43
. Reject if the . . (TI-84: 0.000092880601). The is , so we reject . There is evidence at level of significance that the population mean effect sizes of zooplankton differs from that of the phytoplankton.
Studying Time for Biologists and Psychologists. Use the following information for Exercises 44–47: The National Survey of Student Engagement reported in 2012 that the mean number of hours spent studying per week for biology majors was 16.7, whereas the mean for psychology majors was 13.9. Assume that these are sample means from sample sizes of 100, and that the sample standard deviations were each 5 hours.
44. Calculate the margin of error for a 95% confidence interval for the difference in population mean studying times (Biology – Psychology).
45. Construct a 95% confidence interval for the difference in population mean studying times (Biology – Psychology).
10.2.45
(1.3950, 4.2050)
46. Use the confidence interval to determine whether a significant difference exists between the population mean studying times.
47. Test, using level of significance , whether the population mean studying times are equal. Does your conclusion agree with that in Exercise 46?
10.2.47
. Reject if the . . (TI-84: 0.0001045746); The is , so we reject . There is evidence at level of significance that the population mean study time of biology majors differs from that of psychology majors. Yes.
48. Does Multitasking Degrade Performance? The journal Computers and Education reported that multitasking during class may degrade student performance on assessments.12 Students were randomly assigned into two groups: multitaskers and non-multitaskers. The multitaskers have their laptops open during class, doing things such as surfing Facebook and so forth. The following table provides the summary statistics for a class quiz for the two groups. Test whether the population mean score for the multitaskers is lower than that of the non-multitaskers, using level of significance .
Multitaskers | |||
non-multitaskers |
49. Does Multitasking Degrade your neighbor's Performance? Refer to the previous exercise. The same journal article reported that those sitting next to the multitaskers may have also suffered from degraded quiz performance, due to being distracted by their neighbor's laptop multitasking. The following table provides the summary statistics for a class quiz for the two groups: those sitting where they could see the multitasker's screen (multitasker's neighbors), and those who could not (multitasker's non-neighbors). Test whether the population mean score for the multitasker's neighbors is lower than that of the multitasker's non-neighbors, using level of significance .
Multitasker's neighbors | |||
Multitasker's non-neighbors |
10.2.49
. Reject if the . s. (TI-84: 0.00005106366) The is , so we reject . There is evidence at level of significance that the population mean score for the multitasker's neighbors is less than that of the multitasker's non-neighbors.
BRINGING IT ALL TOGETHER
Do Prior Student Evaluations Influence Students' Ratings of Professors? A study in 1950 reported that instructor reputation affected students' ratings of their instructors.13 Towler and Dipboye uncovered experimental evidence in support of this phenomenon.14 They randomly assigned to students one of two summaries of prior student evaluations: one for a “charismatic instructor” and the other for a “punitive instructor.” The “charismatic” summary included such phrases as “always lively and stimulating in class” and “always approachable and treated students as individuals.” The “punitive” summary included such phrases as “did not show an interest in students' progress” and “consistently seemed to grade students harder.” All subjects were then shown the same 20-minute lecture video given by the same instructor. They were asked to rate the instructor using three questions, and a summary rating score was calculated. The summary statistics are shown below. Assume that both populations are normally distributed and that the samples are drawn independently. Were students' ratings influenced by the prior student evaluations? Find out by answering Exercises 50–58.
606
Reputation | Subjects | Sample mean rating |
Sample standard deviation |
---|---|---|---|
Charismatic (sample 1) |
|||
Punitive (sample 2) |
50. Test whether the population mean student evaluation rating for the charismatic group is greater than that of the punitive group, using a test and level of significance .
51. Construct a 95% confidence interval for .
10.2.51
(0.0588, 0.6952) {TI-84: (0.0676, 0.6864)}
52. Recall we explored some equivalences between confidence intervals and hypothesis tests. Could we have used the confidence interval in Exercise 51 to perform the hypothesis test in Exercise 50? Explain.
53. Test whether the population mean student ratings differ, using a test and level of significance .
10.2.53
. Reject if the . . (TI-84: 0.0180119956) The is , so we reject . There is evidence at level of significance that the population mean student evaluation rating for the charismatic group differs from that of the punitive group.
54. Use the confidence interval in Exercise 51 to conduct tests, at level of significance , for whether takes the following values:
55. Do the following:
10.2.55
(a) (b) . Reject if the . . . The is , so we reject . There is evidence at level of significance that the population mean student evaluation rating for the charismatic group differs from that of the punitive group. (c) The value of in (b) is a little bit larger than the value of in Problem 53, so the in (b) is a little smaller than the in Problem 53. The conclusions of both tests are to reject .
56. Construct a 95% pooled variance confidence interval for . Compare your results with Exercise 51.
For Exercises 57 and 58, assume that the given standard deviations are population standard deviations.
57. Use a test to test whether the population mean student ratings differ, using level of significance . Compare your results to Exercises 53 and 55.
10.2.57
. Reject if the . . . The is , so we reject . There is evidence at level of significance that the population mean student evaluation rating for the charismatic group differs from that of the punitive group. The value of in this problem equals the value of in Problem 53 but it is a little bit smaller than the value of in Problem 55 (b). The is smaller than the other s. All three hypothesis tests have the same conclusion of reject .
58. Construct a 95% confidence interval for . Compare your results with Exercises 51 and 56.
WORKING WITH LARGE DATA SETS
Case Study: Bank loans.
Open the Chapter 10 Case Study data sets, BankLoans_approved and BankLoans_Denied. Here, we will examine whether a difference exists in the loan request amount between those approved for and those denied a loan. Use technology to do the following:
bankloans_approved
bankloans_denied
59. Obtain independent random samples of size 100, one each from the bankloans_approved data set and the BankLoans_Denied data set.
bankloans_approved
bankloans_denied
60. For each sample, find the summary statistics for the variable Request amount, that is, the sample size, sample mean, and sample standard deviation.
bankLoans_approved
bankloans_denied
61. Perform a test, using level of significance , to determine whether the population mean request amount differs between those approved for a loan and those denied the loan.