Chapter 6: Introduction to Inference

SECTION 6.3 EXERCISES

For Exercise 6.90, see page 386; and for Exercise 6.91, see page 388.

Question 6.92

6.92 A role as a statistical consultant. You are the statistical expert for a graduate student planning her PhD research. After you carefully present the mechanics of significance testing, she suggests using α = 0.20 for the study because she would be more likely to obtain statistically significant results and she really needs significant results to graduate. Explain in simple terms why this would not be a good use of statistical methods.

Question 6.93

6.93 What do you know? A research report described two results that both achieved statistical significance at the 5% level. The P-value for the first is 0.048; for the second it is 0.0002. Do the P-values add any useful information beyond that conveyed by the statement that both results are statistically significant? Write a short paragraph explaining your views on this question.

Question 6.94

6.94 Selective publication based on results. In addition to statistical significance, selective publication can also be due to the observed outcome. A recent review of 74 studies of antidepressant agents found 38 studies with positive results and 36 studies with negative or questionable results. All but one of the 38 positive studies were published. Of the remaining 36, 22 were not published and 11 were published in such a way as to convey a positive outcome.²⁹ Describe how this selective reporting can have adverse consequences on health care.

Question 6.95

6.95 What a test of significance can answer. Explain whether a test of significance can answer each of the following questions.

(a) Is the sample or experiment properly designed?
(b) Is the observed effect compatible with the null hypothesis?
(c) Is the observed effect important?

Question 6.96

6.96 Vitamin C and colds. In a study to investigate whether vitamin C will prevent colds, 400 subjects are assigned at random to one of two groups. The experimental group takes a vitamin C tablet daily, while the control group takes a placebo. At the end of the experiment, the researchers calculate the difference between the percents of subjects in the two groups who were free of colds. This difference is statistically significant (P = 0.03) in favor of the vitamin C group. Can we conclude that vitamin C has a strong effect in preventing colds? Explain your answer.

390

Question 6.97

6.97 How far do rich parents take us? How much education children get is strongly associated with the wealth and social status of their parents, termed “socioeconomic status,” or SES. The SES of parents, however, has little influence on whether children who have graduated from college continue their education. One study looked at whether college graduates took the graduate admissions tests for business, law, and other graduate programs. The effects of the parents’ SES on taking the LSAT test for law school were “both statistically insignificant and small.”

(a) What does “statistically insignificant” mean?
(b) Why is it important that the effects were small in size as well as statistically insignificant?

Question 6.98

6.98 Do you agree? State whether or not you agree with each of the following statements and provide a short summary of the reasons for your answers.

(a) If the P-value is larger than 0.05, the null hypothesis is true.
(b) Practical significance is not the same as statistical significance.
(c) We can perform a statistical analysis using any set of data.
(d) If you find an interesting pattern in a set of data, it is appropriate to then use a significance test to determine its significance.
(e) It’s always better to use a significance level of α = 0.05 than to use α = 0.01 because it is easier to find statistical significance.

Question 6.99

6.99 Practical significance and sample size. Every user of statistics should understand the distinction between statistical significance and practical importance. A sufficiently large sample will declare very small effects statistically significant. Consider the study of elite female Canadian athletes in Exercise 6.74 (page 382). Female athletes were consuming an average of 2403.7 kcal/d with a standard deviation of 880 kcal/d. Suppose that a nutritionist is brought in to implement a new health program for these athletes. This program should increase mean caloric intake but not change the standard deviation. Given the standard deviation and how calorie deficient these athletes are, a change in the mean of 50 kcal/d to 2453.7 is of little importance. However, with a large enough sample, this change can be significant. To see this, calculate the P-value for the test of

H₀: μ = 2403.7

H_a: μ > 2403.7

in each of the following situations:

(a) A sample of 100 athletes; their average caloric intake is .
(b) A sample of 500 athletes; their average caloric intake is .
(c) A sample of 2500 athletes; their average caloric intake is .

Question 6.100

6.100 Statistical versus practical significance. A study with 7500 subjects reported a result that was statistically significant at the 5% level. Explain why this result might not be particularly important.

Question 6.101

6.101 More on statistical versus practical significance. A study with 14 subjects reported a result that failed to achieve statistical significance at the 5% level. The P-value was 0.051. Write a short summary of how you would interpret these findings.

Question 6.102

6.102 Find journal articles. Find two journal articles that report results with statistical analyses. For each article, summarize how the results are reported and write a critique of the presentation. Be sure to include details regarding use of significance testing at a particular level of significance, P-values, and confidence intervals.

Question 6.103

6.103 Create an example of your own. For each of the following cases, provide an example and an explanation as to why it is appropriate.

(a) A set of data or an experiment for which statistical inference is not valid.
(b) A set of data or an experiment for which statistical inference is valid.

Question 6.104

6.104 Predicting success of trainees. What distinguishes managerial trainees who eventually become executives from those who, after expensive training, don’t succeed and leave the company? We have abundant data on past trainees—data on their personalities and goals, their college preparation and performance, even their family backgrounds and their hobbies. Statistical software makes it easy to perform dozens of significance tests on these dozens of variables to see which ones best predict later success. We find that future executives are significantly more likely than washouts to have an urban or suburban upbringing and an undergraduate degree in a technical field.

Explain clearly why using these “significant” variables to select future trainees is not wise. Then suggest a follow-up study using this year’s trainees as subjects that should clarify the importance of the variables identified by the first study.

391

Question 6.105

6.105 Searching for significance. Give an example of a situation where searching for significance would lead to misleading conclusions.

Question 6.106

6.106 More on searching for significance. You perform 1000 significance tests using α = 0.05. Assuming that all null hypotheses are true, about how many of the test results would you expect to be statistically significant? Explain how you obtained your answer.

Question 6.107

6.107 Interpreting a very small P-value. Assume that you are performing a large number of significance tests. Let n be the number of these tests. How large would n need to be for you to expect about one P-value to be 0.00001 or smaller? Use this information to write an explanation of how to interpret a result that has P = 0.00001 in this setting.

Question 6.108

6.108 An adjustment for multiple tests. One way to deal with the problem of misleading P-values when performing more than one significance test is to adjust the criterion you use for statistical significance. The Bonferroni procedure does this in a simple way. If you perform two tests and want to use the α = 5% significance level, you would require a P-value of 0.05/2 = 0.025 to declare either one of the tests significant. In general, if you perform k tests and want protection at level α, use α/k as your cutoff for statistical significance. You perform six tests and obtain individual P-values of 0.075, 0.021, 0.285, 0.002, 0.015, and <0.001. Which of these are statistically significant using the Bonferroni procedure with α = 0.05?

Question 6.109

6.109 Significance using the Bonferroni procedure. Refer to the previous exercise. A researcher has performed 12 tests of significance and wants to apply the Bonferroni procedure with α = 0.05. The calculated P-values are 0.141, 0.519, 0.186, 0.753, 0.001, 0.008, 0.646, 0.038, 0.898, 0.013, <0.002, and 0.538. Which of these tests reject their null hypotheses with this procedure?