Clarifying the Concepts
8.1 |
What specific danger exists when reporting a statistically significant difference between two means? |
8.2 |
In your own words, define the word confidence—first as you would use it in everyday conversation and then as a statistician would use it in the context of a confidence interval. |
8.3 |
Why do we calculate confidence intervals? |
8.4 |
What are the five steps to create a confidence interval for the mean of a z distribution? |
8.5 |
In your own words, define the word effect—first as you would use it in everyday conversation and then as a statistician would use it. |
8.6 |
What effect does increasing the sample size have on standard error and the test statistic? |
8.7 |
Relate effect size to the concept of overlap between distributions. |
8.8 |
What does it mean to say an effect- |
8.9 |
What are Cohen’s guidelines for small, medium, and large effects? |
8.10 |
How does statistical power relate to Type II errors? |
8.11 |
In your own words, define the word power—first as you would use it in everyday conversation and then as a statistician would use it. |
8.12 |
How are statistical power and effect size different but related? |
8.13 |
Traditionally, what minimum percentage chance of correctly rejecting the null hypothesis is suggested in order to proceed with an experiment? |
8.14 |
Explain how increasing alpha increases statistical power. |
8.15 |
List five factors that affect statistical power. For each, indicate how a researcher can leverage that factor to increase power. |
8.16 |
What are the four basic steps of a meta- |
8.17 |
What is the goal of a meta- |
8.18 |
In statistics, concepts are often expressed in symbols and equations. For Mlower = −z(σ) + Msample, (i) identify the incorrect symbol, (ii) state what the correct symbol is, and (iii) explain why the initial symbol was incorrect. |
8.19 |
In statistics, concepts are often expressed in symbols and equations. For , (i) identify the incorrect symbol, (ii) state what the correct symbol is, and (iii) explain why the initial symbol was incorrect. |
8.20 |
In 2008, the Gallup poll asked people whether or not they were suspicious of steroid use among Olympic athletes. Thirty- |
207
8.21 |
In 2008, twenty- |
8.22 |
In 2013, the Gallup polling organization and the online publication Inside Higher Ed reported the results of a survey of 831 university presidents and chancellors. The report stated: “For results based on the sample size of 831 total respondents, one can say with 95 percent confidence that the margin of error attributable to sampling error is ±3.4 percentage points” (p. 6). Fourteen percent of respondents indicated that they strongly agreed that massive open online courses (MOOCs) could have a positive impact on higher education. Construct an interval estimate for the point estimate of 14%. |
8.23 |
For each of the following confidence levels, indicate how much of the distribution would be placed in the cutoff region for a one-
|
8.24 |
For each of the following confidence levels, indicate how much of the distribution would be placed in the cutoff region for a two-
|
8.25 |
For each of the following confidence levels, look up the critical z value for a one-
|
8.26 |
For each of the following confidence levels, look up the critical z values for a two-
|
8.27 |
Calculate the 95% confidence interval for the following fictional data regarding daily TV viewing habits: μ = 4.7 hours; σ = 1.3 hours; sample of 78 people, with a mean of 4.1 hours. |
8.28 |
Calculate the 80% confidence interval for the same fictional data regarding daily TV viewing habits: μ = 4.7 hours; σ = 1.3 hours; sample of 78 people, with a mean of 4.1 hours. |
8.29 |
Calculate the 99% confidence interval for the same fictional data regarding daily TV viewing habits: µ = 4.7 hours; σ = 1.3 hours; sample of 78 people, with a mean of 4.1 hours. |
8.30 |
Calculate the standard error for each of the following sample sizes when μ = 1014 and σ = 136:
|
8.31 |
For a given variable, imagine we know that the population mean is 1014 and the standard deviation is 136. A sample mean of 1057 is obtained. Calculate the z statistic for this mean, using each of the following sample sizes:
|
8.32 |
Calculate the effect size for the mean of 1057 observed in Exercise 8.31 where μ = 1014 and σ =136. |
8.33 |
Calculate the effect size for each of the following average SAT math scores. Remember, SAT math is standardized such that μ = 500 and σ = 100.
|
8.34 |
For each of the effect-
|
8.35 |
For each of the following d values, identify the size of the effect, using Cohen’s guidelines.
|
8.36 |
For each of the following d values, identify the size of the effect, using Cohen’s guidelines.
|
8.37 |
For each of the following z statistics, calculate the p value for a two-
|
208
8.38 |
A meta-
|
8.39 |
A meta- |
8.40 |
Assume you are conducting a meta-
|
8.41 |
Assume you are conducting a meta-
|
8.42 |
Margin of error and adult education: According to a report by Public Agenda and the Kresge Foundation, online education is popular among adults planning to return to university (2013). “The majority (73 percent) of adult prospective students want to take at least some classes online, and nearly 4 in 10 (37 percent) say it is absolutely essential for them that their future school offer online classes” (p. 25). The margin of error was reported to be 4.27. Calculate an interval estimate for each of these findings. |
8.43 |
Distributions and the Burakumin: A friend reads in her Introduction to Psychology textbook about a minority group in Japan, the Burakumin, who are racially the same as other Japanese people but are viewed as outcasts because their ancestors were employed in positions that involved the handling of dead animals (e.g., butchers). In Japan, the text reported, mean IQ scores of Burakumin were 10 to 15 points below mean IQ scores of other Japanese people. In the United States, where Burakumin experienced no discrimination, there was no mean difference (from Ogbu, 1986, as reported in Hockenbury & Hockenbury, 2013). Your friend says to you: “Wow— |
8.44 |
Sample size, z statistics, and the Consideration of Future Consequences scale: Here are summary data from a z test regarding scores on the Consideration of Future Consequences scale (Petrocelli, 2003): The population mean (μ) is 3.20 and the population standard deviation (σ) is 0.70. Imagine that a sample of students had a mean of 3.45.
|
8.45 |
Sample size, z statistics, and the Graded Naming Test: In an exercise in Chapter 7, we asked you to conduct a z test to ascertain whether the Graded Naming Test (GNT) scores for Canadian participants differed from the GNT norms based on adults in England. We also used these data in the How It Works section of this chapter. The mean for a sample of 30 adults in Canada was 17.5. The normative mean for adults in England is 20.4, and we assumed a population standard deviation of 3.2. With 30 participants, the z statistic was −4.97, and we were able to reject the null hypothesis.
|
8.46 |
Cheating with hypothesis testing: Unsavory researchers know that one can cheat with hypothesis testing. That is, they know that a researcher can stack the deck in her or his favor, making it easier to reject the null hypothesis.
|
209
8.47 |
Overlapping distributions and the LSATs: A Midwestern U.S. university reported that its behavioral science majors tended to outperform its humanities majors on the LSAT standardized test for law school admissions. Sadie, an English major, and Kofi, a sociology major, both just took the LSAT.
|
8.48 |
Confidence intervals, effect sizes, and tennis serves: Let’s assume the average speed of a serve in men’s tennis is around 135 mph, with a standard deviation of 6.5 mph. Because these statistics are calculated over many years and many players, we will treat them as population parameters. We develop a new training method that will increase arm strength, the force of the tennis swing, and the speed of the serve, we hope. We recruit 9 professional tennis players to use our method. After 6 months, we test the speed of their serves and compute an average of 138 mph.
|
8.49 |
Confidence intervals and football wins: In an exercise in Chapter 7, we asked whether college football teams tend to be more likely or less likely to be mismatched in the upper National Collegiate Athletic Association (NCAA) divisions. During one week of a college football season, the population of 53 Football Bowl Subdivision (FBS) games had a mean spread (winning score minus losing score) of 16.189, with a standard deviation of 12.128. We took a sample of 4 games that were played that week in the next-
|
8.50 |
Confidence intervals and football wins (continued): Using the football data presented in Exercise 8.49, practice evaluating data using confidence intervals.
|
8.51 |
Effect size and football wins: In Exercises 8.49 and 8.50, we considered the study of one week of a 2006 college football season, during which the population of 53 FBS games had a mean spread (winning score minus losing score) of 16.189, with a standard deviation of 12.128. The sample of 4 games that were played that week in the next highest league, the FCS, had a mean of 8.75.
|
8.52 |
Effect size and football wins (continued): In Exercise 8.51, you calculated an effect size for data from one week of a 2006 college football season with 4 games. Imagine that you had a sample of 20 games. How would the effect size change? Explain why it would or would not change. |
8.53 |
Confidence intervals, effect sizes, and Valentine’s Day spending: According to the Nielsen Company, Americans spend $345 million on chocolate during the week of Valentine’s Day. Let’s assume that we know the average married person spends $45, with a population standard deviation of $16. In February 2009, the U.S. economy was in the throes of a recession. Comparing data for Valentine’s Day spending in 2009 with what is generally expected might give us some indication of the attitudes during the recession.
|
210
8.54 |
More about confidence intervals, effect sizes, and tennis serves: Let’s assume the average speed of a serve in women’s tennis is around 118 mph, with a standard deviation of 12 mph. We recruit 100 amateur tennis players to use our new training method this time, and after 6 months we calculate a group mean of 123 mph.
|
8.55 |
Confidence intervals, effect sizes, and tennis serves (continued): As in the previous exercise, assume the average speed of a serve in women’s tennis is around 118 mph, with a standard deviation of 12 mph. But now we recruit only 26 amateur tennis players to use our method. Again, after 6 months we calculate a group mean of 123 mph.
|
8.56 |
Statistical power and football wins: In several exercises in this chapter, we considered the study of one week of a college football season, during which the population of 53 FBS games had a mean spread (winning score minus losing score) of 16.189, with a standard deviation of 12.128. The sample of 4 games that were played that week in the next-
|
8.57 |
Statistical power and tennis serves: Calculate statistical power based on the data presented in Exercise 8.55 using the following alpha levels in a one-
|
8.58 |
Effect size and homeless families: A New York Times article reported on the growing problem of homelessness among families (Bellafante, 2013). The reporter wrote that families in a city-
|
8.59 |
Meta- Many previous authors have advocated traditional mental health treatments be modified to better match clients’ cultural contexts. Numerous studies evaluating culturally adapted interventions have appeared, and the present study used meta-
|
8.60 |
Meta- Across all 76 studies, the random effects weighted average effect size was d = .45 (SE = .04, p < .0001), with a 95% confidence interval of d = .36 to d = .53. The data consisted of 72 nonzero effect sizes, of which 68 (94%) were positive and 4 (6%) were negative. Effect sizes ranged from d = −48 to d = 2.7. (Griner & Smith, 2006, p. 535)
|
211
8.61 |
Fantasy baseball: Your roommate is reading Fantasyland: A Season on Baseball’s Lunatic Fringe (Walker, 2006) and is intrigued by the statistical methods used by competitors in fantasy baseball leagues (in which competitors select a team of baseball players from across all major league teams, winning in the fantasy league if their eclectic roster of players outperforms the chosen mixes of other fantasy competitors). Among the many statistics reported in the book is a finding that Major League Baseball (MLB) players who have a third child show more of a decline in performance than players who have a first child or a second child. Your friend remembers that Red Sox player David Ortiz has three children and drops him from consideration for his fantasy team.
|
8.62 |
Hours of sleep: The table below provides information about hours of sleep.
|
8.63 |
Effect size and an intervention to increase college applications: Caroline Hoxby and Sarah Turner (2013) conducted an experiment to determine whether a simple intervention could increase the number of college applications among low- 212
|