Exercises

Clarifying the Concepts

Question 15.1

15.1

Distinguish among nominal, ordinal, and scale data.

Question 15.2

15.2

What are the three main situations in which we use a nonparametric test?

Question 15.3

15.3

What is the difference between the chi-square test for goodness of fit and the chi-square test for independence?

Question 15.4

15.4

What are the four assumptions for the chi-square tests?

Question 15.5

15.5

List two ways in which statisticians use the word independence or independent with respect to concepts introduced earlier in this book. Then describe how independence is used by statisticians with respect to chi square.

Question 15.6

15.6

What are the hypotheses when conducting the chi-square test for goodness of fit?

Question 15.7

15.7

How are the degrees of freedom for the chi-square hypothesis tests different from those of most other hypothesis tests?

Question 15.8

15.8

Why is there just one critical value for a chi-square test, even when the hypothesis is a two-tailed test?

Question 15.9

15.9

What information is presented in a contingency table in the chi-square test for independence?

Question 15.10

15.10

What measure of effect size is used with chi square?

Question 15.11

15.11

Define the symbols in the following formula: image

Question 15.12

15.12

What is the formula image used for?

Question 15.13

15.13

What information does the measure of relative likelihood provide?

Question 15.14

15.14

In order to calculate relative likelihood, what must we first calculate?

Question 15.15

15.15

What is the difference between relative likelihood and relative risk?

Question 15.16

15.16

Why might relative likelihood be easier to understand as an effect size than Cramér’s V ?

Question 15.17

15.17

Which graph is most useful for displaying the results of a chi-square test for independence?

Question 15.18

15.18

Why can relative likelihood and relative risk sometimes exaggerate risks? (Hint: Think about the base rates—how common a disorder or other occurrence is to start with.)

Question 15.19

15.19

When do we convert scale data to ordinal data?

Question 15.20

15.20

When the data on at least one variable are ordinal, the data on any scale variable must be converted from scale to ordinal. How do we convert a scale variable into an ordinal one?

Question 15.21

15.21

How does the transformation of scale data to ordinal data solve the problem of outliers?

Question 15.22

15.22

What does a histogram of rank-ordered data look like and why does it look that way?

Question 15.23

15.23

Explain how the relation between ranks is the core of the Spearman rank-order correlation.

Question 15.24

15.24

Define the symbols in the following term: image

Question 15.25

15.25

What is the possible range of values for the Spearman rank-order correlation and how are these values interpreted?

Question 15.26

15.26

How would you respond in a situation in which you are ranking a set of scale data and there are two numbers that are exactly the same?

Question 15.27

15.27

When is it appropriate to use the Wilcoxon signed-rank test?

Question 15.28

15.28

When do we use the Mann–Whitney U test?

Question 15.29

15.29

What are the assumptions of the Mann–Whitney U test?

467

Question 15.30

15.30

How are the critical values for the Mann–Whitney U test used differently than critical values for parametric tests?

Question 15.31

15.31

If the data meet the assumptions of the parametric test, why is it preferable to use the parametric test rather than the nonparametric alternative?

Calculating the Statistics

Question 15.32

15.32

For each of the following, (i) identify the incorrect symbol, (ii) state what the correct symbol should be, and (iii) explain why the initial symbol was incorrect.

  1. For the chi-square test for goodness of fit: dfχ2 = N − 1

  2. For the chi-square test for independence: dfχ2 = (krow − 1) + (kcolumn − 1)

  3. image

  4. image

  5. Expected frequency for each cell image

Question 15.33

15.33

For each of the following, identify the independent variable(s), the dependent variable(s), and the level of measurement (nominal, ordinal, scale).

  1. The number of loads of laundry washed per month was tracked for women and men living in college dorms.

  2. A researcher interested in people’s need to maintain their social image collected data on the number of miles on someone’s car and his or her rank for “need for approval” out of the 183 people studied.

  3. A professor of social science was interested in whether involvement in campus life is significantly impacted by whether a student lives on or off campus. Thirty-seven students living on campus and 37 students living off campus were asked whether they were an active member of a club.

Question 15.34

15.34

Use this calculation table for the chi-square test for goodness of fit to complete this exercise.

Category Observed (O) Expected (E) OE (OE )2 image
1 48 60
2 46 30
3 6 10
  1. Calculate degrees of freedom for this chi-square test for goodness of fit.

  2. Perform all of the calculations to complete this table.

  3. Compute the chi-square statistic.

Question 15.35

15.35

Use this calculation table for the chi-square test for goodness of fit to complete this exercise.

Category Observed (O) Expected (E ) OE (OE )2 image
1 750 625
2 650 625
3 600 625
4 500 625
  1. Calculate degrees of freedom for this chi-square test for goodness of fit.

  2. Perform all of the calculations to complete this table.

  3. Compute the chi-square statistic.

Question 15.36

15.36

Below are some data to use in a chi-square test for independence.

Observed
Accidents No Accidents
Rain 19 26 45
No rain 20 71 91
39 97 136
  1. Calculate the degrees of freedom for this test.

  2. Complete this table of expected frequencies.

  3. Calculate the test statistic.

  4. Calculate the appropriate measure of effect size.

  5. Calculate the relative likelihood of accidents, given that it is raining.

Expected
Accidents No Accidents
Rain
No rain

Question 15.37

15.37

The data below are from a study of lung cancer patients in Turkey (Yilmaz et al., 2000). Use these data to calculate the relative likelihood of the patient being a smoker, given that a person is female rather than male.

Nonsmoker Smoker
Female 186 13
Male 182 723

Question 15.38

15.38

In order to compute statistics, we need to have working formulas. For the following, (i) identify the incorrect symbol, (ii) state what the correct symbol should be, and (iii) explain why the initial symbol was incorrect.

  1. image

  2. image

468

Question 15.39

15.39

Consider the following scale data.

Participant Variable X Variable Y
1 134.5 64.00
2 186 60.00
3 157 61.50
4 129 66.25
5 147 65.50
6 133 62.00
7 141 62.50
8 147 62.00
9 136 63.00
10 147 65.50
  1. Convert the data to ordinal or ranked data, starting with a rank of 1 for the smallest data point.

  2. Compute the Spearman correlation coefficient.

Question 15.40

15.40

Consider the following scale data.

Participant Variable X Variable Y
1 $1250 25
2 $1400 21
3 $1100 32
4 $1450 54
5 $1600 38
6 $2100 62
7 $3750 43
8 $1300 32
  1. Convert the data to ordinal or ranked data, starting with a rank of 1 for the smallest data point.

  2. Compute the Spearman correlation coefficient.

Question 15.41

15.41

The following fictional data represent the finishing place for runners of a 5-kilometer race and the number of hours they trained per week.

Race Rank Hours Trained Race Rank Hours Trained
1 25 6 18
2 25 7 12
3 22 8 17
4 18 9 15
5 19 10 16
  1. Calculate the Spearman correlation for this set of data.

  2. Make a decision regarding the null hypothesis. Is there a significant correlation between a runner’s finishing place and the amount the runner trained?

Question 15.42

15.42

Compute the Mann–Whitney U statistic for the following data. Each participant has been assigned a group and a participant number; these are shown in the “Group 1” and “Group 2” columns.

Group 1 Scale Dependent Variable Group 2 Scale Dependent Variable
1 8 9 3
2 5 10 4
3 5 11 2
4 7 12 1
5 10 13 1
6 14 14 5
7 9 15 6
8 11

Question 15.43

15.43

Compute the Mann–Whitney U statistic for the following data. Each participant has been assigned a group and a participant number; these are shown in the “Group 1” and “Group 2” columns.

Group 1 Ordinal Dependent Variable Group 2 Ordinal Dependent Variable
1 1 1 11
2 2.5 2 9
3 8 3 2.5
4 4 4 5
5 6 5 7
6 10 6 12

469

Question 15.44

15.44

Assume a researcher compared the performance of two independent groups of participants on an ordinal variable using the Mann–Whitney U test. The first group had 8 participants and the second group had 11 participants.

  1. Using a p level of 0.05 and a two-tailed test, determine the critical value.

  2. Assume the researcher calculated U1 = 22 and U2 = 17. Make a decision regarding the null hypothesis and explain that decision.

  3. Assume the researcher calculated U1 = 24 and U2 = 30. Make a decision regarding the null hypothesis and explain that decision.

  4. Assume the researcher calculated U1 = 13 and U2 = 9. Make a decision regarding the null hypothesis and explain that decision.

Question 15.45

15.45

Are men or women more likely to be at the top of their class? The following table depicts fictional class standings for a group of men and women:

Student Gender Class Standing Student Gender Class Standing
1 Male 98 7 Male 43
2 Female 72 8 Male 33
3 Male 15 9 Female 17
4 Female 3 10 Female 82
5 Female 102 11 Male 63
6 Female 8 12 Male 25
  1. Compute the Mann–Whitney U test statistic.

  2. Make a decision regarding the null hypothesis. Is there a significant difference in the class ranks of men and women?

Applying the Concepts

Question 15.46

15.46

A nonparametric test and gender differences in obeying bicycling laws: Students at Hunter College studied bicycle safety in New York City (Tuckel & Milczarski, 2014). They reported data on cyclists who were riding their own bikes and were not cycling as part of their job (they were not, for example, riding as delivery workers). They reported that 28.4% of male cyclists stopped completely at a red light, whereas 38.3% of female cyclists did so. 30.9% of male cyclists and 35.4% of female cyclists paused but then rolled through the red light. And 40.7% of male cyclists and 26.3% of female cyclists just ran the light. What hypothesis test would the researchers conduct? Explain your answer.

Question 15.47

15.47

Gender, salary negotiation, and chi square: Researchers investigated whether or not language in job postings affected the likelihood that women and men would negotiate regarding salary (Leibbrandt & List, 2012). Some job postings clearly indicated that the salary was negotiable, and others contained no such statement. The postings were otherwise identical. The researchers examined the behavior of almost 2500 applicants for one of the jobs in these advertisements. The graph shows the proportions of women and men who negotiated in response to either type of listing.

image
  1. What are the variables in this study, what are their levels, and what types of variables are they?

  2. Explain why the researchers would have been able to use chi square in this study. Which type of chi-square test would have been appropriate?

  3. Based on the graph, explain in your own words what the researchers found.

Question 15.48

15.48

Gender, the Oscars, and nonparametric tests: In 2010, Sandra Bullock won an Academy Award for best actress. Shortly thereafter, she discovered that her husband was cheating on her. Headlines erupted about a supposed Oscar curse that befalls women, and many in the media wondered whether ambitious women—whether actors or corporate leaders—are more likely than ambitious men to run the risk of ruining their family lives. Reporters breathlessly listed female actors who were divorced within a couple of years of winning an Oscar—Julia Roberts, Helen Hunt, Kate Winslet, Halle Berry, and Reese Witherspoon among them.

  1. A good researcher always asks, “Compared to what?” In this case, what would be an appropriate comparison group to use to determine whether there really is a gender difference in likelihood of relationship breakups among Oscar winners? Explain your answer.

  2. Kate Harding (2010) reported that many men—including Russell Crowe, William Hurt, Dustin Hoffman, Robert Duvall, and Clark Gable—experienced the same outcome. Indeed, Harding counted 15 best actor winners, compared with just 8 best actress winners, who divorced not long after winning an Oscar. If she wanted to conduct statistical analyses, what test would Harding use? Explain your answer.

  3. Explain how an illusory correlation, bolstered by a confirmation bias, might have led to the headlines despite evidence to the contrary.

470

Question 15.49

15.49

Parametric or nonparametric test? For each of the following research questions, state whether a parametric or nonparametric hypothesis test is more appropriate. Explain your answers.

  1. Are women more or less likely than men to be economics majors?

  2. At a small company with 15 staff and one top boss, do those with a college education tend to make a different amount of money than those without one?

  3. At your high school, did athletes or nonathletes tend to have higher grade point averages?

  4. At your high school, did athletes or nonathletes tend to have higher class ranks?

  5. Compare car accidents in which the occupants were wearing seat belts with accidents in which the occupants were not wearing seat belts. Do seat belts seem to make a difference in the numbers of accidents that lead to no injuries, nonfatal injuries, and fatal injuries?

  6. Compare car accidents in which the occupants were wearing seat belts with accidents in which the occupants were not wearing seat belts. Were those wearing seat belts driving at slower speeds, on average, than those not wearing seat belts?

Question 15.50

15.50

Types of variables and student evaluations of professors: Weinberg, Fleisher, and Hashimoto (2007) studied almost 50,000 students’ evaluations of their professors in nearly 400 economics courses at the Ohio State University over a 10-year period. For each of their findings, outlined below, state (i) the independent variable or variables, and, where appropriate, their levels; (ii) the dependent variable(s); and (iii) which category of research design is being used:

I—Scale independent variable(s) and scale dependent variable

II—Nominal independent variable(s) and scale dependent variable

III—Only nominal variables

Explain your answer to part (iii).

  1. The researchers found that students’ ratings of their professors were predictive of grades in the class for which the professor was evaluated.

  2. The researchers also found that students’ ratings of their professors were not predictive of grades for other, related future classes. (The researchers stated that these first two findings suggest that student ratings of professors are tied to their current grades but not to learning—which would affect future grades.)

  3. The researchers found that male professors received statistically significantly higher student ratings, on average, than did female professors.

  4. The researchers reported, however, that average levels of students’ learning (as assessed by grades in related future classes) were not statistically significantly different for those who had male professors or those who had female professors.

  5. The researchers might have been interested in whether there were proportionally more female professors teaching upper-level than lower-level courses and proportionally more male professors teaching lower-level than upper-level courses (perhaps a reason for the lower average ratings of female professors).

  6. The researchers found no statistically significant differences in average student evaluations among non-tenure-track lecturers, graduate student teaching associates, and tenure-track faculty members.

Question 15.51

15.51

Grade inflation and types of variables: A New York Times article on grade inflation reported several findings related to a tendency for average grades to rise over the years and a tendency for the top-ranked institutions to give the highest average grades (Archibold, 1998). For each of the findings outlined below, state (i) the independent variable or variables, and, where appropriate, their levels; (ii) the dependent variable(s); and (iii) which category of research design is being used:

I—Scale independent variable(s) and scale dependent variable

II—Nominal independent variable(s) and scale dependent variable

III—Only nominal variables

Explain your answer to part (iii).

  1. In 1969, seven percent of all grades were A’s; in 1994, twenty-five percent of all grades were A’s.

  2. The average GPA for the graduating students of elite schools is 3.2, the average GPA for graduating students at selective schools (the level below elite schools) is 3.04, and the average GPA for graduating students at state colleges is 2.95.

  3. At Dartmouth College, an elite university, SAT scores of incoming students have increased along with their subsequent college GPAs (perhaps an explanation for grade inflation).

Question 15.52

15.52

High school academic performance and types of variables: Here are three ways to assess one’s performance in high school: (1) GPA at graduation, (2) whether one graduated with honors (as indicated by graduating with a GPA of at least 3.5), and (3) class rank at graduation. For example, Abdul had a 3.98 GPA, graduated with honors, and was ranked 10th in his class.

  1. Which of these variables could be considered a nominal variable? Explain.

    471

  2. Which of these variables is most clearly an ordinal variable? Explain.

  3. Which of these variables is a scale variable? Explain.

  4. Which of these variables gives us the most information about Abdul’s performance?

  5. If we were to use one of these variables in an analysis, which variable (as the dependent variable) would lead to the lowest chance of a Type II error? Explain why.

Question 15.53

15.53

Immigration, crime, and research design: “Do Immigrants Make Us Safer?” asked the title of a New York Times Magazine article (Press, 2006). The article reported findings from several U.S.-based studies, including several conducted by Harvard sociologist Robert Sampson in Chicago. For each of the following findings, draw the table of cells that would comprise the research design. Include the labels for each row and column.

  1. Mexican immigrants were more likely to be married (versus single) than either blacks or whites.

  2. People living in immigrant neighborhoods were 15% less likely than were people living in nonimmigrant neighborhoods to commit crimes. This finding was true among both those living in households headed by a married couple and those living in households not headed by a married couple.

  3. The crime rate was higher among second-generation than among first-generation immigrants; moreover, the crime rate was higher among third-generation than among second-generation immigrants.

Question 15.54

15.54

Sex selection and hypothesis testing: Across all of India, there are only 933 girls for every 1000 boys (Lloyd, 2006), evidence of a bias that leads many parents to illegally select for boys or to kill their infant girls. (Note that this translates into a proportion of girls of 0.483.) In Punjab, a region of India in which residents tend to be more educated than in other regions, there are only 798 girls for every 1000 boys. Assume that you are a researcher interested in whether sex selection is more or less prevalent in educated regions of India, and that 1798 children from Punjab constitute the entire sample. (Hint: You will use the proportions from the national database for comparison.)

  1. How many variables are there in this study? What are the levels of any variable you identified?

  2. Which hypothesis test would be used to analyze these data? Justify your answer.

  3. Conduct the six steps of hypothesis testing for this example. (Note: Be sure to use the correct proportions for the expected values, not the actual numbers for the population.)

  4. Report the statistics as you would in a journal article.

Question 15.55

15.55

Gender, op-ed writers, and hypothesis testing: Richards (2006) reported data from a study by the American Prospect on the genders of op-ed writers who addressed the topic of abortion in the New York Times. Over a 2-year period, the American Prospect counted 124 articles that discussed abortion (from a wide range of political and ideological perspectives). Of these, just 21 were written by women.

  1. How many variables are there in this study? What are the levels of any variable you identified?

  2. Which hypothesis test would be used to analyze these data? Justify your answer.

  3. Conduct the six steps of hypothesis testing for this example.

  4. Report the statistics as you would in a journal article.

Question 15.56

15.56

Romantic music, behavior, chi square, and effect size: Guéguen, Jacob, and Lamy (2010) investigated whether exposure to romantic music affects dating behavior. The participants, young, single French women, waited for the experiment to start in a room in which songs with either romantic lyrics or neutral lyrics were playing. After a few minutes, each woman who participated completed a marketing survey administered by a young male confederate. During a break, the confederate asked the participant for her phone number. Of the women who listened to romantic music, 52.2% (23 out of 44) gave him her phone number, whereas 27.9% (12 out of 43) of the women who listened to neutral music did so. The researchers conducted a chi-square test for independence, and found the following results: (χ2 (1, N = 83) = 5.37, p < .02).

  1. Calculate Cramér’s V. What size effect is this?

  2. Calculate the relative likelihood of providing her phone number for women listening to romantic music versus neutral music. Explain what we learn from this relative likelihood.

Question 15.57

15.57

The General Social Survey, an exciting life, and relative risk: In How It Works 15.2, we walked through a chi-square test for independence using two items from the General Social Survey (GSS)—LIFE and MOBILE16. Use these data to answer the following questions.

  1. Construct a table that shows only the appropriate conditional proportions for this example. For example, the percentage of people who find life exciting, given that they live in the same city, is 42.4. The proportion, therefore, is 0.424.

  2. Construct a graph that displays these conditional proportions.

  3. Calculate the relative risk (or relative likelihood) of finding life exciting if one lives in a different state compared to if one lives in the same city as one did at 16.

Question 15.58

15.58

472

Gender, ESPN, and chi square: Many of the numbers we see in the news could be analyzed with chi square. The feminist blog Culturally Disoriented examined the photos in the 2012 “body issue” of ESPN The Magazine—the publication’s annual spread of photographs of nude athletes. The blogger reported: “Female athlete after female athlete was photographed not as a talented, powerful sportswoman, but as. . .eye candy” (http://culturallydisoriented.wordpress.com/2012/07/12/the-bodies-we-want-female-athletes-in-espn-magazines-body-issue/). The blogger reported that there were 19 photos of male athletes and 17 of female athletes. Of these, 15 of the men were in active poses and 9 of the women were in active poses. Active poses were typically those in which they were engaged in their sport, whereas the passive photos looked more like a modeling shoot—“where they’re just looking hot for the camera,” in the words of Culturally Disoriented.

  1. Why is a chi-square statistical analysis a good choice for these data? Which kind of chi-square test should you use? Explain your answer.

  2. Conduct the six steps of hypothesis testing for this example.

  3. Calculate an effect size. Explain why there might be a fairly substantial effect size even though we were not able to reject the null hypothesis.

  4. Report the statistics as you would in a journal article.

  5. Construct a table that shows only the appropriate conditional proportions for this example—that is, the proportion of people in active poses given that they are men or women.

  6. Construct a graph that displays these conditional proportions.

  7. Calculate the relative likelihood of being photographed in an active pose if you are a male athlete compared to if you are a female athlete.

Question 15.59

15.59

Premarital doubts, divorce, and chi square: In an article titled “Do Cold Feet Warn of Trouble Ahead?”, researchers studied 464 married heterosexual spouses to determine whether or not doubts before marriage were predictive of marital troubles, and divorce, later on (Lavner, Karney, & Bradbury, 2012). The following is an excerpt from the results section of their paper: “For husbands, 9% of those who reported not having premarital doubts divorced by four years (n = 10 of 117) compared with 14% of those who did report premarital doubts (n = 15 of 106); these groups did not differ significantly, χ2 (1, n = 223) = 1.76, p > .10. Among wives, 8% of those who reported not having premarital doubts divorced by four years (n = 11 of 141) compared with 19% of those who did report premarital doubts (n = 16 of 84). Chi-square analyses indicated that these rates differed significantly, χ2 (1, n = 225) = 6.31, p < .05.”

  1. What are the variables in this study, and what are their levels?

  2. Explain why the researchers were able to use chi-square tests. Which kind of chi-square tests did they use?

  3. What changes would the APA want to see in the reporting of these results?

  4. Explain in your own words what the researchers found.

Question 15.60

15.60

University students, cell phone bills, and ordinal data: Here are some monthly cell phone bills, in dollars, for university students:

100 60 35 50 50 50 60 65
0 75 100 55 50 40 80
200 30 50 108 500 100 45
40 45 50 40 40 100 80
  1. Convert these data from scale to ordinal. (Don’t forget to put them in order first.) What happens to an outlier when you convert these data to ordinal?

  2. What approximate shape would the distribution of these data take? Would they likely be normally distributed? Explain why the distribution of ordinal data is never normal.

  3. Why does it not matter if the ordinal variable is normally distributed? (Hint: Think about what kind of hypothesis test you would conduct.)

Question 15.61

15.61

World cities, livability, and nonparametric hypothesis tests: CNN.com reported on a 2012 study that ranked the world’s cities in terms of how livable they are (http://travel.cnn.com/explorations/life/worlds-most-livable-city-525619), using a range of criteria related to stability, health care, culture and environment, education, and infrastructure. The top 10, in order, were: Melbourne, Australia; Vienna, Austria; Vancouver, Toronto, and Calgary, all in Canada; Adelaide and Sydney, both in Australia; Helsinki, Finland; Perth, Australia; and Auckland, New Zealand. For each of the following research questions, state which nonparametric hypothesis test is appropriate: the Spearman rank-order correlation coefficient, the Wilcoxon signed-rank test, the Mann–Whitney U test, or the Kruskal–Wallis H test. Explain your answers and indicate the equivalent parametric test.

  1. Which cities tend to receive higher rankings—those north of the equator or those south of the equator?

    473

  2. Did the top 10 cities tend to change rank relative to their position in the previous study?

  3. Are the livability rankings related to a city’s economic status?

  4. On which continent do cities tend to have the highest rankings?

Question 15.62

15.62

Fantasy baseball and the Spearman correlation coefficient: In fantasy baseball, groups of 12 league participants conduct a draft in which they can “buy” any baseball players from any teams across one of the two Major League Baseball (MLB) leagues (the American League and the National League). These makeshift teams are compared on the basis of the combined statistics of the individual baseball players. For example, statistics about home runs are transformed into points, and each fantasy team receives a total score of all combined points based on its baseball players, regardless of their real-life team. Many in the fantasy and real-life baseball worlds have wondered how success in fantasy leagues maps onto real MLB teams’ success in terms of winning baseball games. Walker (2006) compared the fantasy league performances of the players for each American League team with their actual American League finishes for the 2004 season, the year the Boston Red Sox broke the legendary “curse” against them and won the World Series. The data, sorted from highest to lowest fantasy league score, are shown in the accompanying table.

Team Fantasy League Points Actual American League Finish
Boston 117.5 2
New York 109.5 1
Anaheim 108 3.5
Minnesota 97 3.5
Texas 85 6
Chicago 80 7
Cleveland 79 8
Oakland 77 5
Baltimore 74.5 9
Detroit 68.5 10
Seattle 51 13
Tampa Bay 47.5 11
Toronto 35.5 12
Kansas City 20 14
  1. What are the two variables of interest? For each variable, state whether it’s scale or ordinal.

  2. Calculate the Spearman correlation coefficient for these two variables. Remember to convert any scale variables to ranks.

  3. What does the coefficient tell us about the relation between these two variables?

  4. Why couldn’t we calculate a Pearson correlation coefficient for these data?

Question 15.63

15.63

Test-taking speed, grade, and the Spearman correlation coefficient: Does speed in completing a test correlate with one’s grade? Here are test scores for eight students in one of our statistics classes. They are arranged in order from the student who turned in the test first to the student who turned in the test last.

98 74 87 92 88 93 62 67
  1. What are the two variables of interest? For each variable, state whether it’s scale or ordinal.

  2. Calculate the Spearman correlation coefficient for these two variables. Remember to convert any scale variables to ranks.

  3. What does the coefficient tell us about the relation between these two variables?

  4. Why couldn’t we calculate a Pearson correlation coefficient for these data?

  5. Does this Spearman correlation coefficient suggest that students should take their tests as quickly as possible? That is, does it indicate that taking the test quickly causes a good grade? Explain your answer.

  6. What third variables might be responsible for this correlation? That is, what third variables might cause both speedy test taking and a good test grade?

  7. Imagine that each of the following numbers represents the Spearman correlation coefficient that quantifies the relation between test grade and speed in taking the test:

1.00, −0.001, 0.52, −0.27, −0.98, 0.09

Specify which of these coefficients suggests the strongest relation between the two variables as well as which coefficient suggests the weakest relation between the two variables.

Question 15.64

15.64

Nonparametric tests and a longitudinal study of mathematically gifted teenagers: Researchers studied male and female teenagers who had scored in the top 1% on mathematical reasoning measures (Lubinski, Benbow, & Kell, 2014). The researchers were able to follow the participants’ progression over the next 40 years. They tracked two groups, or cohorts, of participants on a number of variables, including career-related variables. They reported “pronounced and significant sex differences in the percentage of participants who were working full time (Cohort 1: 89% of males and 69% of females; Cohort 2: 90% of males and 59% of females), χ2(1, N = 1,131) = 67.56, p < .001, and χ2(1, N = 481) = 68.55, p < .001.” So, the researchers looked at income only among those who were employed full time. For example, for one cohort they reported median incomes of $150,000 for men and $101,000 for women. This gender difference was statistically significant for both groups: “Mann–Whitney U test, zs ≥ 5.09, ps < .001.”

474

  1. Explain why the researchers conducted a chi-square test for their data on whether participants were working full time.

  2. Explain the finding on full-time work in your own words.

  3. Explain why the researchers conducted a Mann–Whitney U test for their data on incomes.

  4. Explain the finding on incomes in your own words.

  5. Why did the researchers report median income rather than mean income?

Question 15.65

15.65

Public versus private universities and the Mann–Whitney U test: Do public or private universities tend to have better sociology graduate programs? U.S. News & World Report publishes online rankings of graduate schools across a range of disciplines. The table below includes the Report’s 2013 list of the top 19 doctoral programs in sociology and notes whether the schools are public or private. Schools listed at the same rank are tied.

  1. What is the independent variable, and what are its levels? What is the dependent variable?

  2. Is this a between-groups or within-groups design? Explain.

  3. Why do we have to use a nonparametric hypothesis test for these data?

  4. Conduct all six steps of hypothesis testing for a Mann–Whitney U test.

  5. How would you present these statistics in a journal article?

University Rank Type of School Public Rank Private Rank
Princeton University 2 Private 2
University of California, Berkeley 2 Public 2
University of Wisconsin, Madison 2 Public 2
Stanford University 4.5 Private 4.5
University of Michigan, Ann Arbor 4.5 Public 4.5
Harvard University 7 Private 7
University of Chicago 7 Private 7
University of North Carolina, Chapel Hill 7 Public 7
University of California, Los Angeles 9 Public 9
Northwestern University 10.5 Private 10.5
University of Pennsylvania 10.5 Private 10.5
Columbia University 12.5 Private 12.5
Indiana University, Bloomington 12.5 Public 12.5
Duke University 14.5 Private 14.5
University of Texas, Austin 14.5 Public 14.5
New York University 16 Private 16
Cornell University 18 Private 18
Ohio State University 18 Public 18
Pennsylvania State University, University Park 18 Public 18

Question 15.66

15.66

Gender, aggression, and the interpretation of a Mann–Whitney U test: Spanish researchers examining aggression in children’s dreams reported the following: “Using the Mann–Whitney nonparametrical statistical test on the gender differences, we found a significant difference between boys and girls in Group 1 for overall [aggression] (U = 44.00, p = 0.004) and received aggression (U = 48.00, p = 0.005). So, in their dreams, younger boys not only had a higher level of general aggression but also received more severe aggressive acts than girls of the same age” (emphasis in original) (Oberst, Charles, & Chamarro, 2005, p. 175).

  1. What is the independent variable, and what are its levels? What is the dependent variable?

    475

  2. Is this a between-groups or within-groups design?

  3. Which hypothesis test did the researchers conduct? Why might they have chosen a nonparametric test? Why do you think they chose this particular nonparametric test?

  4. In your own words, describe what they found.

  5. Can we conclude that gender caused a difference in levels of aggression in dreams? Explain. Provide at least two reasons why gender might not cause certain levels of aggression in dreams even though these variables are associated.

Question 15.67

15.67

Cell phone bill, hours spent studying, and the shapes of distributions: The following figures display data that depict the relation between students’ monthly cell phone bills and the number of hours they report that they study per week.

  1. What does the accompanying scatterplot suggest about the shape of the distribution for hours studied per week? What does it suggest about the shape of the distribution for a monthly cell phone bill?

    image
  2. What does the accompanying grouped frequency histogram suggest about the shape of the distribution for a monthly cell phone bill?

    image
  3. Is it a good idea to use a parametric hypothesis test for these data? Explain.

Putting It All Together

Question 15.68

15.68

Stroke patients, treatment, and type of nonparametric test: A common situation faced by researchers working with special populations, such as neurologically impaired people or people with less common psychiatric conditions, is that the studies often have small sample sizes due to the relatively few number of patients. As a result, these researchers often turn to nonparametric statistical tests. For each of the following research descriptions, state which nonparametric hypothesis test is most appropriate: the Spearman rank-order correlation coefficient, the Wilcoxon signed-rank test, the Mann–Whitney U test, or the Kruskal–Wallis H test. Explain your answers.

  1. People who have had a stroke often have whole or partial paralysis on the side of their body opposite the side of the brain damage. Leung, Ng, and Fong (2009) were interested in the effects of a treatment program for constrained movement on the recovery from paralysis. They compared the arm-movement ability of eight stroke patients before and after the treatment.

  2. Leung and colleagues (2009) were also interested in whether the amount of improvement after the therapy was related to the number of months that had passed since the patient experienced the stroke.

  3. Five of Leung and colleagues’ (2009) patients were male and three were female. We could ask whether post-treatment movement performance was different between men and women.

  4. For parts (a), (b), and (c), what tests would you use if you had 180 patients instead of 8? Explain your answers.

  5. For parts (a), (b), and (c), what was the dependent variable in each case, and how might the researchers have operationalized this variable?

  6. The researchers used a treatment program to help patients recover from a stroke. If the researchers had enough patients to randomly assign some to treatment and others to no treatment, would they have been able to have either a blind or a double-blind design? Explain your answer.

Question 15.69

15.69

Gender bias, poor growth, and hypothesis testing: Grimberg, Kutikov, and Cucchiara (2005) wondered whether gender biases were evident in referrals of children for poor growth. They believed that boys were more likely to be referred even when there was no problem—which is bad for boys because families of short boys might falsely view their height as a medical problem. They also believed that girls were less likely to be referred even when there was a problem—which is bad for girls because real problems might not be diagnosed and treated. They studied all new patients at the Children’s Hospital of Philadelphia Diagnostic and Research Growth Center who were referred for potential problems related to short stature. Of the 182 boys who were referred, 27 had an underlying medical problem, 86 did not but were below norms for their age, and 69 were of normal height according to growth charts. Of the 96 girls who were referred, 39 had an underlying medical problem, 38 did not but were below norms for their age, and 19 were of normal height according to growth charts.

476

  1. How many variables are there in this study? What are the levels of any variable you identified?

  2. Which hypothesis test would be used to analyze these data? Justify your answer.

  3. Conduct the six steps of hypothesis testing for this example.

  4. Calculate the appropriate measure of effect size. According to Cohen’s conventions, what size effect is this?

  5. Report the statistics as you would in a journal article.

  6. Draw a table that includes the conditional proportions for boys and for girls.

  7. Create a graph with bars showing the proportions for all six conditions.

  8. Among only children who are below height norms, calculate the relative risk of having an underlying medical condition if one is a boy as opposed to a girl. Show your calculations.

  9. Explain what we learn from this relative risk.

  10. Now calculate the relative risk of having an underlying medical condition if one is a girl. Show your calculations.

  11. Explain what we learn from this relative risk.

  12. Explain how the calculations in parts (h) and (j) provide us with the same information in two different ways.

Question 15.70

15.70

The prisoner’s dilemma, cross-cultural research, and hypothesis testing: In a classic prisoner’s dilemma game with money for prizes, players who cooperate with each other both earn good prizes. If, however, your opposing player cooperates but you do not (the term used is defect), you receive an even bigger payout and your opponent receives nothing. If you cooperate but your opposing player defects, he or she receives that bigger payout and you receive nothing. If you both defect, you each get a small prize. Because of this, most players of such games choose to defect, knowing that if they cooperate but their partners don’t, they won’t win anything. The strategies of U.S. and Chinese students were compared. The researchers hypothesized that those from the market economy (United States) would cooperate less (i.e., would defect more often) than would those from the nonmarket economy (China).

Defect Cooperate
China 31 36
United States 41 14
  1. How many variables are there in this study? What are the levels of any variables you identified?

  2. Which hypothesis test would be used to analyze these data? Justify your answer.

  3. Conduct the six steps of hypothesis testing for this example, using the above data.

  4. Calculate the appropriate measure of effect size. According to Cohen’s conventions, what size effect is this?

  5. Report the statistics as you would in a journal article.

  6. Draw a table that includes the conditional proportions for participants from China and from the United States.

  7. Create a graph with bars showing the proportions for all four conditions.

  8. Create a graph with two bars showing just the proportions for the defections for each country.

  9. Calculate the relative risk (or relative likelihood) of defecting, given that one is from China versus the United States. Show your calculations.

  10. Explain what we learn from this relative risk.

  11. Now calculate the relative risk of defecting, given that one is from the United States versus China. Show your calculations.

  12. Explain what we learn from this relative risk.

  13. Explain how the calculations in parts (i) and (k) provide us with the same information in two different ways.