EXAMPLE 22 Using rank correlation to detect a nonlinear pattern
scrabble
In the Chapter 13 Case Study, “How Fair Is the Scoring in Scrabble?” we noted from a scatterplot of Scrabble point values versus English-language letter frequencies that the relationship between the variables was not linear. Thus, we could not use a linear regression analysis. However, we can use rank correlation to test whether the two variables are associated, because linearity is not a condition for applying the rank correlation test.
For the following random sample of letters, use the rank correlation test to investigate whether an association exists between English-language frequencies and Scrabble point values. Use level of significance . Note from Figure 24 that the relationship between the variables is certainly nonlinear.
14-49
Letter | Frequency | Scrabble points |
---|---|---|
Q | 0.003 | 10 |
L | 0.035 | 1 |
G | 0.016 | 2 |
E | 0.130 | 1 |
X | 0.005 | 8 |
T | 0.093 | 1 |
S | 0.063 | 1 |
K | 0.003 | 5 |
Solution
The data come from a random sample, so we may proceed with the hypothesis test.
Letter | Frequency | Scrabble points |
Frequency rank |
Scrabble rank |
Difference |
|
---|---|---|---|---|---|---|
Q | 0.003 | 10 | 1.5 | 8 | −6.5 | 42.25 |
L | 0.035 | 1 | 5 | 2.5 | 2.5 | 6.25 |
G | 0.016 | 2 | 4 | 5 | −1 | 1 |
E | 0.130 | 1 | 8 | 2.5 | 5.5 | 30.25 |
X | 0.005 | 8 | 3 | 7 | −4 | 16 |
T | 0.093 | 1 | 7 | 2.5 | 4.5 | 20.25 |
S | 0.063 | 1 | 6 | 2.5 | 3.5 | 12.25 |
K | 0.003 | 5 | 1.5 | 6 | −4.5 | 20.25 |
There are letters in the sample, so the value of the test statistic is given by