Clarifying the Concepts
What is a percentile?
When we look up a z score on the z table, what information can we report?
How do we calculate the percentage of scores below a particular positive z score?
How is calculating a percentile for a mean from a distribution of means different from doing so for a score from a distribution of scores?
184
In statistics, what do we mean by assumptions?
What sample size is recommended in order to meet the assumption of a normal distribution of means, even when the underlying population of scores is not normal?
What is the difference between parametric tests and nonparametric tests?
What are the six steps of hypothesis testing?
What are critical values and the critical region?
What is the standard size of the critical region used by most statisticians?
What does statistically significant mean to statisticians?
What do these symbolic expressions mean: H0: μ1 = μ2 and H1: μ1 ≠ μ2?
Using everyday language rather than statistical language, explain why the words critical region might have been chosen to define the area in which a z statistic must fall in order for a researcher to reject the null hypothesis.
Using everyday language rather than statistical language, explain why the word cutoff might have been chosen to define the point beyond which we reject the null hypothesis.
What is the difference between a one-
Why do researchers typically use a two-
Write the symbols for the null hypothesis and research hypothesis for a one-
What are three kinds of dirty data and what are their possible sources?
What are three ways to deal with missing data?
How can data that are misleading result in missing data?
Calculating the Statistic
Calculate the following percentages for a z score of 0.74, with a tail of 22.96%:
What percentage of scores falls below this z score?
What percentage of scores falls between the mean and this z score?
What proportion of scores falls below a z score of −0.74?
Using the z table in Appendix B, calculate the following percentages for a z score of −0.08:
Above this z score
Below this z score
At least as extreme as this z score
Using the z table in Appendix B, calculate the following percentages for a z score of 1.71:
Above this z score
Below this z score
At least as extreme as this z score
Rewrite each of the following percentages as probabilities, or p levels:
5%
83%
51%
Rewrite each of the following probabilities, or p levels, as percentages:
0.19
0.04
0.92
If the critical values for a hypothesis test occur where 2.5% of the distribution is in each tail, what are the cutoffs for z?
For each of the following p levels, what percentage of the data will be in each critical region for a two-
0.05
0.10
0.01
State the percentage of scores in a one-
0.05
0.10
0.01
You are conducting a z test on a sample of 50 people with an average SAT verbal score of 542 (assume we know the population mean to be 500 and the standard deviation to be 100). Calculate the mean and the spread of the comparison distribution (μM and σM).
You are conducting a z test on a sample of 132 people for whom you observed a mean SAT verbal score of 490. The population mean is 500, and the standard deviation is 100. Calculate the mean and the spread of the comparison distribution (μM and σM).
If the cutoffs for a z test are −1.96 and 1.96, determine whether you would reject or fail to reject the null hypothesis in each of the following cases:
z = 1.06
z = −2.06
A z score beyond which 7% of the data fall in each tail
If the cutoffs for a z test are −2.58 and 2.58, determine whether you would reject or fail to reject the null hypothesis in each of the following cases:
z = −0.94
z = 2.12
A z score for which 49.6% of the data fall between z and the mean
185
Use the cutoffs of −1.65 and 1.65 and a p level of approximately 0.10, or 10%. For each of the following values, determine whether you would reject or fail to reject the null hypothesis:
z = 0.95
z = −1.77
A z statistic that 2% of the scores fall above
You are conducting a z test on a sample for which you observe a mean weight of 150 pounds. The population mean is 160, and the standard deviation is 100.
Calculate a z statistic for a sample of 30 people.
Repeat part (a) for a sample of 300 people.
Repeat part (a) for a sample of 3000 people.
For each of the following, indicate whether or not the situation describes misleading data that the researcher may decide to investigate and potentially discard.
A sample of 50 students rate their agreement with 100 statements designed to assess their political attitudes. The rating scale goes from 1 (definitely disagree) to 7 (definitely agree). One participant provides a response of 1 to all 100 statements.
A researcher measures the time it takes participants to hit a button upon hearing a warning signal. In her sample of 34 participants, she finds that the mean response time is 413 milliseconds (ms) with a standard deviation of 30 ms. One participant has a response time of 420 ms.
A researcher measures the time it takes participants to hit a button upon hearing a warning signal. In previous studies, she found that the mean response time is 413 ms with a standard deviation of 30 ms. In the current study, one participant had a response time of 1220 ms, which drives up the overall mean of the sample.
Assume that the following set of data represents the responses of 10 participants to three similar statements. The participants rated their agreement with each statement on a scale from 1 to 7.
Participant | S1 | S2 | S3 |
---|---|---|---|
1 | 2 | 3 | 2 |
2 | 6 | 7 | 3 |
3 | 3 | 2 | 5 |
4 | 7 | 6 | 5 |
5 | 2 | 3 | 3 |
6 | 5 | 5 | 6 |
7 | 9 | 5 | 4 |
8 | 2 | 3 | 7 |
9 | 6 | 7 | 7 |
10 | 3 | 6 | 5 |
There is a piece of dirty data in this data set. Identify it and explain why it is dirty.
Assume that you have decided to throw out the piece of dirty data you identified in part (a) and replace it with the mean for that variable. What is the new data point?
Assume that you have decided to throw out the piece of dirty data you identified in part (a) and replace it with the mean of that participant’s responses. What is the new data point?
Applying the Concepts
Percentiles and unemployment rates: The U.S. Bureau of Labor Statistics’ annual report published in 2011 provided adjusted unemployment rates for 10 countries. The mean was 7, and the standard deviation was 1.85. For the following calculations, treat these as the population mean and standard deviation.
Australia’s unemployment rate was 5.4. Calculate the percentile for Australia—
The United Kingdom’s unemployment rate was 8.5. Calculate its percentile—
The unemployment rate in the United States was 8.9. Calculate its percentile—
The unemployment rate in Canada was 6.5. Calculate its percentile—
Height and the z distribution: Elena, a 15-
Calculate Elena’s z score.
What percentage of girls are taller than Elena?
What percentage of girls are shorter?
How much would Elena have to grow to be perfectly average?
If Sarah is in the 75th percentile for height at age 15, how tall is she?
How much would Elena have to grow in order to be at the 75th percentile with Sarah?
Height and the z distribution: Kona, a 15-
Calculate Kona’s z score.
What is Kona’s percentile score for height?
What percentage of boys this age is shorter than Kona?
What percentage of heights is at least as extreme as Kona’s, in either direction?
If Ian is in the 30th percentile for height as a 15-
186
Height and the z statistic: Imagine a class of thirty-
Calculate the z statistic.
How does this sample of girls compare to the distribution of sample means?
What is the percentile rank for this sample?
Height and the z statistic: Imagine a basketball team comprised of thirteen 15-
Calculate the z statistic.
How does this sample of boys compare to the distribution of sample means?
What is the percentile rank for this sample?
The z distribution and statistics test scores: Imagine that your statistics professor lost all records of students’ raw scores on a recent test. However, she did record students’ z scores for the test, as well as the class average of 41 out of 50 points and the standard deviation of 3 points (treat these as population parameters). She informs you that your z score was 1.10.
What was your percentile score on this test?
Using what you know about z scores and percentiles, how did you do on this test?
What was your original test score?
The z statistic, distributions of means, and height: Using what we know about the height of 15-
Calculate the mean and the standard error of the distribution of mean heights.
Calculate the z statistic for this group.
What percentage of mean heights, based on a sample size of 14 students, would we expect to be shorter than this group?
How often do mean heights equal to or more extreme than this size occur in this population?
If statisticians define sample means that occur less than 5% of the time as “special” or rare, what would you say about this result?
The z statistic, distributions of means, and height: Another teacher decides to average the height of all 15-
Calculate the mean and the standard error of the distribution of mean heights.
Calculate the z statistic for this group.
What percentage of groups of people would we expect to have mean heights, based on samples of this size (57), taller than this group?
How often do mean heights equal to or more extreme than 68.1 occur in this population?
How does this result compare to the statistical significance cutoff of 5%?
Directional versus nondirectional hypotheses: For each of the following examples, identify whether the research has expressed a directional or a nondirectional hypothesis:
A researcher is interested in studying the relation between the use of antibacterial products and the dryness of people’s skin. He thinks these products might alter the moisture in skin differently from other products that are not antibacterial.
A student wonders if grades in a class are in any way related to where a student sits in the classroom. In particular, do students who sit in the front row get better grades, on average, than the general population of students?
Cell phones are everywhere, and we are now available by phone almost all of the time. Does this translate into a change in the closeness of our long-
Null hypotheses and research hypotheses: For each of the following examples (the same as those in Exercise 7.45), state the null hypothesis and the research hypothesis, in both words and symbolic notation:
A researcher is interested in studying the relation between the use of antibacterial products and the dryness of people’s skin. He thinks these products might alter the moisture in skin differently from other products that are not antibacterial.
A student wonders if grades in a class are in any way related to where a student sits in the classroom. In particular, do students who sit in the front row get better grades, on average, than the general population of students?
Cell phones are everywhere, and we are now available by phone almost all of the time. Does this translate into a change in the closeness of our long-
The z distribution and Hurricane Katrina: Hurricane Katrina hit New Orleans on August 29, 2005. The National Weather Service Forecast Office maintains online archives of climate data for all U.S. cities and areas. These archives allow us to find out, for example, how the rainfall in New Orleans that August compared to that in the other months of 2005. The table below shows the National Weather Service data (rainfall in inches) for New Orleans in 2005.
187
January | 4.41 |
February | 8.24 |
March | 4.69 |
April | 3.31 |
May | 4.07 |
June | 2.52 |
July | 10.65 |
August | 3.77 |
September | 4.07 |
October | 0.04 |
November | 0.75 |
December | 3.32 |
Calculate the z score for August. (Note: These are raw data for the population, rather than summaries, so you have to calculate the mean and the standard deviation first.)
What is the percentile for the rainfall in August? Does this surprise you? Explain.
When results surprise us, it is worthwhile to examine individual data points more closely or even to go beyond the data. The daily climate data as listed by this source for August 2005 shows the code “M” next to August 29, 30, and 31 for all climate statistics. The code says: “[REMARKS] ALL DATA MISSING AUGUST 29, 30, AND 31 DUE TO HURRICANE KATRINA.” Pretend you were hired as a consultant to determine the percentile for that August. Write a brief paragraph for your report, explaining why the data you generated are likely to be inaccurate.
What raw scores mark the cutoff for the top and bottom 10% for these data? Based on these scores, which months had extreme data for 2005? Why should we not trust these data?
Percentiles and IQ scores: IQ scores are designed to have a mean of 100 and a standard deviation of 15. IQ testing is one way in which people are categorized as having different levels of mental disability; there are four levels of mental retardation between the IQ scores of 0 and 70.
People with IQ scores of 20–
People with IQ scores of 50–
A person has an IQ score of 66. What is her percentile?
A person falls at the 3rd percentile. What is his IQ score? Would he be classified as having a mental disability?
Step 1 of hypothesis testing for a study of the Wechsler Adult Intelligence Scale: Boone (1992) examined scores on the Wechsler Adult Intelligence Scale-
What are the two populations?
What would the comparison distribution be? Explain.
What hypothesis test would you use? Explain.
Check the assumptions for this hypothesis test. Label your answers (1) through (3).
What does Boone mean when he says significantly?
Step 2 of hypothesis testing for a study of the Wechsler Adult Intelligence Scale: Refer to the scenario described in Exercise 7.49.
State the null and research hypotheses for a two-
Imagine that you wanted to replicate this study. Based on the findings described in Exercise 7.49, state the null and research hypotheses for a one-
Step 1 of hypothesis testing for a study of college football: Let’s consider whether U.S. college football teams are more likely or less likely to be mismatched in the upper National Collegiate Athletic Association (NCAA) divisions. Overall, the 53 Football Bowl Subdivision (FBS) games (formerly Division I-
188
List the independent variable and the dependent variable in this example.
Did we use random selection? Explain.
Identify the populations of interest in this example.
State the comparison distribution.
Check the assumptions for this test.
Step 2 of hypothesis testing for a study of college football: Refer to Exercise 7.51.
State the null hypothesis and the research hypothesis for a two-
One of our students hypothesized that the spread would be bigger among the FCS teams because “some of them are really bad and would get crushed.” State the one-
Steps 3 through 6 of hypothesis testing for a study of college football: Refer to Exercise 7.51. Remember, the population mean is 16.189, with a standard deviation of 12.128. The results for the four FCS Patriot League games are as follows:
Holy Cross, 27/Bucknell, 10
Lehigh, 23/Colgate, 15
Lafayette, 31/Fordham, 24
Georgetown, 24/Marist, 21
Conduct steps 3 through 6 of hypothesis testing. (You already conducted steps 1 and 2 in Exercises 7.51(e) and 7.52(a), respectively.)
Would you be willing to generalize these findings beyond the sample? Explain.
Putting It All Together
The Graded Naming Test and sociocultural differences: Researchers often use z tests to compare their samples to known population norms. The Graded Naming Test (GNT) asks respondents to name objects in a set of 30 black-
Conduct all six steps of a z test. Be sure to label all six steps.
Some words on the GNT are more commonly used in England. For example, a mitre, the head-
When we conduct a one-
Under which circumstance—
If it becomes easier to reject the null hypothesis under one type of test (one-
When we change the p level that we use as a cutoff, there is a small change in step 4 of hypothesis testing. Although 0.05 is the most commonly used p level, other values, such as 0.01, are often used. For this example, conduct steps 4 and 6 of hypothesis testing for a two-
With which p level—
If it is easier to reject the null hypothesis with certain p levels, does this mean that there is a bigger difference between the samples with one p level versus the other p level? Explain.
Patient adherence and orthodontics: A research report (Behenam & Pooya, 2007) begins, “There is probably no other area of health care that requires…cooperation to the extent that orthodontics does,” and explores factors that affected the number of hours per day that Iranian patients wore their orthodontic appliances. The patients in the study reported that they used their appliances, on average, 14.78 hours per day, with a standard deviation of 5.31. We’ll treat this group as the population for the purposes of this example. Let’s say a researcher wanted to study whether a DVD with information about orthodontics led to an increase in the amount of time patients wore their appliances, but decided to use a two-
189
What is the independent variable? What is the dependent variable?
Did the researcher use random selection to choose his sample? Explain your answer.
Conduct all six steps of hypothesis testing. Be sure to label all six steps.
If the researcher’s decision in step 6 were wrong, what type of error would he have made? Explain your answer.
Radiation levels on Japanese farms: Fackler (2012) reported in the New York Times that Japanese farmers have become skeptical of the Japanese government’s assurances that radiation levels were within legal limits in the wake of the 2011 tsunami and radiation disaster at Fukushima. After reports of safe levels in Onami, more than 12 concerned farmers tested their crops and found dangerously high levels of cesium.
If the farmers wanted to conduct a z test comparing their results to the cesium levels found in areas that had not been exposed to the radiation, what would their sample be? Be specific.
Conduct step 1 of hypothesis testing.
Conduct step 2 of hypothesis testing.
Conduct step 4 of hypothesis testing for a two-
Imagine that the farmers calculated a z statistic of 3.2 for their sample. Conduct step 6 of hypothesis testing.
If the farmers’ conclusions were incorrect, what type of error would they have made? Explain your answer.
You have conducted a study with 120 participants (60 female, 60 male) about the relation between attitudes toward cohabitation before marriage (on a 30-
What are the possible causes of incomplete data on the sexual behavior scale?
What choices do you have regarding the missing data on the sexual behavior scale?
What might you do with the data from the participant who reported the highest possible scores on every item on both scales?
Explain why you would or would not report on how you made your decision about what to do with outliers or with the missing or incomplete data in your write-
You have just conducted a study testing how well two independent variables, daily sugar intake (as assessed by a 25-
What will you do with the data of the 3 participants who dropped out just before having their blood sugar levels assessed?
What are your options with regard to the data from the 2 participants who left one item blank on the physical activity scale?
What are your options with regard to the data from the 4 participants who did not respond to most of the items on the eating habits scale?
Do you recommend using these data at all? If so, how?
In Next Steps, we noted that the z distribution is sometimes used to identify potential outliers in a data set. Box Office Mojo (2013) provides data on U.S. box office receipts for major films. Here are worldwide box office grosses for a randomly selected sample of 15 of the 100 top-
190
Movie | Millions of dollars |
---|---|
Marvel’s The Avengers | 1512 |
Flight | 162 |
Skyfall | 1109 |
Wrath of the Titans | 305 |
Tyler Perry’s Madea’s Witness Protection | 66 |
Zero Dark Thirty | 109 |
Lincoln | 275 |
Moonrise Kingdom | 68 |
Life of Pi | 609 |
The Lucky One | 99 |
The Bourne Legacy | 276 |
The Watch | 68 |
Rock of Ages | 59 |
Cloud Atlas | 131 |
Snow White and the Huntsman | 397 |
Eyeball the data. Which score or scores seem like they might be outliers?
Sometimes potential outliers are defined as scores that are beyond 2 standard deviations from the mean—
Sometimes potential outliers are defined as scores that are beyond 3 standard deviations from the mean—
Why might it make sense to eliminate potential outliers from any data analyses?
Explain why the decision about how to identify potential outliers should be made before collecting data.