Clarifying the Concepts
What is the difference between descriptive and inferential statistics?
What is the difference between a sample and a population?
Identify and define the four types of variables that researchers use to quantify their observations.
Describe two ways that statisticians might use the word scale.
Distinguish between discrete and continuous variables.
What is the relation between an independent variable and a dependent variable?
What are confounding variables (or simply confounds), and how are they controlled using random assignment?
What is the difference between reliability and validity, and how are the two concepts related?
To test a hypothesis, we need operational definitions of the independent and dependent variables. What is an operational definition?
In your own words, define the word experiment—first as you would use it in everyday conversation and then as a researcher would use it.
What is the difference between experimental research and correlational research?
What is the difference between a between-
In statistics, it is important to pay close attention to language. The following statements are wrong but can be corrected by substituting one word or phrase. For example, the sentence “Only correlational studies can tell us something about causality” could be corrected by changing “correlational studies” to “experiments.” Identify the incorrect word or phrase in each of the following statements and supply the correct word.
In a study on exam preparation, every participant had an equal chance of being assigned to study alone or with a group. This was a correlational study.
A psychologist was interested in studying the effects of the dependent variable of caffeine on hours of sleep, and she used a scale measure for sleep.
A university assessed the reliability of a commonly used scale—
In a within-
The following statements are wrong but can be corrected by substituting one word or phrase. (See the instructions in Exercise 1.13.) Identify the incorrect word or phrase in each of the following statements and supply the correct word.
A researcher examined the effect of the ordinal variable “gender” on the scale variable “hours of reality television watched per week.”
A psychologist used a between-
In a study on the effects of the confounding variable of noise level on the dependent variable of memory, researchers were concerned that the memory measure was not valid.
A researcher studied a population of 20 rats to determine whether changes in exposure to light would lead to changes in the dependent variable of amount of sleep.
When John Snow investigated the 1854 outbreak of cholera in London, he paid particular attention to outliers.
What is an outlier?
What are potential benefits of outlier analysis?
Calculating the Statistics
University bookstore employees asked 225 students to complete a customer satisfaction survey after these customers bought their books. The bookstore manager wanted to find ways to improve the customer experience. Identify the sample and population for this example.
Over the course of 1 week, a grocery store randomly selected 100 customers to complete a survey about their favorite products. Identify the sample and population for this example.
A researcher studies the average distance that 130 people living in U.S. urban areas walk each week.
What is the size of the sample?
Identify the population.
Is this “average” a descriptive statistic or an inferential statistic if it is used to describe the 130 people studied?
How might you operationalize the average distance walked in 1 week as an ordinal measure?
How might you operationalize the average distance walked in 1 week as a scale measure?
As they leave a popular grocery store, 73 people are stopped and the number of fruit and vegetable items they purchased is counted.
18
What is the size of the sample?
Identify the population.
Is this number of items counted as a descriptive statistic or an inferential statistic if it is used to estimate the diets of all shoppers?
How might you operationalize the amount of fruit and vegetable items purchased as a nominal measure?
How might you operationalize the amount of fruit and vegetable items purchased as an ordinal measure?
How might you operationalize the amount of fruit and vegetable items purchased as a scale measure?
In the fall of 2008, the U.S. stock market plummeted several times, with grave consequences for the world economy. A researcher might assess the economic effect this situation had by seeing how much money people saved in 2009. That amount could be compared to the amount people saved in more economically stable years. How might you operationalize the economic implications at a national level?
A researcher might be interested in evaluating how a person’s physical and emotional distance from Manhattan at the time of the 9/11 terrorist attacks relates to the accuracy of memory about the event.
Identify the independent variables and the dependent variable.
Imagine that physical distance is assessed as within 100 miles or beyond 100 miles; also, imagine that emotional distance is assessed as knowing no one who was affected, knowing people who were affected but lived, and knowing someone who died in the attacks. How many levels do the independent variables have?
How might you operationalize the dependent variable, accuracy of memory about the event?
A study of the effects of skin tone (light, medium, and dark) on the severity of facial wrinkles in middle age might be of interest to cosmetic surgeons.
What would the independent variable be in this study?
What would the dependent variable be in this study?
How many levels would the independent variable have?
Since 1980, most of the cyclists who have won the Tour de France have won it just once. Several cyclists have won it two or three times. The Spanish cyclist Miguel Induráin won it five times, and the U.S. cyclist Lance Armstrong won it seven times (although he was stripped of his medals for violating the drug policy). Identify the outlier or outliers among the cyclists.
Referring to Exercise 1.23, what might be the purpose of an outlier analysis in this case? What might it reveal?
Applying the Concepts
Average weights in the United States: The CDC reported very large weight increases for U.S. residents of both genders and all age groups over the past four decades. Go to the Web site that reports these data (http:/
What were the average weights of 10-
Do you think the CDC weighed every girl in the United States to get these averages? Why would this not be feasible?
How does the average weight of 10-
Sample versus population in Norway: The Nord-
What is the sample used by these researchers?
What is the population to which the researchers would like to extend their findings?
Types of variables and Olympic swimming: At the 2012 London Summer Olympics, American Michael Phelps won four gold medals, bringing his overall Olympic career total to 18 gold medals, the all-
Phelps of the United States came in first, and Chad le Clos of South Africa and Evgeny Korotyshkin of Russia tied for second place.
Phelps finished in 51.21 seconds, and le Clos and Korotyshkin finished in 51.44 seconds.
One might examine the hemisphere from which swimmers hailed. Phelps and Korotyshki live in the Northern Hemisphere, whereas le Clos lives in the Southern Hemisphere.
Types of variables and the Kentucky Derby: The Kentucky Derby is perhaps the premier event in U.S. horse racing. For each of the following examples from the derby, identify the type of variable—
As racing fans, we would be very interested in the variable finishing position. A horse called Orb won in 2013, followed by Golden Soul in second and Revolutionary in third.
We also might be interested in the variable finishing time. Orb won in 2 minutes, 2.89 seconds.
19
Derby attendance was 151,616 in 2013, not as high as the record-
If we were the betting type, we might examine the variable payoffs. For each $2.00 bet on Orb, a gambler won $12.80.
We might be interested in the history of the Derby and the demographic variables of jockeys, such as gender or race. For example, in the first 28 runnings of the Kentucky Derby, 15 of the winning jockeys were African American.
In the luxury boxes, high fashion reigns; we might be curious about the variable hat wearing, observing how many women wear hats and how many do not.
Discrete versus continuous variables: For each of the following examples, state whether the scale variable is discrete or continuous.
The capacity, in terms of songs, of an iPod
The playing time of an individual song
The cost in cents to download a song legally
The number of posted reviews that a CD has on Amazon.com
The weight of an MP3 player
Reliability and validity: Go online and take the personality test found at http:/
What does it mean for a test to be reliable? Take the test a second time. Does it seem to be reliable?
What does it mean for a test to be valid? Does this test seem to be valid? Explain.
The test asks a number of demographic questions at the end, including “In what country did you spend most of your youth?” Can you think of a hypothesis that might have led the developers of this Web site to ask this question?
For your hypothesis in part (c), identify the independent and dependent variables.
Reliability, validity, and wine ratings: You may have been in a wine store and wondered just how useful those posted wine ratings are. (They are usually rated on a scale from 50 to 100, with 100 being the top score.) After all, aren’t ratings subjective? Corsi and Ashenfelter (2001) studied whether wine experts are consistent. Knowing that the weather is the best predictor of price, the researchers wondered how well weather predicted experts’ ratings. The variables used for weather included temperature and rainfall, and the variable used for wine experts’ ratings was the number they assigned to each wine.
Name one independent variable. What type of variable is it? Is it discrete or continuous?
Name the dependent variable. What type of variable is it? Is it discrete or continuous?
How does this study reflect the concept of reliability?
Let’s say that you frequently drink wine that has been rated highly by Robert Parker, one of this study’s wine experts. His ratings were determined to be reliable, and you find that you usually agree with Parker. How does this observation reflect the concept of validity?
Operationalizing variables and rap statistics: The Web site Rap Genius analyzes rap music using what they call RapMetrics (http:/
How does Rap Genius operationalize the best rapper?
List at least three other variables that might be considered in determining the best rapper.
Across all songs, MF Doom came in first with an overall rhyme density of 0.44, and Cam’ron came in second with a rhyme density of 0.41. What kind of variable is the ranking, and what kind of variable is the rhyme density?
Rap Genius summarized its rhyme density discussion by saying: “MF Doom surprised us a little. First the big New Yorker profile and now this…it’s too much! Cam’ron is much more culturally relevant and interesting (plus he doesn’t need to wear a mask, he’s naturally silly), so we at RapMetrics™ consider him the G.O.A.T. (greatest of all time) MC [rapper] overall.” Rap Genius, therefore, changed its operational definition a bit. What has it added to its operational definition of the overall G.O.A.T. MC?
Identifying and operationalizing variables: For each of the following hypotheses, identify the independent variable and the most likely levels of that independent variable. Then identify the likely dependent variable and a likely way to operationalize that dependent variable. Be specific.
Teenagers are better at video games, on average, than are adults in their 30s.
20
Spanking children tends to lead them to be more violent.
Weight Watchers leads to more weight loss, on average, for people who go to meetings than for those who only participate online.
Students do better in statistics, on average, if they study with other people than if they study alone.
Drinking caffeinated beverages with dinner tends to make it harder to get to sleep.
Between-
Describe how you could study the exercise program using a between-
Describe how you could study the exercise program using a within-
What is a potential confound of a within-
Correlational research and smoking: For decades, researchers, politicians, and tobacco company executives debated the relation between smoking and health problems such as cancer.
Why was this research necessarily correlational in nature?
What confounding variables might make it difficult to isolate the health effects of smoking tobacco?
How might the nature of this research and these confounds buy time for the tobacco industry with regard to acknowledging the hazardous effects of smoking?
All ethics aside, how could you study the relation between smoking and health problems using a between-
Experimental versus correlational research and culture: A researcher interested in the cultural values of individualistic and collectivist societies collects data on the rate of relationship conflict experienced by 32 people who test high for individualism and 37 people who test high for collectivism.
Is this research experimental or correlational? Explain.
What is the sample?
Write a possible hypothesis for this researcher.
How might we operationalize relationship conflict?
Experimental versus correlational research and recycling: A researcher wants to know if people’s concerns about the environment vary as a function of incentives provided for recycling. Students living on a university campus are recruited to participate in a study. Some students are randomly assigned to a group in which they are rewarded financially for their recycling efforts for one month. The other students are randomly assigned to a group in which they are assessed a fine that is based on the amount of material that they could have, but did not, recycle.
Is this research experimental or correlational? Explain.
Write a hypothesis for this researcher.
Outliers, exercise, and weight gain: Imagine that you conducted the study described in Exercise 1.34 and that one person had gained many, many pounds while in the exercise program.
Why would this individual be considered an outlier?
Explain why outlier analysis might be useful in this situation.
What kinds of things are we looking for in an outlier analysis?
Outliers and response time: Imagine that a researcher is measuring the time it takes participants to identify whether a string of letters constitutes a word (e.g., duke) or a nonword (e.g., dake). She measures the response time of 40 participants. She finds that most participants took from ½ to 1 second to make their decision but that one participant took 3 minutes to make a decision.
Why would the participant who took 3 minutes be considered an outlier?
What kinds of things might the researcher look for in an outlier analysis of this situation?
Putting It All Together
Romantic relationships: Goodman and Greaves (2010) reported findings from the Millennium Cohort Study, a large research project in the United Kingdom. They stated that “while it is true that cohabiting parents are more likely to split up than married ones, there is very little evidence to suggest that this is due to a causal effect of marriage. Instead, it seems simply that different sorts of people choose to get married and have children, rather than to have children as a cohabiting couple, and that those relationships with the best prospects of lasting are the ones that are most likely to lead to marriage” (p. 1).
What is the sample in this study?
What is the likely population?
21
Is this a correlational study or an experiment? Explain.
What is the independent variable?
What is the dependent variable?
What is one possible confounding variable? Suggest at least one way in which the confounding variable might be operationalized.
Experiments, HIV, and cholera: Several studies have documented the susceptibility of people who are HIV-
Describe a way in which the researchers could have conducted an experiment to examine the effectiveness of the cholera vaccine among people who are HIV-
If the researchers did conduct an experiment, would this have been a between-
The researchers did not randomly assign participants to vaccine or no-
The researchers did not use random assignment when conducting this study. List at least one practical reason and at least one ethical reason that they might not have used random assignment.
Ability and wages: Arcidiacono, Bayer, and Hizmo (2008) analyzed data from a national longitudinal survey called NLSY79, which includes data from over 12,000 men and women in the United States who were in the 14-
List any independent variables.
List any dependent variables.
What is the sample in this study?
What is the population about which researchers want to draw conclusions?
What do the authors mean by “longitudinal” in this study?
The researchers used the Armed Forces Qualification Test (AFQT) as their measure of ability. The AFQT combines scores on word knowledge, paragraph comprehension, arithmetic reasoning, and mathematics knowledge subscales. Can you suggest at least one confounding variable in the relation between ability and wages when comparing college graduates to high school graduates?
Suggest at least two other ways in which the researchers might have operationalized ability.
Assessing charitable organizations: Many people do research on charitable organizations before deciding where to donate their money. Tina Rosenberg (2012) reported that traditionally many people have used sources such as Charity Navigator or the Better Business Bureau’s Web sites. Both of these sites rate organizations more highly if the organizations use less of their donation money for fundraising or administration and more of it for the cause they are supporting. On Charity Navigator, for example, Doctors Without Borders, a nonprofit focused on health and medical needs, gets a rating of 57.11 out of 70 based on its financial practices, accountability, and transparency; this puts that organization in the second of Charity Navigator’s five tiers (http:/
How does Charity Navigator operationalize a good charity?
What kind of variable is the score of 57.11 out of 70, nominal, ordinal, or scale? Explain your answer.
What kind of variable is the tier, second out of five—
There are many types of charities. Doctors Without Borders focuses on health and medical needs. What kind of variable is its type of charity—
According to Rosenberg (2012), Toby Ord, a moral philosopher from Oxford University, thinks the traditional operational definition of what constitutes a good charity is too limited. He has five criteria that he sees as important for a good charity: It targets the most serious problems (disease over art, for example). It uses evidence-
22
Which is more likely to be reliable, the rating by Charity Navigator or the verdict by GiveWell? Explain your answer.
Which is more likely to be valid, the rating by Charity Navigator or the verdict by GiveWell? Explain your answer.
If you were to monitor whether increased donation funds led to a lower death rate in a country, would that be an experiment or a correlational study? Explain your answer.
If you were to randomly assign some regions to receive more donation funds and other regions to receive fewer, and then track the death rate in both sets of regions, would that be an experiment or a correlational study? Explain your answer.