Almost everything worth evaluating requires a sample, from voting trends to sales patterns to the effectiveness of flu vaccines. The goal of sampling is simple: Collect a sample that represents the population. As Lillian Gilbreth reminds us, efficient living and efficient sampling are both satisfying and possible—
A random sample is one in which every member of the population has an equal chance of being selected into the study.
A convenience sample is one that uses participants who are readily available.
There are two main types of samples: random samples and convenience samples. A random sample is one in which every member of the population has an equal chance of being selected into the study. A convenience sample is one that uses participants who are readily available, such as college students. A random sample remains the ideal and is far more likely to lead to a representative sample, but it is usually expensive and can present a lot of practical problems. It is often almost impossible to get access to every member of the population in order to be able to choose a random sample from among them. Technologies such as Amazon Mechanical Turk, SurveyMonkey, and many other Internet tools offer new ways of obtaining convenience samples from a more diverse sample of participants.
5.1: There are two main types of samples in social science research. In the ideal type (a random sample), every member of the population has an equal chance of being selected to participate in a study. In the less ideal but more common type (a convenience sample), researchers use participants who are readily available.
Imagine that there has recently been a traumatic mass murder in a town and that there are exactly 80 officers in the town’s police department. You have been hired to determine whether peer counseling or professional counseling is the more effective way to address the department’s concerns in the aftermath of this trauma. Unfortunately, budget constraints dictate that the sample you can recruit must be very small—
Here’s one way: Assign each of the 80 officers a number, from 01 to 80. Next, use a random numbers table (Table 5-
Excerpt from a Random Numbers Table | ||||||
---|---|---|---|---|---|---|
04493 | 52494 | 75246 | 33824 | 45862 | 51025 | 61962 |
00549 | 97654 | 64501 | 88159 | 96119 | 63896 | 54692 |
35963 | 15307 | 26898 | 09354 | 33351 | 35462 | 77974 |
59808 | 08391 | 45427 | 26842 | 83609 | 49700 | 46058 |
For example, say you begin with the 6th number of the second row of Table 5-
104
Were you surprised that random sampling selected both the number 46 and the number 45? Truly random numbers often have strings of numbers that do not seem to be random. For example, notice the string of three 3’s in the third row of the table.
You can also search online for a “random numbers generator,” which is how we came up with the following numbers: 10, 23, 27, 34, 36, 67, 70, 74, 77, and 78. You might be surprised that 4 of the 10 numbers were in the 70s. Don’t be. Random numbers are truly random, even if they don’t look random.
Random samples are almost never used in the social sciences because we almost never have access to the whole population. For example, if we were interested in studying the eating behavior of voles, we would never be able to list the whole population of voles from which to then select a random sample. If we were interested in studying the effect of video games on the attention span of teenagers in the United Kingdom, we would never be able to identify all U.K. teenagers from which to choose a random sample.
Generalizability refers to researchers’ ability to apply findings from one sample or in one context to other samples or contexts; also called external validity.
It is far more convenient (faster, easier, and cheaper) to use voles that we bought from an animal supply company or to gather teenagers from the local school—
105
Replication refers to the duplication of scientific results, ideally in a different context or with a sample that has different characteristics.
Fortunately, we can increase external validity through replication, the duplication of scientific results, ideally in a different context or with a sample that has different characteristics. In other words, do the study again. And again. Then ask someone else to replicate it, too. That’s the slow but trustworthy process by which science creates knowledge that is both reliable and valid. That’s also why some of the real “scientific breakthroughs” you hear about are really just the tipping point based on many smaller research discoveries.
A volunteer sample, or self-
Liars’ Alert! We must be even more cautious when we use a volunteer sample (also called a self-
Let’s be blunt: if you don’t understand sampling, then you make it easy for others to take advantage of you. For example, the colorful cosmetics catalog Lush Times uses the following testimonial about the amazing skin-
The population of interest is women close to age 60. The sample is the one woman who wrote to Lush. Assuming that it is a real letter, there are two major problems. First, one person is not a trustworthy sample size. Second, this is a volunteer sample. The customer who had this experience chose to write to Lush. Was she likely to write to Lush if she did not feel very strongly about this product? Moreover, would Lush be likely to publish her statement if it weren’t positive? The moisturizer still might be effective, but this testimonial doesn’t provide evidence worth listening to.
Lush touts its products as meant for people of all ages. But with colorful, cartoon-
We could randomly assign a certain number of people to use Skin’s Shangri La and an equal number of people to use another product (or no product), and then see which group has better skin a certain number of weeks later. Which do you find more persuasive: a dubious testimonial or a well-
Random assignment is the distinctive signature of a scientific study. Why? Because it levels the playing field when every participant has an equal chance of being assigned to any level of the independent variable. Random assignment is different from random selection. Random selection is the ideal way to gather a sample from a population; random assignment is what we do with participants once they have been recruited into a study, regardless of how they got there. Practical problems related to getting access to an entire population mean that random selection is almost never used; however, random assignment is used whenever possible—
106
5.2: Replication and random assignment to groups help overcome problems of convenience sampling. Replication involves repeating a study, ideally with different participants or in a different context, to see whether the results are consistent. With random assignment, every participant has an equal chance of being assigned to any level of the independent variable.
Random assignment involves procedures similar to those used for random selection. If a study has two levels of the independent variable, as in the study of police officers, then you would need to assign participants to one of two groups. You could decide, arbitrarily, to number the groups 0 and 1 for the “peer counseling” and “therapist counseling” groups, respectively. Then, (a) select any point on the table; (b) decide to go across, back, up, or down to read through the numbers—
For example, if you began at the first number of the last row of Table 5-
An online random numbers generator lets us tell the computer to give us one set of 10 numbers that range from 0 to 1. We would instruct the program that the numbers should not remain unique because we want multiple 0’s and multiple 1’s. In addition, we would request that the numbers not be sorted because we want to assign participants in the order in which the numbers are generated. When we used an online random numbers generator, the 10 numbers were 1110100001. In an experiment, we usually want equal numbers in the groups. If the numbers were not exactly half 1’s and half 0’s, as they are in this case, we could decide in advance to use only the first five 1’s or the first five 0’s. Just be sure to establish your rule for random assignment ahead of time and then stick to it!
Reviewing the Concepts
Clarifying the Concepts
Calculating the Statistics
Applying the Concepts
Solutions to these Check Your Learning questions can be found in Appendix D.