You don’t have to eat the entire pot of soup to know it needs more salt. That is the idea of sampling: to gain information about the whole by examining only a part. Sample surveys are an important kind of observational study. They survey some group of individuals by studying only some of its members, selected not because they are of special interest but because they represent the larger group. Here is the vocabulary we use to discuss sampling.
9
Populations and samples
The population in a statistical study is the entire group of individuals about which we want information.
A sample is the part of the population from which we actually collect information and is used to draw conclusions about the whole.
Notice that the population is the group we want to study. If we want information about all U.S. college students, that is our population—even if students at only one college are available for sampling. To make sense of any sample result, you must know what population the sample represents. Did a preelection poll, for example, ask the opinions of all adults? Or citizens only? Registered voters only? Democrats only? The sample consists of the people we actually have information about. If the poll can’t contact some of the people it selected, those people aren’t in the sample.
[Design Element Start: Math Inserted by Jouve]
(1.4)
[Design Element End]
The distinction between population and sample is basic to statistics. The following examples illustrate this distinction and also introduce some major uses of sampling. These brief descriptions also indicate the variables measured for each individual in the sample.
[Design Element Start: Inserted From IPS9e]
77 | 289 | 128 | 59 | 19 | 148 | 157 | 203 |
126 | 118 | 104 | 141 | 290 | 48 | 3 | 2 |
372 | 140 | 438 | 56 | 44 | 274 | 479 | 211 |
179 | 1 | 68 | 386 | 2631 | 90 | 30 | 57 |
89 | 116 | 225 | 700 | 40 | 73 | 75 | 51 |
148 | 9 | 115 | 19 | 76 | 138 | 178 | 76 |
67 | 102 | 35 | 80 | 143 | 951 | 106 | 55 |
4 | 54 | 137 | 367 | 277 | 201 | 52 | 9 |
700 | 182 | 73 | 199 | 325 | 75 | 103 | 64 |
121 | 11 | 9 | 88 | 1148 | 2 | 465 | 25 |
[Design Element End]
Example 4 Public opinion polls
Polls such as those conducted by Gallup and many news organizations ask people’s opinions on a variety of issues. The variables measured are responses to questions about public issues. Though most noticed at election time, these polls are conducted on a regular basis throughout the year. For a typical opinion poll:
Population: U.S. residents 18 years of age and over. Noncitizens and even illegal immigrants are included.
Sample: Between 1000 and 1500 people interviewed by telephone.
Example 5 The Current Population Survey
Government economic and social data come from large sample surveys of a nation’s individuals, households, or businesses. The monthly Current Population Survey (CPS) is the most important government sample survey in the United States. Many of the variables recorded by the CPS concern the employment or unemployment of everyone over 16 years old in a household. The government’s monthly unemployment rate comes from the CPS. The CPS also records many other economic and social variables. For the CPS:
10
Population: The more than 123 million U.S. households. Notice that the individuals are households rather than people or families. A household consists of all people who share the same living quarters, regardless of how they are related to each other.
Sample: About 60,000 households interviewed each month.
Example 6 TV ratings
Market research is designed to discover what consumers want and what products they use. One example of market research is the television-rating service of Nielsen Media Research. The Nielsen ratings influence how much advertisers will pay to sponsor a program and whether or not the program stays on the air. For the Nielsen national TV ratings:
Population: The more than 116 million U.S. households that have a television set.
Sample: About 25,000 households that agree to use a “people meter” to record the TV viewing of all people in the household.
[Design Element Start: Math Inserted by Jouve]
[Design Element End]
The variables recorded include the number of people in the household and their age and sex, whether the TV set is in use at each time period, and, if so, what program is being watched and who is watching it.
[Design Element Start: Inserted From IPS9e]
THE MEAN x
To find the mean of a set of observations, add their values and divide by the number of observations. If the n observations are x1, x2, . . . , xn, their mean is
or, in more compact notation,
[Design Element End]
Example 7 The General Social Survey
Social science research makes heavy use of sampling. The General Social Survey (GSS), carried out every second year by the National Opinion Research Center at the University of Chicago, is the most important social science sample survey. The variables cover the subject’s personal and family background, experiences and habits, and attitudes and opinions on subjects from abortion to war.
Population: Adults (aged 18 and over) living in households in the United States. The population does not include adults in institutions such as prisons and college dormitories. It also does not include persons who cannot be interviewed in English.
Sample: About 3000 adults interviewed in person in their homes.
NOW IT’S YOUR TURN
1.1 Legalizing marijuana. The Pew Research Center conducted a poll on March 25–29, 2015. They asked:
Do you think the use of marijuana should be made legal or not?
[Design Element Start: Math Inserted by Jouve]
[Design Element End]
The Pew Research Center reported that the poll consisted of telephone interviews with 1500 randomly selected adult Americans. What do you think the population is? What is the sample?
11
Most statistical studies use samples in the broad sense. For example, the 638 children with leukemia in Example 3 are supposed to represent all children with leukemia. We usually reserve the dignified term “sample survey” for studies that use an organized plan to choose a sample that represents some specific population. The children with leukemia were patients at centers that specialize in treating children’s cancer. Expert judgment says they are typical of all leukemia patients, even though they come only from special types of hospitals. A sample survey doesn’t rely on judgment: it starts with an entire population and uses specific, quantifiable methods to choose a sample to represent the population. Chapters 2, 3, and 4 discuss the art and science of sample surveys.