655
Note: Page numbers in boldface type indicate pages where key terms are defined.
Aaron, Hank, 267, 268–
Abecedarian Project, 123
acceptance sampling, 558–
accuracy
of data, 12
of measurement, 171–
achievement tests, 169
ACT test. See American College Testing (ACT) test
Adams, Evelyn Marie, 411
Advanced Placement (AP) exams, 169
advertising, gender and, 128
age, height and, 345
aging population statistics, 193, 244, 247, 253–
alcohol consumption, facial attractiveness and, 141
alternative hypothesis, 525, 525–
American College Testing (ACT) test, 304, 305
American Community Survey (ACS), 12, 68
American Football League (AFL), 339
American Medical Association, 39
American Psychological Association, Ethical Principles of, 152
American Time Use Survey (ATUS), 533
anger, heart disease and, 581–
anonymity, 145
antidepressants, placebo vs., 551–
approximate level C confidence interval, 504, 508
approximately Normal, 496, 506
Archaeopteryx fossils, classifying, 321–
arithmetic average, 281
Arizona State University, 50
Armed Forces Qualifying Test, 224
asbestos in schools, 417–
aspirin and reduction of heart attacks, 149
astragali, 408–
astrological sign, health and, 556–
atomic clock, 175, 176
authoritarian personality, 177–
auto manufacturer loans, 188–
average, 175, 175–
back-
Bailar, John, 142
banks, big data and, 250
bar fights, 69
bar graphs, 218–
baseball
ballpark beer and hot dog prices, 348, 352
divorce and game attendance with spouse, 557
home run statistics, 267–
player salaries, 367, 372–
probability of appearing in World Series, 415
.300 hitters, 452
base period, 368, 368–
Basic and Applied Social Psychology (BASP), 555
basketball
field-
player salaries, 281–
run of baskets in, 410
Bayes, Thomas, 417
Bayes’s procedure, 417
Bayes’s theorem, 417
Beardstown Ladies’ Common-
beer prices at the ballpark, 348, 352
Behavioral Risk Factor Surveillance System (BRFSS), 493–
behavioral science experiments, 151–
bell curve, 300
Benford’s law, 488
Berra, Yogi, 7
bias
big data and, 354
nonadherers and, 121
reducing, 41, 43, 43–
social desirability, 67
biased sampling, 21–
Big Bang, 317
big data, 24, 250
correlation, prediction and, 353–
block design, 127, 127–
blood pressure of executives, 509
body mass index (BMI), of mothers and daughters, 349–
body temperature, 293–
656
Bonds, Barry, 267–
bone marrow treatment (BMT), 151
boxplots, 272–
Bradley, Tom, 67
Bradley effect, 67
brain size, intelligence and, 163, 174, 319, 322, 327
Broca, Paul, 174
Buffon, Count, 407, 416, 526–
bullying, depression and, 104–
Bureau of Economic Analysis, 377
Bureau of Justice Statistics, 377
Bureau of Labor Statistics (BLS), 166, 167, 176, 377, 378
Consumer Price Index and, 368, 374–
seasonal adjustments, 225
unemployment rate, 166, 176
use of Internet surveys, 78
burger joints, most popular, 174
burglaries during summer, 192–
Burt, Cyril, 201
buying power, adjusting for changes in, 367, 371–
Cadillac brand, 189–
caffeine dependence, 117
calculator, finding mean and standard deviation on, 277, 279
call-
cancer clusters, 412–
car accidents, risk of, 417–
car sales, 189–
Carter, Jimmy, 341–
categorical variables, 4, 218, 220–
causation, 348–
evidence for, 352–
cause, chance and, 412
cause and effect
direct, 350
experiments and, 13–
cause-
cell counts, 581
cell phones, telephone surveys and, 76
censuses, 11, 11–
center
of density curve, 295–
of distribution, 248
Centers for Disease Control and Prevention (CDC), 39, 353, 354, 493
central limit theorem, 507, 507–
cereals, fiber content of, 243
chance, 403, 405–
ancient history of, 408–
myths about, 409–
probability and (See probability(ies))
chart junk, 230
cheating on exams, 533
check fraud, 525
children
number of related in American households, 469
probability of sex of, 414, 452–
chi-
chi-
chi-
using, 581–
chosen in stages, 73
cigarette smoking, lung cancer and, 348, 352–
classes, 245–
Cleveland, William, 246
Cleveland Cavaliers, 281–
climate change, 93
clinical trials, 97, 102, 120
data ethics and, 147–
measurement and, 164
minorities in, 120
of Orlistat, 121–
patient treatment in, 120–
Clinton, Bill, 148
Clinton, Hillary, 67
clusters, 73
cocaine addiction treatment, 574–
coffee
brewing methods, 126
preference for fresh-
cohabitation, 226–
coincidence, myth of surprising, 411
coin tosses, 405–
College Board, 169, 170, 315
colleges and universities
academic rank and gender, 571
average SAT scores of entering, 188
decline in students’ face-
measuring readiness for, 164–
race and graduation rates, 572
rankings of, 187
rise in women with degrees, 230
sample surveys and, 379–
SAT scores and college grades, 351
tuition and fees in Illinois, 248–
657
column variable, 572
common response, 349, 350–
comparative studies, 104, 101–
completely randomized experiment, 124
computation errors, 191–
computer-
computers, privacy and confidentiality of data and, 146
confidence
in polls, 80–
in sample, 30–
confidence intervals, 493–
advantages of, 554
estimation and, 494–
level C, 500, 504, 508
for population mean, 508–
for population proportion, 502–
sampling distribution of sample mean and, 505–
statistical inference and, 548–
confidence level C, 500, 508
confidence statements, 47–
confidentiality, 142, 145–
confounded variables, 96
confounding, 350–
matching and, 104–
consistency of data, 188–
Consumer Expenditure Survey, 374
Consumer Price Index (CPI), 368–
understanding, 374–
using, 370–
Consumer Reports, 127
control
block design and, 127
placebo, 149
control group, 99, 99–
convenience sampling, 22, 23–
Coordinated Universal Time, 175
correlation, 323, 323–
big data and, 353–
causation and, 352–
ecological, 338
independence and, 451
nonsense, 349
regression and, 345–
square of the, 346, 347
cost of living, CPI and, 375–
counts, 4, 168, 217, 295
cell, 581
data tables and, 245
expected, 575, 576
Crested Butte (Colorado), 188
crime, gun control and, 351–
critical values, 503, 503–
Crohn’s disease, 96–
cth percentile, 304
Current Population Survey (CPS), 9–
on cohabitation, 226
reduction of bias and, 176
sample design for, 73
on top causes of death, 215–
unemployment rate and, 166
Curry, Stephen, 473
data, 1–
accuracy of, 12
big, 24, 250, 353–
in censuses, 11–
computation errors and, 191–
consistency of, 188–
excessive precision or regularity of, 191
in experiments, 12–
falsification of, 190, 191
hidden agendas influencing, 194–
incomplete information about, 187–
individuals and variables and, 4–
in observational studies, 7–
organizing, 213
ownership of published, 148
plausibility of, 190–
privacy and confidentiality of, 146
quality of, 1
in sample surveys, 8–
statistical inference and, 548
uses of, 1
data ethics, 141–
behavioral and social science experiments and, 151–
clinical trials and, 147–
confidentiality and, 142, 145–
informed consent and, 142, 144–
institutional review boards and, 142, 143
data production design, 548
data source, in tables, 216
data tables, 215–
day care effects, 123
death, causes of, 215, 216
decision, inference as, 557–
decision rule, 559
decision theory, 560, 561
Declaration of Helsinki, 143, 148
degrees of freedom, 579
density curves, 295, 295–
histograms compared with, 295–
median and mean of, 296–
normal (See normal distributions)
dependent variables, 95
depression, history of bullying and, 104–
deviations, 224, 247
in scatterplot, 320
658
dice rolls, 408, 413, 430–
digits
really random, 447
simulation and, 447, 449–
direct causation, 350, 352
direction of scatterplot, 320
Dirksen, Everett, 345
discrimination in mortgage lending, 585–
distributions, 217, 267–
boxplots and, 272–
centers of, 248
chi-
five-
mean of, 277, 277–
median of, 268, 268–
normal (See Normal distributions)
numerical descriptions of, 281–
overall pattern of, 224, 247
quartiles of, 268, 268–
sampling (See sampling distributions)
shape of, 248
skewed to the left, 249, 250–
skewed to the right, 249, 251–
standard deviation of, 277, 277–
symmetric, 248, 249, 251, 252–
variability of, 248
variance of, 277
domestic violence experiments, 153
double-
driver fatigue, 168
dropouts from research studies, 121, 121–
dying, probability of, 408
Dyson vacuum cleaners, 222–
earnings. See income
ecological correlation, 338
Edmonton Oilers, 170
education. See also colleges and universities
earnings and, 267
grades and video-
level attained by adults, 216–
unemployment by level of, 223–
Einstein, Albert, 408
elderly people in population, 193, 244, 247, 253–
election polls, 49, 50–
elections
predicting states votes in, 341–
vote counting and, 347
Electronic Encyclopedia of Statistical Examples and Exercises (EESEE), 117
energy conservation, 100
equations, regression, 242–
errors. See also bias
computation, 191–
margin of (See confidence statements; margin of error)
measurement, 172
nonsampling, 64, 66–
processing, 66, 67
random, 64, 172
response, 66, 66–
roundoff, 217–
sampling, 64, 64–
standard, 496, 496–
Type I, 560
Type II, 560
estimation
confidence levels and, 494–
using samples, 40–
Euclid, 562
evaluation of poll results, 80–
event, 428
exclusive classes, 244
exercise, weight loss vs., 549
exhaustive classes, 244
exit polls, 50, 50–
expected counts, 575, 576
expected values, 465–
finding by simulation, 472–
law of large numbers and, 469–
winning systems for gambling and, 471–
experimental design, 117–
block, 127, 127–
completely randomized, 124
logic of, 101–
matched pairs, 126, 126–
one-
randomization in, 101–
in the real world, 124–
experiments, 12–
double-
ethics and (See data ethics)
generalization and, 122–
nonresponse and, 120–
observational studies vs., 93–
poorly conducted, 95–
randomized comparative, 98–
statistical significance and, 103, 103–
vocabulary of, 93–
explanatory variables, 94, 94–
extrapolation, 345
Facebook, 353
facial attractiveness, alcohol consumption and, 141
falsification of data, 190, 191
659
Fatality Analysis Reporting System, 165, 167–
FDA (Food and Drug Administration), 130
Fermat, Pierre de, 409
fiber content of cereals, 243
first quartile Q1, 270, 270–
Fisher, Ronald A., 529, 558
five-
5% significance level, 555–
fixed market basket price indexes, 369–
flu trends, 353, 354
food stamp participation, 225–
football
Pick 4 lottery and, 411
probability of winning Super Bowl, 339, 427, 428, 431
Forbes magazine, 195–
Ford Motor Company, 189–
form of scatterplot, 320, 321
fossils, classifying, 321–
Fox & Friends, 192
fruit and vegetable intake, 493–
frustration study, 122
F-
Gallup Polls, 9
on amount of federal income tax paid, 71
on vaccinations and autism, 30–
on voting and abortion issue, 47, 49
weighting responses, 72
Well-
World Poll, 76
Galton, Francis, 300, 344
gambling
ancient history of, 408–
expected values and, 465–
legalized, 465, 471
slot machines, 470
teen approval of, 434
winning systems for, 471–
games of chance, 408–
gasoline price index number, 368
Gauss, Carl Friedrich, 300
GDP, life expectancy and, 318–
gender
academic rank and, 571
advertising and, 128
probability of sex of children, 414, 452–
SAT exam and, 169–
generalization, from experiments, 122–
General Motors, 189–
General Social Survey (GSS), 10, 68, 379–
Global Positioning System, 174
global warming statistics, 192
Gnedenko, B. V.., 428
Goodall, Jane, 7, 12
Google, 250, 353–
government. See also U.S. Census Bureau
Consumer Price Index and, 368–
databases maintained by, 146
statistics used by, 377–
tax revenue breakdown, 221–
grades, video-
graduation rates, race and, 572
Graphic, Visualization, and Usability Center (GVU), 77
graphs, 215–
bar, 218–
constructing effective, 229–
data tables and, 215–
histograms, 243, 243–
line, 223, 223–
pictograms, 222, 222–
pie charts, 218, 218–
scales in, 226–
stemplots, 253, 253–
variables and, 217–
Greenspan, Alan, 376
Gretzky, Wayne, 170
gun control, crime and, 351–
Gut (journal), 96–
haphazard, 406–
Harris Poll Online, 77, 78–
Hawthorne effect, 128
health, astrological sign and, 556–
heart attacks, aspirin and, 149
heart disease
anger and incidence of, 581–
incidence in women, 195
sex bias in treating, 104, 105–
height
age and, 345
of children vs. parents, 344
heart attack risk and, 315
height distribution, 252–
Helsinki Declaration, 143, 148
Hennekens, Charles, 149
hidden agendas, influencing data, 194–
Higher Education Research Institute, 521
highway safety, 165, 167–
Hill, Theodore P., 488
histograms, 243, 243–
interpreting, 247–
home run statistics, 267–
660
honesty, 145
horse racing
payoff odds, 470
starting position in, 445
hot dog prices, 348, 352
Hubble, Edwin, 317
Hubble’s law, 317
human subjects research. See data ethics
Humphries, Robert, 412
Hurricane Katrina, 190–
hydroxyurea for sickle-
hypotheses, 524–
alternative, 525, 525–
null, 525, 526, 549, 555, 557, 558–
testing, 561
incoherent, 431
income
education level and, 267
mean, 282
median annual, 373–
income distribution, 282, 300
income inequality, 195–
incomplete information about data, 187–
independence, 448, 448–
independent trials, 448
independent variables, 95
index numbers, 368, 368–
individuals, 4, 4–
inference. See statistical inference
inflation, 374, 376
informed consent, 142, 144–
insect repellant effectiveness, 127
Inside Higher Education, 521
institutional review boards (IRBs), 142, 143
instrument, 164
intelligence
brain size and, 163, 174, 319, 322, 327
measurement of, 169, 300, 535
intercept, 343
International Bureau of Weights and Measures (BIPM), 175
International Committee for Weights and Measures, 174–
Internet surveys, 76–
InterSurvey, 78
investment returns, 279–
IQ tests, 169, 300, 535
JAMA (Journal of the American Medical Association), 142–
James, LeBron, 281–
Johansson, Mattias Petter, 25
Kerrich, John, 407, 416
kidney transplant, 453–
knowledge retention, improving, 101
labels, on tables, 216, 229
Lake Murray (South Carolina), elevation levels in, 250–
Landers, Ann, 23, 30
Landrieu, Mary, 190
Landsberger, Henry A., 128
large populations, sampling from, 49–
law of averages, 414–
myth of, 413–
law of large numbers, 414, 469, 469–
leaf, 253
Leap Day births, 405
least-
least-
legalization of marijuana, 3, 11, 21
legalized gambling, 465, 471
legends, 229
leukemia, power lines and, 7–
level C confidence interval, 500, 504, 508
level of confidence, 48
Lewis, C. S., 414
life expectancy
GDP and, 318–
television set ownership and, 348–
Lincoln brand, 189–
line graphs, 223, 223–
lists, 315
logic, of experimental design, 101–
Lott, John, 351–
lotteries
expected values and, 465–
rigging of, 468
winning, 411–
Love, Kevin, 281
low-
lung cancer, cigarette smoking and, 348, 352–
lurking variables, 96, 101, 102, 585, 586
Major League Baseball. See baseball
mall interviews, 23, 30
margin of error, 45–
sample survey and, 69
marijuana, legalization of, 3, 11, 21
marital status of young women, 427–
market basket, 369–
market research, 10
Marks, Bruce, 347
Mars Climate Orbiter, 164
matched pairs design, 126, 126–
matching, 104, 104–
661
McNamara, John, 189
mean, 277, 277–
of density curve, 296–
population, 508–
regression and, 344
sample, 300, 505–
of sampling distribution, 496
measles outbreaks, 39
measurement, 163–
accuracy of, 171–
defining variables and, 164–
errors in, 172 (See also bias)
in psychology, 176–
reliability of, 173–
validity of, 166–
median, 268, 268–
of density curve, 296–
medical helicopters, 584–
melon field infestation, 191
meta-
Meyer, Eric, 188
midpoint, 248, 281
miles per gallon, 320
minorities, underrepresentation in clinical trials, 120
Misterpoll.com, 77
MLive poll on legalization of marijuana, 3, 21
Mondale, Walter, 341
mortgage lending, discrimination in, 585–
mountain man price index, 369–
multiple-
mutual funds, 547
myths about chance behavior, 409–
NASDAQ composite stock index, 193–
National Assessment of Educational Progress (NAEP) scores, 509
National Cancer Institute, 413
National Center for Health Statistics (NCHS), 215, 377, 408
national deficit, predicting, 345
National Football League (NFL), 339
National Health Survey, 66
National Hockey League (NHL), 170
National Household Survey, 12
National Institute of Standards and Technology (NIST), 175, 176
National Opinion Research Center (NORC), 10, 379
natural supplements, placebo effect and, 130
negative association, 320, 321, 325
New England Journal of Medicine, 142, 150
New England Patriots, 427, 431
New York
mean income per person, 282
telephone surveys and, 75
Neyman, Jerzey, 561
Neyman-
Nielsen Media Research, 10
Nielsen TV ratings, 10, 78
95% confidence interval, 495, 501–
95% confident, 45, 47, 499, 499–
99% confidence interval, 504–
nonadherers in research studies, 121
nonrandom samples, inference based on, 535
nonresponse, 67, 67–
in experiments, 120–
Internet surveys and, 76–
nonsampling errors, 64, 66–
nonsense correlations, 349
Normal curve, 293, 299, 433–
Normal distributions, 293–
critical values of, 503–
density curves and, 295, 295–
percentiles of, 304, 304–
68–
standard scores and, 302–
Normal percentiles, 434–
Nova Southeastern University, 94, 95, 96
(n + 1)/2 rule, 271
null hypothesis, 525, 526, 549, 555–
null hypothesis significance testing procedures (NHSTP), 555
numerical descriptions, choosing, 281–
numerical variables, 4, 164
Obama, Barack, 49, 67
obesity
low-
in mothers and daughters, 349–
Orlistat study, 121–
observational studies, 7–
experiments vs., 93–
odds, 418, 431
One Million Random Digits, 447
one-
one-
online learning, 94
online social media, 521
opinion polls, 9, 63
accurate information about samples and, 63, 64–
call-
write-
Orlistat study, 121–
outliers, 245, 247
correlation and regression and, 346
population mean and, 508
scatterplots and, 320, 326
standard deviation and, 282–
overall pattern, 224, 247, 320
662
parameters, 40, 494
pari-
Pascal, Blaise, 409
Pearson, Egon S., 561
Pearson, Karl, 407, 416
percentage change, 194
percentages, 4
error and, 191–
two-
percentiles of normal distributions, 304, 304–
personal probabilities, 415–
personal space experiment, 152
Pew Research Center for the People and the Press, 63
Pew Research Center polls
on legalization of marijuana, 11
nonresponse and, 68–
on right to subpoena phone records, 70–
use of Internet surveys, 78
Pew Research Internet Project, 354
Pick 4 lottery, 411
pictograms, 222, 222–
pie charts, 218, 218–
pig whipworms, 96–
placebo effect, 77, 97, 97–
plausibility of data, 190–
Playfair, William, 293
playlist “shuffle” feature, 25
plus four estimate, 519
Point of Purchase Survey, 375
political influence, government statistics and, 378
polls. See also Gallup Polls; opinion polls
election, 47, 49, 50–
public opinion, 9
telephone, 63, 65, 68–
population(s), 9, 40, 40–
elderly people in, 193, 244, 247, 253–
sampling from large, 49–
population mean
confidence intervals for, 508–
significance tests for, 531–
population proportion
confidence intervals for, 494–
estimation from sample proportion, 45
positive association, 320, 321, 325
power lines, leukemia and, 7–
precipitation rates, 346
precision, excessive, of data, 191
prediction
big data and, 353–
regression and, 344–
of states’ votes in elections, 341–
predictive validity, 170, 170–
preelection polls, 50
pregnancies, length of human, 531–
price indexes
fixed market basket, 369–
index number and, 368, 368–
primary sampling units (PSUs), 73
privacy, 146, 147
probability(ies), 403, 405–
of dying, 408
odds and, 431
personal, 415–
of rain, 413
randomness and, 406–
risk and, 417–
simulation and, 446–
probability models, 427–
rules and, 429–
for sampling, 432–
simulation and, 446–
probability samples, 79, 79–
probability theory, 409
processing errors, 66, 67
professional athletes’ salaries, 281–
ProFunds Internet Inv Fund, 547
proportion, 295
sample, 494, 494–
pseudo-
psychology, measurement and, 176–
public opinion polls. See opinion polls
P-values, 524–
calculating, 529–
naked, 553
quantitative variables, 4, 218, 243
quartiles, 268, 268–
calculating, 270–
questions, wording of, 69, 70–
race and ethnicity
census form categories, 5–
discrimination in mortgage lending, 585–
elections and, 67
graduation rates and, 572
nonresponse and, 70
radio format, most popular, 1
rain, probability of, 413
RAND Corporation, 447
random, meaning of, in statistics, 406–
random digits, 26–
table of, 27, 27–
random drawings, 30
random error, 172
randomization in experimental design, 101–
663
randomized comparative experiments, 98–
random samples
simple (See simple random samples (SRSs))
stratified, 73, 73–
systematic, 90
random sampling error, 64
Rasmussen Report Poll, 192
rates, 168, 217
error and, 191
Reagan, Ronald, 341–
really random digits, 447
real-
reasoning of tests of significance, 522–
recession velocity, 317–
recycling, 5
Rees, Martin, 551
refusals, 120–
regression
correlation and, 345–
prediction using, 344–
toward the mean, 344
regression equations, 342–
regression lines, 340, 340–
least-
regularity
chance and, 409–
excessive, of data, 191
reliability, 172, 173–
averages and, 175–
reporter phone records, right to subpoena, 70–
Research Randomizer, 27, 30
response error, 66, 66–
responses, 8
weighting, 72, 79
response variables, 94, 94–
returns on investments, 279–
risk
height and heart attack, 315
probability and, 417–
return on investments and, 279–
Romney, Mitt, 49
roundoff errors, 217–
row variables, 572
rules, probability, 429–
Ruth, Babe, 271, 273, 367
sales tax, 221
sample(s), 9, 9–
accuracy of data produced by, 12
confidence statements and, 47–
estimation using, 40–
margin of error and, 45–
population size and, 49–
size of, 41, 44–
statistics describing, 40–
stratified, 129
variability of, 41–
voluntary response, 22, 22–
sample errors, random, 64
sample means, 300
sampling distribution of, 505–
sample proportion, 300, 494, 494–
sample surveys, 8, 8–
evaluating poll results and, 80–
Internet surveys and, 76–
nonsampling errors and, 64, 66–
probability samples and, 79–
real-
sampling errors and, 64, 64–
university, 379–
wording of questions and, 69, 70–
sampling
acceptance, 558–
biased, 21–
confidence in, 30–
convenience, 22, 23–
from large populations, 49–
probability models for, 432–
random (See random samples; simple random samples (SRSs))
sampling distributions, 432–
of sample mean, 505–
standard deviation of, 506
sampling errors, 64, 64–
sampling frame, 64, 64–
SAT exams
average scores of entering students, 188
college grades and, 351
as college readiness measure, 164–
gender gap and, 169–
percentiles for, 305–
ranking states and, 315
standard scores and, 302–
Vietnam effect and, 224
scales, 226–
scatterplots, 317–
independence and, 451
schools, asbestos in, 417–
Science, 191, 193
seasonally adjusted, 225
seasonal variation, 225, 247
second, defined, 175
several-
sex. See gender; women
sexual assault resistance program effects, 94–
Shakespeare, length of words in, 251, 252
shape of distribution, 248
664
sickle-
sigma (σ), 505–
significance, tests of. See tests of significance
significance level, 528, 528–
Simple Random Sample applet, 75
simple random samples (SRSs), 24–
choosing in two steps, 29–
random digits and, 26–
stratified random samples vs., 73–
of telephone numbers, 75–
variability and, 42
Simpson’s paradox, 584–
simulation, 446, 445–
finding expected values by, 472–
independence and, 448–
probability models and, 446–
68-
skewed to the left distributions, 249, 250–
skewed to the right distributions, 249, 251–
slope of line, 343
slot machines, 470
Slutsky, Robert, 191
smoking, 499
snowfall amounts, 188
social desirability bias, 67
socializing, decline in face-
social media, online, 521
social science experiments, 151–
Social Security Administration, privacy policy, 147
social statistics, 379–
soda consumption, 494–
software
choosing simple random sample using, 26
normal curve and, 293
statistical test, 526
Spielberger Trait Anger Scale test, 581
Spotify, 25
square of the correlation, 346, 347
standard deviation, 277, 277–
normal curves and, 298
outliers and, 282–
properties of, 279
of sampling distribution, 496, 506
standard error, 496, 496–
Standard & Poor’s 500 index, 227–
standard scores, 302–
starting value, 194
statistic(s), 4, 40, 494
causation and, 348–
government, 377–
social, 379–
test, 530
Statistical Abstract of the United States, 190, 193, 215, 371
statistical inference, 491, 494, 494–
confidence intervals and, 548–
data and, 548
as decision, 557–
limitations of tests and, 550–
meaning of, 549–
requirements for, 550
statistical significance and, 549–
for two-
wise use of, 547–
statistically significant, 103, 103–
statistical inference and, 549–
statistical significance at level α, 528–
Statistics Canada, 377
stem, 253
stemplots, 253, 253–
back-
Stinson, William, 347
stock prices, 227–
strata, 73
stratified random samples, 73, 73–
strength of relationship in scatterplot, 320, 321
subjects, 94, 94–
treatment of medical, 123–
substitute other households, 72
Sullivan, Robert, 199
Super Bowl, probability of winning, 339, 427, 428, 431
Super Bowl Indicator, 339
Supplemental Nutrition Assistance Program (SNAP), 225–
surgery, sham, 150
surveys
Internet, 76–
sample, 8, 8–
symmetric distributions, 248, 249, 251, 252–
symmetry
of density curve, 297–
of normal curve, 298
systematic random sample, 90
tables
data, 215–
of random digits, 27, 27–
three-
two-
taxation
international comparison of, 220, 221–
sales tax, 221
665
teaching assistants, evaluating, 30
telemarketer’s pause, 48
telephone samples, 75–
television ratings, 10, 78
television set ownership, life expectancy and, 348–
testing hypotheses, 561
tests of significance, 521–
hypotheses and, 524–
limitations of, 550–
for population means, 531–
P-values and, 524–
reasoning of, 522–
searching for significance and, 555–
statistical significance and, 528–
test statistic, 530
text messages sent, 293–
The Theory of Probability (Gnedenko), 428
third quartile Q3, 270, 270–
three-
time spent eating, 533–
Town Talk call-
treatment, 94, 94–
tree diagram, 453, 453–
trend, 224, 247
Tri-
Tuskegee syphilis study, 148
Tversky, Amos, 476, 489
Twitter, 354
two-
two-
Type I error, 560
Type II error, 560
undercoverage, 64, 65
Internet surveys and, 76–
unemployment
education level and, 223–
measuring, 166, 167, 175
units, 164
university sample surveys, 379–
unmarried couples living together, 226–
Urban Institute, 13
U.S. Census Bureau, 377, 378
American Community Survey (ACS), 68
income inequality data, 269–
income statistics, 196
racial categories, 5–
voluntary response issue and, 78
U.S. Geological Survey (USGS), 251
U.S. News & World Report, 187
utilities, 560
vaccinations, autism and, 30–
validity
measurement and, 166–
predictive, 170, 170–
Van Buren, Abigail, 23
variability, 43, 43–
of density curve, 296–
of distribution, 248
reducing, 44–
sampling, 41–
standard deviation and, 279
variables, 4, 4–
categorical, 4, 218, 243
column, 572
confounded, 96
dependent, 95
explanatory, 94, 94–
independent, 95
lurking, 96, 101, 102, 585, 586
numerical, 4, 164
quantitative, 4, 218, 243
response, 94, 94–
row, 572
types of, 218
variance, 173, 277
vehicles per household, 468
video-
Vietnam effect, 224
visual perception, 548
Vitter, David, 190
voluntary response samples, 22, 22–
Internet surveys and, 76–
Wainer, Howard, 224
Wald, Abraham, 320
Washington Post/ABC News poll, 70–
weighting of responses, 72, 79
weightlifting records, 246
weight loss, 549, 554
welfare mothers and employment, 13–
welfare systems, comparing, 129
winning systems, in gambling, 471–
women
academic rank and gender, 571
heart disease in, 195
height and risk of heart attack, 315
height distribution for, 301–
marital status of young, 427–
obesity in mothers and daughters, 349–
rise in college-
write-
Zogby International, 75