OBJECTIVES By the end of this section, I will be able to …
In Chapter 6, we dealt with probability distributions, which describe populations. Here, in Chapter 7, we return to the use of sample data, in order to show how populations and their samples are connected.
1 Sampling Distribution of ˉx for a Normal Population
In this chapter, we will develop methods that will allow us to quantify the behavior of statistics like ˉx. For the sample mean ˉx, this behavior is expressed in the sampling distribution of the sample mean.
The sampling distribution of the sample mean ˉx for a given sample size n consists of the collection of the means of all possible random samples of size n from the population.
For example, consider the population of all three North American countries, the USA, Canada, and Mexico. In the 2012 Olympics, the USA won 104 medals, Canada won 18, and Mexico won 7. So, the population mean number of medals is
μ=104+18+73=43
Table 1 shows all possible random samples of size n=2 from this population.
Sample 1 | Sample 2 | Sample 3 |
---|---|---|
USA and Canada | USA and Mexico | Canada and Mexico |
ˉx=104+182=61 | ˉx=104+72=55.5 | ˉx=18+72=12.5 |
So, for this tiny population, the sampling distribution of ˉx for n=2 is the collection of sample means, {61, 55.5, 12.5}. Note from Table 1 that the value for the sample mean ˉx varies from sample to sample. Thus, ˉx is a random variable, because its value depends on chance; that is, its value depends on which countries are drawn in the random sample. We say that ˉx exhibits sampling variability because its value changes from sample to sample. Fortunately, there are patterns (predictable behaviors) in how the sample mean ˉx varies. These patterns underlie all the statistical inference about μ that we will perform in Chapters 8–10. And learning those patterns is what this chapter is all about.
First, like any distribution, the sampling distribution of the sample mean has a balance point and, therefore, a mean. Let's find the mean of the sample means in Table 1, which is denoted μˉx:
μˉx=61+55.5+12.53=43
Figure 1 provides a dotplot of the sample means in Table 1, along with the mean of these sample means, indicated at the balance point μ=43.
Note that the mean of the sample means μˉx equals the population mean μ=43. This is always true. So, we generalize it as follows.
Fact 1: Mean of the Sampling Distribution of ˉx
The mean of the sampling distribution of the sample mean ˉx is the value of the population mean μ. It can be denoted as μˉx=μ and read as “the mean of the sampling distribution of ˉx is μ.”
Note: It is convenient to number a set of important facts as we build toward the Central Limit Theorem for Means and the Central Limit Theorem for Proportions.
Now, the populations and samples you will encounter, both here and in the real world, are too large to list out all the possible sample means, as we did in Table 1. Instead, we will examine population distributions of data, with a mean μ and standard deviation σ, just as we did in Chapter 6. However, in Chapter 7, the population may or may not be normally distributed.
Next let's explore some more behavior of the sampling distribution of ˉx, using a graphical example. In Example 1 we compare the population distribution of X with the sampling distribution of ˉx.
EXAMPLE 1 Sampling distribution of ˉx for a normal population
In Example 37 in Chapter 6 (page 369), we saw that X=temperature in Georgia in the month of April was normally distributed with a mean of μ=61.5° and a standard deviation of . We want to compare this population distribution of with the sampling distribution of . Using Minitab, 1000 samples of size were generated from this normal distribution, and the sample means were calculated for each sample. The histogram of green rectangles in Figure 2 was constructed, representing the sampling distribution of for .
Solution
Figure 2 shows the histogram (rectangles) of the means from the 1000 samples of size .
In Example 1, we estimated the standard deviation of the sampling distribution of to be , compared with the original standard deviation . Fact 2 shows the relationship between these two quantities in general.
Note: Fact 2 is derived using methods found in texts on advanced statistics.
Fact 2: Standard Deviation of the Sampling Distribution of
The standard deviation of the sampling distribution of the sample mean is
where is the population standard deviation and is the sample size. is called the standard error of the mean.
Thus, for Example 1, , so our estimate was correct. Note the in the denominator of the formula. Because of this factor, the larger the sample size, the tighter the resulting sampling distribution. Larger sample sizes lead to smaller variability, which results in more precise estimation.
EXAMPLE 2 Finding the mean and standard deviation of the sampling distribution of
According to the American Time Use Survey, the mean amount of sleep that 20- to 24-year-old women get per night is 9.3 hours (Source: Bureau of Labor Statistics1). Assume that the population standard deviation is hour. Find the mean and standard deviation for the sampling distribution of for the following sample sizes: (a) 4, (b) 100, (c) 400.
Solution
We have . Note that this value for does not depend on the sample size, so the value is true for any sample size. We also have .
NOW YOU CAN DO
Exercises 9–16.
YOUR TURN#1
For the situation in Example 2, find the mean and standard deviation for the sampling distribution of for .
(The solution is shown in Appendix A.)
What Does This Number Mean?
Consider for in Example 2(b). This is a measure of the variability of the sampling distribution of for this sample size. That is, if we take samples of size 100, our estimation of the population mean amount of sleep of all 20- to 24-year-old women will be within 0.1 hour (6 minutes) of the true population mean most of the time.
Recall in Example 1 that when the original population distribution is normal, the sampling distribution of is also normal. This is true in general, as stated in Fact 3, which also includes what we have learned so far in Fact 1 and Fact 2.
Fact 3: Sampling Distribution of the Sample Mean for a Normal Population
Note: Let the notation
denote a normal distribution with mean of and standard deviation of .
The good news is that, once we know that the sampling distribution is normal, we can use the methods we learned in Section 6.5 to solve probability problems. Specifically, we can standardize and produce , just as we would for any normal random variable, using Fact 4.
Fact 4: Standardizing a Normal Sampling Distribution for Means
When the sampling distribution of is normal, we may standardize to produce the standard normal random variable as follows:
where is the population mean, is the population standard deviation, and is the sample size.
2 Central Limit Theorem for Means
The discussion so far has been for the case when the original population is normally distributed. However, what if the population (distribution of ) is not normal? In this section, we use a simulation study to learn how the sampling distribution of the sample mean for non-normal populations becomes approximately normal as the sample size increases.
EXAMPLE 3 Simulation study: sample means from a strongly skewed population
nutrition
The data set Nutrition contains nutrition information on a population of 961 foods.
Solution
For a skewed population, we have seen that the sampling distribution of the sample mean becomes approximately normal as the sample size reaches 30. For a less skewed population, we can expect that the sampling distribution of approximates a normal distribution for smaller sample sizes. Our simulation study has shown us that regardless of the population, the sampling distribution of the sample mean becomes approximately normal as the sample size gets larger. We can then combine this statement with Fact 3 (page 399) to form the Central Limit Theorem for Means.
Central Limit Theorem for Means
When random samples of size are taken from a population with mean and standard deviation , the sampling distribution of the sample mean becomes approximately normal as the sample size gets larger, regardless of the shape of the population.
How large does the sample size have to be before the Central Limit Theorem for Means takes effect? In general, it depends on the degree of symmetry, or skewness, of the population. In the simulation study (Figure 4), we saw that the sampling distribution of was approximately normal, even for a skewed population when . Thus, we shall abide by the following rule of thumb.
Rule of Thumb for When to Use the Central Limit Theorem for Means
We consider as large enough to apply the Central Limit Theorem for Means for any population.
Developing Your Statistical Sense
The Central Limit Theorem
The Central Limit Theorem (CLT) is one of the most important results in statistics. Worldwide, much statistical inference is based on the CLT. It actually makes fairly intuitive sense, doesn't it? If we find the mean of a sample of data values, in many cases the extreme values will tend to balance out. However, remember that the mean is very sensitive to outliers. In a small sample, there may not be enough nonextreme values to balance the influence of the outliers. This is what was happening early in the potassium simulation (for example, Figure 4a). However, as the sample sizes increase, the influence of extreme values diminishes and the resulting sample means start to migrate toward the center.
Combining Fact 3 and the Central Limit Theorem for Means, we can identify three possible cases for the sampling distribution of .
Three Possible Cases for the Sampling Distribution of the Sample Mean
Of course, in the real world, no one will tell you which of the three cases applies. You need to investigate the assumptions of each of the cases to determine for yourself which one applies.
EXAMPLE 4 Is the sampling distribution of normal?
For each of (a)–(c), determine whether the sampling distribution of is normal, approximately normal, or unknown.
Solution
NOW YOU CAN DO
Exercises 17–26
3 Finding Probabilities Using a Sampling Distribution
We know that the sampling distribution of the sample mean is normal when the population is normally distributed, so we can use the techniques of Section 6.5 to answer questions about the means of samples taken from normal populations.
EXAMPLE 5 Finding probabilities for the sample mean: Comparing with
Suppose the number of “likes” for Facebook pages is normally distributed with mean 70 likes and standard deviation 10 likes.
Solution
This is a normal probability problem, which we learned how to do in Section 6.5
using Case 2 from Table 8 in Chapter 6 (page 355), which shows how to find area to the right of a Z-value. Therefore, a 15.87% probability exists that a randomly chosen student will like more than 80 Facebook pages (Figure 5a).
It is especially important when answering sampling distribution problems to draw graphs of the original distribution of and the sampling distribution of . Always draw graphs when solving probability problems. They bring in the “artistic side” of your brain to help the “analytic side” solve the problem.
We are given and . So, by Fact 1, . And by Fact 2,
Next, we are given that the population of Facebook likes is normal. Therefore, by Fact 3, the sampling distribution of is distributed as normal (70, 2) (see Figure 5b). This represents Case 1 from the Three Cases above.
Applying Fact 4,
We need to standardize the value of 80 page likes, too.
Thus,
as shown in Figure 5b. Because is standard normal, nearly all observations lie between −3 and 3. Thus, the table does not go up to 5 because the probabilities are so close to 0. The TI-83 provides the more precise probability of , or about 3 in 10 million.
NOW YOU CAN DO
Exercises 27–32.
YOUR TURN#2
For the scenario in Example 5, find the probability that a sample of 25 Facebook users will have a mean number of page likes that is less than 75.
(The solution is shown in Appendix A.)
What Does This Probability Mean?
There is essentially no chance that the sample mean number of page likes will be greater than 80. Compare this to the nearly 16% chance that a particular Facebook user's number of page likes would be above 80. Figure 5 shows the graphs of the distributions of (a) the individual Facebook users, and (b) the means of sets of 25 users. Both distributions are centered at , but the standard deviations differ. The arrow in Figure 5a represents the standard deviation of , and it shows that is only 1 standard deviation above the mean . The arrows in Figure 5b represent the standard error of the mean, , and they illustrate that lies 5 standard errors above the mean . Thus, group means are less variable than individual data values.
In Chapter 6, we found the percentiles of normally distributed random variables. The sampling distribution of is normal, so we are able to find the percentiles of the 's, too. Once the appropriate -value is found, we use the following equation to transform the -value into an -value:
EXAMPLE 6 Finding a value of , given a probability or area
Use the information in Example 5 to do the following:
Solution
The 95th percentile of the sample mean number of Facebook likes is the value of with area 0.95 to the left of it.
We want the 95th percentile, so we seek 0.95 on the inside of the table. Because 0.95 is not in the table, we take the closest value. The two closest values, 0.9495 and 0.9505, are equally close, so we split the difference. Working backward from 0.9495, we find , and for 0.9505 we find . Splitting the difference, we get . This value of is the 95th percentile of the standard normal distribution.
Because we are looking for a sample mean number of page likes, 1.645 is probably not the answer. We need to “unstandardize” by transforming this value of to an -value:
Thus, the 95th percentile of the sample means for the number of Facebook page likes is 73.29.
This is just another way of asking for the 5th and 95th percentiles, which we found in parts (a) and (b). (See Figure 6.) The answer is 66.71 and 73.29.
We seek , as shown in Figure 6. Proceeding with the calculations, we have, as expected,
NOW YOU CAN DO
Exercises 33–58.
YOUR TURN#3
Using the information in Example 5, find the 2.5th percentile of the mean number of Facebook page likes.
(The solution is shown in Appendix A.)
EXAMPLE 7 Finding probabilities using the Central Limit Theorem for Means
fuelefficiency
The data set FuelEfficiency contains information about the fuel efficiency of more than one thousand 2014 model year vehicles (Source: fueleconomy.gov). Figure 7 shows a histogram of the city miles per gallon (city mpg) of these vehicles, illustrating that the distribution is not normal, but somewhat right-skewed. The mean city mpg for all vehicles is , and the standard deviation is . Find the probability that a random sample of size vehicles will have a mean city mpg greater than 21.
Solution
Clearly, the population is not normal, but the sample size is large enough, so the Central Limit Theorem applies. The sampling distribution of the sample mean is approximately normal. This is Case 2 from the Three Cases (page 401). Next, we need to find and . Facts 1 and 2 tell us that
Therefore, as the CLT indicates, the sampling distribution of is approximately normal . We are then left to solve a normal probability problem using the methods of Section 6.5. Figure 8 shows the sampling distribution of and the probability we are interested in, . Using Fact 4, we standardize:
Thus, , as shown in Figure 9. We therefore look up in the table and subtract this table area (0.8413) from 1 to get the desired tail area:
The probability is 0.1587 that a random sample of 36 vehicles will have a mean city mpg greater than 21.
NOW YOU CAN DO
Exercises 59–70
YOUR TURN#4
For the situation in Example 7, find the probability that a random sample of size vehicles will have a mean city mpg that is less than 18.5.
(The solution is shown in Appendix A.)
EXAMPLE 8 Finding two symmetric sample mean values using the Central Limit Theorem for Means
Refer to Example 7. If we take samples of size , find the two symmetric values of that contain the middle 95% of sample mean city mpg values.
Solution
Here, we use the instructions from the Finding Percentiles for Any Normal Distribution portion of the Section 6.5 Step-by-Step Technology Guide on page 380. We have and . We want the middle 95% of sample means, so we ask the TI-83/84 calculator to give us the sample mean with 2.5% to the left of it and the sample mean with 97.5% to the left of it, as shown in Figure 10. We get as the 2.5th percentile of sample means and as the 97.5th percentile of sample means. These are the two symmetric values of the sample mean that contain the middle 95% of all sample mean city mpg values.
NOW YOU CAN DO
Exercises 71–80.
YOUR TURN#5
For the situation in Example 7, if we take samples of size , find the two symmetric values of that contain the middle 90% of sample mean city mpg values.
(The solution is shown in Appendix A.)
EXAMPLE 9 Sometimes there is insufficient information to solve the problem
Using the same data set as in Example 8, suppose the sample size is only . Now try again to find the probability that a random sample of size vehicles will have a mean city mpg greater than 21.
Solution
The population is skewed (not normal) and the sample size is less than the minimum required to apply the Central Limit Theorem. Therefore, we have insufficient information to conclude that the sampling distribution of the sample mean is either normal or approximately normal. This is Case 3 from the Three Cases (page 401). Unfortunately, we cannot find the probability that a random sample of vehicles will have a mean city mpg greater than 21.
Trial of the Pyx: How Much Gold Is in Your Gold Coins?
Medieval English kings devised a procedure to ensure that the coins of the realm contained the proper amount of gold. A sample of 100 of the gold coins that were cast each year was placed in a ceremonial box called the Pyx. At the chosen time, the Company of Goldsmiths jury weighed the gold coins. The mean weight of the coins was supposed to be 128 grams. If the mean weight was much less than 128 grams, the jury concluded that the Master of the Mint was cheating the crown by pocketing the excess gold, and he was severely punished. If the mean weight of the coins was within 0.32 gram of the expected 128 grams, the jury accepted the year's gold as pure. Thus, the mean weight had to lie between 127.68 grams and 128.32 grams.
Problem 1. Can we estimate what the jury used for a standard deviation?
Solution to Problem 1. Let's assume that “much less than” indicated a measurement that is 2 or more standard deviations below average. For the sampling distribution of , then, this would indicate a range of between 127.68 and the mean 128. Therefore, . And therefore, by the Empirical Rule, for instance, approximately 95% of the sample mean observations for the Trial of the Pyx would have been between 127.68 and 128.32. Because , it follows that .
Problem 2. What were the chances that the Master of the Mint would have been caught and punished if he were in fact cheating the throne?
Solution to Problem 2. What if the Master of the Mint set the mean amount of gold per coin in the population of all coins to be instead of the required 128, shortchanging the crown by a tenth of a gram of gold per coin? The jury would never have noticed this, would they?
Let's calculate the probability that the Master of the Mint would have passed the Trial of the Pyx if the mean amount of gold per coin had been only 127.9 grams. We've seen that the Master of the Mint would have passed the Trial of the Pyx if . Now, because 100 is a large sample size, the Central Limit Theorem tells us that the sampling distribution of is approximately normal, with
Standardizing using Fact 5:
That is, the chances of the crown accepting the coins as pure, even if the Master of the Mint had been shortchanging by a tenth of a gram per coin, were over 91% (Figure 11).
Note: Sir William Sharington, 1493–1553, Master of the Mint during the turbulent Tudor era in England. He debased the currency, issued worthless coinage, and diverted the real gold to fund Thomas Seymour's conspiracy to topple the government and seize young King Edward VI. Sharington was arrested in 1548 or 1549, but he later received pardon and became Sheriff of Wiltshire for a short time before he died.
Problem 3. Would the Master of the Mint have been satisfied with this small amount of debasement? Would he have quit while he was ahead?
Solution to Problem 3. No way! The following year, the Master of the Mint decided to debase the currency even further, setting the mean amount of gold in the coins to be .
We need to find the probability of the Master passing the Trial of the Pyx if the mean amount of gold in a coin was 127.3 grams instead of the required 128 grams per coin. We use the same calculations, with . Standardizing:
Then, .
In other words, the Master of the Mint actually would have stood very little chance—less than 1% probability—of passing the Trial of the Pyx if he cheated by this much (Figure 12).
England is a great country for retaining fine old traditions. Today, England's Company of Goldsmiths still operates the London Assay Office where the purity of the kingdom's coin is tested at the annual Trial of the Pyx.