OBJECTIVES By the end of this section, I will be able to …
1 Point Estimate ˆp of the Population Proportion p
So far, we have dealt with interval estimates of the population mean μ only. However, we may also be interested in an interval estimate for the population proportion of successes, p. Recall from Section 7.2 that the sample proportion of successes
ˆp=xn=number of successessample size
is a point estimate of the population proportion p.
EXAMPLE 17 Point estimate ˆp of the population proportion p
Suppose that a random sample of 100 Starbucks' sales transactions is taken, and that 10 of these transactions were made using a cell phone. Calculate the sample proportion ˆp, and use it as a point estimate of the population proportion p.
Solution
We have n=100 transactions and x=10. Thus,
ˆp=xn=10100=0.1
The point estimate of the population proportion p of Starbucks' transactions made using a cell phone is 0.1. (This sample proportion of 0.1 reflects the results from a survey made by the Wall Street Journal in 2013.16)
NOW YOU CAN DO
Exercises 3–6.
YOUR TURN#11
For the following values of n and x, calculate the sample proportion ˆp, and use it as a point estimate of the population proportion p.
(The solutions are shown in Appendix A.)
Of course, different samples of Starbucks' customers may turn up different sample proportions ˆp. These are point estimates, and thus they carry no measure of confidence in their accuracy. The point estimates are probably close to the true values, but it's possible that they are not. They may be far from the true values. Only by using confidence intervals can we make probability statements about the accuracy of the estimates.
2 Z Interval for the Population Proportion p
Alternatively, the conditions may be expressed as follows: x≥5 and (n−x)≥5, that is, the number of successes ≥5 and the number of failures ≥5. Feel free to use these alternative conditions when the calculations are easier.
Recall the Central Limit Theorem for Proportions in Section 7.2.
Central Limit Theorem for Proportions
The sampling distribution of the sample proportion ˆp follows an approximately normal distribution with mean μˆp=p and standard deviation σˆp=√p⋅qn when both the following conditions are satisfied: (1) n⋅p≥5 and (2) n⋅q≥5, where q=1—p.
We can use the Central Limit Theorem for Proportions to construct confidence intervals for the population proportion p. Because the confidence interval for p is based on the standard normal Z distribution, it is called the Z interval for the population proportion p. Because p is unknown, the conditions and the formula for σˆp substitute ˆp for p.
Z Interval for p
The Z interval for p may be performed only if both the following conditions are met: n⋅ˆp≥5 and n⋅ˆq≥5 (alternatively, x≥5 and (n−x)≥5) where ˆq=1−ˆp. When a random sample of size n is taken from a binomial population with unknown population proportion p, the 100(1−α)% confidence interval for p is given by
lower bound=ˆp−Zα/2√ˆp⋅ˆqnupper bound=ˆp+Zα/2√ˆp⋅ˆqn
Alternatively,
ˆp±Zα/2√ˆp⋅ˆqn
where ˆp is the sample proportion of successes, ˆq=1−ˆp, n is the sample size, and Zα/2 depends on the confidence level.
For convenience, we repeat Table 1 here, showing the Zα/2 values for the most common confidence levels.
Confidence level | α | α/2 | Zα/2 |
---|---|---|---|
80% | 0.20 | 0.10 | 1.28 |
90% | 0.10 | 0.05 | 1.645 |
95% | 0.05 | 0.025 | 1.96 |
99% | 0.01 | 0.005 | 2.576 |
EXAMPLE 18 Z interval for the population proportion p
Using the Starbucks' data from Example 17, (a) verify that the conditions for constructing the Z interval for p have been met, and (b) construct a 95% confidence interval for the population proportion of all Starbucks' transactions that are made using a cell phone.
Solution
lower bound=ˆp−Zα/2√ˆp⋅ˆqn=0.1−1.96√0.1(0.9)100=0.1−1.96(0.03)=0.1−0.0588=0.0412upper bound=ˆp+Zα/2√ˆp⋅ˆqn=0.1+1.96√0.1(0.9)100=0.1+1.96(0.03)=0.1+0.0588=0.1588
We are 95% confident that the population proportion of Starbucks' sales transactions made using a cell phone lies between 0.0412 and 0.1588. (See Figure 27.)
NOW YOU CAN DO
Exercises 7–20.
YOUR TURN#12
For the following values of n and x, (i) confirm that the conditions have been met, and (ii) construct a 95% confidence interval for the population proportion p.
(The solutions are shown in Appendix A.)
EXAMPLE 19 Z intervals for p using technology
A Pew Research Center survey of 1895 Internet users found 1118 who agree that “online dating is a good way to meet people.” Use technology to find a 95% confidence interval for the population proportion of all Internet users who agree that online dating is a good way to meet people.
Solution
We use the instructions provided in the Step-by-Step Technology Guide at the end of this section (page 469). The results for the TI-83/84 in Figure 28 display the 95% confidence interval for the population proportion of Americans who agree that online dating is a good way to meet people to be
(lower bound=0.56783, upper bound=0.61212)
They also show the sample proportion ˆp=0.5899736148 and the sample size n=1895.
The results for Minitab are shown in Figure 29. Minitab provides the sample number of successes X=1118, the sample size n=1895, the sample proportion ˆp=0.589974, and the 95% confidence interval for p(0.567829,0.612118) .
The results for CrunchIt! are shown in Figure 30. CrunchIt! provides the sample size n=1895, the sample number of successes X=1118, the sample proportion ˆp=0.59, and the 95% confidence interval for p(0.5678, .
3 Margin of Error for the Interval for
For the interval for the population proportion , the margin of error is given as follows.
Margin of Error for the Interval for
The margin of error for a interval for can be interpreted as follows:
“We can estimate the population proportion to within with confidence.”
Note that, just like the confidence interval for , the interval for takes the form
EXAMPLE 20 Margin of error: The famous “plus or minus 3 percentage points”
Hardly a day goes by without some new poll being published. Polls influence the choice of candidates and the direction of their policies, especially during election campaigns. For example, the Gallup Organization polled 1012 American adults, asking them, “Do you think there should or should not be a law that would ban the possession of handguns, except by the police and other authorized persons?” Of the 1012 randomly chosen respondents, 374 said that there should be such a law.
Solution
The sample size is . The observed proportion is , so
The confidence level of 95% implies that our equals 1.96 (from Table 8.1). Thus, the margin of error equals
We can estimate the population proportion of all Americans who think that there should be such a law to within with 95% confidence.
Note: Here we see the “plus or minus 3 percentage points.”
Thus, we are 95% confident that the population proportion of all American adults who think that there should be such a law lies between 34% and 40%.
NOW YOU CAN DO
Exercises 21–32.
YOUR TURN#13
Refer to Your Turn #12 after Example 18. Find and interpret the margin of error for the following:
(The solutions are shown Appendix A.)
Developing Your Statistical Sense
Famous “Plus or Minus 3 Points”
Note that this confidence interval was obtained by adding and subtracting 3% from the 37% point estimate. That is, the poll has a margin of error of . This is the famous “plus or minus 3 percentage points” used in many news reports. However, newscasters rarely announce the confidence level of the poll. National pollsters most often use 95% as their confidence level and usually try to select the sample size necessary to create a margin of error of about 3%. We learn how they do this next.
4 Sample Size for Estimating the Population Proportion
Next, we consider the question: How large a sample size do I need to estimate the population proportion to within margin of error with confidence? The margin of error of the confidence interval for proportions equals
Solving for gives us
Unfortunately, Equation 1 depends on prior knowledge of . So, if we have such information about available from some earlier sample, then we use Equation 1 to determine the required sample. However, what if we do not know the value of ?
Figure 31 plots the sample size requirements for a 95% confidence interval for , with a desired margin of error of 0.03, for values of ranging from 0.01 to 0.99, representing all sample proportions from 1% to 99%. Note that the plot is symmetric, and therefore the largest required sample size occurs at the midpoint . Thus, is the most conservative value for . When the actual value of is not known, we use the following formula:
Sample Size for Estimating a Population Proportion
When is known, the sample size needed to estimate the population proportion to within a margin of error with confidence is given by
where is the value associated with the desired confidence level, is the desired margin of error, and is the sample proportion of successes available from some earlier sample and Round up to the next integer.
When is unknown, we use
These formulas are illustrated using the following two examples.
EXAMPLE 21 Sample size for estimating when is known
Refer to Example 20. Suppose that the Gallup Organization now wanted to estimate the population proportion of those who think there should be a law that would ban the possession of handguns to within a margin of error of with 95% confidence. How large a sample size is needed?
Solution
From Example 20, we have the sample proportion . The confidence level of 95% implies that our , and the desired margin of error is . Thus, the required sample size is
Rounding up, this gives us a minimum required sample size of 8955. The smaller margin of error requires a larger sample size.
NOW YOU CAN DO
Exercises 33–38.
YOUR TURN#14
For the situation in Example 21, suppose Gallup wants the estimate to be within a margin of error of 0.03 with 99% confidence. How large a sample size is needed?
(The solution is shown in Appendix A.)
EXAMPLE 22 Sample size for estimating when is unknown
Suppose your state wants to take a poll on the proportion of its citizens who support a single statewide primary instead of primaries for each party. No poll on this subject has been taken before, so no prior information is available on the value of the sample proportion, . How large a sample size does the state need to estimate the proportion to within plus or minus 3 percentage points () with 95% confidence?
Solution
The 95% confidence implies that the value for is 1.96. Because no information is available about the value of the population proportion of all state citizens who support a single statewide primary, we use 0.5 as our most conservative value of :
So if the pollsters want to estimate the population proportion of all state citizens who support a single statewide primary to within 3% with 95% confidence, they will need a sample of 1068 voters (don't forget to round up!).
NOW YOU CAN DO
Exercises 39–46.
YOUR TURN#15
For the scenario in Example 22, suppose the state does not have the funds to contact 1068 voters, and it wants the estimate to be within a margin of error of 0.05 with 95% confidence. How large a sample size is needed?
(The solution is shown in Appendix A.)