For Exercises 5.57 and 5.58, see page 276; and for 5.59, see page 278.
5.60 What population and sample?
Twenty fourth-year students from your college who are majoring in English are randomly selected to be on a committee to evaluate changes in the statistics requirement for the major. There are 76 fourth-year English majors at your college. The current rules say that a statistics course is one of four options for a quantitative competency requirement. The proposed change would be to require a statistics course. Each of the committee members is asked to vote Yes or No on the new requirement.
5.61 Simulating Poisson counts.
Most statistical software packages can randomly generate Poisson counts for a given . In this exercise, you will generate 1000 Poisson counts for .
282
5.61
Software, answers will vary. (a) The shape should be roughly symmetric. (b) The mean should be close to 9. (c) The standard deviation should be close to .
5.62 Simulate a sampling distribution for .
In the previous exercise, you were asked to use statistical software’s capability to generate Poisson counts. Here, you will use software to generate binomial counts from the distribution. We can use this fact to simulate the sampling distribution for . In this exercise, you will generate 1000 sample proportions for and .
5.63 Simulate a sampling distribution.
In Exercise 1.72 (page 41) and Example 4.26 (pages 213–214), you examined the density curve for a uniform distribution ranging from 0 to 1. The population mean for this uniform distribution is 0.5 and the population variance is 1/12. Let’s simulate taking samples of size 2 from this distribution.
Use the RAND() function in Excel or similar software to generate 100 samples from this distribution. Put these in the first column. Generate another 100 samples from this distribution, and put these in the second column. Calculate the mean of the entries in the first and second columns, and put these in the third column. Now, you have 100 samples of the mean of two uniform variables (in the third column of your spreadsheet).
5.63
Software, answers will vary. (a) The shape should be roughly Normal. (b) The mean should be close to 0.5. (c) The standard deviation should be close to .
5.64 What is the effect of increasing the number of simulations?
Refer to the previous exercise. Increase the number of simulations from 100 to 500. Compare your results with those you found in the previous exercise. Write a report summarizing your findings. Include a comparison with the results from the previous exercise and a recommendation regarding whether or not a larger number of simulations is needed to answer the questions that we have regarding this sampling distribution.
5.65 Change the sample size to 12.
Refer to Exercise 5.63. Change the sample size to 12 and answer parts (a) through (c) of that exercise. Note that the theoretical mean of the sampling distribution is still 0.5 but the standard deviation is the square root of 1/144 or, simply, 1/12. Investigate how close your simulation estimates are to these theoretical values. In general, explain the effect of increasing the sample size from two to 12 using the results from Exercise 5.63 and what you have found in this exercise.
5.65
Software, answers will vary. The distribution should be more Normal, the mean should be potentially closer to 0.5, but the spread will have decreased to .
283
5.66 Increase the number of simulations.
Refer to the previous exercise and to Exercise 5.64. Use 500 simulations to study the sampling distribution of the mean of a sample of size 12 from a uniform distribution. Write a summary of what you have found.
5.67 Normal distributions.
Many software packages generate standard Normal variables by taking the sum of 12 uniform variables and subtracting 6.
5.67
Software, answers will vary. The distribution should look Normal. The mean should be close to 0, and the standard deviation should be close to 1.
5.68 Is it unbiased?
A statistic has a sampling distribution that is somewhat skewed. The median is 5 and the quartiles are 2 and 10. The mean is 8.
5.69 The effect of the sample size.
Refer to Exercise 5.63, where you simulated the sampling distribution of the mean of two uniform variables, and Exercise 5.65, where you simulated the sampling distribution of the mean of 12 uniform variables.
5.69
(a) The simulated mean using 12 uniform variables should have smaller variability than the simulated mean using only 2 uniform variables. (b) The simulations should have confirmed this.
5.70 What’s wrong?
State what is wrong in each of the following scenarios.
5.71 Describe the population and the sample.
For each of the following situations, describe the population and the sample.
5.71
(a) The population is all students in the United States at four-year colleges. The sample is the 17,096 people surveyed. (b) The population is all restaurant workers. The sample is the 100 people asked. (c) The population all 584 longleaf pine trees. The sample is the 40 trees measured.
5.72 Bias and variability.
Figure 5.15 shows histograms of four sampling distributions of statistics intended to estimate the same parameter. Label each distribution relative to the others as high or low bias and as high or low variability.