3.3 Working with Grouped Data

OBJECTIVES By the end of this section, I will be able to …

  1. Calculate the weighted mean.
  2. Estimate the mean for grouped data.
  3. Estimate the variance and standard deviation for grouped data.

1 The weighted Mean

Note: Before tackling this section, you may wish to review Section 2.2, “Graphs and Tables for Quantitative Data” (page 60).

Sometimes, not all the data values in a data set are of equal importance. Certain data values may be assigned greater importance or weight than others when calculating the mean. For example, have you ever figured out what your final grade for a course was based on the percentages listed in the syllabus? What you actually found was the weighted mean of your grades.

149

Note: The weights, , do not have to be percentages, nor do they have to add up to 1.

In the special case when all the weights equal 1, the weighted mean equals the sample mean from Section 3.1.

Weighted Mean

To find the weighted mean:

  1. Multiply each weight, , by its corresponding data value, .
  2. Add up the products to get .
  3. Divide the result by the sum of the weights, .

EXAMPLE 17 weighted mean of course grades

The syllabus for the Introduction to Management course at a local college specifies that the midterm exam is worth 30%, the term paper is worth 20%, and the final exam is worth 50% of your course grade. Now, say you did not get serious about the course until after Halloween, so that you got a 40 on the midterm. You then began working harder, and got a 70 on the term paper. Finally, you remembered that you had to pay for the course again if you did not pass and had to retake it, so you worked really hard for the last month of the course and got a 90 on the final exam. Calculate your course average, that is, the weighted mean of your grades.

Solution

The data values are 40, 70, and 90. The weights are 0.30, 0.20, and 0.50. Your course weighted mean is then calculated as follows:

Because the final exam had the most weight, you were able to raise your course weighted mean to 71, and you passed the course.

NOW YOU CAN DO

Exercises 4–8.

YOUR TURN #10

The author's syllabus for his Business Statistics I course during Summer 2014 stated that the quiz average was worth 50% of the course grade, with the midterm worth 20% and the final exam worth 30%. One of the students had a 90 quiz average, a 70 midterm grade, and an 85 final exam grade. Calculate the student's course grade.

(The solution is shown in Appendix A.)

2 Estimating the Mean for grouped Data

Thus far in Chapter 3, we have computed measures of center and spread from a raw data set. However, data are often reported using grouped frequency distributions. Without the original data, we cannot calculate the exact values of the measures of center and spread. The remainder of this section examines methods for approximating the mean, variance, and standard deviation of grouped data— that is, population data summarized using frequency distributions.

For each class in the frequency distribution, we estimate the class mean using the class midpoint. The class midpoint, denoted , is defned as the mean of two adjoining lower class limits.

The product of the class frequency, , and class midpoint, , is used as an estimate of the sum of the data values within that class. Summing these products across all classes and dividing by the size of the data set thus provides us with an estimated mean for data grouped into a frequency distribution.

150

Note: Even though we are working with population data, we will notate these values using and because we are estimating the values of the mean and standard deviation.

Estimated Mean for Data grouped into a Frequency Distribution

Given a population frequency distribution, the estimated mean for the variable is given by

where and represent the class midpoints and class frequencies, respectively.

EXAMPLE 18 Calculating the estimated mean for grouped data

The first two columns of Table 17 contain the frequency distribution of the number of Americans younger than 85 years old who were living in the United States in 2013, by age group, as reported by the U.S. Census Bureau.

  1. Find the class midpoints.
  2. Calculate the product of each class frequency with its midpoint.
  3. Find the sum of the frequencies, , and the sum of the products, .
  4. Divide by to find the estimated mean age of all Americans under the age of 85.

Solution

  1. The midpoint for the first class (ages 0–20) is the mean of the lower class limits for this class (0) and the adjoining class (20). That is, the midpoint is . Similarly, the midpoint for the second class (ages 20–40) is . The remainder of the class midpoints are calculated in the same way and are shown in Table 17.
  2. We multiply the frequency for the first age group by its midpoint to get . We do the same for the other age groups, as shown in Table 17.
  3. We add up all the frequencies to get . Also, we add up all the products from (b) to obtain .
  4. Finally, obtain the estimated mean, as follows:

    The estimated mean age of all Americans under 85 is 37.4 years.

Table 3.63: Table 17 Frequency distribution of Americans, by age group, in millions
Class: age Frequency Midpoint Product
83.3 10
82.8 30
85.6 50
51.6 72.5
Total

NOW YOU CAN DO

Exercises 9–14.

151

3 Estimating the Variance and Standard Deviation for Grouped Data

We also use class midpoints and class frequencies to calculate the estimated variance for data grouped into a frequency distribution and the estimated standard deviation for data grouped into a frequency distribution.

Estimated Variance and Standard Deviation for Population Data Grouped into a Frequency Distribution

The estimated variance for data grouped into a frequency distribution is given by

and the estimated standard deviation is given by

where represents the class midpoints, represents the class frequencies, and is the estimated mean.

You should carry as many decimal places as you can for the value of when calculating , and for when calculating .

EXAMPLE 19 Calculating the estimated variance and standard deviation for grouped data

Calculate the estimated variance and standard deviation of the ages of Americans under age 85 from Table 17.

Solution

Table 18 contains the calculations required for finding . The variance is therefore estimated as

and the standard deviation is estimated as

Table 3.64: Table 18 Calculating
Class: age Midpoint Frequency
10.0 83.3 37.4 −27.4 62,538.31
30.0 82.8 37.4 −7.4 4,534.13
50.0 85.6 37.4 12.6 13,589.86
72.5 51.6 37.4 35.1 63,571.72

In other words, the age of Americans under 85 typically differs from the mean age of 37.4 years by about 21.8 years.

NOW YOU CAN DO

Exercises 15–20.

152

EXAMPLE 20 Using technology to find the estimated mean, variance, and standard deviation for grouped data

Use the TI-83/84 calculator to find the estimated mean, variance, and standard deviation for the frequency distribution in Table 17.

Solution

Following the instructions in the Step-by-Step Technology Guide, we get the estimated mean, (which we round to 37.4), the estimated standard deviation, (shown in the output as ) , and the estimated variance as .

usa-ages

image