EXAMPLE 21 The Mean of the Probability Model for benford’s Law

In Self Check 8, you were asked to create the first digits for 20 fictitious invoice numbers. To ensure these numbers were random, you used a random digits table. That way each integer, 1 through 9, was equally likely to be chosen as a first digit. The table below shows the probability model governing your selection of first digits.

First digit 1 2 3 4 5 6 7 8 9
Probability

The mean of this model is

377

If, on the other hand, legitimate records obey Benford’s law, the distribution of the first digit is

First digit 1 2 3 4 5 6 7 8 9
Probability 0.301 0.176 0.125 0.097 0.079 0.067 0.058 0.051 0.046

The mean of Benford’s model is

The comparison of means between Benford’s law and random digits, , reflects the greater probability of smaller first digits under Benford’s law. Probability histograms for these two models appear in Figure 8.21. Because the histogram for random digits (Figure 8.21a) is symmetric, the mean lies at the center of symmetry. We can’t determine the mean of the right-skewed Benford’s law model precisely by simply looking at Figure 8.21b; calculation is needed.

image
Figure 8.21: Figure 8.21 Probability histograms of two models for first digits in numerical records. The mean of each distribution is marked. (a) Digits are equally likely, (b) Digits follow Benford’s law.