Question 5.34

4. Figure 5.29 is a histogram of the lengths of words used in Shakespeare’s plays. Because there are so many words in the plays, the vertical axis of the graph is the percentage of words that are of each length, rather than the count. In this case, the class intervals are centered at integer values, since the data consist only of counting numbers.

What is the overall shape of this distribution? What does this shape say about word lengths in Shakespeare? Do you expect other authors to have word-length distributions of the same general shape? Why?

image
Figure 5.31: Figure 5.29 elative frequency histogram of the lengths of words used (the percentage rather than the count that are of each length).