Graphs That Depict Numerical Information

Graphs can also be used as a convenient way to summarize and display data without assuming some underlying causal relationship. Graphs that simply display numerical information are called numerical graphs. Here we will consider four types of numerical graphs: time-series graphs, scatter diagrams, pie charts, and bar graphs. These are widely used to display real, empirical data about different economic variables because they often help economists and policy-makers identify patterns or trends in the economy. But as we will also see, you must be careful not to misinterpret or draw unwarranted conclusions from numerical graphs. That is, you must be aware of both the usefulness and the limitations of numerical graphs.

Figure2A-8Time-Series Graph Time-series graphs show successive dates on the x-axis and values for a variable on the y-axis. This time-series graph shows real gross domestic product per capita, a measure of a country’s standard of living, in Canada from 1980 to late 2011.
Source: Statistics Canada.

Types of Numerical Graphs

A time-series graph has dates on the horizontal axis and values of a variable that occurred on those dates on the vertical axis.

You have probably seen graphs in newspapers that show what has happened over time to economic variables such as the unemployment rate or stock prices. A time-series graph has successive dates on the horizontal axis and the values of a variable that occurred on those dates on the vertical axis. For example, Figure 2A-8 shows real gross domestic product (GDP) per capita—a rough measure of a country’s standard of living—in Canada from 1980 to 2011. A line connecting the points that correspond to real GDP per capita for each calendar quarter during those years gives a clear idea of the overall trend in the standard of living over these years.

Figure2A-9Scatter Diagram In a scatter diagram, each point represents the corresponding values of the x- and y-variables for a given observation. Here, each point indicates the GDP per capita and the amount of carbon emissions per capita for a given country for a sample of 34 countries in a single year. The upward-sloping fitted line here is the best linear approximation of the general relationship between the two variables.
Source: OECD Factbook, 2012.
Figure2A-10Pie Chart A pie chart shows the percentages of a total amount that can be attributed to various components. This pie chart shows the percentage of the unemployed in Canada in 2012 by age group. Source: Statistics Canada.

A scatter diagram shows points that correspond to actual observations of the x- and y-variables. A curve is usually fitted to the scatter of points.

Figure 2A-9 is an example of a different kind of numerical graph. It represents information from a sample of 34 countries in a single year on the standard of living, again measured by GDP per capita, and the amount of carbon emissions per capita, a measure of environmental pollution. Each point here indicates an average resident’s standard of living and his or her annual carbon emissions for a given country. The points lying in the upper right of the graph, which show combinations of a high standard of living and high carbon emissions, represent economically advanced countries such as Canada. Points lying in the bottom left of the graph, which show combinations of a low standard of living and low carbon emissions, represent economically less developed countries such as Indonesia and Chile. The pattern of points indicates that there is generally a positive relationship between living standard and carbon emissions per capita: on the whole, people create more pollution in countries with a higher standard of living. This type of graph is called a scatter diagram, a diagram in which each point corresponds to an actual observation of the x-variable and the y-variable. In scatter diagrams, a curve is typically fitted to the scatter of points; that is, a curve is drawn that approximates as closely as possible the general relationship between the variables. As you can see, the fitted line in Figure 2A-9 is upward sloping, indicating the underlying positive relationship between the two variables. Scatter diagrams are often used to show how a general relationship can be inferred from a set of data.

A pie chart shows how some total is divided among its components, usually expressed in percentages.

A pie chart shows the share of a total amount that is accounted for by various components, usually expressed in percentages. For example, Figure 2A-10 is a pie chart that depicts what percentage of all unemployed in Canada in 2012 came from different age groups. The fraction of unemployed is the highest among people in the 15 to 24 years age group; about 30% of the unemployed in 2012 fell into this age group.

Bar graphs use bars of various heights or lengths to indicate values of a variable. In the bar graph in Figure 2A-11, the bars show the percent change in the number of unemployed workers in Canada between 2010 and 2011 by different immigrant status—those born in Canada, immigrants landed in Canada more than 10 years earlier, immigrants landed in Canada between 5 and 10 years earlier, and immigrants landed in Canada fewer than 5 years earlier. Exact values of the variable that is being measured may be written at the end of the bar, as in this figure. For instance, the number of unemployed workers who were born in Canada fell by 4.9% between 2010 and 2011.

Figure2A-11Bar Graph A bar graph measures a variable by using bars of various heights or lengths. This bar graph shows the percent decrease in the number of unemployed workers in Canada between 2010 and 2011 by different immigrant status.
Source: Statistics Canada.

A bar graph uses bars of varying height or length to show the comparative sizes of different observations of a variable.

Problems in Interpreting Numerical Graphs Although the beginning of this appendix emphasized that graphs are visual images that make ideas or information easier to understand, graphs can be constructed (intentionally or unintentionally) in ways that are misleading and can lead to inaccurate conclusions. This section raises some issues that you should be aware of when you interpret graphs.

Features of Construction Before drawing any conclusions about what a numerical graph implies, you should pay attention to the scale, or size of increments, shown on the axes. Small increments tend to visually exaggerate changes in the variables, whereas large increments tend to visually diminish them. So the scale used in construction of a graph can influence your interpretation of the significance of the changes it illustrates—perhaps in an unwarranted way.

Take, for example, Figure 2A-12, which shows real GDP per capita in Canada from 1980 to 1984 using increments of $500. You can see that real GDP per capita fell from $26 081 in 1981 to $25 035 in 1982. A decrease, sure, but is it as enormous as the scale chosen for the vertical axis makes it seem? If you go back and re-examine Figure 2A-8, which shows real GDP per capita in Canada from 1980 to 2011, you can see that this would be a misguided conclusion. Figure 2A-8 includes the same data shown in Figure 2A-12, but it is constructed with a scale having increments of $5000 rather than $500. From it, you can see that the fall in real GDP per capita from 1981 to 1982 was, in fact, relatively insignificant. In fact, the story of real GDP per capita—a measure of the standard of living—in Canada is mostly a story of ups, not downs. This comparison shows that if you are not careful to factor in the choice of scale in interpreting a graph, you can arrive at very different, and possibly mistaken, conclusions.

Figure2A-12Interpreting Graphs: The Effect of Scale Some of the same data for the years 1980 and 1984 used in Figure 2A-8 are represented here, except that here they are shown using increments of $500 rather than increments of $5000. As a result of this change in scale, changes in the standard of living look much larger in this figure compared to Figure 2A-8.
Source: Statistics Canada.

An axis is truncated when some of the values on the axis are omitted, usually to save space or to make changes to a variable appear larger.

Related to the choice of scale is the use of truncation in constructing a graph. An axis is truncated when part of the range is omitted. This is indicated by two slashes (//) in the axis near the origin. You can see that the vertical axis of Figure 2A-12 has been truncated—some of the range of values from 0 to $24 000 has been omitted and a // appears in the axis. Truncation saves space in the presentation of a graph and allows smaller increments to be used in constructing it. As a result, changes in the variable depicted on a graph that has been truncated appear larger compared to a graph that has not been truncated and that uses larger increments.

You must also pay close attention to exactly what a graph is illustrating. For example, in Figure 2A-11, you should recognize that what is being shown here are percentage changes in the number of unemployed, not numerical changes. The percentage change in the number of unemployed immigrants landed between 5 and 10 years earlier decreased by the highest percentage, 11.9% in this example. If you were to confuse numerical changes with percentage changes, you would erroneously conclude that the largest absolute reduction of unemployed workers were immigrants landed between 5 and 10 years earlier. But, in fact, Figure 2A-11 shows that the greatest reduction of unemployed workers were those who were born in Canada: the total number of Canadian-born unemployed workers fell by 52 300 workers, which is greater than the decrease in the number of unemployed immigrants landed between 5 and 10 years earlier, which is 8300 in this example.

An omitted variable is an unobserved variable that, through its influence on other variables, may create the erroneous appearance of a direct causal relationship among those variables.

Omitted Variables From a scatter diagram that shows two variables moving either positively or negatively in relation to each other, it is easy to conclude that there is a causal relationship. But relationships between two variables are not always due to direct cause and effect. Quite possibly an observed relationship between two variables is due to the unobserved effect of a third variable on each of the other two variables. An unobserved variable that, through its influence on other variables, creates the erroneous appearance of a direct causal relationship among those variables is called an omitted variable. For example, in Manitoba, a greater amount of snowfall during a given week will typically cause people to buy more snow shovels. It will also cause people to buy more lock de-icer. But if you omitted the influence of the snowfall and simply plotted the number of snow shovels sold versus the number of bottles of lock de-icer sold, you would produce a scatter diagram that showed an upward tilt in the pattern of points, indicating a positive relationship between snow shovels sold and lock de-icer sold. To attribute a causal relationship between these two variables, however, is misguided; more snow shovels sold does not cause more lock de-icer to be sold, or vice versa. They move together because they are both influenced by a third, determining, variable—the weekly snowfall, which is the omitted variable in this case. So before assuming that a pattern in a scatter diagram implies a cause-and-effect relationship, it is important to consider whether the pattern is instead the result of an omitted variable. Or to put it succinctly: correlation does not imply causation.

The error of reverse causality is committed when the true direction of causality between two variables is reversed.

Reverse Causality Even when you are confident that there is no omitted variable and that there is a causal relationship between two variables shown in a numerical graph, you must also be careful that you don’t make the mistake of reverse causality—coming to an erroneous conclusion about which is the dependent and which is the independent variable by reversing the true direction of causality between the two variables. For example, imagine a scatter diagram that depicts the grade point averages (GPAs) of 20 of your classmates on one axis and the number of hours that each of them spends studying on the other. A line fitted between the points will probably have a positive slope, showing a positive relationship between GPA and hours of studying. We could reasonably infer that hours spent studying is the independent variable and that GPA is the dependent variable. But you could make the error of reverse causality: you could infer that a high GPA causes a student to study more, whereas a low GPA causes a student to study less.

The significance of understanding how graphs can mislead or be incorrectly interpreted is not purely academic. Policy decisions, business decisions, and political arguments are often based on interpretation of the types of numerical graphs that we’ve just discussed. Problems of misleading features of construction, omitted variables, and reverse causality can lead to very important and undesirable consequences.