Topic: How Do Taste Perceptions Differ Between Individuals?
Statistical Concepts Covered: In this applet we will discover how to summarize data distributions using measures of central tendency and variability, and learn why picking the right summary statistics is critically important when describing data.
Introduction
When we collect data on individuals’ perceptions of taste, we do not expect every single person to agree on exactly how sweet, sour, bitter, or salty a particular stimulus is. Given this reality, when we analyze data from a group of people, we must have ways to describe a range of results clearly and concisely. Unfortunately, there is no “one size fits all” solution when it comes to describing data. While you might think that describing the average value of a data set is a reasonable approach (and often it is), the truth is that you must understand the shape of your data distribution and the point that you want to convey before you can decide on the best way to represent your data.
In this exercise, you will investigate several data distributions and think about possible ways to describe the data. It will be helpful to review the text’s coverage of graphic representations and descriptive statistics before you begin. In this exercise, we focus on three measures of central tendency—that is, ways to describe an entire data distribution with a single number. You are probably already familiar with these three measures of central tendency: the mean, the median, and the mode. As you explore the data, you’ll see that sometimes the mean value will be a fair and unbiased model of the data, but at other times it may misrepresent the data or even be completely uninformative. As you analyze these data and answer the related questions, you should begin to get a feel for how to select the most representative measure of central tendency for describing your findings.
The data we will use to illustrate these concepts were collected by Dr. Linda Bartoshuk of the University of Florida, as she traveled around the country giving lectures and presentations. She asked her audiences to fill out questionnaires about their perceptions of various tastes and other sensations (e.g., the brightness of the light in the room; their perceived sweetness of a soda; how much they liked or disliked various tastes such as cheddar cheese, grapefruit juice, or buttered popcorn). Over more than 20 years, Dr. Bartoshuk collected data from thousands of individuals. Our goal in this exercise is to accurately summarize some of those results with just a single number.
Statistical Lesson. It may come as a surprise to learn that more than half of the values in a distribution can be below average. In a skewed distribution, the mean value is pulled toward the tail and more than half of the values—and sometimes far more than half of the values—can be above or below the average. In such situations, the mean value may not be the best representation of the entire data set. We will explore this concept more in the following questions.
Statistical Lesson. You may sometimes hear friends or even professors say something like “75% of all students think that they are above average!”, implying that such a distribution is not possible. In fact, it is possible, and it is not even unusual. Whenever you are dealing with a negatively skewed distribution, the mean value is pulled toward the tail, and more than half of the values—and sometimes vastly more than half of the values—can be above the mean (“above average”). For the “strongest pain experienced” data, approximately 55% of the responses are above the mean value. In most Introductory Psychology classes, the exam scores are negatively skewed, so it’s quite likely that far more than half of your classmates will have an above-average score on the next exam!
Statistical Lesson. Another very important aspect of describing data distributions is indicating the variability, or how “spread out” the data are. If we collected data on the ages of the students in your Introduction to Psychology class, there may be a few people who are several years younger or older than the others, but for the most part we would expect the ages to be grouped into a fairly tight range between perhaps 18 and 23 years old. Because the ages are mostly grouped together, we would say that these data have a low variability. But now suppose we asked the members of your class to report the size of their high school graduating class. In this case, we would expect to get a much larger range of answers: Some students may have come from schools with only 20 or 30 graduating seniors, whereas others may have graduated with 2,000 or more other students. Because these values are so spread out, we would say that the data set has a much higher variability. Variability is key concept in statistics, and the next questions will introduce one commonly used measure of variability, the standard deviation of the data.
Congratulations! You have completed this activity.