6 Probability Distributions

6.2Binomial Probability Distribution

OBJECTIVES By the end of this section, I will be able to …

Explain what constitutes a binomial experiment.
Compute probabilities using the binomial probability formula, binomial tables, and technology.
Calculate the mean, variance, and standard deviation of the binomial random variable and find the mode of the distribution.

1Binomial Experiment

Many different types of discrete probability distributions are used. Perhaps the most important is the binomial distribution, which we will learn about in this section. Life is full of situations where there are only two possible outcomes to a process.

A baby is about to be born. Will it be a boy or a girl?
A basketball player is about to attempt a free throw. Will she make it or miss?
A friend of yours is also taking statistics. Will he pass or fail?

Because situations for which there are only two possible outcomes are so widespread, methods have been developed to make it more convenient to analyze them. These methods begin with the definition of a binomial experiment.

327

Binomial Experiment

A probability experiment that satisfies the following four requirements is said to be a binomial experiment:

Each trial of the experiment has only two possible mutually exclusive outcomes (or is defined in such a way that the number of outcomes is reduced to two). One outcome is denoted a success and the other a failure.
A fixed number of trials exists, which is known in advance of the experiment.
The experimental outcomes are independent of each other.
The probability of observing a success remains the same from trial to trial.

Many experiments having more than two outcomes can often be defined so that only two outcomes are possible. For example, the answer to a multiple-choice question that has five answer choices may be recorded as either correct or incorrect.

Let's take a moment to discuss what these requirements really mean.

A success denotes simply the outcome in which we are interested, without necessarily implying that the outcome is desirable. For example, for a researcher investigating college dropout rates, a dropout would be considered a success in the context of a binomial experiment.
Tossing a coin 10 times is a binomial experiment because we know the fixed number of trials. A salesman contacting customers one-by-one until he makes a sale is not a binomial experiment because he doesn't know how many customers he will have to contact.
Sampling without replacement would technically violate the independence requirement. However, recall that we may apply the 1% Guideline from Section 5.3, so that when the sample is small compared to the population, successive trials can be considered independent.
Suppose four friends are wondering how many of them will get an A in statistics. This is not a binomial experiment because the four friends presumably do not all have the same probability of success.

The outcomes of a binomial experiment, together with their probabilities, generate a special discrete probability distribution called the binomial probability distribution. For binomial probability distributions, only two outcomes are always possible, and each outcome has a probability associated with it. The binomial random variable, denoted by , represents the number of successes observed in the trials. Note that .

EXAMPLE 14Recognizing binomial experiments

Determine whether each of the following experiments fulfills the conditions for a binomial experiment. If the experiment is binomial, identify the random variable , the number of trials, the probability of success, and the probability of failure. If the experiment is not binomial, explain why not.

A fisherman is going fishing and will continue to fish until he catches a rainbow trout.
We flip a fair coin three times and observe the number of heads.
A market researcher at a shopping mall is asking consumers whether they use Fib detergent. She asks a sample of four men, one of whom is clearly the employer of the other three.
The National Burglar and Fire Alarm Association reports that 34% of burglars get in through the front door. A random sample of 36 burglaries is taken, and the number of entries through the front door is noted.

328

Solution

This is not a binomial experiment because you don't know how many fish he will catch before the rainbow trout shows up, so a fixed number of trials isn't known in advance.
This is a binomial experiment because it fulfills the requirements:
1. Only two possible outcomes are possible on each trial, with heads defined as success and tails as failure.
2. We know in advance that we are tossing the coin three times.
3. The coin doesn't remember its result from toss to toss, and so the trials are independent.
4. The coin is fair on each toss, and so the probability of observing heads is the same on each toss.
The binomial random variable is the number of heads observed on the three trials; because the coin is fair, the probability of success is 0.5 and the probability of failure is 0.5. The possible values for are 0, 1, 2, or 3.
This is not a binomial experiment because the responses are not independent. The response given by the employer is likely to affect the employees' responses.
This is a binomial experiment because it fulfills the requirements:
1. Only two possible outcomes are possible on each trial: entering through the front door or not entering through the front door.
2. We know in advance that the size of the random sample is 36 burglaries.
3. The sample is random, so the trials are independent.
4. The sample is quite small compared to the size of the population, so the probability of entering through the front door remains the same from burglary to burglary.

The binomial random variable is the number of front-door-entry burglaries noted for the 36 break-ins; the probability of success is 0.34 and the probability of failure is .

NOW YOU CAN DO

Exercises 5–14.

Table 7 gives some notation regarding binomial experiments and the binomial distribution.

Table 6.33: Table 7Notation for binomial experiments and the binomial distribution

Symbol	Meaning
	The outcome denoted as a success
	The outcome denoted as a failure
	The probability of observing a success
	The probability of observing a failure
	The number of trials

Using this notation in the experiment in Example 14(d), we have = burglary through front door , and = burglary not through front door .

Note: In Section 5.4, we used to indicate the number of combinations. Now that we have learned about random variables, which can be denoted , we use to represent the number of combinations.

2Computing Binomial Probabilities

We demonstrate three ways of computing binomial probabilities: (a) the binomial probability formula, (b) binomial tables, and (c) technology. Before we examine the binomial probability distribution formula, let us recall from Section 5.4 (page 296) the formula for the number of combinations.

329

Note: You may find the following special combinations useful. For any integer :

The number of combinations of items chosen from different items is given by

where ! represents factorial, which equals , and 0! is defined to be 1.

We are often interested in finding probabilities associated with a binomial experiment.

EXAMPLE 15Constructing a binomial probability distribution

A recent study reported that about 40% of online dating survey respondents are “hoping to start a long-term relationship” (LTR).² Consider the experiment of choosing three online daters at random, and let

so that a success is defined as choosing someone hoping to start a long-term relationship.

Construct a tree diagram for this experiment.
Suppose that we are interested in finding the probability that exactly two of the three online daters would be LTRers, . In the tree diagram, highlight in blue the outcomes where exactly two of the three online daters are LTRers. Find the probability for each outcome, and use these to find .
Suppose that we are interested in finding . In the tree diagram, highlight in red the outcomes where exactly one of the three online daters is an LTRer. Find the probability for each outcome, and use these to find .

Solution

Figure 7 shows the tree diagram for this experiment.
As we can see from Figure 7, there are different ways that exactly two of the three online daters could be LTRers (highlighted in blue).

Figure 6.9: FIGURE 7 Tree diagram and binomial probabilities.

330

For each of these three outcomes, the probability that is .
- The outcome (second row in Figure 7) has probability .
- The outcome has probability .
- The outcome has probability .
Note that each of these products equals , with having exponent , and () having exponent . Thus,
Similarly, suppose that we are interested in whether exactly one of the three online daters is an LTRer. Then, Figure 7 shows us, highlighted in red, that there are different ways this could happen. Each of these outcomes has probability , where has exponent , and has exponent . Thus,

We can generalize these procedures and use the binomial probability distribution formula to find probabilities for the number of successes for any binomial experiment.

Remember:

The Binomial Probability Distribution Formula

The probability of observing exactly successes in trials of a binomial experiment is

That is,

We often call this the binomial probability formula.

Steps for Solving Binomial Probability Problems

To solve a binomial probability distribution problem, follow these steps:

Step 1 Find the number of trials , and the probability of success on a given trial .
Step 2 Find the number of successes about which the question is asking.
Step 3 Using the values from Steps 1 and 2, find the required probabilities using either the binomial probability formula, the binomial tables (which we learn below), or technology.

EXAMPLE 16Applying the binomial probability distribution formula

A report from SleepFoundation.org reported that 20% of Americans are sleep-deprived (defined as getting less than six hours sleep per night, on average). This has serious consequences for our nation's highways and productivity. Suppose we take a random sample of four Americans. Find the probability that the following numbers of people are sleep-deprived:

None
At least one
Between one and three, inclusive
Five

331

Solution

We apply the steps for solving binomial probability problems.

Step 1 We have a random sample of four Americans, so the number of trials is . “Success” is denoted as a particular American being sleep-deprived. The report states that 20% of Americans are sleep-deprived, so and .
Step 2 For (a), . For (b), ; that is, . For (c), ; that is, ; that is, . For (d), .
Step 3 We apply Step 3 for each of (a)–(d) as follows:

Step 3 To find the probability that none () of the Americans are sleep-deprived, we use the binomial probability formula:

Therefore, the probability that none of the Americans in the sample are sleep-deprived is 0.4096.
Step 3 Note that “at least one” includes all possible values of except . In other words, the two events () and () are complements of each other. Therefore, from the formula for the probability for complements in Section 5.2 (page 260), we have

The probability that at least one of the Americans is sleep-deprived is 0.5904.
Step 3 We need to find the probability that either or or of the Americans are sleep-deprived. Because these three values of are mutually exclusive, we find the required probability by using the Addition Rule for Mutually Exclusive Events.

So we calculate the following:

Thus, . The probability is 0.5888 that between one and three, inclusive, of the Americans in the sample of four are sleep-deprived.
Step 3 In a binomial experiment, the number of successes can never exceed the number of trials . In other words, , always. So, if our sample has only Americans, then . It is not possible for there to be five Americans who are sleep-deprived.

NOW YOU CAN DO

Exercises 15–28.

YOUR TURN #8

For a binomial experiment with and , find the probability that equals the following:

0
1
At most 1

(The solutions are shown in Appendix A.)

332

As you can imagine, calculations involving binomial probabilities can sometimes get tedious. For example, to find the probability of observing at least 60 heads on 100 tosses of a fair coin, we would have to use the binomial formula for , , , and so on, right up to . For this type of problem, you can use Table B, Binomial Distribution, in the Appendix. If you are trying to answer a question involving unusual values of , such as 103, or unusual values of , such as 0.47, then you can use technology instead.

EXAMPLE 17Finding probabilities using the binomial table

Use the binomial table and the binomial distribution from Example 16 to find the following probabilities:

No Americans are sleep-deprived.
At least one American is sleep-deprived.

Solution

From Example 16, we have a binomial distribution with and . We next find and in the binomial table. In Figure 8:
- Look under the column until you find . That is the portion of the table you will use.
- Then go across the top of the table until you get to .
- For part (a), , so go down the column until you see 0 under the column on the left (and in the subgroup with ).
- The number in the column is 0.4096 (see Figure 8), which is the same answer we calculated in Example 16(a).
  
  Figure 6.10: FIGURE 8 Excerpt from the binomial tables.
In this case, “at least 1” means 1 or 2 or 3 or 4. So, by the Addition Rule for Mutually Exclusive Events, find the probabilities for , , , and , and add them up. Using the same column with column head 0.20 in the table as in part (a), we add up the four probabilities.

This is the same answer we calculated in Example 16(b), but it is arrived at in a different way.

333

Next, a word about cumulative probability. Cumulative probability refers to the probability of, at most, a particular value of . For example, what is the probability that, at most, Americans are sleep-deprived? This is the cumulative probability that , , or . Statistical software and the TI-83/84 graphing calculator each have a function that will find cumulative binomial probabilities for you.

EXAMPLE 18Using technology to find binomial probabilities

Using the binomial distribution from Example 16, use the TI-83/84 and CrunchIt! to find the following probabilities:

, the probability that all four Americans are sleep-deprived.
, the (cumulative) probability that, at most, two Americans are sleep-deprived.

Solution

We use the instructions in the Step-by-Step Technology Guide at the end of this section (page 336).

Figure 9 shows that we use the TI-83/84 function binompdf with , , and . Figure 10 shows the result: . Figure 11 shows the same input and final answer using CrunchIt!.

Figure 6.11: FIGURE 9 TI-83/84 menu.

Figure 6.12: FIGURE 10 TI-83/84 result.

Figure 6.13: FIGURE 11 CrunchIt!
With the TI-83/84, we use the function binomcdf with , , and . Figure 12 shows the result: . Figure 13 shows the input and final answer using CrunchIt!.

Figure 6.14: FIGURE 12 TI-83/84 result.

Figure 6.15: FIGURE 13 CrunchIt!

NOW YOU CAN DO

Exercises 29–48.

334

YOUR TURN #9

For a binomial experiment with and , use the binomial tables or technology to calculate the following probabilities:

(The solutions are shown in Appendix A.)

3Binomial Mean, Variance, Standard Deviation, and Mode

In Section 6.1, we examined the mean, variance, and standard deviation of a discrete random variable. The binomial random variable is discrete, so it also has a mean, variance, and standard deviation, which are shown here.

Mean, Variance, and Standard Deviation of a Binomial Random Variable

Mean (or expected value):
Variance:
Standard deviation:

These formulas work only for a binomial random variable.

EXAMPLE 19Binomial mean, variance, and standard deviation

SAT Scores and AP Exam Scores

The College Board reports (2014) that 90% of students taking the Natural Sciences Subject SAT exam have taken high school chemistry. Suppose we take a sample of 100 students.

Find the mean or expected number of Natural Sciences exam takers who have taken a chemistry course.
Calculate the variance and standard deviation of the number of Natural Sciences exam takers who have taken a chemistry course.
In our sample of 100, would it be unusual to observe 80 Natural Sciences exam takers who have taken a chemistry course?

Solution

The binomial random variable here is X = the number of Natural Sciences SAT exam takers who have taken a chemistry course, with sample size , probability of success , and probability failure .

The mean or expected number who have taken a chemistry course is .
, expressed in “students squared.” Then .
We use the -score method (Section 6.1, page 320) to determine whether would be unusual. The -score for 80 is:

335

According to the -score method of identifying outliers, Natural Sciences SAT exam takers having taken a chemistry course would be unusual because it is an outlier, with .

NOW YOU CAN DO

Exercises 49–52.

YOUR TURN #10

For a binomial experiment with and , answer the following:

Find the mean .
Calculate the variance and standard deviation .
In a sample of 50, would it be unusual to observe ? (The solutions are shown in Appendix A.)

What Do and Mean?

The value is the “long-run” mean, and the value is the “long-run” standard deviation. That is, if we repeat this experiment an infinite number of times, identify the number of Natural Sciences SAT exam takers who took a chemistry course in each sample, and take the mean and standard deviation of each of these samples, they will equal and .

Next, we consider the mode of a binomial distribution.

The mode of a binomial distribution is the most likely outcome of the binomial experiment for the given values of and , that is, the outcome with the largest probability.

The next example shows how to find the mode for a binomial distribution.

EXAMPLE 20The binomial mode: the most likely outcome of a binomial experiment

Example 5 (page 314) shows how to construct a probability distribution graph.

Sixty percent of American adults use their cell phones to access the Internet, according to a 2013 report by the Pew Research Center. Suppose we take a random sample of American adults.

Calculate the mean number of American adults who use their cell phones to access the Internet.
Use the binomial table to construct a probability distribution graph of the random variable X = the number of Americans who use their cell phones to access the Internet.
Use the binomial table or the probability distribution graph to find the most likely number of American adults who use their cell phones to access the Internet. Note that this represents the mode of the distribution.

Solution

Figure 14 is an excerpt from the binomial table, highlighting the probabilities for , for and . We use these probabilities to construct the probability distribution graph shown in Figure 15.
336

Figure 6.16: FIGURE 14 Probabilities for .

Figure 6.17: FIGURE 15 Probability distribution graph of .
The most likely number of Americans using their cell phones to access the Internet is associated with the largest probability in the boxed section of Figure 14, 0.4320, which is . Note from Figure 15 that has the tallest bar of probability. Thus, is the most likely number of American adults using their cell phones to access the Internet. We say that is the mode of the distribution of .

NOW YOU CAN DO

Exercises 53–56.