OBJECTIVES By the end of this section, I will be able to …
1 Introduction to Conditional Probability
As we progress through this book, you will notice a recurring theme: the more information available, the better. Very often, when we are investigating the probability of a certain event A, we learn that another event B has occurred. If events A and B are related, then the occurrence of event B often influences the probability that event A will occur.
EXAMPLE 18 Having more information often affects the probability of an event
Table 19 contains the Academy Award winners for Best Actress and Best Actor for 2009–2014, along with their ages at the time of the award.
Year | Best actress | Film | Age | Best actor | Film | Age |
---|---|---|---|---|---|---|
2009 | Kate Winslet | The Reader | 33 | Sean Penn | Milk | 48 |
2010 | Sandra Bullock | The Blind Side |
45 | Jeff Bridges | Crazy Heart | 60 |
2011 | Natalie Portman | Black Swan | 29 | Colin Firth | The King's Speech |
50 |
2012 | Meryl Streep | Iron Lady | 62 | Jean Dujardin | The Artist | 39 |
2013 | Jennifer Lawrence | Silver Linings Playbook |
22 | Daniel Day- Lewis |
Lincoln | 55 |
Table 20 is a contingency table, summarizing the information in Table 19, providing the counts of the performers' genders and whether the performer was under age 40.
Female | Male | Total | |
---|---|---|---|
Under age 40 | 3 | 1 | 4 |
Age 40 or older | 2 | 4 | 6 |
Total | 5 | 5 | 10 |
Now, if we choose a performer at random from Table 20, the probability of choosing a female is P(F)=510=0.5. But what if we were given the extra information that the performer is age 40 or older? How does this extra information affect the probability of selecting a female?
Solution
Notice that when we are given that the person is age 40 or older, we may restrict our attention to the performers who are age 40 or older in Table 20 (highlighted). In other words, this extra information reduces the number of possible outcomes in the sample space from the 10 performers to the 6 performers who are age 40 or older. Of these six performers, two of them are female. Thus, the probability of selecting a female, given that the performer is age 40 or older, is 2/6=1/3≈0.33. The extra information we were given changed the probability of selecting a female, from 1/2 to 1/3.
The extra information about a related event changed the probability of the event of interest. This type of probability is an example of what is called conditional probability.
Note: The vertical line | is read as the word “given.” Whatever follows the word “given” is assumed to have occurred already.
For two related events A and B, the probability of B given A is called a conditional probability and is denoted as P(B|A), or P(B given A).
Thus, if we let F represent the event that a female is selected and A represent the event that the performer is age 40 or older, then
P(F)=510=12 but P(F given A)=P(F|A)=26=13
Figure 17 can help us visualize how conditional probability works. The idea is that, once event A has occurred, the only chance for event F to occur is in the overlap—the intersection (F and A). Therefore, the conditional probability that F will occur, given that event A has already taken place, is found by taking the ratio P(F and A)/P(A).
Developing Your Statistical Sense
The difference between P(F given A) and P(F and A)
Students sometimes confuse the meanings of P(F given A) with P(F and A). The two are very different.
In general, the formula for computing conditional probability is given as follows.
Calculating Conditional Probability
The conditional probability that B will occur, given that event A has already taken place, equals
P(B given A)=P(B|A)=P(A and B)P(A)=N(A and B)N(A)
EXAMPLE 19 Calculating conditional probability
Marketing companies are looking to statistical analysis and data mining in an effort to increase the customer response to their promotions. Therefore, these marketing companies are looking to hire more statistics-savvy students. Table 21 is adapted from a study on direct mail marketing. It contains the numbers of custom-ers who either responded or did not respond to a direct mail marketing campaign, along with whether they had a credit card on file with the company. The two events are:
Response | Credit card on file? | ||
---|---|---|---|
No | Yes | Total | |
Did not respond | 161 | 79 | 240 |
Did respond | 17 | 31 | 48 |
Total | 178 | 110 | 288 |
Solution
P(R)=N(R)N(S)=48288≈0.1667
P(R given C)=P(R|C)=N(R and C)N(C)=31110≈0.2818
That is, the probability that a randomly chosen customer responded to the direct mail marketing campaign, given that the customer had a credit card on file, is 0.2818
NOW YOU CAN DO
Exercises 9–32.
YOUR TURN#12
Using Table 21, find the probability that a randomly selected customer
(The solutions are shown in Appendix A.)
What Do These Numbers Mean?
Conditional Probability
Conditional probabilities can often be interpreted as percentages of some subset of a population. For example, the conditional probability that a customer responded, given that the customer has a credit card on file, may be interpreted as the percent-age of customers with credit cards who responded.
2 Independent Events
Having a credit card on file increased the probability of a customer responding from 0.1667 to 0.2818, so we can say that the probability of responding depends in part on whether the customer has a credit card on file. In other words, the events R and C are dependent events.
On the other hand, if the probability of responding had been unaffected by whether the customer had a credit card on file, then we would have said that R and C were independent events. That is, R and C would have been independent events had P(R| C) equaled P(R). In general, if the occurrence of an event does not affect the probability of a second event, then the two events are independent.
Events A and B are independent if
P(A given B)=P(A) or if P(B given A)=P(B)
Equivalently, using symbols, events A and B are independent if
P(A|B)=P(A) or if P(B|A)=P(B)
Otherwise, the events are said to be dependent.
Developing Your Statistical Sense
Dependent and Independent Events
It may help to think of it this way: Andrew Luck is a superb quarterback for the Indianapolis Colts. Let B = the event the Colts win, and let A be the event that Andrew Luck gets injured. The Colts are certainly better with their great quarter-back than without him. So P(B) is greater than P(B|A), because their chances of winning decrease if Luck gets injured. Because P(B)≠P(B given A), the two events A and B are dependent. This makes sense, because the probability of the Colts winning is dependent on whether Andrew Luck gets injured. Now, do the same calculations—this time letting event C be the event that the water boy for the Colts gets injured. It is not likely that the water boy getting injured will affect the Colts’ chances of winning, so that P(B)=P(B given C). Thus, the events B = Colts win-ning and C = water boy gets injured are independent.
Alternatively, in Step 1 you can find P(A) and in Step 2 you can find P(A given B). Then compare these two quantities for Step 3.
Strategy for Determining Whether Two Events Are Independent
EXAMPLE 20 Determining whether two events are independent
Table 22 contains a contingency table summarizing the gender and survival status of the passengers and crew of RMS Titanic.
Female | Male | Total | |
---|---|---|---|
Did not survive | 126 | 1364 | 1490 |
Survived | 344 | 367 | 711 |
Total | 470 | 1731 | 2201 |
Suppose our experiment is to select one person at random from the passengers and crew. Define the following events:
Determine whether events F and S are independent.
Solution
We use the strategy for determining whether two events are independent.
NOW YOU CAN DO
Exercise 33–52.
YOUR TURN#13
In Example 20, determine whether being a male and not surviving are independent.
(The solution is shown in Appendix A.)
Developing Your Statistical Sense
Don’t Confuse Independent Events and Mutually Exclusive Events
It is important to stress the difference between independent events and mutually exclusive events. Mutually exclusive events have no outcomes in common. For two events to be independent means that the occurrence of one does not affect the probability of the other. The concepts are different.
EXAMPLE 21 Gambler's Fallacy
Suppose we have tossed a fair coin 10 times and have observed heads come up every time. Find the probability of tails on the next toss.
Solution
We have observed an unusual number of heads, so we might think that the probability of tails on the next toss is increased. However, the short answer is “Not so.” Successive tosses of a fair coin are independent because the coin has no memory of its previous tosses. Thus, what happened on the first 10 tosses has no effect on the next toss. Probability theory tells us that, in the long run, the proportion of heads and tails will eventually even out if the coin is fair. Therefore, the probability of tails on the next toss is 0.5. This is an example of the Gambler’s Fallacy.
3 Multiplication rule
Just as the Addition Rule is used to find probabilities of unions of events, the Multiplication rule is used to find probabilities of intersections of events. Recall the formula for the conditional probability of event B given event A:
P(B given A)=P(A and B)P(A)where P(A)≠0
We solve for P(A and B) by multiplying each side by P(A):
P(A and B)=P(A)·P(B given A)
Similarly, consider the conditional probability of event A given event B:
P(A given B)=P(A and B)P(B)where P(B)≠0
Solving for P(A and B) gives us a second equation for P(A and B):
P(A and B)=P(B)·P(A given B)
The two equations for P(A and B) lead directly to the Multiplication Rule.
Multiplication Rule
P(A and B)=P(A)·P(B given A) or equivalently P(A and B)=P(B)·P(A given B)
EXAMPLE 22 Multiplication Rule
According to the Pew Internet and American Life Project,11 35% of American adults have cell phones with apps, but only 68% of those who have apps on their cell phones actually use the apps. Define the following events:
Solution
P(A and U)=P(A)·P(U given A)=0.35(0.68)=0.238
The probability that an American adult has a cell phone with apps and uses them is 0.238.
NOW YOU CAN DO
Exercises 53–58.
YOUR TURN#14
Suppose our experiment is to toss a fair die twice. Find the probability of rolling two sixes in a row.
(The solution is shown in Appendix A.)
By the definition of independent events (page 274), when events A and B are independent, P(A given B)= P(A) or P(B given A)=P(B). Using these identities, we can formulate a special case of the Multiplication Rule. Using P(A given B)= P(A), we can write the Multiplication Rule as
P(A and B)=P(B)·P(A|B)=P(B)P(A)=P(A)P(B)
Equivalently, the Multiplication Rule also states that P(A and B)= P(A)·P(B given A), but if A and B are independent, P(B and A)= P(B), so, again, P(A and B)= P(A)P(B).
Multiplication Rule for Two Independent Events
If A and B are any two independent events, P(A and B)= P(A)P(B).
EXAMPLE 23 Multiplication Rule for two independent events
Successive spins of the roulette wheel are independent because the wheel does not remember its result from the previous spin. Suppose the experiment is to spin the roulette wheel two times and to bet on red each time. There are 18 red numbers out of a total of 38 num-bers on the wheel. What is the probability that red will occur twice in succession?
Solution
Define the following events:
We have the probability of each event given as:
P(A)=P(B)=1838
Then, because the successive spins are independent, the Multiplication Rule for Two Independent Events tells us:
P(Winning)=P(A and B)=P(A)·P(B)=(1838)(1838)≈0.2244
NOW YOU CAN DO
Exercises 59–66.
YOUR TURN#15
For the roulette experiment in Example 23, what is the probability of not getting red on the first spin and not getting red on the second spin?
(The solution is shown in Appendix A.)
Sampling With and Without Replacement
The relationship between two events can be determined by the way the samples are chosen. Two methods of choosing samples are sampling with replacement and sam-pling without replacement.
In sampling with replacement, the randomly selected unit is returned to the population after being selected. When sampling with replacement, it is possible for the same unit to be sampled more than once.
In sampling without replacement, the randomly selected unit is not returned to the population after being selected. When sampling without replacement, it is not possible for the same unit to be sampled more than once.
EXAMPLE 24 Sampling with replacement
We draw a card at random from a shuffled deck, observe the card, and return it to the deck. The deck is then reshuffled, and we draw another card at random. What is the probability that both cards we select will be aces?
Solution
Define the following events:
We want to find P(A and B), the probability of observing an ace on the first draw and an ace on the second draw. From the Multiplication Rule, P(A and B)= P(A)·P(B|A). To find P(A), recall that there are 4 aces in the deck of 52 cards. It is reasonable to assume that all cards are equally likely to be selected, so using the classical method, P(B given A)=4/52. Similarly, P(B)= 4/52.
Next, we need to find P(B given A), the probability of observing an ace on the second draw, given that we observe an ace on the first draw. Because the deck of 52 cards has not changed (except for shuffling), there are still 52 cards—4 of which are aces. Therefore, P(B given A)=4/52. Thus, the probability that both cards we select will be aces is P(A and B)= P(A)·P(B given A)=(4/52)(4/52)≈0.0059.
Note that P(B given A)=P(B)=4/52. Thus, by the alternative method for deter-mining independence, A and B are independent events when sampling with replacement.
NOW YOU CAN DO
Exercises 67–70.
YOUR TURN#16
For the experiment in Example 24, what is the probability of observing hearts on the first draw and observing hearts on the second draw, if we sample with replacement?
(The solution is shown in Appendix A.)
We can generalize this result as follows.
When sampling with replacement, successive draws can be considered independent.
EXAMPLE 25 Sampling without replacement
Suppose we alter the experiment in Example 24 as follows: We draw a card at random from a shuffled deck, hold onto the card (do not replace it) while the deck is reshuffled, and then select another card at random. What is the probability that both cards we select will be aces?
Solution
Define events A and B as in Example 24. Again we use the Multiplication Rule to find P(A and B). The difference in this experiment comes when finding P(B|A), the probability of observing an ace on the second draw given an ace on the first draw. Once we select the first ace, we do not replace it in the deck. Therefore, when the deck is reshuffled, it has only 51 cards left, only 3 of which are aces. The classical method then gives the probability of observing an ace on the second draw:
P(B given A)=number of aces in the decknumber of cards in the deck=351
Thus, the probability that both cards we select will be aces is
P(A and B)=P(A)·P(B given A)=452·351=122652≈0.0045
This probability is somewhat less than the probability that both cards will be aces when sampling with replacement. Note that here we found that P(B given A) was not equal to P(B). Thus, by the alternative method for determining independence, A and B are not independent events; they are dependent events.
NOW YOU CAN DO
Exercises 71–74.
YOUR TURN#17
For the experiment in Example 25, what is the probability of observing hearts on the first draw and observing hearts on the second draw, if we sample without replacement?
(The solution is shown in Appendix A.)
We can generalize this result as follows.
When sampling without replacement, successive draws should be considered dependent.
Note that the Multiplication Rule for Independent Events provides us with an alternative method for determining whether two events are indeed independent.
Alternative Method for Determining Independence
If P(A)·P(B)=P(A and B), then events A and B are independent.
If P(A)·P(B)≠P(A and B), then events A and B are dependent.
EXAMPLE 26 Determining independence using the alternative method
We return to the direct mail marketing data from Example 19, reproduced here in Table 23. Use the alternative method for determining independence to determine whether the following two events are independent:
Response | Credit card on file? | ||
---|---|---|---|
No | Yes | Total | |
Did not respond | 161 | 79 | 240 |
Did respond | 17 | 31 | 48 |
Total | 178 | 110 | 288 |
Solution
Using Table 23, we may find the following probabilities:
P(R)=48288 P(C)=110288 P(R and C)=31288≈0.1076P(R)·P(C)=48288·110288≈0.0637
Because 0.0637≠0.1076, we have P(R)·P(C)≠P(R and C); therefore, R and C are dependent.
NOW YOU CAN DO
Exercises 79–84.
The next example illustrates the relationship between mutually exclusive events and independence.
EXAMPLE 27 Conditional probability for mutually exclusive events
Suppose two events A and B are mutually exclusive, with P(A)>0 and P(B)>0.
Solution
Because A and B are mutually exclusive, P(B and A)=0. Then
P(B given A)=P(A and B)P(A)=0
That is, if event A has occurred, then event B cannot occur. This is a natural consequence of events A and B being mutually exclusive.
What Results Might We Expect?
Two events are independent if the occurrence of one does not affect the probability that the other will occur. However, as we saw in (a), if event A occurs, then the probability that event B will occur is 0. Thus, we would expect events A and B to be dependent.
In other words, if two events are mutually exclusive, then they are dependent.
NOW YOU CAN DO
Exercises 85–92.
We can extend the Multiplication Rule to cover n independent events.
Multiplication Rule for n Independent Events
If A, B, C, … are independent events, then P(A and B and C and …)=P(A)·P(B)·P(C)…
EXAMPLE 28 Multiplication rule for n independent events
According to the National Health Interview Survey, 24% of Americans ages 18–44 smoke tobacco. In a random sample of n=3 Americans ages 18–44, find the probability that all three smoke.
Solution
The U.S. Census Bureau estimates that over 100 million Americans are ages 18–44. We will shortly see that, because the sample of 3 is so small compared to the population of over 100 million, it is reasonable to assume that the successive draws are independent. Let Si denote the event that the ith American in the 18–44 age group smokes.
P(S1)=P(S2)=P(S3)=0.24
Then, using the Multiplication Rule for n Independent Events,
P(S1 and S2 and S3)=P(S1)·P(S2)·P(S3)=(0.24)(0.24)(0.24)=(0.24)3=0.013824
NOW YOU CAN DO
Exercises 93–100.
YOUR TURN#18
Refer to Example 28. In a random sample of n=10 Americans ages 18–44, find the probability that all 10 smoke.
(The solution is shown in Appendix A.)
Next, we turn to solving an “at least” problem. This involves a technique that uses complements in order to give us a nice shortcut for calculating a probability.
EXAMPLE 29 Solving an “at least” problem
Using information in Example 28, find the probability that, in a random sample of 5 Americans ages 18–44, at least 1 of them smokes.
Solution
The phrase “at least” means that one or more of the five Americans smoke, so the probability we are looking for is:
P(1 or 2 or 3 or 4 or 5 Americans smoke)
Now, each of these events is mutually exclusive, meaning that our sample can’t yield exactly 1 American who smokes and exactly 2 Americans who smoke. Thus, by the Addition Rule for Mutually Exclusive Events, the above probability equals:
P(1 American smokes)+P(2 Americans smoke)+…+P(5 Americans smoke)
Calculating all these probabilities would take a while. So, we can use the probability of the complement we learned earlier to get us a shortcut. Note that “at least 1 American smokes” is the complement of “no Americans smoke.” Then, because the complement rule for probability is: P(AC)=1-P(A), we get:
P(At least 1 of the 5 Americans smokes)=1-P(None of the 5 Americans smokes because the events are independent)=1-P(1st doesn'
NOW YOU CAN DO
Exercises 101–104.
YOUR TURN#19
For the experiment in Example 29, find the probability that, in a random sample of four Americans ages 18–44, at least one of them smokes.
(The solution is shown in Appendix A.)
4 Approximating Probabilities for Dependent Events
When the sample size is small compared to the population size, we can estimate the probability of a dependent event as if it were independent. The question is: How small is a small sample? We will use the following 1% Guideline.
The 1% Guideline
Suppose successive draws, such as those for a random sample, are being made from a population. If the sample size is no larger than 1% of the size of the population, then the probability of dependent successive draws from the population may be approximated using the assumption that the draws are independent.
EXAMPLE 30 Applying the 1% Guideline
The Gardasil vaccine
In the Section 5.2 exercises, we found the probability that a randomly chosen patient completed the treatment to be . Suppose we take a random sample of size patients.
Solution
Define the following events:
The 1% Guideline is also helpful when we do not know the size of the population, but may presume that the population is very large compared to the sample size.
NOW YOU CAN DO
Exercises 75–78
5 Bayes’ Rule
When new information becomes available, our estimates of probability need to be revised to reflect the presence of the new information. An English Presbyterian Minister, Thomas Bayes (1702–1761) developed a rule, called Bayes’ rule, for updating previous probabilities using new information to arrive at revised probabilities.
Bayes' Rule
For any two events and ,
We apply Bayes’ Rule in the following example.
EXAMPLE 31 Bayes' Rule
The Gardasil vaccine
Table 24 contains a contingency table (crosstabulation) of the variables com-pleted and insurance type for the data set Gardasil.12
Insurance type | |||||||
---|---|---|---|---|---|---|---|
Hospital- based |
Medical assistance |
Military | Private payer | Total | |||
Completed | No | 45 | 220 | 209 | 470 | 944 | |
Yes | 39 | 55 | 122 | 253 | 469 | ||
Total | 84 | 275 | 331 | 723 | 1413 |
Define the following events:
Use Bayes’ Rule to find the probability that a randomly selected patient used military insurance, given that he or she completed the treatment.
Solution
We are interested in the probability . Substituting for and for in the formula for Bayes’ Rule, we obtain:
From Table 24, we have . The event consists of all other insurance types besides military. So we have , and . Thus,
Note that this result is confirmed by the direct calculation of using Table 24, which is the single cell of 122 military completers divided by the total number of completers.
NOW YOU CAN DO
Exercises 105–108.
YOUR TURN#20
Using Table 24 and Bayes’ Rule, find the probability that a randomly selected patient used hospital-based insurance, given that he or she completed the treatment.
(The solution is shown in Appendix A.)
EXAMPLE 32 The ELISA test for the presence of HIV
The ELISA test is used to screen blood for the presence of HIV. Like most diagnostic procedures, the test is not foolproof.
A positive result means that the test says that the person has the HIV infection. A negative result means that the test says that the person does not have the virus. The prevalence rate for HIV in the general population is 0.5%. That is, 5 of 1000 persons in the general population have HIV.
Suppose we have samples of blood from 100,000 randomly chosen people.
Problem 1. How many people in the sample of 100,000 have HIV? How many do not?
Solution
The prevalence rate of 0.5% means that people in the sample have HIV. The remainder—99,500—do not.
Problem 2. A positive result is given 99.6% of the time for blood containing HIV. For the 500 people with HIV, how many positive results will the ELISA test return? How many of the 500 people with HIV will receive a negative result?
Solution
The ELISA test will return a positive result for of the 500 people. Thus, two people who actually have HIV will receive a test result indicating that they do not have the virus.
Problem 3. A negative result is given 98% of the time for blood without HIV. For the 99,500 people without HIV, how many negative results will the ELISA test return? Positive results?
Solution
The ELISA test will return a negative result for of the 99,500 people without HIV. The remaining 2%, or 1990 people, will receive positive ELISA test results, even though they do not have the virus.
We can use the counts we found to fill in Table 25.
In reality | |||
---|---|---|---|
ELISA test results | Person has HIV | Person does not have HIV | Total |
Positive | 498 | 1,990 | 2,488 |
Negative | 2 | 97,510 | 97,512 |
Total | 500 | 99,500 | 100,000 |
We will use the information in the ELISA test contingency table to solve Problems 4 and 5. If a person is chosen at random from the sample of 100,000, define the following events:
Problem 4. What is the probability that a randomly chosen person actually does have HIV, given that the ELISA results are negative? In other words, find .
Solution
Problem 5. What is the probability that a randomly chosen person actually does not have HIV, given that the ELISA test results are positive? In other words, find .
Solution
Developing Your Statistical Sense
Which Error Is More Dangerous?
In Problems 4 and 5, we examined the probabilities of the two ways that the ELISA test can be wrong. Which error do you think is more dangerous? represents the probability that HIV is present, even though the ELISA test says otherwise. represents the probability that HIV is not present, even though the ELISA test says it is present. The designers of the ELISA test worked hard to reduce the false-negative rate to as low a level as possible. They rightly considered that it is the more dangerous type of error because of the epidemic nature of the illness. A person who receives a false-negative ELISA result could spread the infection further. Therefore, the designers tried to keep this probability as low as they could.
There is a price to be paid, however, which is the high false-positive rate, , a very high 80%. Thus, if a random person receives a positive ELISA test result, the probability that the person does not have HIV is 80%. When the ELISA test comes back positive, a second batch of tests that have a more reasonable false-positive rate is usually administered.