We have shown how to compute optimal pure and mixed strategies, and the values ensured by using them, in constant-sum games. In variable-sum games, we focused on Nash equilibria as a solution concept in the Prisoners’ Dilemma and Chicken, but we found that this notion of a stable outcome did not justify the choice of cooperative strategies in either of these games.
We turn next to a game in which there may be any number of players, each of whom can choose among an infinite number of strategies! Although this game sounds more complex, it demonstrates that it may be easy to determine a Nash equilibrium, and it may be easy for each player to determine an optimal strategy.
The game models a specific type of auction. An auction is a common way to sell an item when the seller is unsure of the going price of the item; the auction is designed to get the seller the highest price for the item. For example, Sotheby’s auction house specializes in high-end contemporary art and rare items such as the British Guiana one-cent magenta stamp, considered the rarest and most valuable stamp in the world. Because these items are one of a kind, the seller has difficulties in knowing what the market value for the item is. Auctions have become more common as websites like eBay hold auctions in which bidders can participate online at any hour of the day. Stamp collectors had used the following type of auction well before Columbia University economist William Vickrey analyzed it formally in 1961.
A Vickrey Auction DEFINITION
In a Vickrey auction, also called the second-price sealed-bid auction, bidders independently submit sealed bids for an item being sold. The winner is the one who bids the highest, but he or she pays only the amount of the second-highest bid.
644
A Vickrey auction involves a bidding strategy that is at least as good as any other. This notion weakens the idea of a dominant strategy that was introduced when analyzing the Prisoners’ Dilemma. A weakly dominant strategy is one that is at least as good as any other strategy. Flipping perspective, we define a dominated strategy.
Dominated Strategy DEFINITION
A player’s strategy is dominated by his or her strategy if the outcome under is always better for the player than the outcome under , regardless of the decisions made by the other players.
In the location/schedule game of Example 1 with payoff matrix from Table 15.1 (page 623), Mark chooses the location of the hospital, while Lisa chooses the shift. Recall that Mark is concerned about making the highest hourly wage possible for his emergency room work. For this reason, Mark will never select the Rural hospital because he would receive a lower hourly wage than he would at the Suburban hospital, regardless of which shift Lisa selects. This is apparent because the numbers in the Suburban row of Table 15.1 are greater than the numbers in the Rural row. We say that the Rural strategy is dominated by the Suburban strategy.
Dominated strategies can be eliminated from consideration, as there will never be a situation in which a dominated strategy should be played.
EXAMPLE 9 The Vickrey Auction
Suppose that Anneliese, Binh, and Charlie bid for a stamp in a Vickrey auction. Anneliese, Binh, and Charlie value the stamp at $300, $200, and $100, respectively. To see that it is always at least as good for Anneliese to bid her true valuation of the stamp, consider the two possible results of bidding $300. To simplify the analysis, we will assume that each bidder’s bid is distinct, so no two bidders bid the same amount.
645
Anneliese’s strategy of bidding $300 weakly dominates any other strategy she could use. A similar analysis holds for Binh and Charlie. Collectively, it is a Nash equilibrium for Anneliese, Binh, and Charlie to each bid his or her true valuation for the stamp. To see this, if Anneliese, Binh, and Charlie bid $300, $200, and $100, respectively, for the stamp, then Anneliese will be the highest bidder and pay $200 for the stamp. The payoffs for Anneliese, Binh, and Charlie are given by the triplet . Neither Binh nor Charlie could change his bid to receive a positive payoff. Anneliese’s bid cannot affect the price she pays; it can determine only whether or not she is the highest bidder. Likewise, she cannot change her bid to do better and could only do worse by bidding too little and losing the stamp to Binh.
If more than one player has the highest bid, then one of the players is selected at random to receive the item and the second highest bid is equal to the highest bid. The analysis in the example extends for a Vickrey auction with any number of players. Each player has a weakly dominant strategy to bid his or her true valuation of the item. It leads to the following result.
Nash Equilibrium in a Vickrey Auction THEOREM
In a Vickrey auction, it is a Nash equilibrium for each bidder to bid the amount at which he or she values the item.
The Vickrey auction induces the bidders to reveal truthfully their values for the item being auctioned (see Chapter 13, Section 13.8, for more on Vickrey auctions). However, there are other equilbria for the Vickrey auction game. For the stamp auction, bids of $201, $200, and $100 and of $200, $301, and $100 by Anneliese, Binh, and Charlie, respectively, are also Nash equilibria. The second of these two equilibria is surprising because Binh can win the item! For games in which each player has a dominant strategy (like the Prisoners’ Dilemma), then there is a single Nash equilibrium. It is also a Nash equilibrium for every player to use a weakly dominant strategy, but there may be more than one equilibrium if each player has a weakly dominant strategy, as demonstrated by the Vickrey auction game.
Suppose Anneliese, Binh, and Charlie bid $150, $301, and $100, respectively, at the stamp auction. Why does Anneliese not prefer to change her bid? Why does Binh prefer not to change his bid? Does your answer to the question about Anneliese also hold for Charlie? Answering these three questions shows that when Anneliese, Binh, and Charlie bid as described, it is a Nash equilibrium: Collectively, if no player wishes to change his or her bid, the bids form a Nash equilibrium.
When Anneliese, Binh, and Charlie have bids of $150, $301, and $100, respectively, then Binh gets the item, pays $150 (the second highest bid), and achieves a payoff. Because Anneliese and Charlie don’t get the item and don’t pay anything, each receives a payoff of 0.
If Anneliese bids anything less than $301, then she will not get the item; this means that she has a payoff of 0. If she bids $301 or more, then she could receive the item and pay more than what it is worth to her. This is a negative payoff. Because she cannot increase her payoff, she has no desire to change her bid. Binh already gets the item, so bids above $150 do not change who gets the item or how much Binh pays. If Binh bids less than $150, then he doesn’t get the item and his payoff is 0. Binh doesn’t wish to risk bidding $150 and possibly not getting the item. Binh has no incentive to change his bid. Like Anneliese, if Charlie bids more than $301, he will win the stamp but pay too much for it. This result could occur if he bids $301, too. For bids less than $301, Charlie still doesn’t get the item and pays nothing. Charlie has no reason to change his bid, either.
646
Mechanism design is the act of designing a game to achieve a particular outcome as a Nash equilibrium. Despite the multiple equilibria for the Vickrey auction game, the Vickrey auction does give each bidder an incentive to reveal truthfully how much he or she values the item. Another example of mechanism design is the design of matching markets, described in Spotlight 15.2. Similar objectives border on the intersection of economics and psychology, in which decision problems are posed in a particular way to encourage people to make better decisions. For example, the placement of healthy food by a checkout register in a school’s lunchroom results in more healthy choices by students.
The Nobel Prize in Economics Spotlight 15.2
The Nobel Memorial Prize in Economics was awarded to three game theorists in 1994, marking the 50th anniversary of the publication of the first text on game theory (Theory of Games and Economic Behavior) by mathematician John von Neumann and economist Oskar Morgenstern.
In 2005, mathematician Robert J. Aumann was awarded the prize for a variety of advances in cooperative and noncooperative game theory, and economist Thomas C. Schelling also was awarded the prize for his contributions to the study of conflicts involving promises, threats, and other kinds of commitments. Mathematician Lloyd Shapley and economist Alvin Roth were awarded the 2012 Nobel Prize for their application of game theory to the design of markets (see the Case Study at the end of Part I, page 174). Highly successful implementations of these types of matching markets include “The Match,” in which physicians are paired up with residency programs, and school choice, in which students are enrolled in schools across Boston and New York based on different criteria.
647
Because the bidders could bid any amount, it was not practical to use a payoff matrix to describe the Vickrey auction game. However, for two-player games, we have used payoff matrices to describe games in strategic form. In these games, the row and column players’ choices of strategies led to an outcome from which each player received a payoff. These strategy choices were assumed to be simultaneous, though mentally the players might eliminate some dominated strategies. This was the case of the schedule/location game from Example 1, as Mark would never select the Rural hospital.
In the next example, which involves a larger game, we start by assuming simultaneous choices and show what outcome would occur. Then we assume that the choices of the players need not be simultaneous, instead allowing for one player to move first. We use a game tree to analyze the sequential choices players can then make, as occurs when first you move, then I move, and so on—which are called games in extensive form. As we will see, the outcome in such a game may be wholly different from the outcome in a game with simultaneous choices, which raises the question of which kind of game is the most realistic model of a situation.
EXAMPLE 10 A Truel
A truel is like a duel, except that there are three players. Truels are depicted in several movies, including The Good, the Bad and the Ugly (1966), Reservoir Dogs (1992), and Pulp Fiction (1994). The photo below shows three of the characters in The Good, the Bad and the Ugly, about to engage in a truel—each man is armed and planning his next move.
In a truel, each player can either fire or not fire his or her gun at either of the other two players. We assume the goal of each player is, first, to survive and, second, to survive with as few other players as possible. Each player has one bullet and is a perfect shot; no communication (e.g., to pick out a common target) leading to a binding agreement with other players is allowed, making the game noncooperative. We will discuss the answers that simultaneous choices, on the one hand, and sequential choices, on the other, give to what is optimal for the players to do in the truel.
648
If choices are simultaneous, at the start of play, each player will fire at one of the other two players, killing that player.
Why will the players all fire at each other? Because their own survival does not depend an iota on what they do. Since they cannot affect what happens to themselves but can affect how many others survive (the fewer the better, according to the postulated secondary goal), they should all blaze away at each other. In fact, the players all have dominant strategies to shoot at each other because whether or not a player survives—we will discuss shortly the probabilities of doing so—he or she does at least as well shooting an opponent.
The game, and optimal strategies in it, would change if the players (1) were allowed more options, such as to fire in the air and thereby disarm themselves, or (2) did not have to choose simultaneously but, instead, a particular order of play were specified. Thus, if the order of play were , followed by and choosing simultaneously, followed by any player with a bullet remaining choosing, then would fire in the air, and and would subsequently shoot each other. ( is no threat to or , so neither of the latter will fire at and waste a bullet; on the other hand, if one of or did not fire immediately at the other, that player would not survive to get in the last shot, so both and will fire at each other.) Thus, will be the sole survivor.
In 1992, a modified version of this scenario was played out in late-night television programming among the three major TV broadcasting networks of the time, with ABC effectively going first with Nightline, its well-established news program, and CBS and NBC dueling about which host, David Letterman or Jay Leno, to choose for their entertainment shows. Regardless of their ultimate choices, ABC “won” when CBS and NBC were forced to divide the entertainment audience. In 2002, ABC, presumably to attract a younger audience than the one that watched Nightline, attempted unsuccessfully to hire Letterman from CBS. Ted Koppel retired in 2005, but Nightline continues with other hosts.
To return to the original game (all choose simultaneously), the players’ strategies of all firing have two possible consequences: Either one player survives (even if two players fire at the same person, the third must fire at one of them, leaving only one survivor), or no player survives (if each player fires at a different person). In either event, there is no guarantee of survival. In fact, if each player has an equal probability of firing at one of the two other players, the probability that any player will survive is only 25%. The reason is that if the three players are , , and , will be killed if fires at him or her, does, or both do. The only circumstance in which will survive is if and fire at each other, which gives 1 chance in 4.
If choices are sequential, no player will fire at any other, so all will survive.
At the start of the truel, all the players are alive, which satisfies their primary goal of survival, though not their secondary goal of surviving with as few others as possible. Now assume that contemplates shooting , thereby reducing the number of survivors. Looking ahead, however, knows that by firing first and killing , he or she will be defenseless and be immediately shot by , who will then be the sole survivor.
It is in ’s interest, therefore, not to shoot anybody at the start, and the same logic applies to each of the other players. Hence, everybody will survive, which is a happier outcome than when choices are simultaneous, in which case everyone’s primary goal of survival is not satisfied—or, quantitatively speaking, satisfied only 25% of the time.
While sequential choices produce a “happier” outcome, do they provide a plausible model of a strategic situation that mimics what people might actually think and do in such a situation? We believe that the players in the truel, artificial as this kind of shootout may seem, would be motivated to think ahead, given the dire consequences of their actions. Therefore, they would hold their fire, knowing that if one fired first, he or she would be the next target. Indeed, this truel scenario is parodied in “Stand Off,” a Saturday Night Live skit broadcast on November 17, 2012, in which the three characters go about a day in their lives with guns trained on one another and no one wanting to fire first.
649
In Figure 15.4, we show this logic somewhat more formally with a game tree, in which A has three strategies, as indicated by the three branches that sprout from : not shoot , shoot , or shoot . The latter two branches, in turn, give survivors and , respectively, two strategies: not shoot or shoot .
We assume that the players rank the outcomes as follows, which is consistent with their primary and secondary goals: 4 = best (lone survivor), 3 = next best (survivor with one other), 2 = next worst (survivor with two others), and 1 = worst (nonsurvivor). These payoffs are given for ordered triples (, , ); thus, (3, 3, 1) indicates the next-best payoffs for and and the worst payoff for .
Note that play necessarily terminates when there is only one survivor, as is the case at (1, 1, 4) and (1, 4, 1). To keep the tree simple, we assume that play also terminates when either initially chooses , or or subsequently chooses , giving outcomes of (2, 2, 2), (3, 3, 1), and (3, 1, 3), respectively. Of course, we could allow the two or three surviving players in the latter cases to make subsequent choices in an extended game tree, but this example is meant only to illustrate the analysis of a game tree, not to be the definitive statement on truel possibilities. (More will be explored in the exercises.)
In a game in extensive form represented by a game tree, players work backward, starting the analysis at the bottom of the tree. By “bottom” we mean where play terminates; because this is where the tree branches out, the tree looks upside down in Figure 15.4. The players then work up the tree, using backward induction. Backward induction is a reasoning process in which players, working backward from the last possible moves in a game, anticipate each other’s rational choices.
650
To illustrate, because prefers (1, 1, 4) to (3, 1, 3), we indicate that would not choose by “cutting” this branch with scissors. Similarly, would not choose . Thus, if play got to the bottom of the tree, would shoot if were the survivor, and would shoot if were the survivor, following ’s shooting or , respectively.
Moving up to the next level, would know that if he or she shot , (1, 1, 4) would be the outcome. If he or she shot , (1, 4, 1) would be the outcome, making one or the other the outcome from the bottom level. Choosing between these two outcomes and (2, 2, 2), would prefer the latter, so A would cut the two branches, would shoot , and would shoot . Hence, would choose , terminating play with nobody shooting anybody else.
This, of course, is the conclusion that we reached earlier, based on the reasoning that if shot either or , he or she would end up dead, too. Because we could allow each player, like , to choose among his or her three initial strategies in a game, and subsequently make moves and countermoves from the initial state (if feasible), the foregoing analysis applies to all players.
Use backward induction to determine how players and would make their sequential decisions for the game tree in Figure 15.5.
Underlying the completely different answers given by the simultaneous and sequential choices in a truel is a change in the rules of play. If play is sequential, the players do not have to fire simultaneously at the start, as assumed in a strategic-form game. Rather, a player who moves first (A in our example)—and then the later players—would not fire, given that play continues until all bullets are expended or nobody chooses to fire.
In the extensive-form analysis, we ask of each player (it need not be ): Given your present situation (all alive), and the situation you anticipate will ensue if you fire first, should you do so? Because each player prefers living to the state he or she would induce by being the first to shoot (certain death), no one shoots.
651
This analysis suggests that truels might be more effective than duels, at least when played sequentially, in preventing the outbreak of conflict.
We will not try to develop this argument into a more general model. The main point is that a game tree allows for a look-ahead approach, whereby players compare the present state with possible future states—perhaps several steps ahead—to determine which moves to make. These choices, as we have seen, may lead to radically different outcomes compared with outcomes based on simultaneous choices.