Ultimatum Game

https://arbital.com/p/ultimatum_game

by Eliezer Yudkowsky Aug 10 2016 updated Aug 10 2016

A Proposer decides how to split $10 between themselves and the Responder. The Responder can take what is offered, or refuse, in which case both parties get nothing.


[summary(Gloss): The experimenter offers \$10 to two players, the Proposer and the Responder. The Proposer offers a split, such as \$6 for the Proposer and \$4 for the Responder. If the Responder accepts, the money is divided up accordingly; otherwise both players get nothing.

Is it ever [principle_rational_choice rational] to reject a low offer? ]

[summary: In the Ultimatum bargaining game:

On Causal decision theories, a '[principle_rational_choice rational]' Responder should accept an offer of \$1. Thus a 'rational' Proposer that knows it is facing a 'rational' Responder should offer \$1 (and keep \$9).

On Logical decision theories, the situation is considerably more complicated. E.g., a 'rational' Responder might accept an offer of \$4 with 83% probability, implying an expected gain to the Proposer of \$4.98, since the Responder knows the Proposer was previously reasoning about its likely current behavior.]

In the Ultimatum bargaining game, the experimenter sets up two subjects to play a game only once.

The experimenter offers \$10, to be divided between the subjects.

One player, the Proposer, offers a split to the other player, the Responder.

The Responder can either accept, in which case both parties get the Proposer's chosen split; or else refuse, in which case both the Responder and Proposer receive nothing.

What is the minimum offer a rational Responder should accept? What should a rational Proposer offer a rational Responder?

Analysis

The Ultimatum Game stands in for the problem of dividing gains from trade in non-liquid markets. Suppose:

In principle the trade could take place at any price between \$5001 and \$7999. In the former case, I've only gained \$1 of value from the trade; in the latter case, I've gained \$2999 of the \$3000 of surplus value generated by the trade. If we can't agree on a price, no trade occurs and no surplus is generated at all.

Unlike the laboratory Ultimatum Game, the used-car scenario could potentially involve reputation effects, and multiple offers and counteroffers. But there's a sense in which the skeleton of the problem closely resembles the Ultimatum Game: if the car-buyer seems credibly stubborn about not offering any more than \$5300, then this is the equivalent of the Proposer offering \$1 of \$10. I can either accept 10% of the surplus value being generated by the trade, or both of us get nothing.

In situations like unions at the bargaining table (with the power to shut down a company and halt further generation of value) or non-monetary trades, the lack of any common market and competing offers makes the Ultimatum Game a better metaphor still.

Responses

[Pretheoretical]

A large experimental literature exists on the Ultimatum Game and its variants.

The general finding is that human subjects playing the Ultimatum Game almost always accept offers of \$5, and reject lower offers with increasing probability.

If it's known that, e.g, the experimental setup prohibits the Proposer from offering more than \$2, offers of \$2 are far more likely to be accepted. %note: Falk, A., Fehr, E., Fischbacher, U., 2003. On the nature of fair behavior. Economic Inquiry 41, 20–26.%

85% of Proposers offer at least \$4. [todo: find citation]

One paper lists Responder acceptance rates as follows:

Among participants with a high score on the [ Cognitive Reflection Test], the graph looks like it says:

Causal decision theory

On the academically standard causal decision theory, a rational Responder should accept \$1 (or higher), since the causal result of accepting the \$1 offer is being \$1 richer than the causal result if you reject the \$1. Thus, a rational Proposer should propose \$1 if it knows it is facing a rational Responder. The much lower acceptance rates by human subjects for \$1 offers therefore demonstrates human irrationality.

Evidential decision theory

As in the Transparent Newcomb's Problem, by the time the \$1 offer is received, the EDT agent thinks it is too late for its decision to be news about the probability of receiving a \$1 offer. Thus, rejecting the \$1 offer would be bad news, and the EDT agent will accept the \$1. Thus, among two EDT agents with common knowledge of each other's EDT-rationality, the Proposer will offer \$1 and the Responder will accept \$1.

Logical decision theory

Under LDT, the Responder can take into account that the Proposer may have reasoned about the Responder's output in the course of deciding which split to offer, meaning that the Responder-algorithm's logical reply to different offers can affect how much money is offered in the first place, not just whether the Responder gets the money at the end. Under updateless forms of LDT, there is no obstacle to the Responder taking into account both effects even under the supposition that the Responder has already observed the Proposer's offer.

Suppose the Proposer and Responder have [common_knowledge common knowledge] of each other's algorithms, predicting each other by simulation, [proof_based_dt proofs], or abstract reasoning.

If an LDT Proposer is facing a CDT/EDT Responder, it seems straightforward that an LDT Proposer will offer \$1 and the CDT/EDT Responder will take it. E.g. under [proof_based_dt] the Proposer will check to see if it can get \$10 by any action, then check to see if it can get \$9 by any action, and should prove that if it offers \$1 the Responder will accept it.

Next suppose that a pure EDT or CDT Proposer is facing an updateless LDT Responder. The outcome should be that the EDT/CDT Proposer runs a simulation of the LDT agent facing every possible offer from \$1 to \$9, and discovers that the simulated LDT agent "irrationally" (and very predictably) rejects any offer less than \$9. E.g., a proof-based updateless Responder will look for a proof that some policy leads to gaining \$10, then a proof that some policy leads to gaining \$9, and will prove that the policy "reject any offer less than \$9" leads to this outcome. So the EDT/CDT Proposer will, with a sigh, give up and offer the LDT agent \$9. %note: On EDT or CDT, there is nothing else you can do with an agent that exhibits such irrational and self-destructive behavior, except give it almost all of your money.%

The case where two LDT agents face each other seems less straightforward. If one were to try to gloss LDT as the rule "Do what you would have precommitted to do", then at first glance the situation seems to dissolve into precommitment warfare where neither side has any obvious advantage or way to precommit "logically first". Imagine that both agents are self-modifying: clearly the Proposer must not be so foolish as to offer \$9 if they see that the Responder immediately self-modifies to accept only \$9, since otherwise that is just what the Responder will do.

No formal solution to this problem has been derived from scratch in any plausible system. Informally, Yudkowsky has suggested (in private discussion) that an LDT equilibrium might work as follows:

Suppose the LDT Responder thinks that a 'fair' solution ('fair' being a term of art we'll define by example) is \$5 apiece for Proposer and Responder; e.g. because \$5 is the Shapley value.

However, the Responder is not sure that the LDT Proposer considers Shapley division to be the 'fair' division of the gains. Then:

This suggests that an LDT Responder should definitely accept any offer at or above what the Responder thinks is the 'fair' amount of \$5, and probabilistically reject any offer below \$5, such that the expected value to the Proposer slopes gently downward as the Proposer offers lower splits. E.g., the Responder might accept an offer $~$q$~$ beneath \$5 with probability:

$$~$p = \big ( \frac{\$5}{\$10 - q} \big ) ^ {1.01}$~$$

This implies that, e.g., an offer of \$4 might be accepted with 83% probability, implying an expected gain to the Proposer of \$4.98, and an expected gain to the Responder of \$3.32.

From the Responder's perspective, the important feature of this response function is not that the Responder gets the same amount as the Proposer. Rather, the key feature is that the Proposer gains no expected benefit from giving the Responder less than the 'fair' amount (and further-diminished returns as the Responder's returns decrease further).

What about the LDT Proposer? It does not want to be the sort of agent which, faced with another agent that accepts only offers of \$9, gives in and offers \$9. Faced with an agent that is or might be like that, one avenue might be to simply offer \$5, and another avenue might be to offer \$9 with 50% probability and $1 otherwise (leading to an expected Responder gain of \$4.50). No Responder should be able to do better by coming in with a notion of 'fairness' that demands more than what the Proposer thinks is 'fair'.

Even if the notion of 'fairness' so far seems arbitrary in terms of derivation from any fundamental decision principle, this gives 'fairness' a very [schelling_point Schelling Point]-like status. From the perspective of both agents:

The resulting expected values are not on the [pareto_optimal Pareto boundary]. In fact, Yudkowsky proposed the above equilibrium in response to an earlier proof by Stuart Armstrong that it was impossible to develop a bargaining solution that did lie on the Pareto boundary and gave no incentive to other agents to lie about their utility functions (which was that problem's analogue of lying about what you believe is 'fair').

Yudkowsky's solution was meant to move as little away from the Pareto boundary as possible, while still maintaining stable incentives. But if we look back at the human Responder behaviors, we find that the Proposer's expected gains were:

$$~$\begin{array}{r|c|c} \text{Offer} & \text{Average subject} & \text{High CRT} \\ \hline \$5 & \$4.8 & \$4.95 \\ \hline \$4 & \$4.74 & \$5.52 \\ \hline \$3 & \$3.15 & \$3.57 \\ \hline \$2 & \$2.16 & \$2.64 \\ \hline \$1 & \$1.80 & \$2.34 \end{array}$~$$

This does not suggest that human Responders are trying to implement a gently declining curve of Proposer gains, trying to keep the bargained intersection close to the Pareto boundary. It does suggest that human Responders could be implementing some variant of: "Give the Proposer around as much in expectation as they tried to offer me." That is, clipping somewhere around \$0.80 (average) or \$0.67 (high-CRT) of expected value per \$1 decrease in the offer, with extra forgiveness around the \$4-offer mark.

This pattern is more vengeful than the proposed LDT equilibrium. But this degree of vengeance may also make ecological sense in a context where some 'dove' agents will accept sub-\$3 offers with high probability. In this case, you might respond harshly to \$3 offers to disincentivize the Proposer from trying to exploit the probability of encountering a dove. If you accept \$3 offers with 70% probability for an expected Proposer gain of \$4.90, but more than 1 in 14 agents are 'dove' agents that accept \$3 offers with 90% probability for a Proposer gain of \$6.30, then a Proposer should rationally offer \$3 to exploit the possibility that you are a 'dove'.

Similarly, the acceptance rate for \$8/\$2 splits is much higher if the experimenter is known to have imposed a maximum \$2 offer on the Proposer. As Falk et. al. observed, this behavior makes no sense if humans have an innate distaste for unfair outcomes, but in an LDT scenario the answer here changes to "Accept \$2." So as an LDT researcher would observe that the human subjects are behaving suggestively like algorithms that know other algorithms are reasoning about them.

Arguably, the human subjects in this experiment might still have their rationality critiqued on the grounds that the humans do not actually have common knowledge of each other's code and therefore shouldn't behave like LDT agents that do. But "human subjects in the Ultimatum Game behave a lot like rational agents would if those agents understood each other's algorithms" seems like a large step away from "human subjects in the Ultimatum Game behave irrationally". Perhaps humans simply have good ecological knowledge about the distribution of other agents they might face; or something resembling the equilibrium among LDT agents emerged evolutionarily and humans now instinctively behave as part of it. Arguendo, this LDT-like instinctive equilibrium seems to still take into account whether the Proposer has the option of offering more than \$2; it isn't just an equilibrium about demanding LDT-like distributions of final gains.

Also, the intuitive notion of precommitment warfare suggests that, even if you are facing a robot whose code says to reject all offers below \$9, you should offer \$9 with at most 55% probability--at least if you believe that robot was designed, rather than it being a natural feature of the environment. (E.g. you should not reject the 'offer' of a field that yields an 'unfair' amount of grain!) If you allowed 'unfair' splits with simple algorithms, a self-modifying agent facing you could self-modify into a simple algorithm that rejects all offers below \$9. The corresponding [program_equilibrium program equilibrium] would be unstable. So if humans are behaving like they are part of an equilibrium of LDT agents reasoning about other LDT agents, their uncertainty about exactly which other algorithms they are facing need not imply that they should 'rationally' accept lowball offers.

The author of the initial version of this page does not know whether a similarly stabilized solution around a 'fair' Schelling Point has been suggested in the rather large literature on the Ultimatum Game; it would not be at all surprising if a similar analysis exists somewhere. But Yudkowsky notes that the solution suggested here comes more readily to mind, if we don't think of the rational answer as CDT's \$1. If rejecting \$1 is 'irrational', an agent trying to throw a 'usefully irrational' tantrum might as easily demand \$9 as \$5. LDT results like [robust_cooperation robust cooperation in the oneshot Prisoner's Dilemma given common knowledge of algorithms] are much more suggestive that rational agents in an Ultimatum Game might coordinate in some elegant and stable way.

LDT equilibria are arguably a better point of departure from which to view human reasoning, since human behaviors on Ultimatum Game variants are nothing like EDT or CDT equilibria. Even if we think of people as trying to acquire a useful reputation for rejecting unfair bargains, we can still see those people as trying to acquire a reputation for acting like an LDT-rational agent whose rationality is known, rather than a reputation for useful irrationality.

Yudkowsky finally suggested that LDT analyses of Ultimatum Games--particularly as they regard 'fairness'--might have significant implications for how we can think economically about non-market bargaining and division-of-gains in the real world.


Comments

Eric Rogstad

Similarly, the acceptance rate for \$8/\$2 splits is much higher if the experimenter is known to have imposed a maximum \$2 offer on the Proposer\. As Falk et\. al\. observed, this behavior makes no sense if humans have an innate distaste for unfair outcomes, but in an LDT scenario the answer here changes to "Accept \$2\." So as an LDT researcher would observe that the human subjects are behaving suggestively like algorithms that know other algorithms are reasoning about them\.

Omit the 'as'