Newcomb’s paradox

Editorial Staff

15 april 2021

Let’s suppose that we have a game with two players. One is a person that will make a decision and the other is a supercomputer that will try and predict that decision beforehand. This computer has been tested many times and the testers have worked out that the computer can predict the decision of the player with 99% accuracy. The game works as follows: two boxes are placed in front of the player. One box is clear and contains a visible 1,000 euros. The other box is opaque, which means the contents are unknown to the player. This box can either contain nothing or 1,000,000 euros. The player can either choose to open both boxes and keep the contents or only open the second box.

This seems like an easy decision but there is a twist. The computer has predicted the player’s choice beforehand. If the computer predicts that the player will open both boxes it will put nothing in the second box. But if the computer predicts that the player will only open the second box it will put 1,000,000 euros in it. What is the optimal choice for the player? To get a better understanding of the problem we first need to know a little bit about strategic dominance and expected utility theory, the concepts that form the paradox.

Strategic dominance

Strategic dominance is a well known concept in game theory. This phenomenon occurs when, for a given player, a certain strategy always gives the best outcome independent of the decision of the other player. The best way to visualize strategic dominance is by looking at an example using a payoff matrix.

Let’s imagine there are two pharmaceutical companies that both want to develop a vaccine for COVID-19. We call them company A and company B. Now of course the profits they will receive if they decide to develop a vaccine is dependent on the choice of the other company. Since the companies never share their research projects beforehand they also do not know what the other company has decided. Now we have the following numbers. If a company decides not to go for the project their net profits are 0, since they did not spend anything and also did not gain anything. Then if either company makes the vaccines while the other company does not, that company will have a net profit of 1000. Then finally, if both companies make the vaccine, both companies will have a net profit of 400. Now let’s put all this information into a payoff matrix.

	B does not develop a vaccine	B develops vaccines
A does not develop a vaccine	0 , 0	0 , 1000
A develops a vaccine	1000 , 0	400 , 400

We continue by looking from the perspective of company A. If company B chooses to develop the vaccines then it is optimal for company A to also develop the vaccines since 400 is more than 0. And if B chooses not to develop the vaccines it is still optimal for A to develop the vaccines since 1000 is more than 0. Note in both cases it is optimal for company A to develop the vaccines. This is known as a dominant strategy since no matter what company B decides, company A will always make the same choice. We will need this concept for one of the interpretations of Newcomb’s paradox later, so keep this concept in mind.

Expected utility

The second concept we will need is expected utility. Utility can be interpreted as a number that represents how happy something makes you. To keep things simple, assume that utility increases with the same rate as monetary value. This means for example that an increase in money of 1000 euro will result in an increase of 1000 units of utility.

Let’s look at another example with numbers to make things a bit more clear. Once again assume that a company is trying to develop a COVID-19 vaccine. But now we will only look from the perspective of company A and it has become a race between companies A and B. Let’s say the probability of company A being faster in developing a vaccine than B is 0.4. This means that the probability of company B being faster is 0.6. If company A is faster, then their net profits are 800. But if company B is faster, company A will only have a profit of 200. The expected utility of company A can now be calculated as $0.4*800 + 0.6*200 = 440$. Hence the expected utility of company A is 440. This can be interpreted as how happy company A will be on average after developing a COVID-19 vaccine.

The paradox

Now let’s get back to the problem at hand. Why does this seemingly simple choice stir up so much controversy? To get a better understanding of this, we have to see what the difference in decision is when using both earlier discussed methods. Let’s first make the payoff matrix and look at the strategic dominance. Note there are two players; the person making the choice and the supercomputer predicting it.

	Computer makes box 2 empty	Computer puts a million in box 2
Picks both boxes	1,000 + 0	1,000 + 1,000,000
Picks box 2 only	0	1,000,000

Note if the supercomputer makes box 2 empty, it is optimal to choose both boxes since 1,000 euros is more than nothing. If the computer puts 1,000,000 euros in box 2 it is also optimal for the player to pick both boxes since 1,001,000 euros is more than 1,000,000. Hence the dominant strategy for the player is to choose both boxes. Thus the choice seems easy, just pick both boxes since the computer has already made its prediction.

But now let’s look at the alternative interpretation. We know that the supercomputers predictions are 99% accurate. Hence we can calculate the expected utility. The expected utility from picking both boxes is $1000 + 0.99*0 + 0.01*1000000 = 11000$. Then the expected utility of choosing only the second box is $0.99*1000000 + 0.01*0 = 990000$. Now note that 990,000 is a lot more than 11,000. Hence, it seems optimal to only go for box 2.

So how can two valid methods of determining the optimal strategy give different answers? This conundrum what we call Newcomb’s paradox. In the end the optimal strategy is up to the one that needs to take the decision, since both arguments seem very much valid.