GTORangeBuilder Blog

Sure, you can add that data to the post. I can giv...

2014-04-14T18:17:25.382-07:00

Sure, you can add that data to the post. I can give it a second check to be sure figures are fine.

You are right that the nash equilibrium strategy h...

2014-04-14T12:15:02.291-07:00

You are right that the nash equilibrium strategy has him playing scissors as often as we play rock, but he can lose less than 1 quarter. If you tell me the exact strategy that you are proposing we play, I can tell you the counter strategy for him that loses less than 1/4th of the time.

then our strategy should be to play rock as often ...

2014-04-14T12:09:12.266-07:00

then our strategy should be to play rock as often as our opponent plays scissors and paper the rest of the time:
then
we win = r * (1 -s ) + s * s
tie = p * (1 - s) + r * s
we lose = s * (1 - s)

because r >= 1/2 and s <= 1/2 my opponents strategy of playing scissors as often as our paper as much as possible would push him to play 1/2 scissors to counteract our strategy and losing 1/4 of the time is the best he can do.

I'm not 100% sure I understand the strategy yo...

2014-04-14T10:55:45.251-07:00

I'm not 100% sure I understand the strategy you are proposing, but if you are saying we should play 50% rock, 50% paper, our opponent could always play paper when he is allowed to and effectively be playing 50% rock and 50% paper as well. We would then break even against him.

I don't buy any of this. Please explain where ...

2014-04-14T10:29:25.633-07:00

I don't buy any of this. Please explain where I'm wrong.

Over time my opponent will know my strategy and I'll know his.

My opponent would just match the scissors % to my paper %, insofar as he could.

My paper % should always be >= 50% as that is where my advantage lies.

Therefore my opponent can best counter that advantage by playing scissor 1/2 the time.

My only gain then is when I play rock to counteract the scissor.

That gives me
1/4 P v R = +1/4
1/4 P v S = -1/4
1/4 R v S = +1/4
1/4 R v R = 0

That means I win 1/4 of the time.

If I play 2/3 paper, I only win 1/6 of the time vs 1/2 R & 1/2 S

Awesome, thanks for doing that, what program did y...

2014-04-14T06:18:19.031-07:00

Awesome, thanks for doing that, what program did you use if I may ask?

The vs Tight numbers are interesting, especially how giant the exploitative leak is. If its okay with you, I'll edit the post to mention your data.

I improved your analysis in the push/fold scenario...

2014-04-14T05:37:30.796-07:00

I improved your analysis in the push/fold scenario using a better program that will enable me to compute the exact gains in each scenario. I supposed the fish will employ the best ranges (ie the best 35% of hands he should call with as opposed to a sub optimal 35% of hands) which means that obviously in reality he may do worse.
These are the results I got:
GTO bb/100 Max Expl. bb/100 GTO WR % Expl. Leak
vs GTO 0 0 100.00% 0
vs Small Fish 0.39 0.81 48.15% -1.26
vs Med. Fish 1.77 3.54 50.00% -2.82
vs Huge fish 21.18 26.04 81.34% -2.64
vs Tight 20% 1.47 8.82 16.67% -27.63
vs vTight 14% 5.07 28.14 18.02% -41.79

Actually, no, nevermind, I think I screwed up the ...

2014-04-13T14:12:19.128-07:00

Actually, no, nevermind, I think I screwed up the math on that (wrong probabilities for waiting on [2R,1B]). The Anon is correct.

I don't think this is quite right. "For ...

2014-04-13T14:04:25.020-07:00

I don't think this is quite right. "For [2R,1B], betting gives EV = +100/3, but waiting gives and EV = +100/2 (0.5 chance or [2R,0B] which has EV = +100 and 0.5 chance of [1R,1B] which has EV = 0)." You have a 2/3rds chance of going to [1R,1B] and a 1/3rd of going to [2R,0B]

For the four card game, I actually find that the E...

2014-04-13T13:58:07.761-07:00

For the four card game, I actually find that the EV is positive. If you bet with four cards, the EV is obviously 0. If you draw, then you either end up in with [2R,1B] or [1R,2B]. For [2R,1B], betting gives EV = +100/3, but waiting gives and EV = +100/2 (0.5 chance or [2R,0B] which has EV = +100 and 0.5 chance of [1R,1B] which has EV = 0). Thus for [2R,1B] you should draw with EV = +50. For [1B,2R], the EVs are reversed with an EV of -100/3 for betting and -50 for drawing. Thus for [1B,2R] you should bet overall with EV -100/3. Overall, the EV for drawing in the 4-card game is: 0.5*EV(draw,[2R,1B]) + 0.5*EV(bet,[1R,2B]) = +25/3.

So the strategy for the 4-card game is to first draw a card. If the card is red, then bet on the next card being red. If the card is black, keep drawing until you draw the last black card from the deck or are forced to bet on the last card.

It also seems true for eight and ten, so I'm a...

2014-04-12T07:10:19.927-07:00

It also seems true for eight and ten, so I'm a little skeptical that it shifts at a critical point. Unlike a game that does shift (say the pirate game once you extend beyond 2*g) there is no numerical constraint that would alter the play. However, I'm incredibly rusty at game theory, such that I don't remember anything beyond basic backwards induction, so I could be missing something obvious. I look forward to seeing the more advanced techniques and the solution.

As an aside, please keep posting these. They're a nice way to practice game theory.

This is pretty good logic and you are right in the...

2014-04-12T05:42:24.815-07:00

This is pretty good logic and you are right in the 4 and the 6 case. There are techniques for backwards inducting through all 52 cards which I will show in the solution and there are cases where things appear true for small numbers like 4 and 6 and then shift at some critical point. I won't give away the solution by saying whether this is one of these cases or not :)

You are definitely correct, that is an error, sorr...

2014-04-12T05:37:45.341-07:00

You are definitely correct, that is an error, sorry about that. It doesn't effect the solution, and I've corrected it above, thanks for pointing it out!

Glad you like the brainteasers :)

I'm loving these brainteasers, really hope to ...

2014-04-12T03:48:24.176-07:00

I'm loving these brainteasers, really hope to see more of them.

One thing though that I'm a bit confused about: in the bonus puzzle solution when we solve for player 1's EVs, why is the second round EV represented by 50 * r2? Should it not be 50 * (1 - r2) since we only auto-win the second round if player 2 *doesn't* play rock? It's also inconsistent with one of the following paragraphs saying that, if we plug r2=p2=s2=1/3 in we get an EV of $33.33... which we don't since it simplifies to 50 * 1/3 = $16.67. Seems like an error to me...

Here are my thoughts, which may be wrong due to ca...

2014-04-11T21:25:53.295-07:00

Here are my thoughts, which may be wrong due to calculation error but seem intuitive:

Backwards induction for both a four and six card game suggests there is no difference between betting and not betting at the first card. The stipulation that one must bet on the final card removes the slight advantage that would be otherwise present from trying for a more favorable ratio of colors, because it counteracts every potential positive outcome with a forced loss. It is unreasonable and impractical to conduct backwards induction for the 52 card game, but the analysis can be extrapolated from that on the games of manageable size; it becomes clear that every game starting with an even ratio is composed of even subgames (with expected value zero) and offsetting uneven subgames. Therefore the optimal strategy is simply to bet on the first card every time, since you are not indifferent to the time it would take to progress through a game. Under this logic, of course, the real optimal strategy is not to play.

2014-04-11T13:18:08.374-07:00

This comment has been removed by the author.

That's a nice and clean math.

2014-04-08T00:55:03.054-07:00

That's a nice and clean math.

Repost of my first comment which failed to appear ...

2014-04-07T05:39:59.594-07:00

Repost of my first comment which failed to appear :

Bonus :
The optimal strategy if my opponent is intelligent and can predict my moves as I can predict his would be for me to make a random choice (head or tails) betwwen paper and rock on the first round, and for him to do the same.
I would then gain 25$ on average which is what I would be willing to pay.
(By "on average" here I mean : a means of all possible results. I know there are only two rounds.)

Explanation :
The only difficulty lies in the strategy for the first round. In the second round, my opponent will play rock if he hasn't play rock before, and a random choice of P/R/S if he has. I will play paper if he hasn't played rock before, and gain 50$, or play a random selection of P/R/S and gain 0$ on average if he has played rock before.

There are only 9 possible games for the first round :
If I play R and he plays R, I will gain 0$ on average for the two rounds.
If I play R and he plays P, I will gain 0$ on average for the two rounds.
If I play R and he plays S, I will gain 1000$ for the two rounds.

If I play P and he plays R, I will gain 50$ on average for the two rounds.
If I play P and he plays P, I will gain 50$ for the two rounds.
If I play P and he plays S, I will gain 0$ for the two rounds.

If I play S and he plays R, I will lose 50$ on average for the two rounds.
If I play S and he plays P, I will gain 100$ on average for the two rounds.
If I play S and he plays S, I will gain 50$ on average for the two rounds.

So if my opponent plays randomly in the first round, all options will give me the same gain on average.
But if I play randomly, the situation is very different for him, as he can minimize my gain to an average of zero by playing rock.
But of course I could predict that and play paper in the first round and gain 50$ if he has played rock.
Which he could then predict, and play scissors to beat me, which I could predict and play rock, which he could predict, which I could predict...
If I can predict his move I can counter him, and if he can predict mine he can counter me.
So my best choice is to be unrpedictable and that goes for him too, while I still keep in mind that playing rock first is the best strategy for him if I make a random choice between the three options.
So my best choice is to make a random choice between paper and rock, thus optimizing my results against rock while remaining upredictable.
If he can predict that, it will be best for him to avoid playing scissors, because against my rock or paper his scissors will gain me 50$ on average, instead of 25$ for his rock or paper against mine.

Follow-up to my earlier comment : I got it wrong....

2014-04-07T05:34:08.161-07:00

Follow-up to my earlier comment :
I got it wrong. I can still improve my strategy and pay as much as 33.33$to play if I throw a die, and play paper if I get 1,2,3,4, and rock if I get 5 or 6. So a 2/3 chance to play paper, 1/3 chance which of course just matches the result of the first problem.
In that case, there is no optimal strategy for my opponent, he can play whatever he wants.
If I increase the odds that I play paper to more than 50%, thent it becomes interesting for him to play scissors which will give me a 0$ gain against my paper, but it is balanced by the risk he might lose 100$ if I still play rock. A 2/3 chance of paper vs 1/3 chance of rock is the equilibrium. Whatever he chooses to play then will make me gain 33.33$ on average, whether he plays scissors or not.

I can see that several other people have come up w...

2014-04-07T04:15:15.581-07:00

I can see that several other people have come up with this answer already but I will post my solution as well.

The Expected Value (EV) of regular RPS is 0 for both players, using the strategy of 1/3 for each possibility. The constrained player is unable to play this strategy so EV(us) > 0.

Because the constrained player must play rock at least 1/2 of the time, if we play scissors with P > 0, then we will lose at least 50% of the time. If the game is to have positive EV, then we should be able to do better.

In any Nash equilibrium, neither player must be able to better by changing their strategies. After ruling out every pure strategy (which I will leave up to the reader), we can see that mixed strategies are required. In any continuous, mixed strategy equilibrium, players will be indifferent between the strategies that play that have non-zero probability (otherwise they could do better by adjusting the probabilities).

We want to find probabilities that will make our opponent indifferent to playing paper or scissors. Therefore:
Payoff (opponent, scissors) = P(us, paper) - P(us, rock)
and Payoff (opponent, paper) = P(us, rock) - P(us, scissors)
are equal. But P(us, scissors) = 0, so this simplifies to:

P(us, paper) - P(us rock) = P(us, rock) or
P(us, paper) = 2 * P(us rock)

Since P(us, paper) + P(us, rock) + P(us, scissors) = 1, we get:
3 * P(us rock) = 1 or,
P(us, rock) = 1/3, and
P(us, paper) = 2/3.

In an equilibrium, we must be indifferent between paper and rock. Using the same logic above, we get P(opponent, scissors) = 2/3 and P(opponent, paper) = 1/3, otherwise it would our payoff would be higher under rock or paper.

From this we can calculate the EV of 100/6 = 16.66.

Bonus : The optimal strategy if my opponent is int...

2014-04-07T03:28:31.459-07:00

Bonus :
The optimal strategy if my opponent is intelligent and can predict my moves as I can predict his would be for me to make a random choice (head or tails) betwwen paper and rock on the first round, and for him to do the same.
I would then gain 25$ on average which is what I would be willing to pay.
(By "on average" here I mean : a means of all possible results. I know there are only two rounds.)

Explanation :
The only difficulty lies in the strategy for the first round. In the second round, my opponent will play rock if he hasn't play rock before, and a random choice of P/R/S if he has. I will play paper if he hasn't played rock before, and gain 50$, or play a random selection of P/R/S and gain 0$ on average if he has played rock before.

There are only 9 possible games for the first round :
If I play R and he plays R, I will gain 0$ on average for the two rounds.
If I play R and he plays P, I will gain 0$ on average for the two rounds.
If I play R and he plays S, I will gain 1000$ for the two rounds.

If I play P and he plays R, I will gain 50$ on average for the two rounds.
If I play P and he plays P, I will gain 50$ for the two rounds.
If I play P and he plays S, I will gain 0$ for the two rounds.

If I play S and he plays R, I will lose 50$ on average for the two rounds.
If I play S and he plays P, I will gain 100$ on average for the two rounds.
If I play S and he plays S, I will gain 50$ on average for the two rounds.

So if my opponent plays randomly in the first round, all options will give me the same gain on average.
But if I play randomly, the situation is very different for him, as he can minimize my gain to an average of zero by playing rock.
But of course I could predict that and play paper in the first round and gain 50$ if he has played rock.
Which he could then predict, and play scissors to beat me, which I could predict and play rock, which he could predict, which I could predict...
If I can predict his move I can counter him, and if he can predict mine he can counter me.
So my best choice is to be unrpedictable and that goes for him too, while I still keep in mind that playing rock first is the best strategy for him if I make a random choice between the three options.
So my best choice is to make a random choice between paper and rock, thus optimizing my results against rock while remaining upredictable.
If he can predict that, it will be best for him to avoid playing scissors, because against my rock or paper his scissors will gain me 50$ on average, instead of 25$ for his rock or paper against mine.

2014-04-06T22:14:39.488-07:00

This comment has been removed by the author.

Spoiler. The most you should be willing to pay is...

2014-04-06T22:07:35.254-07:00

Spoiler.

The most you should be willing to pay is 100/6 = $16.66.

I think that the equilibrium strategy is for the unconstrained player to play paper 2/3 of the time and rock 1/3 of the time. The constrained player plays scissors 2/3 and paper 1/3 of the time he gets to choose.

That is entertaining, I like that a lot, thanks fo...

2014-04-06T18:26:14.110-07:00

That is entertaining, I like that a lot, thanks for sharing!

Your explanation was not wlog. Using your logic ch...

2014-04-06T18:09:59.380-07:00

Your explanation was not wlog. Using your logic choose paper 100%. Then wlog fix R at 100%. You win every game! Wow, Clever!

Lets look at a better strategy against yours. Suppose your opponent chooses Scissors and Rock just as often. Then we have as the expected number of wins the following: .5(2/3 -1/6) + .5(-2/3 + 1/6) = 0 when using your strategy. You break even. If you never play scissors, this works out better.