Game Theory 101 (#58): Grim Trigger in the Repeated Prisoner's Dilemma

Поділитися
Вставка
  • Опубліковано 29 лис 2024

КОМЕНТАРІ • 75

  • @mrnormalnormal
    @mrnormalnormal 5 років тому +60

    Your game theory 101 collection is absolutely insane.

  • @shaliniramgoolam9987
    @shaliniramgoolam9987 5 років тому +5

    These videos saved me in my econ class! Literally would of failed without you and our dear friend the stag game! Much love from Paris!

  • @ekjungoh
    @ekjungoh 8 років тому +5

    This video has the clearest explanation for Grim Trigger in UA-cam. Thank you!

  • @quakeburns5521
    @quakeburns5521 5 років тому +3

    thank you so much. dont usually post comments but this is one of the best educational videos ive seen

  • @witchiswhich
    @witchiswhich 8 років тому +40

    life saving! i'm having my final in an hour and this helps a lot

    • @Gametheory101
      @Gametheory101  8 років тому +5

      Good luck!

    • @KobiJaro
      @KobiJaro 7 років тому +3

      hahaha me too so life saving x

    • @davidsanta8633
      @davidsanta8633 6 років тому +2

      how was your final?

    • @KobiJaro
      @KobiJaro 6 років тому +4

      @@Omophagy666 this got me a B+ in micro econ

    • @PunmasterSTP
      @PunmasterSTP 3 роки тому +1

      Yeah, how did your final go?

  • @salyz7141
    @salyz7141 5 років тому +14

    Hey, just wanted to tell you that i really appreciate you putting this stuff out here for free. Really enjoyed my game theory class in university this year but our prof was pretty bad with putting up learning material for the exams. This is all explained so well and has helped me so much. Is there any way to compensate you for this instead of continue watching your videos?

    • @Gametheory101
      @Gametheory101  5 років тому +11

      Just spread the word! (Orrrrr...you can also check out the books that I have on Amazon.)

  • @efecancevik602
    @efecancevik602 6 років тому +3

    Thank you William Spaniel. I passed microeconomy theory lesson with your videos help.

  • @daughterofunicorns3873
    @daughterofunicorns3873 2 роки тому +6

    better than my economics lecturer in 18 minutes :)

  • @MrAtmiller5
    @MrAtmiller5 3 роки тому +2

    🙏 I think it's been years since I first watched your videos---Glad I'm still able to return to them for a refresher in my Master's program.

  • @PunmasterSTP
    @PunmasterSTP 3 роки тому +2

    Grim trigger? More like "wisdom: bigger", now that I've been watching your series on game theory! I haven't quite a fun time over the past several weeks making my way through all of these videos; thanks for making them.

  • @lorax4732
    @lorax4732 5 років тому +6

    It'd be better to play this in my Game theory classes rather than taking a stupid lecture with my unqualified professor. Thanks William Spaniel

  • @barshusmenoglu1857
    @barshusmenoglu1857 6 років тому +2

    Thank you so much, you don't have to do this but you do it anyway and it is free!

  • @environmentalsupervisor7611
    @environmentalsupervisor7611 2 роки тому

    I find it completely contained in the way a Cocker Spaniel can't escape a Carpenter who can't get over a traditional drinking problem but still poops where it sleeps. Right next to a little Blue Book of measurements. It's a profound Equalibriam because you can't get over a mistake ⁸ until is cost you everything. It's really about Narcissistic realism.

  • @Sam-fu8rp
    @Sam-fu8rp Рік тому

    Thank you so much William !

  • @JayPatelAFC
    @JayPatelAFC 3 роки тому +1

    helped me a lot! thank you very much!

  • @bofengli5148
    @bofengli5148 6 років тому +1

    Thanks! Finally understood it! Life's saver

  • @dfdfgdfkih
    @dfdfgdfkih 2 роки тому

    Well explained. Thanks for the help

  • @carlobardoli
    @carlobardoli 7 років тому +7

    Thanks a lot ! from a Cambridge University Economics Student

  • @mylesallen8055
    @mylesallen8055 4 роки тому

    Thanks William, super helpful information once more!

  • @acctsys
    @acctsys 2 роки тому

    It's kind of sad, but my conjecture here is that IRL, taking advantage of others when it suits you seems to have the highest payoff when people tend to forgive.

  • @JAFeser
    @JAFeser 8 років тому +2

    wow thx m8 u rly helped me ! greetings from cologne university!

  • @zafadoodle
    @zafadoodle 11 місяців тому

    I’m curious why you’re adding the present value of the 4 on the right side but not the present value of 3 on the left side. When I attempted the exercise before you, I got x = 1?

  • @christospolitis5305
    @christospolitis5305 6 років тому

    Hello! Very good job! But let me ask you: 1) why it doesn't wotk to an arbitarily long game? I mean, cooperation can work for the first stages and stop afterwards. 2) Why defecting in the first period yields a profit of 4? If defecting is beneficial (i.e. a profitable deviation), then a rational opponent is expected to also defect. So it must be 2 instead of 4.

  • @arishamahreenpirzadeh5326
    @arishamahreenpirzadeh5326 7 років тому +1

    Thank you so much for sharing this! 1/2 is the result of the cooperation stage. But shouldn't we compare that with the delta in the defection stage? I don't understand how one should proceed from there. 1/2 only tells us that there is a 50% change that the game continues into a new period, I guess. I don't understand how we should use this information. And what about the delta in the defection stage? Delta ranges from 0-1, if this number is negative does this suggest that the should deviate, as they are assured that the game will end.
    I would love to know how I should interpret delta, and how it solves the game

  • @gabrieljames1579
    @gabrieljames1579 6 років тому +1

    ...covertly applied this strategy to my divorce negotiations. It saved my ass/mitigated losses in the long game; thank you.

  • @gabriellacotton-gambara5153
    @gabriellacotton-gambara5153 7 років тому +2

    So is the subgame perfect nash equilibrium 3, 3?

  • @orktv4673
    @orktv4673 4 роки тому

    I am not quite convinced that cooperation will be deterred in finite prisoner's dilemma games. It sounds sensible: all bets are off during the last game, and if the strategies in the last game are fixed then the second to last game can be considered the last game before the assured mutual defection, so we again have to make the optimal choice, etc. But look at this from a distance, and suppose we play 100 times. Is it truly unreasonable to play as if the game is infinte, up to the last game? Will I be worse off if I tried this?
    All in all this is very reminiscent of the unexpected hanging paradox, which uses backwards induction to get to an absurd conclusion. As far as I have understood, the impossibility of cooperation is suggested by backwards induction and nothing else. It makes me wonder whether backwards induction is really the right tool to analyze games generally. For the infinite game we cannot even use this method in the first place, but if that is the case how can we even compare finite and infinite and say one does not allow for cooperation while the other one does?

  • @g4lger895
    @g4lger895 9 місяців тому

    So excellent

  • @SimonaDemel25
    @SimonaDemel25 5 років тому

    Very helpful videos, thank you so much for posting them! Quick question. In the example where player 1 deviates in the first stage, why is his payoff 4 + 2d + ...? I understand why the 4 is there, but if he's only deviating in the first stage, would he not still be playing "cooperate" in stages 2 - infinity (and thus his payoff would be 4 + 1d + ...)? Or does he intentionally deviate in stages 2 - infinity because he knows that player 2 will play the grim trigger and he will be better off by defecting for the rest of the game?

  • @Borzacchinni
    @Borzacchinni 4 роки тому

    Is it possible to calculate the equilibrium, if the payoffs are not the same?

  • @dongxing7104
    @dongxing7104 5 років тому

    I'm a little bit confused about the one shot deviation principle in this case. If the base strategy is cooperating forever, why isn't its one shot deviation strategy "...cooperate-defect-cooperate-cooperate-cooperate..." ?

    • @Gametheory101
      @Gametheory101  5 років тому

      Well, that is a superficially different deviation. But it turns out that the proof in the video covers it. If deviating in the second period is profitable, then it must be true that it pays more in total for the periods 2 to infinity than maintaining the grim trigger strategy. If you set up the calculation for this, you wind up with the exact same payoffs as in the video, except everything is multiplied by \delta.

    • @dongxing7104
      @dongxing7104 5 років тому

      @@Gametheory101 Thanks for your reply, and here is my calculation. If I choose to cooperate forever, my total payoff is 3+3δ+3δ^2+...=3/(1-δ). If I take the one-shot-deviation-strategy and, say, defecting in the first stage and then returning to cooperate, my total payoff will be 4+1δ+1δ^2+...=4+δ/(1-δ), instant payoff beyond the first stage being 1 because my opponent has triggered to defect forever. Solving 3/(1-δ)>=4+δ/(1-δ) we get δ>=1/3, and this result is consistent no matter at which stage I choose to defect(which covers the whole space of one-shot deviation strategy), so δ>=1/3 => no one-shot deviation exists => SPE, which contradicts with the conclusion at 12:29. I can't figure out where I made the mistake...

  • @tong8326
    @tong8326 5 років тому +1

    life saver!

  • @noyz-anything
    @noyz-anything 6 років тому

    that's called grudger, mister

  • @ivegotaname5400
    @ivegotaname5400 8 років тому

    It's funny that you have to go through all those calculations to prove that getting 3 every round is better than getting 4 one time, and then getting 2 every round for the rest of the game.

    • @danielklein5560
      @danielklein5560 7 років тому

      depends on the discount rate... for small delta its better to get the high payoff in the first round

  • @Lucas-me8ef
    @Lucas-me8ef 6 років тому

    Cant cooperation work on finite game with discount factors?

  • @48laws45
    @48laws45 2 роки тому

    I never liked Bushes rhetoric about 'you are either with us or against us' ultimatum and it certainly didnt work for me LOL this country was always been questionable at best, so I guess that proves the point that you can't have had any previous defectors of cooperation for that to be held as an ultimatum unto to others LOL..highly informative if applied correctly to how the psychology of the group thinks in real time and how they apply this to marketing

  • @aparthipan5903
    @aparthipan5903 7 років тому +2

    what does delta represent?

    • @geetatiwari5781
      @geetatiwari5781 4 роки тому +1

      It's the present value factor. We are in period one so we know that the payoff is going to be 3 but for later periods we need to know the present value of future payoffs. So by using delta which incorporates interest rate, we basically convert the future payoffs in terms of their value in period 1.
      Reply is too late. I hope that it helps

  • @supergreatsuper
    @supergreatsuper 8 років тому +1

    What if it is known that the game will end, but it is not known after how many periods?

    • @Gametheory101
      @Gametheory101  8 років тому

      +supergreatsuper So there is some finite period by which the game MUST end, but the players don't know know in which period the game will end before that? That's a pretty straightforward model to solve if you have worked through the section on repeated games.

    • @supergreatsuper
      @supergreatsuper 8 років тому

      +William Spaniel
      Are you assuming that the players know that the game will end within x turns where x is finite and known? I am asking about where x is finite but not known to the players.

    • @zacharytaylor2423
      @zacharytaylor2423 8 років тому +1

      Then it depends on the players' beliefs about the probability distribution of x.
      Case 1: X has an upper bound which is known to the players. Here, the same logic of finitely repeated games applies. In the prisoner's dilemma example, we know that everyone will defect in the very last possible period, if the game lasts that long. This means everyone will defect in the second to last possible period, because there is no incentive to cooperate. Etc. (You can still get some degree of cooperation in games with multiple equillibria, using "Nash Threats." I'm sure Will covers that somewhere, but I'm not sure where.)
      Case 2: X does not have an upper bound. There is a constant probability p that the game will end in any given period. We can treat this as an infinitely repeated game, where p is the discount factor. (Or, if players are impatient, p is an upper bound on the discount factor.)
      Case 3: X does not have an upper bound, but the probability of the game ending in a given period depends on the period and/or on previous play. In this case, cooperation may still be possible, but depends on the specifics of the game and is well out of the scope of an introductory course.
      I believe Will interpreted your question as asking about what I am calling Case 1.
      -Helpful coauthor

    • @supergreatsuper
      @supergreatsuper 8 років тому

      Is there a simple explanation of what should happen if the players are only told "the game will end at some point" (but are given absolutely no information about when it will end)?

  • @kikones34
    @kikones34 8 років тому

    You forgot to put this video on the playlist! I was confused when I accidentally watched #59 right after #57...

  • @chasemorello60
    @chasemorello60 5 днів тому

    📥💀

  • @nathanbarnard7896
    @nathanbarnard7896 3 роки тому

    Does this require common knowledge?

  • @dylans8198
    @dylans8198 5 років тому

    video takes way too long