I created an AI to Play Poker

Steven Gong

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 22 гру 2024

КОМЕНТАРІ • 295

@AskiFin 5 місяців тому ⁺⁸⁹⁶
There is the famous guy who won the AI only poker tournament.. His AI just always went all in, and the other AI's thought since its a large bet, they have better hand and just folded.
@Sandeeppandek21 5 місяців тому ⁺¹⁰
😂😂
@Ohrami 5 місяців тому ⁺⁶⁷
Then those AIs were completely horrible, or the one that only went all-in got incredibly lucky.
@AskiFin 5 місяців тому ⁺⁵²
@@Ohrami It was a school project kinda thing, so I assume it was mix of both.
@preedeelosoponkul993 5 місяців тому ⁺¹
Others got no AA,KK ?
@djdev182 5 місяців тому
Lol😂
@stevengongg 5 місяців тому ⁺¹⁷⁵
This has been a project that I've been working on for a pretty long time... so I'm happy to finally release it! Let me know if you guys want a follow-up, since there are still quite a few things I can improve the AI on. I'd like to challenge an actual professional poker player next time. If you enjoyed watching the video, make sure to leave a like, it helps grow the channel :))
@thecodingmentor7701 5 місяців тому ⁺⁵
This is an incredible video. You combined two of my facorite things, AI and Poker. Cant believe this does not have more views! Such a high production quality also. As for me, I would definitely like a follow up to this video!
@JimStanfield-zo2pz 5 місяців тому ⁺³
It sucks dude. I built a way better one. My poker bot would destroy yours
@Imokay304 5 місяців тому ⁺¹
AI will take over eventually. Its not smart to be making videos like this. It will definitely ruin online poker one day if it hasn't already... Sad shame
@ahsidodna3355 5 місяців тому
for sure, this is very interesting, please continue
@perfectionist8380 5 місяців тому
Maybe try training the AI on all the top poker tournaments (or just Dan Negreanu strategy) & also use facial recognition for player emotion/stress.
Or simply take into account the time to react factor.
Great project overall though! Kudos
@worldofpoker 5 місяців тому ⁺¹⁰⁰
Well done on the poker AI project, challenging your friends. It was definitely worth it, but there are obviously a few areas that use some improvement. Your explanation of Nash and CFR was quite solid, while the practical implementation seemed somewhat missing in depth, especially handling complex game states beyond simple abstractions.
This, in consequence of the pre-flop and flop separation, is what some may consider a big flaw for a more robust AI. Your idea to use Equity for card abstraction is great, but it seems that your method hugely oversimplifies the complex situations, as you noted with very similar equity hands playing differently.
Now, this would be of interest: a tournament of poker AI systems. If you're game for it, let's set up a match against Poker AI from pokerbotAI. It will be impressive to see how different strategies and their various implementations fare against each other.
@DalePumento Місяць тому
Variable change is a big factor too. Like the 66 and KQ equity, and each of their distinct strategy. In a single raise pots, both 66 and KQ are 100% raise to an unopened pot in any position. Post flop is where it differs. 66 needs to pot control on 1 over card board texture. Meanwhile, KQ can Cbet 1/3 pot to deny equity and/or semibluff as the aggressor. Variable change for poker is then the product of nut advantage and range advantage.
@worldofpoker Місяць тому
@@DalePumento The adaptation of strategy according to hand strength and position is key, and the variations of changes in post-flop decisions are huge. Distinctions between hands like 66 and KQ, and their respective strategies based on board texture, all point to complexities that a more advanced AI should address.
@DalePumento Місяць тому
@@worldofpoker I agree with you that OP oversimplified his AI model. The decision tree is too large for the model that he designed. Hopefully with better time and more model training, he could improve it.
@ffishcakess 5 місяців тому ⁺¹⁰⁵
Hahhaa the punt jam on AA2 followed by the even bigger punt call "I'm folding, don't get used to it" - I believe it lmao 😂
@JonCookeBridge 5 місяців тому ⁺⁴
Confused why it calls the huge jam on AA2 with QT. Is it because pre-flop and flop are decoupled, somehow? Or is it not realising the difference between all in and small bet?
@BarvGwydh 5 місяців тому ⁺¹¹
It could mainly be because this spot never comes up in nash equilibrium strategy. I think the bot is not playing GTO in general and has some flaws, but even a well made bot might struggle with this spot because 6x pot jam should never be played.
@JonCookeBridge 5 місяців тому ⁺⁵
@@BarvGwydh Possibly. But watching the film I suspect his preflop model isn’t putting enough aces in this line because it’s decoupled from post flop action. Obviously with more aces in his range here, the need to call weaker hands reduces. To get the algorithm to compute a balanced preflop strategy requires CFR to be done across multiple streets, and he implies that he solved preflop and postflop separately. Basically if you just train a preflop model based on maximising equity before you see the flop, your AI won’t have enough board coverage when it’s deep, and it won’t slow play enough because it won’t see the value it gets from those things post-flop.
@Kingcrowish 5 місяців тому ⁺²¹
The timing on the Helmuth rage clip was great
@tfilipeac 5 місяців тому ⁺²¹
Hey! Poker player here, first of all grat job, must not be easy at all!
I think you would have better success playing against your friends (and bad poker player in general) if you programme your AI not to play in equilibrium but to play unbalanced in order to exploit these weaker players' leaks!
@PsylocibeChillZen 5 місяців тому ⁺²²
why there is 2 cards 6 of diamonds on the deck? at 12:55
@Johnny2Feathers 5 місяців тому
Nice catch
@VeriStrawberi 5 місяців тому ⁺²
At 13:09 its a completely different board ?3QQK too lol He wins with 6d7c
@MarijnRoorda 5 місяців тому ⁺⁶¹
I went from a honest government ad warning me about AI, to this...
@Christoff070 5 місяців тому
Great vid that one
@authenticallysuperficial9874 5 місяців тому ⁺⁸
"Honest government" is an oxymoron
@Christoff070 5 місяців тому
@@authenticallysuperficial9874 yup,that's the point of it
@MarijnRoorda 5 місяців тому
@@authenticallysuperficial9874 It's a youtube channel that warns people for the bullshit during elections. It has pretty girls and plenty of swearing in australien. The Juice media.
@HamguyBacon 5 місяців тому
"government warning about ai" 😂
@CreativeMindstorms 5 місяців тому ⁺¹⁴
I have been wanting to see this for so long! Great job on the AI, it seemed to be making right plays most of the time!
@fauge7 5 місяців тому ⁺¹
Yeah but most of the time is bad enough to make you go broke...
@B00nater 5 місяців тому
It really isn’t. Seems easy to exploit.
@fanouzhang3887 5 місяців тому ⁺⁷
The AI didn't seem to be playing very close to a gto solution of most spots
@haviick1014 5 місяців тому ⁺⁷
This needs a part 2
@tehf00n 5 місяців тому ⁺³
A friend of mine coded a poker bot for texas hold 'em back around 2006. He set one instance running 1c/2c cash game tables and was returning around 10c/day profit playing the odds and position only. I haven't seen him for years to see how he went once he weighted it for variance and stake styles.
@spacebomb9126 5 місяців тому ⁺²
He's probably in prison now for theft on gambling sites.
@tehf00n 5 місяців тому
@@spacebomb9126 I highly doubt it. The guy was a top level coder and wrote most of the original office polymorphic viruses for bug bounties with Microsoft. And worked a government job. More likely he knew his way around a VPN and a dummy account.
@ChrisMaveric 5 місяців тому ⁺⁹
Just a heads up... your video shows two 6 of Diamonds, one is in the AI hand and then a second is the turn card...
@icanyo9301 5 місяців тому ⁺⁵
this is so sick! i need to learn to code cus my god it seems so satisfying completing a project like this
@AbomidablePlayer 5 місяців тому ⁺⁵
The Phil Helmuth reference is gold!
@pipipupukok 5 місяців тому ⁺²
I absolutely love your videos man, you inspired me alot, and now i will be an intern in the biggest it company in poland at 16 years old!
@stevengongg 5 місяців тому ⁺¹
good job!! keep pushing
@JoseLopez-qm4ro 5 місяців тому ⁺¹
Good job man, 👏 props to you regardless of the loss...Take failure, learn from it and embrace it to always make your self better..
@SeanP1982 17 днів тому
The hand at 12:55 has 2 x 6 of Diamonds, how did your AI not throuw an error for 2 of the same card (an invalid outcome in poker) did you give the AI rules to follow? How can the AI calculate anything like this?
@anghungnguyen5115 Місяць тому
I can not train . it always fails on the turn or river . please help me
@kaitan5508 4 місяці тому ⁺³
AI was on tilt after the punt all-in call, its ok, we've all been there 😫😫
@jasonchan64 5 місяців тому ⁺¹
nice work!! keep going! dont be surprised that AI would ask you to call the last hand or even QTs hands, just the solution come from the AI is based on the optimum play against the player play perfectly (or at least much more bluff than your friends) so to compete with your friends you might need to do a lot of adjustment which your AI couldnt understand
@rogerzhang5993 5 місяців тому
It just means the humans are not playing at Nash equilibrium
@MarioOLL6251 9 днів тому
Oh heck yeah, the Daniel vs Phil clip is so iconic
@marcuslaxson6237 5 місяців тому
wow this channel is going to blow up, great idea and very fun execution!
@jujubaloolian 5 місяців тому ⁺⁴
It would be interesting to see this how this would develop further if the AI takes into account the position it is sitting at the table to see how many players bet before or would have an opportunity to bet after. Its interesting to see how other players could interpret the AI's decisions, most poker players would tell you to always bet/raise/reraise the same amount preflop for hands you play, as changing the bet amount would only signal that you have a strong hand or were bluffing. 😃
@BarvGwydh 5 місяців тому ⁺¹
The bot will use randomization so that sizes do not make it exploitable, it will also simply be balanced no matter what, it will have an unexploitable mix of value and bluffs. Nash equilibrium cannot be beaten, at best the opponent can break even if they also use a nash equilibrium strategy, like the rock paper scissors example where the NE strategy is to pick each action 1/3 of the time. If the bot used multiple sizes it will still balance them, or for example, if it uses multiple sizes preflop then it might use 2bb half the time and 2.5bb the other half of the time if it wanted to open raise a hand to 2 different sizes.
@jujubaloolian 5 місяців тому ⁺³
@@BarvGwydh That makes sense for the betting, but I still think it would make a difference based off the position you are sitting. If the bot calls/raises a lower equity hand at the bottom of its range, it has a higher chance of winning when it bets last at the table and there is only one other person in the pot then vs when its UTG and is betting the same hand into 6 people or calling the hand with 6 people in the pot. Nash equilibrium can be beaten in a game with more than two people if others have shifting strategies - it only ensures that a single player can't improve by deviating.
@Zarl451 2 місяці тому
Interesting effort, part of the problem you may be encountering is that the GTO solvers generally need an input on your opponents expected range.
GTO by default assumes the other person is also playing GTO. But as they deviate the optimal strategy is for you to also deviate.
@MrZweene 2 місяці тому ⁺²
That’s bs. How can a game theory optimum depend on other people behavior?
@Zarl451 2 місяці тому
@@MrZweene The maths has no problem at all with the "optimum" strategy existing at the level that your response would be unique to every possible opponent in every possible spot. The need for a generalised set of actions is an engineering and practical one not a mathematical one.
@BadMouth 20 днів тому ⁺¹
give me that program
@Isaac-bb3pj 4 місяці тому
This was really fascinating, would love to see a tournament against pros (but like a table of 6)
@lemmyorleans 5 місяців тому
Youd want to get a complete solve of the game tree for several different opening bet sizes and feed that into your ai. It wouldnt even need to do any ai magic unless you end up in a branch of the tree you didnt solve for. Every presolved scenario would just be a database lookup.
@authenticallysuperficial9874 5 місяців тому
0:30 Please fix this issue in your audio where you played both channels at once and they got garbled.
@laijiahoe3815 Місяць тому
may I know how to install it?
@puffypastels1172 5 місяців тому
Hi! I was wondering what tutorial you watched at 5:55, i'm super interesting in this stuff lol
@carter3144 4 місяці тому
It would be difficult to have the AI factor in the opponents body language and other tells, but what about their strategy? Would it be possible for the AI to utilize in game machine learning to adapt its playing style based on the opponents hands (that they don't muck), raise frequency, bet sizing... etc.
Additionally, as we all know, playing poker with fake money doesn't really work super well. I was wondering if you and your friends were using real money or just chips? This would drastically change how your friends were playing.
Great video, I learned a lot. Thanks for taking the time!!
@TrygveVang 5 місяців тому ⁺¹
This vid deserves 1000x more views that what it is at now holy moly
@pocket8s 5 місяців тому ⁺¹
okay now make it solve multiway hands! nice vid 😊
@AshtonRiddell 5 місяців тому
I wonder if the ai treats calling the all in as a semi bluff
@JMRSplatt 5 місяців тому
"Private" Cards.. I love it. I'm using this instead of hole cards from now on. Sir! These are my on cards, they are private!
@mrjaekwan73 Місяць тому
What RL model did u use?
@RandomFunZer 5 місяців тому ⁺¹
As an actual full-time professional kt was quite interesting to watch. But this ai at its current state would get absolutely destroyed by a real pro :).
Still a fun endeavour and for sure very hard to come up with making this system. Good job
@marcusmartin7576 5 місяців тому
I wanted to attempt a project like this myself I would consider using poker solvers as the basis of a strategy and using solvers as a database because they implement gto and give you the proper move in every scenario
@terencehill3972 5 місяців тому
why the dancing animals? WHY?? 3:58..."Poker is a game of people" - D. Brunson check and mate no seriously, well-made video you made me laugh thanks, love the AI voice its hilarious 🤭
@qlf9_ Місяць тому
did not expect cornell cs 2850 to make a cameo in this video
@marekbalco9594 5 місяців тому
How can I run this code to be able adding my hand cards and board cards?
@KanakManghi-xp5qv 4 місяці тому
how can I use this software?
@kailinaleekuk 2 місяці тому ⁺¹
The goal of Texas Holdem is not to make the best 5 card hand. Your goal in Texas Holdem is to put yourself in situations where you have the larger share of equity.
@ortizbryan27 2 місяці тому
I love by the end, the A. I, through training of other people playing, become a degenerative gambler. 😂😂😂
@freakoutz2707 4 місяці тому
Love the video man. Just wish your Mike was a little bit bit better. Only complaint though keep running.
@noThankyou-g5c 5 місяців тому
what is second top pair? idk if that’s a real term, not trying to be an asshole. also im not sure id call that raise a semi bluff, i at least normally associate semi bluffs w a hand that has weak showdown value but has considerable equity to a hand that could win at showdown. I’m not very good at poker all things considered and dont play a lot of heads up but from what I do know about heads up I’d say it’s more likely that second pair is just a good enough hand to raise. In heads up poker things like second, third, even bottom pair, are relatively good hands in the grand scheme of things, or at the very least theyre considerably better than second pair in 6max or full ring poker where 2nd pair is generally just seen as a bluff catcher/showdown hand
@AlessandroOrlandi83 5 місяців тому
Thanks, this was very interesting as for curiosity I wanted to implement it myself but never got the time to do it.
@Bot-Frenzy 5 місяців тому ⁺¹
Nice work Steven! Keep it up 👍🏻
@blueplayer6197 8 днів тому
So.. Does it just call and raise all the time?
@queenstown9534 5 місяців тому ⁺²
12:51 Heads up, I wouldn't exactly call this a semi-bluff as BB's range should be way weaker than the AI's, so it's a value raise. Very standard.
13:26 This hand is atrocious. I'm guessing you guys are playing 50/100 this hand, hard to tell. In which case we're ~30bb effective and this should just be a fold for the AI as even against the bottom of SB's range, it's not looking too hot. Plus, AI blocks the more natural semi-bluffs like Qxhh for a second nut flush draw. If we can really call a 7.25x shove a natural semi-bluff.
Why does the AI call? If I *had* to rationalize a call, I would say that this shove is either the nuts or air and because we block the Qh, we have more equity against most air hands, except Khx. This is a horrible justification for a call but, it's what it might be thinking.
Interesting note, at 20bb effective, solver calls QhTd at ~10% frequency.
15:02 It might seem insane but this hand is not so horrible from your AI's perspective. It's actually correct from a GTO perspective. Reason for the call is that you are beating 137/185 hands (74.5% equity) and getting ~1.53:1 pot odds (~40% equity required to call and break even). If we folded every time, we would be over folding and losing money long run given the price is just so good.
I think your AI plays kinda okay, considering it's only learnt over 1m hands. The QT hand is crazy, one other consideration is that it's not gonna adjust to the mistakes made by other players so it's gonna make some whack decisions like this one. I'd be interested to play it and also see how it plays against GTO Wizard.
@TheOnTiltBoys 5 місяців тому ⁺⁵
we will demolish that AI
@jttech44 5 місяців тому ⁺¹
ChatGPT? More like ChatATM
@matteovc2158 5 місяців тому
You guys don't always seem to be playing all that great, but I think even you would be able to beat this so called GTO playing AI. I think every action was just horrible.
@tunglee4349 5 місяців тому
impressive! I love this topic! Did u use RL to train?
@cococam3411 4 місяці тому
Well, first I'd like to congratulate and thank you for your immaculate work. But I'm running into some problems, I did everything by the book, downloaded the stuff with git, but when I run the script, I see a table, with 2 players, cards folded, Total pot $800, but how does it work, I'm completely clueless, I'm running Ubuntu 22.04.
Thanks in advance for your reply.
@anghungnguyen5115 Місяць тому
Hi , I also have some problems. Please help me. It not work
@rainluo4831 5 місяців тому
Love your videos!!! Keep pushing❤
@pipishrimp_poker 5 місяців тому
why dont you just use reinforcement learning? itspretty ez to code, just need a long time to train
@seanrobinson5740 4 місяці тому
you should secretly scan your friends cards and make your Ai win 100% of the time 😂😂
@TomRauhe 5 місяців тому ⁺¹
So how is this different from all the existing solvers out there?
@stevengongg 5 місяців тому ⁺³
not that much different, I implemented it for my own intellectual curiosity :)
@u10-bb5lf 5 місяців тому
it would be so lovely if you would do the Python tutorial or some kind of Playlist to educate your audience. Keep up, love your content!!
@spyseefan975 4 місяці тому ⁺³
Isn't poker solved already?
@Ярослав-н3р3ь Місяць тому
Wdym
@chancelyons7401 Місяць тому
Heads up poker is "solved" but add one more person and all the math becomes gibberish (for now)
@robbolewis6878 5 місяців тому
is it allowed to use AI in online poker tournaments?
@timonix2 5 місяців тому
No, its against terms of service for just about every platform. I bet there are tons of them out there though
@spacebomb9126 5 місяців тому
Nope, and it could get you in prison in some cases if you try it.
@belmicah 25 днів тому
His AI was a total calling station lol
@timhutter 5 днів тому
Wow you are absolutely impressive!
@xxtripl3thr33xx Місяць тому
An I wrong to think the AI saw quickly that “he who bets last and bets the biggest usually wins” then figured the best way to do that was to bet as much as it could as soon as possible?
@TheKooldude75 5 місяців тому
pretty cool idea, imagine if you got glasses w/ a camera and an earpiece and took this AI to a casino
@jttech44 5 місяців тому ⁺¹
Sounds like a great way to get kicked out of a casino
@spacebomb9126 5 місяців тому ⁺¹
There's an old saying "F*ck around and find out."
Try it and see what happens, kiddo.
@frey8893 5 місяців тому
this AI plays worse than the biggest fishes I have ever seen lmao
@natereg6558 8 днів тому
Can tell this guy was struggling to say equilibrium lol
@Jordan-Tech 14 днів тому
the problem with playing optimal strat against relatively not good players is that they don't play correctly so it's harder to beat them.. "The unpredictability of suboptimal play"
- In game theory, optimal strategies assume rational decision-making and predictable behavior based on maximizing utility or winning chances. Bad players often deviate from these rational strategies, making their moves harder to predict. Good players, who are skilled at countering optimal strategies, may find themselves struggling against chaotic or irrational play.
@francolauciello4429 5 місяців тому
You should’ve made a results part at the end, good video thooo 🎉🎉
@TsarHare 4 місяці тому ⁺¹
the cards delt, of 52 say exactly the cards that will come. like it am known. play the new drones am greater fun
@zoroark567 4 місяці тому
The concept of a style of play that can't be exploited is interesting but not actually the 'best' strategy in poker. The biggest winners are the ones who are able to exploit their opponents blindspots and weaknesses, which necessarily involves exposing themselves to risk of counter-exploitation.
@TsarHare 4 місяці тому ⁺¹
like the percentage of the cards yourself have increase every fold, as well with every card delt.
@StatisticianInStilettos 27 днів тому
Love this!
@Techn0man1ac 5 місяців тому
8:12 LOL
@angelma9087 17 днів тому
Jajaja me gustó mucho, si le agregas el razonamiento del rango de manos que puede tener el rival según sus acciones, su nivel va a mejorar drásticamente, por eso hacia call en los all in, porque no consideraba la mano del rival fuera buena, solo que fuera una al azar.
@jb123rnd 5 місяців тому ⁺³
I agree with the general reaction to the last hand I don't see how the math could be right, it should be a significantly lower EV to stay in (easy to calculate) especially if you consider the weight of the bet / pool size against the percentages you reported, in fact I think your opponent started to realize that issue/calculation bug in the previous all in bet, I haven't read any papers on this but even with imperfect information the bet/pool size should have a significant effect on the odds (even if some Nash optimization requires some additional randomness injected into that decision) . I would gamble a normal CNN with a lot of training should outperform this vastly (in its current form, you should be able to beat it if you can figure out the true Nash value of each state).
Either way its a cool project, but I think there are much more simpler approaches and code implementations that could perform better here. Thumbs up for a great project idea regardless.
@TheInternetofRandom 5 місяців тому
Id say that you programmed it in a mathematical way using equity. I dont know much about programming so please correct me if I am wrong but a better way would be the AI narrowing the opponents range down using the information that the player gives and use that to make accurate decisions
@tehf00n 5 місяців тому
You could have all kinds of heuristics for a player if you were recording their decisions but that might contravene any kind of rules of a provider. But for personal use you could do all what you suggested and go even further to exploit the Mersenne Twist algorithm that all casinos use to shuffle their poker decks.
@SchmidtyBrothers 5 місяців тому
This video was awesome! Don't really have anything else to say just commenting so this gets more views.
@NSelsted 5 місяців тому
It didn't only make 1 mistake. After it calls with the first all-in with the QT, it is clearly that it is easy to exploit. You wait until you hit a hand, where the AI hasn't shown particular strength, and then you go all-in if you have hit. That it want's to call 1/3 of the time in the last hand, shows that this simple strategy will most likely win close to 100% of the time. There are other exploits, but you are far behind to win against any pro poker player. You have a lot more work to do, before it stand a chance against even semi-pros.
@daylen577 5 місяців тому ⁺³
What I'm missing in this is the AI trying to figure out the cards other players have based on their individual strategies, actions, and known data. For example if it figures out after a round or two that a specific player bets high and ends up winning, that's a pretty good indicator that when they bet high they have cards that would let them win. The AI knows the cards it has and the cards on the table, so by matching all of that together it could make a reasonable assumption about how likely each player is to have a winning hand.
@BarvGwydh 5 місяців тому ⁺¹⁰
It doesn't need to. In poker it's just extremely hard to not lose tremendously vs a nash equilibrium strategy and it's impossible to win. The bot can just play as if it's playing against an unexploitable opponent, and it will crush someone whose strategy has nothing to do with the unexploitable strategy, and it will comfortably beat a human who has spent a decade studying nash equilibrium strategies. If it adjusted to play against the poor strategy it would absolutely decimate them even moreso. However, in such a small sample, we will not discover someone's strategy. And trying to make significant adjustments based on a small sample just opens up the bot to being exploited itself. Sometimes human vs human you can find ways to exploit someone's strategy in a comically small sample like 100 hands, mostly because a lot of humans are terrible at the game, but a bot would really need like 10,000 hands just to get a vague idea of someone's strategy. Plus, if it adjusts before that, a smart player could intentionally make it adjust one way then overadjust their own strategy the other way to exploit the bot.
If anything, it might be better to datamine online heads-up games and take an aggregate strategy of all profitable humans, and the bot could maybe even play a strategy that's mostly GTO with some adjustments mixed in from this online data.
TLDR trying to make a bot adjust its strategy based on its opponent is probably a bad idea unless it's making very small careful adjustments over 10k+, 100k+ hands.
@BarvGwydh 5 місяців тому ⁺⁸
Actually I take back some of what I said. I think a bot could squeeze out a lot of EV by making adjustments over small samples as small as like 50 hands. But it could open the bot up to being exploited for sure. And in any case, the nash equilibrium strategy doesn't need to be improved, it's unbeatable.
@queenstown9534 5 місяців тому
It's just playing according to its preset understanding of optimal play. If it was an actual AI that was taking in other player's tendencies, it wouldn't bother making theoretically correct calls like the 54o hand as it'd know it's rarely good against Brandon. Against Olivier Busquet, though, I'd say it's not a horrible call.
Edit:
Think of solvers (like this AI) more as extremely complex and detailed flow charts.
@evadecaptcha Місяць тому
The problem is that calling there 30% of the time is WAY too much lol. It seems to provide a percentage chance to take actions that should be a 0% frequency (like the call with Q high). This AI will hemorrhage money. I'm guessing the issue is that it's not able to account for your opponent's range, and assumes they could have any 2 random cards for any situation, though I don't know the exact details of setting the AI up, so that may just be an assumption on my part.
@edwardwav 5 місяців тому
Worth the wait
@mattloulou123123 5 місяців тому ⁺¹
Lets go Steven!!
@stevengongg 5 місяців тому ⁺¹
my number 1 supporter ❤️
@RT-he6zf 5 місяців тому
Ive been getting crushed by these "AI`s" in online rooms for years
@D3MONFIEND Місяць тому
That why the paper he derived from say in the end, you still need human data to trained on. The ai self play expect the opposition to play optimal. But human played differently and each player has different behavior to exploit into or being exploit by.
@wesch6354 5 місяців тому ⁺³
You need to rework your AI. Facing a river jam with bottom pair is a call at most maybe 5% not 30%.
@OmateYayami 5 місяців тому
Not if you have infinite chips or games i.e. you can afford consequences of losing few times.
@cpatterson365 5 місяців тому ⁺⁵
@@OmateYayami I've been playing poker wrong all this time. Turns out I just need infinite money!
@OmateYayami 5 місяців тому
@@cpatterson365 you've been playing different poker. Not necessarily wrong. You manage risk differently depending on the relative bet, stack and pot sizes. The bot didn't. Look if you feel your chance of winning is 1 in 3 on a bluff, you won't go into it as likely with all-in as opposed to having 10 more games ahead.
If you have infinite chips or games there is no difference.
I'm just saying that calling vs folding being 5 or 30% is wrong. This percentage should change depending on the chips involved.
@NSelsted 5 місяців тому
@@OmateYayami theoretically the Martingale system works on roulette. The reason it does this, is because it assumes you have infinite money, and infinite bet sizes. For these things to have any relevance, you need to look at non infinite. And funny enough, as soon as you look at non infinite at the Martingale system, it theoretically always fails.
Creating a system with an assumption that is not real, will make a system that is not relatable. Even then, this bot would not stand a chance against a pro, even with infinite money and time.
@OmateYayami 5 місяців тому
@@NSelsted i might have oversimplified but you do not have infinite bet size here. There is always a limit. Despite the name "no limit" the pot limit is the sum of all stacks which is always finite. You can't always place a bigger bet and reduce game to one win need.
I have oversimplified allowing to argue that infinite aggression will always win in an invite game, but that was not the point. The point was that finite aggression was too much and unstable. With invite games it would have been alleviated but that was too broad relaxation, which you properly exploited.
I am not sure if a pro could reliably win against a properly written bot. I have my doubts.
@ElKolimar Місяць тому
Awesome! Congratulations on getting so far with the project! I find it very interesting how much you achieved without knowing anything about poker. I believe that, precisely because of your lack of knowledge about the game, you may have underestimated it in some way or expected positive results where they don't necessarily hold significant value.
First of all, playing GTO is the foundational study for any good poker player, but it’s not how they actually play. Why? Because they constantly rely on adaptations, with GTO being the pivot. For example, if you play against someone who goes All-In on every post-flop hand, you would likely need to start calling with a stronger range of hands. That’s how you would win, and it has less to do with strict GTO and more with adapting to the player's behavior and moves.
An interesting approach could be to implement a model similar to reinforcement learning or perhaps include more parameters that indicate the game context when training the model. The reality is that GTO play is exploitable; to be unexploitable, it would need to face another player who is also using GTO.
@KtoBylPierwszy 29 днів тому
What you wrote above is fundamentally untrue. The GTO paradigm assumes that this strategy cannot be exploited. By playing GTO, you guarantee that your opponent cannot gain any advantage over you. While GTO is a defensive strategy, in cases where your opponent does not play GTO, this strategy generates an automatic profit as a result of the opponent's suboptimal play. Deviating from GTO, meaning adapting to your opponent, serves to maximize EV (expected value). However, this always comes with the risk of opening yourself up to potential counter-exploitation.
@ElKolimar 28 днів тому
@@KtoBylPierwszy Hi! I understand your point, and I agree with the theoretical premise of GTO being unexploitable. That’s absolutely correct in a vacuum. However, my argument is more about the practical application of GTO in real games.
The reality is that while GTO guarantees no one can exploit you, it doesn’t inherently guarantee that you’ll maximize your Expected Value (EV) against non-GTO players. In real games, players deviate from GTO all the time, often significantly. By sticking strictly to GTO, you might leave money on the table against opponents who make obvious mistakes. That’s why strong poker players adapt to maximize EV in those situations, even if it means stepping away from GTO temporarily.
Of course, adaptation introduces the risk of counter-exploitation, as you pointed out, but the best players know how to manage that balance. GTO is an amazing baseline, a theoretical foundation, but it’s not always the most practical way to play in a dynamic and unpredictable environment.
@KtoBylPierwszy 28 днів тому
@@ElKolimar I'am a former professional.
As I mentioned, GTO is a defensive strategy, so it is obvious that it does not maximize EV against suboptimal oponents. The paradox of the situation is that there is no way to maximize EV in the long term without a very good understanding of GTO, because you need a reference point to do so-a point to which you can return if you are being exploited.
In practice, it is exactly the opposite of what you write: the higher the level, the smaller the edge; the smaller the edge, the pool must play closer to GTO.
@SuhendraLie 5 місяців тому ⁺²
I'm missing the point here but so this is a 16minutes video on reinventing GTO Wizard / PIO Solver?
@Gatorbait1869 5 місяців тому
This bot isn’t even close to as sophisticated as those softwares
@SuhendraLie 5 місяців тому
@@Gatorbait1869 I was kidding this is still a commendable endeavor.
@Menschnskind 9 днів тому
"2nd top pair" isnt a term and also isnt a semibluff
@aspzx 5 місяців тому
Great work. Loved the video. It seems like your friends worked out how to read the AI by the last round. When it checked after seeing each Ace and then the King, Brandon correctly guessed it had a bad hand. A human in the AI's position probably would have bluffed.
@BarvGwydh 5 місяців тому ⁺¹
If the AI is working correctly, this is similar to saying that you picked up on the strategy of a nash equilibrium bot in rock paper scissors - where the bot's strategy is to randomly pick each action 1/3 of the time.
@OmateYayami 5 місяців тому
I think it doesn't consider consequences enough. Fold Vs call or agressivness should probably be weighted against stack and pot size. You wouldn't call 5% chance hand because of a coin flip on an all-in, while you could on a small stake.
Bot went too aggressive on two occasions and lost.
Bot played just for a single game and didn't value in stack and pot sizes, did it?
@stevengongg 5 місяців тому ⁺¹
It's due to the fact that I abstract away bet sizes, so it sometimes sees raising and going all-in as essentially the same, which can lead to sub-optimal decisions at times
@OmateYayami 5 місяців тому
@@stevengongg Thanks for a fast reply. By "abstract away", do you mean completely ignore or substitute with a simpler idea?
To me it's not only a raise/call being mixed with all in. It is about how many games you will have left if you lose. Maybe that will make the bot more exploitable, because it would play less aggressively when low on chips but on the other hand it would have more games to come back.
I think the optimal strategy changes depending on chip state. Bet vs pot vs stack.
To me it seems the bot lost because it got heavily punished for taking big risks at high stakes and getting unlucky.
@justinlau9377 5 місяців тому
what a sick video!
@WowkoHrac Місяць тому
So the whole time i've been playing like AIs best performance?
@MrWhatsyourflavour 5 місяців тому
Nice work !
@nicksommer5916 2 місяці тому
I might be late to the party but you could make it easier on yourself by hard coding some preflop charts and only focusing on your postflop strategy.
@jttech44 5 місяців тому
Heads up all in calls need to be much more conservative. Really, overall, it needs to have less weight put on the opponent's bet if it's going to survive any real players at all.
Also understand that, poker solvers already exist, no need to reinvent the wheel there, all the AI needs to do is take a broader look at equity, not just a single momentary hand but the totality of hands, and what the pot values are because pot value has a whole lot to do with strategy. Think of a bet as buying a ticket to the lotto, and the cards in play determine the odds of winning. It appears to not be doing any of that.
Also understand that, data is available on how the professional players play certain hands, as is data on how people play online as well. That's a powerful tool that you don't have access to at the moment, but it's simple enough to get access, you just buy it. You could even have a way to have your system ingest that data, and a separate play mode for unknown players, such that you could eventually learn a given player and profile them.
Good luck.
@anotherperson76 5 місяців тому ⁺¹
That was unreal
@GutsFreedom 5 місяців тому ⁺¹
................................................................................ That is how I felt watching this video
@spacebomb9126 5 місяців тому
I agree.
They are teaching people how to scam people online in poker.
@andycai9866 5 місяців тому
Sick video!
@junaeddhiman9001 5 місяців тому
looks like the AI developed it's own ego😂

Наступне

Автоматичне відтворення

If You See Someone Do This, That's the FISH