Thompson sampling, one armed bandits, and the Beta distribution
Вставка
- Опубліковано 5 лип 2021
- Thompson sampling is a strategy to explore a space while exploiting the wins. In this video we see an application to winning at a game of one-armed bandits.
Beta distributions video: • The Beta distribution ...
Tom Denton blog: inventingsituations.net/
Icons made by Freepik from www.flaticon.com
Announcement: Book by Luis Serrano! Grokking Machine Learning. bit.ly/grokkingML
40% discount code: serranoyt - Наука та технологія
That's the clearest and easiest to understand explanation of Thompson sampling I've ever seen! Thank you!
A real talent to explain complex math in simple and clear terms. Excellent use of graphics and animation.
Such an informative video. All these complex concepts are cleared so nicely and simply.
You are the best teacher always, Dr. Serrano. Thanks for the upload.
“Best” (most intuitive, concise, and practical) explanation of both Thompson sampling and OAB I’ve found. Rivtik should be jealous.
EXCELLENT Video! And one of the few that explain the Bandit problem clearly and succinctly with Thompson sampling.
another great video, really appreciate the simplicity and knowledge in it
Because there's no Chinese subtitle in the video so I would like use English to reply the comment。 the video is cool and easy to understand and I understand it and benefit from it lot. Thank you, thank you very much
Thank you for this video. Very informative as always !
Thank u so much sir... Want more videos on machine learning and deep learning topics
Amazing explaination!
Nice video, I learned something today.
Very well done!
Great video ☺️
Great video.
Happy Teachers day sir
Love from India.
🙂
8:23 an error in the third graph when adding the third trial. The graph should move skew to the right instead of left.
You’re right, thank you for the correction!
9:17 you say "picking a random point from the distribution" which makes it seem like the x coordinate is randomly chosen. I think it's much clearer to say draw a random sample from the distribution.
Hi Luis, I have a doubt, at 8:46 how did M2, M3 and M4 achieved a Right Skewed, Left Skewed and Right Skewed curves respectively
How doe Thompson sampling ensure we choose the unexplored machine? Did you make that clear in the video explanation?
Awesome
❤️
5:52 kindly distinguish between likelihood and probability,i am a little lost here