The ideal measure of a model's predictive fit

What is meant by entropy in statistics?

The KL Divergence : Data Science Basics

WoT Blitz. Late Night Birthday Lotto + Gifts and Presents

Поважай захисників | GOVOR TikTok #govor #shots

Тищенко: вирок і складання мандату? Що «світить» нардепу через скандал із бійкою? | Свобода Live

Explaining the Kullback-Liebler divergence through secret codes

Ben Lambert

Переглядів 40 416

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 14 тра 2018
Explains the concept of the Kullback-Leibler (KL) divergence through a ‘secret code’ example. The KL divergence is a directional measure of separation between two distributions (although is not a 'distance').
This video is part of a lecture course which closely follows the material covered in the book, "A Student's Guide to Bayesian Statistics", published by Sage, which is available to order on Amazon here: www.amazon.co.uk/Students-Gui...
For more information on all things Bayesian, have a look at: ben-lambert.com/bayesian/. The playlist for the lecture course is here: • A Student's Guide to B...

КОМЕНТАРІ • 27

@wendylanger 3 роки тому ⁺²
I am ridiculously overjoyed to have found this remarkably clear & concise explanation of the Kullback-Liebler divergence, thanks Ben Lambert!
@anusaxena971 2 роки тому
You are too good and perfect. Every sentence conveys maximum information and complete arguments !! Never heard even the best of professors so precise and complete with arguments !! 👍👍
@akash_goel 3 роки тому ⁺¹
This is the best explanation I've found yet. Thanks!
@olivierpaalvast1213 5 років тому ⁺¹²
The codes you present are uniquely decodable but they are not instantaneously decodable. Would it not have been better to say, for example for P, c(a) = 0, c(b) = 10, c(c) = 11?
@usmanshabbir4902 2 роки тому
yes, you are right....we are using the principle of entropy coding. but the main is code length will be the same.
@user-is3mv9ol8i 5 років тому ⁺¹
Great explanation, thank you!
@thangbom4742 5 років тому
wow, wonderful explanation!
@danielshamaeli3696 5 років тому ⁺¹
Thank you for explaining this :)
@eminmammadov6525 5 років тому ⁺¹
Fantastic. Great job
@adampax 4 роки тому ⁺¹
very insightful, cheers!
@PieroSavastano 5 років тому
great tutorial! thanks
@polinactiveaccount7737 5 років тому
Amazing! Great explanation :-)
@posthocprior Рік тому
This is a great explanation. Thanks.
@lima073 Рік тому
Thank you for this explanation! Could you do some about the Jensen Shannon Divergence and the relation of it with Mutual Information ?
@nikolaassteenbergen7270 5 років тому ⁺¹⁰
Nice explanation! Its Kullback LEIbler divergence, though. Spelled with ei not ie :)
@BlAcKpHrAcK 4 роки тому ⁺¹
The Q of L should be two times one quarter, rather than one half, which solves Q to three over two. It is the same for both languages P and Q, three over two. All of this presents no particular divergence which is an alternate application of the Kullback Liebler Divergence. Given a symmetrical solution, we might have employed an equation to begin with.
@jakejing1118 5 років тому ⁺¹
Thanks for the nice tutorial. If my understanding is correct, you showed us two ways of calculating the information loss from P(X) to Q(X). But it seems that the second way is independent of the L(X), since the length, L, is not used at all. Does that mean the encoding length will not affect the KL divergence, or my understanding is not correct? Thanks a lot!
@wahabfiles6260 4 роки тому
nice observation, i think it is pre-assumed that P(x) is ideal for arbitrary coding and if we use Q(x) for that arbitrary coding we shall deviate by 1/4th. So I think the arbitrary coding length is not required. I might be wrong though.
@p.z.8355 Рік тому
What is the relationship between log_2(p(x)) and the encoding length of x ? Intuitively, the higher p(x), the shorter the encoding length. How is this relation concisely mathematically formulated?
@prab436 5 років тому
Mind blown!
@michaelkonstantinov7857 4 роки тому
Good explanation! but i think that this is just information coding and not ciphering
@sayandebnath115 3 роки тому
Isn't the example toooo specific as probabilities are very specific....though the idea is fairly correct and two things will match if the no of letters are large instead of only 3
@DarrelFrancis 2 роки тому
Shouldn't the code for language P be a=0, b=10, c=11? And likewise for Q, a=10, b=0, c=11? That way if the first digit seen is 0, you know it is the commonest letter; if it is 1, you now select between the two rarer letters
@2137kg 5 років тому ⁺⁴
you mean LEIbler
@amirhosseinmaleki9802 5 років тому
did not make a mistake? E[L(Q)|P] = 1/4 (not 1/2) x 2 + 1/2 x 1 + 1/4 x 2
@tylertyler82 5 років тому ⁺²
P has ½ probability for letter a and Q has ¼ probability for letter a, and you are given Language P, so I believe the video is correct
@malharjajoo7393 4 роки тому ⁺⁴
This isn't really helpful .. I still don't understand the secret code example and how KL is similar ..

Наступне

Автоматичне відтворення

The ideal measure of a model's predictive fit

The ideal measure of a model's predictive fit

What is meant by entropy in statistics?

What is meant by entropy in statistics?

The KL Divergence : Data Science Basics

The KL Divergence : Data Science Basics

WoT Blitz. Late Night Birthday Lotto + Gifts and Presents

WoT Blitz. Late Night Birthday Lotto + Gifts and Presents

Поважай захисників | GOVOR TikTok #govor #shots

Поважай захисників | GOVOR TikTok #govor #shots

Тищенко: вирок і складання мандату? Що «світить» нардепу через скандал із бійкою? | Свобода Live

Тищенко: вирок і складання мандату? Що «світить» нардепу через скандал із бійкою? | Свобода Live

Коли Содолю давали звання Героя, чому ДБР мовчало? | Кривонос

Коли Содолю давали звання Героя, чому ДБР мовчало? | Кривонос

Kullback-Leibler (KL) Divergence Mathematics Explained

Kullback-Leibler (KL) Divergence Mathematics Explained

Introducing Bayes factors and marginal likelihoods

Introducing Bayes factors and marginal likelihoods

Information Theory - Entropy, KL divergence, Cross Entropy and more.

Information Theory - Entropy, KL divergence, Cross Entropy and more.

Introduction to KL-Divergence | Simple Example | with usage in TensorFlow Probability

Introduction to KL-Divergence | Simple Example | with usage in TensorFlow Probability

Kullback-Leibler divergence (KL divergence) intuitions

Kullback–Leibler divergence (KL divergence) intuitions

An introduction to mutual information

An introduction to mutual information

Entropy | Cross Entropy | KL Divergence | Quick Explained

Entropy | Cross Entropy | KL Divergence | Quick Explained

An introduction to Gibbs sampling

An introduction to Gibbs sampling

Jensen Shannon Divergence || JS Divergence || Quick explained

Jensen Shannon Divergence || JS Divergence || Quick explained

Пробую самое сладкое вещество во Вселенной

Пробую самое сладкое вещество во Вселенной

От первого лица: Школа 6🤩 РАЗВЕЛИ СТАРШЕКЛАССНИЦУ 😱 ЛИШИЛИСЬ ДИРЕКТОРА 🤕 ВЫПУСКНОЙ ГЛАЗАМИ ШКОЛЬНИКА

От первого лица: Школа 6🤩 РАЗВЕЛИ СТАРШЕКЛАССНИЦУ 😱 ЛИШИЛИСЬ ДИРЕКТОРА 🤕 ВЫПУСКНОЙ ГЛАЗАМИ ШКОЛЬНИКА

🟦🟨 ДЕНЬ КОНСТИТУЦІЇ 👊🤨 НАРОД ПРОТИ ЧИНОВНИКІВ 👺💸

🟦🟨 ДЕНЬ КОНСТИТУЦІЇ 👊🤨 НАРОД ПРОТИ ЧИНОВНИКІВ 👺💸

КАК РОССИЯНКА ПРОБИЛА ДНО! У нас в пл@ну…он живет лучше, чем вы / Золкин

КАК РОССИЯНКА ПРОБИЛА ДНО! У нас в пл@ну…он живет лучше, чем вы / Золкин

Хто слов'яни? ❤️ПІДПИШИСЬ!❤️ Андрій Попик. чат рулетка. #андрійпопик #чатрулетка

Хто слов'яни? ❤️ПІДПИШИСЬ!❤️ Андрій Попик. чат рулетка. #андрійпопик #чатрулетка

OMG🤪 #tiktok #shorts #potapova_blog

OMG🤪 #tiktok #shorts #potapova_blog

Тищенко: вирок і складання мандату? Що «світить» нардепу через скандал із бійкою? | Свобода Live

Тищенко: вирок і складання мандату? Що «світить» нардепу через скандал із бійкою? | Свобода Live

От первого лица: Школа 6🤩 ПРОЩАНИЕ с ДИРЕКТОРОМ 🤕 РАЗВЕЛ РОДИТЕЛЕЙ ДЕВУШКИ 🥹 СПОРТ ГЛАЗАМИ ШКОЛЬНИКА

От первого лица: Школа 6🤩 ПРОЩАНИЕ с ДИРЕКТОРОМ 🤕 РАЗВЕЛ РОДИТЕЛЕЙ ДЕВУШКИ 🥹 СПОРТ ГЛАЗАМИ ШКОЛЬНИКА