KL Divergence - CLEARLY EXPLAINED!

The KL Divergence : Data Science Basics

The Boundary of Computation

РЕШАЮЩИЙ РАЗГОВОР: Золкин и Карпенко нашли ее мужа / "Жди меня" отдыхает!

«Я три доби просиділа під тими завалами. Але дива не сталося»

[UA] Eternal Fire проти NAVI | EPL Season 20 Malta

Kullback-Leibler divergence (KL divergence) intuitions

CabbageCat

Переглядів 2 761

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 29 вер 2024

КОМЕНТАРІ • 11

@cabbagecat9612 Рік тому
demo codes: github.com/szhaovas/blog-ytb/blob/master/NES/kl_demo.py
@alexkelly757 3 місяці тому
Thank you. This really helped.
@blackkyurem7166 3 місяці тому
What do you mean by the statement that the “positive and negative log ratios will cancel each other out?”
Attempting to verify this, suppose we have X∈{1, 2, 3, 4} and two simple PMFs:
- P(X), with probabilities 0.1, 0.2, 0.3, and 0.4 respectively
- Q(X), with probabilities 0.25, 0.25, 0.25, and 0.25 respectively
But ln(0.1/0.25) + ln(0.2/0.25) + ln(0.3/0.25) + ln(0.4/0.25) = -0.487109, not 0. Perhaps I’m doing something wrong/misinterpreting the video, but I don’t get why this should be true.
@cabbagecat9612 3 місяці тому
Since the area under the curve of a PDF is 1, if P(x1) is very large at a point x1, then P at other points P(x2), P(x3) etc. have to be smaller, such that the total area under the curve for all P(x) does not exceed 1. So if P(x1) > Q(x1), there must be other points xi where P(xi) < Q(xi), and the positive log-ratio at x1 will be cancelled out by the negative log-ratios at xi (using xi here because there can be an arbitrary number of such points).
TBH I haven't proven myself whether or not positive and negative log-ratios cancel out "exactly" to 0 (even though they definitely cancel each other out, which we don't want in this case because both P(xi) > Q(xi) and P(xj) < Q(xj) should be contributing together towards a larger divergence between P and Q, not against each other).
My math is a bit rusty, but here's a sketch of the proof idea:
- First of all, you will need to integrate log(P(x)/Q(x)) over -inf and +inf x. (You can't choose specific x values here because you do not know in advance where P(x) or Q(x) is large)
- log(P(x) / Q(x)) = log(P(x)) - log(Q(x))
- Then it becomes something like integral log(P(x)) - integral log(Q(x)).
- Both integral P(x) and Q(x) would be 1, because the area under the curve of a PDF is 1. I'm not sure how adding the log changes things, but log(P(x)) and log(Q(x)) are still bell-shaped, which means fixed area under the curve, which means the difference will be bounded. Not sure if this difference is 0, but even if it's not, it will still be a constant irrespective of the actual "distance" between distributions P and Q.
Hope this helps.
@KianSartipzadeh 6 місяців тому
Thank you for making a very intuitive video about the KL divergence 🙏
@priyankjain9970 10 місяців тому
This is probably the best and simple explanation.. Thanks @CabbageCat for the video
👍
@Blu3B33r 8 місяців тому
Amazing explanation and the code is such a smart idea
Thank you for sharing🙏
@sunasheerbhattacharjee4760 8 місяців тому
I think the points on the PDF curves are not probability values as probability values at those points are 0 when considering continuous random variables. The integration between those points actually results in a probability value. Hence, when you integrate from 0 to infinity, the area under the curve results in 1 (probability cannot exceed the value of 1)
@cabbagecat9612 8 місяців тому
You are right, it was a sloppy use of terms. Should have been probability density.
@sunasheerbhattacharjee4760 8 місяців тому ⁺¹
@@cabbagecat9612 Nonetheless, it was a great effort explaining the concept
@TongLi-g2f 9 місяців тому
thank you so much :)

Наступне

Автоматичне відтворення

KL Divergence - CLEARLY EXPLAINED!

KL Divergence - CLEARLY EXPLAINED!

The KL Divergence : Data Science Basics

The KL Divergence : Data Science Basics

The Boundary of Computation

The Boundary of Computation

РЕШАЮЩИЙ РАЗГОВОР: Золкин и Карпенко нашли ее мужа / "Жди меня" отдыхает!

РЕШАЮЩИЙ РАЗГОВОР: Золкин и Карпенко нашли ее мужа / "Жди меня" отдыхает!

«Я три доби просиділа під тими завалами. Але дива не сталося»

«Я три доби просиділа під тими завалами. Але дива не сталося»

[UA] Eternal Fire проти NAVI | EPL Season 20 Malta

[UA] Eternal Fire проти NAVI | EPL Season 20 Malta

Продажный бой? Боксёр испугался? Нет! Всё гораздо сложней... #shorts

Продажный бой? Боксёр испугался? Нет! Всё гораздо сложней... #shorts

Barbados PM’s extraordinary attack on Netanyahu for selective use of Bible in UN | Janta Ka Reporter

Barbados PM’s extraordinary attack on Netanyahu for selective use of Bible in UN | Janta Ka Reporter

The Midpoint Circle Algorithm Explained Step by Step

The Midpoint Circle Algorithm Explained Step by Step

CMA-ES: step size and covariance matrix adaptation

CMA-ES: step size and covariance matrix adaptation

Intuitively Understanding the KL Divergence

Intuitively Understanding the KL Divergence

The BEST Way to Find a Random Point in a Circle | #SoME1 #3b1b

The BEST Way to Find a Random Point in a Circle | #SoME1 #3b1b

Deep Learning 20: (2) Variational AutoEncoder : Explaining KL (Kullback-Leibler) Divergence

Deep Learning 20: (2) Variational AutoEncoder : Explaining KL (Kullback-Leibler) Divergence

All Learning Algorithms Explained in 14 Minutes

All Learning Algorithms Explained in 14 Minutes

KL Divergence - How to tell how different two distributions are

KL Divergence - How to tell how different two distributions are

KL Divergence - Intuition and Math Clearly Explained

KL Divergence - Intuition and Math Clearly Explained

Загадочная череда смертей участников группы Ласковый май | Документальный фильм

Загадочная череда смертей участников группы Ласковый май | Документальный фильм

100 Identical Twins Fight For $250,000

100 Identical Twins Fight For $250,000

Папич - миллионы на стримах, донаты от Меллстроя и альтушки

Папич — миллионы на стримах, донаты от Меллстроя и альтушки

Did you know about this hack? 😁

Did you know about this hack? 😁

Что в джунглях лучше не тpогать?

Что в джунглях лучше не тpогать?

ПОЛ ЭТО ЛАВА В РЕАЛЬНОЙ ЖИЗНИ!**Янчик, Сабина, Амина, Джарахов, MONA, Прокофьев**

ПОЛ ЭТО ЛАВА В РЕАЛЬНОЙ ЖИЗНИ!**Янчик, Сабина, Амина, Джарахов, MONA, Прокофьев**

Папа из-за ТАКОГО снова за хлебом ушёл😁А у тебя есть папа?🤔@KOTFIN

Папа из-за ТАКОГО снова за хлебом ушёл😁А у тебя есть папа?🤔@KOTFIN

ШОУ Я : Егор Крид, Tenderlybae, Сабина, Янчик, Каграманов #3

ШОУ Я : Егор Крид, Tenderlybae, Сабина, Янчик, Каграманов #3