Linear discriminant analysis (LDA) - simply explained

TileStats

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 28 гру 2024

КОМЕНТАРІ • 57

@hopelesssuprem1867 Рік тому ⁺⁴
the best explanation in all internet. Thank you a lot
@nikeforo2612 3 роки тому ⁺⁹
Thanks for sharing. This is amazingly crafted and easy to follow. I finally understood the math behind LDA. Brilliant!
@tilestats 3 роки тому
Thank you! That's great to hear.
@快來快 11 місяців тому ⁺²
I like your video always with subtitles so i can better understand what are you saying
@ratnakarbachu2954 3 роки тому ⁺¹
really you are genius and your help always remembered until my last breath.
@tilestats 3 роки тому
Thanks for your comment!
@finansborsakanali 11 днів тому
Hello Sir. Thanks for the video. at "18:12" we get LD1 from first eigenvector. Does Ld2 come from second eigenvector? Thanks for your answer in advance..
@tilestats 10 днів тому
Yes
@gwendolyneortiz4152 11 місяців тому
I went to the MANOVA video, and it said I needed to first understand the LDA video so now I am here, and now it says I have to go to the PCA video hehe
@KayYesYouTuber Рік тому ⁺¹
Beautiful explanation. Thanks
@44hyt Рік тому ⁺²
Excellent video!!!
@maydin34 2 роки тому ⁺²
Great explanation.
@sergeypigida4834 2 роки тому
Could you please check if the legend colors for bacterial and viral are correct on the figures (e.g. at 6:21)? With scikit-learn I get low values for bacterial and high values for viral data transformed samples. Therefore, viral data points, transformed with LDA, should be above the bacterial ones.
@tilestats 2 роки тому
What values did you get for alpha 1 and alpha 2, using scikit?
@sergeypigida4834 2 роки тому
@@tilestats [-0.3627029 -2.3969302]
import pandas as pd
from sklearn.discriminant_analysis import LinearDiscriminantAnalysis
import matplotlib.pyplot as plt
data = pd.DataFrame({
"infection":["Viral","Viral","Viral","Viral","Viral","Viral",
"Bacterial","Bacterial","Bacterial","Bacterial","Bacterial","Bacterial"],
"crp":[40.,11.1,30.,21.4,10.7,3.4,42.0,31.1,50.,60.4,45.7,17.3],
"temp":[36.,37.2,36.5,39.4,39.6,40.7,37.6,42.2,38.5,39.4,38.6,42.7]})
X, y = data[["crp","temp"]], data["infection"].ravel()
lda = LinearDiscriminantAnalysis(n_components=1, store_covariance=True)
X_t = lda.fit(X, y).transform(X)
colors = ["red", "green"]
plt.figure()
for color, cls in zip(colors, lda.classes_):
plt.scatter(["LDA"]*len(X_t[y==cls]), X_t[y==cls], alpha=0.8, color=color, label=cls)
plt.legend()
plt.show()
@sergeypigida4834 2 роки тому
@@tilestats Hi Andreas, I managed to get the same picture as you. To do this I used the eigen solver (by default sklearn uses singular value decomposition).
lda = LinearDiscriminantAnalysis(solver="eigen", shrinkage=0.003, n_components=1, store_covariance=True)
Thank you for your
@tilestats 2 роки тому
Great!
@danialb9894 Рік тому ⁺¹
Thank you for the video. Could you please make an extended video for LDA? I mean videos like what you did for PCA
@tilestats Рік тому
Have you seen this one?
ua-cam.com/video/9SuIE_tGF-g/v-deo.html
@mrgomelonsolaris 2 роки тому ⁺¹
Very clear explanation!
@saifh.al-nimer2887 2 роки тому ⁺²
Thank you so much for this very useful video.
@tilestats 2 роки тому
Thank you!
@ig55549 2 роки тому ⁺²
Thanks! This video is so helpful! Coffee on me :)
@tilestats 2 роки тому
Thank you!
@rekhapriya2178 Рік тому
Your videos have been amazing, can you please upload content related to Bayesian analysis
@iwwyl847 3 роки тому
Hello! great video, however, I have a question/need clarification: is LD1 found at 18:19 the actual line, on which when projected upon best seperates the two classes? And then the following calculations in the video is the data being projected on the line, or is this line found somewhere else? So to clarify; im looking specifically for the line that best seperates the two classes
@tilestats 3 роки тому
The discriminant axis is given by the first eigenvector. If you like to plot the line on the centered data (unrotated data), plot the first eigenvector (0.15 0.989 or 0.11 0.7, both will give the same slope/direction of the line). Data points should project on this line.
@spp626 10 місяців тому
Hello sir, can you please explain why first eigen vector is considered for LDA?
@hongdao9349 5 місяців тому
How to calculate score from to matrix
@silesoul Рік тому
Hello, thank you for the excellent explanation! So a variable contributes more to the groups' separation when it has a high weight. How do we interpret the negative weights of the variables (in LDA)?
@tilestats Рік тому
Hi
Use the absolute values for interpretation, which means that values far away from zero are important for separation.
@silesoul Рік тому
@@tilestats Thank you !
@MariaMartinezGarcia-ky6ru Рік тому
I greatly appreciate the video. I only have one question: Does this linear discriminant analysis approach rely on the Bayesian, Fisher or some other approach?
@tilestats Рік тому
Well, the original discriminant analysis method was developed by Fisher, but note that LDA is not used to make any inference about a population (no p-value is calculated).
@MariaMartinezGarcia-ky6ru Рік тому
@@tilestats Thank you for your response. That being said, I would like to know to which specific category or approach the method presented in the video could be associated. This is considering that its primary function is to classify known data, without conducting broader inferential analyses. Could you provide additional information on how this particular approach would be classified?
@tilestats Рік тому
I would say that it is a classification method. Have a look at this video where I group LDA with other methods.
ua-cam.com/video/XMZ_0aloVAc/v-deo.html
@ritiksuri7248 6 місяців тому
Can you please provide the dataset on which you worked
@tilestats 6 місяців тому
The data set is the same as shown in the video.
@jimjohnson357 3 роки тому
In your calculation of matrix W, when the sizes of the classes or groups is UNEQUAL, what are the variables n_1 and n_2? (i.e. in the equation shown at 15:43). Also, thanks so much for these videos (and the PCA ones). Well explained, with good examples and you did it in half the time everyone else takes!
@tilestats 3 роки тому
Thank you! n1 and n2 are the sample sizes of group 1 (group A) and group 2 (group B). In this example, you should set n1=6 and n2=6. Note that you should use the same formula at 19:22 if you like to normalize the weights when you have an unequal sample size.
@jimjohnson357 3 роки тому
@@tilestats Thanks very much... you guessed what my second question was going to be and answered it too! Looking forward to working my way through all your videos when I have some extra time.
@NN-if9lr 3 роки тому
@@tilestats what if we have 3 or more classes? should the denumerator still n1+n2 - 2? or it change to n1+n2 - 3 and so on?
@tilestats 3 роки тому
@@NN-if9lr Yes but n1+n2+n3-3
@NN-if9lr 3 роки тому
@@tilestats oh i forgot the n3, thank you for the reply.
@paolopanlaqui9887 3 роки тому
Hello! I would like to ask what the null hypothesis for LDA is or DA (Discrimination Analysis) in general? And also if you would know what the hypothesis is when MANOVA and DA together is used?
@tilestats 3 роки тому
LDA or DA has no null hypothesis because it does not compute any p-value, in comparison to MANOVA. When you use LDA you can use some sort of validation method to check its performance. Have you seen my second video about LDA and the video about MANOVA?
@ahmadalmomani1553 2 роки тому
Thank you very much for these beneficial videos. However, you mentioned that you used software to calculate the Eigenvectors. Please recommend which software to use (the simpler, the better)?
Again, thanks a lot for your videos
@tilestats 2 роки тому
Hi
I use R, which is free! You can compute the eigenvectors in R by:
mat=matrix(c(2.05,14.15,0.16,0.96),2,2)
eigen(mat)
Note that I also have videos on how to understand and calculate the eigenvectors by hand:
ua-cam.com/video/9CT0jnem4vM/v-deo.html
ua-cam.com/video/JtcNe--fsyA/v-deo.html
@upanshisharma2313 2 роки тому
hey can you tell me how you calculated 0.11 and 0.70?
@andreasjansson2366 2 роки тому
It is explained later on in the video…
@azizullah6360 3 роки тому
how you computed alpha1 and alpha2?
@tilestats 3 роки тому
That is explained from 14:40 and forward.
@sunitharamcse 3 роки тому
How you assign alpha1 and alpha2 value?
@tilestats 3 роки тому
Do you mean how they are calculated? That is explained from 11:30.
@azizullah6360 3 роки тому
how to calculate covariance matrix of W?
@tilestats 3 роки тому
Can you refer to the time point in the video?
@morecharacterswithamix9067 2 роки тому
The visuals and the commentary are not in sync.

Наступне

Автоматичне відтворення

MANOVA - explained with a simple example