Tutorial 23-Univariate, Bivariate and Multivariate Analysis- Part2 (EDA)-Data Science

Поділитися
Вставка
  • Опубліковано 7 жов 2024
  • If you are looking for Career Tansition Advice and Real Life Data Scientist Journey. Please check the below link
    Spring board India UA-cam url: / channel
    Please join as a member in my channel to get additional benefits like materials in Data Science, live streaming for Members and many more
    / @krishnaik06
    github url: github.com/kri...
    Connect with me here:
    Twitter: / krishnaik06
    Facebook: / krishnaik06
    instagram: / krishnaik06

КОМЕНТАРІ • 82

  • @himanshumangoli6708
    @himanshumangoli6708 3 роки тому +7

    ur teaching skills are damn good man keep it up man lots of respect

  • @SwavimanKumar
    @SwavimanKumar 4 роки тому +29

    One small correction. That Hue is pronounced "Hiu" instead of "Hui". You are making absolutely great content. Love them all. Keep growing. (Y)

  • @krishnasahoo4598
    @krishnasahoo4598 2 роки тому

    thank you so much for this..I dont know why I was unable to understand this concept. Thanks for this

  • @sengnawawnghkyeng9179
    @sengnawawnghkyeng9179 Рік тому

    The best explanation about these variates ...

  • @pbanerjee4008
    @pbanerjee4008 4 роки тому +1

    Great job. Your sincerity shows. Wonderful effort.

  • @sumantwankhede
    @sumantwankhede 2 роки тому +3

    Just one tiny correction for Univariate x label should be Sepal Length ...all other good ..Thanks Krish

  • @sunitapatil381
    @sunitapatil381 4 роки тому

    you are grate sir .i am really grateful to your vedios thank you thank you so much sir.

  • @AadityaAgarwal-qo1km
    @AadityaAgarwal-qo1km 4 місяці тому

    I love when krish calls Hue as Huiii

  • @Uma7473
    @Uma7473 4 роки тому +1

    Thank you

  • @wasimshaikh9147
    @wasimshaikh9147 3 роки тому +8

    X lab should have been 'Sepal length' instead of 'Petal Length'

    • @adis6867
      @adis6867 3 роки тому +2

      I came in comment box to check same

  • @kalyanipadaraju5715
    @kalyanipadaraju5715 4 роки тому

    Thank you so much sir . Great explanation

  • @The_Hive_Mind8878
    @The_Hive_Mind8878 3 роки тому

    Really helpful. Thanks

  • @SaurabhSaurabh-uh6eq
    @SaurabhSaurabh-uh6eq 3 роки тому

    Wow what a nice explaination! 👌 👋

  • @0Fallen0
    @0Fallen0 2 роки тому

    Another easy way to do the bivaruate plot at 11:20 is sns.scatterplot(df['sepal_length'],df['sepal_width'],hue=df['species'])

  • @venkateshbb2926
    @venkateshbb2926 4 роки тому

    Thanks for tutorial.Please arrange tutorials in proper sequential of related tutorials.

  • @ashita1130
    @ashita1130 4 роки тому +1

    Pretty badass :) Thanks!

  • @nabiltech1366
    @nabiltech1366 3 роки тому

    So here are objective u can obtained by using this statistical method,
    1)Which features have good impact for ur model
    2)Which type of algorithms u should choos

  • @adarshpatodi5300
    @adarshpatodi5300 4 роки тому

    You need to have x label as sepal length in univariate analysis.

  • @gokulansundaraj8149
    @gokulansundaraj8149 2 роки тому

    Wow...

  • @rohithmn3378
    @rohithmn3378 4 роки тому +1

    Thanks for the excellent tutorial..!
    But this works well for classification problems. How shall we perform the similar analysis for Regression problem..!?

  • @piushsingh6066
    @piushsingh6066 4 роки тому +6

    univariate, bivariate and multivariate analysis should be done before data prep-processing or after......Please Reply...

  • @manishshukla125
    @manishshukla125 4 роки тому

    Thanks Sir!

  • @kamran_desu
    @kamran_desu 3 роки тому

    Interesting method to plot univariate, I generally create scatterplots to make similar deductions in terms of what kind of classifier will make sense.
    Here's some sample code:
    import matplotlib.pyplot as plt
    from sklearn import datasets
    iris = datasets.load_iris()
    X = iris.data
    y = iris.target
    F = iris.feature_names
    fig, ax = plt.subplots(1, len(F), figsize=(15,2))
    for i,f in enumerate(F):
    ax[i].scatter(X[:,i],y, c=y)
    ax[i].set(xlabel=f)
    ax[i].get_yaxis().set_visible(False)

  • @ratheesh_tabla
    @ratheesh_tabla 3 роки тому +3

    May be I am wrong, should that be "sepal length" instead of "petal length" in xlabel? based on your plot variables or feature used for univariate analysis

    • @pulkitarora6605
      @pulkitarora6605 3 роки тому +1

      ya its sepal length may be there is some mistake

  • @alokranjanthakur5746
    @alokranjanthakur5746 4 роки тому

    Sir can you make. Video on EDA only using python. Means what are necessary steps in EDA

  • @marioluoni3899
    @marioluoni3899 4 роки тому

    In the uni-variate analysis, why do you put all data points on the same level? By putting them onto different levels, e.g. by setting np.zeros_like()+0, np.zeros_like()+1 and np.zeros_like()+2, it will be very clear that these 3 data sets overlap very heavily as opposed to what you say @9:00 (unless I have misunderstood what you said there). Otherwise great lectures, thanks a lot!

  • @ashukol
    @ashukol Рік тому

    Line 17th code needs modification as follows:
    sns.FacetGrid(df,hue="species").map(plt.scatter,"petal_length","sepal_width").add_legend();
    plt.show()

  • @birinaboro8391
    @birinaboro8391 4 роки тому +2

    Hello Sir, could you please help me out with multivariate correlation through SPSS??

  • @sohamsarkar5255
    @sohamsarkar5255 3 роки тому

    Isn't multivariant analysis a consolidated representation of bivariant analysis, where all possible combinations of bivariant analysis are represented together?

  • @krishnakanthbandaru9308
    @krishnakanthbandaru9308 4 роки тому +3

    Hi I have a doubt these plots are ok for small datasets and interesting while learning but is these graphs helps when handling real time data or while working with real data science projects.

  • @mitrabhanuroutkali
    @mitrabhanuroutkali 4 роки тому +1

    Use DataExplorer package in r

  • @zainabzafari2336
    @zainabzafari2336 8 місяців тому

    Thank you very much for your great videos.
    However, this is the first video of your playlist that I could not understand. The dataset was not clear and you did not explained much.

  • @nijalparmar5951
    @nijalparmar5951 4 роки тому

    Sir can u plz make one video with use of spss and univariate, bivariates and multivariate analysis

    • @aination7302
      @aination7302 4 роки тому

      Just use the graph node and plot your histograms and scatter plots for all the variables you require.

  • @makanjudavid992
    @makanjudavid992 Рік тому

    Question: it is possible to use categorical features to make predictions for a numerical targer variable ??

  • @d39-nischithhegde65
    @d39-nischithhegde65 7 місяців тому

    can you also include link to dataset used

  • @yugen3968
    @yugen3968 3 роки тому

    Why not just plot histograms for every feature for univariate analyis?

  • @tanujsharma5492
    @tanujsharma5492 2 роки тому +1

    sir i think there is 'sepal length' instead of 'petal length' in xlabel. am i wrong or right??

  • @pritamchowdhury3152
    @pritamchowdhury3152 3 роки тому

    sir can you provide some practice dataset

  • @erumalibhatti1218
    @erumalibhatti1218 2 роки тому

    Hi krosh what will be the codes for R for same analysis??

  • @simanchalpatnaik2566
    @simanchalpatnaik2566 4 роки тому +3

    Hi Krish, Why you are keeping the Y-axis as 0. In the previous lecture also it's not explained. In graph you just kept it as 0.
    Please reply.

    • @HimanshuYadav-re7cp
      @HimanshuYadav-re7cp 4 роки тому +1

      hey , he's just trying to visualize the dependency of output feature on that particular feature i.e. "petal_width" .so there is no need for y axis if u want u can put x =0 , and plot it on y axis and we endup with a vertical stack :)

  • @ramthiagu2330
    @ramthiagu2330 3 роки тому

    if we have more than 10 or 20 features, how can we do multivariate analysis. will it be visible clearly in pairplot

  • @rahuldas6777
    @rahuldas6777 2 роки тому

    why put semicolons after your lines of code?

  • @vatsalshingala3225
    @vatsalshingala3225 Рік тому

    ❤❤❤❤❤❤❤❤❤❤

  • @hepengye4239
    @hepengye4239 3 роки тому

    Are those 4 plots along with the diagonal density plots?

  • @MageDigest
    @MageDigest 3 роки тому

    so from multivariate if we some graphs with overlapping variables like sepal length and sepal width, we can ignore one of them while doing any further analysis ? Please help here

  • @siddharthjain4361
    @siddharthjain4361 2 роки тому

    what if we have dimension in order of 100s...??

  • @souravsaha7751
    @souravsaha7751 2 роки тому

    sir a virginica or versicolor kaya ha

  • @Pankaj.6658
    @Pankaj.6658 8 місяців тому

    sir, what is web address you are using and is it free or paid please give some details about that also.

  • @toppoashish7
    @toppoashish7 3 роки тому

    How to do eda when we have many features, say 20+ and all are non correlated.

  • @Gamer_hai_hum
    @Gamer_hai_hum 3 роки тому

    Hello sir huge fan following ur ML playlist and I'm getting error in stringIO sir I also saw youtube video but I'm not able to slove the error it say No module something can u please guide me I'm stuck in your 7th playlist pls let me know sir it will be helpful

  • @hyhyb
    @hyhyb 4 роки тому

    How orange , green colours came into picture, coz we didn't mention any color parameters like palette, colour?

    • @urvishmahajan
      @urvishmahajan 4 роки тому

      Colors are automatically assigned if you don't mention them in the parameters

  • @niraliborad7475
    @niraliborad7475 Рік тому

    After executing the same code for univariate analysis my output is not color distributed as shown in video. can anyone help

  • @mlwithstats1703
    @mlwithstats1703 3 роки тому

    Sir how we can the data ???

  • @tejassutar4198
    @tejassutar4198 4 роки тому

    Hello sir how to know categories of given data in python? For eg. Here We want to know species categories?

    • @viveksingh881
      @viveksingh881 3 роки тому

      if u r talking about getting the unique values in species then following code will help:-
      for unique numbers of species - iris_data['Species'].nunique()
      for names of those unique species - iris_data['Species'].unique()

  • @adarshtiwari6742
    @adarshtiwari6742 4 роки тому

    Sir how much is necessary to know to get job in data science (is there any bounds)

    • @ClickyKitsune
      @ClickyKitsune 4 роки тому

      My personal recommendation would be to start with python , basics of SQL and couple of ML algorithms i.e regression. It all comes to how many projects you have actually created..good luck 👍

  • @anandacharya9919
    @anandacharya9919 4 роки тому

    When I import iris in python , no commands is working I am getting error as "AttributeError: info" , and also "AttributeError: describe" , please solve this, why I am getting this error

  • @SATISHKUMAR-bj2kl
    @SATISHKUMAR-bj2kl 3 роки тому

    sir evertime whenever i am running code then also error messege comes with "name df is not defined" can you please help me

  • @ankita684
    @ankita684 4 роки тому

    Hi Krish...when I am executing this code 'plt.plot(df_setosa['Sepal.Length'],np.zeros(df_setosa['Sepal.Length']),'o') it is returning a value error that reads as 'sequence too large; cannot be greater than 32'. How did you execute without getting this error. How to resolve?

    • @toyazpandey8669
      @toyazpandey8669 4 роки тому

      U haven't written like after np. Zeros_like

  • @dharmatejaadepu8597
    @dharmatejaadepu8597 4 роки тому +1

    In univariate analysis, you have taken sepal length and labelled it as petal length , can you explain me about that.

  • @ashishkumarsingh2910
    @ashishkumarsingh2910 4 роки тому

    how you are calling a url or internet file to read in pandas..... its like impossible for me to do... plztellme how?

  • @vishalrai2859
    @vishalrai2859 3 роки тому

    coaching institutes just looted me
    taught nothing like this

  • @re-cordinglyf7176
    @re-cordinglyf7176 2 роки тому

    I can't believe you pronounced it as hueee....😂😂

  • @shaminmohammed672
    @shaminmohammed672 3 роки тому

    Thank you