Exploratory Data Analysis in R: Towards Data Understanding

Поділитися
Вставка
  • Опубліковано 21 січ 2025

КОМЕНТАРІ • 59

  • @DataProfessor
    @DataProfessor  5 років тому +10

    QUESTION OF THE DAY: How do you use R to perform "Exploratory Data Analysis"? What R functions or packages?

    • @gerardcanetfajas9048
      @gerardcanetfajas9048 4 роки тому +2

      Well, there is package 'FactoMineR', pretty usefull.

    • @stretch8390
      @stretch8390 4 роки тому

      I feel like ggplot2 is an easy go to for EDA in R. And now having watched the video I'd say view(), head( , ), tail( , ), summary(), and sum(is.na()) are a good place to start with skimr package skim() function being my favourite from this video.

    • @9226448708
      @9226448708 3 роки тому

      We can also use "DataExplorer" package and can gain all the insights regarding data.

  • @mack.cheerdesk
    @mack.cheerdesk 4 роки тому +3

    ขอบคุณอาจารย์ที่ทำ content อย่างนี้ครับ your speciality in R with Bioinformatics is very helpful for my case as I am interning at Johns Hopkins University where they mainly use R programming. A lot of contents on youtube are in python and yours truly help as you have both. Greatly appreciated and thank you krup!

    • @DataProfessor
      @DataProfessor  4 роки тому

      Waz, thanks krub for your kind comments! It is certainly a pleasure that you find the contents of this channel helpful.
      Earlier today I just released a new tutorial video on using Python for Computational Drug Discovery ua-cam.com/video/VXFFHHoE1wk/v-deo.html

  • @josephjoestar995
    @josephjoestar995 3 роки тому +1

    What a legend man! Thank you for sharing your knowledge, going to follow this R project series - dropped a sub 😄

  • @FrankMLoSchiavo
    @FrankMLoSchiavo Рік тому

    When I try Method 2 using get URL, I get the error below. Any suggestions? Thx.
    Error in function (type, msg, asError = TRUE) :
    SSL certificate problem: certificate has expired

  • @MrKalin7
    @MrKalin7 5 років тому +3

    Thank you for the video! I found it very intresting, especially skimr package.

    • @DataProfessor
      @DataProfessor  5 років тому +1

      Thanks Kalin for your comment. 😄

  • @atiqahshafie7466
    @atiqahshafie7466 3 роки тому +1

    Thank you, professor, it is really helped me with my project

  • @claudiocrespo4703
    @claudiocrespo4703 Рік тому

    Hi, I have problems with function skim(): Error in base::nchar(wide_chars$test, type = "width") :
    lazy-load database '/Library/Frameworks/R.framework/Versions/3.6/Resources/library/cli/R/sysdata.rdb' is corrupt
    In addition: Warning messages:
    1: In base::nchar(wide_chars$test, type = "width") :
    restarting interrupted promise evaluation
    2: In base::nchar(wide_chars$test, type = "width") :
    internal error -3 in R_decompress1.
    ¿Could you advice me, please?

  • @sp-francinagoh2313
    @sp-francinagoh2313 4 роки тому +1

    Thank you. Your explanation is very clear

    • @DataProfessor
      @DataProfessor  4 роки тому

      SP-Francina GOH Thank you so much for the kind words 😊

  • @ekolenelsonbetobe4711
    @ekolenelsonbetobe4711 4 роки тому +1

    thanks for the explicit video,

  • @nebakyon811
    @nebakyon811 4 роки тому +2

    You know this job. Nice content.

  • @abhipsatripathy3934
    @abhipsatripathy3934 4 роки тому

    Prof. my skim(iris ) is not working even if after installing the package of skimr. It shows skimr package successfully unpacked but when i run the command it says skim not found.
    I run the code.
    library(dplyr)
    iris%>%
    dplyr::group_by(species)%>%
    skim()
    but the output is...
    iris%>%
    + dplyr::group_by(species)%>%
    + skim()
    Error in get(nm, envir = fn, mode = "function") :
    object 'skim' of mode 'function' was not found

  • @SneakerFreaker97
    @SneakerFreaker97 4 роки тому

    I get this error when executing the dplyr function: Error in UseMethod("group_by_") :
    no applicable method for 'group_by_' applied to an object of class "factor"

  • @antonioverissimo5162
    @antonioverissimo5162 4 роки тому +2

    Thank you. Can we use the R -version 4.0 for this tutturials??

    • @DataProfessor
      @DataProfessor  4 роки тому +1

      Hi, I haven't tested this in version 4 yet.

  • @NewNew-y7s
    @NewNew-y7s 3 місяці тому

    How can I download the dataset you used ???

  • @leylamerlo4999
    @leylamerlo4999 3 роки тому +1

    Thank you! Your videos are super helpful :)

  • @siamaksiamak5583
    @siamaksiamak5583 4 роки тому +1

    Hi, thanks for video. you mentioned at the beginning of video about a link to 6 steps... where is the link?

    • @DataProfessor
      @DataProfessor  4 роки тому

      Thanks Siamak for pointing this out, I've added the link in the description. Links to videos in Data Science 101:
      bit.ly/dataprofessor-ds101

  • @chiefmyroncappermoon6135
    @chiefmyroncappermoon6135 2 роки тому

    Is the 101 video you're talking about supposed to be in the "R Data Science Project" library?
    I don't see it :(

  • @gbubemierics1510
    @gbubemierics1510 2 роки тому

    Thank you data doctor for the detailed explanations. On my Rstudio i keep getting error messages when running this code
    iris %%
    dplyr::group_by(species) %%
    skim() - (Error in UseMethod("group_by") :
    no applicable method for 'group_by' applied to an object of class "factor")...please what am i doing wrong?

  • @fahadreda3060
    @fahadreda3060 4 роки тому +1

    Great Video, Thanks

  • @desmondojei3868
    @desmondojei3868 4 роки тому +1

    Amazing video.. Thanks alot

  • @gohanlopez5330
    @gohanlopez5330 2 роки тому

    I am still struggling with my dataset, could we meet over zoom?

  • @jisaelza
    @jisaelza 4 роки тому

    Hi, could you please share the link for the next video ? Thank you so much for this tutorial

  • @mandarvaidya7947
    @mandarvaidya7947 4 роки тому +1

    I am doing a masters degree in data science adn in my final year we have an elective of bioinformatics.Do I need to have an understanding of biology to study bioinformatics .I am from tech background and do not know much about biology except some high school knowledge

    • @DataProfessor
      @DataProfessor  4 роки тому +1

      All you need to get started is high school biology, the rest you can read up on when needed. The hardest part of bioinformatics is the computational proficiency, although the biology is important at the mode interpretation phase but to get started computational proficiency can go a long way. Have fun exploring this exciting field, please check out the Bioinformatics playlist I’ve created at bit.ly/dataprofessor-bioinformatics

  • @dresthipaliwal9455
    @dresthipaliwal9455 3 роки тому

    hello sir i got an assignment can you please help me with the same

  • @kisakyeangela9416
    @kisakyeangela9416 4 роки тому +1

    Could you kindly do a video about Extreme Value theory especially for Peak Over threshold and Annual Maximum

    • @DataProfessor
      @DataProfessor  4 роки тому +1

      Hi Kisakye, thanks for the suggestion, I came across this book chapter that covers the topic, please have a look here link.springer.com/chapter/10.1007/978-3-030-28669-9_3

    • @kisakyeangela9416
      @kisakyeangela9416 4 роки тому

      Data Professor thank you

    • @kisakyeangela9416
      @kisakyeangela9416 4 роки тому

      Data Professor Is there one with R codes someone can follow through while using Rstudio

  • @shehabberam78
    @shehabberam78 3 роки тому +2

    Hi professor thanks for this amazing content
    Do you have any similar projects but using python ?

    • @DataProfessor
      @DataProfessor  3 роки тому +2

      Yes, here it is the Python EDA video ua-cam.com/video/9m4n2xVzk9o/v-deo.html

  • @oliesting4921
    @oliesting4921 3 роки тому

    Wish you do more R videos

  • @valenciawalker6498
    @valenciawalker6498 11 місяців тому

    Thank you

  • @mxaraujo
    @mxaraujo 4 роки тому +1

    I will try

  • @danielit5845
    @danielit5845 5 років тому +1

    I think u can do more than that with R. However, Thanks

    • @DataProfessor
      @DataProfessor  5 років тому +1

      Thanks for your comment. Yes, exactly, more videos on R data science projects coming up. Next 2 videos will be to 1) visualize the iris data set and 2) building a classification model for predicting the class label. Please stay tuned. 😀

    • @DataProfessor
      @DataProfessor  5 років тому

      A sneak peak of what's to come is that I will eventually cover is how you can build a data-driven web app using R and shiny. An example of the web app that I've developed is codes.bio/osfp and further detail of the implementation is published at jcheminf.biomedcentral.com/articles/10.1186/s13321-016-0185-8

  • @BigPooPooKaka
    @BigPooPooKaka 4 роки тому

    you can use a pistachio to open a pistachio

  • @highonsportzzz
    @highonsportzzz 4 роки тому

    Too long video, thumbs down.