Intro to Data Visualization with R & ggplot2

Поділитися
Вставка
  • Опубліковано 18 вер 2024

КОМЕНТАРІ • 128

  • @kushagramishra9257
    @kushagramishra9257 7 років тому +54

    I would suggest everyone beginning with ggplot2 to go through this 1hr vedio, it will save you a lot of time understanding the basics.

    • @Datasciencedojo
      @Datasciencedojo  7 років тому +3

      @Kushagra Mishra - You are too kind, glad you liked the video!
      Dave

    • @deepakhemadri
      @deepakhemadri 6 років тому

      Kushagra Mishra uwl

  • @Jenna-iu2lx
    @Jenna-iu2lx 3 роки тому +5

    This video is literally PERFECT for ggplot2 beginners! In only one hour, you'll learn the basics of ggplot2 R coding and you'll end up falling in love with ggplot2 (I thought this language was weird and not intuitive at first, but after this video I think it's very useful and practical to visualize more accurate data plots!)

  • @ayasuniverse
    @ayasuniverse 3 роки тому +1

    THIS IS ABSOLUETELY ONE OF THE BEST TUTORIALS ON CODING THAT I'VE EVER SEEN !!!! THANK YOUUU !! UP UP UP

  • @asimmunshi4830
    @asimmunshi4830 4 роки тому +5

    This was amazing - thanks. It was, literally, the first time I've ever coded anything in my life, I've wanted to learn about data vis stuff for sports analytics for a long time and this video was the perfect introduction.
    If anyone is in a similar position to myself, and has zero ! previous knowledge of R or programming, and wants to learn about data vis, I'd just start with this video. The only thing I needed to hit google for was to learn how to import the dataset into RStudio (yes, really).
    Thank you!

  • @shahronak47
    @shahronak47 6 років тому +184

    26:30 - Actual video

  • @AliAwadh980
    @AliAwadh980 7 років тому +26

    The seventh question, I believe the labs should be as:
    labs(x = "Age", y = "Density"), and
    labs(x = "Age", y = "Survived Count")

  • @Jxdemelo1961
    @Jxdemelo1961 3 роки тому +1

    Simply exceptional. Thank you. I'm hooked. And I'm not even a Data Scientist. If I was 20 years younger, I'd get into this field.

    • @Datasciencedojo
      @Datasciencedojo  3 роки тому

      Glad to hear that, Jaime. It's still not too late to start with data science. Take a look at our Bootcamp, which might be a great way for you to start: datasciencedojo.com/data-science-bootcamp/

  • @khoadang4844
    @khoadang4844 5 років тому +3

    Currently a college student pursuing a degree in Economics. I'm taking Intro to Economic Data Analysis, and we have the choice of using R or Excel. Our first Homework Project directly coincides with using ggplot2. I haven't even finished half of the video but can already say I have learned so much about R. Sweet vid!

  • @mursyidhb
    @mursyidhb 3 роки тому +3

    Thanks Dave... I think the way you present the code and interpreting the result is awesome. even I have just new to ggplot2, the presentation bring me as if we have so familiar with the code. You make R is not that difficult.

    • @Datasciencedojo
      @Datasciencedojo  3 роки тому +1

      glad to help you out, keep following us for more content!

  • @constantineau9686
    @constantineau9686 3 роки тому +1

    An excellent video! Thanks a lot!

  • @maurocarvalho9835
    @maurocarvalho9835 4 роки тому +1

    Great presentation! Thanks for making ggplot2 easier to be understood.

  • @joanabucheri1380
    @joanabucheri1380 4 роки тому +1

    Thank you for this video, it was indeed helpful. Didn't have sufficient knowledge in ggplots but now i do. Thanks a lot!

  • @charleschungudaka6879
    @charleschungudaka6879 3 роки тому

    Wow, the Best video on ggplot2. Love you Data Science Dojo. So very much helpful and really got me excited.

  • @AncillenoDavis
    @AncillenoDavis 6 років тому

    Definitely one of the best intros to ggplot2

  • @najibbht9727
    @najibbht9727 4 роки тому +1

    Super helpful and crystal clear intro. Thank you very much!

  • @ruizhang3956
    @ruizhang3956 4 роки тому +2

    Thanks for the tutorial! Small caveats on the density plot and the histogram towards the end. The axes are mislabeled. Y should be probability density or counts, while X should be age

  • @darren46166
    @darren46166 5 років тому +4

    Mistakes in your code 162, 163, 171 and 172. The x axis should be "Age" and y axis should be "Survived". By the way, great tutorial!

  • @pewolo
    @pewolo 3 роки тому +1

    Clear and pertinent!

  • @techknowhow4802
    @techknowhow4802 6 років тому

    Good lecture on ggplot and its functionalities. I liked the examples. I would have liked to see it go a little deeper into examples coders and analysts can use directly in their analysis and data science problems. Thank you.

  • @RamadanElsharif
    @RamadanElsharif 3 роки тому +1

    So clear and nice lecture. Thank you so much.

    • @Datasciencedojo
      @Datasciencedojo  3 роки тому +1

      Glad you liked it, stay tuned for more lectures!

  • @thedeadman8361
    @thedeadman8361 6 років тому

    Great intro to ggplot2. Made the basics very clear.

  • @mohdyounisbhat2447
    @mohdyounisbhat2447 3 роки тому

    Such a wonderful video!!!So simple and easy way to make it understand

  • @tassoskat8623
    @tassoskat8623 3 роки тому +1

    I just started loving R

  • @nkristianschmidt
    @nkristianschmidt 6 років тому +1

    Very helpful. I think at the end, the density plot vs histograms issue is, the layered density plots show two different distributions of age and the histograms show one distribution of age and bi-color that distribution by survival. Two different things.

    • @brantleygrey696
      @brantleygrey696 3 роки тому

      i guess im asking randomly but does anybody know a trick to log back into an instagram account??
      I was dumb forgot the password. I would appreciate any assistance you can give me

    • @chaimvance8608
      @chaimvance8608 3 роки тому

      @Brantley Grey Instablaster :)

    • @brantleygrey696
      @brantleygrey696 3 роки тому

      @Chaim Vance I really appreciate your reply. I found the site on google and I'm trying it out now.
      Seems to take a while so I will reply here later when my account password hopefully is recovered.

    • @brantleygrey696
      @brantleygrey696 3 роки тому

      @Chaim Vance it worked and I now got access to my account again. Im so happy:D
      Thanks so much you saved my ass :D

    • @chaimvance8608
      @chaimvance8608 3 роки тому

      @Brantley Grey Happy to help :D

  • @rahulfederer20
    @rahulfederer20 Рік тому

    Legit the perfect video for a beginner. Thanks a ton man

  • @kylelarson5074
    @kylelarson5074 6 років тому

    What would be fantastic is if you could please create 10-15min or less summary videos of your lessons just to provide a snap shot of the different codes. That way it would make it extremely easy to revise your information without needing to sit through the repetition of the more indepth explanations we have already heard.

  • @Justme1635438
    @Justme1635438 4 роки тому

    I loaded the dataset in both SPSS and R and did all of the plots - to me SPSS was more easy to use, but the plots actually look better in R. Great video.

  • @mohamedbelo5462
    @mohamedbelo5462 6 років тому

    Wonderful, this is was so useful and one hour full of knowledge and hand on practice.
    Thanks alot guys !
    Belo

    • @lcaslokonon3979
      @lcaslokonon3979 6 років тому

      Thank you for making this great tutorial! It's easy and simple to follow! I've learned a ton from it; keep making more, please!

  • @martinwilson1729
    @martinwilson1729 7 років тому +1

    Very helpful and appreciated, thanks for uploading

  • @shreyachauhan9181
    @shreyachauhan9181 4 роки тому

    oh man I can't thank enough, you are so good I lost my mind in understanding u hold my back, thanks

  • @This_aint_rocket_science
    @This_aint_rocket_science 4 роки тому +1

    thanks very informative

  • @examtest2883
    @examtest2883 6 років тому

    worth watching 1 hr..Really helpful. Thanks a lot

  • @afifkhaja
    @afifkhaja 5 років тому +1

    Superb presentation

  • @michaelrockinger
    @michaelrockinger 6 років тому +5

    Show really starts after 25min. You should have discussed passengerid and name when you discussed the variables. Is ggplot smart to use factors for visualization? In a few days i will be desperate to remember that i need to factor to get certain visu. It should be the programmer to have contol not the program. No? Is it really sooo complicated to put % in the plot? Not good publicity for such a great package as ggplot.

  • @saurabhyelmame
    @saurabhyelmame 3 роки тому +1

    Why the column names are converted to factors in 32:38 ?
    I tried using them without converting to factors.
    Some were executed but some gave errors.

    • @HarshPatel-ph1lp
      @HarshPatel-ph1lp 3 роки тому

      I think if you don't convert them into factors (which are basically categorical variables), then R will think them just as a string of words rather than recurring categories.

  • @jiangxu3895
    @jiangxu3895 4 роки тому

    Thank you very much for your explanation.

  • @andymonks2808
    @andymonks2808 6 років тому

    excellent video! Thank you very much Dave

  • @victorsamuel6199
    @victorsamuel6199 3 роки тому

    This is so awesome. Thank you so much.

  • @josephmakambala6395
    @josephmakambala6395 2 роки тому +1

    I need some help with my R language biostatistics and I'm glad to pay the affordable R tutorial through Zoom or any other platform.

    • @Datasciencedojo
      @Datasciencedojo  2 роки тому

      Hello Joseph, do check out our free course on R: online.datasciencedojo.com/course/R-Programming

  • @gyandeepsharma5516
    @gyandeepsharma5516 4 роки тому

    thanks for such good vedio. Loved it,

  • @WahranRai
    @WahranRai 6 років тому +1

    46:08 May be instead of using copy and paste, we could use, for example:
    ggplt = ggplot2(titanic,aes...) and add layers to that
    ggplt +
    theme_bw().+
    labs()....

  • @levibauer375
    @levibauer375 4 роки тому

    Found this super helpful! Thanks so much

  • @asifkhan-fr4wb
    @asifkhan-fr4wb 7 років тому +2

    very nice explanation with the dataset. Thank You.

  • @statisticstime4734
    @statisticstime4734 4 роки тому

    Excellent!

  • @sahalnaz3904
    @sahalnaz3904 7 років тому +1

    very useful video... thank you

  • @JulioCCavalcanti
    @JulioCCavalcanti 4 роки тому

    Very good!

  • @pamelabenitez7077
    @pamelabenitez7077 6 років тому

    Nice video to get you hook with ggplot2

  • @jiyaadnaeem4802
    @jiyaadnaeem4802 4 роки тому

    Great tutorial, thank you!

  • @schraudi5763
    @schraudi5763 3 роки тому

    Isn't there a mistake in the axis description of the last histogram?

  • @dreznik
    @dreznik 6 років тому

    you should do geom_boxplot(notch=T) so folks understand the concept of visually comparing medians; also read_csv preferred over read.csv

  • @LucasSantos-sh8tu
    @LucasSantos-sh8tu 5 років тому

    Hi! How can I change the graphic color? I created the graphic but I don't want it to have only the basic colors of the ggplot2, like this pink and blue colors. I did a little research on google, but I only find how I can change the color on graphics that has continous variables and my variables are discrets. Can you help me?

  • @eyadha1
    @eyadha1 3 роки тому

    thank you very much, very helpful for me.

  • @aks1008
    @aks1008 6 років тому

    Sir very good video...I just had a doubt ...if we have 8-10 categories instead of 3 for pclass is there an option to select and show the top 5 pclass from the 8-10 categories and plot them.using ggplot...because I work in the aerospace industry and have multiple categories for each variable...thanks
    Amod Shirke

  • @RanaMuhammadWaqas
    @RanaMuhammadWaqas 5 років тому

    ggplot(titanic, aes(x = Age)) +
    theme_bw()+
    geom_histogram(binwidth = 5) +
    labs(y="Passanger Count",
    x="Age (binwidth=5)",
    title = "Titanic Age Distribution")
    This doesnt work getting an error
    Error: StatBin requires a continuous x variable: the x variable is discrete. Perhaps you want stat="count"?
    >

  • @poojadhonde3169
    @poojadhonde3169 6 років тому

    Very nice sir,
    Please make a videos for 3d visualization .

  • @akakhbod
    @akakhbod 4 роки тому

    Thanks a ton david ...

  • @aegystierone8505
    @aegystierone8505 4 роки тому

    Incredible, telling a story with data!

  • @tpeldonyou
    @tpeldonyou 6 років тому

    What is the use of factorise here? I thought factorising some variable was going to be used later in exercise.

  • @RavinderRam
    @RavinderRam 6 років тому +1

    ggplot2 best package in data science for visulaization

  • @ramenchetia7529
    @ramenchetia7529 5 років тому

    Hi, I tried running the code for 2nd question w.r.t Sex but still getting the grey bars. The color for survived is not coming. Please help.

  • @ntimdomfeh1959
    @ntimdomfeh1959 7 років тому

    Thank you very much. You are far too kind

  • @kmlim4657
    @kmlim4657 5 років тому +1

    Thanks for great lecture, and I'm wondering in the lecture around 1:05:58, the code indicates that aes(x is Age), but the label says it is Survived. Shouldn't it be flipped? labs(y = Survived, x = Age ...)

  • @AYODEJIIYANDA
    @AYODEJIIYANDA 6 років тому

    This is a great tur]torial, good job

  • @aikagyan999
    @aikagyan999 4 роки тому

    Thank you so much..

  • @sharanyasredharan9625
    @sharanyasredharan9625 3 роки тому

    Hello..the titanic.csv dataset is stored in code.datasciencedojo..May I please know where or how else can I retrieve the file as I would like to go through these class? Thank you.

  • @sureshkharel1514
    @sureshkharel1514 4 роки тому

    please create more content on prediction and fitting function

  • @yaoliao3517
    @yaoliao3517 4 роки тому

    Really thanks

  • @shubhangisuralkar8436
    @shubhangisuralkar8436 7 років тому +2

    really helpfull

  • @georgemathematician1186
    @georgemathematician1186 3 роки тому

    Pretty cool

  • @CharwakApte
    @CharwakApte 6 років тому

    Infinite SNR - Thanks!

  • @robinredhu1995
    @robinredhu1995 6 років тому +4

    I think in last two graphs both density and histogram are wrongly labeled

    • @headhunterz1000
      @headhunterz1000 6 років тому

      Would you elaborate on why this is?

    • @juanpablorb7150
      @juanpablorb7150 6 років тому

      I was thinking the same. It doesn't make sense to label the y axis as survived, but rather frequency? while the x axis should just be age.

    • @FRANCESCO-wj8rs
      @FRANCESCO-wj8rs 6 років тому

      I also believe it is wrong. It should be age on the x-axis.

    • @aakash4488
      @aakash4488 5 років тому

      The age is on the x-axis and survived is on the y-axis.

  • @KK-ty6ez
    @KK-ty6ez 3 роки тому

    DATA SCIENCE DOJO Piliz share the R code , my not running properly

  • @dmtv_5559
    @dmtv_5559 7 років тому

    Please how do I display equation of the line and r^2 on my plots in R? In excel it is very easy to do this. I am buying into R because of R markdown. Please help out as I need my equation displayed just the way I use to in excel

    • @rexevan6714
      @rexevan6714 7 років тому

      Liberty Mgbanyi you can use anotation function and use Paste0 to show the r^2 and equation. If you have r^2 and equation for every facet, you would need to make more variable / column.

  • @atomii9455
    @atomii9455 4 роки тому

    Wow. Thanks

  • @ButterfaceGMusicSlump
    @ButterfaceGMusicSlump 5 років тому +1

    for people that are familiar with R skip to 33:33

  • @mehriban9003
    @mehriban9003 4 роки тому

    I wanted to practice with the file while watching this video. But I haven't been able to download the file that he mentions as 'easily downloadable'. I wish he made the file available for downloading on github.

  • @the29couple7
    @the29couple7 6 років тому

    Hi I have started working with ggplot2 recently
    install.packages("ggplot2")is working fine but while using library(ggplot2) I am having below and can't come out.
    library(ggplot2)
    Error: package or namespace load failed for ‘ggplot2’:
    object ‘enexprs’ is not exported by 'namespace:rlang'
    In addition: Warning message:
    package ‘ggplot2’ was built under R version 3.4.4
    Your help is highly needed

  • @manonabrilestades
    @manonabrilestades 5 років тому

    can i do you one cuestion? is about a graphic that i can't resolve

  • @hasibulislam2187
    @hasibulislam2187 6 років тому

    in 42:30min, you have a color on you bars. but with same code, my bars are having the same color. Why? Please give me a solution. Thanks in advance

    • @RobertFisher
      @RobertFisher 6 років тому

      You likely forgot to set up the factors. Go back to around 32:00 in the video for the explanation.

    • @hasibulislam2187
      @hasibulislam2187 6 років тому

      Yes. I have done after making comment. Thanks for replying.

  • @punchline9131
    @punchline9131 4 роки тому

    I would like to use "ggplot2" to create a graph showing whether life satisfaction returns to the value it had before the unemployment event occurred.
    Unemployment I have coded with 0 = not unemployed and 1 = unemployed. General life satisfaction is coded 0 - 10.
    I have already created the data set df_emp with a subset command, which contains all persons who were unemployed at least once. It also contains all years of observation of these persons. So all years before, during and after unemployment, as well as the corresponding values for life satisfaction
    The years before, during and after the event should now be entered on the x-axis. Where 0 is the event unemployment. The values -1 -2 and 1 2 etc. show values for the years before and after unemployment.
    On the y-axis the values for life satisfaction should then be deducted (centered).
    Unfortunately, I can't manage to model this graphic in R and would therefore be very pleased if somebody could help me or give me some tips on how to proceed.
    Best regards
    ps. I got the graphic from Lucas et al. (2004) - Unemployment alters the set point for life satisfaction

  • @tuanlong9238
    @tuanlong9238 6 років тому

    thanks a lot !!!

  • @vincenzo4259
    @vincenzo4259 2 роки тому

    Thanks

  • @djangoworldwide7925
    @djangoworldwide7925 3 роки тому

    That video was fantastic. I now know i should be a girl if i want to go on a cruise

  • @bikashpokharel478
    @bikashpokharel478 3 роки тому +1

    Wait a minute, I am gonna update my linked In bio to R expert

  • @didimoescobar2247
    @didimoescobar2247 5 років тому

    I don´t read the CSV file.....please help..

  • @pravinmsp8025
    @pravinmsp8025 5 років тому

    I need the excel or csv file to practice

    • @Datasciencedojo
      @Datasciencedojo  5 років тому

      You can find our supplemental material here: code.datasciencedojo.com/datasciencedojo/tutorials/tree/master/Introduction%20to%20Data%20Visualization%20with%20R%20and%20ggplot2

  • @sayedmohammad2394
    @sayedmohammad2394 6 років тому

    start from @24:00

  • @ahsanamrirohman6865
    @ahsanamrirohman6865 4 роки тому

    Install equiser package

  • @Dlorensia123
    @Dlorensia123 4 роки тому

    tq dojo

  • @jovial129
    @jovial129 3 роки тому

    Histogram 54:44

  • @Bkoerier
    @Bkoerier 3 роки тому

    what a load of unnecessary talking.... 25 minutes of not needed introduction. 5 minutes to tell why passengerID and name are not relevant.... get to the point already...