This video is literally PERFECT for ggplot2 beginners! In only one hour, you'll learn the basics of ggplot2 R coding and you'll end up falling in love with ggplot2 (I thought this language was weird and not intuitive at first, but after this video I think it's very useful and practical to visualize more accurate data plots!)
This was amazing - thanks. It was, literally, the first time I've ever coded anything in my life, I've wanted to learn about data vis stuff for sports analytics for a long time and this video was the perfect introduction. If anyone is in a similar position to myself, and has zero ! previous knowledge of R or programming, and wants to learn about data vis, I'd just start with this video. The only thing I needed to hit google for was to learn how to import the dataset into RStudio (yes, really). Thank you!
Glad to hear that, Jaime. It's still not too late to start with data science. Take a look at our Bootcamp, which might be a great way for you to start: datasciencedojo.com/data-science-bootcamp/
Currently a college student pursuing a degree in Economics. I'm taking Intro to Economic Data Analysis, and we have the choice of using R or Excel. Our first Homework Project directly coincides with using ggplot2. I haven't even finished half of the video but can already say I have learned so much about R. Sweet vid!
Thanks Dave... I think the way you present the code and interpreting the result is awesome. even I have just new to ggplot2, the presentation bring me as if we have so familiar with the code. You make R is not that difficult.
Thanks for the tutorial! Small caveats on the density plot and the histogram towards the end. The axes are mislabeled. Y should be probability density or counts, while X should be age
Good lecture on ggplot and its functionalities. I liked the examples. I would have liked to see it go a little deeper into examples coders and analysts can use directly in their analysis and data science problems. Thank you.
Very helpful. I think at the end, the density plot vs histograms issue is, the layered density plots show two different distributions of age and the histograms show one distribution of age and bi-color that distribution by survival. Two different things.
i guess im asking randomly but does anybody know a trick to log back into an instagram account?? I was dumb forgot the password. I would appreciate any assistance you can give me
@Chaim Vance I really appreciate your reply. I found the site on google and I'm trying it out now. Seems to take a while so I will reply here later when my account password hopefully is recovered.
What would be fantastic is if you could please create 10-15min or less summary videos of your lessons just to provide a snap shot of the different codes. That way it would make it extremely easy to revise your information without needing to sit through the repetition of the more indepth explanations we have already heard.
I loaded the dataset in both SPSS and R and did all of the plots - to me SPSS was more easy to use, but the plots actually look better in R. Great video.
Show really starts after 25min. You should have discussed passengerid and name when you discussed the variables. Is ggplot smart to use factors for visualization? In a few days i will be desperate to remember that i need to factor to get certain visu. It should be the programmer to have contol not the program. No? Is it really sooo complicated to put % in the plot? Not good publicity for such a great package as ggplot.
I think if you don't convert them into factors (which are basically categorical variables), then R will think them just as a string of words rather than recurring categories.
46:08 May be instead of using copy and paste, we could use, for example: ggplt = ggplot2(titanic,aes...) and add layers to that ggplt + theme_bw().+ labs()....
Hi! How can I change the graphic color? I created the graphic but I don't want it to have only the basic colors of the ggplot2, like this pink and blue colors. I did a little research on google, but I only find how I can change the color on graphics that has continous variables and my variables are discrets. Can you help me?
Sir very good video...I just had a doubt ...if we have 8-10 categories instead of 3 for pclass is there an option to select and show the top 5 pclass from the 8-10 categories and plot them.using ggplot...because I work in the aerospace industry and have multiple categories for each variable...thanks Amod Shirke
ggplot(titanic, aes(x = Age)) + theme_bw()+ geom_histogram(binwidth = 5) + labs(y="Passanger Count", x="Age (binwidth=5)", title = "Titanic Age Distribution") This doesnt work getting an error Error: StatBin requires a continuous x variable: the x variable is discrete. Perhaps you want stat="count"? >
Thanks for great lecture, and I'm wondering in the lecture around 1:05:58, the code indicates that aes(x is Age), but the label says it is Survived. Shouldn't it be flipped? labs(y = Survived, x = Age ...)
Hello..the titanic.csv dataset is stored in code.datasciencedojo..May I please know where or how else can I retrieve the file as I would like to go through these class? Thank you.
Please how do I display equation of the line and r^2 on my plots in R? In excel it is very easy to do this. I am buying into R because of R markdown. Please help out as I need my equation displayed just the way I use to in excel
Liberty Mgbanyi you can use anotation function and use Paste0 to show the r^2 and equation. If you have r^2 and equation for every facet, you would need to make more variable / column.
I wanted to practice with the file while watching this video. But I haven't been able to download the file that he mentions as 'easily downloadable'. I wish he made the file available for downloading on github.
Hi I have started working with ggplot2 recently install.packages("ggplot2")is working fine but while using library(ggplot2) I am having below and can't come out. library(ggplot2) Error: package or namespace load failed for ‘ggplot2’: object ‘enexprs’ is not exported by 'namespace:rlang' In addition: Warning message: package ‘ggplot2’ was built under R version 3.4.4 Your help is highly needed
I would like to use "ggplot2" to create a graph showing whether life satisfaction returns to the value it had before the unemployment event occurred. Unemployment I have coded with 0 = not unemployed and 1 = unemployed. General life satisfaction is coded 0 - 10. I have already created the data set df_emp with a subset command, which contains all persons who were unemployed at least once. It also contains all years of observation of these persons. So all years before, during and after unemployment, as well as the corresponding values for life satisfaction The years before, during and after the event should now be entered on the x-axis. Where 0 is the event unemployment. The values -1 -2 and 1 2 etc. show values for the years before and after unemployment. On the y-axis the values for life satisfaction should then be deducted (centered). Unfortunately, I can't manage to model this graphic in R and would therefore be very pleased if somebody could help me or give me some tips on how to proceed. Best regards ps. I got the graphic from Lucas et al. (2004) - Unemployment alters the set point for life satisfaction
You can find our supplemental material here: code.datasciencedojo.com/datasciencedojo/tutorials/tree/master/Introduction%20to%20Data%20Visualization%20with%20R%20and%20ggplot2
what a load of unnecessary talking.... 25 minutes of not needed introduction. 5 minutes to tell why passengerID and name are not relevant.... get to the point already...
I would suggest everyone beginning with ggplot2 to go through this 1hr vedio, it will save you a lot of time understanding the basics.
@Kushagra Mishra - You are too kind, glad you liked the video!
Dave
Kushagra Mishra uwl
This video is literally PERFECT for ggplot2 beginners! In only one hour, you'll learn the basics of ggplot2 R coding and you'll end up falling in love with ggplot2 (I thought this language was weird and not intuitive at first, but after this video I think it's very useful and practical to visualize more accurate data plots!)
THIS IS ABSOLUETELY ONE OF THE BEST TUTORIALS ON CODING THAT I'VE EVER SEEN !!!! THANK YOUUU !! UP UP UP
This was amazing - thanks. It was, literally, the first time I've ever coded anything in my life, I've wanted to learn about data vis stuff for sports analytics for a long time and this video was the perfect introduction.
If anyone is in a similar position to myself, and has zero ! previous knowledge of R or programming, and wants to learn about data vis, I'd just start with this video. The only thing I needed to hit google for was to learn how to import the dataset into RStudio (yes, really).
Thank you!
26:30 - Actual video
thnks bruh
Thank you!
I thought this was an exaggeration, but no - thanks for the head's up!
Thank you
Thanks man
The seventh question, I believe the labs should be as:
labs(x = "Age", y = "Density"), and
labs(x = "Age", y = "Survived Count")
Simply exceptional. Thank you. I'm hooked. And I'm not even a Data Scientist. If I was 20 years younger, I'd get into this field.
Glad to hear that, Jaime. It's still not too late to start with data science. Take a look at our Bootcamp, which might be a great way for you to start: datasciencedojo.com/data-science-bootcamp/
Currently a college student pursuing a degree in Economics. I'm taking Intro to Economic Data Analysis, and we have the choice of using R or Excel. Our first Homework Project directly coincides with using ggplot2. I haven't even finished half of the video but can already say I have learned so much about R. Sweet vid!
Thanks Dave... I think the way you present the code and interpreting the result is awesome. even I have just new to ggplot2, the presentation bring me as if we have so familiar with the code. You make R is not that difficult.
glad to help you out, keep following us for more content!
An excellent video! Thanks a lot!
Glad you liked it!
Great presentation! Thanks for making ggplot2 easier to be understood.
Thank you for this video, it was indeed helpful. Didn't have sufficient knowledge in ggplots but now i do. Thanks a lot!
Wow, the Best video on ggplot2. Love you Data Science Dojo. So very much helpful and really got me excited.
Definitely one of the best intros to ggplot2
Super helpful and crystal clear intro. Thank you very much!
Thanks for the tutorial! Small caveats on the density plot and the histogram towards the end. The axes are mislabeled. Y should be probability density or counts, while X should be age
Mistakes in your code 162, 163, 171 and 172. The x axis should be "Age" and y axis should be "Survived". By the way, great tutorial!
Clear and pertinent!
Keep following us for more content!
Good lecture on ggplot and its functionalities. I liked the examples. I would have liked to see it go a little deeper into examples coders and analysts can use directly in their analysis and data science problems. Thank you.
So clear and nice lecture. Thank you so much.
Glad you liked it, stay tuned for more lectures!
Great intro to ggplot2. Made the basics very clear.
Such a wonderful video!!!So simple and easy way to make it understand
I just started loving R
Very helpful. I think at the end, the density plot vs histograms issue is, the layered density plots show two different distributions of age and the histograms show one distribution of age and bi-color that distribution by survival. Two different things.
i guess im asking randomly but does anybody know a trick to log back into an instagram account??
I was dumb forgot the password. I would appreciate any assistance you can give me
@Brantley Grey Instablaster :)
@Chaim Vance I really appreciate your reply. I found the site on google and I'm trying it out now.
Seems to take a while so I will reply here later when my account password hopefully is recovered.
@Chaim Vance it worked and I now got access to my account again. Im so happy:D
Thanks so much you saved my ass :D
@Brantley Grey Happy to help :D
Legit the perfect video for a beginner. Thanks a ton man
Keep following us for more crash courses!
What would be fantastic is if you could please create 10-15min or less summary videos of your lessons just to provide a snap shot of the different codes. That way it would make it extremely easy to revise your information without needing to sit through the repetition of the more indepth explanations we have already heard.
I loaded the dataset in both SPSS and R and did all of the plots - to me SPSS was more easy to use, but the plots actually look better in R. Great video.
Wonderful, this is was so useful and one hour full of knowledge and hand on practice.
Thanks alot guys !
Belo
Thank you for making this great tutorial! It's easy and simple to follow! I've learned a ton from it; keep making more, please!
Very helpful and appreciated, thanks for uploading
oh man I can't thank enough, you are so good I lost my mind in understanding u hold my back, thanks
thanks very informative
worth watching 1 hr..Really helpful. Thanks a lot
Superb presentation
Show really starts after 25min. You should have discussed passengerid and name when you discussed the variables. Is ggplot smart to use factors for visualization? In a few days i will be desperate to remember that i need to factor to get certain visu. It should be the programmer to have contol not the program. No? Is it really sooo complicated to put % in the plot? Not good publicity for such a great package as ggplot.
Why the column names are converted to factors in 32:38 ?
I tried using them without converting to factors.
Some were executed but some gave errors.
I think if you don't convert them into factors (which are basically categorical variables), then R will think them just as a string of words rather than recurring categories.
Thank you very much for your explanation.
excellent video! Thank you very much Dave
This is so awesome. Thank you so much.
Keep following us for more content!
I need some help with my R language biostatistics and I'm glad to pay the affordable R tutorial through Zoom or any other platform.
Hello Joseph, do check out our free course on R: online.datasciencedojo.com/course/R-Programming
thanks for such good vedio. Loved it,
46:08 May be instead of using copy and paste, we could use, for example:
ggplt = ggplot2(titanic,aes...) and add layers to that
ggplt +
theme_bw().+
labs()....
Found this super helpful! Thanks so much
very nice explanation with the dataset. Thank You.
Excellent!
very useful video... thank you
Very good!
Nice video to get you hook with ggplot2
Great tutorial, thank you!
Isn't there a mistake in the axis description of the last histogram?
you should do geom_boxplot(notch=T) so folks understand the concept of visually comparing medians; also read_csv preferred over read.csv
Hi! How can I change the graphic color? I created the graphic but I don't want it to have only the basic colors of the ggplot2, like this pink and blue colors. I did a little research on google, but I only find how I can change the color on graphics that has continous variables and my variables are discrets. Can you help me?
thank you very much, very helpful for me.
Sir very good video...I just had a doubt ...if we have 8-10 categories instead of 3 for pclass is there an option to select and show the top 5 pclass from the 8-10 categories and plot them.using ggplot...because I work in the aerospace industry and have multiple categories for each variable...thanks
Amod Shirke
ggplot(titanic, aes(x = Age)) +
theme_bw()+
geom_histogram(binwidth = 5) +
labs(y="Passanger Count",
x="Age (binwidth=5)",
title = "Titanic Age Distribution")
This doesnt work getting an error
Error: StatBin requires a continuous x variable: the x variable is discrete. Perhaps you want stat="count"?
>
Very nice sir,
Please make a videos for 3d visualization .
Thanks a ton david ...
Incredible, telling a story with data!
What is the use of factorise here? I thought factorising some variable was going to be used later in exercise.
ggplot2 best package in data science for visulaization
Hi, I tried running the code for 2nd question w.r.t Sex but still getting the grey bars. The color for survived is not coming. Please help.
Thank you very much. You are far too kind
Thanks for great lecture, and I'm wondering in the lecture around 1:05:58, the code indicates that aes(x is Age), but the label says it is Survived. Shouldn't it be flipped? labs(y = Survived, x = Age ...)
This is a great tur]torial, good job
Thank you so much..
Hello..the titanic.csv dataset is stored in code.datasciencedojo..May I please know where or how else can I retrieve the file as I would like to go through these class? Thank you.
please create more content on prediction and fitting function
Really thanks
really helpfull
Pretty cool
Infinite SNR - Thanks!
I think in last two graphs both density and histogram are wrongly labeled
Would you elaborate on why this is?
I was thinking the same. It doesn't make sense to label the y axis as survived, but rather frequency? while the x axis should just be age.
I also believe it is wrong. It should be age on the x-axis.
The age is on the x-axis and survived is on the y-axis.
DATA SCIENCE DOJO Piliz share the R code , my not running properly
Please how do I display equation of the line and r^2 on my plots in R? In excel it is very easy to do this. I am buying into R because of R markdown. Please help out as I need my equation displayed just the way I use to in excel
Liberty Mgbanyi you can use anotation function and use Paste0 to show the r^2 and equation. If you have r^2 and equation for every facet, you would need to make more variable / column.
Wow. Thanks
for people that are familiar with R skip to 33:33
I wanted to practice with the file while watching this video. But I haven't been able to download the file that he mentions as 'easily downloadable'. I wish he made the file available for downloading on github.
Hi I have started working with ggplot2 recently
install.packages("ggplot2")is working fine but while using library(ggplot2) I am having below and can't come out.
library(ggplot2)
Error: package or namespace load failed for ‘ggplot2’:
object ‘enexprs’ is not exported by 'namespace:rlang'
In addition: Warning message:
package ‘ggplot2’ was built under R version 3.4.4
Your help is highly needed
can i do you one cuestion? is about a graphic that i can't resolve
in 42:30min, you have a color on you bars. but with same code, my bars are having the same color. Why? Please give me a solution. Thanks in advance
You likely forgot to set up the factors. Go back to around 32:00 in the video for the explanation.
Yes. I have done after making comment. Thanks for replying.
I would like to use "ggplot2" to create a graph showing whether life satisfaction returns to the value it had before the unemployment event occurred.
Unemployment I have coded with 0 = not unemployed and 1 = unemployed. General life satisfaction is coded 0 - 10.
I have already created the data set df_emp with a subset command, which contains all persons who were unemployed at least once. It also contains all years of observation of these persons. So all years before, during and after unemployment, as well as the corresponding values for life satisfaction
The years before, during and after the event should now be entered on the x-axis. Where 0 is the event unemployment. The values -1 -2 and 1 2 etc. show values for the years before and after unemployment.
On the y-axis the values for life satisfaction should then be deducted (centered).
Unfortunately, I can't manage to model this graphic in R and would therefore be very pleased if somebody could help me or give me some tips on how to proceed.
Best regards
ps. I got the graphic from Lucas et al. (2004) - Unemployment alters the set point for life satisfaction
thanks a lot !!!
Thanks
Keep following us for more tutorials.
That video was fantastic. I now know i should be a girl if i want to go on a cruise
Wait a minute, I am gonna update my linked In bio to R expert
I don´t read the CSV file.....please help..
I need the excel or csv file to practice
You can find our supplemental material here: code.datasciencedojo.com/datasciencedojo/tutorials/tree/master/Introduction%20to%20Data%20Visualization%20with%20R%20and%20ggplot2
start from @24:00
Install equiser package
tq dojo
Histogram 54:44
what a load of unnecessary talking.... 25 minutes of not needed introduction. 5 minutes to tell why passengerID and name are not relevant.... get to the point already...