I love your detailed tutorials. Maybe one day you'll have me as your Postdoc ;) Id like to point out that in 6:25 you could rename your columns from the select function (which is not trivial since it's defined as selective, not operative), so the %>% rename is unnecessary...
Pat, Again a very nice sequence of related videos. One thing I noticed in the weather data I used was there were were years that had few daily measurements. After some thinking about this I decided to figure out how to find the years with most missing days. I thought it might be helpful to others to share this so here is the code: my_weather %>% count(year = year(date)) %>% ggplot(aes(x = year, y = n - 365)) + geom_line() And I found some "bad" years. Next I decide to extract those years with the day counts. my_weather %>% count(year = year(date)) %>% arrange(n) I had nine years with measurements less than 360. The most obvious were the terminal year (start and finish) which is not surprising. So what do we do? Toss those years? Or do set a mark of say no more than 20% missing? A better approach might be to do an running window functions such as those found in the slider packge and aggregate 2-3 days. Just a thought.
Good stuff. Keep in mind that NA might mean zero. I’d dig into the documentation deeper if this is an important point for you. Also if a year is completely absent it won’t show up as a zero - it won’t show up at all
Thank u for this Pat Got an idea how to combine progress bars with list.files? I got some directories on onedrive which have large and many files that take ages to read in as it automatically download them
Hi Pat, will it be possible to label the x-axis labels with two different colors, say genus as grey and genus+fit as dodgerblue? I tried using element_text. However it seems element_text has some issues as its returns this error message, Vectorized "Warning message: input to `element_text()` is not officially supported. Results may be unexpected or may change in future versions of ggplot2". Is there a way to circumvent this problem? I even tried the element markdown option. It also doesn't help much. Many thanks for your help.
Lets say I have a 13 week trial split into a one-week run in phase and a 6 week supplement phase and a 6 week post supplement phase, is it possible to add additional titles on the x axis to indicate this
I would probably make three facets - one row, three columns. Then I would put the facet title on the bottom outside the x-axis text. I've done this with moving them to the y-axis for a one column, three row figure, but you could surely do it with what you describe
What other questions do you have about using facet_wrap?
Is facet grid the best thing ever? Yes!
Thank you again, and it is a gem for students to have lectures like this from a Master.
You're very welcome!
I love your detailed tutorials. Maybe one day you'll have me as your Postdoc ;)
Id like to point out that in 6:25 you could rename your columns from the select function (which is not trivial since it's defined as selective, not operative), so the %>% rename is unnecessary...
Hah! Yeah sometime since this video was made I started using select to rename columns
Pat,
Again a very nice sequence of related videos. One thing I noticed in the weather data I used was there were were years that had few daily measurements. After some thinking about this I decided to figure out how to find the years with most missing days. I thought it might be helpful to others to share this so here is the code:
my_weather %>%
count(year = year(date)) %>%
ggplot(aes(x = year, y = n - 365)) +
geom_line()
And I found some "bad" years. Next I decide to extract those years with the day counts.
my_weather %>%
count(year = year(date)) %>%
arrange(n)
I had nine years with measurements less than 360. The most obvious were the terminal year (start and finish) which is not surprising. So what do we do? Toss those years? Or do set a mark of say no more than 20% missing? A better approach might be to do an running window functions such as those found in the slider packge and aggregate 2-3 days. Just a thought.
Good stuff. Keep in mind that NA might mean zero. I’d dig into the documentation deeper if this is an important point for you. Also if a year is completely absent it won’t show up as a zero - it won’t show up at all
Thank u for this Pat
Got an idea how to combine progress bars with list.files?
I got some directories on onedrive which have large and many files that take ages to read in as it automatically download them
Sorry no idea
How the hell did you learn all of those details? :D
Practice! 😂
What about Rainbow parentheses @ pat ? :D
Again thank you wonderful inside of your process!
some day ... I could use all the help I can get!
Hi Pat, will it be possible to label the x-axis labels with two different colors, say genus as grey and genus+fit as dodgerblue? I tried using element_text. However it seems element_text has some issues as its returns this error message, Vectorized "Warning message: input to `element_text()` is not officially supported. Results may be unexpected or may change in future versions of ggplot2".
Is there a way to circumvent this problem? I even tried the element markdown option. It also doesn't help much. Many thanks for your help.
I think the easiest would be to use element_markdown from ggtext and then use an html to set the text color to what you want it to be in the label
Lets say I have a 13 week trial split into a one-week run in phase and a 6 week supplement phase and a 6 week post supplement phase, is it possible to add additional titles on the x axis to indicate this
I would probably make three facets - one row, three columns. Then I would put the facet title on the bottom outside the x-axis text. I've done this with moving them to the y-axis for a one column, three row figure, but you could surely do it with what you describe
@Riffomonas sorted it. Between yours and a couple of other videos 💪🏼 thanks for the reply