How to create a grouped plot of jittered data with the ggplot2 R package (CC108)

Поділитися
Вставка
  • Опубліковано 9 вер 2024

КОМЕНТАРІ • 14

  • @Riffomonas
    @Riffomonas  3 роки тому +3

    What do you think? Besides the critiques I give of the jitter plot in the video, what do you see as the pros and cons of these plots?

    • @RasmusKirkegaard
      @RasmusKirkegaard 3 роки тому

      Really great episode! An alternative to "random" jitter, which can cause overlapping data points, could be ggbeeswarm that might make it a bit easier to look at the distribution of the data. github.com/eclarke/ggbeeswarm

    • @Riffomonas
      @Riffomonas  3 роки тому +1

      @@RasmusKirkegaard Thanks! In the next episode I'll talk about adding some summary layers to the plot and the tradeoff of having way too much stuff going on when you have more than a handful of taxa that you are looking at

  • @federicoroggio3052
    @federicoroggio3052 2 роки тому +1

    Hi man, thank you for your guide! I've a tip for you to get the median line with a specific color. If you set "fill=variable, color=variable" and then in the stat_summary you add color="red", you keep all the lines and they change colour.

  • @shantanutamuly6932
    @shantanutamuly6932 Рік тому

    Excellent video

  • @mshparber
    @mshparber 2 роки тому +1

    Excellent! I love how jittered strip plots show the actual data points. What if I have millions of data points and don't want to use box plots. Can heat-map like charts help? Do you have any experience with these?

    • @Riffomonas
      @Riffomonas  2 роки тому

      Eh I’m not a big fan of heat map like things. Perhaps a violin plot? I think a boxplot is really your best bet

  • @shantanutamuly6932
    @shantanutamuly6932 5 місяців тому

    How to add error bar for standard deviation when using mean in crossbar?

    • @Riffomonas
      @Riffomonas  5 місяців тому

      You should check out the documentation for the stat_summary function. They hvae several examples. The default is to use the mean and standard error - ggplot2.tidyverse.org/reference/stat_summary.html

  • @russtin1
    @russtin1 3 роки тому

    Nice plot. Did you reject box plots or violin plot because you lose sample size information?

    • @Riffomonas
      @Riffomonas  3 роки тому

      Thanks @russtin1! I covered those types of plots in other episodes. I'm not a fan of violin plots because I think they emphasize the mode rather than the median of the distribution and when you have a pretty uniform distribution, all you get are rectangles. For this much data, I'd prefer to use stat_summary with geom="pointrange" and fun.data=median_hilow. I cover this type of plot (and box plots) in the next episode of the series - ua-cam.com/video/7TaGcHsoQpM/v-deo.html

  • @tormaferenc9796
    @tormaferenc9796 3 роки тому +1

    You R my hero!.. hahah got it? (-.-) :D

    • @Riffomonas
      @Riffomonas  3 роки тому +1

      Hilarious! You’re not a dad are you? Because that was 💯 dad joke material 😂