Discover 7 Hidden Gems in the R Package Ecosystem

Поділитися
Вставка

КОМЕНТАРІ • 52

  • @tomhenry-datasciencewithr6047
    @tomhenry-datasciencewithr6047  4 роки тому +5

    ▶️ Tidyverse tips in R (my top 18 tips & tricks) - ua-cam.com/video/ZM04jn95YP0/v-deo.html
    🎉 *Subscribe* if you want more videos like this! - ua-cam.com/channels/b5aI-GwJm3ZxlwtCsLu78Q.html
    😃 *Comment* below to share which tricks you liked the most!!
    I figured out what was going on with the warning at 11:12.
    Hint: filter(is.na(start_date)) to look at the NA start_dates (rather than !is.na(start_date) which selects rows which have a correctly parsed start_date). You'll see that startday, startmonth, and startyear are empty (NA) for those rows.

  • @wildaceds
    @wildaceds 3 роки тому +1

    Ive been working with R for like 5 years now and this is extremely helpful. I need to keep up to date with packages.

  • @falayeadeyinka168
    @falayeadeyinka168 3 роки тому +1

    Thank you mr Tom for the great job. It is deep and refreshing. God bless you and your lovely family.

  • @brucefox4954
    @brucefox4954 4 роки тому +5

    Tom, it would be great if you did a video focusing on tsibble. Thanks.

  • @OtRatsaphong
    @OtRatsaphong 2 роки тому

    Tom, Thanks for this great tutorial! Relatively new to R and was looking to familiarise myself with packages that would be useful for financial analysis, and found your channel. Great work!

  • @pradeepviv3672
    @pradeepviv3672 4 роки тому +1

    Thanks very much for this great video, I loved the way you explained how to use these packages very simply. Well done

  • @sloperspinches3122
    @sloperspinches3122 4 роки тому +1

    Thank you for sharing these 7 underrated R packages.

  • @shreyaroraa2234
    @shreyaroraa2234 3 роки тому

    Great video TOM even with R crashing and coming on it was easy to follow, concise and to the point.

  • @denisrastegaev608
    @denisrastegaev608 3 роки тому

    Very helpful! Thank you!

  • @ahmed007Jaber
    @ahmed007Jaber 2 роки тому

    thank u so much, Tom.
    love tsibble and tidytext, never head of them before. would like more such content about text mining

  • @SergioUribe
    @SergioUribe 4 роки тому +1

    very good, thank you for sharing, and thank you also for sharing the mistakes!

  • @porlando12
    @porlando12 4 роки тому +1

    tidylog is an absolute game changer! Thanks for sharing these hidden gems!

  • @solomonoguche2360
    @solomonoguche2360 3 роки тому +2

    Tidytext tutorial please. Now I am highly interested.

  • @bhabishyaneupane2073
    @bhabishyaneupane2073 4 роки тому +1

    I have been watching your videos since day one, and they have really helped me. Thanks for sharing these absolute gems! If you could do a video talking about text analysis or a walkthrough, that'd be great!

    • @tomhenry-datasciencewithr6047
      @tomhenry-datasciencewithr6047  4 роки тому

      Thanks, Bhabishya! I'm glad they are helpful for you! I'll make a video in the next couple of weeks focusing on text analysis with a real dataset. It will be fun.

    • @bhabishyaneupane2073
      @bhabishyaneupane2073 4 роки тому

      @@tomhenry-datasciencewithr6047 I am excited and looking forward to it!
      This might be too much to ask for, but is there a way that I can get in touch with you? I am an aspiring data scientist, and I feel like there's a lot to learn at this point to get there. I would be really grateful if you could list out some things that I might wanna do to get there or mentor me throughout this journey. FYI: Right now, I am in my senior year in college with a good amount of research experience, and I like to think that I have a decent R experience as well. But, yeah, no pressures!

  • @susanbenson4941
    @susanbenson4941 2 роки тому

    Great video - short, clear and showing stumbles is helpful

  • @raould2590
    @raould2590 Рік тому

    Just found this gem! Thank you so much for this! Very useful!

  • @prison9865
    @prison9865 Рік тому

    This video proves, how unstable and shyyt R is to be honest. Crashes, errors etc. this is what im facing every single day at work. Good video though!

  • @haraldurkarlsson1147
    @haraldurkarlsson1147 2 роки тому

    Tom,
    Nice summary of interesting packages. I have been using janitor for a while and it is a life saver. tsibble looks interesting but I have heard that the padr package does similar things and might work better for filling in missing dates. The description of the package claims that it is better than lubridate in that regard. Another package I use quite a bit and love is flextable. It produces great looking output and works with the tidyverse. Finally, a collection of packages that works well with imports and finding your 'path' are rio, pacman and here. Thanks!

  • @aliramadan7425
    @aliramadan7425 3 роки тому

    I am lucky I came across your channel! Thank you.

  • @Yanceyray
    @Yanceyray 3 роки тому

    Thanks. Tidylog is great addition to my workload.

  • @JOHNSMITH-ve3rq
    @JOHNSMITH-ve3rq 3 роки тому +1

    Pls consider making some content on data.table. Package does not get enough love.

  • @maksim0933
    @maksim0933 3 роки тому

    A very nice user-friendly explanation!

  • @ridwanshittu617
    @ridwanshittu617 3 роки тому +1

    Hi Tom, thanks for the tips. I will like a complete tips on the tsibble package. Can this package be used to extract and plot only summer months for time-series datasets of 10 years?

  • @taiwankyh
    @taiwankyh 2 роки тому

    Thanks for sharing

  • @elkamel
    @elkamel 4 роки тому +3

    Thank you Tom for sharing these tips, really these packages are very useful but they did not take their right to fame.
    For the lubridate example, you could have used: make_date() function as follow:
    protests_simplified %
    select(country, startday, startmonth, startyear) %>%
    mutate(start_date = make_date(startyear, startmonth, startday))

  • @ahmed007Jaber
    @ahmed007Jaber 2 роки тому

    thank you for this. wonder if you could help me out with presentations. I would like to produce a customised presentation and apparently the best approach would be by using officer package, the challenge is splitting tables automatically based on allotted space and size of content. Got any idea how to do it???

  • @JOHNSMITH-ve3rq
    @JOHNSMITH-ve3rq 3 роки тому

    Data.table & quanteda would be great.

  • @paulosergioschlogl9550
    @paulosergioschlogl9550 2 роки тому +1

    Hi there...any way to work with multiple csv files at once to select only few columns and create a final csv?

    • @tomhenry-datasciencewithr6047
      @tomhenry-datasciencewithr6047  2 роки тому +2

      There are some different ways to do it - if you install `vroom` package, you can run something like this:
      library(tidyverse)
      library(vroom)
      library(janitor)
      library(fs)
      # also check out the `here` package for a good way to find the working directory
      vroom(dir_ls("*.csv")) %>%
      clean_names() %>%
      select(col_1, col_2, col_3, ......) %>%
      distinct()
      (assumes all your csv files have the same format).
      What do your csv files look like? If your columns have mixed types etc. or the files are very short, the procedure would need to be changed a bit to make sure all the types get picked up accurately.

    • @paulosergioschlogl9550
      @paulosergioschlogl9550 2 роки тому +1

      @@tomhenry-datasciencewithr6047 Hi there! thanks for the answer. Yeah! They have the same format and each column has values with the same types (doubles). And some are big like 65000 lines and 8 columns.

    • @tomhenry-datasciencewithr6047
      @tomhenry-datasciencewithr6047  2 роки тому +2

      @@paulosergioschlogl9550 Fantastic. In that case, the vroom() technique will work well. Or, if you want to use the standard tidyverse packages, you can get the list of files like so:
      library(tidyverse)
      file_list % vroom(delim = "|")
      # because vroom takes a list of files
      all_data_CLEANED %
      janitor::clean_names() %>%
      select(col_1, col_2, col_3, ...)
      all_data_CLEANED %>%
      write_delim("final_csv.csv", delim = "|", na = "")

    • @paulosergioschlogl9550
      @paulosergioschlogl9550 2 роки тому

      @@tomhenry-datasciencewithr6047 Thank you very much Tom. That is awesome.8)

  • @user-ve3no9pm9w
    @user-ve3no9pm9w 10 місяців тому

    make_date renders str_c a waste of time in lubridate walk through

  • @irfanullah235
    @irfanullah235 3 роки тому

    need your email sir

  •  Рік тому

    Excellent! tidytext looks very interesting.

  • @ahmed007Jaber
    @ahmed007Jaber 6 місяців тому +1

    Hi Tom,
    thank you for this. for using tidylog I need to add tidylog() at the end of code chain?
    somehow if I don't add tidylog() at the end I don't see any transformation steps

    • @tomhenry-datasciencewithr6047
      @tomhenry-datasciencewithr6047  6 місяців тому

      Did you run this line near the top of your code?
      library(tidylog)
      You may also need to use these options where appropriate, but normally putting library(tidylog) at the top of your code is fine:
      # turn logging-output on
      options("tidylog.display" = NULL)
      # turn logging-output off
      options("tidylog.display" = list())
      (more details on those here: rdrr.io/github/elbersb/tidylog/f/README.Rmd)
      One possibility is that another package you are using is overriding tidylog, but that is unlikely.

    • @ahmed007Jaber
      @ahmed007Jaber 6 місяців тому

      @@tomhenry-datasciencewithr6047 thank you Tom for the prompt reply
      Ll give it a go
      Any interesting packages, add-ins and/or tips using R? A video is due:)
      Appreciate the efforts, they really make a difference