How to Clean Up Messy Excel files in R | 7 Easy Strategies

Поділитися
Вставка
  • Опубліковано 9 січ 2025

КОМЕНТАРІ • 26

  • @rappa753
    @rappa753  6 місяців тому

    If you enjoyed this video, I am certain that you're going to love my Data Cleaning Master Class. It shows you even more advanced tricks for Excel file and text cleaning. You can check it outat data-cleaning.albert-rapp.de/

  • @rcepka
    @rcepka Рік тому +5

    Great video, many thanks. If I may have a note/suggestion; for us less advanced R-users :), I would like to see explained in more details also partial tasks/steps you are performing. For example regex

    • @rappa753
      @rappa753  Рік тому +1

      Hi Robert, thanks for reaching out. It is indeed hard to find a balance between what to explain and what to leave to the user 🙈 As for your question, the call you mentioned is constructing a regular expression using the glue function. A great guide on regular expression is r4ds.hadley.nz/regexps.html

  • @dasrotrad
    @dasrotrad Рік тому +2

    Very nice. You sure packed a lot into this 7:44 min:secs. Thank you.

    • @rappa753
      @rappa753  Рік тому

      Glad that you like it, Robert 😊

  •  Рік тому +5

    Date columns in Excel are nightmare fuel, luckily we have janitor::excel_numeric_to_date(). Great video!

    • @rappa753
      @rappa753  Рік тому

      It is indeed 😀 Janitor's convenience function are super great for fixing these types of problems.

  • @muhammedhadedy4570
    @muhammedhadedy4570 11 місяців тому

    Excellent tutorial. I literally watch it every day so that I can learn it very well. Would you please make more of these cleaning data tutorials?

    • @rappa753
      @rappa753  11 місяців тому +1

      Happy to have a fan 😊 I'm planning on making a data cleaning course but that's still a bit in the future 😅

    • @muhammedhadedy4570
      @muhammedhadedy4570 11 місяців тому

      @@rappa753
      Excellent. Can't wait to join.

  • @haraldurkarlsson1147
    @haraldurkarlsson1147 10 місяців тому

    Nice. I was a bit surprised by seeing paste0 inside a glue function. I thought glue was used instead of paste and was simpler. It is also worth noting, as you probably, know that there are three packages specifically aimed at dealing with excel files (even messier than the ones you did - for instance subheaders at different levels throughout the document). These are tidyxl, unheadr, and unpivotr. The tidyxl package actually works on the "cellular" level that is allows you to deal with the contents of the excel cells and manipulate them. Tidyxl is powerful but takes experience to work with. It would be nice to see a video on tidyxl. Thanks.

  • @Sensemakingdata
    @Sensemakingdata Рік тому +1

    Great video, well done!

  • @ahmed007Jaber
    @ahmed007Jaber Рік тому +1

    Excellent one. thank you Albert

    • @rappa753
      @rappa753  Рік тому

      Glad that you like it 😊

  • @Aaqib..
    @Aaqib.. Рік тому +1

    so much information packed,you the best

    • @rappa753
      @rappa753  Рік тому

      Thank you, that's very kind of you 😊

  • @hassanhijazi4757
    @hassanhijazi4757 Рік тому +1

    Thank you very much. Very informative.

    • @rappa753
      @rappa753  Рік тому +1

      Thank you, Hassan. Glad that you enjoy my video!

  • @MKhan-zo8xo
    @MKhan-zo8xo 11 місяців тому

    YES! WOW! I didnt know I needed this video

    • @rappa753
      @rappa753  11 місяців тому +1

      Nice, looks like you got a lot out of this video. Thanks for sharing that with me 🤗

  • @andrea-mj9ce
    @andrea-mj9ce Рік тому +2

    A proper way is to use the `tidyxl` package

    • @rappa753
      @rappa753  Рік тому +1

      Uhh {tidyxl} sounds promising. it sounds a bit like an add-on to {readxl}. Thanks for the hint 🙂

  • @johnkaruitha2527
    @johnkaruitha2527 Рік тому +1

    Great job Albert. But next time you use setwd() I will look for you and burn your computer (Jenny Bryan) (projects are better).

    • @rappa753
      @rappa753  Рік тому

      😀 I am on board with not using setwd() for static file paths. I think that's what Jenny meant 🤔 I do love projects but still sometimes navigating inside of them is unavoidable. That's why I use here() from the {here} package to make file paths relative.