Це відео не доступне.
Перепрошуємо.

Join Tables with {dplyr}

Поділитися
Вставка
  • Опубліковано 6 сер 2024
  • If you know how to tidy up data within one table, you’re already a skilled data scientist! However, as data continues to grow exponentially, taking your skills to the next level involves mastering the art of working with multiple tables within a database, typically done using SQL. In this post, we’ll learn three essential techniques using {dplyr} that will allow you to handle databases with ease: merging multiple tables, reducing redundancy through table joins, and effortlessly modifying values within the resulting table.
    If you only want the code (or want to support me), consider join the channel (join button below any of the videos), because I provide the code upon members requests.
    Enjoy! 🥳

КОМЕНТАРІ • 18

  • @abdulbouraa4529
    @abdulbouraa4529 Рік тому

    Thank you so much !

  • @WilOspinoC
    @WilOspinoC Рік тому +1

    I freaking love this!!! Honestly old me used SQL scripts into R to avoid complexity, but now I can see dplyr makes it even simpleR. Thanks Yury. Greetings from Colombia

    • @yuzaR-Data-Science
      @yuzaR-Data-Science  Рік тому +1

      Glad you liked it! Yeah, since I know dplyr, I stopped using SQL too

  • @muhammedhadedy4570
    @muhammedhadedy4570 Рік тому +1

    I love all your work, and I'm a big fan of your channel. Excellent tutorial as usual.
    Please, keep up the great work and thanks for everything.
    ❤❤❤❤❤

    • @yuzaR-Data-Science
      @yuzaR-Data-Science  Рік тому +1

      Thank you very much, Muhammed! That means a lot to me! I will produce more content.

  • @chacmool2581
    @chacmool2581 4 місяці тому

    I too think that 'dplyr' walks all over SQL. When I've told databases SQL guys who don't know R, they look at me like I'm crazy. The only limitation is the memory limitation of R.

    • @yuzaR-Data-Science
      @yuzaR-Data-Science  3 місяці тому

      Exactly! I new SQL before R. But since I learned all I need for SQL in dplyr, I never touched SQL again ;)

    • @yuzaR-Data-Science
      @yuzaR-Data-Science  3 місяці тому

      yeah, the memory limitation is a problem, I got a new PC with tons of RAM and can't complain so far

  • @harryporter9965
    @harryporter9965 Рік тому

    Hi Sir, (Out of context).
    How can I carry out a cross tabulation by setting some exclusion of values. For instance, variables under observation are x and y where x variable has zero values and should be excluded during plotting of the table.

    • @yuzaR-Data-Science
      @yuzaR-Data-Science  Рік тому

      I don't think so, but i you need to exclude them anyway, why not just use dplyr to exclude them from x first, and then execute a join on x-table?

  • @richardbennett4365
    @richardbennett4365 9 місяців тому

    Duckdb with Ibis.

    • @yuzaR-Data-Science
      @yuzaR-Data-Science  9 місяців тому +1

      It’s similar, but it’s python, right?

    • @richardbennett4365
      @richardbennett4365 9 місяців тому

      Yes.

    • @yuzaR-Data-Science
      @yuzaR-Data-Science  8 місяців тому +1

      as I needed to use python, I found a couple of R similar packages our of laziness, but it worked well :)

    • @richardbennett4365
      @richardbennett4365 8 місяців тому

      ​@@yuzaR-Data-Scienceby the way, I like your presentations. The animations and graphics are top-notch, clear, and easy to understand.

    • @richardbennett4365
      @richardbennett4365 8 місяців тому

      ​@@yuzaR-Data-Scienceone more thing, I find Miniforge3 or micromamba, not Anaconda, to be the best way to manage python projects with their various packages and dependencies. Anaconda just installs way too much stuff.
      I'm also finding that LFortran can be installed as a kernel in Jupyter, so that's nice, too. And yes, one more thing, DataSpell can manage both R and python.