duckplyr: Tight Integration of duckdb with R and the tidyverse - posit::conf(2023)

Поділитися
Вставка
  • Опубліковано 6 сер 2024
  • Presented by Kirill Müller
    The duckplyr R package combines the convenience of dplyr with the performance of DuckDB. Better than dbplyr: Data frame in, data frame out, fully compatible with dplyr.
    duckdb is the new high-performance analytical database system that works great with R, Python, and other host systems. dplyr is the grammar of data manipulation in the tidyverse, tightly integrated with R, but it works best for small or medium-sized data. The former has been designed with large or big data in mind, but currently, you need to formulate your queries in SQL.
    The new duckplyr package offers the best of both worlds. It transforms a dplyr pipe into a query object that duckdb can execute, using an optimized query plan. It is better than dbplyr because the interface is "data frames in, data frames out", and no intermediate SQL code is generated.
    The talk first presents our results, a bit of the mechanics, and an outlook for this ambitious project.
    Materials: github.com/duckdblabs/duckplyr/
    Presented at Posit Conference, between Sept 19-20 2023,
    Learn more at posit.co/conference.
    --------------------------
    Talk Track: Databases for data science with duckdb and dbt.
    Session Code: TALK-1100

КОМЕНТАРІ • 5

  • @MrThackeray
    @MrThackeray 4 години тому

    duckdb is awesome. Very happy to see this development

  • @aswanijehangeer
    @aswanijehangeer 5 місяців тому

    Great Presentation, gonna try duckplyr!

  • @mubangansofu7469
    @mubangansofu7469 7 місяців тому +1

    Exellent development I must say

  • @My-NaMeS_jEfF
    @My-NaMeS_jEfF 7 місяців тому +2

    It’s got a drake meme I’m sold

  • @haraldurkarlsson1147
    @haraldurkarlsson1147 7 місяців тому +2

    Can duckdb work with shape files (maps)?