Leverage Apache Hudi incremental query to process new & updated data

Поділитися
Вставка
  • Опубліковано 16 січ 2023
  • Credit: / @soumilshah
    Follow us on Twitter: / apachehudi
    Star the GitHub project: www.github.com/apache/hudi
    Join the community: hudi.apache.org/community/syncs
    Hudi Labs uncovers 1 service or tool in hudi that you can learn more in-depth. In this lab, Soumil walks you through how to leverage Hudi for an incremental query to process new & updated data.
    Code sample: github.com/soumilshah1995/hud...
    To find all labs: github.com/soumilshah1995/hud...
    Gitbook: coming soon

КОМЕНТАРІ • 2

  • @1ma4ighter
    @1ma4ighter 8 місяців тому

    I'm a little lost on precisely why there are three records in the table, but when we finally query it, we see only two records.
    My presumption is that; Hudi stores versioned data and that we only fetch data for the latest version, the same way it's done in databricks, is that correct?

  • @nikhilsimhar
    @nikhilsimhar 25 днів тому

    the screen reader resolution is very low