Leverage Apache Hudi incremental query to process new & updated data
Вставка
- Опубліковано 16 січ 2023
- Credit: / @soumilshah
Follow us on Twitter: / apachehudi
Star the GitHub project: www.github.com/apache/hudi
Join the community: hudi.apache.org/community/syncs
Hudi Labs uncovers 1 service or tool in hudi that you can learn more in-depth. In this lab, Soumil walks you through how to leverage Hudi for an incremental query to process new & updated data.
Code sample: github.com/soumilshah1995/hud...
To find all labs: github.com/soumilshah1995/hud...
Gitbook: coming soon
I'm a little lost on precisely why there are three records in the table, but when we finally query it, we see only two records.
My presumption is that; Hudi stores versioned data and that we only fetch data for the latest version, the same way it's done in databricks, is that correct?
the screen reader resolution is very low