Reading Data from Hudi INC & Joining with Delta Tables using HudiStreamer & SQL-Based Transformer

Поділитися
Вставка
  • Опубліковано 22 лип 2024
  • Reading Data from Hudi Tables Incrementally, Joining with Delta Tables using HudiStreamer and SQL-Based Transformer to Build Denormalized Table: An Interior Peek into Hued Iceberg and Delta Lakes (XTable)
    Exercise Files
    github.com/soumilshah1995/Hud...
  • Наука та технологія

КОМЕНТАРІ • 4

  • @pavan792reddy
    @pavan792reddy 3 місяці тому

    Hi soumil,
    I am not able to access the git lab for the exercise looks like the link wasn't found

  • @ashraf7172
    @ashraf7172 3 місяці тому

    I fetched data from
    Postgres as CDC pipeline,2-3 tables
    Then with help of deltastreamer I creste a universal data lakehouse this will become my bronze layer.
    I had a question now what approach should I use? For creating a silver level data i.e joined and transformed table
    A - fetching data again from postgres and joining at the time of hui streamer pyspark job
    Or B fetching data from the universal datalakehouse bronze layer and then creating a silver level lakehouse.
    Or any other method
    I want it like whenever I update data in postgres all my data in bronze and silver lvl both gets updated ?

    • @SoumilShah
      @SoumilShah  3 місяці тому

      Ofcourse bronze why would you want to fetch from postgres again it’s already bronze lagers
      Read data from Hudi and join them build your silver

    • @ashraf7172
      @ashraf7172 3 місяці тому

      Great,
      I was confused
      That when I will create silver from my hudi bronze tables
      If I make changes in postgres will it reflect in my silver level or only in bronze
      I got clarified