Data modeling interview filters so many data engineers! How to model slowly-changing dimensions

Поділитися
Вставка

КОМЕНТАРІ • 22

  • @SnazzyKicks
    @SnazzyKicks 3 місяці тому +7

    This is great and to the point. But pls add more to this topic .. some challenges/real time examples. Would be of great help to lot of people in the DE community @Zach

  • @InsightsofAJ
    @InsightsofAJ Рік тому +4

    Great one. I would say that leaning normalization is a good start.

    • @Levy957
      @Levy957 6 місяців тому +1

      de-normalization too

  • @jonskaggs2891
    @jonskaggs2891 4 місяці тому +6

    I’ve always wanted to model data, but I’ve never had the right dimensions for it. 🤓

  • @vamau40
    @vamau40 Рік тому +2

    good one, keeps making plz

  • @denist80
    @denist80 Рік тому +6

    Hi Zach, thanks for sharing your thoughts. I just wanted to see if there was a mistyped last name in your CC (0:16) -- "... go read the Kimball book, some people say, go read the Eman book ..." Should this be -- "... go read the Kimball book, some people say, go read the Inmon book ..." (Bill Inmon)

    • @EcZachly_
      @EcZachly_  Рік тому +2

      You’re totally right! Nice catch!

    • @Milhouse77BS
      @Milhouse77BS 4 місяці тому

      Inmon is unreadable. Makes Kimball look like Shakespeare.

  • @srinubathina4495
    @srinubathina4495 3 місяці тому

    I want to learn data modeling from you do you offer any course to do that because I want to learn in depth
    knowledge on this concept

  • @Paul-yq5ym
    @Paul-yq5ym Рік тому +2

    Opposite for me. Have done OLTP and OLAP data modeling for years, but can't get past the deep questions on latest tools of the day, PySpark, Azure Data Factory, etc.

    • @workmode2073
      @workmode2073 Рік тому

      why is azure data factory compared to a stack of spark+airflow?

  • @patparillo
    @patparillo Рік тому +2

    I know you mentioned learning by doing which is my preferred approach as well however do you have any resources on learning about scd type 1, type 2 etc?

    • @ZachRenwickData
      @ZachRenwickData 9 місяців тому

      kimballs the data warehouse toolkit has in depth explanations of slowly changing dimensions (all types)

  • @wtfzalgo
    @wtfzalgo 3 місяці тому +1

    I like the metaphorical explanation. Why don't you write your own platform agnostic data modeling book?

  • @maleldil1
    @maleldil1 3 місяці тому +2

    Why have the end date be in the future instead of just null?

    • @EcZachly_
      @EcZachly_  3 місяці тому +5

      BETWEEN syntax doesn’t work if end date is NULL

  • @workmode2073
    @workmode2073 Рік тому +2

    What about modeling in MPPs like Redshift? Traditional dimensions/facts does not match the archi of MPPs

    • @EcZachly_
      @EcZachly_  Рік тому

      Those are more denormalized, you’re right!

    • @workmode2073
      @workmode2073 Рік тому

      @@EcZachly_ most companies are using MPPs these days just from the sheer speed/efficiency to cost ratio; then why are companies still testing facts/dimension/PK-FK based data modeling knowledge?

  • @TheHermitProcess
    @TheHermitProcess 4 місяці тому +2

    Std or SCD.😂😂😂😂😂 thanks!

    • @techgraph1233
      @techgraph1233 3 місяці тому

      I was going to comment same 😝😂😂😂😂😂😂.
      He said that’s how you kind of get the STD. Lol

  • @filbertejess8711
    @filbertejess8711 Рік тому +1

    *promosm*