This is great and to the point. But pls add more to this topic .. some challenges/real time examples. Would be of great help to lot of people in the DE community @Zach
Hi Zach, thanks for sharing your thoughts. I just wanted to see if there was a mistyped last name in your CC (0:16) -- "... go read the Kimball book, some people say, go read the Eman book ..." Should this be -- "... go read the Kimball book, some people say, go read the Inmon book ..." (Bill Inmon)
Opposite for me. Have done OLTP and OLAP data modeling for years, but can't get past the deep questions on latest tools of the day, PySpark, Azure Data Factory, etc.
I know you mentioned learning by doing which is my preferred approach as well however do you have any resources on learning about scd type 1, type 2 etc?
@@EcZachly_ most companies are using MPPs these days just from the sheer speed/efficiency to cost ratio; then why are companies still testing facts/dimension/PK-FK based data modeling knowledge?
This is great and to the point. But pls add more to this topic .. some challenges/real time examples. Would be of great help to lot of people in the DE community @Zach
Great one. I would say that leaning normalization is a good start.
de-normalization too
I’ve always wanted to model data, but I’ve never had the right dimensions for it. 🤓
good one, keeps making plz
Hi Zach, thanks for sharing your thoughts. I just wanted to see if there was a mistyped last name in your CC (0:16) -- "... go read the Kimball book, some people say, go read the Eman book ..." Should this be -- "... go read the Kimball book, some people say, go read the Inmon book ..." (Bill Inmon)
You’re totally right! Nice catch!
Inmon is unreadable. Makes Kimball look like Shakespeare.
I want to learn data modeling from you do you offer any course to do that because I want to learn in depth
knowledge on this concept
Opposite for me. Have done OLTP and OLAP data modeling for years, but can't get past the deep questions on latest tools of the day, PySpark, Azure Data Factory, etc.
why is azure data factory compared to a stack of spark+airflow?
I know you mentioned learning by doing which is my preferred approach as well however do you have any resources on learning about scd type 1, type 2 etc?
kimballs the data warehouse toolkit has in depth explanations of slowly changing dimensions (all types)
I like the metaphorical explanation. Why don't you write your own platform agnostic data modeling book?
Why have the end date be in the future instead of just null?
BETWEEN syntax doesn’t work if end date is NULL
What about modeling in MPPs like Redshift? Traditional dimensions/facts does not match the archi of MPPs
Those are more denormalized, you’re right!
@@EcZachly_ most companies are using MPPs these days just from the sheer speed/efficiency to cost ratio; then why are companies still testing facts/dimension/PK-FK based data modeling knowledge?
Std or SCD.😂😂😂😂😂 thanks!
I was going to comment same 😝😂😂😂😂😂😂.
He said that’s how you kind of get the STD. Lol
*promosm*