How Would You Model This Data? (Example)

What tools should you know as a Data Engineer?

What is Data Pipeline? | Why Is It So Popular?

#JasonDeruloTV // Funny #GotPermissionToPost From @SofiManassyan #SlowLow

"Бажано відбити посадку без втрат": військовий розповів, як загибель побратимів впливає на психіку

"ХИТРЕЦ": Трамп РОЗЛЮТИВ Скабєєву / Оля ЛИЄ ЯДОМ #shorts

The Missing Piece in Many Data Pipelines

Kahan Data Solutions

Переглядів 6 266

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 21 гру 2024

КОМЕНТАРІ • 20

@KahanDataSolutions 5 місяців тому
Looking for help with your team's data strategy? → www.kahandatasolutions.com
Looking to improve your data engineering skillset?→ bit.ly/more-kds
@sunilbabu588 4 місяці тому ⁺³
Thanks man. Hope this channel blows up in the days to come.
@Billbillbillhahagdvdve 4 місяці тому
Your data modelling playlist is fantastic !
@SQLGuyLLC 3 місяці тому ⁺¹
What I like - is your English and punctuation, but not those people from India who thinks that their English is native
@bertjanvdberg 5 місяців тому ⁺²
Nice! Question: Do you also use views in your warehouse and mart layers? I've been at companies where the marts were basically views based on views based on views times 10 which was terrible for the performance of getting the data.
@ramtadam1469 5 місяців тому ⁺²
We always use tables as marts and then sometimes on top build views that do things with the materialized marts data.
@iARAVIND666 3 місяці тому
Excellent series! Thank you :)
@andresarmua 5 місяців тому
Nice! I use a staging layer as a view and then 4 more layers for the pipeline until I get to the mart. I usually alternate between views and materialized tables, but I am not quite sure how to know the optimal way to decide between tables and views at each time. How do you compare performance, storage and other practical factors?
@thedavidabides 5 місяців тому ⁺²
Nice work! Where should the staging layer come when using a bronze, silver, gold medallion structure ?
@muhammadbadar6089 5 місяців тому ⁺²
from my understanding you would use your bronze layer as a staging layer pulling from all source systems
@personalbranddata 5 місяців тому ⁺¹
It's the silver layer. Bronze = raw data in this video. Silver = "staging"/cleaned data in this video. Gold = Warehouse in this video. I don't like that he's using the term "staging" to refer to cleaned data because in traditional data warehousing a staging table typically refers to uncleaned data straight after you've loaded it from a source system and the cleaning happens later.
@ArmandsPutnis 5 місяців тому ⁺³
it does not really matter how you call them if you have agreed on the purpose. Bronze layer can be raw_source or it can be staging.
personally i like to keep the source out of the way and use bronze for staging - cleaning/transforming.
silver for joining multiple bronze tables, what i know can be reused for multiple use cases in a gold layer.
gold layer for the final solution/consumption joining some silver and bronze tables.
@gatorpika 5 місяців тому
@@ArmandsPutnis yeah, this. Bronze, silver and gold is an abstraction to help you think about your structure, not something with set rules you have to follow dogmatically. Figure out what layers you need to solve your problems and then just structure your layers appropriately. Staging serves a purpose to help you shift the transforms left so changes are easier down the road given they will propagate through all your downstream transforms. Then transform on top of that assuming the stage takes care of most of the cleaning/formatting for you. If your management makes you pick a metal, I suggest the titanium layer.
@sarfarazanjum007 Місяць тому ⁺¹
Thanks a lot. Can you please take the real world project and covert into data model.
@johnpower1458 5 місяців тому
Do you truncate the data each batch pipeline run on staging and capture the cleaned data in snapshots? If not, how do you avoid duplicates down stream if you’re using say SCD Type 2?
@williamchurch711 5 місяців тому
The staging layer would be equivalent to a landing zone?
@senarl 5 місяців тому
Migh be wrong but I take that the staging layer would be a bronze layer in the Medallion architecture, so we would have landing with raw data, bronze with cleaned raw data, silver with any new columns or any enhancement to the data and Gold with the joins and business logic. But thats just how I use at work and it can be changed to fit your needs
@KahanDataSolutions Місяць тому ⁺¹
Here's a new video I made about the Landing Zone - ua-cam.com/video/TaSIdUX4YXk/v-deo.htmlsi=DXOWjummSZWHQ-un
@williamchurch711 Місяць тому
@@KahanDataSolutions thank you
@Milhouse77BS 5 місяців тому
Stage All the Things

Наступне

Автоматичне відтворення

How Would You Model This Data? (Example)

How Would You Model This Data? (Example)

What tools should you know as a Data Engineer?

What tools should you know as a Data Engineer?

What is Data Pipeline? | Why Is It So Popular?

What is Data Pipeline? | Why Is It So Popular?

#JasonDeruloTV // Funny #GotPermissionToPost From @SofiManassyan #SlowLow

#JasonDeruloTV // Funny #GotPermissionToPost From @SofiManassyan #SlowLow

"Бажано відбити посадку без втрат": військовий розповів, як загибель побратимів впливає на психіку

"Бажано відбити посадку без втрат": військовий розповів, як загибель побратимів впливає на психіку

"ХИТРЕЦ": Трамп РОЗЛЮТИВ Скабєєву / Оля ЛИЄ ЯДОМ #shorts

"ХИТРЕЦ": Трамп РОЗЛЮТИВ Скабєєву / Оля ЛИЄ ЯДОМ #shorts

Как найти себе жену? Больше - тут @stas.yornik.shorts

Как найти себе жену? Больше - тут @stas.yornik.shorts

How to Create a Data Modeling Pipeline (3 Layer Approach)

How to Create a Data Modeling Pipeline (3 Layer Approach)

Memory Arenas - Explained Simply

Memory Arenas - Explained Simply

Extracting Data From APIs As Data Engineers - The Basics And Challenges You'll Run Into

Extracting Data From APIs As Data Engineers - The Basics And Challenges You'll Run Into

Data Modeling in the Modern Data Stack

Data Modeling in the Modern Data Stack

Modern Data Engineering Workflows, Explained

Modern Data Engineering Workflows, Explained

Common Table Expressions vs Subqueries vs Views vs Temp Tables for data engineers

Common Table Expressions vs Subqueries vs Views vs Temp Tables for data engineers

Data Architecture 101: The Lambda Strategy

Data Architecture 101: The Lambda Strategy

APIs Explained (in 4 Minutes)

APIs Explained (in 4 Minutes)

Why I Quit Copilot | Prime Reacts

Why I Quit Copilot | Prime Reacts

МІША ЛЕБІГА і АНДРІЙ ЛУЗАН в СРАЧІ #32

МІША ЛЕБІГА і АНДРІЙ ЛУЗАН в СРАЧІ #32

ПРОВЕРКА НА ВШИВОСТЬ (смешное видео, юмор, поржать, приколы)

ПРОВЕРКА НА ВШИВОСТЬ (смешное видео, юмор, поржать, приколы)

Удержаться на воде?? 🌊 #симбочкапимпочка #симбочка #симба

Удержаться на воде?? 🌊 #симбочкапимпочка #симбочка #симба

Unexpected way to open the new Audi A6 e-tron Frunk 😮! #shorts

Unexpected way to open the new Audi A6 e-tron Frunk 😮! #shorts

ПРАНК НАД БОЯРСКИМ | КОНФЛИКТ НА ДОРОГЕ

ПРАНК НАД БОЯРСКИМ | КОНФЛИКТ НА ДОРОГЕ

Разобрался голыми руками 😎 #start #кино #фильм #сериал #молотведьм #полиция #пацаны

Разобрался голыми руками 😎 #start #кино #фильм #сериал #молотведьм #полиция #пацаны

Рабочий способ бросить вредную привычку

Рабочий способ бросить вредную привычку

Cute Baby Ties Up Dad And Wants To Play With His Phone #funny #fatherhoodlove#cute#fatherhoodmoments

Cute Baby Ties Up Dad And Wants To Play With His Phone #funny #fatherhoodlove#cute#fatherhoodmoments