Dynamic Partition Pruning in Apache Spark

Processing 25GB of data in Spark | How many Executors and how much Memory per Executor is required.

Systems Design in an Hour

Unexpected way to open the new Audi A6 e-tron Frunk 😮! #shorts

Заява ЗАЛУЖНОГО ШОКУВАЛА увесь СВІТ😱ТРЕТЯ СВІТОВА ВІЙНА ПОЧАЛАСЬ?

Перший наступ КНДРівців

Dynamic Partition Pruning: How It Works (And When It Doesn’t)

Afaque Ahmad

Переглядів 5 418

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 5 січ 2025

КОМЕНТАРІ • 23

@coolraviraj24 8 днів тому
I have completed this whole playlist, full of knowledge and it gave me enough confidence to handle questions.....
I saw your bucketing video and it was awesome then i decided to complete this whole playlist and here I am at the end.
I really learnt alot. please make more videos as you are our senior DE.😅
@afaqueahmad7117 8 днів тому ⁺¹
Hey @coolraviraj24, I'm elated to know this and super glad that the playlist has helped strengthen your concepts. Appreciate you putting out this note here. Please also share it w/ your friends and colleagues :)
@Wonderscope1 Рік тому
Thanks for great video; you make these concept so simple. Thanks
@gopinathdhanasekar3286 7 місяців тому ⁺¹
you deserve more subscribers !! thanks for explaining the concepts
@afaqueahmad7117 7 місяців тому
Those words mean a lot, thank you @gopinathdhanasekar328! If you wouldn't mind, a request to kindly share with your friends and colleagues, I would greatly appreciate your help in spreading the word
@choubeysumit246 4 місяці тому
Great tutorials 🙏, please create more videos on spark from beginners point of view
@iamkiri_ Рік тому
Loving ur videos Bro !
@VenkatakrishnaGangavarapu Рік тому ⁺¹
thanks for another indeapth video yes we need how spark uses it's memory executors and on what basis it split data to multiple executors
@afaqueahmad7117 Рік тому
Resource level optimisation videos upcoming in the next few weeks, stay tuned! :)
@iamexplorer6052 Рік тому
Thank you sharing , new thing I learned from you
@roksig3823 Рік тому ⁺¹
Can you make a video on how to decide driver/executor memory size, no of executor based file size like 100 GB in Spark ?
@afaqueahmad7117 Рік тому ⁺¹
Resource level optimisation videos upcoming in the next few weeks, stay tuned! :)
@animeshrajjha 2 місяці тому
this is awesome, what tools do you use for drawing , recording and presenting?
@afaqueahmad7117 2 місяці тому ⁺¹
Thanks @animeshrajjha, Ecamm Live for recording, Notes on iPad for drawing and Notion for writing :)
@plearns4551 11 місяців тому
Hello, I think one correction, I think even if the dimension table(songs) don't have filter condition on release date still DPP would work right?? as it will forward the release date selected after the filter, irrespective of the filter condition. eg even if we apply filter on songID in songs table is there and after filter few record are selected in those records whatever the release dates are it will be forwarded.
@anandchandrashekhar2933 7 місяців тому
Thanks Afaque. Terminology wise, Is this the same as Filter pushdown which you explained during the Query Plan video?
@afaqueahmad7117 7 місяців тому
Hey @anandchandrashekhar2933 Appreciate it :)
On the question - DPP is different from "filter pushdown", although it uses filter pushdown to prune the large dataset based on the filters from the smaller dataset. It's effective when you have a large and a small dataset (which can be broadcasted) and want to use the small dataset to filter records from the large dataset at scan-time
@sathyamoorthy2362 7 місяців тому
All videos are great and nicely explained , video clarity is bad even for 4k.
@afaqueahmad7117 7 місяців тому
Thanks, @sathyamoorthy2362, for the kind words. On the video quality, I was trying out a new tool and it didn't work out, but hope the other ones are good and you like them :)
@rohitshingare5352 11 місяців тому
What if both datasets are too big , so in that case broadcast exchange is still happens?
@afaqueahmad7117 7 місяців тому
Hey @rohitshingare5352, Good question. DPP generally works best when one table is large and the other table is small enough to be broadcasted. The most significant reason for this if the two tables are large, the filters being moved will also be large (in the worst case) and this filter propagation mechanism over the network is the biggest bottleneck
@mohitupadhayay1439 5 місяців тому
Dead gorgeous stuff.
@afaqueahmad7117 4 місяці тому
Appreciate it man :)

Наступне

Автоматичне відтворення

Dynamic Partition Pruning in Apache Spark

Dynamic Partition Pruning in Apache Spark

Processing 25GB of data in Spark | How many Executors and how much Memory per Executor is required.

Processing 25GB of data in Spark | How many Executors and how much Memory per Executor is required.

Systems Design in an Hour

Systems Design in an Hour

Unexpected way to open the new Audi A6 e-tron Frunk 😮! #shorts

Unexpected way to open the new Audi A6 e-tron Frunk 😮! #shorts

Заява ЗАЛУЖНОГО ШОКУВАЛА увесь СВІТ😱ТРЕТЯ СВІТОВА ВІЙНА ПОЧАЛАСЬ?

Заява ЗАЛУЖНОГО ШОКУВАЛА увесь СВІТ😱ТРЕТЯ СВІТОВА ВІЙНА ПОЧАЛАСЬ?

Перший наступ КНДРівців

Перший наступ КНДРівців

Что будет если украсть в магазине шоколадку 🍫

Что будет если украсть в магазине шоколадку 🍫

Bucketing - The One Spark Optimization You're Not Doing

Bucketing - The One Spark Optimization You're Not Doing

Database Sharding and Partitioning

Database Sharding and Partitioning

How Partitioning Works In Apache Spark?

How Partitioning Works In Apache Spark?

Transformers (how LLMs work) explained visually | DL5

Transformers (how LLMs work) explained visually | DL5

24 Fix Skewness and Spillage with Salting in Spark | Salting Technique | How to identify Skewness

24 Fix Skewness and Spillage with Salting in Spark | Salting Technique | How to identify Skewness

Broadcast Joins & AQE (Adaptive Query Execution)

Broadcast Joins & AQE (Adaptive Query Execution)

Understanding Databricks & Apache Spark Performance Tuning: Lesson 01 - Spark Architecture

Understanding Databricks & Apache Spark Performance Tuning: Lesson 01 - Spark Architecture

7+ million Postgres tables | Kailash Nadh | IndiaFOSS 2024 | FOSS United

7+ million Postgres tables | Kailash Nadh | IndiaFOSS 2024 | FOSS United

How to Read Spark DAGs | Rock the JVM

How to Read Spark DAGs | Rock the JVM

ГРАВИТАЦИЯ! ВЫЖИВАНИЕ на ЛЕТАЮЩЕМ ОСТРОВЕ(DDprod.) в РАСТ/RUST

ГРАВИТАЦИЯ! ВЫЖИВАНИЕ на ЛЕТАЮЩЕМ ОСТРОВЕ(DDprod.) в РАСТ/RUST

Женская супер-сила 😂 #ComedyClub #КамедиКлаб #харламов #тнт4 #тнт #демискарибидис #богатство #кравец

Женская супер-сила 😂 #ComedyClub #КамедиКлаб #харламов #тнт4 #тнт #демискарибидис #богатство #кравец

Ветеран війни отримав гроші на житло

Ветеран війни отримав гроші на житло

Cat mode and a glass of water #family #humor #fun

Cat mode and a glass of water #family #humor #fun

Lp. Сердце Вселенной #60 РОЖДЕНИЕ ЛОЛОЛОШКИ [Финал] • Майнкрафт

Lp. Сердце Вселенной #60 РОЖДЕНИЕ ЛОЛОЛОШКИ [Финал] • Майнкрафт

🤔Можно ли спастись от Ядерки в Холодильнике ? #shorts

🤔Можно ли спастись от Ядерки в Холодильнике ? #shorts

Cute Baby Ties Up Dad And Wants To Play With His Phone #funny #fatherhoodlove#cute#fatherhoodmoments

Cute Baby Ties Up Dad And Wants To Play With His Phone #funny #fatherhoodlove#cute#fatherhoodmoments

Сестра обхитрила!

Сестра обхитрила!