034 Performance Tuning Features in hadoop

030 Job Run YARN in hadoop

Apache Hadoop & Big Data 101: The Basics

Громадянська оборона 2024: КУРСЬК І БЄЛГОРОД палають, у РФ ПАНІКА - повний випуск українською

зі святом 🇺🇦 як ви готуєте борщ?

Карпати - Динамо / УПЛ / 3 тур / Огляд матчу #Карпати #Динамо #уплтб

033 Shuffle and Sort in hadoop

videoonlinelearning

Переглядів 29 329

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 25 сер 2024

КОМЕНТАРІ • 16

@its_joel7324 2 роки тому
Thankyou very much for this..
@rytmf 3 роки тому
Great explanation. Ty
@judesoosai8648 6 років тому ⁺¹
I understand the merging of files at the reducer side happens in multiple rounds with max of 10 files in each round (configurable and called as merge factor). The final merge is happening in reducer memory and the number of files in the final round is kept equal to the merge factor (default 10). To achieve this the merge logic groups the files accordingly.
When there are 40 files, it goes like this ...
merge 4 files -> 1 file (round 1)
merge 10 files ->1 file (round 2)
merge 10 files -> 1 file (round 3)
merge 10 files -> 1 file (round 4)
At this point we have 4 merged files and 6 unmerged files (totally 10).
In round 5, these 10 files will be merged in the reducer memory.
However I am not clear how this logic would make the disk i/o efficient.
@mohammadsadaquat3624 8 років тому
very nyc explanation. Keep posting newer contents. Thanx
@nsb5467 8 років тому ⁺²
Hi, Can you explain why using three files for the first reducer split increases disk I/O efficiency?
@judesoosai8648 6 років тому
@Nachiket Bhoyar
I understand the merging of files at the reducer side happens in multiple rounds with max of 10 files in each round (configurable and called as merge factor). The final merge is happening in reducer memory and the number of files in the final round is kept equal to the merge factor (default 10). To achieve this the merge logic groups the files accordingly.
When there are 40 files, it goes like this ...
merge 4 files -> 1 file (round 1)
merge 10 files ->1 file (round 2)
merge 10 files -> 1 file (round 3)
merge 10 files -> 1 file (round 4)
At this point we have 4 merged files and 6 unmerged files (totally 10).
In round 5, these 10 files will be merged in the reducer memory.
However I am not clear how this logic would make the disk i/o efficient.
@akashgaikwad6847 7 років тому ⁺¹
How is disk I/O efficiency increased taking first 3 files into one and then processing later by batches of ten?
Files are already moved over network so how will they increase I/O efficiency? how is the example given at the last related.Please elaborate.
@JMK2928 2 роки тому
Is there any notes
@mahendarkusuma 7 років тому
Very good presentation, can you please tell me Which tool are you using to generate the simulations
@sunnyjain4774 6 років тому ⁺¹
Already read this in the definite hadoop. Can you exlain how partitions takes in spill. Thanks
@charleygrossman8368 8 років тому
Hello, I have a question.
Speaking for the sort phase, would you consider the theoretical sort (first one) with three even splits to be a bucket sort? And the actual sort (second one) that is implemented, why does it begin with three partitions, then 10, 10, and finally the remaining 7 files?
Thank you sir.
@sonalisharma9654 6 років тому
Very helpful
@VibeWithSingh 8 років тому
Nice explanation. though didn't understand the last splitting part. but still kudos. :)
@kirantvbk 6 років тому
When files spill over to the disk and then data gets partitioned and sorted. Does it need to read the data into memory again and do sort and write back? Or does it in disk?
@shaikhmohammedatif2391 3 роки тому
have u made another channel?
@spirridd 5 років тому
Impossible to understand video

Наступне

Автоматичне відтворення

034 Performance Tuning Features in hadoop

034 Performance Tuning Features in hadoop

030 Job Run YARN in hadoop

030 Job Run YARN in hadoop

Apache Hadoop & Big Data 101: The Basics

Apache Hadoop & Big Data 101: The Basics

Громадянська оборона 2024: КУРСЬК І БЄЛГОРОД палають, у РФ ПАНІКА - повний випуск українською

Громадянська оборона 2024: КУРСЬК І БЄЛГОРОД палають, у РФ ПАНІКА - повний випуск українською

зі святом 🇺🇦 як ви готуєте борщ?

зі святом 🇺🇦 як ви готуєте борщ?

Карпати - Динамо / УПЛ / 3 тур / Огляд матчу #Карпати #Динамо #уплтб

Карпати - Динамо / УПЛ / 3 тур / Огляд матчу #Карпати #Динамо #уплтб

Чому одні ухиляються, а інших тягне на фронт?! ДАЦИК & ГРОМ | "Дачний двіж" з @Raminaeshakzai

Чому одні ухиляються, а інших тягне на фронт?! ДАЦИК & ГРОМ | "Дачний двіж" з @Raminaeshakzai

MapReduce - Computerphile

MapReduce - Computerphile

038 Sorting Ideas with Partitioner in hadoop Part 2

038 Sorting Ideas with Partitioner in hadoop Part 2

Map Reduce explained with example | System Design

Map Reduce explained with example | System Design

4 mapreduce shuffle

4 mapreduce shuffle

MapReduce Jobs For Distributed Hadoop Clusters in Python

MapReduce Jobs For Distributed Hadoop Clusters in Python

Basic Introduction to Apache Hadoop

Basic Introduction to Apache Hadoop

Apache Spark - Computerphile

Apache Spark - Computerphile

Learn MapReduce with Playing Cards

Learn MapReduce with Playing Cards

037 Sorting Ideas with Partitioner in hadoop Part 1

037 Sorting Ideas with Partitioner in hadoop Part 1

Олександрія - Шахтар / УПЛ / 4 тур / Огляд матчу #Олександрія #Шахтар #уплтб

Олександрія - Шахтар / УПЛ / 4 тур / Огляд матчу #Олександрія #Шахтар #уплтб

МЕГА МЕЛКОВЫЙ СЕКРЕТ

МЕГА МЕЛКОВЫЙ СЕКРЕТ

"Не трогай меня... иначе.. " оригинал-@TheLandofBoggs #voiceacting #boggs #озвучка

"Не трогай меня... иначе.. " оригинал-@TheLandofBoggs #voiceacting #boggs #озвучка

🔥ЖДАНОВ: Путін НАС ПЕРЕГРАВ З КУРСЬКОМ! Здає землі недарма. Він знає фінал. Ми так ПРОФУКАЄМО ДОНБАС

🔥ЖДАНОВ: Путін НАС ПЕРЕГРАВ З КУРСЬКОМ! Здає землі недарма. Він знає фінал. Ми так ПРОФУКАЄМО ДОНБАС

Карпати - Динамо / УПЛ / 3 тур / Огляд матчу #Карпати #Динамо #уплтб

Карпати - Динамо / УПЛ / 3 тур / Огляд матчу #Карпати #Динамо #уплтб

Як ПОТРАПИЛА в миротворець? Росіяни НА ВЕСІЛЛІ. До мене ВЧИНИЛИ насильство / Okay Eva Bar

Як ПОТРАПИЛА в миротворець? Росіяни НА ВЕСІЛЛІ. До мене ВЧИНИЛИ насильство / Okay Eva Bar

🔥 Уся правда про українську СУДЖУ

🔥 Уся правда про українську СУДЖУ

How I Did The SELF BENDING Spoon 😱🥄 #shorts

How I Did The SELF BENDING Spoon 😱🥄 #shorts