Parquet File Format - Explained to a 5 Year Old!

What is Apache Spark in less than 10 minutes | An Introduction to Apache Spark architecture

25+ Amazing Excel Shortcuts | Boost Work Speed by 10X | Learn Microsoft Excel

Creative Justice at the Checkout: Bananas and Eggs Showdown #shorts

🤔Можно ли спастись от Ядерки в Холодильнике ? #shorts

Жінка-головнокомандувач зневажає партнера? - Кохання на виживання - Сезон 5 - Випуск 3 - 04.12.2024

Row Format vs Column Format | Why Parquet is better than Avro | Why Columnar formats are preferred

Learning Journal

Переглядів 13 411

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 8 гру 2024

КОМЕНТАРІ • 11

@PANKAJKUMAR-fe8zn 5 місяців тому
Wonderful explanation. I was studying data cloud in salesforce and they were mentioning this data format multiple time. I was clueless but I got clarity from your video. Thank you sir
@evilgoogle6986 Місяць тому
sql example should have used aggregates. Probably that’s where columnar storage shines
@SanjayKumar-rw2gj 5 місяців тому
Great explanation, to the point no exaggeration. Thanks for the video
@MrSravan84 Рік тому ⁺²
Very nicely explained. But @8:40 you mentioned that the column 2 can go in the different same block or different block and @11:29 you mentioned that Spark knows that column 2 is stored in Block-2. These 2 statements are sort of causing confusion. i.e., if a column of each row can be spread across multiple blocks how does Spark know which block to search ?
@mustafabohra2070 Місяць тому
Sir, you are genius!!
@nindersingh Рік тому ⁺¹
In Block 1 R3C3 is mentioned as wrong 🚫, this must be R2C3. Because R3C3 is coming in Block 2 as expected.
@cheluveshab9525 2 роки тому ⁺¹
Pleasure do make a video on compression techniques
@sumanthb3280 2 роки тому ⁺¹
So, why is Avro used in some projects?
@sumitnekar8965 2 роки тому ⁺¹
One scenario i can think of,Avro over plain json offers benefits like schema evolution which can be beneficial in case of multiple producers and consumers setup. If you are using json data format with kafka topics in a data pipeline, avro format can be leveraged instead of json.
@josephjoestar995 Рік тому
@@sumitnekar8965could you explain further please? I’m doing some investigation work on choosing avro v parquet v delta tables for Azure Event Hubs output, your explanation would be appreciated 🙏
@James-l5s7k Рік тому ⁺¹
As a mathematician I must inform you that having a row space vs a column space is an isomorphism. There is no difference; it's in your head.

Наступне

Автоматичне відтворення

Parquet File Format - Explained to a 5 Year Old!

Parquet File Format - Explained to a 5 Year Old!

What is Apache Spark in less than 10 minutes | An Introduction to Apache Spark architecture

What is Apache Spark in less than 10 minutes | An Introduction to Apache Spark architecture

25+ Amazing Excel Shortcuts | Boost Work Speed by 10X | Learn Microsoft Excel

25+ Amazing Excel Shortcuts | Boost Work Speed by 10X | Learn Microsoft Excel

Creative Justice at the Checkout: Bananas and Eggs Showdown #shorts

Creative Justice at the Checkout: Bananas and Eggs Showdown #shorts

🤔Можно ли спастись от Ядерки в Холодильнике ? #shorts

🤔Можно ли спастись от Ядерки в Холодильнике ? #shorts

Жінка-головнокомандувач зневажає партнера? - Кохання на виживання - Сезон 5 - Випуск 3 - 04.12.2024

Жінка-головнокомандувач зневажає партнера? – Кохання на виживання – Сезон 5 – Випуск 3 – 04.12.2024

Як в Уторопах варять сіль із соровиці з місцевого джерела

Як в Уторопах варять сіль із соровиці з місцевого джерела

File Formats [Row based vs Columnar Format] #parquet #avro #orc

File Formats [Row based vs Columnar Format] #parquet #avro #orc

Row based & Column based formats | Demystifying RC Format in Big Data

Row based & Column based formats | Demystifying RC Format in Big Data

Big Data File Format Performance Comparison [CSV Vs JSON Vs AVRO vs PARQUET]

Big Data File Format Performance Comparison [CSV Vs JSON Vs AVRO vs PARQUET]

File Formats: Big Data- Parquet, Avro, ORC | The Data Channel

File Formats: Big Data- Parquet, Avro, ORC | The Data Channel

What is Apache Parquet file?

What is Apache Parquet file?

Data Caching in Apache Spark | Optimizing performance using Caching | When and when not to cache

Data Caching in Apache Spark | Optimizing performance using Caching | When and when not to cache

Different Data File Formats in Big Data Engineering

Different Data File Formats in Big Data Engineering

The Parquet Format and Performance Optimization Opportunities Boudewijn Braams (Databricks)

The Parquet Format and Performance Optimization Opportunities Boudewijn Braams (Databricks)

Spark Data Frame Internals | Map Reduce Vs Spark RDD vs Spark Dataframe | Look inside the Dataframe

Spark Data Frame Internals | Map Reduce Vs Spark RDD vs Spark Dataframe | Look inside the Dataframe

Unexpected way to open the new Audi A6 e-tron Frunk 😮! #shorts

Unexpected way to open the new Audi A6 e-tron Frunk 😮! #shorts

Cheerleader Transformation That Left Everyone Speechless! #shorts

Cheerleader Transformation That Left Everyone Speechless! #shorts

Подарував батьку-військовому машину його мрії

Подарував батьку-військовому машину його мрії

УГАДАЙ ПРЕДМЕТ chill 🍁| WICSUR #shorts

УГАДАЙ ПРЕДМЕТ chill 🍁| WICSUR #shorts

Creative Justice at the Checkout: Bananas and Eggs Showdown #shorts

Creative Justice at the Checkout: Bananas and Eggs Showdown #shorts

УДИВИЛ ВСЕХ СВОИМ УХОДОМ!😳 #shorts

УДИВИЛ ВСЕХ СВОИМ УХОДОМ!😳 #shorts

99.9% IMPOSSIBLE

99.9% IMPOSSIBLE

АЗИЯ! 1000 дней ВЫЖИВАНИЯ на КЛАНОВОМ СЕРВЕРЕ в РАСТ/RUST

АЗИЯ! 1000 дней ВЫЖИВАНИЯ на КЛАНОВОМ СЕРВЕРЕ в РАСТ/RUST