10 recently asked Pyspark Interview Questions | Big Data Interview

My Unconventional Coding Story | Self-Taught

Python for Coding Interviews - Everything you need to Know

"ВСЯ УЛИЦА полетела" - курянка про обстріли рф

Wall Rebound Challenge 🙈😱

Что-что Мурсдей говорит? 💭 #симбочка #симба #мурсдей

4 Recently asked Pyspark Coding Questions | Apache Spark Interview

Sumit Mittal

Переглядів 35 432

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 16 січ 2025

КОМЕНТАРІ • 38

@sopankardile2603 11 місяців тому ⁺³
One of the best interview series Thank you sumit sir .
@sumitmittal07 11 місяців тому
glad to know that you liked it.
@adityatomar9820 11 місяців тому ⁺³
One of the great explanation so far on youtube. I wish i could afford your course :(
@souradeep.official 5 місяців тому ⁺¹
Need more Pyspark Interview Solutions like this 😊
@ritikadamani2008 3 місяці тому
Best selection of questions and very good explanation.
@abhyaravya421 3 місяці тому
Thanks a lot, Sumit! I am a senior data engineer with 5 years of exp but since we don't work with dataframes or pyspark mostly I am not able to do these simple things.
@praptijoshi9102 9 місяців тому
You are doing a great job posting these❤
@veerugandhad3437 11 місяців тому
Very useful informative video which gives more confidence to the bigdata aspirants. Thanks Sumit.
@singhjirajeev 9 місяців тому ⁺¹
00:03 Recently asked Pyspark Coding Questions
02:37 Writing and executing Pyspark pseudo code
05:21 Creating a Spark dataframe from input and performing group by aggregation
08:04 Using aggregation functions and collect list in Pyspark.
11:15 Spark SQL solution for creating DataFrame and running queries.
14:18 Understanding the data frame reader API for reading JSON and the usage of explode function
17:11 Creating a Spark dataframe and performing operations on it.
19:44 Converting string to date and performing group by in Pyspark DataFrame
22:32 Finding the average stock value using PySpark
25:38 Practice more on data frames for interviews
28:15 Practice more to gain confidence in writing correct syntax for Pyspark coding
@MamtaChoudhary-c4i 17 днів тому
Thank You sir for the best explanation. Can you please come up with more examples?
@gudiatoka 11 місяців тому
Sir...Share need more .. please continue this playlist
@naveenkumar-oq6zi 13 днів тому
Hi Sumit , Well the last question about aggregation and max average of stock , there should be time also with date. Because originally at different times the prices of stock changes.
Then we need to convert it into yyyy-MM-dd format to get the day specific stock , get their average and then max of avg. Just thought of sharing. Well overall implementation would still be same :) cheers
@venugopal-nc3nz 11 місяців тому ⁺⁵
It will be great if you put questions in comment . Others can try without looking at solution first
@SusheelGajbinkar 5 місяців тому
Thank you sir😄
@satishutnal 11 місяців тому
Best explanation sir thanks
@sumitmittal07 11 місяців тому
I am happy to hear this
@rohit-ll3rj 9 місяців тому
We can apply distinct() too I guess for avoiding duplicate values in df.
@sravankumar1767 10 місяців тому
Superb
@2412_Sujoy_Das 11 місяців тому
Much needed sir.....!!!
@sumitmittal07 11 місяців тому ⁺¹
Sujoy, I am sure you will enjoy watching this.
@NextGen_Tech_Hindi 11 місяців тому
thanks sumit make videos like this .
@sumitmittal07 11 місяців тому
definitely
@anjibabumakkena 11 місяців тому
Nice explanation sir, kindly post scenario based questions
@sumitmittal07 11 місяців тому
yes for sure
@shashankgupta2776 8 місяців тому
Thank you Sir greatly explained, would be good if you can post data/schemas also in the decription box for us to query and do hands on. Thanks.! :)
@prasoonvijay5775 11 місяців тому
Hi Sumit,
Could you please create Video explaining pipelines on AWS Databricks End-End along with Orchestration of those.
@NextGen_Tech_Hindi 11 місяців тому ⁺¹
What about remaining 10 questions on pyspark you told we are covering it in next video but still you not uploaded on UA-cam and when you will upload it on UA-cam we are waiting for remaining 10 questions on pyspark
Thank you ❤
@Nikhil-qi4oz 11 місяців тому
Amazing sir
@sumitmittal07 11 місяців тому ⁺²
Nikhil, I am sure you will find it useful.
@mdasif2411 11 місяців тому
Hi Sir, can we not write in Spark sql in interview? As there is no difference in performance.
@TheUMESH34 11 місяців тому
This is great!
@sumitmittal07 11 місяців тому
thank you Umesh
@sharankarchella2688 11 місяців тому
Nice video
@sumitmittal07 11 місяців тому ⁺¹
thank you
@rudrakasha-t1v 10 місяців тому
in question number 2 = do we not need to remove duplicate as last can you please clear me on it ?
@VinodKumarChouhan-o8c 10 місяців тому
Hello sir, how can I run pyspark code online, are you also using any online utilty to run pyspark code as shown in this video , could you please share the source, it would be very helpful.
@sonurohini6764 7 місяців тому
Sir create coding interview playlist
@RAHULKUMAR-px8em 28 днів тому
Q2.
Data=[('a','aa',1),
('a','aa',2),
('b','bb',5),
('b','bb',3),
('b','bb',4)]
data_schema= "col1 string, col2 string, col3 int"
df_data=spark.createDataFrame(data=Data,schema=data_schema)
df_data.display()
from pyspark.sql.functions import *
from pyspark.sql.types import *
result = ( df_data.groupBy(col('col1'),col('col2'))\
.agg(collect_set(col('col3')))
)
result.display()

Наступне

Автоматичне відтворення

10 recently asked Pyspark Interview Questions | Big Data Interview

10 recently asked Pyspark Interview Questions | Big Data Interview

My Unconventional Coding Story | Self-Taught

My Unconventional Coding Story | Self-Taught

Python for Coding Interviews - Everything you need to Know

Python for Coding Interviews - Everything you need to Know

"ВСЯ УЛИЦА полетела" - курянка про обстріли рф

"ВСЯ УЛИЦА полетела" — курянка про обстріли рф

Wall Rebound Challenge 🙈😱

Wall Rebound Challenge 🙈😱

Что-что Мурсдей говорит? 💭 #симбочка #симба #мурсдей

Что-что Мурсдей говорит? 💭 #симбочка #симба #мурсдей

⚡КОРЕЙЦІ ПРОТИ росіянок

⚡КОРЕЙЦІ ПРОТИ росіянок

Processing 25GB of data in Spark | How many Executors and how much Memory per Executor is required.

Processing 25GB of data in Spark | How many Executors and how much Memory per Executor is required.

The ONLY PySpark Tutorial You Will Ever Need.

The ONLY PySpark Tutorial You Will Ever Need.

Cloud Data Engineer Mock Interview | PySpark Coding Interview Questions |Azure Databricks #question

Cloud Data Engineer Mock Interview | PySpark Coding Interview Questions |Azure Databricks #question

Latest Tiger Analytics coding Interview Questions & Answers | Data Engineer Prep 2024

Latest Tiger Analytics coding Interview Questions & Answers | Data Engineer Prep 2024

Top 50 PySpark Interview Questions & Answers 2025 | PySpark Interview Questions | MindMajix

Top 50 PySpark Interview Questions & Answers 2025 | PySpark Interview Questions | MindMajix

Scaling 7M+ Postgres Tables! by Kailash Nadh CTO @zerodha

Scaling 7M+ Postgres Tables! by Kailash Nadh CTO @zerodha

Top 15 Spark Interview Questions in less than 15 minutes Part-2 #bigdata #pyspark #interview

Top 15 Spark Interview Questions in less than 15 minutes Part-2 #bigdata #pyspark #interview

PySpark Course: Big Data Handling with Python and Apache Spark

PySpark Course: Big Data Handling with Python and Apache Spark

Data Engineer Mock Interview | SQL | PySpark | Project & Scenario based Interview Questions

Data Engineer Mock Interview | SQL | PySpark | Project & Scenario based Interview Questions

Рождение Немецкой Легенды - Mercedes 190E 2.3-16

Рождение Немецкой Легенды - Mercedes 190E 2.3-16

СПОРИМ ТЫ НЕ ЗНАЕШЬ ТРИ СЛОВА НА БУКВУ О? #shortsvideo #юмор #катяклон #comedy #прикол #мамадочка

СПОРИМ ТЫ НЕ ЗНАЕШЬ ТРИ СЛОВА НА БУКВУ О? #shortsvideo #юмор #катяклон #comedy #прикол #мамадочка

МІША ЛЕБІГА і АНДРІЙ ЛУЗАН в СРАЧІ #32

МІША ЛЕБІГА і АНДРІЙ ЛУЗАН в СРАЧІ #32

Как найти себе жену? Больше - тут @stas.yornik.shorts

Как найти себе жену? Больше - тут @stas.yornik.shorts

Дал Свою Безлимитную Карту Друзьям, Потратили Миллионы... (Хазяева, Кокошка, Дилблин, Сатир)

Дал Свою Безлимитную Карту Друзьям, Потратили Миллионы... (Хазяева, Кокошка, Дилблин, Сатир)

How to treat Acne💉

How to treat Acne💉

Морпіх із Каліфорнії доєднався до лав ЗСУ #shorts

Морпіх із Каліфорнії доєднався до лав ЗСУ #shorts

ПРОВЕРКА НА ВШИВОСТЬ (смешное видео, юмор, поржать, приколы)

ПРОВЕРКА НА ВШИВОСТЬ (смешное видео, юмор, поржать, приколы)