Data Engineering Mock Interview | Spark Optimization Interview Questions | Best Coding Practices
Вставка
- Опубліковано 20 сер 2024
- 𝐓𝐨 𝐞𝐧𝐡𝐚𝐧𝐜𝐞 𝐲𝐨𝐮𝐫 𝐜𝐚𝐫𝐞𝐞𝐫 𝐚𝐬 𝐚 𝐂𝐥𝐨𝐮𝐝 𝐃𝐚𝐭𝐚 𝐄𝐧𝐠𝐢𝐧𝐞𝐞𝐫, 𝐂𝐡𝐞𝐜𝐤 trendytech.in/... for curated courses developed by me.
I have trained over 20,000+ professionals in the field of Data Engineering in the last 5 years.
𝐖𝐚𝐧𝐭 𝐭𝐨 𝐌𝐚𝐬𝐭𝐞𝐫 𝐒𝐐𝐋? 𝐋𝐞𝐚𝐫𝐧 𝐒𝐐𝐋 𝐭𝐡𝐞 𝐫𝐢𝐠𝐡𝐭 𝐰𝐚𝐲 𝐭𝐡𝐫𝐨𝐮𝐠𝐡 𝐭𝐡𝐞 𝐦𝐨𝐬𝐭 𝐬𝐨𝐮𝐠𝐡𝐭 𝐚𝐟𝐭𝐞𝐫 𝐜𝐨𝐮𝐫𝐬𝐞 - 𝐒𝐐𝐋 𝐂𝐡𝐚𝐦𝐩𝐢𝐨𝐧𝐬 𝐏𝐫𝐨𝐠𝐫𝐚𝐦!
"𝐀 8 𝐰𝐞𝐞𝐤 𝐏𝐫𝐨𝐠𝐫𝐚𝐦 𝐝𝐞𝐬𝐢𝐠𝐧𝐞𝐝 𝐭𝐨 𝐡𝐞𝐥𝐩 𝐲𝐨𝐮 𝐜𝐫𝐚𝐜𝐤 𝐭𝐡𝐞 𝐢𝐧𝐭𝐞𝐫𝐯𝐢𝐞𝐰𝐬 𝐨𝐟 𝐭𝐨𝐩 𝐩𝐫𝐨𝐝𝐮𝐜𝐭 𝐛𝐚𝐬𝐞𝐝 𝐜𝐨𝐦𝐩𝐚𝐧𝐢𝐞𝐬 𝐛𝐲 𝐝𝐞𝐯𝐞𝐥𝐨𝐩𝐢𝐧𝐠 𝐚 𝐭𝐡𝐨𝐮𝐠𝐡𝐭 𝐩𝐫𝐨𝐜𝐞𝐬𝐬 𝐚𝐧𝐝 𝐚𝐧 𝐚𝐩𝐩𝐫𝐨𝐚𝐜𝐡 𝐭𝐨 𝐬𝐨𝐥𝐯𝐞 𝐚𝐧 𝐮𝐧𝐬𝐞𝐞𝐧 𝐏𝐫𝐨𝐛𝐥𝐞𝐦."
𝐇𝐞𝐫𝐞 𝐢𝐬 𝐡𝐨𝐰 𝐲𝐨𝐮 𝐜𝐚𝐧 𝐫𝐞𝐠𝐢𝐬𝐭𝐞𝐫 𝐟𝐨𝐫 𝐭𝐡𝐞 𝐏𝐫𝐨𝐠𝐫𝐚𝐦 -
𝐑𝐞𝐠𝐢𝐬𝐭𝐫𝐚𝐭𝐢𝐨𝐧 𝐋𝐢𝐧𝐤 (𝐂𝐨𝐮𝐫𝐬𝐞 𝐀𝐜𝐜𝐞𝐬𝐬 𝐟𝐫𝐨𝐦 𝐈𝐧𝐝𝐢𝐚) : rzp.io/l/SQLINR
𝐑𝐞𝐠𝐢𝐬𝐭𝐫𝐚𝐭𝐢𝐨𝐧 𝐋𝐢𝐧𝐤 (𝐂𝐨𝐮𝐫𝐬𝐞 𝐀𝐜𝐜𝐞𝐬𝐬 𝐟𝐫𝐨𝐦 𝐨𝐮𝐭𝐬𝐢𝐝𝐞 𝐈𝐧𝐝𝐢𝐚) : rzp.io/l/SQLUSD
30 INTERVIEWS IN 30 DAYS- BIG DATA INTERVIEW SERIES
This mock interview series is launched as a community initiative under Data Engineers Club aimed at aiding the community's growth and development
Expert guest interviewer, Sachin R, / sachin-r27 imparts invaluable insights and practical advice derived from extensive experience.
Suman Basu, / basusuman23 skilled guest interviewee, showcases an exceptional approach in answering interview questions.
Link of Free SQL & Python series developed by me are given below -
SQL Playlist - • SQL tutorial for every...
Python Playlist - • Complete Python By Sum...
Don't miss out - Subscribe to the channel for more such informative interviews and unlock the secrets to success in this thriving field!
Social Media Links :
LinkedIn - / bigdatabysumit
Twitter - / bigdatasumit
Instagram - / bigdatabysumit
Student Testimonials - trendytech.in/...
Discussed Questions : Timestamp
1:37 Introduction
2:50 Brief about your project responsibilities
5:26 Discuss SQL code documentation best practices for ensuring query efficiency.
9:56 What are transformations and actions in PySpark DataFrames?
10:35 What are the best practices you have followed specific to PySpark?
12:39 What is the difference between cache and persist?
13:33 Explain the concept of partitioning.
14:58 When allocating multiple worker nodes/executors, how to increase or decrease the number of partitions?
16:38 Which is more effective in avoiding data skewness. Repartitioning or coalesce? what is data skewness?
18:07 Coding questions
36:20 Dealing with data quality issues
38:30 After fetching data from CSV files, how would you define the schema?
41:00 Preferred file format for data loading.
Tags
#mockinterview #bigdata #career #dataengineering #data #datascience #dataanalysis #productbasedcompanies #interviewquestions #apachespark #google #interview #faang #companies #amazon #walmart #flipkart #microsoft #azure #databricks #jobs
I like the way Sachin asked the question by asking what are the best practises you follow in sql, pyspark to understand better
Sachin is really knowledgeable, and he is helping to answer the questions as well with Suman.
yes both have been great. Kudos to Sachin & Suman.
This is such an amazing initiative...While watching the video I felt like as if I was being interviewed...I cant stress on how helpful this will be for so many people. It gave me a very good idea of the level of my preparation. Thanks a lot and I hope you will create more videos like this.
36:03 1.he is asking only highest
2. Dept vise highest
Use sql code as follow
1.select max(salary) from emp;
2 select dept,max(salary) from emp group by dept;
As simple as that he did not asked you to write window function if he ask you then do it 😊
In case 1 , we should use WinDow function bcoz, we need to print id and name as well
@@sriharidhanakshirur9245 in this case u can use sub query as well if anyone explicitly ask you is there any other way or do it using windows then at that time interviewer will get impress 😊
Great initiative. Thank you Sumit Sir 🙏. Looking forward to more such videos. Keep up the good work 👍
Thanks a ton
Thank you so much Sumit sir.Really a great initiative
thank you very much
Sir as I see from last 3 days everytime cloud tech you use is Azure only , please make it on AWS too it’s very helpful
definitely, you will see a lot of variety
Plz increase little bit complexity of interview because in actual its more complex 😊
candidates mostly get stuck in basic fundamentals. These are actual people who conduct interviews in companies.
Informative!
if possible mention the experience also , to which experience level these interview are targeting (like this is for 1 year, fresher or for 3 year experience )
Thank you Sumit Sir
you are welcome
Thankyou sumit sir for this initiative.