Pyspark Advanced interview questions part 2 #Databricks #DeltaLake #PysparkInterviewQuestions

Top 50 PySpark Interview Questions & Answers 2024 | PySpark Interview Questions | MindMajix

Part 1: Cracking Databricks Interview: Top Questions Answered with Detailed Explanations!

Throwing Swords From My Blue Cybertruck

Пришёл к другу на ночёвку 😂

Сказала дочке НЕТ!

Pyspark Advanced interview questions part 1

TechLake

Переглядів 59 765

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 11 вер 2024

КОМЕНТАРІ • 31

@abhilash0410 3 роки тому ⁺⁸
Bro bring more real-time interview questions like these thank you so much !
@vedanthasm2659 3 роки тому ⁺³
One of the best explanation. Bro..Please make more videos on Pyspark
@sjitghosh 2 роки тому ⁺³
You are doing an excellent work. Helping a lot!!
@saachinileshpatil 7 місяців тому ⁺¹
Thanks for sharing 👍, very informative
@rocku4evr 2 роки тому ⁺¹
Great......fortunate to be your subscriber
@seshuseshu4106 3 роки тому ⁺¹
Very good detailed explanation, thanks for your efforts, keep continue ..
@janardhanreddy3267 6 місяців тому
nice explanation ,please attach csv file or json in description to practice
@nsrchndshkh 3 роки тому ⁺¹
Thanks Man. This was some detailed explanation. Kudos
@TRRaveendra 3 роки тому
Ur welcome 👍
@akashpb4044 2 роки тому ⁺¹
Awesome video... Cleared my doubts 👍👍👍
@varuns4472 2 роки тому
Nice one
@sanooosai 5 місяців тому
great thank you
@achintamondal1494 Рік тому ⁺¹
Awesome video.
Could you please share the notebook, it will really help.
@janardhanreddy3267 6 місяців тому
please upload all pyspark interview questions videos
@fratkalkan7850 2 роки тому
very clean explanation thank you sir
@shreekrishnavani7868 2 роки тому
Nice explanation 👌 thanks
@rajanib9057 11 місяців тому
can you pleaae explain how did spark filter those 2 colums as bad data? I don't see any where condition mentioned for the corrupt column
@rahulyeole6411 2 роки тому
Please share basic big data video
@johnsonrajendran6194 3 роки тому
are any such mode options available while reading parquet files?
@balajia8376 2 роки тому
seems querying _corrupt_record is not working. I tried it today and not allowing me to query with the column name.cust_df.filter("_corrupt_record is not null"). AnalysisException: Since Spark 2.3, the queries from raw JSON/CSV files are disallowed when the
referenced columns only include the internal corrupt record column
(named _corrupt_record by default). For example:
spark.read.schema(schema).csv(file).filter($"_corrupt_record".isNotNull).count()
and spark.read.schema(schema).csv(file).select("_corrupt_record").show().
Instead, you can cache or save the parsed results and then send the same query.
For example, val df = spark.read.schema(schema).csv(file).cache() and then
df.filter($"_corrupt_record".isNotNull).count().
@TRRaveendra 2 роки тому
cust_df.cache()
Cache dataframe and it's won't raise exception
@balajia8376 2 роки тому
@@TRRaveendra Yes I did, even after that also not allowing to write a query on _corrupt_record is null or not null.
@balajia8376 2 роки тому
seems badRecordsPath is only the solution.
@sachintiwari6846 Рік тому
Woah what a explanation
@balajia8376 2 роки тому
cust_df.select("_corrupt_record").show() is working but not allowing is null or not null. cust_df.select("_corrupt_record is null").show(). let me know if this is working for you. thank you.
@naveendayyala1484 11 місяців тому
plz share the notebook in .dbc format
@swagatikatripathy4917 2 роки тому ⁺¹
Why do we write inferschema= true
@TRRaveendra 2 роки тому ⁺²
InferSchema =True Creating datatypes based on data.
Header = True creating columns from file first line
@srikanthbachina7764 Рік тому
Hi pls share ur contact details I am looking for python, pyspark, databricks training
@balajia8376 2 роки тому
root
|-- cust_id: integer (nullable = true)
|-- cust_name: string (nullable = true)
|-- manager: string (nullable = true)
|-- city: string (nullable = true)
|-- phno: long (nullable = true)
|-- _corrupt_record: string (nullable = true) . display(cust_df.filter("_corrupt_record is not null")). FileReadException: Error while reading file dbfs:/FileStore/tables/csv_with_bad_records.csv.
Caused by: IllegalArgumentException: _corrupt_record does not exist. Available: cust_id, cust_name, manager, city, phno

Наступне

Автоматичне відтворення

Pyspark Advanced interview questions part 2 #Databricks #DeltaLake #PysparkInterviewQuestions

Pyspark Advanced interview questions part 2 #Databricks #DeltaLake #PysparkInterviewQuestions

Top 50 PySpark Interview Questions & Answers 2024 | PySpark Interview Questions | MindMajix

Top 50 PySpark Interview Questions & Answers 2024 | PySpark Interview Questions | MindMajix

Part 1: Cracking Databricks Interview: Top Questions Answered with Detailed Explanations!

Part 1: Cracking Databricks Interview: Top Questions Answered with Detailed Explanations!

Throwing Swords From My Blue Cybertruck

Throwing Swords From My Blue Cybertruck

Пришёл к другу на ночёвку 😂

Пришёл к другу на ночёвку 😂

Сказала дочке НЕТ!

Сказала дочке НЕТ!

Василиса пошла В ПЕРВЫЙ класс! А что у вас в рюкзаке)))?

Василиса пошла В ПЕРВЫЙ класс! А что у вас в рюкзаке)))?

4 Recently asked Pyspark Coding Questions | Apache Spark Interview

4 Recently asked Pyspark Coding Questions | Apache Spark Interview

PySpark Interview | 2 YOE | Bigdata | Mock interview | Feedback

PySpark Interview | 2 YOE | Bigdata | Mock interview | Feedback

Most Important Question of PySpark in LTIMindTree Interview Question | Salary in each department |

Most Important Question of PySpark in LTIMindTree Interview Question | Salary in each department |

Cloud Data Engineer Mock Interview | PySpark Coding Interview Questions |Azure Databricks #question

Cloud Data Engineer Mock Interview | PySpark Coding Interview Questions |Azure Databricks #question

Top 15 Spark Interview Questions in less than 15 minutes Part-2 #bigdata #pyspark #interview

Top 15 Spark Interview Questions in less than 15 minutes Part-2 #bigdata #pyspark #interview

SQL Server Interview Questions and Answers | SQL Interview Questions

SQL Server Interview Questions and Answers | SQL Interview Questions

Spark Interview Question | How many CPU Cores | How many executors | How much executor memory

Spark Interview Question | How many CPU Cores | How many executors | How much executor memory

Top SQL interview Questions and Answers | Most Asked SQL Questions for Job interview

Top SQL interview Questions and Answers | Most Asked SQL Questions for Job interview

SQL JOINS Interview Question | What does different SQL Joins return?

SQL JOINS Interview Question | What does different SQL Joins return?

Пришёл к другу на ночёвку 😂

Пришёл к другу на ночёвку 😂

Russian soldier catches Ukraine FPV drone with his bare hands and runs with it

Russian soldier catches Ukraine FPV drone with his bare hands and runs with it

Кінець РФ близько ❗️ Власна балістична ракета України

Кінець РФ близько ❗️ Власна балістична ракета України

Друг без машины #непосредственнокаха

Друг без машины #непосредственнокаха

ПРИКОЛЫ НАД БРАТОМ #shorts

ПРИКОЛЫ НАД БРАТОМ #shorts

Секрет летающего стула! #shorts

Секрет летающего стула! #shorts

Каха отправляет дочь в школу #непосредственнокаха

Каха отправляет дочь в школу #непосредственнокаха

Презентация Apple iPhone 16 WYLSACOM 09.09 в 19:00 МСК (смотрим, общаемся, разыгрываем айфоны)

Презентация Apple iPhone 16 WYLSACOM 09.09 в 19:00 МСК (смотрим, общаемся, разыгрываем айфоны)