Question 15: Nagarro DE interview questions part1 | data engineer | #pyspark #nagarro #bigdata

Part 1: Cracking Databricks Interview: Top Questions Answered with Detailed Explanations!

13. Pepsico pyspark interview question and answer | azure data engineer interview Q & A | databricks

Бабулька Granny пытается поймать Nuggets Gegagedigedagedago , но не тут то было!

GOLEIRO EXPULSO | CEARÁ X OPERÁRIO | BRASILEIRÃO SÉRIE B 2024 | #Shorts | ge.globo

哈莉奎因怎么变骷髅了#小丑 #shorts

Question 14: Interview question for data engineers

pysparkpulse

Переглядів 364

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 16 вер 2024
In this video I have discussed on the question that was asked in an MNC interview for data engineer checking whether the interviewee have worked with JSON files or not.
You are tasked with processing a JSON file containing information about sales transactions. Each transaction record consists of the transaction ID, the customer ID, the product ID, the quantity sold, and the timestamp of the transaction. Your goal is to analyze this data using PySpark and perform the following tasks:
Calculate the total sales revenue generated from each product.
Identify the top-selling product.
Determine the total number of transactions for each customer.
Find the customer who made the most transactions.
Sample Json
[
{"transaction_id": 1, "customer_id": 101, "product_id": 1, "quantity": 2, "timestamp": "2024-01-01 08:00:00"},
{"transaction_id": 2, "customer_id": 102, "product_id": 2, "quantity": 1, "timestamp": "2024-01-01 08:30:00"},
{"transaction_id": 3, "customer_id": 103, "product_id": 1, "quantity": 3, "timestamp": "2024-01-01 09:00:00"},
{"transaction_id": 4, "customer_id": 101, "product_id": 3, "quantity": 1, "timestamp": "2024-01-01 10:00:00"},
{"transaction_id": 5, "customer_id": 102, "product_id": 1, "quantity": 2, "timestamp": "2024-01-01 10:30:00"},
{"transaction_id": 6, "customer_id": 103, "product_id": 2, "quantity": 2, "timestamp": "2024-01-01 11:00:00"}
]
To create dataframe
sales_df = spark.read.option("multiline",True).json("dbfs:/FileStore/transaction.json")
#pyspark #mnc #dataengineer #azure #databricks #interview #questions #bigdata #bigdataquestions #json

КОМЕНТАРІ • 2

@rawat7203 6 місяців тому ⁺¹
Thankyou sir
@pysparkpulse 6 місяців тому
Thank you for your appreciation 😊

Наступне

Автоматичне відтворення

Question 15: Nagarro DE interview questions part1 | data engineer | #pyspark #nagarro #bigdata

Question 15: Nagarro DE interview questions part1 | data engineer | #pyspark #nagarro #bigdata

Part 1: Cracking Databricks Interview: Top Questions Answered with Detailed Explanations!

Part 1: Cracking Databricks Interview: Top Questions Answered with Detailed Explanations!

13. Pepsico pyspark interview question and answer | azure data engineer interview Q & A | databricks

13. Pepsico pyspark interview question and answer | azure data engineer interview Q & A | databricks

Бабулька Granny пытается поймать Nuggets Gegagedigedagedago , но не тут то было!

Бабулька Granny пытается поймать Nuggets Gegagedigedagedago , но не тут то было!

GOLEIRO EXPULSO | CEARÁ X OPERÁRIO | BRASILEIRÃO SÉRIE B 2024 | #Shorts | ge.globo

GOLEIRO EXPULSO | CEARÁ X OPERÁRIO | BRASILEIRÃO SÉRIE B 2024 | #Shorts | ge.globo

哈莉奎因怎么变骷髅了#小丑 #shorts

哈莉奎因怎么变骷髅了#小丑 #shorts

ПОСТОЯННИК ЛОМБАРДА #шоу #юмор #спб #фитнес #вау

ПОСТОЯННИК ЛОМБАРДА #шоу #юмор #спб #фитнес #вау

Question 8: #Interview questions on Word count of complex Dataset in pyspark #big4 #mnc

Question 8: #Interview questions on Word count of complex Dataset in pyspark #big4 #mnc

Partitioning vs Bucketing | Interview Question | PySpark #pyspark #bigdata #pwc #interview

Partitioning vs Bucketing | Interview Question | PySpark #pyspark #bigdata #pwc #interview

121. Databricks | Pyspark| AutoLoader: Incremental Data Load

121. Databricks | Pyspark| AutoLoader: Incremental Data Load

Are You Accidentally Crippling Your EF Core Queries?

Are You Accidentally Crippling Your EF Core Queries?

Spark Interview Question | How many CPU Cores | How many executors | How much executor memory

Spark Interview Question | How many CPU Cores | How many executors | How much executor memory

Spark memory management | OOM in executors | Interview questions #pyspark #interview

Spark memory management | OOM in executors | Interview questions #pyspark #interview

Databricks | PySpark | Slowly Changing Dimension (SCD Type2) Practical Implementation

Databricks | PySpark | Slowly Changing Dimension (SCD Type2) Practical Implementation

The Ultimate Data Science Roundtable for Aspirants (You don't wanna miss this!)

The Ultimate Data Science Roundtable for Aspirants (You don't wanna miss this!)

Question 18:EXL Self Join Interview Question | EXL | Data Engineer #pyspark | #EXL #interview #mnc

Question 18:EXL Self Join Interview Question | EXL | Data Engineer #pyspark | #EXL #interview #mnc

СОМНЕНИЙ НЕТ! Первая встреча с приёмным ребёнком | Зови меня мамой

СОМНЕНИЙ НЕТ! Первая встреча с приёмным ребёнком | Зови меня мамой

Проверил Лайфхак ОГОНЬ-ТРЕНИЕМ Сахар+Марганцовка #фрост #shorts #frost #лайфхаки #лайфхак #выживание

Проверил Лайфхак ОГОНЬ-ТРЕНИЕМ Сахар+Марганцовка #фрост #shorts #frost #лайфхаки #лайфхак #выживание

Вот в чём отличие ТЯЖЁЛОЙ весовой #shorts

Вот в чём отличие ТЯЖЁЛОЙ весовой #shorts

ЗВЕРНЕННЯ ДО МЕНЕДЖЕРІВ YouTube!

ЗВЕРНЕННЯ ДО МЕНЕДЖЕРІВ YouTube!

КТО БОИТСЯ КЛЕЩЕЙ?? #shorts

КТО БОИТСЯ КЛЕЩЕЙ?? #shorts

哈莉奎因怎么变骷髅了#小丑 #shorts

哈莉奎因怎么变骷髅了#小丑 #shorts

ДИЗЕЛЬ ШОУ 2024 💙 148 ВИПУСК 💛💐 ВЕЛИКА ПРЕМ'ЄРА 🌷 від 06.09.2024

ДИЗЕЛЬ ШОУ 2024 💙 148 ВИПУСК 💛💐 ВЕЛИКА ПРЕМ'ЄРА 🌷 від 06.09.2024

👆🏻Жми на «МЫ поехали в Питер…» и смотри 1 из 48 видео про мою жизнь

👆🏻Жми на «МЫ поехали в Питер…» и смотри 1 из 48 видео про мою жизнь