How can i dedup data in Pyspark Dataframe which has Str datetime or Timestamp | Explained with Code

Data Engineering Made Easy: Build Datalake on S3 with Apache Hudi & Glue Hands-on Labs for Beginners

Building Data Lakes on AWS: Build a simple Data Lake on AWS with AWS Glue, Amazon Athena, and S3

Beautiful gymnastics 😍☺️

КОЛИЧЕСТВО СЛОВ! Не говори, иначе зашьют рот 👹☠️

ФРАНКІВСЬК ВБИВАЄ: темна історія України

Build your Data-Lake with AWS S3 and Athena using the Glue crawler | correct S3 Folder Structure

Soumil Shah

Переглядів 4 566

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 22 лип 2024
Наука та технологія

КОМЕНТАРІ • 7

@aacasd Рік тому ⁺¹
good demo man
@rankena 2 роки тому ⁺¹
errr, what if you want to filter by FolderA? :D
@josemanuelgutierrez4095 Рік тому
I have a question my friend , the last part wnen you show us all your data on athena , what are the benefits as a company for example to use it ?? . Can you tell me pls , because many company are using the same service but to be honest I don't know exactly the right use of those service . Thank you
@rushikeshkadam5282 2 роки тому
Brother please help
So i have created a custom endpoint URL for Amazon elasticsearch (Open Search) .Certificate is issued from AWS itself and i have configured Route53 with Cname. But still it can't load my custom URL. But the default URL provided by AWS it's working.
I don't what's happening. I am thinking elastic search is not accepting my SSL Certificate.
Any solution how can i connect to my elastic search and kibana via custom url?
@navinsai5726 2 роки тому
Good video brother
Great video soumil...couple of questions: I could not relate your Case A & Case B on folder structure. What is difference between Folder A vs Projectfiles or Folder B vs ProjectFiles 1? Aren't they both same, you just calling a different name folder a refers to projectfiles and folder b refers to projectfiles 1? Can you give a practicle example of case A and Case b folder structure?
Where do you get the values of yyyy/mm/dd for your folder structure? are those load year, month ,day values or date that represent when event or sales occurred?
there are a lot of things that can be done on AWS console but none of the video is teaching a complete deployable code from one environment to another environment, it's the crux of data engineering principles with agile development ....
also, is it practical to ask your data analyst to keep querying with year/dd/mm all the time? my users just want to do "select * from table" ...that's all they know :-0
assume Tableu/quicksight connects to Athena and if it doesn't generate the right partitioning values, how does these queries react, do you get a FAT BILL at the end of the month?
@adityasunny99 2 роки тому
I think bucket structure will be determined by use case, in your case you want all this by year. Suppose if i want it by project files by year, then your 2 architecture will be right. Please correct if i am wrong?
@SoumilShah 2 роки тому ⁺¹
Yes you are right

Наступне

Автоматичне відтворення

How can i dedup data in Pyspark Dataframe which has Str datetime or Timestamp | Explained with Code

How can i dedup data in Pyspark Dataframe which has Str datetime or Timestamp | Explained with Code

Data Engineering Made Easy: Build Datalake on S3 with Apache Hudi & Glue Hands-on Labs for Beginners

Data Engineering Made Easy: Build Datalake on S3 with Apache Hudi & Glue Hands-on Labs for Beginners

Building Data Lakes on AWS: Build a simple Data Lake on AWS with AWS Glue, Amazon Athena, and S3

Building Data Lakes on AWS: Build a simple Data Lake on AWS with AWS Glue, Amazon Athena, and S3

Beautiful gymnastics 😍☺️

Beautiful gymnastics 😍☺️

КОЛИЧЕСТВО СЛОВ! Не говори, иначе зашьют рот 👹☠️

КОЛИЧЕСТВО СЛОВ! Не говори, иначе зашьют рот 👹☠️

ФРАНКІВСЬК ВБИВАЄ: темна історія України

ФРАНКІВСЬК ВБИВАЄ: темна історія України

🤔Какой Орган самый длинный ? #shorts

🤔Какой Орган самый длинный ? #shorts

AWS Tutorials - Partition Data in S3 using AWS Glue Job

AWS Tutorials - Partition Data in S3 using AWS Glue Job

AWS Hands-On: ETL with Glue and Athena

AWS Hands-On: ETL with Glue and Athena

AWS Glue Tutorial for Beginners [FULL COURSE in 45 mins]

AWS Glue Tutorial for Beginners [FULL COURSE in 45 mins]

Deep Dive Into AWS Lake Formation - Level 300 (United States)

Deep Dive Into AWS Lake Formation - Level 300 (United States)

Building a Data Lake on AWS with AWS Glue, Glue Studio, Amazon Athena, and S3

Building a Data Lake on AWS with AWS Glue, Glue Studio, Amazon Athena, and S3

Build and automate Serverless DataLake using an AWS Glue , Lambda , Cloudwatch

Build and automate Serverless DataLake using an AWS Glue , Lambda , Cloudwatch

AWS Athena Tutorial |What is Amazon Athena |Athena + Glue + S3 Data | Athena AWS Tutorial | Edureka

AWS Athena Tutorial |What is Amazon Athena |Athena + Glue + S3 Data | Athena AWS Tutorial | Edureka

I Analyze Data - Best Practices for Implementing a Data Lake in Amazon S3 (Level 200)

I Analyze Data - Best Practices for Implementing a Data Lake in Amazon S3 (Level 200)

Top AWS Services A Data Engineer Should Know

Top AWS Services A Data Engineer Should Know

🖼️Этот девайс не купить в магазине! Самоделка с нейросетью

🖼️Этот девайс не купить в магазине! Самоделка с нейросетью

Electricians have been hiding this for years! Say Goodbye to Batteries-Innovative Tv Remote Solution

Electricians have been hiding this for years! Say Goodbye to Batteries–Innovative Tv Remote Solution

Infinix NOTE 40Pro+5G. НЕДОРОГОЙ СМАРТФОН С ФИШКАМИ ФЛАГМАНА

Infinix NOTE 40Pro+5G. НЕДОРОГОЙ СМАРТФОН С ФИШКАМИ ФЛАГМАНА

Это Xiaomi Su7 Max 🤯 #xiaomi #su7max

Это Xiaomi Su7 Max 🤯 #xiaomi #su7max

Новый питомец! Робот с искусственным интеллектом! Он меня узнал! Anki Cozmo

Новый питомец! Робот с искусственным интеллектом! Он меня узнал! Anki Cozmo

Windows 7. 15 лет спустя. Что она ЕЩЁ может?

Windows 7. 15 лет спустя. Что она ЕЩЁ может?

Россия вводит спутниковое управление дронами Геоскан

Россия вводит спутниковое управление дронами Геоскан

iPhone 16 - НАРЕШТІ ДОЧЕКАЛИСЯ!

iPhone 16 – НАРЕШТІ ДОЧЕКАЛИСЯ!