Lets Build Streaming Solution using Kafka + PySpark and Apache HUDI Hands on Lab with code
Вставка
- Опубліковано 2 жов 2024
- Lets Build Streaming Solution using Kafka + PySpark and Apache HUDI Hands on Lab with code
Code
github.com/sou...
Looking foreword to configure HUDI on Windows
Apache Hudi on Windows Machine Spark 3.3 and hadoop2.7 Step by Step guide and Installation Process
• Apache Hudi on Windows...
#aws #cloud #cloudcomputing #azure #devops #technology #python #amazonwebservices #linux #amazon #programming #awscloud #cybersecurity #coding #googlecloud #developer #kubernetes #bigdata #datascience #microsoft #machinelearning #software #java #tech #it #gcp #awstraining #javascript #security #dockerOnehousenadine farah 🇺🇦
Nice video, thanks. I tried this on AWS Glue. It is working but it is not able to generate Glue catalog based table + It generated many many small files (It is not automatically compacting)? Have you tried running this on AWS Glue?
Thanks for the video. It's a good one. Do you have any samples related to the scenario where we have to read the Avro data from a Kafka topic and upsert into the Hudi tables?
How will you stop spark steaming after certain time,I don't want to manually stop the streaming job,any idea?
www.linkedin.com/pulse/how-shutdown-spark-streaming-job-gracefully-lan-jiang
just awesome work :)
Thank you brother..
just wow
Soumil, this video was amazing !!
Love all your content :)
By any chance do you have a linkedin do connect ?
Thanks bro 🎉
Yes I have a LinkedIn and happy to connect
@@SoumilShah already sent LinkedIn request :)
amazing sir, Now I m your fan
Thanks and welcome
tu mast kaam karta hai soumil bhai
Thanks a lot bhai ❤