Part 1- End to End Azure Data Engineering Project | Project Overview

Mr. K Talks Tech

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 29 гру 2024

КОМЕНТАРІ • 154

@mr.ktalkstech 24 дні тому ⁺³
Thank you for watching! If you found Part 1 valuable and want to dive deeper, the full tutorial is available on Udemy.
▶ Get the Full Course on Udemy -> www.udemy.com/course/end-to-end-azure-data-engineering-real-time-project/?referralCode=626B44A4C9AA848ACB53
Thank you for supporting my work, and I’m excited to help you continue your learning journey!
@TheRawFootages 23 дні тому ⁺³
why did you hide your content sir? I thought you are the only teacher who help poor students like us providing best content in free.
@mr.ktalkstech 23 дні тому ⁺²
I sincerely apologize for this situation. Unfortunately, due to Udemy's policies, I had to remove the content. Thank you so much for your understanding and continued support.
@seedhiBaatNoBakwas. 5 місяців тому ⁺¹⁰
Great playlist for someone who has zero knowledge on ETL/AZURE. Good to clear fundamentals of azure resources
@mr.ktalkstech 5 місяців тому
Thank you so much :)
@pavankulkarni352 2 місяці тому ⁺¹
This is the cleanest explanation I have ever come across on azure.
@RukshanEdirisinghe-v9q 4 місяці тому ⁺⁷
this helped me to find my job. Thank you
@natureexplore7207 3 місяці тому
which company and role?
@madhavtn7947 Рік тому ⁺⁷
I just started doing projects on data engineering and to be honest, this series needs to be on top results. Very useful content and easily understandable to newbees. Eagerly waiting for new projects using new tools and cloud services
@mr.ktalkstech Рік тому
Thank you so much :)
@BOSS-AI-20 Рік тому
@@mr.ktalkstech Hello Sir, can I have your linkedIn Id
@blackspring4605 7 місяців тому
Do you if they are all open source ?
@prabhatgupta6415 Рік тому ⁺⁶
wow what a explanantion ..Huge respect.
keep doing u will have good followers soon
@mr.ktalkstech Рік тому
Thank you so much :)
@jabalraji7883 Рік тому
Am a starter in DE....your illustration is awesome and I have subscribed to page for more updates...
@mr.ktalkstech Рік тому
Thank you soo much :)
@ajinkyadhoke4713 Рік тому ⁺¹
Excellent Explanation...🔥🔥🔥🔥
@mr.ktalkstech Рік тому
Thank you so much :)
@sandeepkumar0612 2 місяці тому
this video is savior for new aspirants
@prabhatgupta6415 Рік тому ⁺³
Plz plzz bring more.. U teach very well
@mr.ktalkstech Рік тому
Thank you so much :) sure, will do
@kiruthikal5910 6 місяців тому ⁺¹
Excellent content brother
@mr.ktalkstech 5 місяців тому
Thank you so much :)
@shyamsunderMerugu Рік тому ⁺¹
Excellent....Superb tutorial. Fantastic explanation in a nut shell...
@mr.ktalkstech Рік тому
Thank you soo much :)
@IceMan299-kj5 10 місяців тому
Hi Mr.K your lessons give the view of the roles of Data Engineering. I really appreciate your videos and would like to thank you sir. May God bless you and your family.
@mr.ktalkstech 8 місяців тому
Thank you so much :)
@jansenoliveira2823 Рік тому ⁺³
Amazing content. Congrats and thanks!
@vamshiikrishna Рік тому ⁺¹
Please carry with more viedos your knowledge sharing is helping us a lot🙏
@mr.ktalkstech Рік тому ⁺¹
Thank you so much :)
@dhruba454 5 місяців тому
Amazing content, Thank you for sharing this video series..
@pradeeppeace4541 6 місяців тому
Thank you for explaining concept simple with presentation.
@mr.ktalkstech 5 місяців тому
Thank you so much :)
@Win_whatsimportantnow 5 місяців тому
This video series is a game changer for me
@bhavindedhia3976 Рік тому ⁺¹
You are really amazing seriously waiting for more such projects
@mr.ktalkstech Рік тому
Thank you so much :)
@charankatta Рік тому ⁺⁴
hi, great tutorial and indeed good learning for starters as me. Can you also please make end to end azure data engineering real time project with continuous data stream & readily available big data (so that we can readily download from your link). It would be of great help for us.
@chubsmash7602 6 місяців тому
Thank you for these videos, really appreciate the time and efforts.
@mr.ktalkstech 5 місяців тому
Thank you so much :)
@justvenkyy...3423 11 місяців тому
such a good explanation. great work. please post on complex challenges that faced by data engineers and its solutions.
@mr.ktalkstech 10 місяців тому
Thank you so much :)
@tao-adl 7 місяців тому
Awesome, some key concepts finally clicked in my brain. Great breakdown!
@mr.ktalkstech 5 місяців тому
Thank you so much :)
@passions9730 Рік тому
very good session...thanks for brining this project. subscribed to channel by seeing the content..
@mr.ktalkstech Рік тому
Thank you so much :)
@Dataenginner Рік тому ⁺⁵
The concept you just showed in 11 mins is more worth then others playlist 😂, good to be ur subscriber man ❤ please keep making videos and help student
@mr.ktalkstech Рік тому
Thank you so much for the biggest compliment :)
@jayanttiwari3762 Рік тому
@@mr.ktalkstech even i feel the same. Bhai your concepts are very clear, awesum videos.
@mansouralshamri1387 7 місяців тому ⁺⁷
Why do we use Databricks? Azure Synapse Analytics does ETL.
@kenamia9136 2 місяці тому ⁺²
Perhaps He wants to expose you to as many tools as possible
@Manohar-q7k 2 місяці тому ⁺⁴
In real world, if we take similar setup, may I know what would be the reason for using Databricks instead of Data Factory for the transformation of the data between the layers?
@AmanSingh-ig1en Місяць тому
Although we can use dataflow in adf for transformation but it is easy to use pyspark with dataframe and all for transformation and pyspark is fast also. And one more thing we most use adf for orchestration
@helovesdata8483 Рік тому ⁺¹
we are using the Medallion architecture at my job now.
@vps071 Рік тому ⁺¹
great informative video! quick question..why is Synapse analytics needed? Can't PowerBi directly get feed from the gold layer in datalake?
@mr.ktalkstech Рік тому ⁺¹
Thank you so much :) We can connect directly from Data Lake as well- but its always recommended to use a structured database as a serving layer for reporting which will be scalable and handling the security will be simpler :)
@Badr_ouz 7 місяців тому
Good explaination
@mr.ktalkstech 5 місяців тому
Thank you so much :)
@sowmyakotapally6677 Місяць тому
hi,
Can u make Video to cover Azure and Spark relates interview questions and answers wrt to real time scenarios focusing on optimization done in specific for the use case and not the general methadologies.
These are the questions I was asked recently.
1) How do u recover a corrupt parquet data file
2) U have millions of records in bronze layer and after transformations u have 50 million records in gold layer.
U find that there are corrupt files in only one partition at the gold layer.
How will u recover the file of that particular partition without rerunning the entire pipeline because we have millions of rows in both bronze layers
3) What are the actual optimization done in project by you to achieve a) Execution time optimization b) Join level optimization
Interviewer did not want generic answers which we know or would have read theortically.
He wanted in specific How i implemented in the project
Please do video with such tricky questions
@RAHULSHINDE-ky5si 12 днів тому
Hi sir, why have you kept this playlist hidden? It's very useful from an interview perspective... please make it visible
@sidsrivastava6987 10 місяців тому
Damn i did an exact project like this in my internship at Amazon
@Dipsvloggermany2021 Рік тому ⁺³
Can you make an end to end project using Microsoft Fabric ? And please make more end to end to project like this
@mr.ktalkstech Рік тому ⁺¹
Hi, sure, I am already looking into Fabrics, you can expect the video in the near future, thanks for understanding :)
@rasikakurhade1011 11 місяців тому ⁺²
Hi Mr.K,
I have also worked on the same migration project wherein we migrated data from on prem sql server to azure data lake gen2. We have already transformed data into SQL server as per the business requirement and then copied it to data lake gen2 using activities in ADF.
In this video you explained about lake house architecture which I was not aware earlier when I worked on this project.
So I have a small doubt:
As we transformed data already before migrating it to azure as per the client requirements in SQL server then after loading it to the azure data Lake, in which layer of lake house architecture it would have been copied by us among bronze, silver and gold? And is it possible to copy data directly to gold layer? It was my first project so I couldn't pay attention to more details, could you please help me understand about it.
Thanks in advance!
@mr.ktalkstech 10 місяців тому ⁺¹
Thanks for reaching out :) If the data is already transformed and it doesn't require any further transformation at all- then we can load directly to the Gold layer.
@rasikakurhade1011 9 місяців тому
@@mr.ktalkstech : Thanks for clearing the doubt.
@AltafAnsari-tf9nl Рік тому
Awesome explanation
@dommarajuchaitanya7284 10 днів тому
It's very use full video sir . Why your hide those videos 😢 . could you please if possible provide those videos.
@dommarajuchaitanya7284 10 днів тому
Can you please let me know could be able to find rest if videos in Udemy?
@seeemant Рік тому
Amazing, pls add AKS too
@atulbisht9019 3 місяці тому ⁺²
Sir this usecase doesn't make sense. They would want to eliminate/cut the on prem data warehouse to azure environment then why wiill we be connecting to it. For one bulk loat it is understandable but for daily refreshes the source should be an OLTP system?
Still thanks for making this playlist...it is really helpful to understand important azure services.
@azureportol Рік тому ⁺⁶
I am not able to find data set about this project
@sayantanpodder2478 10 місяців тому ⁺¹
Please refer other comments before commenting
@hashmatsulthana Рік тому
Thank you so much for this content .can you also please bring up video for ADF to snowflake?
@sonusolanki9927 5 місяців тому
Please share dataset to complete this project, really amazing videos
@dp9794 6 місяців тому ⁺¹
How to integrate data from sources like salesforce, AWS, Azure data lakes, Genesys, SAP
@sergendula3256 21 день тому
hello sir i have a problem with the transformation in part 6(data transformation) i keep getting the error
AnalysisException: Found duplicate column(s) in the data to save: ship__to__address__id, sub__total, credit__card__approval__code, ship__method, ship__date, purchase__order__number, account__number, modified__date, order__date, revision__number, tax__amt, customer__id, due__date, sales__order__number, online__order__flag, bill__to__address__id, total__due, sales__order__id
and the more i try to redefine the logic it still gives thesame errors
@raghuprasad3920 Рік тому
Hi Sir, Thank you for the video can you also do a 'End to End (Snowflake + Azure) Data Engineering Project' ?
@UjjwalDhiman-lm5pj 6 місяців тому
Project is amazing, can I get the database with tables you used in this project
@nuzhatnsu Рік тому ⁺¹
thanks for providing an amazing video... please provide the link to the dataset so we can practice.. thanks in advance
@mr.ktalkstech Рік тому
It's an open source Adventure works database- follow the below link to import the database to the SSMS (I used the light weight version)
learn.microsoft.com/en-us/sql/samples/adventureworks-install-configure?view=sql-server-ver16&tabs=ssms
@Mehtre108 Рік тому
@@mr.ktalkstechbro what is project name
@arunrs425 4 дні тому
if i buy udemy i can do and get the knowledge of this project sir?
@ShanumUmaira-l3f 3 місяці тому
How do we load data from gold layer to synopsis...using ADF? or data bricks?
@pandeyvivak8223 Рік тому ⁺¹
can you please bring more videos like this. Also DP203 certification guide videos.
@mr.ktalkstech Рік тому
Sure!
@DrayCool-df8kj 2 місяці тому
Thank you. Do you have a community ? I wanna join please.
@likhim Рік тому
Hi Sir can u pls advise after free tier over how much cost it will come to use azure for learning this project
@rammik1494 8 місяців тому
Thank you so much for explaining the architecture. Wonderful content 😊
I have a question though- what is the use is azure synapse analytics as we already have gold layer with clean data. Why can’t we connect bi tool directly to gold layer?
Can you please let me know sir?
@zahidalam7831 8 місяців тому
+1
@nallakumarp2886 6 місяців тому
Its very useful video . Can you please let me know if you have any hadoop data migration from hdfs to Azure sql server project . if Yes kindly share the link
@venzotv1976 Рік тому
Why do we need Synapse if PowerBI can read from any Gen2 storage at Gold level?
@abhishekkalia6990 22 дні тому
bro why rest of the videos are hidden now?
@abhishekkalia6990 22 дні тому
hey bro i have already supported you. I being charged and tried to copy the link but at the time of access it has gone. i want this project access
@Chennairthymes 2 місяці тому
Can you please share the project title for this project
@UnrealK9999 Рік тому
thanks for this!!
@abhaybhatnate7428 11 місяців тому
Sir can you please upload the data set plz...... unable to do the project
@AngamVijaykumar Місяць тому
Please say the use case for the project
@ranjansrivastava9256 Рік тому ⁺¹
Dear really great video. I have couple of questions on this architecture a. Which type of challenges do we face if we connect Power BI to Databricks directly to prepare dashboards. b. We can do the transformations in Synapse as well , and how do we connect Gold Layer to Synapse to prepare the data before connect to the Power BI dashboards. c. What challenges do we face if we connect On-Premises SQL Server data to Power BI directly to prepare the dashboards. Kindly help me on that.
@mr.ktalkstech Рік тому ⁺¹
Previously databricks doesn't have a serverless DB (I guess they recently added it)- having serverless DB to Power BI integration will be better as we don't need to wait for the cluster to turn on as it will be readily available to query the tables.
@ranjansrivastava9256 Рік тому ⁺¹
@@mr.ktalkstech one more query was there like:- suppose client does not like to go on cloud :- c. What challenges do we face if we connect On-Premises SQL Server data to Power BI directly to prepare the dashboards.
@alaricmbooh3628 Рік тому ⁺¹
Some challenges could be related to scalability
@moizmirza9179 23 дні тому
why did you disabled other parts brother, I was following the tutorial :(
@ricardogomes4077 Рік тому
plzz bring more using semi-structured and unstructured data
@mr.ktalkstech Рік тому
sure, ll do that :)
@DipuApple 10 місяців тому
is the project OS independent ? like any1 using mac linux ubuntu try it out ? or azure is only for Microsoft ?
@saikumarjakki3802 11 місяців тому
HI where i can get the on prem data can u share that link it will be help full
@shravyakulal5756 Місяць тому
Is this project available in udemy?
@zahidalam7831 9 місяців тому
Hi Mr k,
Kindly help me out how to put this project in our resume. Whats the best way to present this project into resume so that we
can explain the thing whatever we used in this.
@zahidalam7831 8 місяців тому
Kindly tell me
@zahidalam7831 8 місяців тому
Plz suggest me
@pavankumard5276 3 місяці тому
I have not watched the entire video but you can put something like migrated on premise sql db to azure
@udaynj Рік тому
Why do you need DataBricks AND Synapse? Synapse does data transformation/loading also. Seems duplicative to me. Can you pls explain? Thanks
@mr.ktalkstech Рік тому ⁺⁴
Yes, you are right- synapse does both- in most cases Databricks is preferred for doing the data transformation, which works really well for the big data workloads and with the streaming data.
But the main Idea of using databricks for this projects is to cover different resources as possible in the architecture, so that it would help people to understand how each resources works together. Hope that makes sense :)
@atharvbajare7398 4 місяці тому
please provide me dataset you have used during this project
@anishsaha1777 5 місяців тому
Can the entire project be done by using the free subscription of Azure?
@PavanKalyan-ec9mw Рік тому
i think we can transfer this data using data migration service right, if it's just for one time.
@mr.ktalkstech Рік тому ⁺¹
Hi, yes, that's right :)
@TheAdventureArchive1 10 місяців тому
@@mr.ktalkstech were is data set
@adityadhawle6735 8 місяців тому
thanks bro
@nitikjain993 Рік тому
Could you please make this same project in using AWS services?
@mr.ktalkstech Рік тому
Sure, will do in the future :)
@Mehtre108 Рік тому
Did pyspark use in databrics sir
@mr.ktalkstech Рік тому
Yes :)
@Mehtre108 Рік тому
Hello Sir,
What should i mention project name on resume
Description
Roles n responsibilities
@Mehtre108 Рік тому
I am new in this field so pls help sir
@ashabhumza3394 Рік тому
Can I also do this project along side this video? I mean without paying anything for using Azure.
@mr.ktalkstech Рік тому ⁺¹
Hey. thanks for reaching out. You can create a free azure account which will give you free credit of 200 dollars for 1 Year period, and you can use it to do the project if you would like to :)
azure.microsoft.com/en-us/free/
@rajkumarbandi6195 Рік тому
@@mr.ktalkstech its 30 days I think
@AbhishekParmar-gy3fz 11 місяців тому
@@mr.ktalkstech Hi is 200 dollars enough to complete the whole project?
@mansinayak3360 6 місяців тому
Do we need any subscription to build this project at any stage?
@anishsaha1777 5 місяців тому
Same question
@prabhatgupta6415 Рік тому
SIR CAN YOU BRING SOMETHING ON HEALTH CARE PROECT
@mr.ktalkstech Рік тому
Sure, will do that in the future :)
@satyajeetdesai6076 2 місяці тому
is this project with free resources?
@Chennairthymes 2 місяці тому
Can you please say me the project name
@kanthikumar122 Рік тому
Can you please suggest good institute to learn azure data engineer course
@mr.ktalkstech Рік тому
I am not sure about that, Sorry :)
@Ady_Sr Рік тому
Buy ur own subscription. Learn 1 module at a time from open sources like youtube n documents.
@beniffland7310 Рік тому
Can I ask is Microsoft Fabric basically using these same services?
@mr.ktalkstech Рік тому
Fabric contains Data Factory + Synapse + Data Lake (It does not have other services used in this Project)
@gautamgovinda5140 Рік тому
👍
@TheAdventureArchive1 11 місяців тому
sir datasets
@DipakChavan-g7l 5 днів тому
Good for that hidden videos. Now someone can sell your videos as course more than udemy price. your dam good to took wrong decision.
@MDMODASSIRALAM-v6n 13 днів тому
why u hide others video man-----unsubscribe
@portiseremacunix 20 днів тому
unsub...
@chamarthysowjanya Рік тому
Hai Kishore u r explaining simply Superab how can I contact u
@mr.ktalkstech Рік тому
Thank you :) email: mrktalkstech@gmail.com
@carterh7470 9 місяців тому ⁺¹
🤌🤌🤌🤌 this is perfect
@mr.ktalkstech 8 місяців тому
Thank you so much :)
@karthireddy5838 9 місяців тому
Amazing content, thanks for this video!!
@mr.ktalkstech 8 місяців тому
Thank you so much :)
@sureshk8882 6 місяців тому
very nice explained
@mr.ktalkstech 5 місяців тому
Thank you so much :)

Наступне

Автоматичне відтворення

End to End Realtime Streaming Azure Data Engineering Project (Part -1) | Complete Guide with Demo