Part 1- End to End Azure Data Engineering Project | Project Overview

Поділитися
Вставка
  • Опубліковано 29 гру 2024

КОМЕНТАРІ • 154

  • @mr.ktalkstech
    @mr.ktalkstech  24 дні тому +3

    Thank you for watching! If you found Part 1 valuable and want to dive deeper, the full tutorial is available on Udemy.
    ▶ Get the Full Course on Udemy -> www.udemy.com/course/end-to-end-azure-data-engineering-real-time-project/?referralCode=626B44A4C9AA848ACB53
    Thank you for supporting my work, and I’m excited to help you continue your learning journey!

    • @TheRawFootages
      @TheRawFootages 23 дні тому +3

      why did you hide your content sir? I thought you are the only teacher who help poor students like us providing best content in free.

    • @mr.ktalkstech
      @mr.ktalkstech  23 дні тому +2

      I sincerely apologize for this situation. Unfortunately, due to Udemy's policies, I had to remove the content. Thank you so much for your understanding and continued support.

  • @seedhiBaatNoBakwas.
    @seedhiBaatNoBakwas. 5 місяців тому +10

    Great playlist for someone who has zero knowledge on ETL/AZURE. Good to clear fundamentals of azure resources

  • @pavankulkarni352
    @pavankulkarni352 2 місяці тому +1

    This is the cleanest explanation I have ever come across on azure.

  • @RukshanEdirisinghe-v9q
    @RukshanEdirisinghe-v9q 4 місяці тому +7

    this helped me to find my job. Thank you

  • @madhavtn7947
    @madhavtn7947 Рік тому +7

    I just started doing projects on data engineering and to be honest, this series needs to be on top results. Very useful content and easily understandable to newbees. Eagerly waiting for new projects using new tools and cloud services

    • @mr.ktalkstech
      @mr.ktalkstech  Рік тому

      Thank you so much :)

    • @BOSS-AI-20
      @BOSS-AI-20 Рік тому

      @@mr.ktalkstech Hello Sir, can I have your linkedIn Id

    • @blackspring4605
      @blackspring4605 7 місяців тому

      Do you if they are all open source ?

  • @prabhatgupta6415
    @prabhatgupta6415 Рік тому +6

    wow what a explanantion ..Huge respect.
    keep doing u will have good followers soon

  • @jabalraji7883
    @jabalraji7883 Рік тому

    Am a starter in DE....your illustration is awesome and I have subscribed to page for more updates...

  • @ajinkyadhoke4713
    @ajinkyadhoke4713 Рік тому +1

    Excellent Explanation...🔥🔥🔥🔥

  • @sandeepkumar0612
    @sandeepkumar0612 2 місяці тому

    this video is savior for new aspirants

  • @prabhatgupta6415
    @prabhatgupta6415 Рік тому +3

    Plz plzz bring more.. U teach very well

  • @kiruthikal5910
    @kiruthikal5910 6 місяців тому +1

    Excellent content brother

  • @shyamsunderMerugu
    @shyamsunderMerugu Рік тому +1

    Excellent....Superb tutorial. Fantastic explanation in a nut shell...

  • @IceMan299-kj5
    @IceMan299-kj5 10 місяців тому

    Hi Mr.K your lessons give the view of the roles of Data Engineering. I really appreciate your videos and would like to thank you sir. May God bless you and your family.

  • @jansenoliveira2823
    @jansenoliveira2823 Рік тому +3

    Amazing content. Congrats and thanks!

  • @vamshiikrishna
    @vamshiikrishna Рік тому +1

    Please carry with more viedos your knowledge sharing is helping us a lot🙏

  • @dhruba454
    @dhruba454 5 місяців тому

    Amazing content, Thank you for sharing this video series..

  • @pradeeppeace4541
    @pradeeppeace4541 6 місяців тому

    Thank you for explaining concept simple with presentation.

  • @Win_whatsimportantnow
    @Win_whatsimportantnow 5 місяців тому

    This video series is a game changer for me

  • @bhavindedhia3976
    @bhavindedhia3976 Рік тому +1

    You are really amazing seriously waiting for more such projects

  • @charankatta
    @charankatta Рік тому +4

    hi, great tutorial and indeed good learning for starters as me. Can you also please make end to end azure data engineering real time project with continuous data stream & readily available big data (so that we can readily download from your link). It would be of great help for us.

  • @chubsmash7602
    @chubsmash7602 6 місяців тому

    Thank you for these videos, really appreciate the time and efforts.

  • @justvenkyy...3423
    @justvenkyy...3423 11 місяців тому

    such a good explanation. great work. please post on complex challenges that faced by data engineers and its solutions.

  • @tao-adl
    @tao-adl 7 місяців тому

    Awesome, some key concepts finally clicked in my brain. Great breakdown!

  • @passions9730
    @passions9730 Рік тому

    very good session...thanks for brining this project. subscribed to channel by seeing the content..

  • @Dataenginner
    @Dataenginner Рік тому +5

    The concept you just showed in 11 mins is more worth then others playlist 😂, good to be ur subscriber man ❤ please keep making videos and help student

    • @mr.ktalkstech
      @mr.ktalkstech  Рік тому

      Thank you so much for the biggest compliment :)

    • @jayanttiwari3762
      @jayanttiwari3762 Рік тому

      @@mr.ktalkstech even i feel the same. Bhai your concepts are very clear, awesum videos.

  • @mansouralshamri1387
    @mansouralshamri1387 7 місяців тому +7

    Why do we use Databricks? Azure Synapse Analytics does ETL.

    • @kenamia9136
      @kenamia9136 2 місяці тому +2

      Perhaps He wants to expose you to as many tools as possible

  • @Manohar-q7k
    @Manohar-q7k 2 місяці тому +4

    In real world, if we take similar setup, may I know what would be the reason for using Databricks instead of Data Factory for the transformation of the data between the layers?

    • @AmanSingh-ig1en
      @AmanSingh-ig1en Місяць тому

      Although we can use dataflow in adf for transformation but it is easy to use pyspark with dataframe and all for transformation and pyspark is fast also. And one more thing we most use adf for orchestration

  • @helovesdata8483
    @helovesdata8483 Рік тому +1

    we are using the Medallion architecture at my job now.

  • @vps071
    @vps071 Рік тому +1

    great informative video! quick question..why is Synapse analytics needed? Can't PowerBi directly get feed from the gold layer in datalake?

    • @mr.ktalkstech
      @mr.ktalkstech  Рік тому +1

      Thank you so much :) We can connect directly from Data Lake as well- but its always recommended to use a structured database as a serving layer for reporting which will be scalable and handling the security will be simpler :)

  • @Badr_ouz
    @Badr_ouz 7 місяців тому

    Good explaination

  • @sowmyakotapally6677
    @sowmyakotapally6677 Місяць тому

    hi,
    Can u make Video to cover Azure and Spark relates interview questions and answers wrt to real time scenarios focusing on optimization done in specific for the use case and not the general methadologies.
    These are the questions I was asked recently.
    1) How do u recover a corrupt parquet data file
    2) U have millions of records in bronze layer and after transformations u have 50 million records in gold layer.
    U find that there are corrupt files in only one partition at the gold layer.
    How will u recover the file of that particular partition without rerunning the entire pipeline because we have millions of rows in both bronze layers
    3) What are the actual optimization done in project by you to achieve a) Execution time optimization b) Join level optimization
    Interviewer did not want generic answers which we know or would have read theortically.
    He wanted in specific How i implemented in the project
    Please do video with such tricky questions

  • @RAHULSHINDE-ky5si
    @RAHULSHINDE-ky5si 12 днів тому

    Hi sir, why have you kept this playlist hidden? It's very useful from an interview perspective... please make it visible

  • @sidsrivastava6987
    @sidsrivastava6987 10 місяців тому

    Damn i did an exact project like this in my internship at Amazon

  • @Dipsvloggermany2021
    @Dipsvloggermany2021 Рік тому +3

    Can you make an end to end project using Microsoft Fabric ? And please make more end to end to project like this

    • @mr.ktalkstech
      @mr.ktalkstech  Рік тому +1

      Hi, sure, I am already looking into Fabrics, you can expect the video in the near future, thanks for understanding :)

  • @rasikakurhade1011
    @rasikakurhade1011 11 місяців тому +2

    Hi Mr.K,
    I have also worked on the same migration project wherein we migrated data from on prem sql server to azure data lake gen2. We have already transformed data into SQL server as per the business requirement and then copied it to data lake gen2 using activities in ADF.
    In this video you explained about lake house architecture which I was not aware earlier when I worked on this project.
    So I have a small doubt:
    As we transformed data already before migrating it to azure as per the client requirements in SQL server then after loading it to the azure data Lake, in which layer of lake house architecture it would have been copied by us among bronze, silver and gold? And is it possible to copy data directly to gold layer? It was my first project so I couldn't pay attention to more details, could you please help me understand about it.
    Thanks in advance!

    • @mr.ktalkstech
      @mr.ktalkstech  10 місяців тому +1

      Thanks for reaching out :) If the data is already transformed and it doesn't require any further transformation at all- then we can load directly to the Gold layer.

    • @rasikakurhade1011
      @rasikakurhade1011 9 місяців тому

      @@mr.ktalkstech : Thanks for clearing the doubt.

  • @AltafAnsari-tf9nl
    @AltafAnsari-tf9nl Рік тому

    Awesome explanation

  • @dommarajuchaitanya7284
    @dommarajuchaitanya7284 10 днів тому

    It's very use full video sir . Why your hide those videos 😢 . could you please if possible provide those videos.

  • @dommarajuchaitanya7284
    @dommarajuchaitanya7284 10 днів тому

    Can you please let me know could be able to find rest if videos in Udemy?

  • @seeemant
    @seeemant Рік тому

    Amazing, pls add AKS too

  • @atulbisht9019
    @atulbisht9019 3 місяці тому +2

    Sir this usecase doesn't make sense. They would want to eliminate/cut the on prem data warehouse to azure environment then why wiill we be connecting to it. For one bulk loat it is understandable but for daily refreshes the source should be an OLTP system?
    Still thanks for making this playlist...it is really helpful to understand important azure services.

  • @azureportol
    @azureportol Рік тому +6

    I am not able to find data set about this project

    • @sayantanpodder2478
      @sayantanpodder2478 10 місяців тому +1

      Please refer other comments before commenting

  • @hashmatsulthana
    @hashmatsulthana Рік тому

    Thank you so much for this content .can you also please bring up video for ADF to snowflake?

  • @sonusolanki9927
    @sonusolanki9927 5 місяців тому

    Please share dataset to complete this project, really amazing videos

  • @dp9794
    @dp9794 6 місяців тому +1

    How to integrate data from sources like salesforce, AWS, Azure data lakes, Genesys, SAP

  • @sergendula3256
    @sergendula3256 21 день тому

    hello sir i have a problem with the transformation in part 6(data transformation) i keep getting the error
    AnalysisException: Found duplicate column(s) in the data to save: ship__to__address__id, sub__total, credit__card__approval__code, ship__method, ship__date, purchase__order__number, account__number, modified__date, order__date, revision__number, tax__amt, customer__id, due__date, sales__order__number, online__order__flag, bill__to__address__id, total__due, sales__order__id
    and the more i try to redefine the logic it still gives thesame errors

  • @raghuprasad3920
    @raghuprasad3920 Рік тому

    Hi Sir, Thank you for the video can you also do a 'End to End (Snowflake + Azure) Data Engineering Project' ?

  • @UjjwalDhiman-lm5pj
    @UjjwalDhiman-lm5pj 6 місяців тому

    Project is amazing, can I get the database with tables you used in this project

  • @nuzhatnsu
    @nuzhatnsu Рік тому +1

    thanks for providing an amazing video... please provide the link to the dataset so we can practice.. thanks in advance

    • @mr.ktalkstech
      @mr.ktalkstech  Рік тому

      It's an open source Adventure works database- follow the below link to import the database to the SSMS (I used the light weight version)
      learn.microsoft.com/en-us/sql/samples/adventureworks-install-configure?view=sql-server-ver16&tabs=ssms

    • @Mehtre108
      @Mehtre108 Рік тому

      ​@@mr.ktalkstechbro what is project name

  • @arunrs425
    @arunrs425 4 дні тому

    if i buy udemy i can do and get the knowledge of this project sir?

  • @ShanumUmaira-l3f
    @ShanumUmaira-l3f 3 місяці тому

    How do we load data from gold layer to synopsis...using ADF? or data bricks?

  • @pandeyvivak8223
    @pandeyvivak8223 Рік тому +1

    can you please bring more videos like this. Also DP203 certification guide videos.

  • @DrayCool-df8kj
    @DrayCool-df8kj 2 місяці тому

    Thank you. Do you have a community ? I wanna join please.

  • @likhim
    @likhim Рік тому

    Hi Sir can u pls advise after free tier over how much cost it will come to use azure for learning this project

  • @rammik1494
    @rammik1494 8 місяців тому

    Thank you so much for explaining the architecture. Wonderful content 😊
    I have a question though- what is the use is azure synapse analytics as we already have gold layer with clean data. Why can’t we connect bi tool directly to gold layer?
    Can you please let me know sir?

  • @nallakumarp2886
    @nallakumarp2886 6 місяців тому

    Its very useful video . Can you please let me know if you have any hadoop data migration from hdfs to Azure sql server project . if Yes kindly share the link

  • @venzotv1976
    @venzotv1976 Рік тому

    Why do we need Synapse if PowerBI can read from any Gen2 storage at Gold level?

  • @abhishekkalia6990
    @abhishekkalia6990 22 дні тому

    bro why rest of the videos are hidden now?

  • @abhishekkalia6990
    @abhishekkalia6990 22 дні тому

    hey bro i have already supported you. I being charged and tried to copy the link but at the time of access it has gone. i want this project access

  • @Chennairthymes
    @Chennairthymes 2 місяці тому

    Can you please share the project title for this project

  • @UnrealK9999
    @UnrealK9999 Рік тому

    thanks for this!!

  • @abhaybhatnate7428
    @abhaybhatnate7428 11 місяців тому

    Sir can you please upload the data set plz...... unable to do the project

  • @AngamVijaykumar
    @AngamVijaykumar Місяць тому

    Please say the use case for the project

  • @ranjansrivastava9256
    @ranjansrivastava9256 Рік тому +1

    Dear really great video. I have couple of questions on this architecture a. Which type of challenges do we face if we connect Power BI to Databricks directly to prepare dashboards. b. We can do the transformations in Synapse as well , and how do we connect Gold Layer to Synapse to prepare the data before connect to the Power BI dashboards. c. What challenges do we face if we connect On-Premises SQL Server data to Power BI directly to prepare the dashboards. Kindly help me on that.

    • @mr.ktalkstech
      @mr.ktalkstech  Рік тому +1

      Previously databricks doesn't have a serverless DB (I guess they recently added it)- having serverless DB to Power BI integration will be better as we don't need to wait for the cluster to turn on as it will be readily available to query the tables.

    • @ranjansrivastava9256
      @ranjansrivastava9256 Рік тому +1

      @@mr.ktalkstech one more query was there like:- suppose client does not like to go on cloud :- c. What challenges do we face if we connect On-Premises SQL Server data to Power BI directly to prepare the dashboards.

    • @alaricmbooh3628
      @alaricmbooh3628 Рік тому +1

      Some challenges could be related to scalability

  • @moizmirza9179
    @moizmirza9179 23 дні тому

    why did you disabled other parts brother, I was following the tutorial :(

  • @ricardogomes4077
    @ricardogomes4077 Рік тому

    plzz bring more using semi-structured and unstructured data

  • @DipuApple
    @DipuApple 10 місяців тому

    is the project OS independent ? like any1 using mac linux ubuntu try it out ? or azure is only for Microsoft ?

  • @saikumarjakki3802
    @saikumarjakki3802 11 місяців тому

    HI where i can get the on prem data can u share that link it will be help full

  • @shravyakulal5756
    @shravyakulal5756 Місяць тому

    Is this project available in udemy?

  • @zahidalam7831
    @zahidalam7831 9 місяців тому

    Hi Mr k,
    Kindly help me out how to put this project in our resume. Whats the best way to present this project into resume so that we
    can explain the thing whatever we used in this.

    • @zahidalam7831
      @zahidalam7831 8 місяців тому

      Kindly tell me

    • @zahidalam7831
      @zahidalam7831 8 місяців тому

      Plz suggest me

    • @pavankumard5276
      @pavankumard5276 3 місяці тому

      I have not watched the entire video but you can put something like migrated on premise sql db to azure

  • @udaynj
    @udaynj Рік тому

    Why do you need DataBricks AND Synapse? Synapse does data transformation/loading also. Seems duplicative to me. Can you pls explain? Thanks

    • @mr.ktalkstech
      @mr.ktalkstech  Рік тому +4

      Yes, you are right- synapse does both- in most cases Databricks is preferred for doing the data transformation, which works really well for the big data workloads and with the streaming data.
      But the main Idea of using databricks for this projects is to cover different resources as possible in the architecture, so that it would help people to understand how each resources works together. Hope that makes sense :)

  • @atharvbajare7398
    @atharvbajare7398 4 місяці тому

    please provide me dataset you have used during this project

  • @anishsaha1777
    @anishsaha1777 5 місяців тому

    Can the entire project be done by using the free subscription of Azure?

  • @PavanKalyan-ec9mw
    @PavanKalyan-ec9mw Рік тому

    i think we can transfer this data using data migration service right, if it's just for one time.

  • @adityadhawle6735
    @adityadhawle6735 8 місяців тому

    thanks bro

  • @nitikjain993
    @nitikjain993 Рік тому

    Could you please make this same project in using AWS services?

  • @Mehtre108
    @Mehtre108 Рік тому

    Did pyspark use in databrics sir

  • @Mehtre108
    @Mehtre108 Рік тому

    Hello Sir,
    What should i mention project name on resume
    Description
    Roles n responsibilities

    • @Mehtre108
      @Mehtre108 Рік тому

      I am new in this field so pls help sir

  • @ashabhumza3394
    @ashabhumza3394 Рік тому

    Can I also do this project along side this video? I mean without paying anything for using Azure.

    • @mr.ktalkstech
      @mr.ktalkstech  Рік тому +1

      Hey. thanks for reaching out. You can create a free azure account which will give you free credit of 200 dollars for 1 Year period, and you can use it to do the project if you would like to :)
      azure.microsoft.com/en-us/free/

    • @rajkumarbandi6195
      @rajkumarbandi6195 Рік тому

      @@mr.ktalkstech its 30 days I think

    • @AbhishekParmar-gy3fz
      @AbhishekParmar-gy3fz 11 місяців тому

      @@mr.ktalkstech Hi is 200 dollars enough to complete the whole project?

  • @mansinayak3360
    @mansinayak3360 6 місяців тому

    Do we need any subscription to build this project at any stage?

  • @prabhatgupta6415
    @prabhatgupta6415 Рік тому

    SIR CAN YOU BRING SOMETHING ON HEALTH CARE PROECT

  • @satyajeetdesai6076
    @satyajeetdesai6076 2 місяці тому

    is this project with free resources?

  • @Chennairthymes
    @Chennairthymes 2 місяці тому

    Can you please say me the project name

  • @kanthikumar122
    @kanthikumar122 Рік тому

    Can you please suggest good institute to learn azure data engineer course

    • @mr.ktalkstech
      @mr.ktalkstech  Рік тому

      I am not sure about that, Sorry :)

    • @Ady_Sr
      @Ady_Sr Рік тому

      Buy ur own subscription. Learn 1 module at a time from open sources like youtube n documents.

  • @beniffland7310
    @beniffland7310 Рік тому

    Can I ask is Microsoft Fabric basically using these same services?

    • @mr.ktalkstech
      @mr.ktalkstech  Рік тому

      Fabric contains Data Factory + Synapse + Data Lake (It does not have other services used in this Project)

  • @gautamgovinda5140
    @gautamgovinda5140 Рік тому

    👍

  • @TheAdventureArchive1
    @TheAdventureArchive1 11 місяців тому

    sir datasets

  • @DipakChavan-g7l
    @DipakChavan-g7l 5 днів тому

    Good for that hidden videos. Now someone can sell your videos as course more than udemy price. your dam good to took wrong decision.

  • @MDMODASSIRALAM-v6n
    @MDMODASSIRALAM-v6n 13 днів тому

    why u hide others video man-----unsubscribe

  • @portiseremacunix
    @portiseremacunix 20 днів тому

    unsub...

  • @chamarthysowjanya
    @chamarthysowjanya Рік тому

    Hai Kishore u r explaining simply Superab how can I contact u

    • @mr.ktalkstech
      @mr.ktalkstech  Рік тому

      Thank you :) email: mrktalkstech@gmail.com

  • @carterh7470
    @carterh7470 9 місяців тому +1

    🤌🤌🤌🤌 this is perfect

  • @karthireddy5838
    @karthireddy5838 9 місяців тому

    Amazing content, thanks for this video!!

  • @sureshk8882
    @sureshk8882 6 місяців тому

    very nice explained