What is Data Pipeline | How to design Data Pipeline ? - ETL vs Data pipeline (2024)

Поділитися
Вставка
  • Опубліковано 30 сер 2020
  • What is Data Pipeline | How to design Data Pipeline? - ETL vs Data pipeline
    #datapipeline
    📢📢 Subscribe to my FREE newsletter "Normal I.T. Guy" to know more about what I am thinking, learning and doing in my career & life !!
    CLICK THE LINK BELOW 👇👇👇
    anshultiwari.substack.com
    **Do check out our popular playlists**
    1) Latest technology tutorial (2020) -
    • What is a Data Vault ?...
    2) Google Cloud Platform Beginner Series (2020) -
    • Google Cloud Platform ...
    3) Crunching Data Series (2020) - • Learn - Data Engineeri...
    Hi Friends, I am Anshul Tiwari, and welcome to our youtube channel I.T. k Funde.
    More about this video -
    In this video, we will understand what is a data pipeline with the help of a real-life example. Data pipelines are designed to move data from one point to another. In this video, we will cover below topics -
    1 - What is a Data Pipeline?
    2 - Why you need a data pipeline?
    3 - Basic design of a data pipeline
    4 - Types of Data Pipeline - Batch, Streaming, Lambda architecture
    5 - Advanced data pipeline design
    We will also learn about various products that can be used in a data pipeline - SAP BODS, Mongo DB, Apache Kafka, Big Query, MDM, Teradata, SAP Business Objects, Tableau.
    Credits & Resources -
    Wikipedia
    www.alooma.com/blog/what-is-a...
    *********************FOLLOW US ON*************************
    Facebook - / itkfunde
    Linkedin - / ansh9685
    Twitter - / ansh9685
    Blog - blogs.itkfunde.com/
    Instagram - / itkfunde
    ****************************************************************
    *****************About This Channel************************
    Friends ITkFUNDE channel wants to bring I.T related knowledge, information, career advice, and much more to every individual regardless of whether he or she belongs to I.T or not. This channel is for everyone interested in learning something new!

КОМЕНТАРІ • 1,2 тис.

  • @sivanagarajugamidi
    @sivanagarajugamidi 3 роки тому +400

    Master piece tutorial for data engineering

    • @ITkFunde
      @ITkFunde  3 роки тому +8

      Thanks Siva

    • @indexima6517
      @indexima6517 3 роки тому

      hey! don't hesitate to follow us and to take a look at our videos which deal with the same topics :)

    • @vijayjayaram606
      @vijayjayaram606 3 роки тому

      @@indexima6517 I guess the videos on ur channel deals with more on, what do we do after receiving the data, analytics if I understand correctly.
      Here, its more of pumping the data from one place to a common place, and make it available for interested people down the lane

    • @gautamdeusa
      @gautamdeusa 2 роки тому

      @@ITkFunde It's truly one of the finest and easiest video to follow and relate. Many thanks. Will check other videos.

    • @lwhieldon1
      @lwhieldon1 2 роки тому

      Thank you for breaking down concepts that are difficult to understand!

  • @nathancarranza9860
    @nathancarranza9860 3 роки тому +548

    Something I’ve noticed is that Indians are good teachers and give great illustrations. Good work. Greetings from the US.

    • @ITkFunde
      @ITkFunde  3 роки тому +68

      Thanks Nathan for making me feel even more proud of being an Indian thank you for the compliment means a lot brother 🙏😊

    • @fsfernandes20
      @fsfernandes20 3 роки тому +25

      Yes Indians like to make difficult concept easy

    • @nathancarranza9860
      @nathancarranza9860 3 роки тому +27

      I don’t use my real name online, but I do give real compliments.

    • @AutitsicDysexlia
      @AutitsicDysexlia 2 роки тому +11

      @@nathancarranza9860 Plot Twist: His real name was not Nathan. It was always Vladimir Putin.

    • @visionxx8656
      @visionxx8656 2 роки тому +12

      Can't believe Putin is from US

  • @altamashjawad6691
    @altamashjawad6691 2 роки тому +63

    Loved this video, probably the best explanation on advanced data pipeline out there. If in your next videos, maybe create a playlist which can show each of the section of this pipeline in detail with little examples using Python or any language etc. Just an idea, brilliant work!

  • @sid1r
    @sid1r 3 роки тому +3

    Thank you so much for a great and easy to understand data pipeline introduction. I love how you focus on the concepts and not jargons, as it allows for people to understand the essence of data pipeline.

  • @MrBignate12345
    @MrBignate12345 3 роки тому +33

    Please continue to create videos like these! So easy to understand. Love your visual teaching style and the examples you give.

    • @ITkFunde
      @ITkFunde  3 роки тому +1

      Thank you MrBignate...The aim is to simplify these techie jargons for everyone to correlate and enjoy learning.

  • @lcsxwtian
    @lcsxwtian 3 роки тому +3

    Simply one of the best videos on data pipeline on UA-cam. Deserves so much more attention.

  • @jananisri6214
    @jananisri6214 2 роки тому +1

    One of the best tutorials in youtube so far which gives an overview of data engineering process and that too within 10 minutes. Really appreciate your effort and time you put into making this video. Thank you so much. Please keep doing more such tutorials.

  • @ericdasse8174
    @ericdasse8174 2 роки тому +57

    That was great! As a data engineer in the making, this is the first time I have understood the concept of data pipelines so clearly. Thank you very much

    • @lukmanaliyu7386
      @lukmanaliyu7386 Рік тому

      Hello Eric, I'd love to know how it's going for you at the moment with the DE track

  • @aitemiryeskenov355
    @aitemiryeskenov355 3 роки тому +4

    Wonderful explanation. Can't tell you how much I appreciate brilliant tutorials like this one.

  • @sy-vf4js
    @sy-vf4js 3 роки тому +34

    And again, another easy-to-digest video. Thumbs up!

    • @ITkFunde
      @ITkFunde  3 роки тому

      Thank you 🙏🙏☺️

  • @infinitereach46
    @infinitereach46 День тому

    Masterclass. Great job!

  • @mitchelleleeuw2266
    @mitchelleleeuw2266 Рік тому +2

    ☺️I’m new in Data Engineering and man you created a clear picture of what I’ve been learning and trying to understand 🙂love this… definitely subscribing 🤩

  • @jamesmcmurtry5351
    @jamesmcmurtry5351 3 роки тому +6

    Great visual layout. Would love to see this applied to an ELT model with Snowflake and it's advantages/disadvantages. Possibly a suggestion on ML complementary tools like Looker and Kraken.

  • @MrBignate12345
    @MrBignate12345 3 роки тому +30

    Would love to learn more about how to choose the right frameworks/technologies for data pipelines and data warehouses/lakes for differing requirements. It would be nice to see a playlist of you designing or comparing solutions for an analytic stack.

    • @ITkFunde
      @ITkFunde  3 роки тому +10

      Thanks MrBignate I have created various playlists one of which is " Crunching Data Series "...I will surely make more videos on similar topic. It is because of encouragement from audience like you which helps me move forward so thanks and really grateful for your positive feedback.

  • @ryunosukefuriya3748
    @ryunosukefuriya3748 15 днів тому

    This was so helpful, thank you very much! Subscribed and will keep learning from you!

  • @squarehead6c1
    @squarehead6c1 3 роки тому +23

    Great intro, just what I needed. I learned the distinction between ETL and general pipe lines, and Kafka's place in the architecture.

    • @ITkFunde
      @ITkFunde  3 роки тому

      Thanks Ronnie☺️

  • @chocochipbananasplit
    @chocochipbananasplit 3 роки тому +3

    I got more out of your video than reading 5 articles on the matter! Your content is great!

  • @ci2484
    @ci2484 3 роки тому +16

    This was extremely helpful in my new job working in data. Thank you for your approachable and easy to understand teaching style!

    • @ITkFunde
      @ITkFunde  3 роки тому +1

      Thanks a lot 🙏

  • @MegaCha29
    @MegaCha29 2 місяці тому

    Excellent. Thanks for your courses

  • @rajguru1998
    @rajguru1998 3 роки тому +1

    Finally understood the pipeline in 10 mints... thank u

  • @prabur4027
    @prabur4027 2 роки тому +24

    This would be the Best start for the Data Engineers.. A clear precise and short pictorial representation of Data Pipeline (Basics). Best video so far I had seen.. 😊 Thanks.. Much Appreciated.. 👍

    • @ITkFunde
      @ITkFunde  2 роки тому +1

      Thanks Prabu 👍☺️🙏

    • @vivekjoshi3769
      @vivekjoshi3769 2 роки тому

      Do data analysts also use data pipeline creation in their jobs ? Or are they expected to know it ?
      Asking as some companies write knowledge of ETL in JDs.

    • @prabur4027
      @prabur4027 2 роки тому +1

      @@vivekjoshi3769 knowing any of the ETL tools would help in constructing the pipelines and they can visualize data flow from source to target.. Yes mostly it is used..

  • @alexanderulloaopazo6275
    @alexanderulloaopazo6275 3 роки тому +11

    Thank you! I had read a lot of papers about Data Pipeline, but I couldn't get the main idea. However, your video was so easy to understand!! Now I have a better picture of the complete process. Thanks again.

    • @ITkFunde
      @ITkFunde  3 роки тому

      Thank you Alexander !!!

  • @GoldJSilverio
    @GoldJSilverio Рік тому

    Yes I must say this is very concise and how he names the commercial vendors as examples really augments the value further.

  • @chimmenegaius-assor3767
    @chimmenegaius-assor3767 2 роки тому +2

    Great explanation for introduction to data pipelines. Thanks for clarifying the distinction between ETL and data Pipelines.

  • @mardiidking4030
    @mardiidking4030 Рік тому +8

    This topic is so complex as a beginner, but I understand this explanation so well. I didn't even have to go back in the video or rewatch it to understand. This is beautiful.

    • @ITkFunde
      @ITkFunde  Рік тому +1

      Thank you so much for your kind words and support 🙏🙏♥♥

  • @bigglesharrumpher4139
    @bigglesharrumpher4139 Рік тому +6

    Great video - it seems while technology has advanced, the concepts of batch loads and real-time data is actually decades old. Back in early 2000's we controlled all ETL and real-time loads with Unix or DOS or SQL scripts that provided return codes for success/failure which triggered alert emails, and we had KPI's for Data quality, backing-out jobs for failed loads, and many other control systems. It just seems there are more 'out-of-the-box' software to handle these now as opposed to custom-built solutions. Great presentation!

  • @JoeASalter
    @JoeASalter 3 роки тому +2

    This is such a clear and useful explanation. Thank you!

  • @marianamoscosa9747
    @marianamoscosa9747 Місяць тому

    Awesome video, thanks so much !

  • @JibrilLamai
    @JibrilLamai 3 роки тому +3

    This is a very good explanation and the best I have seen so far in my quest to understand this concept. Thank you very much. Now I can confidently visualize and explain the same concept with ease and a great understanding of it.

    • @ITkFunde
      @ITkFunde  3 роки тому

      Thanks Jibril glad it helped 🙏☺️

  • @kenford3738
    @kenford3738 3 роки тому +6

    Great job explaining the difference between Data Pipelines and ETL.

    • @ITkFunde
      @ITkFunde  3 роки тому +1

      Thanks Ken 🙏☺️

  • @haydarissa9371
    @haydarissa9371 Рік тому +1

    Very elegant way to explain data pipelining and ETL approach. I appreciate the examples given especially the master data management. Well done.

  • @arsaikia02
    @arsaikia02 2 роки тому

    Hello Anshul Sir,
    I do not have any data engineering background, but I am very strong in visualization. You have aided me in finding the missing blocks about data processing and its treatment for final consumption. Keep doing your good job. Thank you again.

  • @ramakrishnachimmani7273
    @ramakrishnachimmani7273 3 роки тому +8

    Thank you. The best way of explanation. I was looking for this kind of video for long time. As a traditional ETL developer, I questioned my self, why people are using a term called 'Data pipeline' though we have ETL process and what is the exact difference between them. Thanks again.

    • @ITkFunde
      @ITkFunde  3 роки тому

      Thanks Rama for your positive feedback !!

  • @K0n5tant
    @K0n5tant 3 роки тому +5

    Your way of explaining these concepts is excellent, thank you!

  • @puneettiwari2251
    @puneettiwari2251 3 роки тому

    Great

  • @komalsharma3856
    @komalsharma3856 2 роки тому

    You beautifully explained the basics & placements of various technologies in an architectural diagram. Very useful. Pls make more videos clarifying the basics

  • @juliansihite1289
    @juliansihite1289 Рік тому +8

    This guy really explain everything clearly and simple!
    Good job brother, keep sharing and contributing! You're a great teacher :)

    • @ITkFunde
      @ITkFunde  Рік тому

      Thanks Julian 😊❤️🙏

  • @kalyanchakri5258
    @kalyanchakri5258 3 роки тому +4

    Love your way of teaching in a simple understandable concepts. Im mad of you..!

    • @ITkFunde
      @ITkFunde  3 роки тому

      Thanks Kalyan for your feeback it helps a lot..

  • @nitinbobde3170
    @nitinbobde3170 3 роки тому

    Crystal clear explanation. Simple but effective way of explaining concepts. Excellent teacher. You rock.

  • @RolandoLopezNieto
    @RolandoLopezNieto Місяць тому

    Awesome explanation, thanks

  • @dhritimanbnrj
    @dhritimanbnrj 3 роки тому +12

    best productive 10 minutes of my life.

    • @ITkFunde
      @ITkFunde  3 роки тому

      Thanks Dhritiman for this super comment you made my day 🙏☺️

  • @brentcos9370
    @brentcos9370 3 роки тому +4

    Very informative, especially for a non-computer science guy like myself. Thanks!

    • @ITkFunde
      @ITkFunde  3 роки тому

      Thanks Brent that is the essence of this channel - Making I.T. interesting for everyone.

  • @formulaRoot
    @formulaRoot Рік тому +1

    Beautiful! Thanks for this!

  • @djmenon21
    @djmenon21 Рік тому +1

    Superb video

  • @e2xsq
    @e2xsq Рік тому +4

    Excellent high level overview Anshul, I appreciate that you differentiated between batch data and real time data with the Lambda Architecture as it seems most applicable to modern organizations. Your explanation of dashboards as consumers was also very realistic. Your video helped me better understand the general steps in the process. +1 Subscriber.

    • @ITkFunde
      @ITkFunde  Рік тому

      Thanks Matthew for supporting ❤️

  • @Gridblue
    @Gridblue Рік тому +3

    Thank you for the video, I learnt what data lake hydration projects are, my previous company had no proper KT, I struggled to grasp what I was doing. This was very nicely explained and cleared the doubts that I had.

  • @gilbertsibomana3081
    @gilbertsibomana3081 3 роки тому +1

    Thanks for the video, I like how everything is clear.

  • @georgesmith9178
    @georgesmith9178 Рік тому

    That was incredible. Thank you so much for this crisp, yet encompassing explanation of a data pipeline. I feel I totally understand it after watching for only 10 min.

  • @mikebrooks4182
    @mikebrooks4182 2 роки тому +3

    Thanks for a great overview of how the Lambda architecture can expedite the delivery of data to data consumers. For future videos, it would be helpful to map this to the roles, responsibilities, and skill requirements needed to manage this environment.

    • @ITkFunde
      @ITkFunde  2 роки тому +1

      Thanks Mike for suggestion will try to add this

  • @Manoj419419
    @Manoj419419 3 роки тому +3

    Great explanation and examples used. Thanks a ton !!

  • @gurpreetsmatharu
    @gurpreetsmatharu 2 роки тому +1

    Very well explained session on Data Pipeline and comparison with traditional ETL. Thanks so much!

  • @tektauron
    @tektauron 3 роки тому +2

    GREAT explanation, wow. Thank you so much dude! This was incredibly useful. I've subscribed!

  • @vox-populi-vox-dei-
    @vox-populi-vox-dei- Рік тому +3

    I think, you'ver covered it all as part of Enterprise Data Architecture. This is from the technology view and bit of functional view as well. Good job!

  • @maelherbert321
    @maelherbert321 3 роки тому +4

    Really content. Bravo from France 👏👏👏

  • @sharmilanadgir5030
    @sharmilanadgir5030 2 роки тому

    Thank you for this simple and clear explanation of data pipeline. Now I have a clear picture of how data flows from consumer to producer

  • @jessicafb5398
    @jessicafb5398 3 роки тому +2

    This was very helpful, thank you!

  • @knorth2386
    @knorth2386 3 роки тому +3

    Hi Anshul, your video was helpful. I have experience with ETL but didn't know that it was a specific type of data pipeline. Thanks for showing the different type of systems and technologies used for the concept visual that you explained with.

    • @ITkFunde
      @ITkFunde  3 роки тому +1

      Thank you Kyle coming from an experienced guy means a lot. Hoping for continued support !!

  • @KolawoleAdekoya
    @KolawoleAdekoya Рік тому +3

    Simplified and clear explanation of the concepts. Great diction and presentation. Well done!

  • @obiradaniel7391
    @obiradaniel7391 2 роки тому +1

    Thank you very much, very elaborate and concise, this import for everyone in the technical data cycle, data engineer, analyst, administrator and data scientist.

  • @nu7493
    @nu7493 Рік тому +1

    Great job man! Very straight to the point and very informative. Thank you so much!

  • @ravinduabeygunasekara833
    @ravinduabeygunasekara833 2 роки тому +3

    This is superb!. I am very strange to Data Engineering, and this video gave me a super insight! Keep up the good work

    • @ITkFunde
      @ITkFunde  2 роки тому

      Thanks Ravindu ☺️

  • @hussamcheema
    @hussamcheema 3 роки тому +3

    Excellent Explanation. Keep making more videos regarding Data Engineering, AI, and Data Science.

    • @ITkFunde
      @ITkFunde  3 роки тому

      Thanks a lot mate for your feedback and suggestion!!

  • @bimpenwoko8229
    @bimpenwoko8229 Місяць тому

    Thank you, that was easy to digest

  • @sandeepbhardwa9719
    @sandeepbhardwa9719 2 місяці тому

    Excellent Video, clear all doubts in one go. Keep sharing.

  • @sourabhsuri8812
    @sourabhsuri8812 3 роки тому +12

    Thank you so much brother, for clarifying some of the concepts.. Truly appreciate it. Can you suggest - Which way is the Tech Heading now - Data Warehouse Vs. Data Lake? Are DWH a thing of past?

    • @ITkFunde
      @ITkFunde  3 роки тому +13

      Thanks Sourabh, DWH is here to stay its not going anywhere. Today data world has become enormously huge and there is space for DWH and DL to co exist also Datalake can not solve all business problem. There is a hybrid approach coming up wherein you have your DWH on top of your Datalake

    • @vivek1joshi
      @vivek1joshi 2 роки тому +1

      Data Mesh

  • @ChanceMinus
    @ChanceMinus Рік тому

    Brilliant! Thank you.

  • @rajivjani8594
    @rajivjani8594 Рік тому

    Superb! In 10 minutes, you have put such a clear picture of data pipeline in my mind that I will never forget! Many thanks for your time and sharing this valuable piece!

  • @sofiyaj.3011
    @sofiyaj.3011 2 місяці тому

    Such a good video, so well explained! Amazing job!

  • @edsonsabino
    @edsonsabino 2 роки тому +1

    Great! The part that I liked the most was the one in wich he explained the difference between ETL and data pipeline

  • @TheSocialDrone
    @TheSocialDrone 3 роки тому

    An excellent and yet simple explanation of the topic! Thanks a lot for creating and uploading!

  • @sam007sd
    @sam007sd 3 роки тому +1

    Very well explained with a perfect example in the end.

  • @eliaszeray7981
    @eliaszeray7981 2 роки тому

    clear! Thank you.

  • @vandanagupta2137
    @vandanagupta2137 4 місяці тому

    best video ever to learn from, it precisely helps me to understant this topic. just lovedd it. just go for it without a second thought... i can asure uh.

  • @vigneshvicky3481
    @vigneshvicky3481 3 роки тому

    Ua very clear wid d subject. No hurry, no messy... Content delivery is ultimate... Clarified all my misunderstandings ... kudos 😍😍

  • @darinacherepanova6930
    @darinacherepanova6930 Рік тому

    It’s eyes opening and matching pieces in my head into logic, really thankful !

  • @tulajagautam4284
    @tulajagautam4284 2 роки тому

    Amazing!!

  • @Nmmmmmmy
    @Nmmmmmmy 3 роки тому

    Thanks for the video - found the comparison to an actual water pipeline very intuitive. Really enjoyed the video !

  • @hasmilaomar5562
    @hasmilaomar5562 2 роки тому +1

    It is good that u explain the concept of data pipeline by referring to water pipeline. So much easier to understand and remember. Thank you for your video!!

  • @Troy20111000
    @Troy20111000 2 роки тому

    Phenomenal work

  • @gajini1001
    @gajini1001 2 роки тому

    Simple, easy introduction. Thanks for your patience to explain the concepts.

  • @Alebobb
    @Alebobb Рік тому +1

    That was just great, thanks

  • @TheGhumanz
    @TheGhumanz 2 роки тому

    Very neat and simplified approach. Cleared my doubts about the need for data pipeline vs ETL. Thanks for sharing!

  • @jackieyou445
    @jackieyou445 3 роки тому

    There is a lot of value in explaining technology in lay man terms like this. Thanks very much!

  • @benstirling9324
    @benstirling9324 2 роки тому

    Fantastic. Thanks a bunch!

  • @ajeet23790
    @ajeet23790 3 місяці тому

    Your teaching technique is amazing. Thank you for sharing the knowledge on data pipeline. My all doubts related to data pipeline is clear now.

  • @commonman1271
    @commonman1271 2 роки тому +1

    Excellent explaination

  • @Rick1972mx
    @Rick1972mx 10 місяців тому

    Great Video !

  • @davidwallace5814
    @davidwallace5814 Рік тому +1

    Thank you, well done.

  • @jessechichi5609
    @jessechichi5609 4 місяці тому

    This is great stuff, welldone, keep it up!!!

  • @empressbelless3232
    @empressbelless3232 2 роки тому

    This is meant to be a compliment. I appreciate how articulate your English is with each word you speak! Easy to listen to!

  • @terminatorgamer97
    @terminatorgamer97 Місяць тому

    Great Video

  • @EMPELZIP
    @EMPELZIP 2 роки тому

    Great vid!

  • @DanishAnsari-hw7so
    @DanishAnsari-hw7so 2 роки тому +1

    Such an awesome explanation, short, crisp and to the point. Great!

  • @arunachalampalani4321
    @arunachalampalani4321 10 місяців тому +1

    Couldn't have asked for more. Very well explained, Thank you mate.

  • @MarcosDF_DPF
    @MarcosDF_DPF 2 роки тому

    Thanks for share!

  • @polinemuthaiga2752
    @polinemuthaiga2752 9 місяців тому +1

    You are a wonderful teacher!!!!!!!

  • @talentzunlimited1398
    @talentzunlimited1398 Рік тому

    Awesome Video!

  • @issamfakhari3152
    @issamfakhari3152 3 роки тому +1

    Great explication!!!!

  • @parryismful
    @parryismful 2 місяці тому

    Super Awesome video. Very well done !

  • @fahimahyder2394
    @fahimahyder2394 3 роки тому

    Thanks so much! That was so thorough!