Learn Apache Spark in 10 Minutes | Step by Step Guide

Поділитися
Вставка
  • Опубліковано 26 гру 2024

КОМЕНТАРІ • 289

  • @AxelNtwari
    @AxelNtwari 4 місяці тому +55

    Bro cooked. From the history, to the technical design and demo! Hats off!

  • @DarshilParmar
    @DarshilParmar  Рік тому +106

    Don't forget to hit that Subscribe Button for more amazing content :)

  • @jjones40
    @jjones40 Рік тому +51

    Thanks for actually explaining spark, instead of making general comments or assuming we know the basics. Great video. Thumbs up, subscribed.

    • @Remoes02
      @Remoes02 4 місяці тому +1

      agreed. I watched like 5 videos prior to this one that made wild assumptions about what I knew

  • @thealbaniandude1997
    @thealbaniandude1997 Рік тому +32

    That was an extremely good explanation. Not only explained the theory but also practical examples.

  • @rajdeepnaha1242
    @rajdeepnaha1242 6 місяців тому +8

    00:00 Big Data and Hadoop
    01:25 Hadoop processed data in batches and was slower due to disk storage, Apache Spark solves these limitations.
    02:43 Apache Spark is a fast and efficient data processing framework.
    04:11 Apache Spark is a powerful tool for processing and analyzing Big Data.
    05:42 Apache Spark application consists of a driver process and executor processes.
    07:02 Spark data frames are distributed across multiple computers and require partitioning for parallel execution.
    08:24 Spark transformation block will give the final output.
    09:40 Spark allows the conversion of data frames and the execution of SQL queries on top of it.

  • @krneki6954
    @krneki6954 Рік тому +13

    best explanation on spark in 10 minutes. its like feynman explaining physics. excellent job!

  • @aparnapandian13
    @aparnapandian13 26 днів тому +1

    I had understood the concept clearly within 10 min . Now I had a great understanding and knowledge about Apache Spark . This is best Spark video I had gone through . Its clear and top notch Explanation about each of the topics .

  • @sureshlira6307
    @sureshlira6307 Рік тому +21

    I never knew I could recall so much in just under 10min...
    Wonderful content and well explained keeping it simple...

  • @Rahul-fq9kf
    @Rahul-fq9kf 9 місяців тому +1

    You are doing a fabulous job of making Data analytics so easy for everyone. Thank you so very much. God bless you!

  • @sajaljaiswal1659
    @sajaljaiswal1659 3 місяці тому +2

    This was insanely good. Thanks for explaining the basics so clearly. Now I can learn deeper more comfortably.

  • @fatihkeskin5867
    @fatihkeskin5867 Рік тому +22

    I was waiting for this. Please share an end to end project using Spark.

    • @DarshilParmar
      @DarshilParmar  Рік тому +4

      Yes

    • @Rider-jn6zh
      @Rider-jn6zh Рік тому +4

      Waiting for the same...right from spark installation on local as well as on cloud platform

    • @sumant542
      @sumant542 Рік тому

      Please upload ASAP.

    • @nirakarsahu4844
      @nirakarsahu4844 Рік тому

      Yes, if possible can you please also share using pyspark as well..

  • @kranthikumarnagothu3056
    @kranthikumarnagothu3056 Рік тому +16

    Such a nice content!
    What a man you are!
    You have covered everything in spark in just 10 mins. I wonder how you made this video and the effort u put in to make this video is wonderful. Thank you for sharing nice content in such a simple manner!!

  • @devarapallivamsi7064
    @devarapallivamsi7064 8 місяців тому +3

    I usually be off from content titled learn/master/excel X in Y minutes. would have definitely done the same had I came accross this by myself. Watched it only because my frd shared to me. Now I feel that I am lucky after watching this as I could wrap my head around SPARK.
    Subscribed.

  • @prateeksachdeva1611
    @prateeksachdeva1611 10 місяців тому +2

    The best Spark tutorial I have ever gone through. Thanks a lot Darshil.

  • @tortingseloc1810
    @tortingseloc1810 Місяць тому

    The first very clear video about spark that I have seen.

  • @mathewsignatious2838
    @mathewsignatious2838 4 місяці тому +1

    Excellent explanation-clear, concise, and straight to the point.

  • @shobogenzo
    @shobogenzo 10 днів тому

    As many other already said, tantastic and informative video on Spark. Nice context by providing the history of Hadoop. Nice pace too, not to fast, not too slow!

  • @youhavetodoit9070
    @youhavetodoit9070 4 місяці тому +3

    Thanks For Explaining in 10 Min 🙌

  • @hey_jaski
    @hey_jaski 25 днів тому

    crystal clear explanation! loved it❤

  • @manyumungara1081
    @manyumungara1081 10 місяців тому +1

    I didn't understand apache spark since my undergraduate until I found this gem.

  • @newbie8051
    @newbie8051 16 днів тому

    Saw a bunch of your roadmap videos back in my freshman year, and now back here prepping for my DS internship, thanks !
    The job description had spark/mapreduce which brings me here : )

    • @newbie8051
      @newbie8051 16 днів тому

      I tried to replicate the code block at 10:13,
      Can we use tips.filter(filterA & filterB), this applied both filters at the same time and does not create intermediate results
      tips.filter(filterA) will create some dataframe, which will be filtered by another filterB
      Please correct me if I'm wrong
      thanks !

  • @jyotikothari499
    @jyotikothari499 Рік тому +3

    Apache Spark -- explained core concept in such a simple language..
    Wonderful job 👍👍👍

  • @JacquelineSanchez-yh3dy
    @JacquelineSanchez-yh3dy 2 місяці тому

    I'm just getting started with creating a group CNN project with friends and we are dealing a huge dataset of mri scans so I was thinking about platforms that could deal with lots of data without having to deplete my disk lol. Thank you so much for breaking down how Apache works compared to Hadoop, I really appreciate it! 😊

  • @jeevanb8623
    @jeevanb8623 8 місяців тому

    superb man.. didn't waste the time.. great explaination..

  • @JaydeepRana-j1c
    @JaydeepRana-j1c 22 дні тому

    Thanks for this video , much informative and easy to understand using the examples you gave.

  • @elara5122
    @elara5122 2 місяці тому

    What a video really understood the apache spark that i could not in my university.

  • @njokiwambui3447
    @njokiwambui3447 Рік тому +3

    Thanks for this.Currently reading spark definitive guide.Looking forward to full tutorial

  • @noob523
    @noob523 23 дні тому +1

    Nice video

  • @vijaybhargavkarnati9926
    @vijaybhargavkarnati9926 26 днів тому

    Explained and the presentation is good.

  • @PriyanshuVerma-kv8lp
    @PriyanshuVerma-kv8lp Рік тому +1

    I really understand the software really quickly, thanks man

  • @lamiranda100
    @lamiranda100 Рік тому +1

    amazing explanation!! Thank you!

  • @asp1201
    @asp1201 Місяць тому

    You nailed it man! Amazing information that i am using for my DE interviews

  • @ag564
    @ag564 6 місяців тому

    Very good intro to Spark. I've started my data science journey and it really helps.

  • @xfarooqi
    @xfarooqi 3 місяці тому

    So in just 10 mins, I get to know about Big Data, Hadoop, Spark, Pyspark and how I can write code in Pyspark.
    Wow, that's what a good explanatory should be like!

  • @oo7ankit
    @oo7ankit 2 місяці тому

    Brief and informative . Thanks 👍

  • @krupakarjeeru1061
    @krupakarjeeru1061 7 місяців тому

    You nailed it Bro in just 10 mins 😊

  • @AnalyticsByHenry
    @AnalyticsByHenry 9 місяців тому

    Impressive explanation of spark. Making it easy for every beginner to understand.

  • @TahaSayyed-u2j
    @TahaSayyed-u2j 5 місяців тому

    One of the best video ! You really exxplained in very precise and esay way. Love it!

  • @JonathanBrune
    @JonathanBrune 8 місяців тому

    Great introduction. Thank you so much.

  • @FarooqKhan-d8c
    @FarooqKhan-d8c Рік тому

    Wonderfully explained in just 10 mins.

  • @VanshSingla-jp4jy
    @VanshSingla-jp4jy Рік тому +69

    Alright, but need a full tutorial on this topic, if you can.

    • @DarshilParmar
      @DarshilParmar  Рік тому +20

      Working on it!

    • @anasshaikh3885
      @anasshaikh3885 Рік тому +6

      ​@@DarshilParmarthank you please upload it ASAP

    • @ajinkyagaikwad4181
      @ajinkyagaikwad4181 Рік тому +3

      ​@@DarshilParmarplease upload

    • @adieka1633
      @adieka1633 6 місяців тому

      ​@@DarshilParmar this is a what the heroes did. Kudos to you Darshil

  • @kinjal_suryavanshi
    @kinjal_suryavanshi 3 місяці тому

    Thank you!! So helpful

  • @AayushSingh-gh5lm
    @AayushSingh-gh5lm 4 місяці тому

    Clear and concise explanation

  • @hobbyofbobby2973
    @hobbyofbobby2973 6 місяців тому

    Good job Darshil. Appreciate the work.

  • @Sky-2212
    @Sky-2212 6 місяців тому +1

    Amazing, You explained everything in detail with examples. Best video on UA-cam to know about Spark.👏

  • @ParthivShah
    @ParthivShah 3 місяці тому +1

    Nice Video, Thank You.

  • @krishkanojia2850
    @krishkanojia2850 Рік тому

    Understood video very well. Without any prior knowledge of apache spark

  • @skshareena5013
    @skshareena5013 9 місяців тому

    Super explanation bro, I got many answers in one vedio 🥳🥳

  • @mdmahirasefkabir3883
    @mdmahirasefkabir3883 4 місяці тому

    Very nice video. Thank you!

  • @sageevajoseph9579
    @sageevajoseph9579 7 місяців тому

    You explained the content simple and clear. Thank you for this video.

  • @himanshusekharmohanta2489
    @himanshusekharmohanta2489 4 місяці тому

    Really very nice explanations..

  • @darshakmehta
    @darshakmehta 5 місяців тому

    To the point, quick, simple and comprehensive knowledge sharing!

  • @Kondaranjith3
    @Kondaranjith3 Рік тому +1

    Waiting for full course from you apache spark

  • @christinachen9669
    @christinachen9669 9 місяців тому

    Wonderful summarize!

  • @TheBaBaLand
    @TheBaBaLand 9 місяців тому

    Awesome video mate! well done.

  • @sophiebarrie5749
    @sophiebarrie5749 2 місяці тому

    Thank you so much for this explanation, youve outlined it quite clearly before Ive even had any experience using Spark, so thank you! If you could slow down your explanation a bit though, that would be helpful

    • @DarshilParmar
      @DarshilParmar  2 місяці тому

      It's a 10min series, you can check out my courses for more in-depth guide

  • @imanelamnaoir590
    @imanelamnaoir590 5 місяців тому

    This explanation is very gooooooooooooooooooooooooood
    Thank u

  • @Taehyung-w3q
    @Taehyung-w3q 22 дні тому

    really good explanation

  • @Player18345
    @Player18345 Рік тому +1

    Super🎉
    Waiting for full tutorial

  • @prensudangol1547
    @prensudangol1547 4 місяці тому

    Thank u i got the basics

  • @msanghamitra
    @msanghamitra 10 місяців тому

    Very well explained! Thank you!

  • @omarcruz6326
    @omarcruz6326 3 місяці тому

    Hello Darshil,
    This is great content ! A little bit too much information, hehe. Now it should be digested :)

  • @arsh2489
    @arsh2489 2 місяці тому

    Thank you sir 👍

  • @rk-ej9ep
    @rk-ej9ep 6 місяців тому

    Nice explanation..plz do series on spark.

    • @DarshilParmar
      @DarshilParmar  6 місяців тому

      I have a course on Spark, please check description

  • @PranathiAnda
    @PranathiAnda 7 місяців тому

    Nice Explanation, Thank you

  • @ShrutiSharma-xu6qs
    @ShrutiSharma-xu6qs 3 місяці тому

    very nicely explained

  • @garimajain474
    @garimajain474 Рік тому

    Best tutorial ❤❤all in one

  • @prashantcloud
    @prashantcloud 7 місяців тому

    Very well explained , thank you very much

  • @balajirpi
    @balajirpi Рік тому

    As simple as that.. Liked

  • @lokeshnaidu6888
    @lokeshnaidu6888 Рік тому

    Very well explained😊

  • @nik54610
    @nik54610 Місяць тому

    Great video bro

  • @nirmalpandey600
    @nirmalpandey600 Рік тому

    Really productive video.

  • @hritikapal683
    @hritikapal683 Рік тому +3

    Superb one! Can we expect full tutorial over spark!?

  • @MrPavelber
    @MrPavelber Рік тому

    Great video! Thank you

  • @nancymaheshwari5421
    @nancymaheshwari5421 Рік тому

    Just Amazing😇Thank you

  • @mdaurangzebkhan8734
    @mdaurangzebkhan8734 Рік тому

    A excellent video on Apache Spark. Covered almost everything. Very helpful video to the beginners like me.

  • @kartikeyasingh2798
    @kartikeyasingh2798 7 місяців тому +1

    Very good video

  • @deepanshurathore9661
    @deepanshurathore9661 Рік тому +3

    You explained so many things in 10 minutes 🫡🫡🫡

  • @rishisingh2598
    @rishisingh2598 9 місяців тому

    Fantastic explanation… 👏👏 the way you take your audience through the flow of explaining these concepts is very effective👌

  • @vivekabhyankar5029
    @vivekabhyankar5029 Рік тому

    Wonderful video you explained everything perfectly

  • @2412_Sujoy_Das
    @2412_Sujoy_Das Рік тому +1

    Darshil Sir, I had a query regarding Memory Management concept of Spark.
    As per my understanding, Spark uses it Execution memory to store intermediate data in execution memory which it shares with storage memory too, if needed. It can also utilize the off-heap memory for storing extra data.
    1) Does it access the off heap memory after filling up storage memory?
    2) What if it fills up Off heap memory too? Does it wait till GC clears up on-heap part or spills the extra data to disc?
    Now, in a wide transformation, Spark either sends the data back to disc or transfer it over the network, say for a join operation.
    Is the part of data sending data back to disc same as above where Spark has the option to spill data to disc on filling up on-heap memory?
    Please do clarify my above queries, sir. I feel like breaking my head as I couldn't make a headway through it yet even after referring few materials.

    • @DarshilParmar
      @DarshilParmar  Рік тому +2

      In Spark, memory management involves both on-heap memory and off-heap memory. Let me address your queries regarding Spark's memory management:
      1. Off-heap memory usage: By default, Spark primarily uses on-heap memory for storing data and execution metadata. However, Spark can also utilize off-heap memory for certain purposes, such as caching and data serialization. Off-heap memory is typically used when the data size exceeds the available on-heap memory or when explicit off-heap memory is configured. It is not used as an overflow for storage memory.
      2. Filling up off-heap memory: If off-heap memory fills up, Spark does not automatically spill the data to disk. Instead, it relies on garbage collection (GC) to free up memory. Spark's memory management relies on the JVM's garbage collector to reclaim memory when it becomes necessary. When off-heap memory is full, Spark waits for the JVM's garbage collector to reclaim memory by cleaning up unused objects. Therefore, if off-heap memory fills up, Spark may experience performance degradation or even out-of-memory errors if the garbage collector cannot free enough memory.
      Thanks,
      ChatGPT

  • @shantanukulkarni8883
    @shantanukulkarni8883 8 місяців тому

    A very very good video. Thanks, you are doing a really great job!

  • @shankarchavhan375
    @shankarchavhan375 Рік тому +1

    Darshil I want to learn data engineering from scratch. I don't know anything about these changes, so where do I start? Which course should be taken.

    • @DarshilParmar
      @DarshilParmar  Рік тому

      My Python & SQL for Data Engineering is a good place to start - learn.datawithdarshil.com/

  • @vishalcrazy5121
    @vishalcrazy5121 6 місяців тому

    Really good content .

  • @raaj5779
    @raaj5779 4 місяці тому

    What is the difference between Apache spark and Kafka?? Which one to use for data analysis?

  • @paspas12
    @paspas12 3 місяці тому

    Amazing concise detailed explanation with great editing. Such a great way of presenting a hard topic in an easy manner. Love your comparisons with teamwork, puzzles etc. So impressed. Big thumbs up and subscribe from me. Eager to see your other videos. Thanks!

  • @hariramkm1677
    @hariramkm1677 Рік тому

    Excellent Explanation...

  • @InfinitesimallyInfinite
    @InfinitesimallyInfinite Рік тому

    Excellent video Darshil. Clear and concise! Subscribed!

  • @infotalkk
    @infotalkk 4 місяці тому

    So is pandas similar to spark where pandas is more suitable for for a single node data processing vs spark is for distributed data processing ?

  • @AlexFosterAI
    @AlexFosterAI 2 місяці тому

    can you make one of these vids on lakesail's pysail?

  • @rodrigomatos7686
    @rodrigomatos7686 8 місяців тому

    Great video, thanks :)

  • @venkatah9847
    @venkatah9847 Рік тому +2

    Thank you very much and it's a very nice primer to refresh once the concepts. Thank you for your contributions 👍

  • @AviralJain
    @AviralJain Рік тому

    It was really helpful. Thanks.

  • @amadoucisse2692
    @amadoucisse2692 5 місяців тому

    Nice video!

  • @kirtisoni6076
    @kirtisoni6076 Рік тому

    Amazing video. Please share the project doc😊

  • @ANKITASHARMA-ix9gt
    @ANKITASHARMA-ix9gt Рік тому

    Very brief and informative video

  • @jeanpeuplu3862
    @jeanpeuplu3862 11 місяців тому

    Thank you for this video, I liked it: simple, clear, and short! Perfect :)

  • @abhilashn2993
    @abhilashn2993 5 місяців тому

    Nice overview.

  • @aditya3david
    @aditya3david Рік тому

    This is a great explanation

  • @Pvtmovies4384
    @Pvtmovies4384 Рік тому

    Thanks Darshil