driver out of memory spark | spark memory management | Lec-18

Поділитися
Вставка
  • Опубліковано 3 січ 2025

КОМЕНТАРІ • 54

  • @akhiladevangamath1277
    @akhiladevangamath1277 7 місяців тому +13

    That smile on your face when driver OOM error came😂, shows how much you enjoy teaching us🤗

  • @priyankasethi2922
    @priyankasethi2922 7 місяців тому +1

    I have recently joined MNC, and I must say this is so accurate content. thank you so much for sharing your knowledge and experience

  • @ShivrajSingh-x5j
    @ShivrajSingh-x5j Місяць тому

    Awesome video! The content quality is excellent, with a clear and concise explanation of the topic. The hands-on approach to Spark is incredibly helpful, and the teaching style makes it easy to follow along. Keep up the great work!

  • @coding_BeastMode_ON
    @coding_BeastMode_ON Рік тому

    Bhai itne easy or detailed way me koi ni samjhata hai. Great work !!

  • @Tech_world-bq3mw
    @Tech_world-bq3mw 5 місяців тому +2

    5:16 literally first guy in my life who is happy to get error.😆

    • @manish_kumar_1
      @manish_kumar_1  5 місяців тому +1

      It's not first time when I was happy with error. Jab bhi hamara code Bina error ke galat output deta hai then I want it to give me error so that I know exactly where to fix this 😀

  • @abhijitganguly4836
    @abhijitganguly4836 2 місяці тому

    awesome tutorials brother! Really really simplified things! Absolutely brilliant!

  • @kunalnkalore
    @kunalnkalore Рік тому

    ekdum mast hai bhai ye wala video...keep going

  • @engineerbaaniya4846
    @engineerbaaniya4846 Рік тому

    Awesome most asked interview question this channel should get more subscribers and views

    • @kunalnkalore
      @kunalnkalore Рік тому +1

      sab reels dekhne me vyast hai bhai

  • @dataplumberswithajay
    @dataplumberswithajay Рік тому

    great video bhaiya

  • @anviscreations1265
    @anviscreations1265 Рік тому

    Awesome content please continue the series

  • @praveenkumarrai101
    @praveenkumarrai101 Рік тому

    hats off bro loving your channel

  • @08rajdeepsonawane94
    @08rajdeepsonawane94 Рік тому

    Thanks sir for your great video ❤

  • @tanushreenagar3116
    @tanushreenagar3116 7 місяців тому

    perfect video sir

  • @harshranglani9950
    @harshranglani9950 2 місяці тому

    Bhai aapne pdhaya to bht badiya, thanks a lot for that but ye practical or theory ki alg alg playlist kyu bnayi h smjh nhi aata kb konsi dekhna h.

  • @raviyadav-dt1tb
    @raviyadav-dt1tb 11 місяців тому +1

    How would we know that which file is small when we do broadcast Join, please tell me

  • @mallangivinaykumar9500
    @mallangivinaykumar9500 Рік тому +1

    Can you please make videos in English language. It will be easy to understand .

  • @rekhasingh4945
    @rekhasingh4945 Рік тому

    Please make the series continue

  • @Matrix_Mayhem
    @Matrix_Mayhem 11 місяців тому

    What do we mean by container here? Does it has any other name which we studied in earlier videos?

    • @younevano
      @younevano Місяць тому +1

      container is application master

  • @adityakvs3529
    @adityakvs3529 2 місяці тому

    bhai in which memory dataframe is stored and proceesd overhead or jvm heap

    • @younevano
      @younevano Місяць тому

      Neither, dataframes are there in executors!

  • @sammail96
    @sammail96 Рік тому

    Hi Sir, I have came across some conflicting information regarding show() method. Kindly confirm the below information: The show method does not bring all the data from a single partition to the driver, but rather collects a sample of rows from each partition and display them.

    • @akhiladevangamath1277
      @akhiladevangamath1277 7 місяців тому +1

      As of my understanding, show method brings all the data from a single partition of single executor to the driver, but displays a sample of rows from that partition.

  • @DpIndia
    @DpIndia Рік тому

    nice video

  • @yashwantdhole7645
    @yashwantdhole7645 Рік тому +1

    What are objects here due to which driver overhead oom error occurs? Can you please explain?

  • @akumar2575.
    @akumar2575. 8 місяців тому

    day 6 done 👍

  • @eajazahmed948
    @eajazahmed948 2 місяці тому

    How to check the driver logs in a cluster mode ? using the UI

    • @manish_kumar_1
      @manish_kumar_1  2 місяці тому

      Driver logs hi to Hota hai spark ui par

    • @eajazahmed948
      @eajazahmed948 2 місяці тому

      @@manish_kumar_1 Executors ke hi toh logs batara hai , drive log section toh khali rehta hai

  • @ashutoshkumarsingh3337
    @ashutoshkumarsingh3337 Рік тому

    ok so garbage collection comes under overhead or jvm heap memory?

  • @prajwaljamunkar
    @prajwaljamunkar Рік тому

    Hi Manish , when you will arranged live chat ....

  • @adityakvs3529
    @adityakvs3529 2 місяці тому

    bhai driver program runs in driver node then what is application master

    • @younevano
      @younevano Місяць тому

      There is no driver node, driver program runs in application master container which runs on any worker node

    • @adityakvs3529
      @adityakvs3529 Місяць тому +1

      @ application master is also known as driver terminology changes in databricks we call it as driver node

  • @tnmyk_
    @tnmyk_ 10 місяців тому

    Mastt video!
    Description me S22 ultra bilkul nahi lena hai aise kyu likha hai 😂

  • @arnabghosh106
    @arnabghosh106 7 місяців тому

    My driver is stopping while writing a dataframe. I didn't use any action rather than write. We can mot increase cluster configuration. How to resolve the issue? Any suggestion?

    • @manish_kumar_1
      @manish_kumar_1  7 місяців тому

      This information is not sufficient to provide some suggestion

  • @PavanKumar-vi7hd
    @PavanKumar-vi7hd 9 місяців тому

    Hi Manish
    could you please upload same videos in english

  • @ajaywade9418
    @ajaywade9418 Рік тому

    i saw few videos they mention max(0.07*memory, 384mb)

  • @rekhasingh4945
    @rekhasingh4945 Рік тому

    👍

  • @ajr1791ze
    @ajr1791ze Рік тому

    Hi Manish, Any good resource to lean Scala ?

  • @vishaljoshi1752
    @vishaljoshi1752 Рік тому

    Hi Manish, driver programme will run on master node or worker node as you have written Application Master (worker node)?

    • @amazhobner
      @amazhobner Рік тому

      Application Master/Driver is Driver node not worker node.
      Master Node is where your resource manager is, though DB does give option to have RM and AM/D to be on the same machine now.

  • @venkataramana-yh3th
    @venkataramana-yh3th Рік тому

    Bro, please make vedio in English

  • @rh334
    @rh334 Рік тому +1

    Let's say we read a csv file of 10.1 GB stored in datalake and have to do some filtering of data, how many tasks will run?
    is there a possibility of, out of memory error in the above scenario?

    • @manish_kumar_1
      @manish_kumar_1  Рік тому +3

      10100 mb/128mb = 79 partition. So total 79 task will be created and since you are doing filtering which doesn't require any dependency from other partition which means it is a narrow dependency transformation. Every partition can do it's own filtering. As long as you have more than 500mb of executor size you will not face any oom. But make sure that you are not calling a collect function otherwise you may face driver oom.

    • @rajnarayanshriwas4653
      @rajnarayanshriwas4653 9 місяців тому

      why 500 mb?
      @@manish_kumar_1

  • @udittiwari8420
    @udittiwari8420 10 місяців тому

    i think this is Lec-15 but by mistake it is written lec-18 sir