RDD vs Dataframe vs Dataset | With sample code | Spark Interview Questions

Поділитися
Вставка
  • Опубліковано 27 сер 2024
  • Hi Friends,
    In this video, I have explained the differences between rdd, df and ds.
    github.com/sra...
    Please subscribe to my channel for more interesting learnings.

КОМЕНТАРІ • 11

  • @Jay-vh3gx
    @Jay-vh3gx Рік тому +2

    Thank you for clear cut explanation Sravana

  • @sravankumar1767
    @sravankumar1767 2 роки тому +1

    Nice explanation sravana, thanks for sharing 👍

  • @padmsrikanth1672
    @padmsrikanth1672 2 роки тому +1

    After long time you uploaded video, pls upload regularly.your explanation superb.

  • @anky986
    @anky986 2 роки тому +1

    Hi Lakshmi,
    Nice video. Can you please create a video on Spark submit and client and explain how these are work with different with example.

    • @sravanalakshmipisupati6533
      @sravanalakshmipisupati6533  2 роки тому +1

      Thanks Ankit. Please checkout the video for the differences between modes - ua-cam.com/video/BLut7duRhTM/v-deo.html and video for spark-submit - ua-cam.com/video/APb6vwQz1_M/v-deo.html

  • @kaladharnaidusompalyam851
    @kaladharnaidusompalyam851 2 роки тому +1

    Hi Dear Lakshmi, Thank you for video
    what is the diff b/w foreach() & collect() actions . I know we can't use collect for larger dataset it will spill over disk if we don't have prior experience.
    Than other doubt is.
    If we have two files a, b j need to know duplicates present in both files how do you do it with spark. Can I have overview for the approach .👍

    • @sravanalakshmipisupati6533
      @sravanalakshmipisupati6533  2 роки тому

      Sure, I will get back on the differences. you can use df.dropDuplicates() - ua-cam.com/video/Vrs011lwrmM/v-deo.html

  • @ShubhamRai06
    @ShubhamRai06 Рік тому

    Could you demonstrate , how df and ds handle runtime & compile time safety?