11. Databricks | Pyspark: Explode Function

Поділитися
Вставка
  • Опубліковано 27 січ 2025

КОМЕНТАРІ • 34

  • @gulsahtanay2341
    @gulsahtanay2341 11 місяців тому +2

    Raja is a natural when it comes to teaching.

  • @ririraman7
    @ririraman7 2 роки тому +1

    Amazing! Awesome tutorial

  • @ayushgupta1613
    @ayushgupta1613 25 днів тому

    Please share the script for all the sample data in the description box so that we can copy and practice in our own Databricks notebook. It would be of great help ! Not only for this video.
    But for all the videos where you are showing the demonstration practically.

  • @bashask2121
    @bashask2121 3 роки тому +4

    Nice explanation. Can you please provide source code for practice purpose

  • @anthony281190
    @anthony281190 2 роки тому +2

    Very Good Man!

  • @gurumoorthysivakolunthu9878
    @gurumoorthysivakolunthu9878 2 роки тому +1

    Hi Sir... Your videos are very simple and also great explanation...
    My doubt:-- how to use explode Function if we have more than 1 normal columns and 1 list or map column... What is the syntax for this...
    Thank you, Sir...

  • @sumitchandwani9970
    @sumitchandwani9970 Рік тому

    Please try to attach the notebook exports for reference

  • @mohans3143
    @mohans3143 3 роки тому +1

    How to load CSV file data which is having array type data into pyspark data frame

    • @rajasdataengineering7585
      @rajasdataengineering7585  3 роки тому +1

      If you want to flatten the array, you can use explode function.
      While reading CSV file, you can give defined schema with array type wherever needed

  • @rabink.5115
    @rabink.5115 2 місяці тому

    Hi Raja, do you have these code stored in github?

  • @mohanj6403
    @mohanj6403 10 місяців тому

    If I have 2 or more array column how to use explode function example array = [(mohan,[tv,AC],[cricket,football])]

    • @telugucartoonchannel2916
      @telugucartoonchannel2916 10 місяців тому

      Bro do like these
      Select(col('col name'),explode(col name), explode (col name)

    • @supriyakoura7755
      @supriyakoura7755 6 місяців тому

      @@telugucartoonchannel2916 not working

  • @pradipawasthi2883
    @pradipawasthi2883 2 роки тому +1

    I want a serious help from you..actually I have a sequence data i.e., a single row data I want to split the data into multiple rows after every 5th delimiter ('|') how can I do that??

    • @rajasdataengineering7585
      @rajasdataengineering7585  2 роки тому

      Split the data based on 5 th delimiter and convert it as list using collect_set then flatten the list using explode function. It should work

    • @pradipawasthi2883
      @pradipawasthi2883 2 роки тому +1

      @@rajasdataengineering7585 I am doing this in Pyspark and databricks

    • @pradipawasthi2883
      @pradipawasthi2883 2 роки тому

      @@rajasdataengineering7585 it is not moving to the next line I already followed this process..
      is not working

    • @pradipawasthi2883
      @pradipawasthi2883 2 роки тому +1

      @@rajasdataengineering7585 Hi raja I have just mailed u my query with a screenshot. Can u look into that?

    • @rajasdataengineering7585
      @rajasdataengineering7585  2 роки тому

      Hi Pradip, when I get time, I shall look into it and get back to you

  • @satyaraj1427
    @satyaraj1427 Рік тому

    Hello, can you please provide the code or any github repo(if there is any) for reference? Thank You in advance :)

  • @ajaygundu_92
    @ajaygundu_92 Рік тому

    Hello raja garu . it would be more helpful if you provide source code

  • @anthony281190
    @anthony281190 2 роки тому +1

    Rajas how do i groupBy for customer with condition of your taxes more high and put "active" and EstatusColum
    ID Customer | Taxes | EstatusColum
    1245 520 Inactive
    1245 600 Inactivo
    1245 700 ->Active

    • @rajasdataengineering7585
      @rajasdataengineering7585  2 роки тому

      Window function rank, max_by can be used for this requirement

    • @anthony281190
      @anthony281190 2 роки тому

      @@rajasdataengineering7585 can You send me an examples?

  • @jalindarvapre3760
    @jalindarvapre3760 Рік тому +1

    Amazing! Awesome tutorial