Automatically Remove Top Junk Rows & Combine Data from Multiple Excel Files

Поділитися
Вставка
  • Опубліковано 15 жов 2024
  • Check out our newly launched M Language course ↗️ - goodly.co.in/l...
    In this video, we will learn to remove junk rows from the top and combine data from multiple Excel files. We will explore three Excel files, each containing a few unwanted rows at the top. Our goal is to remove this inconsistent number of junk rows and merge the data from all the files automatically.
    ===== ONLINE COURSES =====
    ✔️ Mastering DAX in Power BI -
    goodly.co.in/l...
    ✔️ Power Query Course-
    goodly.co.in/l...
    ✔️ Master Excel Step by Step-
    goodly.co.in/l...
    ✔️ Business Intelligence Dashboards-
    goodly.co.in/l...
    ===== LINKS 🔗 =====
    Blog 📰 - www.goodly.co....
    Corporate Training 👨‍🏫 - www.goodly.co....
    Need my help on a Project 💻- www.goodly.co....
    Download File ⬇️ - goodly.co.in/a...
    ===== CONTACT 🌐 =====
    Twitter - / chandeep2786
    LinkedIn - / chandeepchhabra
    Email - goodly.wordpress@gmail.com
    ===== WHO AM I? =====
    A lot of people think that my name is Goodly, it's NOT ;)
    My name is Chandeep. Goodly is my full-time venture where I share what I learn about Excel and Power BI.
    Please browse around, you'd find a ton of interesting videos that I have created :) Cheers!

КОМЕНТАРІ • 136

  • @sharadpunita
    @sharadpunita Місяць тому +2

    I am 70 having no use of learning Excel/Power query! I keep watching your videos as I like the way you solve problems. I have learnt so much from your teaching. Can I pay some Token for whatever I have learnt.

  • @nelson_k_d
    @nelson_k_d 4 місяці тому +4

    I saved the entire syntax as follows:
    Add Custome Column =Table.PromoteHeaders(Table.Skip([Data], each not List.ContainsAny( Record.ToList(_), {"Header1", "Header2"} )))
    Works wonders!!
    Thanks a ton bro!!

  • @nitish9111
    @nitish9111 9 днів тому

    Thanks Chandeep! I was using Index for this one, but you make it so easy. Learning from your videos is amazing! Keep up the good work!

  • @OmisileKehindeOlugbenga
    @OmisileKehindeOlugbenga Рік тому +5

    Pretty awesome! Thanks a lot for this. Record, Table and List object manipulation in one video for one task without using "Remove Other Columns"

  • @josepepe741
    @josepepe741 3 місяці тому

    It is a huge pleasure to look at your videos. Moving from excel advanced user to Power query person. Thanks a lot.

  • @retamapark
    @retamapark 10 місяців тому

    Thanks, Chandeep! I knew you had posted this, and I had this problem today. I was fooling around with different other approaches that were a mess. This was perfect!

  • @justinwduff
    @justinwduff Рік тому

    Thank you, SO MUCH! Had about 200 files to combine with various junk rows up top and now I can do it :D

    • @GoodlyChandeep
      @GoodlyChandeep  Рік тому

      Woah.. thats a lot of files.
      I am glad I could help

  • @tARasKoni
    @tARasKoni Рік тому

    M-Masterpiece!
    Packing a lot of slick tricks in one video.
    Thank you Chandeep!

  • @ramonillarramendi3191
    @ramonillarramendi3191 11 місяців тому

    Wow..Amazing. Been struggling with removing dynamically the junk and with custom headers for a while now. This works like a charm. Thanks a mill.

  • @Phippsy23
    @Phippsy23 Рік тому +1

    Amazing, I've done this with a fixed Skip value but this is on another level! Thanks

  • @JeevanC-l3k
    @JeevanC-l3k 8 місяців тому

    I am actually learning power query its Excellent. I like the way to teach. Thank u so, Much for this video.

  • @pmsocho
    @pmsocho 7 місяців тому

    Awesome. Great logic. Thanks for the video.

  • @goodnewskasparyaodzramedo9097

    This is wonderful @Goodly. I watch all your videos. From the logic to the solution of the problem and the actual solution. God continually bless you, you are a messiah!

  • @decentmendreams
    @decentmendreams Рік тому

    Your methods become so refined over time . Awesome job

  • @odallamico
    @odallamico 4 місяці тому

    Dear, you are a genius. You make M language look so easy. I appreciate your videos, my respects to you.

  • @inaction2024
    @inaction2024 11 місяців тому

    This is a lifesaving technique. Thank you for sharing with us.

  • @Chillman666
    @Chillman666 Рік тому

    You have no idea how much Power Query has helped me to automate my tasks. Also I have been struggling with this problem. A big THANK YOU ❤!

  • @vl21i
    @vl21i Рік тому

    Super Awesome, Chandeep.
    Very powerful formulas that you are teaching in a simple and easy understandable way !
    Power Query and DAX are having lot of hidden treasures

  • @Sarvabhauma
    @Sarvabhauma 3 місяці тому

    Thanks @Chandeep, I got it based on your trick im able to do it for removal of top rows. I also wanted to do for bottom rows, there is lot of junk bottom rows from my sheet, i applied same trick but i have added index row with descending order then applied this trick for to remove junk rows from sheet and sort ascending order index back.. Anyways your tricks are fantastic.

  • @santiagovillamoreno6821
    @santiagovillamoreno6821 Рік тому

    Beautiful Power Query techniques!!

  • @stephenandrews1291
    @stephenandrews1291 10 місяців тому

    this is awesome; thank you! one question - i need to add a column into the combined file that shows the original source filename for each record... where in the flow and how best to do that please?

  • @rhaps2008
    @rhaps2008 Рік тому

    Amazing, I have had to struggle through with this exact issue to manually remove those junk rows, your a life saver I will be using this in the future

  • @Adam_K_W
    @Adam_K_W 5 місяців тому

    This is great! What is the best way to do this when your source files are not formatted as Tables, but are simply Excel Worksheets?

  • @aahanavikram07
    @aahanavikram07 Рік тому +2

    You hear the problem it seems😂
    I was using filter method and removing null values and a lot of other filter method.
    Thanks for making the work easier and cleaner ❤

  • @joukenienhuis6888
    @joukenienhuis6888 Рік тому

    Thanks again for such a great clear videobabout the next step in PowerQuery. I am new in PowerQuery, but i am experimenting on DAX and you are giving a great explanation

  • @syrophenikan
    @syrophenikan Рік тому +1

    Fantastic! This is definitely going into my daily routine.

  • @roywilson9580
    @roywilson9580 Рік тому +1

    Thanks for the video. Much better than only skipping rows to one hardcoded value, makes sense to use if your column order is not the same across data tables.

  • @wensesvincen4877
    @wensesvincen4877 Рік тому

    Hi Goodly, Wonderful and Powerful Trick...Keep up the good work. What if you are importing from PDF files...Trying to convert the binary gives a different results.

  • @rohitwaradpande2492
    @rohitwaradpande2492 Рік тому

    This was awesome video. Thanks for the same. I liked the trick that you used to removed the blanks dynamically.

  • @rajanpradeepankarath8846
    @rajanpradeepankarath8846 Рік тому

    yeah that is pretty damn awesome, Chandeep. this is an everyday challenge

  • @SAIKISH2005
    @SAIKISH2005 2 місяці тому

    Marvelous work ji

  • @ShubhamSharma-ls6hj
    @ShubhamSharma-ls6hj Рік тому

    Very well explained thanku so much brother.

  • @ursvenky6394
    @ursvenky6394 Рік тому

    Awesome video. I was struggling earlier. I had work by using macro. This is very cool. Thanks Goodly

  • @chrism9037
    @chrism9037 Рік тому

    Fantastic Chandeep, thank you!

  • @setantadundalk
    @setantadundalk Рік тому

    Very sleek. I had this very same issue but I used List.Generate to loop through each record which suppose would take slightly more processing time but nothing you would ever notice.

  • @swathimanda5342
    @swathimanda5342 Рік тому

    Hi Goodly,
    That was an amazing video. I learned a lot from your videos for my daily tasks with excel. It saves lot of my time. God bless you.
    I have a question please if you can answer that, when I covert pdf to excel most of the column values are not aligned into 1 column but locate on either side.
    Ex: column B dates, should be in column B but on few rows it will be on A or C.
    How can I align them into just 1 column B.
    Please advise.
    Thank you for all your great videos. 🙏

  • @davidjosevarelagarcia7011
    @davidjosevarelagarcia7011 Рік тому

    Great , excellent. Simple like that. Thanks.

  • @manojsakthi6201
    @manojsakthi6201 Рік тому

    its very awesome, i hade a similar issue had to work around it, but this looks pretty good

  • @iankr
    @iankr Рік тому

    Brilliant! Many thanks, Mr Goodly.

  • @FarkadBarri
    @FarkadBarri 2 місяці тому

    This is awesome! I have one challenge, that one of the first rows contain the date of report and I want it to be within the data columns , how to do that :D!

  • @emilmubarakshin49
    @emilmubarakshin49 Рік тому

    Fantastic video and amazing explanation!

  • @Rice0987
    @Rice0987 Рік тому

    Magosh... It's just simple brilliant!💎
    Thanks a bunch for yor priceless help!🤗👦

  • @cesarsaldana3429
    @cesarsaldana3429 2 місяці тому

    Amazing!!!, Greetings from Mexico.

  • @cherianiype
    @cherianiype Рік тому

    ooff!! SHABASH! Terrific video Chandeep! Superb!! Sixer Maar diya!

  • @Ratnakumarwrites
    @Ratnakumarwrites Рік тому

    Super Video Chandeep.

  • @zahir585
    @zahir585 Рік тому

    Genius 🔥 thank you my friend sooo helpful ❤️❤️❤️

  • @rahulmahajan5932
    @rahulmahajan5932 Рік тому

    Awe Stucked... No Words to Express How Fantabulous It is

  • @noorbisharmohamed
    @noorbisharmohamed Рік тому

    Fantastic! But can we use "is blank" instead of "not Contain any" as the condition?
    Or promote headers if record contains any " Date", "Amount" etc?

  • @jianlinchen7978
    @jianlinchen7978 6 місяців тому

    really very helpful. Thanks . It is a good idea

  • @patrickharilantoraherinjat2994

    Great video Goodly ! what if I have a junk rows and also a junk columns. is it possible to combine? Thanks

    • @GoodlyChandeep
      @GoodlyChandeep  Рік тому

      Thanks Patrick.
      May this video will help ua-cam.com/video/1fn8fXYw6M4/v-deo.html

  • @excel-in-g
    @excel-in-g Рік тому

    As always, very neat & clear stuff. 👍
    I was wondering if one can't use Table.FindText?
    Like, each Table.Skip( _,Table.PositionOf(_, Table.FindText(_, "Profit"){0} )))
    But only testing for 1 column header here.

  • @MikeMcGlynn
    @MikeMcGlynn 3 місяці тому +2

    Hello, trying to see if any one else ran into this issue:
    -Followed all steps and worked as they should until ~7:00 when we're supposed to transform the content from Binary to a Table. My files are all CSV so the formula I ended up using was =Table.TransformColumns(Source,{"Content",Csv.Document}) instead of =Table.TransformColumns(Source,{"Content",Excel.Workbook}) that is used in the video. It converted to Tables fine.
    -Then on the next step (7:40) where you're supposed to expand the tables and see all of the sheets, when I do this...the header options are just Column 1, Column 2, etc. instead of the actual headers...AND every row of all the files shows up instead of just the sheet name.
    -When I follow the steps afterwards, I get the error: "dataformat.error there were more columns in the result than expected"
    Any idea what's going on?

  • @SandhyaSingh-qk6up
    @SandhyaSingh-qk6up Рік тому +1

    Can this also be solved by using index number and custom function?

  • @andreass.3130
    @andreass.3130 Рік тому +1

    Hi Goodly, thanks for all your great videos. Isn't there a simpler way to do it here? In the example file you create a conditional column (If Column1=Date Then True). Then you fill the conditional column downwards. Now you have a True for all rows you need and a null value above the desired header row. So you can filter for True. Shouldn't that be dynamic as well?

    • @DAXifiedSatish
      @DAXifiedSatish Рік тому

      No. In that way if u filter true it will be with junk rows and all data except headers and if u filter for false u will get only headers

  • @tiagocarvalhal4502
    @tiagocarvalhal4502 Рік тому

    Incredible. Thanks :)

  • @rogeriopsvalle
    @rogeriopsvalle Рік тому

    Hi Goodly.
    Thanks for all videos. They are just great.
    I need to combine two of your tricks in just one.
    I have many sheets with junk lines (same number of junk lines for all sheets) and these same sheets have inconsistent columns.
    How do I do that?
    Thanks in advance

    • @GoodlyChandeep
      @GoodlyChandeep  Рік тому

      You code to remove the junk headers can probably be
      Table.Skip ( Table, List.MatchesAny(Record.ToList(_), each _ "null" or each _ "")
      After this you can promote the headers and then follow the inconsistent header video.

  • @arbazahmad7177
    @arbazahmad7177 Рік тому

    Fantastic 🎉.. Thanks 😊

  • @MrKamranhaider0
    @MrKamranhaider0 Рік тому

    JUST AMAZAING SUPERB

  • @BharathiM-t8y
    @BharathiM-t8y 8 місяців тому

    admiring the brilliance

  • @bohdanduda652
    @bohdanduda652 Рік тому +4

    Another way, how we can do it, it is add additional column using List.PositionOf and due to that calculate in which position we have Date and Profit

  • @s1ngularityxd64
    @s1ngularityxd64 5 днів тому

    excellent🙂

  • @swapna_learner
    @swapna_learner 14 днів тому

    Could we have similar logic for bottom rows?

  • @mohamedheltoukhy2355
    @mohamedheltoukhy2355 Рік тому

    Amazing
    Thanks alot

  • @gennarocimmino
    @gennarocimmino Рік тому

    I would say simply amazing !!!!

  • @williamarthur4801
    @williamarthur4801 Рік тому

    This may sound like a stupid question, and I'm sure it's something basic, but why do you get
    Name = Sheet , Data = Table, but I always have Name= table , Name = Sheet and adjacent
    Data = table, Data = table ? Oh, and loved the use of a condition for skip which I'd never thought of,
    even though now having looked it does say count or condition.

  • @markdonaghy5397
    @markdonaghy5397 2 місяці тому

    I have an interesting use case. I have a base file with headers but the fifth file has had one additional column with headers added. As a result the process you have described breaks and for the last table I get the error "DataFormat.Error: There were more columns un the result than expected" in the CSV data column.
    I have been wrestling with this for a week and my research does not show any ways to manage this, although an append tables would seem to manage this.

    • @GoodlyChandeep
      @GoodlyChandeep  2 місяці тому

      send me your sample data and description of the problem - goodly.wordpress@gmail.com

  • @RogerStocker
    @RogerStocker Рік тому

    unbelievable crazy as usual.

  • @zdzislawkes
    @zdzislawkes Місяць тому

    Hi, It's very smart solution. I'm looking for instruction, how to combine tables and not lose the columns that exist in the previous steps, in this case I would like the Name column to remain?

    • @GoodlyChandeep
      @GoodlyChandeep  Місяць тому

      you'll find the answer in this video
      m.ua-cam.com/video/oExuBdnHtrk/v-deo.html

    • @zdzislawkes
      @zdzislawkes Місяць тому

      @@GoodlyChandeep Thank You very much. This is what I am looking. Can Yoy say what You think about this solution: ua-cam.com/video/rCYn_onMP0I/v-deo.htmlsi=QYmkwRM2Cl1FuoCu

  • @stefankirst3234
    @stefankirst3234 Рік тому

    Really awesome!

  • @cooolbreeze
    @cooolbreeze Рік тому

    Pretty cool. Seems the only thing that would limit what one can do with PQ is one's imagination. Question, why List.ContainsAny instead of List.ContainsAll?

  • @starprinceofficial9603
    @starprinceofficial9603 6 місяців тому

    For me my question if we need to bring the data before the columns as a new column before promoting headers, how do we go about it

  • @zeinh5139
    @zeinh5139 Рік тому

    Wow ! thank you

  • @babakmohammadi9931
    @babakmohammadi9931 Рік тому

    could you possibly tell me what do i have to do with CSV files for "Table.TransformColumns(Source, {"Content", Excel.Workbook})" as that dosent work for CSV

  • @TomStewart-x1t
    @TomStewart-x1t 3 місяці тому

    I have a similar problem with a CSV file, there are title characters before the Column headings. How could I remove those?

  • @itgyantricks7218
    @itgyantricks7218 Рік тому

    excellent

  • @abhijitmodak3461
    @abhijitmodak3461 9 місяців тому

    Thanks!

  • @j_baisley_
    @j_baisley_ 8 місяців тому

    Damn awesome is right 👍

  • @markp8600
    @markp8600 Рік тому

    Fabulous video , but it is difficult to see the "Applied Steps" at the end of the video. Question: what if the junk rows and "junk row data" are spread inconsistently through the spreadsheet. For example, the fund or department information may change resulting in blank rows between datasets and header rows when a new fund source or department is identified in the report. The example shows how to remove junk rows at the top of the report for 3 reports. Do you have any videos where the junk rows would be sporadically located throughout the report?

    • @GoodlyChandeep
      @GoodlyChandeep  Рік тому +1

      Mark, I'll have to take a look at the data to give you possible ways to solving it.
      See if you can pick any tricks from this long video - ua-cam.com/video/_ZKT1raC4P0/v-deo.html
      I've shared horizontal and vertical looping techniques in this video.

    • @markp8600
      @markp8600 Рік тому

      @@GoodlyChandeep - Hi Chandeep, I could share a file with you on your website. I could upload on the PQ training course site. Would that work? Thanks for the great insights!

    • @markp8600
      @markp8600 Рік тому

      @@GoodlyChandeep Perfect! Thank you so much Chandeep. Your help and guidance is greatly appreciated. Cheers!

  • @huyuc6614
    @huyuc6614 10 місяців тому

    Hi i stuck on the step transform column. My file is .csv file not xlsx so when i use transformcolumn, it show error on the content. Do you know how to fix it

  • @jimmckie3574
    @jimmckie3574 2 місяці тому

    Brilliant

  • @mohitchaturvedi8931
    @mohitchaturvedi8931 Рік тому

    Delighted, this is the problem of every hour.
    Many times data come with merged header, which you have sorted already,

  • @eslamfahmy87
    @eslamfahmy87 Рік тому

    If the name & kind of data which extracted into PQ it's inconsistent and i need to filter out for all non needed sheets,
    How can work with that sheet

  • @denissipchenko2455
    @denissipchenko2455 Рік тому +1

    Thanks for sharing!
    But looks like there should be List.ContainsAll instead of List.ContainsAny

  • @bulbulahmed4414
    @bulbulahmed4414 Рік тому

    Amazing

  • @hassanjatta4257
    @hassanjatta4257 Рік тому

    Awesome!!!

  • @devendrareddynamballa1053
    @devendrareddynamballa1053 5 місяців тому

    What if we want to add back the removed rows after promoting the headers

  • @oOBlindyOo
    @oOBlindyOo 3 місяці тому

    My source data is from pdf. Not excel.
    So the formula '=Table.TransformColumns(Source,{"Content", Excel.Workbook}) isn't working... any ideas?

  • @josealvesferreira1683
    @josealvesferreira1683 Рік тому

    very nice

  • @ericliu79
    @ericliu79 Місяць тому

    Great!

  • @carolshipley7903
    @carolshipley7903 4 місяці тому

    Hi. Can you give me away of doing this if there is more than one worksheet in the excel file and you only want to clean just one worksheet?

    • @GoodlyChandeep
      @GoodlyChandeep  4 місяці тому

      If you're connecting to a single xl file, you'll have to apply a filter before the Navigation Step to restrict the sheet names that you want.

  • @retamapark
    @retamapark 10 місяців тому

    Not as sophisticated, but say you had a table with two columns, Col1 and Col2. There are junk rows at the top. The row with headers has the values "Date" and "Amount". This seems for work and is easy to implement.
    let
    Source = Excel.CurrentWorkbook(){[Name="Table1"]}[Content],
    Custom1 = Table.Skip(Source, each [Col1] "Date" and [Col2] "Amount")
    in
    Custom1

    • @retamapark
      @retamapark 10 місяців тому

      fnTableSkipDynamic(Source, "Col1", "Date", "Col2", "Amount")
      Function to do this: Input column names as text, header values as text.
      (sourcetablename as table, col1name as text, header1value as text, col2name as text, header2value as text)=>
      let
      return = Table.Skip(sourcetablename,
      each
      ( Record.Field(_, col1name) header1value)
      and
      ( Record.Field(_, col2name) header2value)
      )

      in
      return

    • @GoodlyChandeep
      @GoodlyChandeep  10 місяців тому

      List.matchesall can look for both the headers. 👍

  • @ankursharma6157
    @ankursharma6157 Рік тому +1

    Token of Gratitude!

  • @udayteja6595
    @udayteja6595 Рік тому

    Great

  • @eslamfahmy87
    @eslamfahmy87 Рік тому

    Actually, I think you are the only youtuber instructor who is preparing depth & creative PQ examples.
    Really, fantastic 👏, elegance & more simple 😊

  • @PandaVlademirBerlin
    @PandaVlademirBerlin 6 місяців тому

    What if i need the source.file column?

  • @ajay249
    @ajay249 Рік тому

    Goodly is just too Godly.

  • @cdmkk2211
    @cdmkk2211 9 місяців тому

    How can I add files name as a column in this

  • @ankitachauhan5834
    @ankitachauhan5834 5 місяців тому

    How to do same thing but fro CSV file

  • @TiffanyYoung-z7s
    @TiffanyYoung-z7s Рік тому

    Hi Chandeep, i want to enroll your course about Dax and Data Modelling (Bundle), can you give me some discount? hehe it's too expensive for me :(

  • @MuridRealita
    @MuridRealita 29 днів тому

    amzing