Much Better Web Scraping with Pandas - Automatically Extract All Table Elements From a Web Page!

Поділитися
Вставка
  • Опубліковано 3 лис 2024

КОМЕНТАРІ • 164

  • @peppespe
    @peppespe 2 роки тому +9

    Could you explain the same technique on Facebook (e.g. friend list or Wall) or IG?

    • @PythonSimplified
      @PythonSimplified  2 роки тому +12

      Yup! It's not gonna be exactly the same as Facebook and Instagram are heavy on JavaScript so they require a more powerful library, for example: Selenium! 😊
      I have a bunch of really cool tutorials about it:
      ⭐ web scraping entire Facebook image gallery with Selenium:
      ua-cam.com/video/SsXcyoevkV0/v-deo.html
      ⭐ web scraping Instagram images with Selenium:
      ua-cam.com/video/iJGvYBH9mcY/v-deo.html
      ⭐ automating LinkedIn contacts using Selenium:
      ua-cam.com/video/7aIb6iQZkDw/v-deo.html
      And I have some additional resources for Instagram scraping if you want to check them out:
      Detailed Instagram Web Scraping Blog:
      www.mariyasha.com/post/web-scraping-instagram-thumbnails-with-selenium
      Code for different types of Instagram bots:
      github.com/MariyaSha/WebscrapingInstagram
      Good Luck and I hope it helps! 😁

    • @JaveGeddes
      @JaveGeddes 2 роки тому +1

      @@PythonSimplified Please take the microphone off you desk.. When you type it's like your hitting the mic.. Also thanks for the videos.. Much love!

    • @smartyster
      @smartyster 2 роки тому +2

      @@JaveGeddes I like the sound of typing in her video as well! 🙈

    • @StanislavVasko
      @StanislavVasko 2 роки тому +1

      She can't, because this is not example for real world scraping. On YT you can find pro tutorials with much better libraries to choose.

    • @peppespe
      @peppespe 2 роки тому +1

      @@StanislavVasko can you provide any links? Please 🙏

  • @champnarratives
    @champnarratives Рік тому +2

    I loved the way you taught; I was stuck for 2 hours with this problem before seeing your video, and you solved all of my problems in a simple manner. 💕💕💕💕

  • @DittoRahmat
    @DittoRahmat 2 роки тому +1

    You're welcome Mariya !

  • @josecosta3878
    @josecosta3878 2 роки тому +8

    Omg I loved this tutorial. I usually use Selenium with Beautiful soup, but for scraping tables (static ones, I think) I would say that your method is far way faster and better. Thank you

  • @Asman2021
    @Asman2021 5 місяців тому

    This coding tutorial just saved me 2 days of effort, thank you so much!

  • @wexleyyt5931
    @wexleyyt5931 2 роки тому +2

    I just came back home and this was waiting for me in the youtube notifications! Thanks Mariya!

    • @PythonSimplified
      @PythonSimplified  2 роки тому

      You're absolutely welcome! hope this shortcut saves you lots of time in your future projects! 😁

  • @saiyamarora3788
    @saiyamarora3788 2 роки тому +25

    I have learned Python Flask from your videos and now I have made 2 complex app(at least for me😅) using flask. Your videos are really very good. Keep it up😊😊😊😊😊

    • @Tobs_
      @Tobs_ 2 роки тому +2

      great comment, there are lots of great dev jobs that need Flask 👍

    • @liftcarryfetish1296
      @liftcarryfetish1296 2 роки тому +1

      @@Tobs_ Flask is nothing in front of ASP.NET

  • @evenmoreevil
    @evenmoreevil 2 роки тому +1

    Thank you, thank you, thank you for simplifying this! I've been looking ever where. When the code worked on my end I yelled YES

    • @PythonSimplified
      @PythonSimplified  2 роки тому +1

      Yeeey!! I'm super happy to help!! 😃
      Have fun with your new web scraping superpowers! 😉

  • @yhoff76
    @yhoff76 Рік тому

    BOOM!!!! This video was awesome and provided exactly what I was looking, best and fastest
    method I've encountered thus far.

  • @muhendisinkaresi6336
    @muhendisinkaresi6336 2 роки тому +1

    The best simple tutorial video I have seen ever and also the content is so clear
    Finally, your english is pretty good and quite understandable

  • @davidgm2821
    @davidgm2821 2 роки тому

    Oaoh!! I always thought Pandas library is incredibly powerful 💪🏼Bravo!! 🤘🏻

  • @arijanrahimic3210
    @arijanrahimic3210 2 роки тому

    Back to the LXML Roots! Great great great as always 💪💪💪 THX Mariya

  • @Tobs_
    @Tobs_ 2 роки тому +2

    wow, that seems quite powerful for just a few lines of code! Awesome, good job 👍

  • @theRealtensigh
    @theRealtensigh 2 роки тому +1

    Very cool, especially throwing in the SSL unverified cert issue.

    • @PythonSimplified
      @PythonSimplified  2 роки тому

      hmmmm... that's a great question! I haven't had a chance to work with macport yet so I'm probably not the best person to ask 😊
      I would assume these two are competitors and they wouldn't want you to migrate so easily... but that's just a guess 😅 hahahaha
      Thanks for your lovely comment, and sorry I couldn't help much. Maybe some other folks here have some experience with it?

  • @wandevv
    @wandevv 2 роки тому +3

    For separator in python you can use: print("*" * 30)
    with python you can multiply stings to repeat then by the product 😁

  • @KacperSieradziński
    @KacperSieradziński 2 роки тому +1

    Great example! I didn't know that I can scrape the data that way :-) Thanks for being an inspiration ;)

  • @herbyberson7797
    @herbyberson7797 2 роки тому

    Awesome tutorial. I couldn't find a solution anywhere as to why I kept getting the ssl error...your video really simplify it...

  • @espua72
    @espua72 2 роки тому +1

    I really enjoy your teaching, is fun, precise and short !

  • @vigneshsuresh6003
    @vigneshsuresh6003 2 роки тому +1

    This should be useful for scraping wikipedia pages too

  • @aakos23
    @aakos23 2 роки тому

    Nice Pandas shirt :) and brilliant tutorial as always :)

  • @JorgeEscobarMX
    @JorgeEscobarMX 2 роки тому +1

    Amazing video, I just cloned the previous WebScrapingDatabases repository and works wonders. Still I want to scrape a grocery store catalog to keep record of sale prices and discounts and that's not tabular data so no elements there.

    • @fernandoflores4656
      @fernandoflores4656 2 роки тому

      Personally working on this. It's not on a repo, at least just yet. Still trying to get past the headers to hit the site directly. I was building 1 for HEB Grocery store, since their digital receipts are stored on the account through the site. I had to manually copy the cookie that I received after making an initial response with the browser. I just get denied when I try to save the cookie on a return using Python and requests library. That is the challenge.

  • @watheusbr
    @watheusbr 2 роки тому

    Thank you very much!!! the biggest problem i had with automating an economic report was an easy way to get data before plotting the charts. scraping with pandas solves this.

  • @Yeah_bruv
    @Yeah_bruv 2 роки тому

    Awesome vid - currently documenting my journey on UA-cam been following yours for some time now. Thanks! 😀

  • @smalirizvi8026
    @smalirizvi8026 2 роки тому +1

    Closed whatever I was watching and came here 😍😍😍😍 😀

    • @PythonSimplified
      @PythonSimplified  2 роки тому

      hahahaha that's awesome!! thank you so much dear! 😁😁😁

  • @luis96xd
    @luis96xd 2 роки тому

    Wow, this is awesome! New ways for making amazing things, thanks for this videos and your explanation 😁🙏

  • @feelthelo-fi9174
    @feelthelo-fi9174 2 роки тому +1

    Hii
    its a good news to you , you got a new subscriber me😆😅
    thanks for helping me in python ...
    love from india ❤

  • @mibrahim4245
    @mibrahim4245 2 роки тому

    You wrote panAdas for the notebook, just to let you know I focus on the video content more than before 😀 ..
    Great tutorial as usual ❤, thanks M ..

  • @luizwerneck9262
    @luizwerneck9262 2 роки тому +1

    Great explained.
    You should explain how we can get atributes from a XML ? We can use pandas or other library?
    And plus, rs, How we can alterate some atributes and save the file
    Thx !!

  • @chalmerditmars7037
    @chalmerditmars7037 Рік тому

    Love the white background on this video. I have to get the black at just the right angle from the glare of the sun to see the print. Great video thank you. It does not work so well with my windows and visual studio, I do not get the cell at the bottom to select which table I want.

  • @Maximus8112
    @Maximus8112 2 роки тому

    Thank you for your simple explanation.

  • @webslinger2011
    @webslinger2011 2 роки тому

    Nice! very quick scrape n grab method.

  • @trtlphnx
    @trtlphnx 2 роки тому +2

    Thanks So Much For This Incredible Information ~
    Your The Best; Love You And Your Channel ~

    • @PythonSimplified
      @PythonSimplified  2 роки тому

      You're absolutely welcome! I hope it helps saving time on your future projects! 😊

  • @alwayslifted6667
    @alwayslifted6667 2 роки тому +1

    So useful!

  • @CodePhiles
    @CodePhiles 2 роки тому

    Again , amazing new trick, so now we can get any data faster and intelligently, Thank you..❤

  • @SuperCantillano
    @SuperCantillano 2 роки тому

    Love you young lady. Your tutos are very good and easy 👍

  • @mohamadhabibhomeidi5634
    @mohamadhabibhomeidi5634 2 роки тому +1

    Very nice .

  • @k.ballajiaxe6403
    @k.ballajiaxe6403 2 роки тому

    Thalaivi you are great

  • @syoeye
    @syoeye 2 роки тому

    Damn, this is amazing.. I'll give a try with my project! Awesome tutorial, really helpful 👍 thank you so much

  • @namancomtech
    @namancomtech 2 роки тому

    Please Tell me about your Computer's Monitor. I also want to purchase this kind of large monitor.
    Because I am Also a Python Coding Learner....
    By the Way your Explanation Skills are Awesome. I LIKE IT SO MUCH.
    Thank You..
    Waiting for your Reply
    ..... ....... ....... ...............

  • @ayandas8757
    @ayandas8757 2 роки тому

    Hi Mariya! Happy new year ❣️✌️🔥

  • @GiasoneP
    @GiasoneP 2 роки тому +2

    Very cool. I had no idea this was possible. 👍🏼

    • @PythonSimplified
      @PythonSimplified  2 роки тому +1

      Thank you, me neither!!! 😅
      I was working with Pandas for quite some time but never used it to scrape webpages!
      Apparently since it already has the ability to read from a csv file and automatically place the records in a data frame - it can apply very similar principles on web pages as well! 😊

    • @GiasoneP
      @GiasoneP 2 роки тому

      @@PythonSimplified Because of this video I was able to use Pandas' read_html for a project I'm working on. So much easier! Thanks for the video and a big shout out to Ditto for the recommendation 👏

  • @sudzbyte2215
    @sudzbyte2215 2 роки тому

    Really great video!
    Question: What is the extension that pops up in the browser "found solution on GitHub" at 2:48?
    Thank you!

  • @СергейЛасточкин-ы1р

    Amazing girl, amazing content 🔥🔥🔥

  • @MrKor4k
    @MrKor4k 2 роки тому

    Do you consider make a video (or whole series) about web scraping using scrapy?

  • @rajappasubramanian
    @rajappasubramanian 2 роки тому

    Wow... Such a simple way. Thanks a lot sister.
    Can you post a video on how to scrape and do actions like select dates, select values from drop down from a java script enabled website in python?

  • @capcrunch7838
    @capcrunch7838 2 роки тому

    Such a great presentation.

  • @semtex2987
    @semtex2987 2 роки тому

    nice, you did it - i'm proud of you 😘

  • @wragabrr
    @wragabrr 2 роки тому

    I‘m impressed. Thanks 🙏

  • @dandano
    @dandano 2 роки тому

    Thank you! I had been doing this the hard way! XD

  • @taschfogster223
    @taschfogster223 2 роки тому

    Panadas - sounds like icecream 🤤

  • @elvinrk
    @elvinrk 2 роки тому

    Awsome!! Thank you!

  • @janduplessis1357
    @janduplessis1357 2 роки тому

    love your videos

  • @jonatthansiancassoto2866
    @jonatthansiancassoto2866 2 роки тому +2

    what monitor do you use in your videos? pd: i love your videos

    • @PythonSimplified
      @PythonSimplified  2 роки тому +1

      Thank you so much Jonatthan! 😊
      I'm using the Samsung Odyssey G9 ultra-wide 49" monitor, I got it on an amazing deal (it's refurbished certified, so I'm saving 20%-30% just with that 😉)

    • @iberar
      @iberar 2 роки тому

      I have this same exact monitor. I love it. If you could do a video on how best to utilize the space on this monitor for coding programs, that would be great! Thank you for your lessons.

  • @ehushubhamgaming3129
    @ehushubhamgaming3129 2 роки тому

    Heyy u are a good explainer🎉

  • @good-lychik
    @good-lychik 2 роки тому

    nice job as always ;) big like for you!

  • @rrahll
    @rrahll 2 роки тому

    Thank you! Interesting )

  • @yoniziv
    @yoniziv 2 роки тому

    Wow, this is amazing, thanks!

  • @francisconegrete6056
    @francisconegrete6056 2 роки тому

    Great video!!!

  • @waldemarmalecki9050
    @waldemarmalecki9050 2 роки тому

    Amazing 😇 I love your way 😁

  • @sudhakarank2474
    @sudhakarank2474 2 роки тому +2

    Cool stuff Mariya. Keep going. And shall we expect any Deep learning models with Keras in the future???
    Since I am gonna work on that for my Projects...

  • @cryptolicious3738
    @cryptolicious3738 2 роки тому

    awesome video !

  • @miguelangelsalomehernandez8370
    @miguelangelsalomehernandez8370 2 роки тому +1

    Muy bueno para complementar mis primeros pasos en python

  • @lakshminarasimhanadimoorth3945
    @lakshminarasimhanadimoorth3945 2 роки тому

    Will you web scrap option chain using pandas which is htmi pattern of any share market say Indian NSE OR NASDAQ?

  • @karthipalani7373
    @karthipalani7373 2 роки тому

    Simply cool like you 😃😉

  • @Chuckk12
    @Chuckk12 2 роки тому

    Very nice. Thanks

  • @1rudedudette1
    @1rudedudette1 2 роки тому

    Can you scrape dynamically generated, Ajax pages?

  • @kokorospirit5006
    @kokorospirit5006 2 роки тому +1

    How to avoid 403 Forbidden with some URLs using this code?

  • @luferov
    @luferov 2 роки тому

    Which monitor on your table?)

  • @gavixtech7906
    @gavixtech7906 2 роки тому

    Hey Mariya could you please do a an OOP part 2 tutorial covering all the OOP concept that you didnt cover in the part 1

  • @JohnDoe-hp5kw
    @JohnDoe-hp5kw 2 роки тому +1

    Wow, I do it 👍😂
    How can I save it, for example, in CSV?

    • @PythonSimplified
      @PythonSimplified  2 роки тому +1

      What a great timing for your question! 😀
      I've just posted a new Pandas tutorial where I show you how to do it:
      ua-cam.com/video/zN2Hua6oII0/v-deo.html
      It will go over many useful Pandas commands and tricks, but if you only need the to_csv() method you can find it at minute: 07:30
      Good luck and I hope it helps! 😁

    • @JohnDoe-hp5kw
      @JohnDoe-hp5kw 2 роки тому

      @@PythonSimplified I wiil see it 👍
      Thank you 👍👍👍

  • @digvijaygujale9433
    @digvijaygujale9433 2 роки тому

    Hey Honey there are few modules available for extract table in 2 line.

  • @indianameregone
    @indianameregone 2 роки тому

    you are awesome!!! thanks a lot

  • @chaghlarblabla5157
    @chaghlarblabla5157 2 роки тому

    Hello. i want to extract full articles from wordpress based sites. The quickest method suppose to be parsing rss feed but, i m unsure if You can get the full article by doing so. i m sure it's doable some way. because they do it by using php so, why not with Python? Would You like to put up a video regarding harvesting articles?

  • @physclubuz
    @physclubuz 2 роки тому

    ❤cool, thank you for your tutorial

  • @udbhav3760
    @udbhav3760 2 роки тому

    This is very huge name of a variable but it looks good. That is the new thing. I didn't know that.
    thanks for the video [:-0
    Extra awesome 😎😂👏

  • @dayanandaksh
    @dayanandaksh 2 роки тому

    I always learn so much from you.
    thank you 🌹 very much.
    by the way you look lovely in that dress!!! 🥰😍🥰😘

  • @njmanga617
    @njmanga617 2 роки тому

    Thanks for video liked it

  • @ingluissantana
    @ingluissantana 2 роки тому

    Nice Video!!! Thanks!!!!!

  • @williamweidner5425
    @williamweidner5425 2 роки тому

    is there a way to do this without jupyter notebook? just curious, please don't hate

  • @WendyMoule
    @WendyMoule 2 роки тому

    I'm looking for some thing to scrape genealogy DNA web sites. Now to figure out how to do button presses and logins

  • @siamahmed8287
    @siamahmed8287 2 роки тому +1

    Heyy.. Have you tried selenium for scraping websites?

    • @PythonSimplified
      @PythonSimplified  2 роки тому

      Selenium is actually my all times favorite! I've used it to download my entire image gallery from Facebook (before permanently erasing my account 😅 hahahaha), I've used it to spam an imposter of mine on Instagram until they gave up, automatically adding an unlimited amount of connections on LinkedIn and even used it to find out if PS5 are back in stock on Best Buy 😊
      I have a bunch of Selenium tutorials is you want to check them out:
      ⭐ interacting with LinkedIn using Selenium:
      ua-cam.com/video/7aIb6iQZkDw/v-deo.html
      ⭐ web scraping entire Facebook image gallery with Selenium:
      ua-cam.com/video/SsXcyoevkV0/v-deo.html
      ⭐ web scraping Instagram images with Selenium:
      ua-cam.com/video/iJGvYBH9mcY/v-deo.html
      (As well as a few on Wayscript's channel - the PS5 restock bot for example)
      It's a very fun and intuitive tool to play with, hope it helps! 😁

    • @siamahmed8287
      @siamahmed8287 2 роки тому

      @@PythonSimplified oh yeah. I've also watched them. Stupid me forgot about them Hahaha. Btw thanks for the references. And yeah my notes?🙂

  • @sshroot5565
    @sshroot5565 2 роки тому

    She is so beautiful as always 🥰😍

  • @krishnagroupgzp2641
    @krishnagroupgzp2641 2 роки тому

    Hello miss,,,
    please teach how to Make bulk id card software in python Tkinter GUI,

  • @jayprajapati2208
    @jayprajapati2208 2 роки тому

    I'm waiting Django videos 😊...

  • @onlinestore5427
    @onlinestore5427 2 роки тому +1

    Nice video

  • @karthikbhandary879
    @karthikbhandary879 2 роки тому

    can you make a video on how to scrape UA-cam data??

  • @MonirHossain-zl1yl
    @MonirHossain-zl1yl 2 роки тому

    You are awesome

  • @danqiu6413
    @danqiu6413 2 роки тому

    very nice and intelligent girl!👍

  • @smalirizvi8026
    @smalirizvi8026 2 роки тому +1

    Mariya do you recommend doing Google's tensorflow cert. exam from google developers?
    Are they in demand?

    • @PythonSimplified
      @PythonSimplified  2 роки тому +1

      hmmmm... it's a tough question since I don't know much about this program or what exactly this certification exam grants you 😅 In general - I stay away from online certifications because I don't believe employers recognize anything other than BSc or in some cases Boot Camp completions. But when it comes to Google - is there a chance they are recruiting candidates who successfully passed this exam? I'm not sure... 🤷‍♀️
      I did a quick check of the first row of certified developers:
      developers.google.com/certification/directory/tensorflow
      It seems that none of them is working for Google but they do work in the data science industry.
      If you want to make sure you're not missing anything - reach out to a few of these certified developers on Linkedin and check if they're happy with this program and if it helped them in some way. This will also be very handy in terms of networking - it's always important to make new connections as they sometimes may result in a career opportunity or a really good advice! 😁
      So my official recommendation (if it even counts hahahaha, I'm not really an expert) is to do this exam only as an additional certification on top of a BSc or any other wieldy recognized academic program. However, I don't believe this is an alternative to 4 years of school... if it was I'd quit university right now and go do that instead!! hahaha 🤣🤣🤣
      Good luck, and let me know if you end up taking it, I'm curious to see what it's all about! 😊

  • @return_1101
    @return_1101 2 роки тому

    Love your video... But I am very beginner in this field. 😑

  • @jopa7683
    @jopa7683 2 роки тому

    How to write code in selenium for WebS Facebook to Open Groups in order

  • @RapperKingable
    @RapperKingable 2 роки тому +2

    Привет Мария. Молодец. Если возможно Python + Kivy + pandas с сохранением данные на excel. Сними плс видео ролик. Спасибо.

    • @PythonSimplified
      @PythonSimplified  2 роки тому

      Спасибо большое! 😁
      попробуй .to_csv() метод:
      df.to_csv("my_file.csv" , encoding="utf-8", index=False)
      это сохранит любой Data Frame в CSV файл (который можно открыть с помощью excel)
      удачи! и надеюсь что это поможет! 😀

    • @RapperKingable
      @RapperKingable 2 роки тому

      @@PythonSimplified Спс. Если не получится напишу. ОК.

    • @RapperKingable
      @RapperKingable 2 роки тому

      @@PythonSimplified Здравствуйте. Этот прием не получилось. Задачу решил но другим путем.

  • @ryd3v
    @ryd3v 2 роки тому

    Love it

  • @tonupif
    @tonupif 2 роки тому

    За скрэпинг отдельное спасибо.

  • @diwakar_tsn
    @diwakar_tsn 2 роки тому +2

    You're looking soo beautiful today💖❤️❤️🇳🇵😅

  • @bitterbob30
    @bitterbob30 2 роки тому

    It's Rudolph the White Nose reindeer.

  • @peaceangell
    @peaceangell 2 роки тому

    Nice videos, maybe we can use print("*" × 50) instead of print("****************").

  • @liamhecht1092
    @liamhecht1092 2 роки тому

    Can you make tutorial you make tutorial on flask?

  • @gtakadze1247
    @gtakadze1247 2 роки тому

    Could you get a link of your screen ?

  • @ZurioSi
    @ZurioSi 2 роки тому

    I feel like Web Scraping with Pandas is simpler than with selenium, isn't it?. Btw what's your opinion on log4j Java exploit? hahaha. Literally, you could hack someone's computer through Minecraft.