Selenium Web Scraping is too Slow. Try This.

Поділитися
Вставка
  • Опубліковано 7 січ 2025

КОМЕНТАРІ • 43

  • @parisneto
    @parisneto 2 місяці тому +11

    I have been scraping for 6 months and since UA-cam help me find you ( I was not searching ! ) I learned so much and evolved my work in the past 2 weeks that seems like years of experience shared. Thanks for the generosity and dedication.

    • @ibekwevictor1158
      @ibekwevictor1158 Місяць тому

      What kind of jobs can you get as a scraper sir?

  • @vladgri7754
    @vladgri7754 2 місяці тому +2

    You always bring fresh ideas!

  • @DanKayser
    @DanKayser 2 місяці тому +5

    Cool! I've always scraped data on a single driver and yeah the process turns slow quickly... This is awesome, but I'm not very familiarized with async/await going to do my research!
    Thanks!

  • @BillyMcCord
    @BillyMcCord 5 днів тому

    Can the browser be headless so you can use a docker container?

  • @StanHordon
    @StanHordon Місяць тому

    amazing for beginer in scrapping your video is life saver.thank you

  • @attifbhuttoa3384
    @attifbhuttoa3384 2 місяці тому +1

    I love your video's. They are amazing 💪

  • @quintinsweeney2927
    @quintinsweeney2927 19 днів тому

    I am currently working on a similar project and this tutorial has helped me so much! Quick question, if I am interested in gathering the data from a grid similarly to this, is it necessary to open all the links to the items? I want to scrape the price, item name, category etc. and that can be found directly on the grid. Would the downside be that you won't have access to the data in JSON format?

  • @GouravMittal-fi4vg
    @GouravMittal-fi4vg 17 днів тому

    Hey
    Thanks for the video and guidance.
    Can we do with the same dynamic website for live sports data which is updating every second

  • @admaxcool
    @admaxcool Місяць тому

    is there a keyword utility library on top of selenium like SeleniumBase but without the recorder or demo modes (a slimmed out version with handy utilities)

  • @ebukaume
    @ebukaume Місяць тому

    Nice approach! What do you think about using a Semaphore instead of a temporal rate limiter?

  • @stephena8965
    @stephena8965 Місяць тому

    Great tutorial as always! How do you add headers to the request? The docs are incomplete and I've never used selenium so it's a little confusing with network interceptions. The reason I'm asking is because the page isn't loading fully, it always timesout after loading the navbar, and I think it's because I need some headers or maybe it's because I'm not using any proxies. Thanks in advance! I'll post my solution if I find one

  • @hamed6899
    @hamed6899 2 місяці тому +1

    Great video, but please make a video about how to find hidden api

  • @switch8291
    @switch8291 2 місяці тому +1

    Hey john
    We've seen how you use a list of selectors in json file to scrape multiple website
    Is there a library to auto get selectors or this part is manual on each website, or instead is there a way to automate it using json schemas of each website
    Stay Golden!

    • @voinywolnyprod3046
      @voinywolnyprod3046 Місяць тому

      You can automate it yourself collecting selectors individually (in set for example) for each website because each website will have different selectors

  • @lunaticberserker5869
    @lunaticberserker5869 2 місяці тому +1

    NoDriver lacks documenation and methods that undetected-chromedriver used to have. but it's more faster than its predecessor.

  • @viratchoudhary6827
    @viratchoudhary6827 2 місяці тому

    hi john, My question is whether using a self-hosted proxy with multiple ports necessitates a Proxyscrape subscription. eg localhost:2001, *:2002, *:2003

  • @adhamkhaled8687
    @adhamkhaled8687 2 місяці тому +1

    What is better to use seleniumbase or driverless ?

    • @JohnWatsonRooney
      @JohnWatsonRooney  2 місяці тому +1

      Both seem to work well but I haven’t used either enough to say, right now I am using selenium driverless more

  • @towhidurrahman8961
    @towhidurrahman8961 2 місяці тому +1

    Make a video on selenium grid.

  • @adhishtanaka
    @adhishtanaka 2 місяці тому

    what linux you use?can you make a video about webscrapper pc setup that explain os and tools,ide that you use day to day life in current moment

    • @wiresploit
      @wiresploit 2 місяці тому +1

      He uses Fedora Linux with i3wm (tiling window manager)

  • @zaidyousaf4449
    @zaidyousaf4449 2 місяці тому

    make a video on some online tools for scraping like phantom buster. Like how do they do it, because they also go for platforms like linkedin where login is needed plus the js rendering is very much involved. How do they it from their cloud services. i want to know the technique so that we people can also replicate such things at least of 10 % of theirs instead of just using selenium, scrapy or puppeteer.

  • @MyKnighty
    @MyKnighty 2 місяці тому +1

    Hey John .. Can i run in headless mode

  • @Volkskomissar
    @Volkskomissar 2 місяці тому +7

    wow never seen a 22 Minutes AD/ Commercial dsiguised as a tutorial,
    + before

    • @truthwillout1980
      @truthwillout1980 2 місяці тому

      You haven't? That's every video.

    • @kareemyoussef2304
      @kareemyoussef2304 2 місяці тому +2

      isnt this entire channel now? everytime I see his title saying "something is too slow/detectable try this instead" i know its a 20 min waste of time

    • @genericname1296
      @genericname1296 2 місяці тому

      I don't know what you are so upset by here. He clearly states in the description who he is affiliated with and spends the entire video teaching you how to do what his title says. No where does he say only one proxy service can do this.

  • @atulraaazzz2931
    @atulraaazzz2931 2 місяці тому

    Can you share the base of code

  • @Aidas_Li
    @Aidas_Li 2 місяці тому +1

    Nice video.

  • @saadkhan883
    @saadkhan883 2 місяці тому

    Can you share a code with us ?

  • @bakasenpaidesu
    @bakasenpaidesu 2 місяці тому +2

    .

  • @bluekeybo
    @bluekeybo 2 місяці тому +3

    It's very distracting to watch so many typos as you type, and deleting/correcting them. Not sure what's a way to fix that? Maybe showing the code chunks already typed and explaining them instead of typing? Thank you!

    • @JohnWatsonRooney
      @JohnWatsonRooney  2 місяці тому +3

      I understand and that’s come up before, I can copy/paste chunks which I have done in the recent past but I walked to show my working how I got there etc. I should just practice typing more…

    • @ScalpersInc
      @ScalpersInc 2 місяці тому +6

      @@JohnWatsonRooneyno ur good brother

    • @profbiyi
      @profbiyi 2 місяці тому

      ​@@JohnWatsonRooneyyou are doing an excellent job. Thanks so much

    • @cybern9ne
      @cybern9ne 2 місяці тому

      then don't watch the channel. you just want to copy and not learn.

    • @carlos-ferreira
      @carlos-ferreira 2 місяці тому +1

      ​@JohnWatsonRooney I disagree with the person above. I actually like that you're coding while recording and showing how you actually work.