This is the ONLY way I'll use Selenium now

Поділитися
Вставка
  • Опубліковано 15 гру 2024

КОМЕНТАРІ • 32

  • @MakeDataUseful
    @MakeDataUseful Рік тому +2

    Fantastic tutorial John, I’m looking forward to having a play with remote and grid. Keep the tutorials coming 🎉

  • @DiegoHerrero-we5qc
    @DiegoHerrero-we5qc 9 місяців тому +1

    I love the content, I'm starting scraping, I'm grateful to have come across your channel. At work they are asking me to automate the checking of some websites that today is done manually, my question is: how do I present the scraping results in a report? that says if the website is ok or down, etc. Thanks

  • @vaviloffx
    @vaviloffx Рік тому +3

    Great video as always, John! I think you didn't address the issue of why you use one browser for one page only and the close it right away. Would be interesting to know. Don't fresh browsers without any history and cookies seem more suspicious to target sites?

  • @999hypnos
    @999hypnos 9 місяців тому +3

    does anyone know if its possible to use undetectable_chromedriver with selenium grid?

    • @JohnWatsonRooney
      @JohnWatsonRooney  9 місяців тому

      Not that I am aware of - although I wouldn’t be surprised if someone has put a solution together

  • @awais.shorts
    @awais.shorts 10 місяців тому

    Hi Sir, how we can scrape a webpage or website which is showing status code 403.
    (Not by saving html) kindly another method.

  • @yellowboat8773
    @yellowboat8773 5 місяців тому

    Is it one core per instance?

  • @searchenginemaster
    @searchenginemaster 4 місяці тому

    That's awesome !
    I have two questions:
    1-why you don't use Threads for running Firefox?
    2-Your .vim setups :)

  • @MotuDaaduBhai
    @MotuDaaduBhai 4 місяці тому

    Do you have any paid course on Selenium or Software testing anywhere?

  • @cenanium
    @cenanium 11 місяців тому +1

    What about launching multiple tabs inside a single browser window for concurrency? instead of launching multiple browser windows with each having a single tab....

    • @JohnWatsonRooney
      @JohnWatsonRooney  11 місяців тому

      Yes you can do that as you would normally but over multiple browsers with grid. Grid allows you remote connect to concurrent browsers and manages them for you, inside each one you could have multiple tabs

  • @999hypnos
    @999hypnos 9 місяців тому +2

    nice content

  • @Yagnik_Sojitra
    @Yagnik_Sojitra Рік тому

    how to scrap table data with BS4 in python ?
    table data has ul and li tags nested, taking all li tags repeats the data again. Didn't find any method to get only main tags with which the nested tags and data can be obtained. All li tags have everything in common. No class names also given to main li tags.

  • @Aidas_Li
    @Aidas_Li 11 місяців тому

    John, absolutely no idea on how to, however where can I get your tuition?

  • @nachoeigu
    @nachoeigu Рік тому

    Great video. One question: What paid proxy do you recommend for web scraping? Which one do you use usually? Thanks you are a mentor for me :)

  • @Septumsempra8818
    @Septumsempra8818 Рік тому

    Anyone got an alternative to selenium wire? I have a site that when page loads it makes the API call that I want to intercept, but it's IMPOSSIBLE to replicate the API call. Selenium wire solves this, I scan the requests made in the background and I get what I want. But now I'm stuck with selenium. Does anyone know alternatives to selenium wire that are reliable?

    • @JohnWatsonRooney
      @JohnWatsonRooney  Рік тому +2

      i've done similar with playwright, you can get into the network events from there too but it wasn't any easier than selenium wire

    • @adnantaufique68
      @adnantaufique68 Рік тому +2

      Playwright is the best tool fpr intercepting network events. My only gripe is that you need to store the intercepted data in a global data structure rather than just returning it. If that was solved, there's no alternative for playwright both in terms of speed and stability.

    • @piaizao1405
      @piaizao1405 Рік тому

      Why cant you replicate the api call?

    • @anirbanpatra3017
      @anirbanpatra3017 Рік тому +1

      ​@@JohnWatsonRooneyPlease make a more detailed video in SeleniumWire and it's alternative, including the best practices.

    • @Septumsempra8818
      @Septumsempra8818 Рік тому

      @@adnantaufique68 thanks for this

  • @bakasenpaidesu
    @bakasenpaidesu Рік тому +1

    Cool

  • @shubhamghag1704
    @shubhamghag1704 Рік тому +1

    Hey, I love your videos, can you make a video on how to use the new chromedrivers for webscraping using selenium on Mac and Windows. that would be really helpful, as I am planning to scrape a lot of data for a machine learning project.

  • @Septumsempra8818
    @Septumsempra8818 Рік тому +1

    Yes!!!

  • @oribrosh6458
    @oribrosh6458 Рік тому +1

    when im doing that i get - Max. Concurrency: 1, how to change it to 12?

    • @JohnWatsonRooney
      @JohnWatsonRooney  Рік тому

      It’s depended on cpu cores and if your running it via docker compose you need the se max nodes line in the yml file