ScrapeGraphAI - REVOLUTION in WEB SCRAPING!!!

Поділитися
Вставка
  • Опубліковано 3 жов 2024

КОМЕНТАРІ • 28

  • @SynergyOfTwo
    @SynergyOfTwo 3 місяці тому +2

    I found ScrapeGraphAI pretty useless, it doesn't except for the simplistic examples.

    • @TomsTechAcademy
      @TomsTechAcademy  3 місяці тому

      It’s a great tool for people without Python / BeautifulSoup skills. For most cases which are a bit more complex, the conventional approach is probably the best ;)

  • @MrMoonsilver
    @MrMoonsilver 4 місяці тому +5

    It extracted the titles, not the articles themselves.

    • @raunaqss
      @raunaqss 4 місяці тому

      Thanks for writing this

    • @ShihChiehLee
      @ShihChiehLee Місяць тому

      can you do a walkthrough to read a given url of news web page to come up with context of the news and come up with a summary through LLM, ideally using Ollama

  • @userou-ig1ze
    @userou-ig1ze 4 місяці тому

    Thanks for demo-ing the tool

    • @TomsTechAcademy
      @TomsTechAcademy  4 місяці тому

      Thank you for your comment, much apreciated!

  • @Bbyanar
    @Bbyanar 2 місяці тому +1

    Great video thx !

  • @256Services
    @256Services Місяць тому

    Video could have been more helpful if it was about scraping helpful marketing information like emails and phone numbers

    • @TomsTechAcademy
      @TomsTechAcademy  Місяць тому

      Thanks for your comment. It’s impossible for me to make a video about every possible use case. Instead I try to make videos which can be easily applied to other scenarios. In this specific case, it’s just a matter of modifying the prompt and the url and you’re good to go 😉

  • @amandamate9117
    @amandamate9117 4 місяці тому

    this video tutorial was suprising beginner friendly

    • @TomsTechAcademy
      @TomsTechAcademy  4 місяці тому

      Thank you for your comment Amanda, and glad my video was useful for you :) I always try to make my videos as beginner friendly as possible

  • @ShihChiehLee
    @ShihChiehLee Місяць тому

    can you do a walkthrough to read a given url of news web page to come up with context of the news and come up with a summary through LLM, ideally using Ollama

  • @aleksdeveloper698
    @aleksdeveloper698 Місяць тому

    Can you scrape data from Google maps?

  • @iamrrricardo
    @iamrrricardo 3 місяці тому

    Thank you so much for showing the demo! I still have a question:does scrapegraphai support full stack crawling? or it only works for one page

  • @zemmouripopodori
    @zemmouripopodori 3 місяці тому +1

    i get api error

    • @TomsTechAcademy
      @TomsTechAcademy  3 місяці тому

      If you receive any error, feel free to join my discord (link in description) and post your question there 😀

  • @anishmanandhar1203
    @anishmanandhar1203 4 місяці тому

    whenever I install scrapegraphai I am hit by AttributeError: module 'pkgutil' has no attribute 'ImpImporter'. Did you mean: 'zipimporter'?
    [end of output]

    • @TomsTechAcademy
      @TomsTechAcademy  4 місяці тому +1

      Hi, I think I had this exact same error - are you running Python in a virtual environment? If you run Python 3.12 you can try switching to 3.11

  • @s6yx
    @s6yx 3 місяці тому

    I just give chat gpt the html data as example. Give it what I want scraped and it’ll generate the code for me

    • @TomsTechAcademy
      @TomsTechAcademy  3 місяці тому

      Thanks for Sharing your expertise, that’s another way of doing it 😀

  • @DuhaMasoodi
    @DuhaMasoodi 3 місяці тому

    When i am trying this i get the error asyncio error cannot be called

    • @TomsTechAcademy
      @TomsTechAcademy  3 місяці тому

      Hi, thanks for your comment. I have seen this error before, is it possible you're running Python 3.12? If so, try running it in a venv with 3.11. Good luck! If you need more help, you can reach me on my Discord (link in description).

  • @metazoo4407
    @metazoo4407 4 місяці тому

    I keep getting this error "PermissionError: [Errno 13] Permission denied: 'wired.xlsx'" and I've even typed the code shown in your video exactly and still I get that error, any help would be greatly appreciated

    • @TomsTechAcademy
      @TomsTechAcademy  4 місяці тому +2

      Hi Meta, it looks like you're trying to write a file to a directory where you don't have enough permissions. Assuming you're using Windows, can you try the following?
      Navigate to your Desktop, right click on any of your files there. Click properties. Go to the tab "general" and read location. This should be something like: "C:\Users\thoma\Desktop" (where thoma is your username). Copy this location to VS Code. Now add \wired.xlsx and change all slashes for double slashes (you should now have something like this:
      df.to_excel("C:\\Users\\thoma\\Desktop\\wired.xlsx",index=False)
      Run the code again. Let me know whether this works for you. If not, feel free to join my Discord if you need more help (link in video description).

    • @metazoo4407
      @metazoo4407 4 місяці тому

      @@TomsTechAcademy Hi, I managed to get it to produce two results and now I'm having issues again, It is now giving me this message "Traceback (most recent call last):
      File "c:\scrapperai\main.py", line 26, in
      df = pd.DataFrame(result['pages'])
      ~~~~~~^^^^^^^^^" any help would be greatly appreciated!