François from Octoparse
François from Octoparse
  • 22
  • 151 868
Scrape Data from Yelp (including phone numbers)
Yelp is one of the largest business directory websites on the Internet. This tutorial will show you how to collect business information on Yelp.
Downloading Octoparse:
www.octoparse.com/download/
To get your first 100 B2B leads for free:
scrap.id/s/1xY
For a Web Scraping request:
fdufaurboidin@gmail.com
XPaths used:
Pagination
//a[@aria-label="Next"]
Loop_Item (replace by the correct sign):
//li[descendant::h2[starts-with(text(), "All")]]/following-sibling::li[position()INFERIOR_THAN_SIGN=10]
Click_Item:
//a[ancestor::h3]
Extract_Data:
Store_Name:
//h1
Rating:
//span[a[@href="#reviews"]]/preceding-sibling::span[1]
Reviews:
//a[@href="#reviews"]
Type:
//span[a[starts-with(@href, "/search?find_desc")]]/..
Address:
//p[a[text()="Get Directions"]]/following-sibling::p[1]
Website:
//p[text()="Business website"]/following-sibling::p[1]
Phone_Number:
//p[text()="Phone number"]/following-sibling::p[1]
Переглядів: 1 272

Відео

Scrape Data from Google Maps (in 2024)
Переглядів 18 тис.7 місяців тому
How to Scrape Phone Numbers from Google Maps using Octoparse. Downloading Octoparse: www.octoparse.com/download/ To get your first 100 B2B leads for free: scrap.id/s/1xY For a Web Scraping request: fdufaurboidin@gmail.com XPaths used: Scroll: //div[@role="feed"] Loop Item: //div[@role="feed"]//a[@aria-label and starts-with(@href, "www.google.com/maps")] Title: (//div[@role="main" and @aria-labe...
Scrape Competitor Prices from eBay
Переглядів 38110 місяців тому
This tutorial will show you how to scrape product data from eBay. Download Octoparse: www.octoparse.com/download/ To get your first 100 B2B leads for free: scrap.id/s/1xY For a Web Scraping request: fdufaurboidin@gmail.com XPaths used: Loop item: //li[contains(@class, "item-card list")] Pagination: //a[@aria-label="Go to next search page"] Click_Item: //a[h3[contains(@class,"title")]] Regex: \d...
Scrape Professional Details from Houzz (Interior designers, Architects, Painters)
Переглядів 42310 місяців тому
How to Scrape Business Names, Addresses, Phone Numbers and Websites of professionals working for homeowners. Downloading Octoparse: www.octoparse.com/download/ To get your first 100 B2B leads for free: scrap.id/s/1xY For a Web Scraping request: fdufaurboidin@gmail.com XPaths used: Pagination: (//a[contains(@class, "next")])[1] Loop Item: //li[contains(@class, "pro-search-results")] Click Item: ...
How to Automatically Scrape Data Every Day
Переглядів 1,5 тис.Рік тому
Downloading Octoparse: www.octoparse.com/download/ To get your first 100 B2B leads for free: scrap.id/s/1xY For a Web Scraping request: fdufaurboidin@gmail.com
XPath Tutorial (and How to Use them for Web Scraping)
Переглядів 4,3 тис.Рік тому
XPaths are essential when scraping data (even if you use No-Code tools). Downloading Octoparse: www.octoparse.com/download/ To get your first 100 B2B leads for free: scrap.id/s/1xY For a Web Scraping request: fdufaurboidin@gmail.com 01:39 How to Read an HTML Document 05:42 The Different Kinds of XPaths 08:40 The "Cheat Sheet" 29:15 Writing your XPaths for Web Scraping
Scrape DuckDuckGo Search Results
Переглядів 868Рік тому
Downloading Octoparse: www.octoparse.com/download/ To get your first 100 B2B leads for free: scrap.id/s/1xY For a Web Scraping request: fdufaurboidin@gmail.com 01:19 Create a Pagination 03:01 Create a Loop Item 03:29 Extract Data 04:51 Adjust the Timeouts 05:13 Run your Task XPaths used: - Pagination: //a[@class="result more btn btn btn full"] - Loop item: //div[@id="links"]//article - URL: //h...
Scrape Property Data (Price per Square Foot, Facilities, Adress...)
Переглядів 372Рік тому
Scrape property data from Realtor.com Downloading Octoparse: www.octoparse.com/download/ To get your first 100 B2B leads for free: scrap.id/s/1xY For a Web Scraping request: fdufaurboidin@gmail.com 01:01 Start a New Task 01:06 Create a Pagination 04:43 Set Up your Proxy 06:07 Create a Loop Item 07:17 Click on Each Element 08:05 Extract Data 10:11 Adjust the Settings 11:38 Run the Task XPaths us...
Scrape Data from Yellow Pages (in 2023)
Переглядів 4 тис.Рік тому
How to Scrape Phone Numbers & Emails from the US Yellow Pages using Octoparse. Downloading Octoparse: www.octoparse.com/download/ To get your first 100 B2B leads for free: scrap.id/s/1xY For a Web Scraping request: fdufaurboidin@gmail.com XPaths used: Pagination: //a[@class="next ajax-page"] Loop Item: //div[@class="result"] Click Item: //a[@class="business-name"] Title: //h1 Phone: //p[@class=...
Scrape Hotel Information from Trip.com
Переглядів 1,4 тис.Рік тому
How to Scrape Hotel Data with Octoparse? Downloading Octoparse: www.octoparse.com/download/ To get your first 100 B2B leads for free: scrap.id/s/1xY For a Web Scraping request: fdufaurboidin@gmail.com XPaths used: Pagination: //div[@class="list-btn-more_content"] Loop Item: //div[@class="hotel-info"] Name: //span[@class="name"] Location: //p[@class="transport"] Rating: //div[@class="score"]/spa...
Scrape Footbal Data (from WhoScored)
Переглядів 4 тис.Рік тому
Scrape Footbal Data (from WhoScored)
Scrape Reviews from Google Maps (with Octoparse)
Переглядів 4,1 тис.Рік тому
Scrape Reviews from Google Maps (with Octoparse)
Scrape Twitter Followers of a public account
Переглядів 4,1 тис.Рік тому
Scrape Twitter Followers of a public account
Scrape Videos from TikTok (No Code)
Переглядів 2,4 тис.Рік тому
Scrape Videos from TikTok (No Code)
Scrape Data from Google Maps (in 2023)
Переглядів 90 тис.Рік тому
Scrape Data from Google Maps (in 2023)
No Code Web Scraping Tool: How to Extract Data with Octoparse
Переглядів 13 тис.Рік тому
No Code Web Scraping Tool: How to Extract Data with Octoparse
I leave Octoparse
Переглядів 1,1 тис.Рік тому
I leave Octoparse

КОМЕНТАРІ

  • @egs4975
    @egs4975 3 години тому

    It's censorship of police reviews.

  • @tranquillity425
    @tranquillity425 14 годин тому

    how to scrape only prices products name and hyper link and store name ?

  • @phoenixgaming231
    @phoenixgaming231 День тому

    I have a question

  • @AkshayDeoke
    @AkshayDeoke 10 днів тому

    Why this has to be so difficult and such a steep learning curve. I follwed your video but page scrolling thing didnt work.

  • @FabricioBertassi
    @FabricioBertassi 18 днів тому

    Thanks for the video! I was helpful to get the affiliates list in twitter, but my Scroll doesn't work right, because it goes to the end of the list, and only capture the last users of it... any guess of what I'm doing wrong?

  • @marsdiggy8
    @marsdiggy8 19 днів тому

    How do you save a workflow? I checked the documentation on OP but didn't find anything. Is this a premium feature? Thank you.

  • @bungaazizah2802
    @bungaazizah2802 20 днів тому

    Hi, thankyou for sharing. But I think I followed all the steps and still can’t scrape a data. I think there’s something wrong with the scroll xpath, but I already watched your xpath tutorial and use many tools to find the right xpath, still it just doesn’t working out. Do you have any idea about this?

    • @francoisfromoctoparse
      @francoisfromoctoparse 20 днів тому

      Hi, If you can't scrape a data, maybe the problem is coming from the loop item and not from the scroll.

  • @philtarry2558
    @philtarry2558 26 днів тому

    Hi - just starting out with Octoparse and this is really helpful. Are you able to share a copy of the word doc with all the different xpath types? Thanks!

  • @slimshadeless
    @slimshadeless Місяць тому

    Solid tutorial, helped me a lot! Adding chapter breaks to the time bar to divide up the topics in the video would be a great addition. It would make this tutorial a little more navigation friendly for people like me who constantly come back to the video for refreshers. ** Edit - There are markers in the description! Exactly what I needed, missed them earlier **

  • @topfoodlind260
    @topfoodlind260 Місяць тому

    need your help

  • @topfoodlind260
    @topfoodlind260 Місяць тому

    How can I export locations codes plus from my map to Excel?

  • @dhara3987
    @dhara3987 Місяць тому

    if we have multiple variants in details page then it is possible to scrape data ?

  • @cr1730
    @cr1730 Місяць тому

    very helpful. i really needed this video. thanks alot

  • @DCVisuals974
    @DCVisuals974 2 місяці тому

    Salut François, Une question. Quand tu obtiens tes 120 leads, comment fais-tu pour générer les leads suivants après le 120 eme ? Merci

    • @francoisfromoctoparse
      @francoisfromoctoparse 2 місяці тому

      Salut, Tu ne peux pas obtenir plus de 120 leads à la fois avec cette méthode. La seule façon de faire serait de segmenter la recherche.

  • @gloration972
    @gloration972 2 місяці тому

    I just found your videos. I think you have great content, thank you for creating the tutorials and sharing your knowledge. English is my second language, I think you did great! It’s not easy to explain things in another language, especially when it comes to software 😅. Thank you for your videos 🙏🏻

  • @chankk6038
    @chankk6038 2 місяці тому

    Hi, why is the data extracted stop at the 7th row? it should have more rows..

    • @francoisfromoctoparse
      @francoisfromoctoparse 2 місяці тому

      Hi, It is likely because the scrolling process doesn't work in your case.

  • @danielaamaro5210
    @danielaamaro5210 2 місяці тому

    Hi François, what's the extension you're using on your browser to see the results of the xpath query? Thanks! Great video

  • @e.v.vasilkova
    @e.v.vasilkova 2 місяці тому

    Thank you very much! This is what I was looking for

  • @MediaMusicManiaM
    @MediaMusicManiaM 2 місяці тому

    very nice article thanks

  • @bolumekenejoan6360
    @bolumekenejoan6360 2 місяці тому

    Hi @francoisfromoctoparse thank you very much for this video tutorial but can I ask you go more details using case scenarios and possible recommend course to take for those looking to further into scrapping. I am very much interested in learning scrapping especially with other tools. Thank you.

  • @DiegoNovello86
    @DiegoNovello86 2 місяці тому

    Hi, my Titles is going wrong, its getting Sponsered, a think google changed something. Title: (//div[@role="main" and @aria-label]//h1)[last()] Could you help how identify the new path?

  • @johnnyescondido7430
    @johnnyescondido7430 2 місяці тому

    Has something changed in the last year with G or this software? Everything works great in test mode, but when running the software it isn't clicking "More" when it appears. If I click the 3 dots within the "if long reviews" box it says "text not found, condition is false," which make no sense because I'm looking right at it. Then when selecting click item it does click More. Very puzzling. Shouldn't it use the long reviews branch when more is displayed?

  • @biswajitmohanty8532
    @biswajitmohanty8532 2 місяці тому

    Is it free?

  • @TotalFootball365
    @TotalFootball365 2 місяці тому

    How would I scrape data if I have to click a tab within a table first? For example if I was on player statistics for a league but wanted to scrape offensive stats instead of the summary which is the table which initially loads?

  • @rezekihomemart5823
    @rezekihomemart5823 2 місяці тому

    Everything works fine except phone number. Whats wrong with it?

  • @latlov
    @latlov 2 місяці тому

    How about scraping some reviews for each business? reviews based on a specific rating (worse and best)? how to achieve that?

  • @Visualife
    @Visualife 2 місяці тому

  • @feritkardal
    @feritkardal 3 місяці тому

    Thanks for the video! Very nice! But i couldnt scrape the review texts (the comments) can you help 😢

  • @cjt208
    @cjt208 3 місяці тому

    This is so cool. Sound like Jean Girard from Talladega Nights.

  • @AbeikuDearGod
    @AbeikuDearGod 3 місяці тому

    This is good. Can you build it to scrap more and more data from sofascore please?

  • @IamTechAlchemist
    @IamTechAlchemist 3 місяці тому

    Hi Francois, Thank you very much for this video😊, i would like to ask is this method possible without xpath, with only pointing and clicking.

    • @francoisfromoctoparse
      @francoisfromoctoparse 3 місяці тому

      Hi, Of course , it is possible ;) But it will be more likely that there is an error. And the accuracy will be lower.

    • @IamTechAlchemist
      @IamTechAlchemist 3 місяці тому

      Can you make a video on it please if possible

  • @matinshoon
    @matinshoon 3 місяці тому

    This was an amazing tutorial, thank you!

  • @Iamhershey69
    @Iamhershey69 3 місяці тому

    New my favorite information station

  • @deborahhudareal
    @deborahhudareal 3 місяці тому

    thank you

  • @garymoore1288
    @garymoore1288 3 місяці тому

    the xpath for the loop item doesn't seem to work for me, could it have changed? Also when I try clicking it only seems to return the result I click (only one A in the list)

    • @francoisfromoctoparse
      @francoisfromoctoparse 3 місяці тому

      Did you run your task? It is the most concrete way to indentify a potentiel mistake.

  • @imtime22
    @imtime22 3 місяці тому

    Thank you

  • @yassa.9624
    @yassa.9624 4 місяці тому

    Bonjour Francois, How can i extract the speed limit data from a circuit on google maps please ?

    • @francoisfromoctoparse
      @francoisfromoctoparse 3 місяці тому

      Hello, I think you should take a look at the Google Maps API. Maybe they have a feature for this.

  • @TaranpreetKaur-mw7je
    @TaranpreetKaur-mw7je 4 місяці тому

    Hii I have some issues regarding this tool and i am trying to get help can u please help me.. Please let me know how can i contact you.?

  • @muratkekec5994
    @muratkekec5994 4 місяці тому

    how to scrap pages that is redirecting to another page

    • @francoisfromoctoparse
      @francoisfromoctoparse 4 місяці тому

      If the page is from another website, I don't think you can do that with Octoparse.

    • @muratkekec5994
      @muratkekec5994 4 місяці тому

      @@francoisfromoctoparse what do you mean.? I have the link and when I enter url it directs to another website. Original website has all the information I want to scrab. can you try on your end?

  • @muratkekec5994
    @muratkekec5994 4 місяці тому

    how to scrap pages that is redirecting to another page?

  • @youngpark5286
    @youngpark5286 4 місяці тому

    thanks for the amazing video, somehow I do not get website and phone number. but your xpath worked with the xpath helper.

  • @Scuurpro
    @Scuurpro 4 місяці тому

    Have a big issue with Locate web element via nearby text. When I enter the Xpath I see the DIvs and everything below but won't let me apply

  • @salpro22
    @salpro22 4 місяці тому

    Where there any duplicate entries when you ran the test? I tested this out on a few GMB. First one had 960 reviews. I stopped it when I came back and saw it had parsed over 6k reviews. Tons of duplicate entries, It looks like it parses the same entry twice depending on whether it has to click MORE in the review. Another sample test with only 5 reviews. The output had 25 entries. Same issues though. I thought the branch was supposed to work to do A if xyz, if not do B. Is there something I'm missing?

    • @francoisfromoctoparse
      @francoisfromoctoparse 4 місяці тому

      Like that, I cannot really tell what the problem is. It might look like a "loop item" problem to me.

    • @salpro22
      @salpro22 3 місяці тому

      @@francoisfromoctoparse I figured it out. Had the loop messed up. Do you have any idea how to get the unix timestamp for the date a review was submitted? I know it's in the code, but I can't figure it out. So instead of getting, "a month ago" you get the exact date?

  • @salpro22
    @salpro22 4 місяці тому

    Very helpful video. Much appreciated! It worked perfectly.

  • @valentinthalmann361
    @valentinthalmann361 4 місяці тому

    Hey all :) I found out that sometimes the script gets confused about the website_url because there are two aria labels with the "website" tag - one in the list of the companies and one in the side tile. This could result in displaying a striped square and only the word "Website". To fix this, right-click the website row > customize field > select other fields > href. It won't change the preview, but by running the loop, it will only take the URLs. I'm not sure if this is a bug or if it's a problem with the two elements.

    • @youngpark5286
      @youngpark5286 4 місяці тому

      I am facing to the same issue for placeid, website, phone number. Do you know how to solve this? and I am not sure how to get lat and long too.

    • @thegundamkingr
      @thegundamkingr 2 місяці тому

      Thanks for sharing this tip, you're a star

  • @MrMattden
    @MrMattden 4 місяці тому

    Thanks for the video, do you know if it's possible to extract mail address ?

  • @solinari60
    @solinari60 4 місяці тому

    Hi. Is this totally legal and legit? I need a tool to collect comments from Facebook, Instagram to conduct behavioural analysis of consumers. It will be for my business so it has to be compliant to regulations

    • @francoisfromoctoparse
      @francoisfromoctoparse 4 місяці тому

      Hi, Better not to scrape Facebook related websites. I think you cannot do it through Octoparse anyway.

    • @solinari60
      @solinari60 4 місяці тому

      ​@@francoisfromoctoparsethank you!

  • @user-bo8ls6uc3c
    @user-bo8ls6uc3c 5 місяців тому

    I am Unable to scarape playstore reviews

  • @Retorneadordeceramica
    @Retorneadordeceramica 5 місяців тому

    Hi, I am following your tutorial but the second Loop Item with the Variable List did not work for me? Do you know why?

    • @francoisfromoctoparse
      @francoisfromoctoparse 4 місяці тому

      Hi, You mean the loop item with the different companies? In that case, did you try to rewrite the XPath?

    • @n1ghttra1n
      @n1ghttra1n Місяць тому

      @@francoisfromoctoparse you have a space after maps( could be because chrome checkmarks safe links) if he removes the space it, works

  • @cinematicworld4874
    @cinematicworld4874 5 місяців тому

    I have made the whole workflow and when I run it, it makes the fields but extracts the data for same one business again and again. Please help me

    • @francoisfromoctoparse
      @francoisfromoctoparse 5 місяців тому

      Probably a Loop Item error.

    • @cinematicworld4874
      @cinematicworld4874 5 місяців тому

      @@francoisfromoctoparse how can i resolve this😢

    • @francoisfromoctoparse
      @francoisfromoctoparse 5 місяців тому

      ​@@cinematicworld4874 Unfortunately, I lack of time to answer specific requests for free. If you need any web scraping service: fdufaurboidin@gmail.com