How to scrape infinite scroll page fed by AJAX request

Scraping Dynamic JavaScript Websites - Beautiful Soup Python

Playwright request interception

ДИЗЕЛЬ ШОУ 2024 💙 154 ВИПУСК 💛❄️☃️ ЗИМОВА ПРЕМ'ЄРА ⭐ від 13.12.2024

Артем Пивоваров x Max Barskih - Так ніхто не кохав

Wall Rebound Challenge 🙈😱

Faking AJAX requests | Python + Requests & BeautifulSoup

Monkey see, monkey do

Переглядів 3 617

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 14 гру 2024

КОМЕНТАРІ • 11

@16vladott16 4 роки тому ⁺¹
Thanks a lot, great video ! I don't understand why your videos are so underrated. By the way, is there a way to get the Request URL from the website's source code ?
@monkey_see_monkey_do 4 роки тому
you can get request URL from response object, e.g.
# make HTTP request
response = requests.get('google.com')
# print URL
print(response.url)
# output
google.com
@vladimirprotein3275 6 місяців тому
My website requests by POST 😭
@monkey_see_monkey_do 6 місяців тому
then just use requests.post(url, params)
chatgpt would be super helpful
@KhalilYasser 4 роки тому
Thank you very much. Very awesome.
When I tried the code, I encountered an error like that:
Traceback (most recent call last):
File "demo.py", line 31, in
scraper.start_me()
File "demo.py", line 27, in start_me
self.to_csv()
File "demo.py", line 22, in to_csv
writer.writerow(row)
File "C:\Users\Future\AppData\Local\Programs\Python\Python36\lib\csv.py", line 155, in writerow
return self.writer.writerow(self._dict_to_list(rowdict))
File "C:\Users\Future\AppData\Local\Programs\Python\Python36\lib\encodings\cp1256.py", line 19, in encode
return codecs.charmap_encode(input,self.errors,encoding_table)[0]
UnicodeEncodeError: 'charmap' codec can't encode character '\u2161' in position 247: character maps to
But I got the table.csv although there is an error.
@monkey_see_monkey_do 4 роки тому ⁺¹
This is Window specific unicode character encoding issue. In order to get rid of the error in line where you're opening file stream just add this keyword argument:
with open('your.csv', 'w', encoding='utf-8')
@KhalilYasser 4 роки тому
@@monkey_see_monkey_do Thank you very much. Now it worked. But when opening the CSV file I found a lot of empty rows in the output. (rows 2,4,6,8,10, .. and so on).
By the way I found a way to use Pandas to export to CSV and I think more easier than using CSV module ..
@monkey_see_monkey_do 4 роки тому ⁺¹
@@KhalilYasser your issue is OS specific. I don't have windows and can't reproduce the issue on my side. The way people use csv module is all the same and the matter of skipped rows is the matter of unicode character policies, not the csv package. The fact that pandas handles it implicitly doesn't make standard csv module bad.
@KhalilYasser 4 роки тому
@@monkey_see_monkey_do Thank you very much for fast reply. Is there any workaround as the point of unicode to get rid of the empty rows?
@monkey_see_monkey_do 4 роки тому ⁺¹
@@KhalilYasser You don't need to get rid of them, you need to handle them properly instead. Just google for "python csv unicode error windows" - there's lots of windows specific solutions. I had some similar questions from my subscribers before and all of them managed to find the way to go quite easily. They didn't mention the exact solutions though so I can't help in particular.
Anyway I think you should focus on the technique of fetching data via faking ajax request and apply it to the domain of your interest. 90% of the data I've been scraping didn't have any unicode characters (usually you encounter them when say specific character for square meters/feet occurs or currency). Unicode characters are often the say every time so you can track and replace them.

Наступне

Автоматичне відтворення

How to scrape infinite scroll page fed by AJAX request

How to scrape infinite scroll page fed by AJAX request

Scraping Dynamic JavaScript Websites - Beautiful Soup Python

Scraping Dynamic JavaScript Websites - Beautiful Soup Python

Playwright request interception

Playwright request interception

ДИЗЕЛЬ ШОУ 2024 💙 154 ВИПУСК 💛❄️☃️ ЗИМОВА ПРЕМ'ЄРА ⭐ від 13.12.2024

ДИЗЕЛЬ ШОУ 2024 💙 154 ВИПУСК 💛❄️☃️ ЗИМОВА ПРЕМ'ЄРА ⭐ від 13.12.2024

Артем Пивоваров x Max Barskih - Так ніхто не кохав

Артем Пивоваров x Max Barskih - Так ніхто не кохав

Wall Rebound Challenge 🙈😱

Wall Rebound Challenge 🙈😱

ЧТО ЖЕ МЫ КУПИЛИ СОБАКЕ ВМЕСТО ТАБАЛАПОК😱#shorts

ЧТО ЖЕ МЫ КУПИЛИ СОБАКЕ ВМЕСТО ТАБАЛАПОК😱#shorts

Python and Scrapy - Scraping Dynamic Site (Populated with JavaScript)

Python and Scrapy - Scraping Dynamic Site (Populated with JavaScript)

Web Scraping with Python - Beautiful Soup Crash Course

Web Scraping with Python - Beautiful Soup Crash Course

Python Web Scraping Tutorial: scraping dynamic JavaScript/AJAX websites with BeautifulSoup

Python Web Scraping Tutorial: scraping dynamic JavaScript/AJAX websites with BeautifulSoup

Advanced Web Scraping Tutorial! (w/ Python Beautiful Soup Library)

Advanced Web Scraping Tutorial! (w/ Python Beautiful Soup Library)

Something is wrong with ISPs in India 🇮🇳

Something is wrong with ISPs in India 🇮🇳

Криптокоролева: самая разыскиваемая аферистка в мире | Документальный фильм Би-би-си

Криптокоролева: самая разыскиваемая аферистка в мире | Документальный фильм Би-би-си

Python Requests Tutorial: Request Web Pages, Download Images, POST Data, Read JSON, and More

Python Requests Tutorial: Request Web Pages, Download Images, POST Data, Read JSON, and More

PYTHON REQUESTS. ПЕРЕДАЧА COOKIE, SESSION, GET, POST ЗАПРОСЫ, ЗАГОЛОВКИ, API, JSON,СКАЧИВАНИЕ ФАЙЛОВ

PYTHON REQUESTS. ПЕРЕДАЧА COOKIE, SESSION, GET, POST ЗАПРОСЫ, ЗАГОЛОВКИ, API, JSON,СКАЧИВАНИЕ ФАЙЛОВ

Как спарсить ajax на python 3.

Как спарсить ajax на python 3.

Cheerleader Transformation That Left Everyone Speechless! #shorts

Cheerleader Transformation That Left Everyone Speechless! #shorts

ПРОВЕРКА НА ВШИВОСТЬ (смешное видео, юмор, поржать, приколы)

ПРОВЕРКА НА ВШИВОСТЬ (смешное видео, юмор, поржать, приколы)

Тернистий шлях до рівноправ’я - Кохання на виживання - Сезон 5 - Випуск 1 - 02.12.2024

Тернистий шлях до рівноправ’я – Кохання на виживання – Сезон 5 – Випуск 1 – 02.12.2024

Анна Трінчер - Треш (Official Music Video)

Анна Трінчер - Треш (Official Music Video)

Beat Ronaldo, Win $1,000,000

Beat Ronaldo, Win $1,000,000

ВОТ ПОЧЕМУ Япония живет в будущем 🤫 Утилизация масла #япония #токио #путешествия #shorts

ВОТ ПОЧЕМУ Япония живет в будущем 🤫 Утилизация масла #япония #токио #путешествия #shorts

🤔Можно ли спастись от Ядерки в Холодильнике ? #shorts

🤔Можно ли спастись от Ядерки в Холодильнике ? #shorts

Cool Items!🥰 New Gadgets, Smart Appliances, Kitchen Tools Utensils, Home Cleaning, Beauty #shorts

Cool Items!🥰 New Gadgets, Smart Appliances, Kitchen Tools Utensils, Home Cleaning, Beauty #shorts