How to go about it if there are a lot of tabs and you need to fetch all the data from each tab? Example: A grocery site where there are several tabs such as "Fruits", "Vegetables", "Dairy", "Eggs, Meat", "Snacks" and so on and you need to fetch all the products from every tab?
If those tabs have a dedicated url you’ll need some way to find those url’s. Without dedicated url you’ll need to simulate a click on the tab most likely
What about just feeding entire html page into LLM and making it extract the data? 🙂 Or to optimize this a little bit, instead of pushing entire page's html, just get eg. wrapping element's html like #productList or sth.
Great video. Few things: 1. You're repeating the same info multiple times. First 4 minutes of the video can be skipped. 2. This is good to show case what OpenAI apis can do, but the application is completely unnecessary and much would be more efficient to use just Playwright.
short and precise
3rd comment😇, always good infromative video as usual.❤
How to go about it if there are a lot of tabs and you need to fetch all the data from each tab? Example: A grocery site where there are several tabs such as "Fruits", "Vegetables", "Dairy", "Eggs, Meat", "Snacks" and so on and you need to fetch all the products from every tab?
If those tabs have a dedicated url you’ll need some way to find those url’s. Without dedicated url you’ll need to simulate a click on the tab most likely
What about just feeding entire html page into LLM and making it extract the data? 🙂 Or to optimize this a little bit, instead of pushing entire page's html, just get eg. wrapping element's html like #productList or sth.
Possible, but can be costly
do a video on how to scrap authenticated websites
1st comment
Nice!
@ really can’t wait to see some react 19 / next 15 / tailwind videos for a real world project . Thanks for all
Great video. Few things:
1. You're repeating the same info multiple times. First 4 minutes of the video can be skipped.
2. This is good to show case what OpenAI apis can do, but the application is completely unnecessary and much would be more efficient to use just Playwright.