The power of Headless Chrome and browser automation (Google I/O '18)
Вставка
- Опубліковано 6 жов 2024
- The headless browser revolution has arrived! Headless browsers are powerful tools that all developers can adopt in their workflow. This session will showcase examples of the amazing things that Chrome can do without a UI: write programs to control the browser; test a site; automate UI tasks; integrate into a CI system; setup A/B perf monitoring; prerender a client-side app for SEO; and more. The focus will be on using Puppeteer, Google's Node library for controlling headless Chrome.
Rate this session by signing-in on the I/O website here → goo.gl/PEoVtB
Getting Started with Headless Chrome → goo.gl/wg3u1W
Puppeteer → goo.gl/YXrgqX
Examples → goo.gl/G4QcBC
Demo sites:
→ pptraas.com
→ try-puppeteer....
Watch more Chrome and Web sessions from I/O '18 here → goo.gl/5fgXhX
See all the sessions from Google I/O '18 here → goo.gl/q1Tr8x
Subscribe to the Chrome Developers channel → goo.gl/LLLNvf
#io18 event: Google I/O 2018; re_ty: Publish; product: Chrome - Web - Puppeteer; fullname: Eric Bidelman; event: Google I/O 2018;
*Timestamps*
0:07 introduction (Eric Bidelman)
1:17 agenda
1:38 Intro to Headless Chrome
3:00 What does it unlock for us?
4:54 [Pyramid of Puppeteer] Chrome DevTools Protocol (CDP)
5:45 [Pyramid of Puppeteer] Puppeteer w/ usage examples
10 Things you can do (your scripts at the top of Pyramid of Puppeteer)
9:02 0. Quick Intro
9:30 1. *Pre-rendering* *JS* *sites*
12:46 1.1 Performance Comparison Demo
13:51 1.2 4 Pre-rendering Optimization tips
17:29 2. *Is* *Lazy-loading* *paying* *off?*
19:22 3. *A/B* *Testing*
21:09 4. *Catch* *issues* *for* *the* *Google* *crawler*
22:54 5. *Create* *custom* *PDFs*
24:20 6. *Make* *your* *browser* *talk*
26:28 7. *Test* *a* *Chrome* *extension*
28:10 8. *Crawl* *a* *SPA*
29:13 9. *Verify* *service* *worker* *offline* *caching*
30:52 10. *Procrastinate* (showcase touch/keyboard API)
31:39 Closing Remarks
Thank youu :)
You're doing God's work here
awesome
I have been using puppeteer to automate the process of finding a new apartment and it has been such a great experience. The API is really well documented and its really straight forward to use.
Having a hard time with this. I need to do the same thing. Are you able to give a step by step please?
24:55 "The rise of the machines has begun" _me nervously laughing_
This is cool and all, but at 9:43 he put his words where his mouth is, and i think thats the greatest thing of all.
in case someone is trying the 2nd example of server.mjs, the example is missing a line of code.
If your index.html is located in the root folder of the server, add : app.use(express.static('.'))
Such a genius idea to SSR with puppeteer. Great work!
I don't understand tho...why ssr with puppeteer when you can ssr the whole thing?
headless chrome with puppeteer is a bliss
headless chrome is the best, I used lot of headless browser and lot of them don't support some javascript sometime I couldnt load a page but with chromium you can load any web site I never had any issue to load pages or with the javascript, you can just do everything even watching a youtube video ^^, all you do with chrome you can do with headless chrome and you can run it on linux.I have been using pupeeteer for a moment and it's the best headless browser, just thank a lot for making this.
That's not a pyramid, mate; that's a triangle.
it can still be a pyramid from an orthographic point of view :p
ROFL
14:35 “How to wait until pages get partly or completely loaded”
Very cool tech that I didn’t know existed. Thanks for sharing. Now to learn more about this
Headless Chrome + Karate + Gatling is the bomb.
"Kinda cool"..."Really Awesome !!!"
22:54 *Creating custom PDFs* I would very much like support for adding metadata to PDFs, like you can with PDFKit. For storing PDF page snapshots it would be great to have them associated with searchable metadata (like Title, Author, Subject, Keywords, Comments). Where should I create the feature request? Puppeteer? SkiaPDF? Chromium?
How does it compare to Nightmare?
apparently it made it obsolete
What about concurrency? Is it really ok to call a puppetier browser in an enpoint that can be called by multiple clients at the same time?
why would not be? just create a module which exposes one instance on headless chrome. Now you can create as many tabs as your system can handle. Or, more realistically, you can create a queue for rendering tasks, and reause let's say 10 tabs, to render the queued requests.
i dont think it is a major issue if you put the caching in place
Am I the only one feeling web developers spent 20 years slowing down the internet only to realize that all of this wasn't necessary in the first place.
And now the pyroman is the fireman. We'll soon have to pay Google to generate html. Just like Maps. 😔
I can't come up with a reason why all this shit is needed when you have Varnish and a random scripting language like PHP 7.2, or HELL.. server-side NODEJS even... just sitting there that can render this stuff at 5000 hits per second instead of 4...
why he didnt use chromebook?
I do not understand why this approach does not wait the whole browser to load? Because server side have to do all the work before res.send to the client.
Nicely Explaination
Still can't easily download files with Puppeteer and without crunches.
Nice video, Does Puppeteer-SSR is more effective than TemplateEngines-SSR ?. My point is, template engines do not require for taking care of filtering requests, and also you could inject data on templates without calling external services.
I like the puppet-ssr approach but just asking for any advantages over template engines ssr.
Thank you
So we've gone from PHP or server-side nodejs that renders all this directly to HTML to a client-side language that should create this, but instead, we do this server side using a headless browser which scales in much the same way that a banana doesn't, all instead of a fast script interpreter, and then send it to the client to be handled as static HTML.. only to have it all in 1 language? is that really the only reason?! the lazyness of being able to use one language?
Hi, Can I insert script that using puppeteer library in a tag of a web page?
Sad thing is, this is only for node.js
First of all many thanks for uploading the info about headless chrome.
Problem :
I am trying to access a password protected url which loads pdf. This works absolutely fine in normal chrome browser, however, in Chrome headless mode because of some reason the PDF never gets loaded.
Need your help to approach this problem.
It would really be a BIG help if you can help me in anyways…
Many thanks to you in advance
I tried this the other day, but I couldn't bypass Cloudflare DDOS protection, don't know why...
I wonder why we even care about 3g for most part. It’s like the same old discussion to support IE but now it’s network.
If your page requires animations and high-performance. Seriously, don’t even think about what is better for the first paint. Think of what is the best overall.
This first paint strategy is Google dogma
"Basically"
how can i set background image add use margin @page for each page in file pdf, i can margin top and bottom for each pages but can't set background image, Please help me, Thanks!
10:38 FIrst code sample
Would like to run with it's "head full", i n incognito mode, and would like to save the interactive timeline report from the performance tab as is. Can this be done? Where can I find a video that shows this?
Unable to use download options in headless chrome.
help me..
i want to use puppeteer to fetch a page and press a specific button on the page..how can i do that?
WOW! WOW! WOW!
So far ahead of me automation
can we use pupeeteer for cross browsers as silenium ?
Puppeteer works only with Chromium or Chrome. However, many teams only run unit tests with a single browser (e.g. PhantomJS). In non-testing use cases, Puppeteer provides a powerful but simple API because it's only targeting one browser that enables you to rapidly develop automation scripts.
Puppeteer bundles the latest versions of Chromium.
I want to import expression in control panel is not it?
nothing works -
Can you simulate mouse clicks at a specific location?
With puppeteer you can click on Css Selectors (pptr.dev/#?product=Puppeteer&version=v1.13.0&show=api-pageclickselector-options) or you can use the DevTools Protocol Input.dispatchMouseEvent method to specify a (x,y) location. See puppeteer's CDPSession class for examples of how to use DevTools Protocol with puppeteer.
Nice ...
Yes. Let's use eager Promises instead of lazy Futures. And, wow, we also get a then then then then. Bravo.
good
LUV!
Great
If puppetier is the way, then why not use PHP as it does server side rendering. Personally i love JS, but are we not reinventing the wheel in another language here.
You can also control the headless chrome with PHP, via websockets. So someone could reprogram Puppeteer in PHP or is there something against?
Because you can re-use the JS that you wrote for the client to work for server-side rendering. You don't have to duplicate the logic in two places.
Furthermore, Puppeteer gives you more than just SSR - you can take screenshots, make PDFs. Those things you cannot do with just PHP (of course you could use PHP and websockets to control a headless chrome, but I sense that's not the point you're trying to make, correct me if I'm wrong).
That said, using a bunch of headless chrome instances might use more computing resources than you want, at which point PHP or Go or any other language would work. Tons of ways to do this, it's all dependent on your requirements.
Stephan Bijzitter i guess the benefit here is utilizing one language. Thanks
Nope. The benefit is that you can build apps with a great user experience (=pure js frontend) without the disadvantages (in terms of seo, ssr complexity and initial page load).
@@rckd5903 but wait.. isn't all the javascript removed from the page when it's sent to the browser... soooo.. the next request HAS to go through that headless monstrosity again?
ua-cam.com/video/lhZOFUY1weo/v-deo.html
The presentation mentioned a protocol monitor panel there. Its not in the menu in Chrome 72, even when Im in the right context (localhost:9222).
Did it get remove ?
Yes, it was removed. I saw this protocol monitor pannel for a while in a early version. Don't remember exactly when... maybe 8 months ago.... maybe...
still too slow in x2...
How to download the files in chrome headless mode in automation.... Can you please help me on this
too long...
puppeteer is opening doors to alot of hacking methods and will be banned i think..........
You can do all this stuff in normal Chrome.
good