project information
website-scraper-puppeteer Is a Plug-in based on Puppeteer, designed for website-scraper The main library is designed and can be used to crawl dynamically rendered websites (i.e. websites that load content through JavaScript) and return full HTML.
Its purpose is to launch a headless Chrome instance, load the target page, wait for the page content to finish rendering, and then save the rendered HTML. This is more reliable than just grabbing HTTP responses, and is especially suitable for websites that need to wait for JS to execute or lazy loading content.
Usage and Function
According to the code example provided by README, the plug-in supports the following configuration items:
- launchOptions: Puppeteer startup options (such as headless mode)
- gotoOptions: Used to control
page.gotoLoad behavior of (such aswaitUntil: "networkidle0") - scrollToBottom: Support automatic scrolling to load lazy content, you can set
timeoutandviewportN - blockNavigation: Whether to disable page navigation to avoid jumping to other pages (default false)
In short, it helps you use Puppeteer to crawl web pages more gracefully, including those that load content dynamically via AJAX or scrolling.
Update status and recent submissions
From the overall overview of the GitHub repository (the list of projects under the organization “website-scraper”), we can see that website-scraper-puppeteer The latest update is in September 1, 2025。
In addition, judging from the release version, the most recent release version of this project is v1.1.0, published in July 25, 2023。
To summarize:
- Projects still remain actively submitted (with updates at least as of September 1, 2025)
- The latest version is v1.1.0 released on July 25, 2023
summary
| project name | profile |
|---|---|
| website-scraper-puppeteer | on website-scraper Puppeteer plug-in for grabbing dynamic rendered HTML |
| latest version | v1.1.0, released on 2023-07-25 |
| recently submitted | 2025-09-01(Still under maintenance) |
Overall, this project is used to crawl dynamic web content and is still being maintained (at least until September 2025), but the version has been updated relatively long (2023). It is still a available and stable option in the current state.
Github:https://github.com/website-scraper/website-scraper-puppeteer
Oil tubing: