Grab a dynamically rendered website and return full HTML

project information

website-scraper-puppeteer Is a Plug-in based on Puppeteer, designed for website-scraper The main library is designed and can be used to crawl dynamically rendered websites (i.e. websites that load content through JavaScript) and return full HTML.

Its purpose is to launch a headless Chrome instance, load the target page, wait for the page content to finish rendering, and then save the rendered HTML. This is more reliable than just grabbing HTTP responses, and is especially suitable for websites that need to wait for JS to execute or lazy loading content.

Usage and Function

According to the code example provided by README, the plug-in supports the following configuration items:

  • launchOptions: Puppeteer startup options (such as headless mode)
  • gotoOptions: Used to control page.goto Load behavior of (such as waitUntil: "networkidle0"
  • scrollToBottom: Support automatic scrolling to load lazy content, you can set timeout and viewportN
  • blockNavigation: Whether to disable page navigation to avoid jumping to other pages (default false)

In short, it helps you use Puppeteer to crawl web pages more gracefully, including those that load content dynamically via AJAX or scrolling.

Update status and recent submissions

From the overall overview of the GitHub repository (the list of projects under the organization “website-scraper”), we can see that website-scraper-puppeteer The latest update is in September 1, 2025

In addition, judging from the release version, the most recent release version of this project is v1.1.0, published in July 25, 2023

To summarize:

  • Projects still remain actively submitted (with updates at least as of September 1, 2025)
  • The latest version is v1.1.0 released on July 25, 2023

summary

project nameprofile
website-scraper-puppeteeron website-scraper Puppeteer plug-in for grabbing dynamic rendered HTML
latest versionv1.1.0, released on 2023-07-25
recently submitted2025-09-01(Still under maintenance)

Overall, this project is used to crawl dynamic web content and is still being maintained (at least until September 2025), but the version has been updated relatively long (2023). It is still a available and stable option in the current state.

Github:https://github.com/website-scraper/website-scraper-puppeteer

Oil tubing:

Scroll to Top