Lightweight, cross-platform web archiving and sharing tool

Introduction: Web Archive is a web archive tool that contains the following parts

  • Browser plug-in: Save a webpage as a webpage snapshot and upload it to the server.
  • Server: Receive snapshots uploaded by browser plug-ins and store them in database and storage buckets.
  • Web client: Query the snapshot and display it.
  • The server is based on a full set of services of Cloudflare Worker, including D1 database and R2 storage buckets.

This project is a free web archiving and sharing tool based on Cloudflare called Web Archive.

project information

Web Archive is a full-stack web page archiving tool designed to provide a cross-platform, lightweight web page preservation and sharing experience that does not require self-built servers.

core component

  1. browser plug-in: Supports Chrome and Firefox, users can save the current web page as a single HTML file through plug-in and upload it to the server.

  2. server side: Receive HTML files from plug-ins and store them in databases and buckets.

  3. Web client: Provides a web interface for users to query and display saved web pages.

main function

  • Web archiving, searching and sharing: Users can save web pages, conduct keyword searches, and share them with others.

  • Folder classification: Supports classifying archive web pages into folders.

  • Mobile adaptation: The interface adapts to mobile devices and is easy to access on mobile phones or tablets.

  • AI generates label classification: Use artificial intelligence technology to automatically generate tags for archived web pages for easy classification and retrieval.

  • reading mode: Provide a concise reading interface to enhance the reading experience.

deployment methods

The project provides detailed deployment documentation that users can refer to deploy services in their own environment. After deployment is complete, enter the service address and key in the browser plug-in to use it.

project background

Developer Ray-D-Song was once a loyal user of ArchiveBox. However, because ArchiveBox requires self-built servers and high system resources, and is not friendly enough to support websites that need to be logged in, he developed Web Archive to provide a cross-platform, lightweight web archiving tool that does not require self-built servers.

project address

  • GitHub repository: link turn0search0

  • Document website: link turn0search0

If you are interested in this project, you can visit the link above for more information.

Website:https://github.com/Ray-D-Song/web-archive
Oil tubing:

Scroll to Top