Paperless-ngx: an open-source document management system,

Convert your physical documents into searchable online archives, reducing paper usage.

It has a built-in OCR feature that automatically performs OCR on uploaded scanned documents. Ability to recognize text in documents and convert them into editable and searchable text formats.

The documents are then categorized and indexed, and you can search and consult them at any time.

Key features:

1. Organize and index documents: Categorize and index documents using tags, correspondents, types, etc.
2. Perform OCR: Perform optical character recognition (OCR) on documents, adding searchable and selectable text even to documents with only images. Multilingual Support: Utilize the open-source Tesseract engine to recognize over 100 languages.
3. Document Saving Format: Documents are saved in PDF/A format, which is designed for long-term storage while retaining the original unmodified file.
4. Machine learning automatic tagging: Use machine learning to automatically add tags, correspondents, and document types to documents.
5. Support multiple file types: PDF documents, images, plain text files, Office documents (Word, Excel, Powerpoint and LibreOffice equivalent), etc.
6. Intuitive Web Application: Provides customized dashboards, filters, batch editing, drag-and-drop uploads, customized views, custom fields, shared public links, and more.
7. Support full-text search: Provide search functions such as auto-completion, relevance sorting, and highlighting the document section of the matching query. You can search using keywords, tags, or other metadata.

GitHub:https://github.com/paperless-ngx/paperless-ngx
Online Demo: https://demo.paperless-ngx.com
Official website: https://docs.paperless-ngx.com

Scroll to Top