In this repository is the source code of Papermerge DMS backend core, REST API server, and frontend UI
-
Updated
May 24, 2024 - Python
In this repository is the source code of Papermerge DMS backend core, REST API server, and frontend UI
Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Logical structure extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser
Evaluate OMR sheets fast and accurately using a scanner 🖨 or your phone 🤳.
A curated list of awesome projects to simplify and improve paper and document scanning.
This repository contains automation solutions that efficiently extracts text from scanned PDF documents with consistent layouts. Utilizing Tesseract OCR engine, the UiPath RPA robot achieves nearly 90% accuracy, streamlining the process and significantly reducing manual workload.
Open Source Document Management System for Digital Archives (Scanned Documents)
Documentation for Papermerge DMS - Installation, Help, User Manual, REST API
Papermerge DMS command line utility
An automatic scan server software for scanners with document feeder. It creates multi-page PDFs with selectable text (OCR) by just one button press.
Emacs-assisted PDF document filing
The web UI for Facile Search. Together with DocIndex, this UI can help you search the myriad of scanned documents you have been accumulating over the years. Using the power of Docker & Elasticsearch you can run a powerful search engine that lets you convert scanned (image-based) PDFs to searchable text, group documents by letterhead, run fuzzy s…
Small utility to prepare scanned documents. Supports separating PDF files by separator pages and removing blank pages.
Searching for a text using OCR, detection and recognition of tables in scanned documents.
A document indexing daemon that can populate Elasticsearch indexes with the contents and metadata of a number of document types including PDF, image scans, etc. Used to power Facile Search, however can be re-used for anything that requires search indexing for scanned documents.
ScanTailor Advanced is the version that merges the features of the ScanTailor Featured and ScanTailor Enhanced versions, brings new ones and fixes.
For Windows Developers who need to capture image from scanner, digital camera or capture card that has a TWAIN device driver with C++, C#, VB.NET , VB, Delphi, Vfp, MS Access.
A program to automate simple and repetitive tasks while scanning documents by Dallin Stewart
A super lightweight image processing algorithm for detection and extraction of overlapped handwritten signatures on scanned documents using OpenCV and scikit-image.
BoxDetect is a Python package based on OpenCV which allows you to easily detect rectangular shapes like character or checkbox boxes on scanned forms.
Add a description, image, and links to the scanned-documents topic page so that developers can more easily learn about it.
To associate your repository with the scanned-documents topic, visit your repo's landing page and select "manage topics."