scanned-documents

Here are 47 public repositories matching this topic...

alucic2 / cluster_htrc

Identifying the boundaries of main content of fiction and non-fiction works in the HathiTrust Extracted Features dataset.

scanned-documents extracting-features clustering-algorithm digital-libraries clustering-analysis smoothing-methods detecting-paratext-boundaries

Updated May 10, 2022
Jupyter Notebook

Viscomsoft / Scanner-Pro-SDK-ActiveX-x64

Star

TWAIN Scanning SDK for 64 bit and 32 bit MS Access, VB.NET, C#, Delphi and Visual C++ and 32 bit Visual Basic 6 and VFP.

pdf sdk csharp vb6 vba scanned-documents vbnet twain msaccess msaccess-vba imaging-solutions vbtwain

Updated Jul 24, 2022
C#

Viscomsoft / Scanner-TWAIN-SDK-ActiveX

Star

For Windows Developers who need to capture image from scanner, digital camera or capture card that has a TWAIN device driver with C++, C#, VB.NET , VB, Delphi, Vfp, MS Access.

sdk csharp dotnet scanner activex scanned-documents scannerbarcode twain twain-operation vbtwain

Updated May 22, 2023
Visual Basic .NET

rohanrav / document-scanner

Star

Document scanner created using openCV and python.

python3 scanned-documents opencv-python

Updated Mar 13, 2019
Python

bearrundr / scantailor-custom

Star

scantailor customization add some new functions

image-processing djvu scanned-documents book-scanning digitization

Updated Oct 5, 2019
C++

legenscandary / scan

Star

An automatic scan server software for scanners with document feeder. It creates multi-page PDFs with selectable text (OCR) by just one button press.

pdf ocr samba scanned-documents shell-scripts pdf-generation scanning

Updated Feb 19, 2024
Shell

simranbiswas / Textract

Star

Text Extractor for scanned images and documents. Scans and extracts the content of the file saving loads of time and reduces the chance of typographical error to 0%.

scanned-documents ocr-text-reader text-extractor

Updated Jul 28, 2021
HTML

drxwat / scanned_digits_recognition

Star

Scanned digits detector and classifier (CNN, OpenCV)

classifier opencv ocr keras scanned-documents

Updated May 4, 2017
Jupyter Notebook

hacker-or-id / docs

Star

This is the open-source repo for docs.github.com.

typescript actions scan open-data scan-tool scanned-documents ubuntu-server linux-server sistem scancode

Updated Oct 16, 2020
JavaScript

A document indexing daemon that can populate Elasticsearch indexes with the contents and metadata of a number of document types including PDF, image scans, etc. Used to power Facile Search, however can be re-used for anything that requires search indexing for scanned documents.

search-engine elasticsearch full-text-search scanned-documents pdf-search

Updated Oct 17, 2023
Java

paulocressoni / scanned_pdf_ocr

Star

Apply OCR on scanned PDF files to extract text from the PDF images.

imagemagick ocr linux-shell tesseract-ocr scanned-documents image-to-text

Updated Jan 13, 2020
Shell

MaxineXiong / Scraping-Scanned-PDF-Docs-using-OCR-with-RPA

Star

This repository contains automation solutions that efficiently extracts text from scanned PDF documents with consistent layouts. Utilizing Tesseract OCR engine, the UiPath RPA robot achieves nearly 90% accuracy, streamlining the process and significantly reducing manual workload.

ocr scanned-documents optical-character-recognition screen-scraping rpa robotic-process-automation uipath uipath-studio scanned-receipts uipath-modern-design uipath-classic-design

Updated Apr 17, 2024

rbrito / pkg-pdfbeads

Star

Debian packaging of pdfbeads

pdf pdf-converter scanned-documents pdf-generation scanning scanned-image-pdfs

Updated May 11, 2020
Ruby

paulveillard / cybersecurity-internet-scanning

Star

An ongoing & curated collection of awesome software best practices and techniques, libraries and frameworks, E-books and videos, websites, blog posts, links to github Repositories, technical guidelines and important resources about Internet Scanning in Cybersecurity

scanner scanned-documents scanning scanning-tool

Updated Jun 14, 2022

hacker-or-id / scan

Star

{{scan|tools|software|headware|progress|open|template|log|log|log|softwaretool|}}{[[:wikt:Scan|log scan]]}. #[[:wikt:log scan|log copyright]]. *[[:wikt:log is log|log]]. *[[:wikt:log scan|txt]]. *[[:wikt:log scan|png]]. *[[:wikt:log scan|image image image/category user/category is /category talkname/category username/category done/category in pr…

linux-kernel scans scanned-documents ubuntu-server linux-server scancode unixporn scansnap-organizer scans-xhr-requests scans-directories

Updated Oct 23, 2020

Hawk453 / OCR_FOR_PDFS

Star

Optical Character Recognition for Scanned Documents

opencv ocr scanned-documents optical-character-recognition pdfs

Updated Nov 15, 2020
Python

hnjm / papermerge

Star

Open Source Document Management System for Digital Archives (Scanned Documents)

python pdf django ocr archives scan scanned-documents dms document-management paperless hnjm

Updated Jan 5, 2023
Python

deckerego / docmag

Star

The web UI for Facile Search. Together with DocIndex, this UI can help you search the myriad of scanned documents you have been accumulating over the years. Using the power of Docker & Elasticsearch you can run a powerful search engine that lets you convert scanned (image-based) PDFs to searchable text, group documents by letterhead, run fuzzy s…

docker kubernetes pdf elasticsearch full-text-search scanned-documents

Updated Oct 26, 2023
Groovy

milahu / document-photo-auto-threshold

Star

auto-correct contrast and brightness of photographed document

image-processing contrast brightness scan-tool scanned-documents postprocessing contrast-enhancement brightness-adjustment

Updated Oct 12, 2021
Python

svitlana1209 / OCR-search

Star

Searching for a text using OCR, detection and recognition of tables in scanned documents.

python pdf opencv image ocr computer-vision pandas-dataframe tesseract text-recognition scanned-documents hough-transform contour-detection pytesseract angle-rotation detect-table-struct

Updated Oct 23, 2023
Python

Improve this page

Add a description, image, and links to the scanned-documents topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the scanned-documents topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scanned-documents

Here are 47 public repositories matching this topic...

alucic2 / cluster_htrc

Viscomsoft / Scanner-Pro-SDK-ActiveX-x64

Viscomsoft / Scanner-TWAIN-SDK-ActiveX

rohanrav / document-scanner

bearrundr / scantailor-custom

legenscandary / scan

simranbiswas / Textract

drxwat / scanned_digits_recognition

hacker-or-id / docs

deckerego / docidx

paulocressoni / scanned_pdf_ocr

MaxineXiong / Scraping-Scanned-PDF-Docs-using-OCR-with-RPA

rbrito / pkg-pdfbeads

paulveillard / cybersecurity-internet-scanning

hacker-or-id / scan

Hawk453 / OCR_FOR_PDFS

hnjm / papermerge

deckerego / docmag

milahu / document-photo-auto-threshold

svitlana1209 / OCR-search

Improve this page

Add this topic to your repo