pdfOCR is an iText 7 add-on to recognize and extract text in scanned documents and images. It can also convert them into fully ISO-compliant PDF or PDF/A-3u files that are accessible, searchable, and suitable for archiving
pdf
data
image
ocr
recognition
glyphs
tesseract
scan
character
spanish
searchable
ligatures
hindi
portuguese
optical
archival
mandarin
extractable
iso-compliant
diacritic
-
Updated
Apr 18, 2024 - C#