Skip to content

An Image-to-Text OCR Extraction Tool built in Java with Tess4J, a JNA wrapper class for Tesseract.

License

Notifications You must be signed in to change notification settings

incubated-geek-cc/Tess4JOcrApp

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

37 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

logo

🔎 Tess4JOcrApp

🛠️ Image-to-Text extraction from image and PDF files. Uses the open-sourced Tesseract OCR Engine & JNA Java Wrapper Class.

A native Desktop application built with Tess4J in Java.

📌 Features

  • Multiple image/PDF file uploads
  • Text extraction from image/PDF files
  • Export to text file

✍ Read related posts here

Article One :: Link :: Build a Portable OCR Tool in 4 Steps with Tess4J — A Tesseract Wrapper for Java


Article Two :: Link :: Building an OCR Native Application Tool with Tess4J — Extract Text from PDF in just 3 steps


Application GUI as of Sep 2022

🌟 Application GUI as of Feb 2024

Note: As of current, the latest version is available in the folder at v4x.


License

Both Tesseract and this Software are licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

Join me on 📝 Medium at ~ ξ(🎀˶❛◡❛) @geek-cc


🌮 Please buy me a Taco! 😋