tika
Here are 141 public repositories matching this topic...
A doc searcher of the documents on the local host that is based on: Tika+OCR, ElasticSearch and Kibana
-
Updated
Jan 23, 2021 - Java
WORK IN PROGRESS - Dataiku DSS plugin to extract text data from documents
-
Updated
Jan 11, 2021 - Makefile
The simple monolithic application demonstrates: the extraction of the images of the PDF document pages using Apache Tika, the storage of the images files into the local filesystem, the display of the pages using the ngx-swiper-wrapper library.
-
Updated
May 9, 2023 - Java
Information retrieval system for documents.
-
Updated
Feb 15, 2022 - HTML
Early Buddhist texts from the Tipitaka (Tripitaka). Suttas (sutras) with the Buddha's teachings on mindfulness, insight, wisdom, and meditation.
-
Updated
Jul 6, 2023 - JavaScript
Directory tree metadata parser using Apache Tika
-
Updated
May 3, 2024 - Python
A windows service wrapper for the tika JSR 311 network server.
-
Updated
Jan 29, 2024 - Batchfile
Extracts GPS coordinates from pdf files and Points/Polygons from kmz files to create a master kml file. 🌎
-
Updated
Jul 7, 2021 - HTML
A Java application that uses Lucene and Tika to search document and display the document part in which the document is found.Along with precision and recall value
-
Updated
Aug 20, 2017 - Java
A Windows Installer (MSI) for the windows service wrapper of the tika JSR 311 network server.
-
Updated
Feb 15, 2022 - C#
Container-ized (Docker) GeoTopicParser-Enabled Apache Tika Server with Lucene Geo Gazetteer.
-
Updated
Apr 5, 2021 - Dockerfile
DocClusterizer is a Java desktop application designed to analyze and cluster documents based on their content similarity. The application utilizes Lucene and Tika libraries to process various file extensions such as txt, pdf, docx, and pptx.
-
Updated
Apr 6, 2024 - Java
The Information Retrieval Labolatories
-
Updated
Apr 16, 2018 - Java
Improve this page
Add a description, image, and links to the tika topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the tika topic, visit your repo's landing page and select "manage topics."