Skip to content

Issues: Unstructured-IO/unstructured

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Assignee
Filter by who’s assigned
Sort

Issues list

bug/some tables in PDF not getting recognized bug Something isn't working pdf
#2997 opened May 9, 2024 by Ritesh1137
feat/ocr_layer_to_pdf enhancement New feature or request ocr Related to optical character recognition (OCR).
#2991 opened May 8, 2024 by punjabdhaputar
Import is very time -consuming: from unnersted.partition.pdf import partition_pdf investigating Issues that require more information before they are actionable
#2983 opened May 8, 2024 by peanutpaste
Lightweight installation unstructured[pdf] ????? investigating Issues that require more information before they are actionable
#2976 opened May 7, 2024 by liturrig
feat/chunking_by_title_tokens enhancement New feature or request
#2967 opened May 3, 2024 by erik-squared
partition_pdf: no text orientation detection? bug Something isn't working pdf
#2966 opened May 3, 2024 by vivien000
CCT measure-table-structure-accuracy-command doesn't drop index bug Something isn't working
#2962 opened May 2, 2024 by mallorih
docs: Add docs showing users can set which OCR agent to use documentation Improvements or additions to documentation
#2961 opened May 2, 2024 by Coniferish
chore/Allow python 3.12
#2959 opened May 2, 2024 by NikolasWolke
feat/docx-field-codes docx Related to Microsoft Word (.docx) file format enhancement New feature or request
#2944 opened Apr 27, 2024 by erik-squared
Text Extraction Issue: Greek Language PDFs Rendered with Incorrect Alphabet bug Something isn't working ocr Related to optical character recognition (OCR).
#2939 opened Apr 26, 2024 by DarioBernardo
feat/partition_metadata enhancement New feature or request html
#2933 opened Apr 25, 2024 by Falven
Clarify orig_elements documentation documentation Improvements or additions to documentation enhancement New feature or request
#2929 opened Apr 25, 2024 by Marcell-Balint
chore: Update unstructured-client bug Something isn't working
#2924 opened Apr 23, 2024 by Coniferish
infer_table_structure lead Failed to initialize the model bug Something isn't working pdf
#2923 opened Apr 23, 2024 by spongxin
bug/Execution speed is very slow in AWS LAMBDA environment investigating Issues that require more information before they are actionable
#2916 opened Apr 22, 2024 by cds-code
Doc/Docx with Checkboxes docx Related to Microsoft Word (.docx) file format enhancement New feature or request
#2912 opened Apr 19, 2024 by Rob-Smith-HDT
Documentation for Partitioning table for email has wrong class type documentation Improvements or additions to documentation
#2907 opened Apr 19, 2024 by debasisdwivedy
bug: TesseractError: Estimating resolution as X bug Something isn't working ocr Related to optical character recognition (OCR).
#2900 opened Apr 17, 2024 by qued
ProTip! Type g i on any issue or pull request to go back to the issue listing page.