Issues: Unstructured-IO/unstructured
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
bug/some tables in PDF not getting recognized
bug
Something isn't working
pdf
#2997
opened May 9, 2024 by
Ritesh1137
feat/ocr_layer_to_pdf
enhancement
New feature or request
ocr
Related to optical character recognition (OCR).
#2991
opened May 8, 2024 by
punjabdhaputar
feat/ Param to control the behavior of chunking when encountering Table
enhancement
New feature or request
#2990
opened May 8, 2024 by
LucasOliveira44
Following dependencies are missing: pikepdf. Please install them using
pip install pikepdf
.
#2984
opened May 8, 2024 by
OLR-Nadia
Import is very time -consuming: from unnersted.partition.pdf import partition_pdf
investigating
Issues that require more information before they are actionable
#2983
opened May 8, 2024 by
peanutpaste
Lightweight installation unstructured[pdf] ?????
investigating
Issues that require more information before they are actionable
#2976
opened May 7, 2024 by
liturrig
feat/chunking_by_title_tokens
enhancement
New feature or request
#2967
opened May 3, 2024 by
erik-squared
partition_pdf: no text orientation detection?
bug
Something isn't working
pdf
#2966
opened May 3, 2024 by
vivien000
CCT Something isn't working
measure-table-structure-accuracy-command
doesn't drop index
bug
#2962
opened May 2, 2024 by
mallorih
docs: Add docs showing users can set which OCR agent to use
documentation
Improvements or additions to documentation
#2961
opened May 2, 2024 by
Coniferish
feat: enable users to define retry logic when using New feature or request
ingest
partition_via_api
enhancement
#2948
opened Apr 29, 2024 by
Coniferish
feat/docx-field-codes
docx
Related to Microsoft Word (.docx) file format
enhancement
New feature or request
#2944
opened Apr 27, 2024 by
erik-squared
Text Extraction Issue: Greek Language PDFs Rendered with Incorrect Alphabet
bug
Something isn't working
ocr
Related to optical character recognition (OCR).
#2939
opened Apr 26, 2024 by
DarioBernardo
Clarify Improvements or additions to documentation
enhancement
New feature or request
orig_elements
documentation
documentation
#2929
opened Apr 25, 2024 by
Marcell-Balint
chore: Update unstructured-client
bug
Something isn't working
#2924
opened Apr 23, 2024 by
Coniferish
infer_table_structure
lead Failed to initialize the model
bug
#2923
opened Apr 23, 2024 by
spongxin
infer_table_structure
in partition_pdf
function causes CUDA RuntimeError
bug
#2922
opened Apr 22, 2024 by
naity2
bug/Execution speed is very slow in AWS LAMBDA environment
investigating
Issues that require more information before they are actionable
#2916
opened Apr 22, 2024 by
cds-code
Doc/Docx with Checkboxes
docx
Related to Microsoft Word (.docx) file format
enhancement
New feature or request
#2912
opened Apr 19, 2024 by
Rob-Smith-HDT
Documentation for Partitioning table for email has wrong class type
documentation
Improvements or additions to documentation
#2907
opened Apr 19, 2024 by
debasisdwivedy
bug: TesseractError: Estimating resolution as X
bug
Something isn't working
ocr
Related to optical character recognition (OCR).
#2900
opened Apr 17, 2024 by
qued
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.