New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
PDF only partially parsed #163
Labels
Comments
I can confirm this issue, LLamaParse misses a lot of text in the documents. On comparing the results of Llamaparse with Marker I noticed that LLamaparse doesn't parse around 40-60% of texts in PDF depending on the file. I must say, whatever llamaparse parses is superior to any other pdf to markdown converter out there but this issue makes it unusable. Look forward to a quick resolution from the team. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
I parsed the below PDF using llama parse:
Allianz_2017_CbCR_7.pdf
Unfortunately, on page 1, only the left column got parsed:
The text was updated successfully, but these errors were encountered: