-
Notifications
You must be signed in to change notification settings - Fork 429
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Figures and tables in the back / annex section ignored #737
Comments
This seems to be due to FullTextParser processing figures and tables from the body only. |
This was referenced Apr 14, 2021
@de-code do you have a Pdf for testing? |
One example is DOI 10.1101/306803 or 306803v1 (from the bioRxiv 10k validation dataset). |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
This is related to #698
Some documents have main figures and supplementary figures.
If in those cases, the segmentation model labels the supplementary figures as
annex
,then the content is passed separately to the
fulltext
model.If the
fulltext
then correctly labels it asfigure
, then the figures from theannex
are not included in the output.The text was updated successfully, but these errors were encountered: