You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
如果你管理着很多不同种类的文档报告doc\docx\excel\ppt\pdf\jpg等等,模版很多,文档种类也很多,我如何通过NLP 或者通过版面分析把它们按文档内容或按排版分类?
目前的思路是统统把文档按页转为图片,然后通过paddlenlp标注、信息抽取,按类别训练模型。但需要时间实验与验证。
paddleNLP技术日趋成熟,但我一直没法找到更好的办法,希望得到高人指点,有Demo更好
Beta Was this translation helpful? Give feedback.
All reactions