We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
1)对第一张图片直接用paddleocr识别的坐标为[[2530.0, 105.0], [2654.0, 105.0], [2654.0, 171.0], [2530.0, 171.0]],可以看出右上角的文字能正常框出,即
2)但根据样例得到的demo_ocr_res.json文件里面发现的坐标确是这样:"document_bbox": [[2530, 2592, 105, 175], [2530, 2592, 105, 175], [2592, 2654, 105, 175], [284, 463, 294, 386], [463, 552, 294, 386], ...., (内容已省略)
不太明白document_bbox的数值指的是文字的坐标么,但直接将坐标在图片显示出来又不是文字方框,请问这些数值的具体含义是什么
The text was updated successfully, but these errors were encountered:
请参考aistudio教程:
https://aistudio.baidu.com/projectdetail/4049663?channelType=0&channel=0
Sorry, something went wrong.
KB-Ding
No branches or pull requests
请提出你的问题
1)对第一张图片直接用paddleocr识别的坐标为[[2530.0, 105.0], [2654.0, 105.0], [2654.0, 171.0], [2530.0, 171.0]],可以看出右上角的文字能正常框出,即
2)但根据样例得到的demo_ocr_res.json文件里面发现的坐标确是这样:"document_bbox": [[2530, 2592, 105, 175], [2530, 2592, 105, 175], [2592, 2654, 105, 175], [284, 463, 294, 386], [463, 552, 294, 386], ...., (内容已省略)
不太明白document_bbox的数值指的是文字的坐标么,但直接将坐标在图片显示出来又不是文字方框,请问这些数值的具体含义是什么
The text was updated successfully, but these errors were encountered: