-
Notifications
You must be signed in to change notification settings - Fork 9.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
amr解析部分数字出错 #1721
Labels
Comments
感谢反馈,的确存在中文数字解析的问题。微软的东西试过了,也没法处理一些混合小数与单位的情况,还是得靠自己改了改。请应用补丁:
|
至于部分数值缺失,则是由于模型没有预测出来,而不是预测出来转换错误导致的。暂时没有太好的办法,可能需要跟NER做联合学习。 |
期待 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Describe the bug
例1:我给了他15万元。
amr 解析结果如下图:
“15万” 未被正确解析
例2: 我给了他十五点八万元。
“十五点八万” 未被正确解析
例3: 我给了他十元三角八分钱。
“十元三角八分” 未被正确解析
Code to reproduce the issue
Provide a reproducible test case that is the bare minimum necessary to generate the problem.
Describe the current behavior
将“15万”改为“十五万”后,可解析为 “150000”
错误应出自数字转换的过程。 可以参考 https://github.com/microsoft/Recognizers-Text
Expected behavior
能正确显示 label。
当然了,输出数据里的 anchors 标记了原文位置,所以问题也不是特别的大😄
看了下输出的数据,anchors是保留了原文的位置,所以问题也不是特别的大。
System information
Other info / logs
Include any logs or source code that would be helpful to diagnose the problem. If including tracebacks, please include the full traceback. Large logs and files should be attached.
The text was updated successfully, but these errors were encountered: