You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Doing some large scale testing, so noting general errors in POS tagging and dependencies observed in the most recent en_core_web_sm that should likely be trained out. We train these out on our own models, but if it is useful I'll pass them on here if I feel like they should be part of the core models.
Short expressions of times in the format "1am","2am", etc., are being split into the numeral and "am", with the "am" then tagged as a verb or noun with an inappropriate dependency. Correct parse should likely be the "am" tagged as an adjective with a num_mod dependency on the numeral. FYI - looks like a lot of inconsistency on tagging and parsing of "a.m." and "p.m." as well.
The word "spanish" is being tagged as an adjective in all contexts, e.g. in "does she speak spanish", spanish is returned as an adjective instead of a noun
spaCy version: 2.0.5
Platform: Darwin-16.7.0-x86_64-i386-64bit
Python version: 3.6.1
Models: en
The text was updated successfully, but these errors were encountered:
Doing some large scale testing, so noting general errors in POS tagging and dependencies observed in the most recent en_core_web_sm that should likely be trained out. We train these out on our own models, but if it is useful I'll pass them on here if I feel like they should be part of the core models.
Short expressions of times in the format "1am","2am", etc., are being split into the numeral and "am", with the "am" then tagged as a verb or noun with an inappropriate dependency. Correct parse should likely be the "am" tagged as an adjective with a num_mod dependency on the numeral. FYI - looks like a lot of inconsistency on tagging and parsing of "a.m." and "p.m." as well.
The word "spanish" is being tagged as an adjective in all contexts, e.g. in "does she speak spanish", spanish is returned as an adjective instead of a noun
The text was updated successfully, but these errors were encountered: