-
Notifications
You must be signed in to change notification settings - Fork 59
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tutorial results in obscure errors. #40
Comments
Thanks for the catch! Tutorial is outdated and not complete though; the recommended way is to use crfsuite, not wapiti, and tutorial should have shown how to use Pattern features, as they are important to get good quality. |
Hey @kmike thanks for letting me know. I did manage to get it working with crfsuit and it's doing pretty well for my use-case! I'd update the docs but I feel that my knowledge is a bit limited on this subject for the time being. Could you elaborate more on Pattern features? Or point me to some material? |
Hey @Granitosaurus! I've added a complete example here: https://github.com/scrapinghub/webstruct/tree/master/example; it'd be nice to move some parts of it to the tutorial. |
I've been following webstruct tutorial and I'm getting few peculiar errors.
From the tutorial I end up with code along the lines of this:
The first error I get is TypeError when trying to use extract something with
ner
:It seems like python3 support issue as it's expects bytes but get a string?
Second error is when trying to build a
ner
straight from model without fitting it first:Results in:
The errors seem to be very vague and I don't even know where to start debugging this. Am I missing something?
I'm running:
webstruct
- 0.5scikit-learn
- 0.18.2scipy
- 0.19libwapiti
- 0.2.1The text was updated successfully, but these errors were encountered: