Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Text Unit Classifer Suggestion Appears Empty #25

Open
dwmcqueen opened this issue Feb 9, 2018 · 3 comments
Open

Text Unit Classifer Suggestion Appears Empty #25

dwmcqueen opened this issue Feb 9, 2018 · 3 comments

Comments

@dwmcqueen
Copy link
Contributor

This may be a user issue, but I have classified several text units and the suggestion list continues to be empty. Is it functional?

@reddalexx
Copy link
Contributor

Hi.

Text Unit Classifier Suggestions mainly depend on 4 things:

  • how you marked text units using "class name" and "class value"
  • which item you choose in "classify by" list (terms, geo entities, parties)
  • classifier algorithm
  • and confidence value

So doing a classification you say: let's classify all text units
using specific terms from text (geo entities, party names) like anchors and
specified class [class name] values [class value] as labels for class groups.
And then set classifier method and Max confidence value.
Consider to coordinate marked text units with selected "Cluster By" form field
and may be decrease "Max Confidence" value.

@dwmcqueen
Copy link
Contributor Author

That makes sense - so it is functional. As I understand it, the "class name" for a similar text unit would be the same but the "class value" could differ, correct? For example, I could have a class name for "Assignability" but then several different values based on how the text unit reads? Then, it is important to have enough samples so the classifier runs?

@reddalexx
Copy link
Contributor

Yes, you are right.
F.e. if a user wants to classify by geo entities,
class_name could be "Case1", class_values could be "northern" text units with geo entities like Alaska and "southern" for text units with Florida or Texas
and then run classification with classify_by="geo entities"

And yes, there should be as much as possible samples to make better classification.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

2 participants