-
Notifications
You must be signed in to change notification settings - Fork 171
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
分词结果过滤单个字符 #16
Comments
这个需求可以使用solr自带的 示例如下: <analyzer>
<tokenizer class="solr.StandardTokenizerFactory"/>
<filter class="solr.LengthFilterFactory" min="2" max="7"/>
</analyzer>
将该过滤器配置在 ik 分词器的过滤器列表里即可。 |
十分感谢🙏 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
请问分词结果怎么过滤单个字符呢?如果源词就只有一个字符那么就直接返回源词,如果原来的词是多个字符例如 “我是中国人”, 那么分词结果只保留 “我是中国人”, “我是”,“中国人”, “中国”,不再要“人”
The text was updated successfully, but these errors were encountered: