-
Notifications
You must be signed in to change notification settings - Fork 99
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
如何把hanlp 与 solr的DataImportHandler对接? #13
Labels
Comments
你好, 各司其职,DataImportHandler由solr提供,Processor由tika提供,Analyzer由hanlp(或类似ik等分词器)提供。 |
谢谢你的说明, 我有成功作到data import了! 另外一个初学者的问题 已经尝试放再多个地方 可是切词看起来没有根据CustomDictionary.txt里面的关键字来切词 |
謝謝! |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
你好:
本人solr新手,麻烦详细指教
我已经导入hanlp-sor-plugin并且在solr里面运行analysis正常了
之后我想从PDF, word,... 等档案格式中提取数据导入索引
请参考 http://iamyida.iteye.com/blog/2214600
请问hanlp是否也有同tika一样(TikaEntityProcessor)提供相应的processor?
或者有其他方法呢?
The text was updated successfully, but these errors were encountered: