Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

更新百度百科获取方式 #174

Open
wants to merge 19 commits into
base: dev
Choose a base branch
from
Open

Conversation

yuyijiong
Copy link

你在该请求中做了些什么?
更新百度百科获取方式。
现在百度搜索中位于优先位置的百度百科通常是 tpl="sg_kg_entity_san"的,这种情况下的百度百科搜索结果往往优于tpl="bk_polysemy"。
所以我将tpl="sg_kg_entity_san"作为筛选百度百科的优先条件,tpl="bk_polysemy"次之。
此更新后,百度百科的搜索结果将更符合用户需要。

更新百度百科筛选机制
更新百度百科解析
更新百度百科解析方式
yuyijiong and others added 16 commits December 10, 2023 19:49
修复了解析不出domain的bug
增加函数 search_baidu_normal,会将所有网页作为普通网页处理
search_web函数增加normal_all参数,normal_all=True会将所有网页作为普通网页处理
修复查找不到时间会报错的bug
修复parse_web中未找到搜索数量的报错
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant