Skip to content
This repository has been archived by the owner on Nov 10, 2022. It is now read-only.

Fuzzy-matching via ElasticSearch #80

Open
wetneb opened this issue May 19, 2020 · 1 comment
Open

Fuzzy-matching via ElasticSearch #80

wetneb opened this issue May 19, 2020 · 1 comment

Comments

@wetneb
Copy link
Owner

wetneb commented May 19, 2020

The Wikidata search service supports fuzzy search by adding ~ to some words:

So, how should the reconciliation service use this? Add ~ to all the words in the query to enable fuzzy search everywhere? Only do it if the exact query did not return any candidate? Just let users add ~ to their own queries when they want to enable the feature? Any other strategy?

Cc @darecoder1999 who brought up the issue

@thadguidry
Copy link
Contributor

There's quite a bit more exposed even beyond ~ from the ElasticSearch-backed CirrusSearch as documented here: https://www.mediawiki.org/wiki/Help:CirrusSearch

Extra Specific Wikibase features are mentioned:

Wikibase search
The Wikibase extension defines some search keywords in order to make it easier to search for certain Wikibase items. This is useful on Wikidata and other Wikibase sites, including to search for images with Structured data on Wikimedia Commons. See Help:WikibaseCirrusSearch for details.

And documented here:
https://www.mediawiki.org/wiki/Help:Extension:WikibaseCirrusSearch

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants