-
-
Notifications
You must be signed in to change notification settings - Fork 1.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add new GREL function to normalize characters #6608
Comments
A Separately, we probably also want to enable access to locale sensitive string comparisons of various strengths as provided by |
It would be easier if OpenRefine allows searching for words with diacritical marks or extended western characters.
For example, if I have the name
Björn Borg
in a data set, if you try to useText filter
and writeBjorn Borg
, you won't get any results.Proposed solution
Transform the cells value to its normalization. This could happen by providing a new builtin GREL function, perhaps called
normalize()
, that do the following :For example:
"gödel".normalize()
->godel
"Villazón".normalize()
->Villazon
Alternatives considered
Installing Jython 2.7 + unidecode library (like here)
Additional context
Look at the
normalize()
function inFingerprintKeyer
class hereThe text was updated successfully, but these errors were encountered: