Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...
-
Updated
Jun 1, 2022 - Java
Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...
A powerful and modular toolkit for record linkage and duplicate detection in Python
Java fuzzy string matching implementation of the well known Python's fuzzywuzzy algorithm. Fuzzy search for Java
A .NET port of java-string-similarity
📚 String comparison and edit distance algorithms library, featuring : Levenshtein, LCS, Hamming, Damerau levenshtein (OSA and Adjacent transpositions algorithms), Jaro-Winkler, Cosine, etc...
Fuzzy string matching for PHP
String Distances in Julia
Golang metrics for calculating string similarity and other string utility functions
Natural Language Processing (NLP) library for Crystal
Making the quickest and most memory efficient implementation of Levenshtein Distance with SIMD and Threading support
A Privacy focused, easy sharable, open source and trackingless diff viewer.
📐 Hidden alignment conditional random field for classifying string pairs.
Rust edit distance routines accelerated using SIMD. Supports fast Hamming, Levenshtein, restricted Damerau-Levenshtein, etc. distance calculations and string search.
A Java library for computation on permutations and sequences
Lexicographically-subdivide the “space” between strings, by defining an alternate non-base-ten number system using a pre-defined dictionary of symbol↔︎number mappings. Handy for ordering NoSQL keys.
String similarity functions, String distance's, Jaccard, Levenshtein, Hamming, Jaro-Winkler, Q-grams, N-grams, LCS - Longest Common Subsequence, Cosine similarity...
📐 A Cython implementation of the affine gap string distance
String trie that supports wildcard search
Levenshtein distance and similarity metrics with customizable edit costs and Winkler-like bonus for common prefix.
A project for string similarities.
Add a description, image, and links to the string-distance topic page so that developers can more easily learn about it.
To associate your repository with the string-distance topic, visit your repo's landing page and select "manage topics."