Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity...
levenshtein-distance cosine-similarity string-distance damerau-levenshtein distance distance-measure jaro-winkler similarity-measures shingles algorithm jvmJellyfish is a python library for doing approximate and phonetic matching of strings. Written by James Turk <james.p.turk@gmail.com> and Michael Stephens.
levenshtein soundex hamming metaphone jaro-winkler fuzzy-search"Natural" is a general natural language facility for nodejs. Tokenizing, stemming, classification, phonetics, tf-idf, WordNet, string similarity, and some inflections are currently supported.
natural language porter lancaster stemmer bayes classifier phonetic metaphone inflector wordnet tf-idf logistic regression doublemetaphone double jaro-winkler levenshtein distance taggerGolang string comparison and edit distance algorithms library featuring : Levenshtein, LCS, Hamming, Damerau levenshtein (OSA and Adjacent transpositions algorithms), Jaro-Winkler, Cosine, etc...
unicode algorithms edit-distance levenshtein jaro-winkler levenshtein-distance similarity-measures string-distance cosine string-matching damerau-levenshtein lcs lcs-distance hamming string-comparison golang-string-comparison edit-distance-algorithmsA collection of string matching algorithms built with React Select in mind. They're well documented in main.js.
distance filter fuzzy jaro-winkler levenshtein match options proximity search similar similarity string subsequence substitutions text typeahead variationsjaro_winkler is an implementation of Jaro-Winkler distance algorithm which is written in C extension and will fallback to pure Ruby version in platforms other than MRI/KRI like JRuby or Rubinius. Both of C and Ruby implementation support any kind of string encoding, such as UTF-8, EUC-JP, Big5, etc. There is no JaroWinkler.jaro_winkler_distance, it's tediously long.
algorithm jaro-winkler jaro-winkler-distanceA string similarity function using the Jaro-Winkler distance metric. Returns a number between 0 and 1. A 0 being no similarity and a 1 being an exact match. Read more about it on Wikipedia.
distance metric jaro winkler jaro-winkler compareclj-fuzzy is a native Clojure library providing a collection of famous algorithms dealing with fuzzy strings and phonetics. It can be used in Clojure, ClojureScript, client-side JavaScript and Node.js.
fuzzy strings metrics matching clojurescript caverphone dice hamming jaccard jaro-winkler lancaster levenshtein nysiis metaphone porter schinke stemmer soundex tverskySimetric provides facilities to perform approximate string matching and measurement of string similarity/distance. The library is focusing on speed and completeness. Then, run mix deps.get in your shell to fetch the new dependency.
elixir jaro-winkler levenshteinThe plan for this package is to have Go implementation of different string distance/similarity functions, like Levenshtein (normalized, weighted, Damerau), Jaro-Winkler, Jaccard index, Euclidean distance, Hamming distance... Work in progress...
jaro-winkler cosine-distance jaccard levenshtein string-distance comparison jaro-distance distanceMaking similarity functions and phonetic algorithms readily available for fuzzy matching analyses in Spark. Update your build.sbt file to import the libraries.
cosine-distance spark fuzzy-score hamming-distance jaccard-similarity jaro-winkler double-metaphone nysiis refined-soundexYou can change the version in the url to see the documentation for an older version in the changelog. If you don't want to install Rust itself, you can run $ ./dev for a development CLI if you have Docker installed.
jaro-winkler levenshtein jaro damerau-levenshtein edit-distancestrutil provides string metrics for calculating string similarity as well as other string utility functions. Full documentation can be found at: https://pkg.go.dev/github.com/adrg/strutil. The package defines the StringMetric interface, which is implemented by all the string metrics. The interface is used with the Similarity function, which calculates the similarity between the specified strings, using the provided string metric.
string smith-waterman levenshtein jaro-winkler string-metrics string-distance jaccard-similarity jaccard string-matching string-similarity hamming-distance jaro n-gram jaccard-index overlap-coefficient dice-coefficient smith-waterman-gotoh sorensen-dice n-gram-intersection strutilStringComparison is a library developed for reconciling naming conventions between different models of the electric grid. I have stripped off the power system specific code and put together what can effectively be used as a string extension for determining approximate equality between two strings. All of the algorithms used here have been pulled from online resources, translated into C#, and compiled into this library. I found several other similar open-source implementations around but nothing for .NET/C#. Adding the *.dll to your project will give you access to this extension and the individual extensions under the hood of the IsSimilarity() extension. While all of the algorithms are exposed and can be used and can provide their raw results, they have been conveniently combined in a way that they can selectively be used to judge the approximate equality of two strings. This is done through the IsSimilar extension and by setting the desired StringComparisonOptions and StringComparisonTolerance.
string comparison jaro-winkler levenshtein-distance longest-common-subsequence jaccard-distance hamming-distance jaro-distance longest-common-substring overlap-coefficient ratcliff-obershelp-similarity sorensen-dice-distance tanimoto-coefficientTySug is collection of packages, together they form a keyboard layout aware alternative word suggester. It can be used as both a library and a webservice. The primary supported use-case is to help with spelling mistakes against short popular word lists (e.g. domain names). Which is useful in helping to prevent typos in e.g. e-mail addresses, detect spam, phishing (Typosquatting), etc.
docker toml keyboard cors webservice library algorithm keyboard-layout typing jaro-winkler words typo string-distance suggestions spelling-errors jaroRun pip install ceja to install the library. Import the functions with import ceja. After importing the code you can run functions like ceja.nysiis, ceja.jaro_winkler_similarity, etc.
pyspark jaro-winkler nysiis metaphone damerau-levenshtein hamming-distance porter-stemmer jaro-similarity match-rating-comparisons
We have large collection of open source products. Follow the tags from
Tag Cloud >>
Open source products are scattered around the web. Please provide information
about the open source projects you own / you use.
Add Projects.