The library's full documentation can be found here. Be sure to lint & pass the unit tests before submitting your pull request.
natural-language-processing machine-learning fuzzy-matching clustering record-linkage bayes bloom-filter canberra caverphone chebyshev cologne cosine classifier daitch-mokotoff dice fingerprint fuzzy hamming k-means jaccard jaro lancaster levenshtein lig metaphone mra ngrams nlp nysiis perceptron phonetic porter punkt schinke sorensen soundex stats tfidf tokenizer tversky vectorizer winklerThis is the R package to support phonetic spelling algorithms in R. Several packages provide the Soundex algorithm. However, other algorithms have been developed since Soundex that can also provide phonetic spelling and test phonetic similarity. This work used the Extreme Science and Engineering Discovery Environment (XSEDE), which is supported by National Science Foundation grant number ACI-1548562. In particular, it used the Comet system at the San Diego Supercomputing Center (SDSC) through allocations TG-DBS170012 and TG-ASC150024.
phonetic-spelling-algorithms soundex phonics nysiis metaphone text-processing linguistics record-linkageclj-fuzzy is a native Clojure library providing a collection of famous algorithms dealing with fuzzy strings and phonetics. It can be used in Clojure, ClojureScript, client-side JavaScript and Node.js.
fuzzy strings metrics matching clojurescript caverphone dice hamming jaccard jaro-winkler lancaster levenshtein nysiis metaphone porter schinke stemmer soundex tverskyMaking similarity functions and phonetic algorithms readily available for fuzzy matching analyses in Spark. Update your build.sbt file to import the libraries.
cosine-distance spark fuzzy-score hamming-distance jaccard-similarity jaro-winkler double-metaphone nysiis refined-soundexRun pip install ceja to install the library. Import the functions with import ceja. After importing the code you can run functions like ceja.nysiis, ceja.jaro_winkler_similarity, etc.
pyspark jaro-winkler nysiis metaphone damerau-levenshtein hamming-distance porter-stemmer jaro-similarity match-rating-comparisons
We have large collection of open source products. Follow the tags from
Tag Cloud >>
Open source products are scattered around the web. Please provide information
about the open source projects you own / you use.
Add Projects.