Jellyfish is a python library for doing approximate and phonetic matching of strings. Written by James Turk <james.p.turk@gmail.com> and Michael Stephens.
levenshtein soundex hamming metaphone jaro-winkler fuzzy-searchThe library's full documentation can be found here. Be sure to lint & pass the unit tests before submitting your pull request.
natural-language-processing machine-learning fuzzy-matching clustering record-linkage bayes bloom-filter canberra caverphone chebyshev cologne cosine classifier daitch-mokotoff dice fingerprint fuzzy hamming k-means jaccard jaro lancaster levenshtein lig metaphone mra ngrams nlp nysiis perceptron phonetic porter punkt schinke sorensen soundex stats tfidf tokenizer tversky vectorizer winklerThe quickest way to get the latest release of Phonix is to add it to your project using NuGet (http://nuget.org/List/Packages/Phonix).
phonetic-algorithms algorithm soundex metaphone caverphone c-sharpThis is the R package to support phonetic spelling algorithms in R. Several packages provide the Soundex algorithm. However, other algorithms have been developed since Soundex that can also provide phonetic spelling and test phonetic similarity. This work used the Extreme Science and Engineering Discovery Environment (XSEDE), which is supported by National Science Foundation grant number ACI-1548562. In particular, it used the Comet system at the San Diego Supercomputing Center (SDSC) through allocations TG-DBS170012 and TG-ASC150024.
phonetic-spelling-algorithms soundex phonics nysiis metaphone text-processing linguistics record-linkageclj-fuzzy is a native Clojure library providing a collection of famous algorithms dealing with fuzzy strings and phonetics. It can be used in Clojure, ClojureScript, client-side JavaScript and Node.js.
fuzzy strings metrics matching clojurescript caverphone dice hamming jaccard jaro-winkler lancaster levenshtein nysiis metaphone porter schinke stemmer soundex tverskyThis project is Unlicensed. In other words, I don't care what you do with it. However, if you make something interesting, I would like to check it out. Well, the wiki article probably says enough. Find it here.
soundex index text phonetic algorithmMore will be added in the future. The module is available in PyPI, just use pip install pyphonetics.
phonetic-algorithms soundex levenshtein-distance nlp metaphone text-miningThis library contains implementations of the Levenshtein distance, Jaro-Winkler and Soundex algorithms written in Go (golang). Other algorithms related with string metrics (or string similarity, whatever) are welcome. The Wagner-Fischer algorithm for calculating the Levenshtein distance. It runs on O(mn) and needs O(2m) space where m is the size of the smallest string. This is kinda optimized so it should be used in most cases.
levenshtein-distance distance jaro soundex winkler jarowinkler ukkonen string-metrics algorithmMetaphone3 is a more accurate version of the original Soundex algorithm. It's designed so that similar-sounding words in American English share the same keys. For example Smith, Smyth, Smithe, Smythe all encode to SM0 primary and XMT alt. Whereas Schmidt encodes to XMT primary with no secondary. Searching for matches where either primary or secondary matches will give the best results.
golang-library soundex metaphone
We have large collection of open source products. Follow the tags from
Tag Cloud >>
Open source products are scattered around the web. Please provide information
about the open source projects you own / you use.
Add Projects.