Displaying 1 to 16 from 16 results

SecondString - Java-based package of approximate string-matching techniques

  •    Java

SecondString, an open-source Java-based package of approximate string-matching techniques. This code was developed by researchers at Carnegie Mellon University from the Center for Automated Learning and Discovery, the Department of Statistics, and the Center for Computer and Communications Security.

closestmatch - Golang library for fuzzy matching within a set of strings :page_with_curl:

  •    Go

closestmatch is a simple and fast Go library for fuzzy matching an input string to a list of target strings. closestmatch is useful for handling input from a user where the input (which could be mispelled or out of order) needs to match a key in a database. closestmatch uses a bag-of-words approach to precompute character n-grams to represent each possible target string. The closest matches have highest overlap between the sets of n-grams. The precomputation scales well and is much faster and more accurate than Levenshtein for long strings.closestmatch is more accurate than Levenshtein for long strings (like in the test corpus).

levenshtein.c - Levenshtein algorithm in C

  •    C

Vladimir Levenshtein’s edit distance algorithm1 as a C library. There’s also a CLI: levenshtein(1), and a JavaScript version.Or clone the repo.




fsed - Aho-Corasick string replacement utility

  •    Perl

Licensed under the MIT License (see file LICENSE.rst for details). Search and replace on file(s), with matching on fixed strings.

strsim - string similarity based on Dice's coefficient in go

  •    Go

Finds degree of similarity between two strings, based on Dice's Coefficient. Returns a fraction between 0 and 1, which indicates the degree of similarity between the two strings. 0 indicates completely different strings, 1 indicates identical strings. The comparison is case-sensitive.

ahocorasick - Aho-corasick for javascript.

  •    Javascript

Implementation of the Aho-Corasick string searching algorithm, as described in the paper "Efficient string matching: an aid to bibliographic search". check test/basic.js for more examples.


php_aho_corasick - Aho-Corasick string search algorithm PHP extension implementation.

  •    C

PHP extension implementing Aho-Corasick pattern matching algorithm (more on wiki). Is especially effective if there is a large database of needles (=strings to be searched, for example virus signatures). Another advantage is that built search structure is initialized before search in separate call thus it can be called more times with different haystack, saving time.

fast-simple-lcsk - Fast and simple algorithms for computing both LCSk and LCSk+

  •    C++

DISCLAIMER: This is not an officially supported Google product. This repository contains an implementation of the algorithms for computing LCSk and LCSk+, described in [1].

ahocorasick - A fast, compact and low memory implementation of aho-corasick algorithm based on double-array trie (cedar), supports visualizing inner structure via graphviz

  •    Go

cedar-go is a Golang port of cedar which is written in C++ by Naoki Yoshinaga. cedar-go currently implements the reduced verion of cedar. This package is not thread safe if there is one goroutine doing insertions or deletions.

pcre-ocaml - OCaml bindings to PCRE (Perl Compatibility Regular Expressions)

  •    OCaml

This OCaml-library interfaces the C-library PCRE (Perl-compatibility Regular Expressions). It can be used for string matching with "PERL"-style regular expressions. The PCRE-library by Philip Hazel has been under development for many years and is fairly advanced and stable. It implements just about all of the functionality that can be found in PERL regular expressions. The higher-level functions written in OCaml (split, replace, etc.), too, are compatible with the corresponding PERL-functions to the extent that OCaml allows. Most people find the syntax of PERL-style regular expressions more straightforward and powerful than the Emacs-style regular expressions used in the Str-module in the standard OCaml distribution.

multi_string_replace - A fast multiple string replace library for ruby

  •    C

A fast multiple string replace library for ruby. Uses a C implementation of the Aho–Corasick Algorithm based on https://github.com/morenice/ahocorasick while adding support for a few performance enhancements and on the fly multiple string replacement. If Regex is not needed, this library offers significant performance advantages over String.gsub() for large string and with a large number of tokens.

ATGValidator - iOS validation framework with form validation support

  •    Swift

ATGValidator is a validation framework written to address most common issues faced while verifying user input data. You can use it to validate different data types directly, or validate ui components like UITextfield or UITextView, or even add validation support to your custom UI elements. You don't need to subclass native components to get the validation support. UITextField and UITextView has the support out of the box, adding support for any other elements is as simple as adding an extension with protocol conformance to ValidatableInterface.

strutil - Golang metrics for calculating string similarity and other string utility functions

  •    Go

strutil provides string metrics for calculating string similarity as well as other string utility functions. Full documentation can be found at: https://pkg.go.dev/github.com/adrg/strutil. The package defines the StringMetric interface, which is implemented by all the string metrics. The interface is used with the Similarity function, which calculates the similarity between the specified strings, using the provided string metric.

wildmatch - Simple string matching with questionmark- and star-wildcard operator

  •    Rust

Match strings against a simple wildcard pattern. Tests a wildcard pattern p against an input string s. Returns true only when p matches the entirety of s. See also the example described on wikipedia for matching wildcards.






We have large collection of open source products. Follow the tags from Tag Cloud >>


Open source products are scattered around the web. Please provide information about the open source projects you own / you use. Add Projects.