Displaying 1 to 12 from 12 results

did_you_mean - The gem that has been saving people from typos since 2014

  •    Ruby

Ruby 2.3 and later ships with this gem and it will automatically be required when a Ruby process starts up. No special setup is required. Aside from the basic features above, the did_you_mean gem comes with experimental features. They can be enabled by calling require 'did_you_mean/experimental'.

SymSpell - SymSpell: 1 million times faster through Symmetric Delete spelling correction algorithm

  •    CSharp

The Symmetric Delete spelling correction algorithm reduces the complexity of edit candidate generation and dictionary lookup for a given Damerau-Levenshtein distance. It is six orders of magnitude faster (than the standard approach with deletes + transposes + replaces + inserts) and language independent. Lookup provides a very fast spelling correction of single words.

sheldon - Very Simple Erlang Spell Checker

  •    Erlang

Very Simple Erlang Spell Checker. Note: Sheldon also suggests correct words when some word is misspelled. That functionality was highly inspired by the Elixir project spell_check.

JamSpell - Modern spell checking library - accurate, fast, multi-language

  •    C++

Model was trained on 300K wikipedia sentences + 300K news sentences (english). 95% was used for train, 5% was used for evaluation. Errors model was used to generate errored text from the original one. JamSpell corrector was compared with Norvig's one, Hunspell and a dummy one (no corrections). More details about reproducing available in "Train" section.




gecco - Generic Environment for Context-Aware Correction of Orthography

  •    Python

Gecco is a generic modular and distributed framework for spelling correction. Aimed to build a complete context-aware spelling correction system given your own data set. Most modules will be language-independent and trainable from a source corpus. Training is explicitly included in the framework. The framework aims to easily extendible, modules can be written in Python 3. Moreover, the framework is scalable and can be distributed over multiple servers. Given an input text, Gecco will add various suggestions for correction.

spacy_hunspell - :pencil2: Hunspell extension for spaCy 2.0.

  •    Python

This package uses the spaCy 2.0 extensions to add Hunspell support for spellchecking. Inspired from this discussion here. Add the spaCyHunSpell to the spaCy pipeline.

misspell-fixer - Simple tool for fixing common misspellings, typos in source code

  •    sed

Utility to fix common misspellings, typos in source code. There are lots of typical misspellings in program code. Typically they are more eye-catching in the living code but they can easily hide in comments, examples, samples, notes and documentation. With this utility you can fix a large number of them very quickly. Be aware that the utility does not check or fix file names. It can easily happen that a misspelled word is fixed in a file name in a program's code, but the file itself will not be renamed by this utility.

LinSpell - Fast approximate strings search & spelling correction

  •    CSharp

The LinSpell spelling correction algorithm does not require edit candidate generation or specialized data structures like BK-tree or Norvig's algorithm. In most cases LinSpell is faster and requires less memory compared to BK-tree or Norvig's algorithm. LinSpell is language and character set independent. The word frequency list was created by intersecting the two lists mentioned below. By reciprocally filtering only those words which appear in both lists are used. Additional filters were applied and the resulting list truncated to ≈ 80,000 most frequent words.


SymSpellCompound - SymSpellCompound: compound aware automatic spelling correction

  •    

SymSpellCompound supports compound aware automatic spelling correction of multi-word input strings. It is built on top of SymSpell's 1 million times faster spelling correction algorithm. Splitting errors, concatenation errors, substitution errors, transposition errors, deletion errors and insertion errors can by mixed within the same word.

WordSegmentationTM - Fast Word Segmentation with Triangular Matrix

  •    CSharp

Fast Word Segmentation using a Triangular Matrix approach. Faster 2x, lower memory consumption constant O(1) vs. linear O(n), better scaling, more GC friendly. For a Word Segmentation using a Dynamic Programming approach have a look at WordSegmentationDP.

spellingBee - :honeybee: :heavy_plus_sign: :octocat: Correct spelling of README's.

  •    Python

For persistence across terminal sessions, consider adding both export statements to your ~/.bashrc, ~/.zshrc, etc. You will need to source ~/.bashrc, or similar rc file in order to reload these export statements the first time you configure the environment variables. In the case where a spelling correction can be found, a fork and pull request from the authorized account will be triggered upon execution of ./main.py.





We have large collection of open source products. Follow the tags from Tag Cloud >>


Open source products are scattered around the web. Please provide information about the open source projects you own / you use. Add Projects.