JamSpell - Modern spell checking library - accurate, fast, multi-language

  •        590

Model was trained on 300K wikipedia sentences + 300K news sentences (english). 95% was used for train, 5% was used for evaluation. Errors model was used to generate errored text from the original one. JamSpell corrector was compared with Norvig's one, Hunspell and a dummy one (no corrections). More details about reproducing available in "Train" section.




Related Projects

SymSpell - 1 million times faster through Symmetric Delete spelling correction algorithm

  •    CSharp

Spelling correction & Fuzzy search: 1 million times faster through Symmetric Delete spelling correction algorithm The Symmetric Delete spelling correction algorithm reduces the complexity of edit candidate generation and dictionary lookup for a given Damerau-Levenshtein distance. It is six orders of magnitude faster (than the standard approach with deletes + transposes + replaces + inserts) and language independent.

SharePoint 2010 SpellCheck


SharePoint 2010 SpellCheck Project will let you enable spelling check functionality in SharePoint 2010 using SpellCheck.asmx

did_you_mean - The gem that has been saving people from typos since 2014

  •    Ruby

Ruby 2.3 and later ships with this gem and it will automatically be required when a Ruby process starts up. No special setup is required. Aside from the basic features above, the did_you_mean gem comes with experimental features. They can be enabled by calling require 'did_you_mean/experimental'.

SpellO - Spell Checker for Website


SpellO crawls a website and finds typos and spelling errors. It provides a user-friendly interface and supports multiple dictionaries.


  •    Javascript

A lightweight jQuery plugin that can check the spelling of text within a form field or DOM tree.

Spell Checking (C#, VB, HTML5/JavaScript)


This implements WINRT component wrapper over C++ SpellChecker library which can be used for basic spelling checks and providing suggestions in Window Store App

bounter - Efficient Counter that uses a limited (bounded) amount of memory regardless of data size.

  •    Python

Bounter is a Python library, written in C, for extremely fast probabilistic counting of item frequencies in massive datasets, using only a small fixed memory footprint. However, unlike dict or Counter, Bounter can process huge collections where the items would not even fit in RAM. This commonly happens in Machine Learning and NLP, with tasks like dictionary building or collocation detection that need to estimate counts of billions of items (token ngrams) for their statistical scoring and subsequent filtering.


  •    Ruby

A gtk run dialog box with tab completion and a simple ui. Supports common features as well as history, files, directories, urls, and spelling correction.

mafsa - Package mafsa implements Minimal Acyclic Finite State Automata in Go, essentially a high-speed, memory-efficient, Unicode-friendly set of strings

  •    Go

Package mafsa implements Minimal Acyclic Finite State Automata (MA-FSA) with Minimal Perfect Hashing (MPH). Basically, it's a set of strings that lets you test for membership, do spelling correction (fuzzy matching) and autocomplete, but with higher memory efficiency than a regular trie. With MPH, you can associate each entry in the tree with data from your own application.A BuildTree is used to build data from scratch. Once all the elements have been inserted, the BuildTree can be serialized into a byte slice or written to a file directly. It can then be decoded into a MinTree, which uses significantly less memory. MinTrees are read-only, but this greatly improves space efficiency.

Ghalatawi, Arabic AutoCorrect


This Project aims to construct a word list and a list of regular expressions for Arabic auto correction ( auto spelling.


  •    Java

This project provide implementations of spellcheckers in java language. Spellchecker implementations for TinyMCE based on Jazzy and google-spellchecker-service Authors: Rich Irwin, Andrey Chorniy You may see integration details here https://achorniy.wordpress.com/2009/08/11/tinymce-spellchecker-in-java/ and here https://achorniy.wordpress.com/tag/spellchecker/

fuck - Fix command line spelling errors like it's 1999.

  •    CSharp

Fix command line spelling errors like it's 1999. I ordered my lunch at work late so I decided to spike out a program to fix command line spelling errors in my newfound free time.

NLP-progress - Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks

  •    HTML

This document aims to track the progress in Natural Language Processing (NLP) and give an overview of the state-of-the-art (SOTA) across the most common NLP tasks and their corresponding datasets. It aims to cover both traditional and core NLP tasks such as dependency parsing and part-of-speech tagging as well as more recent ones such as reading comprehension and natural language inference. The main objective is to provide the reader with a quick overview of benchmark datasets and the state-of-the-art for their task of interest, which serves as a stepping stone for further research. To this end, if there is a place where results for a task are already published and regularly maintained, such as a public leaderboard, the reader will be pointed there.

RakNet - RakNet is a cross platform, open source, C++ networking engine for game programmers.

  •    C

------------------------------------------ See Help\swigtutorial.html Upgrading from version 3 ------------------------------------------ See 3.x_to_4.x_upgrade.txt Windows users (Visual Studio 2008 and 2010) ----------------------------------------- Load RakNet_VS2008.sln and convert if necessary.After the project conversion, if you encounter error MSB4006,follow the steps below to fix it:1. Open project properties2. Click on "Common Properties"3. Click on "Framework and References"4. Look

Squiggly Spell Check


A better notepad++ spell check plug in... Adds: - Visualize your document's spelling errors with red squiggly underlines - Allows you to choose whether or not to underline spelling mistakes - Select pre-defined document types for spelling.

JaSpell - Java spelling checking package

  •    Java

JaSpell is a Java spelling checking package. It is of particular interest for developers, since it provides a set of APIs (Application Programming Interfaces) that allow one to add spelling checking to any Java Application easily. JaSpell is implemented with basis on the ternary search tree.

Typo.js - A client-side JavaScript spellchecker that uses Hunspell-style dictionaries.

  •    Javascript

Typo.js is a JavaScript spellchecker that uses Hunspell-style dictionaries. To use Typo in a standard web application you need to pass a settings object that provides a path to the folder containing the desired dictionary.

pynlpl - PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing

  •    Python

PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotatation). The library is a divided into several packages and modules. It works on Python 2.7, as well as Python 3.

PyTorch-NLP - Supporting Rapid Prototyping with a Toolkit (incl. Datasets and Neural Network Layers)

  •    Python

PyTorch-NLP, or torchnlp for short, is a library of neural network layers, text processing modules and datasets designed to accelerate Natural Language Processing (NLP) research. Join our community, add datasets and neural network layers! Chat with us on Gitter and join the Google Group, we're eager to collaborate with you.