TextCat - Perl Program helps to identify natural language

  •        4301

TextCat written in Perl helps to identify 69 natural langauge.

http://www.let.rug.nl/~vannoord/TextCat/

Tags
Implementation
License
Platform

   




Related Projects

UIMA - Unstructured information management architecture


UIMA analyzes large volumes of unstructured information in order to discover knowledge that is relevant to an end user. It is a framework with different set of components. The components include Language Identification, Language specific segmentation, Sentence boundary detection, Entity detection (person/place names) etc. The framework manages these components and the data flows between them.

NTextCat


NTextCat is text classification utility. Primary target is language identification. So it helps you to recognize (identify) the language of text (or binary) snippet. Pure .net application (C#).

NLangDetect


C# port of a language detection library. Tags: language detection, language identification, language guessing.

ImageMagick


ImageMagick is a software suite to create, edit, and compose bitmap images. It can read, convert and write images in a variety of formats (over 100) including DPX, EXR, GIF, JPEG, JPEG-2000, PDF, PhotoCD, PNG, Postscript, SVG, and TIFF. Use ImageMagick to translate, flip, mirror, rotate, scale, shear and transform images, adjust image colors, apply various special effects, or draw text, lines, polygons, ellipses and Bézier curves.



TextCat - Simple and lightweight library to classify text using N-Grams (useful to detect language)


Simple and lightweight library to classify text using N-Grams (useful to detect language)

multi-lid - proof of concept multilingual text language identification


proof of concept multilingual text language identification

whatlang-rs - Natural language detection library for Rust


Natural language detection for Rust with focus on simplicity and performance.For more details (e.g. how to blacklist some languages) please check the documentation.

lidruby - Language Identification with Ruby: probabilistic language identification with ruby1.9


Language Identification with Ruby: probabilistic language identification with ruby1.9

whatlanggo - Natural language detection library for Go


Natural language detection for Go.Thanks to greyblake Potapov Sergey for creating whatlang-rs from where I got the idea and logic.

fmi.ldet - Proiect Text Mining - Language Detection


Proiect Text Mining - Language Detection

language-detection - Use ngrams to work out what language a given text is


Use ngrams to work out what language a given text is

unsupervised-language-identification


An unsupervised language identification algorithm in Ruby, built originally for detecting English-language tweets.

language-identifier - N-gram-based JavaScript language identification


N-gram-based JavaScript language identification

crodas-TextCat


Simple and lightweight library to classify text using N-Grams (useful to detect language)

twilio-translator


A Twilio service that translates back an SMS message. It also calls you back with a pronunciation in the local language. Useful if you're traveling - especially in occasions where your telco signal is a better option than wifi. Uses node.js and Mashape APIs (language detection, language translation, and text-to-voice)

detection-language


Translate and detect the language of blocks of text within a web page using Google Language API in pure JavaScript.

language-detection - Language detection library for Android


Language detection library for Android

textcat-sr


Serbian Cyrillic and Latin language models for libexttextcat, a free software n-gram based language guessing library

flyspell-textcat - Switch flyspell language according to


Switch flyspell language according to