TextTeaser - Automatic Summarization Algorithm

  •        0

TextTeaser is an automatic summarization algorithm that combines the power of natural language processing and machine learning to produce good results. It can provide provide a gist of an article, Better previews in news readers.

https://github.com/MojoJolo/textteaser
http://www.textteaser.com/

Tags
Implementation
License
Platform

   




Related Projects

Gate - General Architecture for Text Engineering


GATE excels at text analysis of all shapes and sizes. It provides support for diverse language processing tasks such as parsers, morphology, tagging, Information Retrieval tools, Information Extraction components for various languages, and many others. It provides support to measure, evaluate, model and persist the data structure. It could analyze text or speech. It has built-in support for machine learning and also adds support for different implementation of machine learning via plugin.

nlp - Natural language processing tools for text generation, search and analysis.


Natural language processing tools for text generation, search and analysis.

SWING


The Summarizer from the Web IR / NLP Group (WING), hence SWING, is a modular, state-of-the-art automatic extractive text summarization system. It is used as the basis for summarization research at the National University of Singapore. It performs as one of the leading automatic summarization systems in the international TAC competition, getting high marks for the ROUGE evaluation measure

node-summary - Node module that summarizes text using a naive summarization algorithm


Node module that summarizes text using a naive summarization algorithm

divijvaidya-iIntelli


We developed a generic interactive framework based on human cognition, where the system can learn continuously from the Internet and from its interaction with the users. To show the utilization of this framework, iIntelli, an agent based application for multiple text document summarization was developed and compared with the MEAD on the Cran Data Set. Mead is a natural language processing based summarizer, which provides summary by extracting sentences from a cluster of related documents and Cra

OpenPipe - Document Pipeline


OpenPipe is an open source scalable platform for manipulating a stream of documents. A pipeline is an ordered set of steps / operations performed on a document to convert from its raw form to something ready to be put into the index. The operations performed on documents include language detection, field manipulation, POS tagging, entity extraction or submitting the document to a search engine.

japanese-nlptools - Tools for NLP-related analysis of Japanese text


Tools for NLP-related analysis of Japanese text

ArabicNLP - Collection of various Arabic NLP and Text Processing Scripts and Utilities


Collection of various Arabic NLP and Text Processing Scripts and Utilities

tif - Text Interchange Formats


This package describes and validates formats for storing common object arising in text analysis as native R objects. Representations of a text corpus, document term matrix, and tokenized text are included. The tokenized text format is extensible to include other annotations. There are two versions of the corpus and tokens objects; packages should accept both and return or coerce to at least one of these.corpus (data frame) - A valid corpus data frame object is a data frame with at least two columns. The first column is called doc_id and is a character vector with UTF-8 encoding. Document ids must be unique. The second column is called text and must also be a character vector in UTF-8 encoding. Each individual document is represented by a single row in the data frame. Addition document-level metadata columns and corpus level attributes are allowed but not required.

Utils - Common routines in Java for basic data processing in network or text analysis


Common routines in Java for basic data processing in network or text analysis

Text-NLP - Release history of Text-NLP


Release history of Text-NLP

text-analysis-toolkit - Collection of tools for specific text processing that I needed.


Collection of tools for specific text processing that I needed.

text_summarization - Python text summarization project for computer science M.Sc.


Python text summarization project for computer science M.Sc.

suzgec - Text Summarization System for Turkish Language, Senior project at Atilim University


Text Summarization System for Turkish Language, Senior project at Atilim University

posthoc - Functions for ML/NLP experiment post-processing and analysis.


Functions for ML/NLP experiment post-processing and analysis.

bogofilter -- Fast Bayesian Spam Filter


Bogofilter is a mail filter that classifies mail as spam or ham (non-spam) by a statistical analysis of the message's header and content (body). The program is able to learn from the user's classifications and corrections. Bogofilter provides processing for plain text and HTML. It supports multi-part MIME messages with decoding of base64, quoted-printable, and uuencoded text and ignores attachments, such as images.

whatlang-rs - Natural language detection library for Rust


Natural language detection for Rust with focus on simplicity and performance.For more details (e.g. how to blacklist some languages) please check the documentation.

NTextCat


NTextCat is text classification utility. Primary target is language identification. So it helps you to recognize (identify) the language of text (or binary) snippet. Pure .net application (C#).