node-summary - Node module that summarizes text using a naive summarization algorithm

  •        89

Summarizes text using a naive summarization algorithm, based off of the Python implementation by shlomibabluki. And now with UTF8 support, thanks to xissy.


babel-core : ^6.24.0
babel-preset-es2015 : ^6.24.0
cheerio : ^0.22.0
html-to-text : ^3.2.0
lodash : ^4.17.4
request : ^2.81.0
sbd : ^1.0.12



Related Projects

TextTeaser - Automatic Summarization Algorithm

  •    Scala

TextTeaser is an automatic summarization algorithm that combines the power of natural language processing and machine learning to produce good results. It can provide provide a gist of an article, Better previews in news readers.


  •    Python

This is an implementation of sequence-to-sequence model using a bidirectional GRU encoder and a GRU decoder. This project aims to help people start working on Abstractive Short Text Summarization immediately. And hopefully, it may also work on machine translation tasks. Please check harvardnlp/sent-summary.

prose - :book: A Golang library for text processing, including tokenization, part-of-speech tagging, and named-entity extraction

  •    Go

prose is Go library for text (primarily English at the moment) processing that supports tokenization, part-of-speech tagging, named-entity extraction, and more. The library's functionality is split into subpackages designed for modular use.See the GoDoc documentation for more information.



this is summary this is summary this is summary this is summary this is summary this is summary this is summary this is summary this is summary this is summary

sumy - Module for automatic summarization of text documents and HTML pages.

  •    Python

Sumy contains command line utility for quick summarization of documents. Or you can use sumy like a library in your project. Create file (don't name it with the code below to test it.


  •    Python

CVS-Summary is a program that generates an HTML summary of CVS activity, very similar to that provided by the popular ViewCVS script. CVS-Summary differs from ViewCVS in that it dumps the summary information to static HTML files which can be served by n

neural-summary-tensorflow - In progress

  •    Python

Tensorflow implementation of A Neural Attention Model for Abstractive Summarization. The original code of author can be found here.

text-analytics-with-python - Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer

  •    Python

Derive useful insights from your data using Python. Learn the techniques related to natural language processing and text analytics, and gain the skills to know which technique is best suited to solve a particular problem. A structured and comprehensive approach is followed in this book so that readers with little or no experience do not find themselves overwhelmed. You will start with the basics of natural language and Python and move on to advanced analytical and machine learning concepts. You will look at each technique and algorithm with both a bird's eye view to understand how it can be used as well as with a microscopic view to understand the mathematical concepts and to implement them to solve your own problems.

RLSeq2Seq - Deep Reinforcement Learning For Sequence to Sequence Models

  •    Python

NOTE: THE CODE IS UNDER DEVELOPMENT, PLEASE ALWAYS PULL THE LATEST VERSION FROM HERE. In recent years, sequence-to-sequence (seq2seq) models are used in a variety of tasks from machine translation, headline generation, text summarization, speech to text, to image caption generation. The underlying framework of all these models are usually a deep neural network which contains an encoder and decoder. The encoder processes the input data and a decoder receives the output of the encoder and generates the final output. Although simply using an encoder/decoder model would, most of the time, produce better result than traditional methods on the above-mentioned tasks, researchers proposed additional improvements over these sequence to sequence models, like using an attention-based model over the input, pointer-generation models, and self-attention models. However, all these seq2seq models suffer from two common problems: 1) exposure bias and 2) inconsistency between train/test measurement. Recently a completely fresh point of view emerged in solving these two problems in seq2seq models by using methods in Reinforcement Learning (RL). In these new researches, we try to look at the seq2seq problems from the RL point of view and we try to come up with a formulation that could combine the power of RL methods in decision-making and sequence to sequence models in remembering long memories. In this paper, we will summarize some of the most recent frameworks that combines concepts from RL world to the deep neural network area and explain how these two areas could benefit from each other in solving complex seq2seq tasks. In the end, we will provide insights on some of the problems of the current existing models and how we can improve them with better RL models. We also provide the source code for implementing most of the models that will be discussed in this paper on the complex task of abstractive text summarization.

Effective-JAVA-Summary - Summary of the book Effective Java 2nd Edition by Joshua Bloch


This is my summary of the Effective Java 2nd Edition by Joshua Bloch. I use it while learning and as quick reference. It is not intended to be an standalone substitution of the book so if you really want to learn the concepts here presented, buy and read the book and use this repository as a reference and guide. If you are the publisher and think this repository should not be public, just write me an email at hugomatilla [at] gmail [dot] com and I will make it private.

skimr - A frictionless, pipeable approach to dealing with summary statistics

  •    HTML

skimr provides a frictionless approach to summary statistics which conforms to the principle of least surprise, displaying summary statistics the user can skim quickly to understand their data. It handles different data types and returns a skim_df object which can be included in a pipeline or displayed nicely for the human reader. The APIs for this branch should be considered reasonably stable but still subject to change if an issue is discovered.

pytorch-summary - Model summary in PyTorch similar to `model.summary()` in Keras

  •    Python

Keras has a neat API to view the visualization of the model which is very helpful while debugging your network. Here is a barebone code to try and mimic the same in PyTorch. The aim is to provide information complementary to, what is not provided by print(your_model) in PyTorch.

soft-eng-interview-prep - Everything you need to know for a Software Engineering interview


The goal of this summary is to contain all the required theoretical material needed to pass a Facebook/Google software engineering interview, but (hopefully) no more than that. It originated out of my own personal notes while preparing for such interviews. I view it as an executive summary, that should ideally take a few hours to read, and that you should read multiple times while preparing for the interview. See this post for more background and tips on preparing for an interview. Start from SUMMARY (or see below). It's also available in GitBook format for easier reading and navigation.

Reductio - Automatic summarizer text in Swift

  •    Swift

Reductio is a tool used to extract keywords and phrases using an implementation of the algorithm TextRank. Simply add Reductio as a dependency to your project's Package.swift.

Open Text Summarizer


The open text summarizer is an open source tool for summarizing texts. The program reads a text and generates a summary.

Text Precis Utility

  •    C++

A text passage that runs to several sentences will contain implicit forward and backward references. A passable summary can be made by listing the three sentences with the best balance of forward and backward references.

pytextrank - Python implementation of TextRank for text document NLP parsing and summarization

  •    Jupyter

Python implementation of TextRank, based on the Mihalcea 2004 paper. The results produced by this implementation are intended more for use as feature vectors in machine learning, not as academic paper summaries.

Conditional-PixelCNN-decoder - Tensorflow implementation of Gated Conditional Pixel Convolutional Neural Network

  •    Python

This is a Tensorflow implementation of Conditional Image Generation with PixelCNN Decoders which introduces the Gated PixelCNN model based on PixelCNN architecture originally mentioned in Pixel Recurrent Neural Networks. The model can be conditioned on latent representation of labels or images to generate images accordingly. Images can also be modelled unconditionally. It can also act as a powerful decoder and can replace deconvolution (transposed convolution) in Autoencoders and GANs. A detailed summary of the paper can be found here. The gating accounts for remembering the context and model more complex interactions, like in LSTM. The network stack on the left is the Vertical stack that takes care of blind spots that occure while convolution due to the masking layer (Refer the Pixel RNN paper to know more about masking). Use of residual connection significantly improves the model performance.

LintCode - :pencil2: C++ 11 Solutions of All 289 LintCode Problems

  •    C++

Up to date (2016-08-22), there are 289 problems on LintCode Online Judge. The number of problems is increasing recently. Here is the classification of all 289 problems. For more problems and solutions, you can see my LeetCode repository. I'll keep updating for full summary and better solutions. Stay tuned for updates.