Displaying 1 to 17 from 17 results

natural - general natural language facilities for node

  •    Javascript

"Natural" is a general natural language facility for nodejs. Tokenizing, stemming, classification, phonetics, tf-idf, WordNet, string similarity, and some inflections are currently supported.

pattern - Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization

  •    Python

It is well documented, thoroughly tested with 350+ unit tests and comes bundled with 50+ examples. The source code is licensed under BSD and available from http://www.clips.ua.ac.be/pages/pattern. This example trains a classifier on adjectives mined from Twitter using Python 3. First, tweets that contain hashtag #win or #fail are collected. For example: "$20 tip off a sweet little old lady today #win". The word part-of-speech tags are then parsed, keeping only adjectives. Each tweet is transformed to a vector, a dictionary of adjective → count items, labeled WIN or FAIL. The classifier uses the vectors to learn which other tweets look more like WIN or more like FAIL.

pywsd - Python Implementations of Word Sense Disambiguation (WSD) Technologies.

  •    Python

Python Implementations of Word Sense Disambiguation (WSD) Technologies.

WordNet::SQLConverter

  •    CSharp

SQLConverter is a program which can convert the WordNet 3.0 text files into a robust and well-designed SQL database. Its aim is to allow all WordNet users to use the powerful features of the SQL language on the WordNet thesaurus. The app is developed in C# and has a WinForms GUI.




wordpos - Part-of-speech utilities for node.js based on the WordNet database.

  •    Javascript

wordpos is a set of fast part-of-speech (POS) utilities for Node.js using fast lookup in the WordNet database. Version 1.x is a major update with no direct dependence on natural's WordNet module, with support for Promises, and roughly 5x speed improvement over previous version.

ws4j - WordNet Similarity for Java provides an API for several Semantic Relatedness/Similarity algorithms

  •    Java

This project was exported from the original Google Code Location. The purpose is to publish an artifact to Maven Central. The repository has been changed to build with sbt instead of Maven. To run the tests, download http://nlpwww.nict.go.jp/wn-ja/data/1.1/wnjpn.db.gz and unzip the .db file into the config directory.

rwordnet - A pure Ruby interface to the WordNet database

  •    Ruby

This library implements a pure Ruby interface to the WordNet lexical/semantic database. Unlike existing ruby bindings, this one doesn't require you to convert the original WordNet database into a new database format; instead it can work directly on the database that comes with WordNet. If you're doing something data-intensive you will achieve much better performance with Michael Granger's Ruby-WordNet, since it converts the WordNet database into a BerkelyDB file for quicker access. rwordnet has a much smaller footprint, with no gem or native dependencies, and requires about a third of the space on disk as Ruby-Wordnet + DB. In writing rwordnet, I've focused more on usability and ease of installation ( gem install rwordnet ) at the expense of some performance. Use at your own risk, etc.

wordnet-db - WordNet Database files (previously WNdb)

  •    Javascript

A package to install WordNet database files for Node.js modules. Used by wordpos project which is based on natural WordNet module. Note: This repository was previously name WNdb. Its new name complies with npm naming rules. Although old links will still work, it is recommended to change references to the new name.


wordnet - A Wordnet API in pure JavaScript

  •    CoffeeScript

This is an implementation of a Wordnet API in pure JavaScript. It was initially adapted from NaturalNode/natural, which had the original core implementation, but which was very basic and hard to use for higher-level tasks. This is a drop-in replacement for the Wordnet access in NaturalNode/natural, but with additional methods that make it easier to use for other tasks, and probably higher in performance too. For example, the original implementation opens file handles for more or less each individual low-level query.

wordnut - Emacs major mode interface to WordNet lexical database

  •    Emacs

Uses wn(1) for searching local wordnet db; injects results into *WordNut* buffer. If you have adaptive-wrap mode installed, wordnut will automatically use it to improve the text formatting (don't forget to add (require 'adaptive-wrap) in ~/.emacs).

languagecrunch - LanguageCrunch NLP server docker image

  •    Python

sentence: The new twitter is so weird. Seriously. Why is there a new twitter? What was wrong with the old one? Fix it now.

node-wordnet-magic - tools for working with Princeton's lexical database WordNet

  •    Javascript

A node.js module for working with Princeton's WordNet lexical database for the English language. Since natural language processing (NLP) has to deal with a lot of non-digital features (comprising ambivalence, dubiety of language, not to mention irony, lies or rhetorical tricks and ruses) it always has been a great challenge, confronting the programmer with a constant need of sense disambiguation.

pygermanet - GermaNet API for Python

  •    Python

GermaNet API for Python. Copyright (c) 23 March, 2014 Will Roberts <wildwilhelm@gmail.com>.

dict-definition - Preprocessing scripts to read definitions and other information from dictionaries

  •    Python

Preprocessing scripts to read definitions and other information from dictionaries. This repository is for AAAI2017 paper: "Definition Modeling: Learning to define word embeddings in natural language".

biosemble - AI assembly of biological wordnets

  •    Python

biosemble is a Python natural language processing (NLP) software program for assembling biological wordnets from structured and unstructured biological text. Structured text includes resources like biologically relevant dictionaries and encyclopedias, while unstructured text includes biologically relevant textbooks. biosemble uses part-of-speech (POS) tagging to assemble similar words across a wide array of biologically relevant dictionaries and encyclopedias.

wordnet-sqlite - A node package exposing an SQLite database of the Princeton University WordNet database

  •    Javascript

Note that installing the module will install the SQLite3 module as a dependency, which requires compilation using node-gyp, so a working toolchain is required to install this module. On requiring the module, an instance of an sqlite3 client is created and connected to the local WordNet database. This client is then returned, and can be used according to the sqlite3 API.The returned object is an instance of Database, so methods like #run and #foreach can be called directly from it.

gently-js - Module which returns the offensive words in a string

  •    Javascript

gently-js is a module which helps your presence online be more gentle. The module exports a function called validate({ text, strict = false, spellcheck = true }), which accepts a sentence as a string and returns if any of the words is abusive. The strict flag indicates whether gently-js should also check the direct synonyms of each word.






We have large collection of open source products. Follow the tags from Tag Cloud >>


Open source products are scattered around the web. Please provide information about the open source projects you own / you use. Add Projects.