Samudra-Manthan

  •        0

Samudra Manthan uses C and MPI for finding interesting n-grams(terms) in a large corpus of data. We use the GigaWord corpus to find top m interesting n-grams using TF*IDF measure.

http://popular-terms.sourceforge.net

Tags
Implementation
License
Platform

   

comments powered by Disqus


Related Projects

Pytwitter-client - A twitter client built on urwid and python-twitter


A twitter client that aims to be as minimalistic and complete (in terms of twitter API implementation) as possible. Interface principles are largely influenced by irssi, the popular IRC client. Features currently include simply receiving updates and posting updates using plain text authentication.

Monad-tutorial - The Answer To The Ultimate Question of Monads, Programming and Everything


A tutorial essay on the monad concept. What does the word monad mean? The reader will have a concrete understanding of the concept that the term monad denotes. The reader will be able to confidentally use the term in discussion and recognise episodes where the term is being used improperly. What does the word monad not mean? This essay will address many of the popular misunderstandings around the term monad. Readers will be equipped to determine if they are observing an incident of misappropriat

Outpost13 - Outpost 13 Multiplayer RPG


Outpost 13Outpost 13 is inspired by the game Space Station 13 a popular RPG for the BYOND engine. This is a long term project of mine, Programming is being done in Java. Current News17th June 2010 - Working with some other people on Cellngine which will form the base of Outpost 13. Follow me on Twitter @Chryseus8086 for the latest news.

Tweet-detector - Query expansions for capturing and relating concepts and trends in social media


OutlineWith the rapid growth in the popularity of social media and micro-blogs, health information users are often overwhelmed by the amount of information or are unable to find the relevant information. This research project aims to make information retrieval related to Twitter tweets faster and more accurate. Our approach uses query expansion techniques to associate query terms with the most similar terms and to find the most related tweets. Our overall purpose is to develop a Twitter search e

Sip-communicator - The Java VoIP and Instant Messaging Client


SIP Communicator - the Java VoIP and Instant Messaging client. SIP Communicator is an audio/video Internet phone and instant messenger that supports some of the most popular instant messaging and telephony protocols such as SIP, Jabber, AIM/ICQ, MSN, Yahoo! Messenger, Bonjour, IRC, RSS and soon others like IAX. SIP Communicator is completely Open Source / Free Software, and is freely available under the terms of the GNU Lesser General Public License.

Raidattendancemanager - Easy to use WoW raid attendance management system


The goal is to create an easy to use attendance management programm with general focus on providing overview and analysis of raid-attendance of individual persons in the popular MMO RPG World of Warcraft (tm). Frequent use of the programm will allow to decide whether a member is a reliable person in terms of raid presence and put them in different categories (raider, member, newcomer). It will also be capable of providing raid-specific details like exchanges or substitutes.

Rsbot-client-german - RSBot - the open source modified game client.


RSBot is an immensely popular and successful bot for a Java MMORPG. The program source code is completely open and free - whether you're a Java expert or just keen to explore, the bot is a valuable learning resource. Legal note: the entire source code is freely available under the GPLv3 terms. Contrary to bogus and automated DMCA claims, no content is under the copyright of Jagex Ltd. The code is provided purely for educational purposes by the authors as freedom of expression.

Fast4j - Fast & Agile Service Tools for Java


Welcome to the fast4j projectCopyright (c) 2007 Alexandre ROMAN <alexandre.roman@gmail.com> The goal of the fast4j project is to provide tools to ease business services development. Being based on common libraries such as Spring, Hibernate or SLF4J, fast4j enables you to quickly create SOA applications, ready to use and deploy on popular JEE containers (JOnAS, JBoss, GlassFish). As fast4j is an open source project (published under the terms of the GNU LGPL), this project will only use open sourc

Wiklite - A very tiny and lightweight wiki script based in PHP and MySQL/Sqlite


Wiklite is a fork of MicroWiki by Owen Winkler, as it follows some of its ideas and paradigms. Its code, however, is slowly being rewritten to meet today's standards of quality. Some of Wiklite features are: Extensible through plugins Produces XHTML 1.0 Strict valid markup. Customizable style and templates. Integrates with Wordpress Some of Wiklite goals are: To provide integration with as many popular scripts as possible. To provide Javascript framework independence. Remain safe, lightweight bu

Skbot-client - SKBot - the open source modified game client


SKBot is an immensely popular and successful bot for a Java MMORPG. The program source code is completely open and free - whether you're a Java expert or just keen to explore, the bot is a valuable learning resource. Legal note: the entire source code is freely available under the GPLv3 terms. Contrary to bogus and automated DMCA claims, no content is under the copyright of Jagex Ltd. The code is provided purely for educational purposes by the authors as freedom of expression. Terms and Conditio