UIMA Framework

  •        0

The Unstructured Information Management Architecture (UIMA) Framework is an open, scalable and extensible platform for building analytic solutions that process unstructured information to find latent meaning, relationships and relevant facts.




Related Projects

UIMA - Unstructured information management architecture

UIMA analyzes large volumes of unstructured information in order to discover knowledge that is relevant to an end user. It is a framework with different set of components. The components include Language Identification, Language specific segmentation, Sentence boundary detection, Entity detection (person/place names) etc. The framework manages these components and the data flows between them.

Zanzibar - UIMA library to make regular expression over UIMA Annotation

Zanzibar is a regular expression engine over feature structures based on the UIMA framework, developed at the University of Rome “Tor Vergata�. At this moment its main purpose is to offer a way to match a sequence of UIMA Annotation and then aggregate them in some way that user can specify. Actually Zanzibar allow to specify rules in a user-friendly language to match a particular sequence.

Uimafit - uimaFIT is a library that provides factories, injection, and testing utilities for UIMA

version 1.4.0 is tentatively scheduled for release by mid 2012 (Roadmap) version 1.3.1 is released! (Release Notes, Changes) version 1.3.0 is released! (Release Notes, Changes) To see how uimaFIT works please see the getting started page, conceptual overview, or the documentation. uimaFIT requires Java 1.5 or higher and UIMA 2.4.0 or higher. Configuring UIMA components is generally achieved by creating XML descriptor files which tell the framework at runtime how components should be instantiat

Uima-type-mapper - Tackling two issues: mapping UIMA objects (annotations from distinct type systems

OverviewDespite the current url, this project is also named uima-annotation-mapper, uima-annotation-pattern-mapper, and lately uima-mapper. This project offers a software solution to handle two issues the users of the Apache UIMA architecture are faced. Analysis is one of the major NLP tasks. A semantic rule-based analyser should enable to create (we say also mapping), update or delete annotations according to rules expressed over other annotation patterns. One of the major issues dealing with a

Cleartk - ClearTK is a toolkit for developing statistical natural language processing components in

ClearTKClearTK provides a framework for developing statistical natural language processing (NLP) components in Java and is built on top of Apache UIMA. It is developed by the Center for Computational Language and Education Research (CLEAR) at the University of Colorado at Boulder. Please see the conceptual overview for a broad introduction to ClearTK. FeaturesA common interface and wrappers for popular machine learning libraries such as SVMlight, LIBSVM, OpenNLP MaxEnt, and Mallet. A rich featur

Uima-shell - This UIMA Analysis Engine offers a way to process Shell command over a CAS element, vie

OVERVIEWThis UIMA Analysis Engine (AE) allows you to run Shell command over a Common Analysis Structure (CAS) element (view or annotation) and to store the result either as a new view or a new annotation. It mainly aims at running within a UIMA aggregate processing chain some external tools available via command line. These tools should perform their processing by taking the input as a file name parameter or a standard input ( stdin ) and produce the result via the standard output ( stdout ). Wi

Mavericarc - The Automated Retrieval Console - An open source clinical information retrieval tool

ARC 2.0 Installer with LVG and UMLS pre-bundledValidate your UMLS license at http://nlp-ecosystem.ucsd.edu/vm-download.html Once verified download http://nlp-ecosystem.ucsd.edu/nlm-auth/MAVERIC/ARC-2.0-installer.jar md5sum: 6e09e91e859abb35ce64ff5f7618110d 332 MB If you are not set to launch JARs on double-click, run by going to a command line and typing: java -jar ARC-2.0-installer.jar Reduce custom software and rules development ARC just needs to be provided examples of the documents or concep

Graph-expression - High level automaton library for information extraction

GExpHigh level automaton library for information extraction. Usage: Named Entity Recognition(NER) patterns Optimal match finding (for ambiguous grammars) Relation and fact extraction Structure parsing (like document structure, sententce parsing) Search problem solving (like Suduku) GExp Features:all standard reg exp operators: +, {n,n}, {n,}, .... much more: java predicates on groups, reluctant, greedy, cut operator, inner match... structure of match -it is possible to build syntax tree based on

Ne-ushi - Official page for NeUshi or "�丑", API of Natural Language Processors using A

AbstractNeUshi is an programming interface specification and its implementation which enables programmers to handle easily the output of Morphological Analysis(MA) modules. Using MA modules through NeUshi, a programmer receives the module's process result encapsulated in Apache UIMA objects, which enables to get the elements contained in the result just by calling functions that correspond. NeUshi will contribute to reduce the cost of time to develop applications since a programmer doesn't need

Ytex - Yale cTAKES Extensions

IntroductionYTEX is: Clinical NLP: A robust, general-purpose clinical natural language processing system based on the Clinical Text Analysis and Knowledge Extraction System (cTAKES). Semantic Similarity: YTEX provides a framework for computing the similarity between pairs of concepts; this is integrated with clinical NLP, Data Mining, and Feature Engineering tools. Data Mining: YTEX provides tools to export UIMA annotations to machine learning packages, including Weka, R, Matlab, SAS, Libsvm, SV