•        0

Pan Gu Segment is a library that can segment Chinese and English words from sentence. ?????????????????eaglet ?????KTDictSeg ?????????????? ?????????????????????????????????????




comments powered by Disqus

Related Projects

Insight Segmentation and Registration Toolkit

ITK is an open-source, cross-platform system that provides developers with an extensive suite of software tools for image analysis. Developed through extreme programming methodologies, ITK employs leading-edge algorithms for registering and segmenting multidimensional data.

Python - Programming Language

Python is a remarkably powerful dynamic programming language that is used in a wide variety of application domains. It is embeddable within applications as a scripting interface. Build flexible data-driven code using Python's powerful and dynamic introspection capabilities and advanced language features such as meta-classes, duck typing and decorators.

Ruby - Programming Language

A dynamic, open source programming language with a focus on simplicity and productivity. It has an elegant syntax that is natural to read and easy to write. The syntax inspired by Perl with Smalltalk-like features. It was also influenced by Eiffel and Lisp. It is a scripting language that is more powerful than Perl, and more object-oriented than Python.

R Language - Project for Statistical Computing

R is a language and environment for statistical computing and graphics. It is a GNU project which is similar to the S language. R provides a wide variety of statistical (linear and nonlinear modelling, classical statistical tests, time-series analysis, classification, clustering, ...) and graphical techniques, and is highly extensible.

Ghostscript - Document Rendering and Conversion

Ghostscript is a rendering and conversion engine for page description languages, including Postscript and PDF. It has ability to convert PostScript language files to many raster formats, view them on displays, and print them on printers that don't have PostScript language capability built in.

ANTLR - ANother Tool for Language Recognition

ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files. It's widely used to build languages, tools, and frameworks. From a grammar, ANTLR generates a parser that can build and walk parse trees. Twitter search uses ANTLR for query parsing, with over 2 billion queries a day.

GNU Octave - Language for Numerical Computations

GNU Octave is a high-level interpreted language, primarily intended for numerical computations. It provides capabilities for the numerical solution of linear and nonlinear problems, and for performing other numerical experiments. It also provides extensive graphics capabilities for data visualization and manipulation. Octave Forge is a place for development of its packages; from bioinformatics and fuzzy logic to mechanics and instrument control.

Moodle - Course Management System

Moodle is a Course Management System (CMS) which could be used to create effective online learning portal. Each Moodle site can support thousands of courses. Courses can be categorised and searched. Array of course activities like Forums, Quizzes, Glossaries, Resources, Choices, Surveys, Assignments, Chats, Workshops are supported. It supports all major brands of database.


Avro is a data serialization system. It is a subproject of Apache Hadoop.

zlib - A Massively Spiffy Yet Delicately Unobtrusive Compression Library

zlib is a general purpose data compression library. All the code is thread safe. It is ported to different programming languages like Java, CSharp, Python and Perl.