Julius - Large Vocabulary CSR Engine

  •        0

"Julius" is a high-performance, two-pass large vocabulary continuous speech recognition (LVCSR) decoder software for speech-related researchers and developers. Based on word N-gram and context-dependent HMM, it can perform almost real-time decoding on most current PCs in 60k word dictation task.

Major search techniques are fully incorporated such as tree lexicon, N-gram factoring, cross-word context dependency handling, enveloped beam search, Gaussian pruning, Gaussian selection, etc. It supports Windows SAPI.




Related Projects

Simon - Speech Recognition and Dictation System

Simon is an open source speech recognition program that can replace your mouse and keyboard. The system is designed to be as flexible as possible and will work with any language or dialect. It is a real dictation system.


Project Name: VOICE ENABLED INTERACTIVE LEARNING MADE BY: AJAY KUMAR PROJECT DESCRIPTION: This project includes 3 parts:- 1:Speech Synthesis 2:Speech Recognition 3:Speech Analysis 1: Speech Synthesis: In this part it takes text as input and voice as output.You can open any text file or doc file,it will read for u. 2: Speech Recognition: In this part it takes speech as input and text as an ouput.Whatever you speak it will print on the screen. 3.Speech Analysis: It has two parts a. waveform creati

pyjulius - Python interface to Julius speech recognition engine

Python interface to Julius speech recognition engine

bots - Android Speech Recognition and Text-To-Speech - How to build a voice controlled assistant.

Android Speech Recognition and Text-To-Speech - How to build a voice controlled assistant.

Voxx Speech Recognition Project

Written in VB 6 for Win98 and up. Our goal is to provide speech recognition and text to speech unlike any software currently in the market. Some features include TTS, Dictation using Microsoft SAPI 5.1 engines. Visit our Home Page for more info.

The quot;SHoUTquot; speech recognition toolkit

SHoUT is a toolkit for performing research on large vocabulary continuous speech recognition (LVCSR). The toolkit contains applications for training statistical models and for speech/non-speech detection, speaker diarization and decoding.

Google Speech Recognition Example

Google Speech Recognition contains a working example of application that uses google speech recognition API. App contains all necessary dlls to record, decode and send your voice request to google service and recieve a text representation of what you've said. It's developed i...

Speech Saver - A utility to automatically turn off Vista's Speech Recognition

A very simple VB.Net system tray utility to turn automatically off Vista's Speech Recognition after a few user determined minutes with no voice activity.

HTK - Speech Recognition Toolkit

The Hidden Markov Model Toolkit (HTK) is a portable toolkit for building and manipulating hidden Markov models. HTK is primarily used for speech recognition research although it has been used for numerous other applications including research into speech synthesis, character recognition and DNA sequencing. HTK is in use at hundreds of sites worldwide.

CMU Sphinx - Toolkit For Speech Recognition

CMUSphinx toolkit is a speech recognition toolkit with various tools used to build speech applications. CMU Sphinx toolkit has a number of packages for different tasks. Pocketsphinx — lightweight recognizer library written in C, Sphinxbase — support library required by Pocketsphinx, Sphinx4 — adjustable, modifiable recognizer written in Java, CMUclmtk — language model tools, Sphinxtrain — acoustic model training tools, Sphinx3 — decoder for speech recognition research written in C.

Festvox - Builds New Synthetic Voices

The Festvox project aims to make the building of new synthetic voices more systemic and better documented, making it possible for anyone to build a new voice. Festvox is the base for most of the Speech Synthesis libraries.

Voice Conference Manager

Voice Conference Manager uses VoiceXML and CCXML to control speech recognition, text to speech, and voice biometrics for a telephone conference service. Say the names or numbers of people and VCM places them into the call. Can be hosted on public servers

SpeakRight Framework - Helps to build Speech Recognition Applications

SpeakRight is an Java framework for writing speech recognition applications in VoiceXML. Dynamic generation of VoiceXML is done using the popular StringTemplate templating framework. Although VoiceXML uses a similar web architecture as HTML, the needs of a speech app are very different. SpeakRight lives in application code layer, typically in a servlet. The SpeakRight runtime dynamically generates VoiceXML pages, one per HTTP request.

Flite - Fast Run time Synthesis Engine

Flite (festival-lite) is a small, fast run-time synthesis engine developed at CMU and primarily designed for small embedded machines and/or large servers. Flite is designed as an alternative synthesis engine to Festival for voices built using the FestVox suite of voice building tools.

dictation-api - Node.JS WAV-Text dictation using the Google Speech API

Node.JS WAV-Text dictation using the Google Speech API


Tools for viewing data charts and preparing voice-patterns for speech recognition engine

Kinect-Voice - Based off the first set of speech recognition from Microsoft.

Based off the first set of speech recognition from Microsoft.

Concrete Voice - Complete Text to Speech System

Concrete Voice is a text-to-speech solution using Microsoft text-to-speech technologies. I started this project because I could not find a quality text-to-speech program to use. The commercial products are embarrassing to think they would ask money for something I would not ev...

Kaldi - Speech Recognition Toolkit

Kaldi is a Speech recognition research toolkit. It is similar in aims and scope to HTK. The goal is to have modern and flexible code, written in C++, that is easy to modify and extend.