SpeakRight Framework - Helps to build Speech Recognition Applications

  •        6754

SpeakRight is an Java framework for writing speech recognition applications in VoiceXML. Dynamic generation of VoiceXML is done using the popular StringTemplate templating framework. Although VoiceXML uses a similar web architecture as HTML, the needs of a speech app are very different. SpeakRight lives in application code layer, typically in a servlet. The SpeakRight runtime dynamically generates VoiceXML pages, one per HTTP request.

Applications are written in Java using SpeakRight's extensible classes.

http://speakrightframework.blogspot.com/

Tags
Implementation
License
Platform

   




Related Projects

VoiceEnabledInteractiveLearning-VoiceEnabledInteractiveLearning


Project Name: VOICE ENABLED INTERACTIVE LEARNING MADE BY: AJAY KUMAR PROJECT DESCRIPTION: This project includes 3 parts:- 1:Speech Synthesis 2:Speech Recognition 3:Speech Analysis 1: Speech Synthesis: In this part it takes text as input and voice as output.You can open any text file or doc file,it will read for u. 2: Speech Recognition: In this part it takes speech as input and text as an ouput.Whatever you speak it will print on the screen. 3.Speech Analysis: It has two parts a. waveform creati

Festival - Speech Synthesis System


Festival offers a general framework for building speech synthesis systems as well as including examples of various modules. It offers full text to speech through a APIs via shell and though a Scheme command interpreter. It has native support for Apple OS. It supports English and Spanish languages.

CMU Sphinx - Toolkit For Speech Recognition


CMUSphinx toolkit is a speech recognition toolkit with various tools used to build speech applications. CMU Sphinx toolkit has a number of packages for different tasks. Pocketsphinx — lightweight recognizer library written in C, Sphinxbase — support library required by Pocketsphinx, Sphinx4 — adjustable, modifiable recognizer written in Java, CMUclmtk — language model tools, Sphinxtrain — acoustic model training tools, Sphinx3 — decoder for speech recognition research written in C.

HTK - Speech Recognition Toolkit


The Hidden Markov Model Toolkit (HTK) is a portable toolkit for building and manipulating hidden Markov models. HTK is primarily used for speech recognition research although it has been used for numerous other applications including research into speech synthesis, character recognition and DNA sequencing. HTK is in use at hundreds of sites worldwide.

FreeTTS - Speech Synthesizer in Java


FreeTTS is a speech synthesis system written entirely in the Java. It is based upon Flite, a small run-time speech synthesis engine developed at Carnegie Mellon University. Flite is derived from the Festival Speech Synthesis System from the University of Edinburgh and the FestVox project from Carnegie Mellon University. FreeTTS supports a subset of the JSAPI 1.0 java speech synthesis specification.



MARY - Text-to-Speech System


MARY is an open-source, multilingual Text-to-Speech Synthesis platform written in Java. It supports German, British and American English, Telugu, Turkish, and Russian.

Speech Server .NET


Speech Server .NET aims to add functionalities of Text-To-Speech (TTS) and Automatic Speech Recnognition (ASR) to handheld devices like Pocket PC and Smartphone, running Windows Mobile, that are wirelessly connected to a server. This server is able to generate a speech stream ...

bots - Android Speech Recognition and Text-To-Speech - How to build a voice controlled assistant.


Android Speech Recognition and Text-To-Speech - How to build a voice controlled assistant.

Voice Conference Manager


Voice Conference Manager uses VoiceXML and CCXML to control speech recognition, text to speech, and voice biometrics for a telephone conference service. Say the names or numbers of people and VCM places them into the call. Can be hosted on public servers

eSpeak - Text to Speech


eSpeak is a compact open source software speech synthesizer for English and other languages. eSpeak uses a formant synthesis method. This allows many languages to be provided in a small size. It supports SAPI5 version for Windows, so it can be used with screen-readers and other programs that support the Windows SAPI5 interface. It can translate text into phoneme codes, so it could be adapted as a front end for another speech synthesis engine.

Kaldi - Speech Recognition Toolkit


Kaldi is a Speech recognition research toolkit. It is similar in aims and scope to HTK. The goal is to have modern and flexible code, written in C++, that is easy to modify and extend.

SpeakEasy - Kinect Speech Recognition Framework


Kinect Speech Recognition Framework

Modular Audio Recognition Framework


MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.

DeepSpeech - A TensorFlow implementation of Baidu's DeepSpeech architecture


Project DeepSpeech is an open source Speech-To-Text engine. It uses a model trained by machine learning techniques, based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow project to make the implementation easier.Pre-built binaries can be found on TaskCluster. You'll need to download native_client.tar.xz and the appropriate Python wheel package.

Google Speech Recognition Example


Google Speech Recognition contains a working example of application that uses google speech recognition API. App contains all necessary dlls to record, decode and send your voice request to google service and recieve a text representation of what you've said. It's developed i...

Speech - Speech to text and text to speech via google speech api


Speech to text and text to speech via google speech api

Festvox - Builds New Synthetic Voices


The Festvox project aims to make the building of new synthetic voices more systemic and better documented, making it possible for anyone to build a new voice. Festvox is the base for most of the Speech Synthesis libraries.

Speect - Multilingual text-to-speech (TTS) system


Speect is a multilingual text-to-speech (TTS) system. It offers a full TTS system (text analysis which decodes the text, and speech synthesis, which encodes the speech) with various API’s, as well as an environment for research and development of TTS systems and voices.

Voxx Speech Recognition Project


Written in VB 6 for Win98 and up. Our goal is to provide speech recognition and text to speech unlike any software currently in the market. Some features include TTS, Dictation using Microsoft SAPI 5.1 engines. Visit our Home Page for more info.