MARY is an open-source, multilingual Text-to-Speech Synthesis platform written in Java. It supports German, British and American English, Telugu, Turkish, and Russian.
text-to-speech speech-recognition speechSee http://gtts.readthedocs.org/ for documentation and examples.
speech tts text-to-speech gttsNode.js client library to use the Watson APIs. The examples folder has basic and advanced examples. The examples within each service assume that you already have service credentials.
ibm-watson-services language-translation conversation-service watson tone-analyzer natural-language visual-recognition personality-insights typescript conversation dialog discovery ibm natural-language-classifier natural-language-understanding speech-to-text text-to-speech tone_analyzer watson-developer-cloud wdcLeon is an open-source personal assistant who can live on your server. He does stuff when you ask him for. You can talk to him and he can talk to you. You can also text him and he can also text you. If you want to, Leon can communicate with you by being offline to protect your privacy.
personal-assistant artificial-intelligence speech-to-text text-to-speech speech-recognition speech-synthesis deepspeech fliteaeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment). aeneas automatically generates a synchronization map between a list of text fragments and an audio file containing the narration of the text. In computer science this task is known as (automatically computing a) forced alignment.
speech alignment tts nlp espeak espeak-ng festival cli dtw ffmpeg forced-alignment text audio srt smil text-to-speechCoqui TTS is a library for advanced Text-to-Speech generation. It's built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed and quality. Coqui TTS comes with pretrained models, tools for measuring dataset quality and already used in 20+ languages for products and research projects.
text-to-speech deep-learning speech pytorch tts vocoder tacotron speaker-encodings tensorflow2 melgan speaker-encoder melgan-stft multi-speaker-tts glow-tts hifigan align-tts tts-model🤪 TensorFlowTTS provides real-time state-of-the-art speech synthesis architectures such as Tacotron-2, Melgan, Multiband-Melgan, FastSpeech, FastSpeech2 based-on TensorFlow 2. With Tensorflow 2, we can speed-up training/inference progress, optimizer further by using fake-quantize aware and pruning, make TTS models can be run faster than real-time and be able to deploy on mobile devices or embedded systems. Different Tensorflow version should be working but not tested yet. This repo will try to work with the latest stable TensorFlow version. We recommend you install TensorFlow 2.3.0 to training in case you want to use MultiGPU.
text-to-speech real-time tts speech-synthesis vocoder tflite voice-cloning tensorflow2 fastspeech tacotron2 melgan multi-speaker-tts multiband-melgan fastspeech2 parallel-wavegan mobile-tts zh-tts chinese-tts korea-tts german-ttsThe Festvox project aims to make the building of new synthetic voices more systemic and better documented, making it possible for anyone to build a new voice. Festvox is the base for most of the Speech Synthesis libraries.
text-to-speech speech-recognition speechFreeTTS is a speech synthesis system written entirely in the Java. It is based upon Flite, a small run-time speech synthesis engine developed at Carnegie Mellon University. Flite is derived from the Festival Speech Synthesis System from the University of Edinburgh and the FestVox project from Carnegie Mellon University. FreeTTS supports a subset of the JSAPI 1.0 java speech synthesis specification.
text-to-speech speech-recognition speechFestival offers a general framework for building speech synthesis systems as well as including examples of various modules. It offers full text to speech through a APIs via shell and though a Scheme command interpreter. It has native support for Apple OS. It supports English and Spanish languages.
text-to-speech speech-recognition speechSpeakRight is an Java framework for writing speech recognition applications in VoiceXML. Dynamic generation of VoiceXML is done using the popular StringTemplate templating framework. Although VoiceXML uses a similar web architecture as HTML, the needs of a speech app are very different. SpeakRight lives in application code layer, typically in a servlet. The SpeakRight runtime dynamically generates VoiceXML pages, one per HTTP request.
text-to-speech speech-recognition speech voicexml java-framework frameworkThis repository contains the Neural Network (NN) based Speech Synthesis System developed at the Centre for Speech Technology Research (CSTR), University of Edinburgh.Merlin is a toolkit for building Deep Neural Network models for statistical parametric speech synthesis. It must be used in combination with a front-end text processor (e.g., Festival) and a vocoder (e.g., STRAIGHT or WORLD).
merlin speech-synthesis text-to-speech voice-conversion deep-learning theano tensorflow keras neural-networksKaldi is a Speech recognition research toolkit. It is similar in aims and scope to HTK. The goal is to have modern and flexible code, written in C++, that is easy to modify and extend.
speech-recognition text-to-speechQuickly add voice to your app with the Alan Platform. Create an in-app voice assistant to enable human-like conversations and provide a personalized voice experience for every user. A powerful web-based IDE where you can write, test and debug dialog scenarios for your voice assistant or chatbot.
machine-learning text-to-speech sdk chatbot voice voice-commands speech-recognition flutter voice-control voice-assistant conversational-ai voice-ai alan-voice alan-sdk alan-studioQuickly add voice to your app with the Alan Platform. Create an in-app voice assistant to enable human-like conversations and provide a personalized voice experience for every user. A powerful web-based IDE where you can write, test and debug dialog scenarios for your voice assistant or chatbot.
ios machine-learning text-to-speech sdk chatbot voice voice-commands speech-recognition voice-control voice-assistant conversational-ai voice-ai alan-voice alan-studio alan-ios-sdkQuickly add voice to your app with the Alan Platform. Create an in-app voice assistant to enable human-like conversations and provide a personalized voice experience for every user. A powerful web-based IDE where you can write, test and debug dialog scenarios for your voice assistant or chatbot.
machine-learning text-to-speech sdk chatbot voice voice-commands speech-recognition voice-control voice-assistant conversational-ai voice-ai alan-voice alan-ai alan-studio alan-web-sdk alan-sdk-web ai conversational nlp natural-language dialogs dialog-flow enterprise-ai voice-development add-voice-to-your-app voice-interfaceSpeect is a multilingual text-to-speech (TTS) system. It offers a full TTS system (text analysis which decodes the text, and speech synthesis, which encodes the speech) with various API’s, as well as an environment for research and development of TTS systems and voices.
text-to-speech text analysis speechFlite (festival-lite) is a small, fast run-time synthesis engine developed at CMU and primarily designed for small embedded machines and/or large servers. Flite is designed as an alternative synthesis engine to Festival for voices built using the FestVox suite of voice building tools.
text-to-speech speech-recognition speecheSpeak is a compact open source software speech synthesizer for English and other languages. eSpeak uses a formant synthesis method. This allows many languages to be provided in a small size. It supports SAPI5 version for Windows, so it can be used with screen-readers and other programs that support the Windows SAPI5 interface. It can translate text into phoneme codes, so it could be adapted as a front end for another speech synthesis engine.
text-to-speech speech-recognition speech ssmlGoogle has since launched it's official Google Cloud Speech API. I strongly recommend looking over there. output: json, xml not supported.
audio text-to-speech
We have large collection of open source products. Follow the tags from
Tag Cloud >>
Open source products are scattered around the web. Please provide information
about the open source projects you own / you use.
Add Projects.