Speech recognition using google's tensorflow deep learning framework, sequence-to-sequence neural networks. Replaces caffe-speech-recognition, see there for some background.
tensorflow speech-recognition neural-network deep-learning stt speech-to-textThe deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
deep-learning tensorflow automatic-speech-recognition speech-to-text stt asr coqui-aiSonus lets you quickly and easily add a VUI (Voice User Interface) to any hardware or software project. Just like Alexa, Google Now, and Siri, Sonus is always listening offline for a customizable hotword. Once that hotword is detected your speech is streamed to the cloud recognition service of your choice - then you get the results. Generally, running npm install should suffice. This module however, requires you to install SoX.
speech speech-recognition speech-to-text voice-control stt node hotword-detection keyword-spotting alexa voice-recognition keyword spotting hotword detection voiceJavaScript modules for Mozilla's cloud speech recognition API
mozilla speech speech-recognition sttSpeech recognition in node and the browser using Electron.It seems that Google has shut down the Chrome Speech API for use in shell environments like Electron, which electron-speech relies on.
web speech electron recognition sttNode.js module for SpeakToMe, Mozilla's Speech-to-text REST API. Supports recording of audio on local system, encoding and sending the recording to Mozilla's service for processing, and retrieval of results.
mozilla speech speech-recognition sttThis app was an initial prototype to test the quality of IBM STT, and is no longer activly supported I am now working on a more full fledge version at https://github.com/OpenNewsLabs/autoEdit_2 ( http://www.autoedit.io ). If you clone the repo you can start the app with npm start.
nwjs ibm watson speech to text sttlibdvbtee is a stream parser and service information aggregator library for MPEG2 transport streams. The library includes a program service information (PSI) parser and support for various network streaming methods and is aware of the linux-dvb kernel API as well as HDHomeRun network streaming APIs. The library contains enough functionality to power a full featured television middleware application, including the ability to acquire and stream data through UDP, TCP, HTTP, DMA and various other mechanisms.
network-streams udp dvb mpegts atsc psip streaming tv tv-apps parser transport-stream mpeg2 linuxtv hdhomerun dvb-t network-stream parse pat pmt eit dvb-psi dvbt psi m2ts ts transport stream dtv digital television sdt vct mgt nit ett stt tdt tot epg table tables descriptor descriptorsAny standard MPEG2TS stream is supported, with additional specific support for broadcast television transport streams containing PSIP tables and descriptors. These tables and descriptors contain information about the stream, such as broadcast info, program info, and electronic program guide (EPG).
atsc dvb dvb-psi mpegts parser psip transport-stream tv pat pmt eit vct sdt mgt epg tables descriptors psip-tables m2ts dvbt dvb-t psi mpeg2 ts transport stream dtv digital television nit ett stt tdt tot table descriptorRETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:
stt openstt voice nlu nlp nlp-machine-learning speech-recognition speech-to-text speech-processingFired whenever there is an error related to speech recognition. Fired when the user agent has started to capture audio.
choo stt text-to-speech webA Microsoft Bing Speech API client written in node.js. Official documentation for Bing Speech API service.
stt tts text-to-speech speech-to-text bing-speech bing speech microsoftVoice overlay helps you turn your user's voice into text, providing a polished UX while handling for you the necessary permission. See it implemented in the demo app.
voice overlay input speech-to-text stt voice-recognition speech-recognition voice-assistant conversation conversational-ui conversational-interface conversational-bots chatbots permission permissions permissions-android instant-search instantsearch search androidAn application to make it faster, easier and more accessible to edit audio and video interviews using automatically generated transcriptions form STT service. See intro and slides for more info on the project and user journey for a high level overview of the user journey.
audio video interviews transcript speech-to-text video-editing stt video-editing-software bbc-news-labs news-labs transcript-editor newslabs paper-edit paperedit digital-paper-edit paper-editingRETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:
nlp nlu voice speech-recognition speech-to-text stt speech-processing nlp-machine-learning openstt
We have large collection of open source products. Follow the tags from
Tag Cloud >>
Open source products are scattered around the web. Please provide information
about the open source projects you own / you use.
Add Projects.