119 Open Source Asr Software Projects
Free and open source asr code projects including engines, APIs, generators, and tools.
Pytorch Kaldi 1793 ⭐
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Vosk API 770 ⭐
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Speech Transformer 489 ⭐
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
Tensorflow_end2end_speech_recognition 294 ⭐
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
Athena Team Athena 382 ⭐
an open-source implementation of sequence-to-sequence based speech processing engine
Self Supervised Speech Pretraining And Representation Learning 243 ⭐
The S3PRL speech toolkit: self-supervised pre-training and representation learning of Mockingjay, TERA, A-ALBERT, APC, and more to come. With easy-to-use standard downstream evaluation scripts including phone classification, speaker recognition, and ASR. (All in Pytorch!)
Asr Evaluation 170 ⭐
Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).
Vosk Server 189 ⭐
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Mrcp Plugin With Freeswitch 151 ⭐
Py Kaldi Asr 146 ⭐
Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
Online Speech Recognition 151 ⭐
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
Listen Attend Spell 127 ⭐
A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.
Rnn Transducer 108 ⭐
MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
Asr_audio_data_links 104 ⭐
A list of publically available audio data that anyone can download for ASR or other speech activities
Ctc Asr 101 ⭐
End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
At16k 99 ⭐
Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.
Ktspeechcrawler 81 ⭐
Automatically constructing corpus for automatic speech recognition from YouTube videos
Learnedvector Wav2letter 72 ⭐
Speech Recognition model based off of FAIR research paper built using Pytorch.
Bigcidian 80 ⭐
Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.
Rustfst 58 ⭐
Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs).
Syn Speech 52 ⭐
Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Keras Sincnet 42 ⭐
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Hms Ml Demo 69 ⭐
HMS ML Demo provides an example of integrating Huawei ML Kit service into applications. This example demonstrates how to integrate services provided by ML Kit, such as face detection, text recognition, image segmentation, asr, and tts.
Baiduasrandtts 33 ⭐
Using Baidu API. ASR: Automatic Speech Recognition;TTS: Text To Speech; 百度语音识别、语音合成API使用。
Kolm 29 ⭐
Korean text normalization and language preparation package for LM in Kaldi-based ASR system
Pytorch_mlp_for_asr 28 ⭐
This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.
Mspeech 27 ⭐
Program for speech recognition using the Google Speech API, voice commands, control your computer.
Myg2p 25 ⭐
Myanmar (Burmese) Language Grapheme to Phoneme (myG2P) Conversion Dictionary for speech recognition (ASR) and speech synthesis (TTS).
Audioku Meta Transfer Learning 29 ⭐
Implementation of meta-transfer-learning for ASR and LM (ACL 2020)
Contextnet 32 ⭐
Tensorflow2 based implementation of ContextNet, an improved convolutional rnn-transducer-based architecture for end-to-end speech recognition using global context
Syn Speech Samples 22 ⭐
An application that demostrate the usage of Syn.Speech library for Speech Recognition
Kaldi Alligner 18 ⭐
scripts to align a given wave to its transcription using trained models by Kaldi
Arabic Speech Recognition 13 ⭐
This repository contains my attempt to use two famous speech recognition frameworks (Kaldi, CMU Sphinx4) for Arabic Language using the publicly-available dataset "Arabic Corpus of Isolated Words"