87 Open Source Kaldi Software Projects
Free and open source kaldi code projects including engines, APIs, generators, and tools.
Pytorch Kaldi 2113 ⭐
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Vosk API 2890 ⭐
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Awesome Kaldi 476 ⭐
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
React Transcript Editor 410 ⭐
A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress
Speech Aligner 300 ⭐
speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
Self Supervised Speech Pretraining And Representation Learning 1042 ⭐
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
Vosk Server 440 ⭐
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Py Kaldi Asr 166 ⭐
Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
Kaldi Active Grammar 263 ⭐
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
Factorized Tdnn 122 ⭐
PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
Pytorch Kaldi Neural Speaker Embeddings 129 ⭐
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
Yh1008 Speech To Text 59 ⭐
mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras
Rustfst 89 ⭐
Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs).
Tf_kaldi_io 40 ⭐
A python package that make tensorflow be able to read "Kaldi" scp/ark in an elegant way. May kaldi user happy to enter tensorflow world.
Theano Kaldi Rnn 32 ⭐
THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is coupled with the Kaldi decoder.
Pytorch_mlp_for_asr 34 ⭐
This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.
Srvk Eesen Offline Transcriber 22 ⭐
Top level code to transcribe English audio/video files into text/subtitles
Kaldi Alligner 22 ⭐
scripts to align a given wave to its transcription using trained models by Kaldi
Dropclass_speaker 20 ⭐
DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020
Nn Similarity Diarization 24 ⭐
Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clustering for Speaker Diarization")
Arabic Speech Recognition 20 ⭐
This repository contains my attempt to use two famous speech recognition frameworks (Kaldi, CMU Sphinx4) for Arabic Language using the publicly-available dataset "Arabic Corpus of Isolated Words"
Kaldi_helpers 13 ⭐
:speak_no_evil: A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
Lattice_combination 14 ⭐
Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices
Research_speech_speaker_verification_nist_sre2010 12 ⭐
Kaldi Timit Sre Ivector 16 ⭐
Develop speaker recognition model based on i-vector using TIMIT database
Funcwj Aps 87 ⭐
A personal toolkit for single/multi-channel speech recognition & enhancement & separation.
Vosk Browser 63 ⭐
A speech recognition library running in the browser thanks to a WebAssembly build of Vosk
Kaldi_nl 47 ⭐
Code related to the Dutch instance and user groups of the KALDI speech recognition toolkit
Kaldifeat 55 ⭐
Kaldi-compatible feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
Kaldi_ag_training 14 ⭐
Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Voice Privacy Challenge 2020 41 ⭐
Baseline Recipe for VoicePrivacy Challenge 2020: https://www.voiceprivacychallenge.org/vp2020/docs/VoicePrivacy_2020_Eval_Plan_v1_3.pdf