logo Idiap Research Institute        
 [BibTeX] [Marc21]
The Kaldi Speech Recognition Toolkit
Type of publication: Idiap-RR
Citation: Povey_Idiap-RR-04-2012
Number: Idiap-RR-04-2012
Year: 2012
Month: 1
Institution: Idiap
Address: Rue Marconi 19, Martigny
Abstract: We describe the design of Kaldi, a free, open-source toolkit for speech recognition research. Kaldi provides a speech recognition system based on finite-state automata (using the freely available OpenFst), together with detailed documentation and a comprehensive set of scripts for building complete recognition systems. Kaldi is written is C++, and the core library supports modeling of arbitrary phonetic-context sizes, acoustic modeling with subspace Gaussian mixture models (SGMM) as well as standard Gaussian mixture models, together with all commonly used linear and affine transforms. Kaldi is released under the Apache License v2.0, which is highly nonrestrictive, making it suitable for a wide community of users.
Keywords: ASR, Automatic Speech Recognition, GMM, HTK, SGMM
Projects Idiap
Authors Povey, Daniel
Ghoshal, Arnab
Boulianne, Gilles
Burget, Lukas
Glembek, Ondrej
Goel, Nagendra
Hannemann, Mirko
Motlicek, Petr
Qian, Yanmin
Schwarz, Petr
Silovsky, Jan
Stemmer, Georg
Vesely, Karel
Crossref by Povey_ASRU2011_2011
Added by: [ADM]
Total mark: 0
Attachments
  • Povey_Idiap-RR-04-2012.pdf (MD5: c0ac211369f43cedb861ce945afb1cbf)
Notes