The Kaldi Speech Recognition Toolkit

Type of publication:	Conference paper
Citation:	Povey_ASRU2011_2011
Publication status:	Published
Booktitle:	IEEE 2011 Workshop on Automatic Speech Recognition and Understanding
Year:	2011
Month:	December
Publisher:	IEEE Signal Processing Society
Location:	Hilton Waikoloa Village, Big Island, Hawaii, US
Note:	IEEE Catalog No.: CFP11SRW-USB
ISBN:	978-1-4673-0366-8
Crossref:	Povey_Idiap-RR-04-2012: The Kaldi Speech Recognition Toolkit, Povey, Daniel, Ghoshal, Arnab, Boulianne, Gilles, Burget, Lukas, Glembek, Ondrej, Goel, Nagendra, Hannemann, Mirko, Motlicek, Petr, Qian, Yanmin, Schwarz, Petr, Silovsky, Jan, Stemmer, Georg and Vesely, Karel, Idiap-RR-04-2012
Abstract:	We describe the design of Kaldi, a free, open-source toolkit for speech recognition research. Kaldi provides a speech recognition system based on finite-state transducers (using the freely available OpenFst), together with detailed documentation and scripts for building complete recognition systems. Kaldi is written is C++, and the core library supports modeling of arbitrary phonetic-context sizes, acoustic modeling with subspace Gaussian mixture models (SGMM) as well as standard Gaussian mixture models, together with all commonly used linear and affine transforms. Kaldi is released under the Apache License v2.0, which is highly nonrestrictive, making it suitable for a wide community of users.
Keywords:	ASR, Automatic Speech Recognition, GMM, HTK, SGMM
Projects:	Idiap IM2
Authors:	Povey, Daniel Ghoshal, Arnab Boulianne, Gilles Burget, Lukas Glembek, Ondrej Goel, Nagendra Hannemann, Mirko Motlicek, Petr Qian, Yanmin Schwarz, Petr Silovsky, Jan Stemmer, Georg Vesely, Karel
Added by:	[UNK]
Total mark:	0
Attachments
Povey_ASRU2011_2011.pdf
Notes

processing time: 0.0003 seconds.