The Kaldi Speech Recognition Toolkit

We use cookies

This website uses cookies and other tracking technologies to improve your browsing experience for the following purposes: to enable basic functionality of the website, to provide a better experience on the website, to measure your interest in our products and services and to personalize marketing interactions, to deliver ads that are more relevant to you.

[BibTeX] [Marc21]

Type of publication:	Idiap-RR
Citation:	Povey_Idiap-RR-04-2012
Number:	Idiap-RR-04-2012
Year:	2012
Month:	1
Institution:	Idiap
Address:	Rue Marconi 19, Martigny
Abstract:	We describe the design of Kaldi, a free, open-source toolkit for speech recognition research. Kaldi provides a speech recognition system based on finite-state automata (using the freely available OpenFst), together with detailed documentation and a comprehensive set of scripts for building complete recognition systems. Kaldi is written is C++, and the core library supports modeling of arbitrary phonetic-context sizes, acoustic modeling with subspace Gaussian mixture models (SGMM) as well as standard Gaussian mixture models, together with all commonly used linear and affine transforms. Kaldi is released under the Apache License v2.0, which is highly nonrestrictive, making it suitable for a wide community of users.
Keywords:	ASR, Automatic Speech Recognition, GMM, HTK, SGMM
Projects	Idiap
Authors	Povey, Daniel Ghoshal, Arnab Boulianne, Gilles Burget, Lukas Glembek, Ondrej Goel, Nagendra Hannemann, Mirko Motlicek, Petr Qian, Yanmin Schwarz, Petr Silovsky, Jan Stemmer, Georg Vesely, Karel
Crossref by	Povey_ASRU2011_2011
Added by:	[ADM]
Total mark:	0
Attachments
Povey_Idiap-RR-04-2012.pdf (MD5: c0ac211369f43cedb861ce945afb1cbf)
Notes

processing time: 0.0389 seconds.