Mixed Bayesian Networks with Auxiliary Variables for Automatic Speech Recognition

We use cookies

This website uses cookies and other tracking technologies to improve your browsing experience for the following purposes: to enable basic functionality of the website, to provide a better experience on the website, to measure your interest in our products and services and to personalize marketing interactions, to deliver ads that are more relevant to you.

[BibTeX] [Marc21]

Type of publication:	Conference paper
Citation:	stephenson02a
Booktitle:	International Conference on Pattern Recognition (ICPR~2002)
Volume:	4
Year:	2002
Month:	8
Address:	Quebec City, PQ, Canada
Crossref:	stephenson01c: Mixed Bayesian Networks with Auxiliary Variables for Automatic Speech Recognition, Stephenson, Todd Andrew, Magimai-Doss, Mathew and Bourlard, Hervé, Idiap-RR-45-2001
Abstract:	Standard hidden Markov models (HMMs,',','), as used in automatic speech recognition (ASR,',','), calculate their emission probabilities by an artificial neural network (ANN) or a Gaussian distribution conditioned on the hidden state variable, considering the emissions independent of any other variable in the model. Recent work showed the benefit of conditioning the emission distributions on a discrete auxiliary variable, which is observed in training and hidden in recognition. Related work has shown the utility of conditioning the emission distributions on a continuous auxiliary variable. We apply mixed Bayesian networks (BNs) to extend these works by introducing a continuous auxiliary variable that is observed in training but is hidden in recognition. We find that an auxiliary pitch variable conditioned itself upon the hidden state can degrade performance unless the auxiliary variable is also hidden. The performance, furthermore, can be improved by making the auxiliary pitch variable independent of the hidden state.
Userfields:	ipdmembership={speech},
Keywords:
Projects	Idiap
Authors	Stephenson, Todd Andrew Magimai-Doss, Mathew Bourlard, Hervé
Added by:	[UNK]
Total mark:	0
Attachments
icpr2002.pdf icpr2002.ps.gz
Notes

processing time: 0.0003 seconds.