Dynamic Bayesian Network Based Speech Recognition with Pitch and Energy as Auxiliary Variables

We use cookies

This website uses cookies and other tracking technologies to improve your browsing experience for the following purposes: to enable basic functionality of the website, to provide a better experience on the website, to measure your interest in our products and services and to personalize marketing interactions, to deliver ads that are more relevant to you.

[BibTeX] [Marc21]

Type of publication:	Conference paper
Citation:	stephenson02c
Booktitle:	2002 IEEE International Workshop on Neural Networks for for Signal Processing (NNSP~2002)
Year:	2002
Month:	9
Address:	Martigny, Switzerland
Crossref:	stephenson02b: Dynamic Bayesian Network Based Speech Recognition with Pitch and Energy as Auxiliary Variables, Stephenson, Todd Andrew, Escofet, Jaume, Magimai-Doss, Mathew and Bourlard, Hervé, Idiap-RR-24-2002
Abstract:	Pitch and energy are two fundamental features describing speech, having importance in human speech recognition. However, when incorporated as features in automatic speech recognition (ASR,',','), they usually result in a significant degradation on recognition performance due to the noise inherent in estimating or modeling them. In this paper, we show experimentally how this can be corrected by either conditioning the emission distributions upon these features or by marginalizing out these features in recognition. Since this is not obvious to do with standard hidden Markov models (HMMs,',','), this work has been performed in the framework of dynamic Bayesian networks (DBNs,',','), resulting in more flexibility in defining the topology of the emission distributions and in specifying whether variables should be marginalized out.
Userfields:	ipdmembership={speech},
Keywords:
Projects	Idiap
Authors	Stephenson, Todd Andrew Escofet, Jaume Magimai-Doss, Mathew Bourlard, Hervé
Added by:	[UNK]
Total mark:	0
Attachments
nnsp2002.pdf nnsp2002.ps.gz
Notes

processing time: 0.0003 seconds.