CONF wisp2001/IDIAP From missing data to maybe useful data: soft data modelling for noise robust ASR Morris, Andrew Barker, Jon Bourlard, Hervé Bayesian recognition data utility HMMs missing data robust ASR EXTERNAL https://publications.idiap.ch/attachments/reports/2001/morris-2001-wisp.pdf PUBLIC https://publications.idiap.ch/index.php/publications/showcite/morris-rr-01-06 Related documents Proc. WISP 06 2001 Stratford-upon-Avon, England Much research has been focused on the problem of achieving automatic speech recognition (ASR) which approaches human recognition performance in its level of robustness to noise and channel distortion. We present here a new approach to data modelling which has the potential to combine complementary existing state-of-the-art techniques for speech enhancement and noise adaptation into a single process. In the "missing feature theory" (MFT) based approach to noise robust ASR, misinformative spectral data is detected and then ignored. Recent work has shown that MFT ASR greatly improves when the usual hard decision to exclude data features is softened by a continuous weighting between the likelihood contributions normally used with MFT for "clean" and "missing" data. The new model presented here can be seen as a generalisation of this "soft missing data" approach, in which the mixture pdf which is implicitly used to model clean or missing observation data is recognised as the data posterior pdf, and modelled accordingly. Initial "soft data" experiments compare the performance of different soft missing data models against baseline Gaussian mixture HMM performance. The test used is the Aurora 2.0 task for speaker independent continuous digits recognition. REPORT morris-RR-01-06/IDIAP From missing data to maybe useful data: soft data modelling for noise robust ASR Morris, Andrew Barker, Jon Bourlard, Hervé Bayesian recognition data utility HMMs missing data robust ASR EXTERNAL https://publications.idiap.ch/attachments/reports/2001/rr01-06.pdf PUBLIC Idiap-RR-06-2001 2001 IDIAP Much research has been focused on the problem of achieving automatic speech recognition (ASR) which approaches human recognition performance in its level of robustness to noise and channel distortion. We present here a new approach to data modelling which has the potential to combine complementary existing state-of-the-art techniques for speech enhancement and noise adaptation into a single process. In the "missing feature theory" (MFT) based approach to noise robust ASR, misinformative spectral data is detected and then ignored. Recent work has shown that MFT ASR greatly improves when the usual hard decision to exclude data features is softened by a continuous weighting between the likelihood contributions normally used with MFT for "clean" and "missing" data. The new model presented here can be seen as a generalisation of this "soft missing data" approach, in which the mixture pdf which is implicitly used to model clean or missing observation data is recognised as the data posterior pdf, and modelled accordingly. Initial "soft data" experiments compare the performance of different soft missing data models against baseline Gaussian mixture HMM performance. The test used is the Aurora 2.0 task for speaker independent continuous digits recognition.