Handling acoustic variation in dysarthric speech recognition systems through model combination

We use cookies

This website uses cookies and other tracking technologies to improve your browsing experience for the following purposes: to enable basic functionality of the website, to provide a better experience on the website, to measure your interest in our products and services and to personalize marketing interactions, to deliver ads that are more relevant to you.

[BibTeX] [Marc21]

Type of publication:	Conference paper
Citation:	Hermann_INTERSPEECH_2021
Publication status:	Accepted
Booktitle:	Proceedings of Interspeech
Year:	2021
Abstract:	Developing automatic speech recognition (ASR) systems that recognise dysarthric speech as well as control speech from unimpaired speakers remains challenging. Including more highly variable dysarthric speech during training can also negatively affect the performance on control speakers, which is not desirable when developing speech recognisers for a wider audience. In this work, we analyse how the acoustic variability of dysarthric speech affects ASR systems and propose the combination of multiple acoustic models trained on different subsets of speakers to mitigate this effect. This approach shows improvements for both dysarthric and control speakers on the Torgo and UA-Speech corpora.
Keywords:	Dysarthria, Pathological speech, speech recognition
Projects	Idiap TAPAS
Authors	Hermann, Enno Magimai-Doss, Mathew
Added by:	[UNK]
Total mark:	0
Attachments
Hermann_INTERSPEECH_2021.pdf
Notes

processing time: 0.0004 seconds.