logo Idiap Research Institute        
 [BibTeX] [Marc21]
Dysarthric Speech Recognition with Lattice-Free MMI
Type of publication: Conference paper
Citation: Hermann_ICASSP_2020
Publication status: Published
Booktitle: International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
Year: 2020
Pages: 6109-6113
URL: https://ieeexplore.ieee.org/do...
DOI: 10.1109/ICASSP40776.2020.9053549
Abstract: Recognising dysarthric speech is a challenging problem as it differs in many aspects from typical speech, such as speaking rate and pronunciation. In the literature the focus so far has largely been on handling these variabilities in the framework of HMM/GMM and cross-entropy based HMM/DNN systems. This paper focuses on the use of state-of-the-art sequence-discriminative training, in particular lattice-free maximum mutual information (LF-MMI), for improving dysarthric speech recognition. Through a systematic investigation on the Torgo corpus we demonstrate that LF-MMI performs well on such atypical data and compensates much better for the low speaking rates of dysarthric speakers than conventionally trained systems. This can be attributed to inherent aspects of current speech recognition training regimes, like frame subsampling and speed perturbation, which obviate the need for some techniques previously adopted specifically for dysarthric speech.
Keywords: Automatic Speech Recognition, Dysarthria, Pathological Speech Processing
Projects Idiap
TAPAS
Authors Hermann, Enno
Magimai.-Doss, Mathew
Added by: [UNK]
Total mark: 0
Attachments
  • Hermann_ICASSP_2020.pdf
Notes