CONF Hermann_ICASSP_2020/IDIAP Dysarthric Speech Recognition with Lattice-Free MMI Hermann, Enno Magimai-Doss, Mathew Automatic Speech Recognition Dysarthria Pathological Speech Processing EXTERNAL https://publications.idiap.ch/attachments/papers/2020/Hermann_ICASSP_2020.pdf PUBLIC International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2020 6109-6113 https://ieeexplore.ieee.org/document/9053549 URL 10.1109/ICASSP40776.2020.9053549 doi Recognising dysarthric speech is a challenging problem as it differs in many aspects from typical speech, such as speaking rate and pronunciation. In the literature the focus so far has largely been on handling these variabilities in the framework of HMM/GMM and cross-entropy based HMM/DNN systems. This paper focuses on the use of state-of-the-art sequence-discriminative training, in particular lattice-free maximum mutual information (LF-MMI), for improving dysarthric speech recognition. Through a systematic investigation on the Torgo corpus we demonstrate that LF-MMI performs well on such atypical data and compensates much better for the low speaking rates of dysarthric speakers than conventionally trained systems. This can be attributed to inherent aspects of current speech recognition training regimes, like frame subsampling and speed perturbation, which obviate the need for some techniques previously adopted specifically for dysarthric speech.