CONF
Rasipuram_INTERSPEECH_2016/IDIAP
HMM-based Non-native Accent Assessment using Posterior Features
Rasipuram, Ramya
Cernak, Milos
Magimai-Doss, Mathew
Automatic accent assessment
KL-divergence
lexical model
non-native speech
Posterior features
https://publications.idiap.ch/index.php/publications/showcite/Rasipuram_Idiap-RR-32-2015
Related documents
Proceedings of Interspeech
San Francisco, USA
2016
Automatic non-native accent assessment has potential benefits in language learning and speech technologies. The three fundamental challenges in automatic accent assessment are to characterize, model and assess individual variation in speech of the non-native speaker. In our recent work, accentedness score was automatically obtained by comparing two phone probability sequences obtained through instances of non-native and native speech. Although automatic accentedness ratings of the approach correlated well with human accent ratings, the approach is critically constrained because of the requirement of native speech instance. In this paper, we build on the previous work and obtain the native latent symbol probability sequence through the word hypothesis modeled as a hidden Markov model (HMM). The latent symbols are either context-independent phonemes or clustered context-dependent phonemes. The advantage of the proposed approach is that it requires just reference text transcription instead of native speech recordings. Using the HMMs trained on an auxiliary native speech corpus, the proposed approach achieves a correlation of 0.68 with human accent ratings on the ISLE corpus. This is further interesting considering that the approach does not use any non-native data and human accent ratings at any stage of the system development.
REPORT
Rasipuram_Idiap-RR-32-2015/IDIAP
HMM-based Non-native Accent Assessment using Posterior Features
Rasipuram, Ramya
Cernak, Milos
Magimai-Doss, Mathew
Automatic accent assessment
KL-divergence
KL-HMM
lexical model
non-native speech
Posterior features
EXTERNAL
https://publications.idiap.ch/attachments/reports/2015/Rasipuram_Idiap-RR-32-2015.pdf
PUBLIC
Idiap-RR-32-2015
2015
Idiap
October 2015
Automatic non-native accent assessment has many potential benefits in language learning and speech technologies. The three fundamental challenges in automatic accent assessment are to characterize, model and assess individual variation in speech of the non-native speaker. In our recent work, accentedness score was automatically obtained by comparing two phone probability sequences obtained through instances of non-native and native speech. In this paper, we build on the previous work and obtain the native latent symbol probability sequence through the word hypothesis modeled as a hidden Markov model (HMM). The approach overcomes the necessity for a native human reference speech of the same sentence. Using the HMMs trained on an auxiliary native speech corpus, the proposed approach achieves a correlation of 0.68 with the human accent ratings on the ISLE corpus. This is further interesting considering that the approach does not use any non-native data and human accent ratings at any stage of the system development.