CONF Rasipuram_INTERSPEECH_2016/IDIAP HMM-based Non-native Accent Assessment using Posterior Features Rasipuram, Ramya Cernak, Milos Magimai-Doss, Mathew Automatic accent assessment KL-divergence lexical model non-native speech Posterior features http://publications.idiap.ch/index.php/publications/showcite/Rasipuram_Idiap-RR-32-2015 Related documents Proceedings of Interspeech San Francisco, USA 2016 Automatic non-native accent assessment has potential benefits in language learning and speech technologies. The three fundamental challenges in automatic accent assessment are to characterize, model and assess individual variation in speech of the non-native speaker. In our recent work, accentedness score was automatically obtained by comparing two phone probability sequences obtained through instances of non-native and native speech. Although automatic accentedness ratings of the approach correlated well with human accent ratings, the approach is critically constrained because of the requirement of native speech instance. In this paper, we build on the previous work and obtain the native latent symbol probability sequence through the word hypothesis modeled as a hidden Markov model (HMM). The latent symbols are either context-independent phonemes or clustered context-dependent phonemes. The advantage of the proposed approach is that it requires just reference text transcription instead of native speech recordings. Using the HMMs trained on an auxiliary native speech corpus, the proposed approach achieves a correlation of 0.68 with human accent ratings on the ISLE corpus. This is further interesting considering that the approach does not use any non-native data and human accent ratings at any stage of the system development. REPORT Rasipuram_Idiap-RR-32-2015/IDIAP HMM-based Non-native Accent Assessment using Posterior Features Rasipuram, Ramya Cernak, Milos Magimai-Doss, Mathew Automatic accent assessment KL-divergence KL-HMM lexical model non-native speech Posterior features EXTERNAL http://publications.idiap.ch/attachments/reports/2015/Rasipuram_Idiap-RR-32-2015.pdf PUBLIC Idiap-RR-32-2015 2015 Idiap October 2015 Automatic non-native accent assessment has many potential benefits in language learning and speech technologies. The three fundamental challenges in automatic accent assessment are to characterize, model and assess individual variation in speech of the non-native speaker. In our recent work, accentedness score was automatically obtained by comparing two phone probability sequences obtained through instances of non-native and native speech. In this paper, we build on the previous work and obtain the native latent symbol probability sequence through the word hypothesis modeled as a hidden Markov model (HMM). The approach overcomes the necessity for a native human reference speech of the same sentence. Using the HMMs trained on an auxiliary native speech corpus, the proposed approach achieves a correlation of 0.68 with the human accent ratings on the ISLE corpus. This is further interesting considering that the approach does not use any non-native data and human accent ratings at any stage of the system development.