CONF Rasipuram_INTERSPEECH_2015/IDIAP Automatic Accentedness Evaluation of Non-Native Speech Using Phonetic and Sub-Phonetic Posterior Probabilities Rasipuram, Ramya Cernak, Milos Nanchen, Alexandre Magimai-Doss, Mathew Automatic accent evaluation dynamic programming KL-divergence non-native speech phonetic representation Posterior features EXTERNAL https://publications.idiap.ch/attachments/papers/2015/Rasipuram_INTERSPEECH_2015.pdf PUBLIC https://publications.idiap.ch/index.php/publications/showcite/Rasipuram_Idiap-RR-12-2015 Related documents Proceedings of Interspeech 2015 Automatic evaluation of non-native speech accentedness has potential implications for not only language learning and accent identification systems but also for speaker and speech recognition systems. From the perspective of speech production, the two primary factors influencing the accentedness are the phonetic and prosodic structure. In this paper, we propose an approach for automatic accentedness evaluation based on comparison of instances of native and non-native speakers at the acoustic-phonetic level. Specifically, the proposed approach measures accentedness by comparing phone class conditional probability sequences corresponding to the instances of native and non-native speakers, respectively. We evaluate the proposed approach on the EMIME bilingual and EMIME Mandarin bilingual corpora, which contains English speech from native English speakers and various non-native English speakers, namely Finnish, German and Mandarin. We also investigate the influence of the granularity of the phonetic unit representation on the performance of the proposed accentedness measure. Our results indicate that the accentedness ratings by the proposed approach correlate consistently with the human ratings of accentedness. In addition, our studies show that the granularity of the phonetic unit representation that yields the best correlation with the human accentedness ratings varies with respect to the native language of the non-native speakers. REPORT Rasipuram_Idiap-RR-12-2015/IDIAP Automatic Accentedness Evaluation of Non-Native Speech Using Phonetic and Sub-Phonetic Posterior Probabilities Rasipuram, Ramya Cernak, Milos Nanchen, Alexandre Magimai-Doss, Mathew Automatic accent evaluation dynamic programming KL-divergence non-native speech phonetic representation Posterior features EXTERNAL https://publications.idiap.ch/attachments/reports/2015/Rasipuram_Idiap-RR-12-2015.pdf PUBLIC Idiap-RR-12-2015 2015 Idiap June 2015 Automatic evaluation of non-native speech accentedness has potential implications for not only language learning and accent identification systems but also for speaker and speech recognition systems. From the perspective of speech production, the two primary factors influencing the accentedness are the phonetic and prosodic structure. In this paper, we propose an approach for automatic accentedness evaluation based on comparison of instances of native and non-native speakers at the acoustic-phonetic level. Specifically, the proposed approach measures accentedness by comparing phone class conditional probability sequences corresponding to the instances of native and non-native speakers, respectively. We evaluate the proposed approach on the EMIME bilingual and EMIME Mandarin bilingual corpora, which contains English speech from native English speakers and various non-native English speakers, namely Finnish, German and Mandarin. We also investigate the influence of the granularity of the phonetic unit representation on the performance of the proposed accentedness measure. Our results indicate that the accentedness ratings by the proposed approach correlate consistently with the human ratings of accentedness. In addition, our studies show that the granularity of the phonetic unit representation that yields the best correlation with the human accentedness ratings varies with respect to the native language of the non-native speakers.