CONF
Rasipuram_INTERSPEECH_2015/IDIAP
Automatic Accentedness Evaluation of Non-Native Speech Using Phonetic and Sub-Phonetic Posterior Probabilities
Rasipuram, Ramya
Cernak, Milos
Nanchen, Alexandre
Magimai-Doss, Mathew
Automatic accent evaluation
dynamic programming
KL-divergence
non-native speech
phonetic representation
Posterior features
EXTERNAL
https://publications.idiap.ch/attachments/papers/2015/Rasipuram_INTERSPEECH_2015.pdf
PUBLIC
https://publications.idiap.ch/index.php/publications/showcite/Rasipuram_Idiap-RR-12-2015
Related documents
Proceedings of Interspeech
2015
Automatic evaluation of non-native speech accentedness has potential implications for not only language learning and accent identification systems but also for speaker and speech recognition systems. From the perspective of speech production, the two primary factors influencing the accentedness are the phonetic and prosodic structure. In this paper, we propose an approach for automatic accentedness evaluation based on comparison of instances of native and non-native speakers at the acoustic-phonetic level. Specifically, the proposed approach measures accentedness by comparing phone class conditional probability sequences corresponding to the instances of native and non-native speakers, respectively. We evaluate the proposed
approach on the EMIME bilingual and EMIME Mandarin bilingual corpora, which contains English speech from native English speakers and various non-native English speakers, namely Finnish, German and Mandarin. We also investigate the influence of the granularity of the phonetic unit representation on the performance of the proposed accentedness measure. Our results indicate that the accentedness ratings by the proposed approach correlate consistently with the human ratings of accentedness. In addition, our studies show that the granularity of the phonetic unit representation that yields the best correlation with the human accentedness ratings varies with respect to the native language of the non-native speakers.
REPORT
Rasipuram_Idiap-RR-12-2015/IDIAP
Automatic Accentedness Evaluation of Non-Native Speech Using Phonetic and Sub-Phonetic Posterior Probabilities
Rasipuram, Ramya
Cernak, Milos
Nanchen, Alexandre
Magimai-Doss, Mathew
Automatic accent evaluation
dynamic programming
KL-divergence
non-native speech
phonetic representation
Posterior features
EXTERNAL
https://publications.idiap.ch/attachments/reports/2015/Rasipuram_Idiap-RR-12-2015.pdf
PUBLIC
Idiap-RR-12-2015
2015
Idiap
June 2015
Automatic evaluation of non-native speech accentedness has potential implications for not only language learning and accent identification systems but also for speaker and speech recognition systems. From the perspective of speech production, the two primary factors influencing the accentedness are the phonetic and prosodic structure. In this paper, we propose an approach for automatic accentedness evaluation based on comparison of instances of native and non-native speakers at the acoustic-phonetic level. Specifically, the proposed approach measures accentedness by comparing phone class conditional probability sequences corresponding to the instances of native and non-native speakers, respectively. We evaluate the proposed approach on the EMIME bilingual and EMIME Mandarin bilingual corpora, which contains English speech from native English speakers and various non-native English speakers, namely Finnish, German and Mandarin. We also investigate the influence of the granularity of the phonetic unit representation on the performance of the proposed accentedness measure. Our results indicate that the accentedness ratings by the proposed approach correlate consistently with the human ratings of accentedness. In addition, our studies show that the granularity of the phonetic unit representation that yields the best correlation with the human accentedness ratings varies with respect to the native language of the non-native speakers.