logo Idiap Research Institute        
 [BibTeX] [Marc21]
A Comparison of Supervised and Unsupervised Cross-Lingual Speaker Adaptation Approaches for HMM-Based Speech Synthesis
Type of publication: Idiap-RR
Citation: Liang_Idiap-RR-05-2010
Number: Idiap-RR-05-2010
Year: 2010
Month: 2
Institution: Idiap
Abstract: The EMIME project aims to build a personalized speech-to-speech translator, such that spoken input of a user in one language is used to produce spoken output that still sounds like the user's voice however in another language. This distinctiveness makes unsupervised cross-lingual speaker adaptation one key to the project's success. So far, research has been conducted into unsupervised and cross-lingual cases separately by means of decision tree marginalization and HMM state mapping respectively. In this paper we combine the two techniques to perform unsupervised cross-lingual speaker adaptation. The performance of eight speaker adaptation systems (supervised vs. unsupervised, intra-lingual vs. cross-lingual) are compared using objective and subjective evaluations. Experimental results show the performance of unsupervised cross-lingual speaker adaptation is comparable to that of the supervised case in terms of spectrum adaptation in the EMIME scenario, even though automatically obtained transcriptions have a very high phoneme error rate.
Crossref: Liang_ICASSP_2010:
A Comparison of Supervised and Unsupervised Cross-Lingual Speaker Adaptation Approaches for HMM-Based Speech Synthesis, Liang, Hui, Dines, John and Saheer, Lakshmi, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Dallas, U.S.A., 2010
Keywords:
Projects Idiap
EMIME
Authors Liang, Hui
Dines, John
Saheer, Lakshmi
Crossref by Liang_ICASSP_2010
Added by: [ADM]
Total mark: 0
Attachments
  • Liang_Idiap-RR-05-2010.pdf (MD5: 36a006edffb35066596f7191cbb2c265)
Notes