Personalising speech-to-speech translation: Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis

We use cookies

This website uses cookies and other tracking technologies to improve your browsing experience for the following purposes: to enable basic functionality of the website, to provide a better experience on the website, to measure your interest in our products and services and to personalize marketing interactions, to deliver ads that are more relevant to you.

[BibTeX] [Marc21]

Type of publication:	Journal paper
Citation:	Dines_CSL_2011
Publication status:	Accepted
Journal:	Computer Speech and Language
Year:	2011
URL:	http://www.sciencedirect.com/s...
DOI:	doi:10.1016/j.csl.2011.08.003
Abstract:	In this paper we present results of unsupervised cross-lingual speaker adaptation applied to text-to-speech synthesis. The application of our research is the personalisation of speech-to-speech translation in which we employ a HMM statistical framework for both speech recognition and synthesis. This framework provides a logical mechanism to adapt synthesised speech output to the voice of the user by way of speech recognition. In this work we present results of several different unsupervised and cross-lingual adaptation approaches as well as an end-to-end speaker adaptive speech-to-speech translation system. Our experiments show that we can successfully apply speaker adaptation in both unsupervised and cross-lingual scenarios and our proposed algorithms seem to generalise well for several language pairs. We also discuss important future directions including the need for better evaluation metrics.
Keywords:	cross-lingual speaker adaptation, Machine Translation, speech recognition, speech synthesis
Projects	Idiap EMIME
Authors	Dines, John Liang, Hui Saheer, Lakshmi Gibson, Matthew Byrne, William Oura, Keiichiro Tokuda, Keiichi Yamagishi, Junichi King, Simon Wester, Mirjam Hirsimäki, Teemu Karhila, Reima Kurimo, Mikko
Added by:	[UNK]
Total mark:	0
Attachments
Dines_CSL_2011.pdf (Version prior to proofing.)
Notes

processing time: 0.0002 seconds.