Subjective Evaluation of Join Cost Functions Used in Unit Selection Speech Synthesis
Type of publication: | Idiap-RR |
Citation: | vepa_icslp04 |
Number: | Idiap-RR-26-2004 |
Year: | 2004 |
Institution: | IDIAP |
Abstract: | In our previous papers, we have proposed join cost functions derived from spectral distances, which have good correlations with perceptual scores obtained for a range of concatenation discontinuities. To further validate their ability to predict concatenation discontinuities, we have chosen the best three spectral distances and evaluated them subjectively in a listening test. The unit sequences for synthesis stimuli are obtained from a state-of-the-art unit selection text-to-speech system: rVoice from Rhetorical Systems Ltd. In this paper, we report listeners' preferences for each of the three join cost functions. |
Userfields: | ipdmembership={speech}, |
Keywords: | |
Projects |
Idiap |
Authors | |
Crossref by |
vepa_icslp04_p |
Added by: | [UNK] |
Total mark: | 0 |
Attachments
|
|
Notes
|
|
|