Subjective Evaluation of Join Cost Functions Used in Unit Selection Speech Synthesis
| Type of publication: | Idiap-RR |
| Citation: | vepa_icslp04 |
| Number: | Idiap-RR-26-2004 |
| Year: | 2004 |
| Institution: | IDIAP |
| Abstract: | In our previous papers, we have proposed join cost functions derived from spectral distances, which have good correlations with perceptual scores obtained for a range of concatenation discontinuities. To further validate their ability to predict concatenation discontinuities, we have chosen the best three spectral distances and evaluated them subjectively in a listening test. The unit sequences for synthesis stimuli are obtained from a state-of-the-art unit selection text-to-speech system: rVoice from Rhetorical Systems Ltd. In this paper, we report listeners' preferences for each of the three join cost functions. |
| Userfields: | ipdmembership={speech}, |
| Keywords: | |
| Projects: |
Idiap |
| Authors: | |
| Crossref by |
vepa_icslp04_p |
| Added by: | [UNK] |
| Total mark: | 0 |
|
Attachments
|
|
|
Notes
|
|
|
|
|