CONF
Goldman_IS2016/IDIAP
The SIWIS database: a multilingual speech database with acted emphasis
Goldman, Jean-Philippe
Honnet, Pierre-Edouard
Clark, Rob
Garner, Philip N.
Ivanova, Maria
Lazaridis, Alexandros
Liang, Hui
Macedo, Tiago
Pfister, Beat
Ribeiro, Manuel Sam
Wehrli, Eric
Yamagishi, Junichi
EXTERNAL
https://publications.idiap.ch/attachments/papers/2016/Goldman_IS2016.pdf
PUBLIC
https://publications.idiap.ch/index.php/publications/showcite/Goldman_Idiap-RR-13-2016
Related documents
Proceedings of Interspeech
San Francisco, USA
2016
1532--1535
10.21437/Interspeech.2016-1003
doi
REPORT
Goldman_Idiap-RR-13-2016/IDIAP
The SIWIS database: a multilingual speech database with acted emphasis
Goldman, Jean-Philippe
Honnet, Pierre-Edouard
Clark, Rob
Garner, Philip N.
Ivanova, Maria
Lazaridis, Alexandros
Liang, Hui
Macedo, Tiago
Pfister, Beat
Ribeiro, Manuel Sam
Wehrli, Eric
Yamagishi, Junichi
bilingual speakers
emphasis
speech corpus
speech-to-speech translation
EXTERNAL
https://publications.idiap.ch/attachments/reports/2016/Goldman_Idiap-RR-13-2016.pdf
PUBLIC
Idiap-RR-13-2016
2016
Idiap
May 2016
We describe here a collection of speech data of bilingual and trilingual speakers of English, French, German and Italian. In the context of speech to speech translation (S2ST), this database is designed for several purposes and studies: training CLSA systems (cross-language speaker adaptation), conveying emphasis through S2ST systems, and evaluating TTS systems. More precisely, 36 speakers judged as accentless (22 bilingual and 14 trilingual speakers) were recorded for a set of 171 prompts in two or three languages, amounting to a total of 24 hours of speech. These sets of prompts include 100 sentences from news, 25 sentences from Europarl, the same 25 sentences with one acted emphasised word, 20 semantically unpredictable sentences, and finally a 240-word long text. All in all, it yielded 64 bilingual session pairs of the six possible combinations of the four languages. The database is freely available for non-commercial use and scientific research purposes.