Keywords:
- All pass warp
- Digital IIR Filters
- Digital IIR Filters
- Emotion Recognition
- emotional speech synthesis
- emotional TTS
- End-to-End Speech synthesis
- Fujisaki Model
- neural networks
- Prosody Modelling
- recurrent neural network
- Saliency Mapping
- Speech enhancement
- speech synthesis
- TTS
- unit selection
- VAE
- Voice Conversion
- VTLN
- WaveNet
- zero-shot speaker adaptation
Publications of Bastian Schnell
2022
Controllability and Interpretability in Affective Speech Synthesis, , École polytechnique fédérale de Lausanne, 2022 |
[DOI] [URL] |
Investigating a neural all pass warp in modern TTS applications, and , in: Speech Communication, 138:26--37, 2022 |
[DOI] |
2021
Improving Emotional TTS with an Emotion Intensity Input from Unsupervised Extraction, and , in: 11th ISCA Speech Synthesis Workshop, 2021 |
[URL] |
2019
AN END-TO-END NETWORK TO SYNTHESIZE INTONATION USING A GENERALIZED COMMAND RESPONSE MODEL, , , , and , Idiap-RR-05-2019 |
|
An End-to-end Network to Synthesize Intonation Using a Generalized Command Response Model, , , , and , in: ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, United Kingdom, pages 7040-7044, IEEE, 2019 |
[DOI] [URL] |
Neural VTLN for Speaker Adaptation in TTS, and , in: Proc. 10th ISCA Speech Synthesis Workshop, ISCA, Vienna, Austria, pages 6, 2019 |
[DOI] |
2018
A Neural Model to Predict Parameters for a Generalized Command Response Model of Intonation, and , Idiap-RR-10-2018 |
|
A Neural Model to Predict Parameters for a Generalized Command Response Model of Intonation, and , in: Proc. Interspeech 2018, pages 3147-3151, 2018 |
[DOI] |
A Neural Model to Predict Parameters for a Generalized Command Response Model of Intonation, and , in: MLSLP-18 Proceedings, Hyderabad, 2018 |
[URL] |