An End-to-end Network to Synthesize Intonation Using a Generalized Command Response Model

We use cookies

This website uses cookies and other tracking technologies to improve your browsing experience for the following purposes: to enable basic functionality of the website, to provide a better experience on the website, to measure your interest in our products and services and to personalize marketing interactions, to deliver ads that are more relevant to you.

[BibTeX] [Marc21]

Type of publication:	Conference paper
Citation:	Marelli_ICASSP2019_2019
Booktitle:	ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Year:	2019
Month:	May
Pages:	7040-7044
Publisher:	IEEE
Location:	Brighton, United Kingdom
Crossref:	Marelli_Idiap-RR-05-2019: AN END-TO-END NETWORK TO SYNTHESIZE INTONATION USING A GENERALIZED COMMAND RESPONSE MODEL, Marelli, François, Schnell, Bastian, Bourlard, Hervé, Dutoit, T. and Garner, Philip N., Idiap-RR-05-2019
URL:	https://ieeexplore.ieee.org/do...
DOI:	10.1109/ICASSP.2019.8683815
Abstract:	The generalized command response (GCR) model represents intonation as a superposition of muscle responses to spike command signals. We have previously shown that the spikes can be predicted by a two-stage system, consisting of a recurrent neural network and a post-processing procedure, but the responses themselves were fixed dictionary atoms. We propose an end-to-end neural architecture that replaces the dictionary atoms with trainable second-order recurrent elements analogous to recursive filters. We demonstrate gradient stability under modest conditions, and show that the system can be trained by imposing temporal sparsity constraints. Subjective listening tests demonstrate that the system can synthesize intonation with high naturalness, comparable to state-of-the-art acoustic models, and retains the physiological plausibility of the GCR model.
Keywords:	Digital IIR Filters, Fujisaki Model, neural networks, Prosody Modelling, speech synthesis
Projects	Idiap
Authors	Marelli, François Schnell, Bastian Bourlard, Hervé Dutoit, T. Garner, Philip N.
Added by:	[UNK]
Total mark:	0
Attachments
Marelli_ICASSP2019_2019.pdf (Paper)
Notes

processing time: 0.0003 seconds.