Publications of NAST sorted by journal and type
Frontiers in Neuroscience
Exploring neural oscillations during speech perception via surrogate gradient spiking neural networks, and , in: Frontiers in Neuroscience, 18(1449181), 2024 |
[DOI] |
IEEE/ACM Transactions on Audio, Speech, and Language Processing
Bayesian Parameter-Efficient Fine-Tuning for Overcoming Catastrophic Forgetting, and , in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 |
[DOI] |
arxiv
HyperMixer: An MLP-based Green AI Alternative to Transformers, , , , , , and , in: arxiv, 2022 |
Frontiers in Neuroscience
A surrogate gradient spiking baseline for speech command recognition, and , in: Frontiers in Neuroscience, 2022 |
[DOI] [URL] |
Speech Communication
Investigating a neural all pass warp in modern TTS applications, and , in: Speech Communication, 138:26--37, 2022 |
[DOI] |
IEEE Transactions on Pattern Analysis and Machine Intelligence
A Bayesian Approach to Recurrence in Neural Networks, and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 43(8):2527--2537, 2021 |
[DOI] |
Proc. 12th ISCA Speech Synthesis Workshop (SSW 12) (2023)
Diffusion Transformer for Adaptive Text-to-Speech, and , in: Proc. 12th ISCA Speech Synthesis Workshop (SSW 12), 2023 |
[DOI] |
Proc. of the 61st Annual Meeting of the Association for Computational Linguistics (2023)
HyperMixer: An MLP-based Low Cost Alternative to Transformers, , , , , , and , in: Proc. of the 61st Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Toronto, Canada, pages 15632-15654, 2023 |
[DOI] |
Proc. 18th Blizzard Challenge Workshop (2023)
The Idiap Speech Synthesis System for the Blizzard Challenge 2023, , , and , in: Proc. 18th Blizzard Challenge Workshop, 2023 |
[DOI] |
IEEE International Joint Conference on Biometrics (2023)
Vulnerability of Automatic Identity Recognition to Audio-Visual Deepfakes, , , and , in: IEEE International Joint Conference on Biometrics, 2023 |
|
Proc. Interspeech 2022 (2022)
Bayesian Recurrent Units and the Forward Backward Algorithm, and , in: Proc. Interspeech 2022, pages 4137-4141, 2022 |
[DOI] |
Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2021)
A Bayesian Interpretation of the Light Gated Recurrent Unit, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2021 |
[DOI] |
11th ISCA Speech Synthesis Workshop (2021)
Improving Emotional TTS with an Emotion Intensity Input from Unsupervised Extraction, and , in: 11th ISCA Speech Synthesis Workshop, 2021 |
[URL] |
Publications of type Phdthesis
2022
Controllability and Interpretability in Affective Speech Synthesis, , École polytechnique fédérale de Lausanne, 2022 |
[DOI] [URL] |