Publications of NAST sorted by first author
B
Exploring neural oscillations during speech perception via surrogate gradient spiking neural networks, and , in: Frontiers in Neuroscience, 18(1449181), 2024 |
[DOI] |
A surrogate gradient spiking baseline for speech command recognition, and , in: Frontiers in Neuroscience, 2022 |
[DOI] [URL] |
Bayesian Recurrent Units and the Forward Backward Algorithm, and , in: Proc. Interspeech 2022, pages 4137-4141, 2022 |
[DOI] |
A Bayesian Interpretation of the Light Gated Recurrent Unit, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2021 |
[DOI] |
C
Bayesian Parameter-Efficient Fine-Tuning for Overcoming Catastrophic Forgetting, and , in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 |
[DOI] |
Diffusion Transformer for Adaptive Text-to-Speech, and , in: Proc. 12th ISCA Speech Synthesis Workshop (SSW 12), 2023 |
[DOI] |
The Idiap Speech Synthesis System for the Blizzard Challenge 2023, , , and , in: Proc. 18th Blizzard Challenge Workshop, 2023 |
[DOI] |
G
A Bayesian Approach to Recurrence in Neural Networks, and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 43(8):2527--2537, 2021 |
[DOI] |
K
Vulnerability of Automatic Identity Recognition to Audio-Visual Deepfakes, , , and , in: IEEE International Joint Conference on Biometrics, 2023 |
|
M
HyperMixer: An MLP-based Low Cost Alternative to Transformers, , , , , , and , in: Proc. of the 61st Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Toronto, Canada, pages 15632-15654, 2023 |
[DOI] |
HyperMixer: An MLP-based Green AI Alternative to Transformers, , , , , , and , in: arxiv, 2022 |
S
Controllability and Interpretability in Affective Speech Synthesis, , École polytechnique fédérale de Lausanne, 2022 |
[DOI] [URL] |
Investigating a neural all pass warp in modern TTS applications, and , in: Speech Communication, 138:26--37, 2022 |
[DOI] |
Improving Emotional TTS with an Emotion Intensity Input from Unsupervised Extraction, and , in: 11th ISCA Speech Synthesis Workshop, 2021 |
[URL] |