Publications of IICT sorted by journal and type
Publications of type Idiap-Internal-RR
2024
SSL-TTS: Leveraging Self-Supervised Embeddings and kNN Retrieval for Zero-Shot Multi-speaker TTS, , , and , Idiap-Internal-RR-38-2024 |
[URL] |
Interspeech (2024)
Cross-transfer Knowledge between Speech and Text Encoders to Evaluate Customer Satisfaction, , , , and , in: Interspeech, Kos Island, Greece, ISCA, 2024 |
|
ISCA Proceedings (2024)
Exploring generalization to unseen audio data for spoofing: insights from SSL models, , , , , and , in: ISCA Proceedings, Greece, 2024 |
[DOI] [URL] |
Proceedings of Interspeech (2024)
Towards interfacing large language models with ASR systems using confidence measures and prompting, , , , and , in: Proceedings of Interspeech, pages 2980-2984, 2024 |
[DOI] |
ISCA proceedings (2024)
Unveiling Biases while Embracing Sustainability: Assessing the Dual Challenges of Automatic Speech Recognition Systems, , , and , in: ISCA proceedings, Greece, pages 4, 2024 |
[DOI] [URL] |
What Does it Take to Generalize SER Model Across Datasets? A Comprehensive Benchmark, , , , and , in: ISCA proceedings, Greece, 2024 |
[DOI] [URL] |
International Conference on Acoustics, Speech and Signal Processing (2024)
COMPARING DATA-DRIVEN AND HANDCRAFTED FEATURES FOR DIMENSIONAL EMOTION RECOGNITION, , and , in: International Conference on Acoustics, Speech and Signal Processing, 2024 |
|
Proceedings of Interspeech (2023)
Few-shot Dysarthric Speech Recognition with Text-to-Speech Data Augmentation, and , in: Proceedings of Interspeech, pages 156-160, 2023 |
[DOI] [URL] |
Using Commercial ASR Solutions to Assess Reading Skills in Children: A Case Report, , , , , and , in: Proceedings of Interspeech, pages 4573-4577, 2023 |
[DOI] [URL] |
Publications of type Phdthesis
2023
On matching data and model in LF-MMI-based dysarthric speech recognition, , École polytechnique fédérale de Lausanne, 2023 |
[DOI] [URL] |