Keywords:
- Automatic Speech Recognition
- Children speech recognition
- Data Augmentation
- Dysarthria
- Few-shot learning
- Large Language Models
- Lattice-Free MMI
- multilingual bottleneck features
- Objective Evaluation
- Pathological speech
- Pathological Speech Processing
- Reading Assessment
- speech recognition
- speech synthesis
- subword modeling
- unsupervised feature extraction
- Voice Conversion
- zero-resource speech technology
Publications of Enno Hermann sorted by journal and type
Publications of type Idiap-Internal-RR
2024
SSL-TTS: Leveraging Self-Supervised Embeddings and kNN Retrieval for Zero-Shot Multi-speaker TTS, , , and , Idiap-Internal-RR-38-2024 |
[URL] |
Computer Speech and Language
Multilingual and Unsupervised Subword Modeling for Zero-Resource Languages, , and , in: Computer Speech and Language, 65, 2021 |
[DOI] [URL] |
Proceedings of Interspeech (2024)
Towards interfacing large language models with ASR systems using confidence measures and prompting, , , , and , in: Proceedings of Interspeech, pages 2980-2984, 2024 |
[DOI] |
Proceedings of Interspeech (2023)
Few-shot Dysarthric Speech Recognition with Text-to-Speech Data Augmentation, and , in: Proceedings of Interspeech, pages 156-160, 2023 |
[DOI] [URL] |
Using Commercial ASR Solutions to Assess Reading Skills in Children: A Case Report, , , , , and , in: Proceedings of Interspeech, pages 4573-4577, 2023 |
[DOI] [URL] |
Proceedings of ITG Conference on Speech Communication (2021)
An Objective Evaluation Framework for Pathological Speech Synthesis, , , , , and , in: Proceedings of ITG Conference on Speech Communication, 2021 |
|
Proceedings of Interspeech (2021)
Handling acoustic variation in dysarthric speech recognition systems through model combination, and , in: Proceedings of Interspeech, 2021 |
|
International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2020)
Dysarthric Speech Recognition with Lattice-Free MMI, and , in: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 6109-6113, 2020 |
[DOI] [URL] |
Proc. Interspeech (2018)
Multilingual bottleneck features for subword modeling in zero-resource languages, and , in: Proc. Interspeech, pages 2668-2672, 2018 |
[DOI] |
Publications of type Phdthesis
2023
On matching data and model in LF-MMI-based dysarthric speech recognition, , École polytechnique fédérale de Lausanne, 2023 |
[DOI] [URL] |