Keywords:
- Automatic Speech Recognition
- Children speech recognition
- Data Augmentation
- Dysarthria
- Few-shot learning
- Large Language Models
- Lattice-Free MMI
- multilingual bottleneck features
- Objective Evaluation
- Pathological speech
- Pathological Speech Processing
- Reading Assessment
- speech recognition
- speech synthesis
- subword modeling
- unsupervised feature extraction
- Voice Conversion
- zero-resource speech technology
Publications of Enno Hermann sorted by first author
E
SSL-TTS: Leveraging Self-Supervised Embeddings and kNN Retrieval for Zero-Shot Multi-speaker TTS, , , and , Idiap-Internal-RR-38-2024 |
[URL] |
H
An Objective Evaluation Framework for Pathological Speech Synthesis, , , , , and , in: Proceedings of ITG Conference on Speech Communication, 2021 |
|
On matching data and model in LF-MMI-based dysarthric speech recognition, , École polytechnique fédérale de Lausanne, 2023 |
[DOI] [URL] |
Multilingual bottleneck features for subword modeling in zero-resource languages, and , in: Proc. Interspeech, pages 2668-2672, 2018 |
[DOI] |
Multilingual and Unsupervised Subword Modeling for Zero-Resource Languages, , and , in: Computer Speech and Language, 65, 2021 |
[DOI] [URL] |
Few-shot Dysarthric Speech Recognition with Text-to-Speech Data Augmentation, and , in: Proceedings of Interspeech, pages 156-160, 2023 |
[DOI] [URL] |
Handling acoustic variation in dysarthric speech recognition systems through model combination, and , in: Proceedings of Interspeech, 2021 |
|
Dysarthric Speech Recognition with Lattice-Free MMI, and , in: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 6109-6113, 2020 |
[DOI] [URL] |
N
Towards interfacing large language models with ASR systems using confidence measures and prompting, , , , and , in: Proceedings of Interspeech, pages 2980-2984, 2024 |
[DOI] |
P
Using Commercial ASR Solutions to Assess Reading Skills in Children: A Case Report, , , , , and , in: Proceedings of Interspeech, pages 4573-4577, 2023 |
[DOI] [URL] |