Keywords:
- Acoustic model adaptation
- acoustic modeling
- Afrikaans
- audiobook
- Automatic Speech Recognition
- Deep learning for speech
- deep MLPs
- deep neural networks
- dnn
- exemplar-based modeling
- fast adaptation
- fast training
- fmllr
- Gaussian Mixture Models
- Hidden Markov Model
- hidden variable
- hybrid system
- KL-HMM
- Kullback-Leibler divergence
- lan- guage identification
- multilayer perceptron
- multilingual acoustic modeling
- Multilingual automatic speech recognition
- multilingual speech recognition
- neural network
- neural network features
- non-native speech
- parametric synthesis
- posterior feature
- Posterior features
- Prosodic features
- Semi-supervised training
- sparse representation
- speaker adaptation
- Speaker Diarization
- speech recognition
- speech synthesis
- Subs-ace Gaussian Mixture Models
- Tandem
- text to speech
- text-to-speech
- text-to-speech synthesis
- triphone mapping
- under-resourced languages
- under-resourced speech recognition
- universal phoneme set
Publications of David Imseng sorted by first author
| 1 | 2 |
I
Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition, , and , Idiap-RR-01-2012 |
|
Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition, , and , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Hawaii, USA, pages 348-353, 2011 |
|
M
Improving Real Time Factor of Information Bottleneck-based Speaker Diarization System, , and , Idiap-RR-18-2015 |
|
Application of Subspace Gaussian Mixture Models in Contrastive Acoustic Scenarios, , , and , Idiap-RR-20-2012 |
|
Development of Bilingual ASR System for MediaParl Corpus, , , and , Idiap-RR-21-2014 |
|
Development of Bilingual ASR System for MediaParl Corpus, , , and , in: Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech 2014), Singapore, ISCA, 2014 |
|
Crosslingual Tandem-SGMM: Exploiting Out-Of-Language Data for Acoustic Model and Feature Level Adaptation, , and , Idiap-RR-39-2013 |
|
Crosslingual Tandem-SGMM: Exploiting Out-Of-Language Data for Acoustic Model and Feature Level Adaptation, , and , in: Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013), ISCA - International Speech Communication Association, Lyon, France, pages 510-514, ISCA, 2013 |
|
Exploiting foreign resources for DNN-based ASR, , , , and , Idiap-RR-27-2015 |
|
Exploiting foreign resources for DNN-based ASR, , , , and , in: EURASIP Journal on Audio, Speech, and Music Processing(2015:17), 2015 |
[DOI] |
P
Preliminary Work on Speaker Adaptation for DNN-Based Speech Synthesis, , and , Idiap-RR-02-2015 |
|
S
Leveraging speaker diarization for meeting recognition from distant microphones, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, pages 4390--4393, 2010 |
V
Multilingual Deep Neural Network based Acoustic Modeling For Rapid Language Adaptation, , , , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Florence, pages 7639-7643, IEEE, 2014 |
[DOI] |
W
Comparative Study on Sentence Boundary Prediction for German and English Broadcast News, , , , and , Idiap-RR-18-2017 |
|
| 1 | 2 |