Keywords:
- Acoustic model adaptation
- acoustic modeling
- Afrikaans
- audiobook
- Automatic Speech Recognition
- Deep learning for speech
- deep MLPs
- deep neural networks
- dnn
- exemplar-based modeling
- fast adaptation
- fast training
- fmllr
- Gaussian Mixture Models
- Hidden Markov Model
- hidden variable
- hybrid system
- KL-HMM
- Kullback-Leibler divergence
- lan- guage identification
- multilayer perceptron
- multilingual acoustic modeling
- Multilingual automatic speech recognition
- multilingual speech recognition
- neural network
- neural network features
- non-native speech
- parametric synthesis
- posterior feature
- Posterior features
- Prosodic features
- Semi-supervised training
- sparse representation
- speaker adaptation
- Speaker Diarization
- speech recognition
- speech synthesis
- Subs-ace Gaussian Mixture Models
- Tandem
- text to speech
- text-to-speech
- text-to-speech synthesis
- triphone mapping
- under-resourced languages
- under-resourced speech recognition
- universal phoneme set
Publications of David Imseng sorted by title
| 1 | 2 |
P
| Posterior-based Sparse Representation for Automatic Speech Recognition, , , and , in: Proceeding of Interspeech, 2014 |
|
| Preliminary Work on Speaker Adaptation for DNN-Based Speech Synthesis, , and , Idiap-RR-02-2015 |
|
R
| Robust Speaker Diarization for Short Speech Recordings, and , Idiap-RR-26-2009 |
|
| Robust Speaker Diarization for Short Speech Recordings, and , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, pages 432-437, 2009 |
|
| Robust triphone mapping for acoustic modeling, , and , in: Proceedings of Interspeech, Portland, Oregon, 2012 |
|
| Robust triphone mapping for acoustic modeling, , and , Idiap-RR-02-2013 |
|
S
| Speaker adaptive Kullback-Leibler divergence based hidden Markov models, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2013 |
|
| Statistical models for HMM/ANN hybrids, and , Idiap-RR-11-2013 |
|
T
| The ICSI RT-09 Speaker Diarization System, , , , , , , and , in: IEEE Transactions on Audio, Speech, and Language Processing, 20(2):371--381, 2012 |
[DOI] |
| Towards mixed language speech recognition systems, , and , Idiap-RR-15-2010 |
|
| Towards mixed language speech recognition systems, , and , in: Proceedings of Interspeech, Makuhari, Japan, pages 278-281, 2010 |
|
| Tuning-Robust Initialization Methods for Speaker Diarization, and , Idiap-RR-35-2010 |
|
| Tuning-Robust Initialization Methods for Speaker Diarization, and , in: IEEE Transactions on Audio, Speech, and Language Processing, 18(8):2028-2037, 2010 |
[DOI] |
U
| Using KL-divergence and multilingual information to improve ASR for under-resourced languages, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, pages 4869--4872, 2012 |
|
| Using out-of-language data to improve an under-resourced speech recognizer, , , and , in: Speech Communication, 2013 |
[DOI] [URL] |
| Using out-of-language data to improve an under-resourced speech recognizer, , , and , Idiap-RR-09-2013 |
|
| 1 | 2 |