Keywords:
- Acoustic model adaptation
- acoustic modeling
- Afrikaans
- audiobook
- Automatic Speech Recognition
- Deep learning for speech
- deep MLPs
- deep neural networks
- dnn
- exemplar-based modeling
- fast adaptation
- fast training
- fmllr
- Gaussian Mixture Models
- Hidden Markov Model
- hidden variable
- hybrid system
- KL-HMM
- Kullback-Leibler divergence
- lan- guage identification
- multilayer perceptron
- multilingual acoustic modeling
- Multilingual automatic speech recognition
- multilingual speech recognition
- neural network
- neural network features
- non-native speech
- parametric synthesis
- posterior feature
- Posterior features
- Prosodic features
- Semi-supervised training
- sparse representation
- speaker adaptation
- Speaker Diarization
- speech recognition
- speech synthesis
- Subs-ace Gaussian Mixture Models
- Tandem
- text to speech
- text-to-speech
- text-to-speech synthesis
- triphone mapping
- under-resourced languages
- under-resourced speech recognition
- universal phoneme set
Publications of David Imseng sorted by journal and type
| 1 | 2 |
Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2014)
Exploiting un-transcribed foreign data for speech recognition in well-resourced languages, , , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Florence, IT, pages 2322 - 2326, IEEE, 2014 |
[DOI] |
Multilingual Deep Neural Network based Acoustic Modeling For Rapid Language Adaptation, , , , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Florence, pages 7639-7643, IEEE, 2014 |
[DOI] |
Proceeding of Interspeech (2014)
Posterior-based Sparse Representation for Automatic Speech Recognition, , , and , in: Proceeding of Interspeech, 2014 |
|
Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013) (2013)
Crosslingual Tandem-SGMM: Exploiting Out-Of-Language Data for Acoustic Model and Feature Level Adaptation, , and , in: Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013), ISCA - International Speech Communication Association, Lyon, France, pages 510-514, ISCA, 2013 |
|
Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding (2013)
Impact of deep MLP architecture on different acoustic modeling techniques for under-resourced speech recognition, , , and , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, 2013 |
|
Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2013)
Speaker adaptive Kullback-Leibler divergence based hidden Markov models, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2013 |
|
Proceedings of the 3rd International Workshop on Spoken Languages Technologies for Under-resourced Languages (2012)
Boosting under-resourced speech recognizers by exploiting out of language data - Case study on Afrikaans, , and , in: Proceedings of the 3rd International Workshop on Spoken Languages Technologies for Under-resourced Languages, Cape Town, pages 60--67, 2012 |
|
Proceedings of Interspeech (2012)
Comparing different acoustic modeling techniques for multilingual boosting, , , , and , in: Proceedings of Interspeech, Portland, Oregon, 2012 |
|
Proceedings of the 2012 IEEE Workshop on Spoken Language Technology (2012)
MediaParl: Bilingual mixed language accented speech database, , , , , and , in: Proceedings of the 2012 IEEE Workshop on Spoken Language Technology, pages 263--268, 2012 |
|
Proceedings of Interspeech (2012)
Robust triphone mapping for acoustic modeling, , and , in: Proceedings of Interspeech, Portland, Oregon, 2012 |
|
Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2012)
Using KL-divergence and multilingual information to improve ASR for under-resourced languages, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, pages 4869--4872, 2012 |
|
Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding (2011)
Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition, , and , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Hawaii, USA, pages 348-353, 2011 |
|
Proceedings of Interspeech (2011)
Improving non-native ASR through stochastic multilingual phoneme space transformations, , , , and , in: Proceedings of Interspeech, Florence, Italy, pages 537-540, 2011 |
|
Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2011)
Language dependent universal phoneme posterior estimation for mixed language speech recognition, , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Prag, CZ, pages 5012-5015, 2011 |
|
Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2010)
An Adaptive Initialization Method for Speaker Diarization based on Prosodic Features, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, pages 4946-4949, 2010 |
|
Proceedings of Interspeech (2010)
Hierarchical Multilayer Perceptron based Language Identification, , and , in: Proceedings of Interspeech, Makuhari, Japan, pages 2722-2725, 2010 |
|
Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2010)
Leveraging speaker diarization for meeting recognition from distant microphones, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, pages 4390--4393, 2010 |
Proceedings of Interspeech (2010)
Towards mixed language speech recognition systems, , and , in: Proceedings of Interspeech, Makuhari, Japan, pages 278-281, 2010 |
|
Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding (2009)
Robust Speaker Diarization for Short Speech Recordings, and , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, pages 432-437, 2009 |
|
Publications of type Phdthesis
2013
Multilingual speech recognition A posterior based approach, , École Polytechnique Fédérale de Lausanne (EPFL), 2013 |
|
| 1 | 2 |