Keywords:
- Acoustic model adaptation
- acoustic modeling
- Afrikaans
- audiobook
- Automatic Speech Recognition
- Deep learning for speech
- deep MLPs
- deep neural networks
- dnn
- exemplar-based modeling
- fast adaptation
- fast training
- fmllr
- Gaussian Mixture Models
- Hidden Markov Model
- hidden variable
- hybrid system
- KL-HMM
- Kullback-Leibler divergence
- lan- guage identification
- multilayer perceptron
- multilingual acoustic modeling
- Multilingual automatic speech recognition
- multilingual speech recognition
- neural network
- neural network features
- non-native speech
- parametric synthesis
- posterior feature
- Posterior features
- Prosodic features
- Semi-supervised training
- sparse representation
- speaker adaptation
- Speaker Diarization
- speech recognition
- speech synthesis
- Subs-ace Gaussian Mixture Models
- Tandem
- text to speech
- text-to-speech
- text-to-speech synthesis
- triphone mapping
- under-resourced languages
- under-resourced speech recognition
- universal phoneme set
Publications of David Imseng sorted by journal and type
| 1 | 2 |
Publications of type Idiap-RR
2017
Comparative Study on Sentence Boundary Prediction for German and English Broadcast News, , , , and , Idiap-RR-18-2017 |
|
2016
Feature mapping using far-field microphones for distant speech recognition, , , and , Idiap-RR-20-2016 |
|
2015
Exploiting foreign resources for DNN-based ASR, , , , and , Idiap-RR-27-2015 |
|
Improving Real Time Factor of Information Bottleneck-based Speaker Diarization System, , and , Idiap-RR-18-2015 |
|
Preliminary Work on Speaker Adaptation for DNN-Based Speech Synthesis, , and , Idiap-RR-02-2015 |
|
2014
Development of Bilingual ASR System for MediaParl Corpus, , , and , Idiap-RR-21-2014 |
|
2013
Comparing different acoustic modeling techniques for multilingual boosting, , , , and , Idiap-RR-01-2013 |
|
Crosslingual Tandem-SGMM: Exploiting Out-Of-Language Data for Acoustic Model and Feature Level Adaptation, , and , Idiap-RR-39-2013 |
|
MediaParl: Bilingual mixed language accented speech database, , , , , and , Idiap-RR-03-2013 |
|
Robust triphone mapping for acoustic modeling, , and , Idiap-RR-02-2013 |
|
Statistical models for HMM/ANN hybrids, and , Idiap-RR-11-2013 |
|
Using out-of-language data to improve an under-resourced speech recognizer, , , and , Idiap-RR-09-2013 |
|
2012
Application of Subspace Gaussian Mixture Models in Contrastive Acoustic Scenarios, , , and , Idiap-RR-20-2012 |
|
Boosting under-resourced speech recognizers by exploiting out of language data - Case study on Afrikaans, , and , Idiap-RR-15-2012 |
|
Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition, , and , Idiap-RR-01-2012 |
|
2011
Improving non-native ASR through stochastic multilingual phoneme space transformations, , , , and , Idiap-RR-19-2011 |
|
Language dependent universal phoneme posterior estimation for mixed language speech recognition, , , and , Idiap-RR-13-2011 |
|
2010
An Adaptive Initialization Method for Speaker Diarization based on Prosodic Features, and , Idiap-RR-02-2010 |
|
Hierarchical Multilayer Perceptron based Language Identification, , and , Idiap-RR-14-2010 |
|
Towards mixed language speech recognition systems, , and , Idiap-RR-15-2010 |
|
Tuning-Robust Initialization Methods for Speaker Diarization, and , Idiap-RR-35-2010 |
|
2009
Novel initialization methods for Speaker Diarization, , Idiap-RR-07-2009 |
|
Robust Speaker Diarization for Short Speech Recordings, and , Idiap-RR-26-2009 |
|
Publications of type Idiap-Com
2012
Decision tree clustering for KL-HMM, and , Idiap-Com-01-2012 |
|
EURASIP Journal on Audio, Speech, and Music Processing
Exploiting foreign resources for DNN-based ASR, , , , and , in: EURASIP Journal on Audio, Speech, and Music Processing(2015:17), 2015 |
[DOI] |
IEEE Transactions on Audio, Speech, and Language Processing
Tuning-Robust Initialization Methods for Speaker Diarization, and , in: IEEE Transactions on Audio, Speech, and Language Processing, 18(8):2028-2037, 2010 |
[DOI] |
IEEE Transactions on Audio, Speech, and Language Processing
Applying multi- and cross-lingual stochastic phone space transformations to non-native speech recognition, , , , and , in: IEEE Transactions on Audio, Speech, and Language Processing, 2013 |
[DOI] |
The ICSI RT-09 Speaker Diarization System, , , , , , , and , in: IEEE Transactions on Audio, Speech, and Language Processing, 20(2):371--381, 2012 |
[DOI] |
Sadhana
Current trends in multilingual speech processing, , , , , , , , and , in: Sadhana, 36(5):885–915, 2011 |
[DOI] [URL] |
Speech Communication
Feature mapping using far-field microphones for distant speech recognition, , , and , in: Speech Communication, 83:1-9, 2016 |
[DOI] [URL] |
Using out-of-language data to improve an under-resourced speech recognizer, , , and , in: Speech Communication, 2013 |
[DOI] [URL] |
Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2015)
Building context-dependent DNN acoustic models using Kullback-Leibler divergence-based state tying, , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2015 |
|
IEEE International Conference on Acoustics, Speech, and Signal Processing (2015)
Learning Feature Mapping using Deep Neural Network Bottleneck Features for Distant Large Vocabulary Speech Recognition, , , , , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing, pages 4540-4544, 2015 |
[DOI] |
Proceedings of Interspeech (2014)
Automatic Speech Recognition and Translation of a Swiss German Dialect: Walliserdeutsch, , and , in: Proceedings of Interspeech, 2014 |
|
Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech 2014) (2014)
Development of Bilingual ASR System for MediaParl Corpus, , , and , in: Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech 2014), Singapore, ISCA, 2014 |
|
| 1 | 2 |