Keywords:
- All-pass filter based bilinear transformations
- Constrained likelihood linear regression.
- constrained structural maximum a posteriori linear regression
- cross-lingual speaker adaptation
- decision tree marginalization
- decision trees
- Expectation maximization
- hidden Markov models
- HMM state mapping
- HMM-based automatic speech recognition (ASR)
- HMM-based statistical parametric speech synthesis (HTS)
- Machine Translation
- Mel-generalized cepstral features
- Model transformations
- Rapid feature adaptation
- speaker adaptation
- speech recognition
- speech synthesis
- Statistical parametric speech synthesis
- Unified modeling and adaptation of ASR and TTS
- unified models
- unsupervised cross-lingual speaker adaptation
- vocal tract length normalization
Publications of Lakshmi Saheer sorted by title
A
A Comparison of Supervised and Unsupervised Cross-Lingual Speaker Adaptation Approaches for HMM-Based Speech Synthesis, , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Dallas, U.S.A., 2010 |
|
A Comparison of Supervised and Unsupervised Cross-Lingual Speaker Adaptation Approaches for HMM-Based Speech Synthesis, , and , Idiap-RR-05-2010 |
|
Automatic Staging of Audio with Emotions, and , in: International Conference on Affective Computing and Intelligent Interaction, 2013 |
B
Bias Adaptation for Vocal Tract Length Normalization, , , and , Idiap-RR-12-2013 |
|
C
COMBINING VOCAL TRACT LENGTH NORMALIZATION WITH HIERARCHIAL LINEAR TRANSFORMATIONS, , , and , in: Proceedings in International conference on Speech and Signal processing, Kyoto, Japan, pages 4493-4496, IEEE SPS (ICASSP), 2012 |
|
Combining Vocal Tract Length Normalization with Hierarchical Linear Transformations, , , and , in: IEEE Journal of Selected Topics in Signal Processing - Special Issue on Statistical Parametric Speech Synthesis, 8(2):262 - 272, 2014 |
[DOI] |
Combining Vocal Tract Length Normalization with Linear Transformations in a Bayesian Framework, , , and , Idiap-RR-11-2012 |
|
Current trends in multilingual speech processing, , , , , , , , and , in: Sadhana, 36(5):885–915, 2011 |
[DOI] [URL] |
I
Implementation of VTLN for Statistical Speech Synthesis, , , and , Idiap-RR-32-2010 |
|
Implementation of VTLN for Statistical Speech Synthesis, , , and , in: Proceedings of ISCA Speech Synthesis Workshop, Kyoto, Japan, 2010 |
|
P
Personalising speech-to-speech translation in the EMIME project, , , , , , , , , , , , , , , , , , and , in: Proceedings of the ACL 2010 System Demonstrations, Association for Computational Linguistics, Uppsala, Sweden, 2010 |
[URL] |
Personalising speech-to-speech translation: Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis, , , , , , , , , , , , and , in: Computer Speech and Language, 2011 |
[DOI] [URL] |
S
Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project, , , , , , , , , , , , , , , , , , and , in: Proceedings of the 7th ISCA Speech Synthesis Workshop, Kyoto, Japan, 2010 |
|
Speech recognition with speech synthesis models by marginalising over decision tree leaves, , and , Idiap-RR-17-2009 |
|
Speech recognition with speech synthesis models by marginalising over decision tree leaves, , and , in: Proceedings of Interspeech, Brighton, U.K., 2009 |
|
Study of Jacobian Normalization for VTLN, , and , Idiap-RR-25-2010 |
|
Syllabic Pitch Tuning for Neutral-to-Emotional Voice Conversion, , and , Idiap-RR-31-2015 |
|
U
Understanding Factors in Emotion Perception, and , in: ISCA Speech Synthesis Workshop, 2013 |
|
Understanding Factors in Emotion Perception, and , Idiap-RR-28-2013 |
|
Unified Framework Of Feature Based Adaptation For Statistical Speech Synthesis And Recognition, , Ecole Polytechnique Federale de Lausanne (EPFL), 2012 |
|
V
Vocal Tract Length Normalization for Statistical Parametric Speech Synthesis, , and , in: IEEE Transactions on Audio, Speech and Language Processing, 2012 |
|
VTLN Adaptation for Statistical Speech Synthesis, , , and , in: Proceedings of ICASSP, Dallas, Texas, 2010 |
|
VTLN Adaptation for Statistical Speech Synthesis, , , and , Idiap-RR-41-2009 |
|
VTLN-Based Rapid Cross-Lingual Adaptation for Statistical Parametric Speech Synthesis, , , and , Idiap-RR-12-2012 |
|