Keywords:
- ASR
- audio processing
- Automatic Speech Recognition
- binary masking
- constrained structural maximum a posteriori linear regression
- cross-lingual speaker adaptation
- data-driven enhancement
- decision tree marginalization
- decision trees
- dialogue
- Discourse Annotation
- domain adaptation
- hidden Markov models
- HMM state mapping
- HMM-based TTS
- Language Models
- Machine Translation
- microphone array
- minimum generation error
- multilingual acoustic modeling
- neural network
- overlapping speech recognition
- pattern matching
- personality impressions
- phonological constraints
- phonological knowledge
- regression class tree
- reliability estimation
- Representation and Processing
- speaker adaptation
- speech recognition
- speech separation
- speech synthesis
- Statistical parametric speech synthesis
- supervision
- temporal alignment
- time synchronisation
- time synchronization
- time-frequency analysis
- under-resourced languages
- unified models
- universal phoneme set
- unsupervised cross-lingual speaker adaptation
- verbal analysis
- vlogs
- vocal tract length normalization
- Web data
- youtube
Publications of John Dines sorted by first author
| 1 | 2 |
L
Phonological Knowledge Guided HMM State Mapping for Cross-Lingual Speaker Adaptation, and , in: Proceedings of Interspeech, Florence, Italy, 2011 |
|
Phonological Knowledge Guided HMM State Mapping for Cross-Lingual Speaker Adaptation, and , Idiap-RR-17-2011 |
|
An Analysis of Language Mismatch in HMM State Mapping-Based Cross-Lingual Speaker Adaptation, and , Idiap-RR-16-2010 |
|
An Analysis of Language Mismatch in HMM State Mapping-Based Cross-Lingual Speaker Adaptation, and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
|
A Comparison of Supervised and Unsupervised Cross-Lingual Speaker Adaptation Approaches for HMM-Based Speech Synthesis, , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Dallas, U.S.A., 2010 |
|
A Comparison of Supervised and Unsupervised Cross-Lingual Speaker Adaptation Approaches for HMM-Based Speech Synthesis, , and , Idiap-RR-05-2010 |
|
M
Improving Continuous Speech Recognition System Performance with Grapheme Modelling, , , and , Idiap-RR-16-2005 |
|
Phoneme vs Grapheme Based Automatic Speech Recognition, , , and , Idiap-RR-48-2004 |
|
On the Use of Information Retrieval Measures for Speech Recognition Evaluation, , , , , , and , Idiap-RR-73-2004 |
|
Juicer: A Weighted Finite-State Transducer speech decoder, , , , , and , in: 3rd Joint Workshop on Multimodal Interaction and Related Machine LEarning Algorithms MLMI'06, 2006 |
|
Juicer: A Weighted Finite-State Transducer speech decoder, , , , , and , Idiap-RR-21-2006 |
|
P
The DBOX Corpus Collection of Spoken Human-Human and Human-Machine Dialogues, , , , , , , , , , , , , and , in: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), Reykjavik, Iceland, European Language Resources Association (ELRA), 2014 |
[URL] |
S
Vocal Tract Length Normalization for Statistical Parametric Speech Synthesis, , and , in: IEEE Transactions on Audio, Speech and Language Processing, 2012 |
|
Implementation of VTLN for Statistical Speech Synthesis, , , and , Idiap-RR-32-2010 |
|
Implementation of VTLN for Statistical Speech Synthesis, , , and , in: Proceedings of ISCA Speech Synthesis Workshop, Kyoto, Japan, 2010 |
|
Study of Jacobian Normalization for VTLN, , and , Idiap-RR-25-2010 |
|
VTLN Adaptation for Statistical Speech Synthesis, , , and , in: Proceedings of ICASSP, Dallas, Texas, 2010 |
|
VTLN Adaptation for Statistical Speech Synthesis, , , and , Idiap-RR-41-2009 |
|
VTLN-Based Rapid Cross-Lingual Adaptation for Statistical Parametric Speech Synthesis, , , and , Idiap-RR-12-2012 |
|
Combining Vocal Tract Length Normalization with Hierarchical Linear Transformations, , , and , in: IEEE Journal of Selected Topics in Signal Processing - Special Issue on Statistical Parametric Speech Synthesis, 8(2):262 - 272, 2014 |
[DOI] |
Bias Adaptation for Vocal Tract Length Normalization, , , and , Idiap-RR-12-2013 |
|
Combining Vocal Tract Length Normalization with Linear Transformations in a Bayesian Framework, , , and , Idiap-RR-11-2012 |
|
COMBINING VOCAL TRACT LENGTH NORMALIZATION WITH HIERARCHIAL LINEAR TRANSFORMATIONS, , , and , in: Proceedings in International conference on Speech and Signal processing, Kyoto, Japan, pages 4493-4496, IEEE SPS (ICASSP), 2012 |
|
W
Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project, , , , , , , , , , , , , , , , , , and , in: Proceedings of the 7th ISCA Speech Synthesis Workshop, Kyoto, Japan, 2010 |
|
| 1 | 2 |