Keywords:
- ASR
- audio processing
- Automatic Speech Recognition
- binary masking
- constrained structural maximum a posteriori linear regression
- cross-lingual speaker adaptation
- data-driven enhancement
- decision tree marginalization
- decision trees
- dialogue
- Discourse Annotation
- domain adaptation
- hidden Markov models
- HMM state mapping
- HMM-based TTS
- Language Models
- Machine Translation
- microphone array
- minimum generation error
- multilingual acoustic modeling
- neural network
- overlapping speech recognition
- pattern matching
- personality impressions
- phonological constraints
- phonological knowledge
- regression class tree
- reliability estimation
- Representation and Processing
- speaker adaptation
- speech recognition
- speech separation
- speech synthesis
- Statistical parametric speech synthesis
- supervision
- temporal alignment
- time synchronisation
- time synchronization
- time-frequency analysis
- under-resourced languages
- unified models
- universal phoneme set
- unsupervised cross-lingual speaker adaptation
- verbal analysis
- vlogs
- vocal tract length normalization
- Web data
- youtube
Publications of John Dines sorted by journal and type
| 1 | 2 |
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14) (2014)
The DBOX Corpus Collection of Spoken Human-Human and Human-Machine Dialogues, , , , , , , , , , , , , and , in: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), Reykjavik, Iceland, European Language Resources Association (ELRA), 2014 |
[URL] |
Proceedings in International conference on Speech and Signal processing (2012)
COMBINING VOCAL TRACT LENGTH NORMALIZATION WITH HIERARCHIAL LINEAR TRANSFORMATIONS, , , and , in: Proceedings in International conference on Speech and Signal processing, Kyoto, Japan, pages 4493-4496, IEEE SPS (ICASSP), 2012 |
|
Proceedings of Interspeech (2012)
Comparing different acoustic modeling techniques for multilingual boosting, , , , and , in: Proceedings of Interspeech, Portland, Oregon, 2012 |
|
Actes de la conférence conjointe JEP-TALN-RECITAL 2012 (2012)
Impact du degré de supervision sur l'adaptation à un domaine d'un modèle de langage à partir du Web, , , and , in: Actes de la conference conjointe JEP-TALN-RECITAL 2012, Grenoble, France, pages 193-200, ATALA/AFCP, 2012 |
|
Proceedings of Interspeech (2012)
Supervised and unsupervised Web-based language model domain adaptation, , , and , in: Proceedings of Interspeech, Portland, Oregon, USA, pages to appear, 2012 |
|
Proceedings of Interspeech (2011)
Improving non-native ASR through stochastic multilingual phoneme space transformations, , , , and , in: Proceedings of Interspeech, Florence, Italy, pages 537-540, 2011 |
|
Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2011)
Language dependent universal phoneme posterior estimation for mixed language speech recognition, , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Prag, CZ, pages 5012-5015, 2011 |
|
Proceedings of Interspeech (2011)
Phonological Knowledge Guided HMM State Mapping for Cross-Lingual Speaker Adaptation, and , in: Proceedings of Interspeech, Florence, Italy, 2011 |
|
Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (2010)
A Comparison of Supervised and Unsupervised Cross-Lingual Speaker Adaptation Approaches for HMM-Based Speech Synthesis, , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Dallas, U.S.A., 2010 |
|
Proceedings of Interspeech (2010)
An Analysis of Language Mismatch in HMM State Mapping-Based Cross-Lingual Speaker Adaptation, and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
|
Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2010)
Automatic Temporal Alignment of AV Data with Confidence Estimation, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010 |
|
Proceedings of ISCA Speech Synthesis Workshop (2010)
Implementation of VTLN for Statistical Speech Synthesis, , , and , in: Proceedings of ISCA Speech Synthesis Workshop, Kyoto, Japan, 2010 |
|
Proceedings of the ACL 2010 System Demonstrations (2010)
Personalising speech-to-speech translation in the EMIME project, , , , , , , , , , , , , , , , , , and , in: Proceedings of the ACL 2010 System Demonstrations, Association for Computational Linguistics, Uppsala, Sweden, 2010 |
[URL] |
Proceedings of the 7th ISCA Speech Synthesis Workshop (2010)
Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project, , , , , , , , , , , , , , , , , , and , in: Proceedings of the 7th ISCA Speech Synthesis Workshop, Kyoto, Japan, 2010 |
|
Proceedings of Interspeech (2010)
The AMIDA 2009 Meeting Transcription System, , , , , , , , and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
|
Tracter: A Lightweight Dataflow Framework, and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
|
Proceedings of ICASSP (2010)
VTLN Adaptation for Statistical Speech Synthesis, , , and , in: Proceedings of ICASSP, Dallas, Texas, 2010 |
|
Proceedings of Interspeech (2009)
Measuring the gap between HMM-based ASR and TTS, , and , in: Proceedings of Interspeech, Brighton, U.K., 2009 |
|
Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2009)
Non-linear mapping for multi-channel speech separation and robust overlapping speech recognition, , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009 |
|
Proceedings of Interspeech (2009)
Real-Time ASR from Meetings, , , , , , , , and , in: Proceedings of Interspeech, Brighton, UK., 2009 |
|
Speech recognition with speech synthesis models by marginalising over decision tree leaves, , and , in: Proceedings of Interspeech, Brighton, U.K., 2009 |
|
Proceedings of INTERSPEECH, September 2008 (2008)
Maximum kurtosis beamforming with the generalized sidelobe canceller, , , , , and , in: Proceedings of INTERSPEECH, September 2008, Brisbane, Australia, 2008 |
|
International Conference on Multimodal Interfaces (2008)
Role Recognition in Multiparty Recordings using Social Affiliation Networks and Discrete Distributions, , , and , in: International Conference on Multimodal Interfaces, Chania, Greece, 2008 |
|
3rd Joint Workshop on Multimodal Interaction and Related Machine LEarning Algorithms {MLMI'06} (2006)
Juicer: A Weighted Finite-State Transducer speech decoder, , , , , and , in: 3rd Joint Workshop on Multimodal Interaction and Related Machine LEarning Algorithms MLMI'06, 2006 |
|
Int. Conf. on Spoken Language Processing ({Interspeech ICSLP}) (2006)
The segmentation of multi-channel meeting recordings for automatic speech recognition, , and , in: Int. Conf. on Spoken Language Processing (Interspeech ICSLP), 2006 |
|
Proceedings of ICSLP, 2004 (2004)
Using RASTA in task independent TANDEM feature extraction, , and , in: Proceedings of ICSLP, 2004, 2004 |
|
| 1 | 2 |