Keywords:
Publications of Hynek Hermansky sorted by first author
M
Phoneme vs Grapheme Based Automatic Speech Recognition, , , and , Idiap-RR-48-2004 |
|
Spectral Entropy Based Feature for Robust ASR, , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2004 |
|
Spectral Entropy Based Feature for Robust ASR, , , and , Idiap-RR-56-2003 |
|
Automatic Speech Recognition: an Auditory Perspective, , and , Idiap-RR-17-1998 |
Automatic Speech Recognition: an Auditory Perspective, , and , in: Speech Processing in the Auditory System, Springer Verlag, New York, 2000 |
Arithmetic Coding of Sub-Band Residuals in FDLP Speech/Audio Codec, , and , in: 10th Annual Conference of the International Speech Communication Association, ISCA, Brighton, England, ISCA 2009, 2009 |
|
Wide-Band Audio Coding based on Frequency Domain Linear Prediction, , and , Idiap-RR-32-2009 |
|
Entropy coding of Quantized Spectral Components in FDLP audio codec, , and , Idiap-RR-71-2008 |
|
Non-uniform QMF Decomposition for Wide-band Audio Coding based on Frequency Domain Linear Prediction, , , and , Idiap-RR-43-2007 |
|
Scalable Wide-band Audio Codec based on Frequency Domain Linear Prediction, , , and , Idiap-RR-16-2007 |
|
Perceptually motivated Sub-band Decomposition for FDLP Audio Coding, , , , and , in: Text, Speech and Dialogue, Brno, Czech Republic, Springer-Verlag Berlin, Heidelberg, 2008 |
|
Wide-Band Audio Coding based on Frequency Domain Linear Prediction, , , and , in: EURASIP Journal on Audio Speech and Music Processing, 2010(856280), 2010 |
[DOI] [URL] |
Frequency Domain Linear Prediction for QMF Sub-bands and Applications to Audio Coding, , , and , in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), 2007 |
|
Non-Uniform Speech/Audio Coding Exploiting Predictability of Temporal Evolution of Spectral Envelopes, , , and , in: Tenth International Conference on TEXT, SPEECH and DIALOGUE (TSD), 2007 |
|
Audio Coding Based on Long Temporal Contexts, , , and , Idiap-RR-30-2006 |
|
Speech Coding based on Spectral Dynamics, , , and , in: Ninth International Conference on TEXT, SPEECH and DIALOGUE (TSD), 2006 |
|
Speech Coding based on Spectral Dynamics, , , and , Idiap-RR-05-2006 |
|
AM-FM DECOMPOSITION OF SPEECH SIGNAL: APPLICATIONS FOR SPEECH PRIVACY AND DIAGNOSIS, , , , and , Idiap-RR-01-2020 |
|
AM-FM DECOMPOSITION OF SPEECH SIGNAL: APPLICATIONS FOR SPEECH PRIVACY AND DIAGNOSIS, , , , and , in: 11th International workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, Universita Degli Studi Firenze, Firenze, Italy, 2019 |
[URL] |
Wide-Band Perceptual Audio Coding based on Frequency-Domain Linear Prediction, , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007 |
|
Wide-Band Perceptual Audio Coding based on Frequency-Domain Linear Prediction, , and , Idiap-RR-58-2006 |
|
P
Exploiting contextual information for speech/non-speech detection, and , Idiap-RR-22-2008 |
|
A Data-driven Approach to Speech/Non-speech Detection, and , Idiap-RR-23-2008 |
|
Exploiting temporal context for speech/non-speech detection, , and , Idiap-RR-21-2008 |
|
Exploiting Contextual Information for Speech/Non-Speech Detection, , and , in: Text, Speech and Dialogue, Brno, Czech Republic, Springer-Verlag Berlin, Heidelberg, 2008 |
|
Comparing Different Word Lattice Rescoring Approaches Towards Keyword Spotting, , , and , Idiap-RR-32-2007 |
|
Combining Evidence from a Generative and a Discriminative Model in Phoneme Recognition, and , Idiap-RR-20-2008 |
|
Combining Evidence from a Generative and a Discriminative Model in Phoneme Recognition, and , in: Proceedings of Interspeech, 2008 |
|
Exploiting Contextual Information for Improved Phoneme Recognition, , , and , in: "IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)", 2008 |
|
Exploiting Phoneme Similarities in Hybrid HMM-ANN Keyword Spotting, , and , Idiap-RR-11-2007 |
|
Significance of Contextual Information in Phoneme Recognition, , , and , Idiap-RR-28-2007 |
|
Reverse Correlation for analyzing MLP Posterior Features in ASR, , and , Idiap-RR-13-2008 |
|
Reverse Correlation for analyzing MLP Posterior Features in ASR, , and , in: 11th International Conference on Text, Speech, and Dialogue, 2008 |
|
Volterra Series for Analyzing MLP based Phoneme Posterior Probability Estimator, , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2009 |
|
Volterra Series for Analyzing MLP based Phoneme Posterior Probability Estimator, , , and , Idiap-RR-69-2008 |
|
Analysis of MLP Based Hierarchical Phoneme Posterior Probability Estimator, , , , and , in: IEEE Transcations on Audio, Speech, and Language Processing, 19(2):225-241, 2011 |
|
Fast Approximate Spoken Term Detection from Sequence of Phonemes, , , and , Idiap-RR-45-2008 |
|
Fast Approximate Spoken Term Detection from Sequence of Phonemes, , , and , in: Workshop on Searching Spontaneous Conversational Speech at SIGIR, 2008 |
|
Exploiting Contextual Information for Improved Phoneme Recognition, , , and , Idiap-RR-65-2007 |
|
Analysis of Confusion Matrix to Combine Evidence for Phoneme Recognition, , , and , Idiap-RR-27-2007 |
|
S
On Use of Task Independent Training Data in Tandem Feature Extraction, and , in: Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04), 2004 |
|
On Use of Task Independent Training Data in Tandem Feature Extraction, and , Idiap-RR-57-2003 |
|
Emulating Temporal Receptive Fields of Auditory Mid-Brain Neurons for Automatic Speech Recognition, and , Idiap-RR-24-2008 |
|
Introducing Temporal Asymmetries in Feature Extraction for Automatic Speech Recognition, and , Idiap-RR-25-2008 |
|
Emulating Temporal Receptive Fields of Auditory Mid-Brain Neurons for Automatic Speech Recognition, and , in: Proc. 16th European Signal Processing Conference (EUSIPCO), 2008 |
|
Introducing Temporal Asymmetries in Feature Extraction for Automatic Speech Recognition, and , in: Interspeech 2008, 2008 |
|
T
Phoneme Recognition Using Spectral Envelope and Modulation Frequency Features, , and , Idiap-RR-04-2009 |
|
Spectro-Temporal Features for Automatic Speech Recognition using Linear Prediction in Spectral Domain, , and , Idiap-RR-05-2008 |
|