Keywords:
Publications of Hynek Hermansky sorted by first author
A
The DIRAC AWEAR Audio-Visual Platform for Detection of Unexpected and Incongruent Events, , , , , , , , , , , , , and , Idiap-RR-41-2010 |
|
The DIRAC AWEAR Audio-Visual Platform for Detection of Unexpected and Incongruent Events, , , , , , , , , , , , , and , in: Proceedings of the International Conference on Multimodal Interfaces, 2008 |
|
LP-TRAP: Linear predictive temporal patterns, , and , 2004 |
|
PLP$^2$: Autoregressive modeling of auditory-like 2-D spectro-temporal patterns, , and , Idiap-RR-60-2004 |
|
LP-TRAP: Linear predictive temporal patterns, , and , Idiap-RR-59-2004 |
|
F
Towards ASR Based on Hierarchical Posterior-Based Keyword Recognition, and , Idiap-RR-64-2005 |
|
New Nonsense Syllables Database -- Analyses and Preliminary ASR Experiments, , , and , in: Proceedings of International Conference on Spoken Language Processing (ICSLP), 2004 |
|
New Nonsense Syllables Database -- Analyses and Preliminary ASR Experiments, , , and , Idiap-RR-29-2004 |
|
G
Autoregressive Models of Amplitude Modulations in Audio Compression, , and , in: IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 2010 |
[URL] |
Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, , and , in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, Pilsen, Czech Republic, Springer - Verlag, Berlin Heidelberg 2009, 2009 |
|
Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, , and , in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, Pilsen, Czech Republic, Springer - Verlag, Berlin Heidelberg 2009, 2009 |
|
Autoregressive Models of Amplitude Modulations in Audio Compression, , and , Idiap-RR-33-2009 |
|
MDCT for Encoding Residual Signals in Frequency Domain Linear Prediction, , and , Idiap-RR-34-2009 |
|
MDCT for Encoding Residual Signals in Frequency Domain Linear Prediction, , and , in: Audio Engineering Society (AES,',','), 127th Convention, Audio Engineering Society (AES), Audio Engineering Society, 60 East 42nd Street, New York, New York 10165-2520, USA;, 2009 |
[URL] |
MODIFIED DISCRETE COSINE TRANSFORM FOR ENCODING RESIDUAL SIGNALS IN FREQUENCY DOMAIN LINEAR PREDICTION, , and , Idiap-RR-74-2008 |
|
Low-Delay Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, , and , Idiap-RR-75-2008 |
|
Temporal Masking for Bit-rate Reduction in Audio Codec Based on Frequency Domain Linear Prediction, , , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008 |
|
Spectral Noise Shaping: Improvements in Speech/Audio Codec Based on Linear Prediction in Spectral Domain, , , and , Idiap-RR-16-2008 |
|
Spectral Noise Shaping: Improvements in Speech/Audio Codec Based on Linear Prediction in Spectral Domain, , , and , in: INTERSPEECH 2008, 2008 |
|
Autoregressive Modelling of Hilbert Envelopes for Wide-band Audio Coding, , , and , in: AES 124th Convention, Audio Engineering Society, 2008 |
|
Autoregressive Modelling of Hilbert Envelopes for Wide-band Audio Coding, , , and , Idiap-RR-40-2008 |
|
Temporal Masking for Bit-rate Reduction in Audio Codec Based on Frequency Domain Linear Prediction, , , and , Idiap-RR-48-2007 |
|
Front-end for Far-field Speech Recognition based on Frequency Domain Linear Prediction, , and , Idiap-RR-17-2008 |
|
Front-end for Far-field Speech Recognition based on Frequency Domain Linear Prediction, , and , in: Interspeech 2008, 2008 |
|
Modulation Frequency Features For Phoneme Recognition In Noisy Speech, , and , Idiap-RR-70-2008 |
|
Modulation Frequency Features For Phoneme Recognition In Noisy Speech, , and , in: Journal of Acoustical Society of America - Express Letters, 2008 |
|
APPLICATIONS OF SIGNAL ANALYSIS USING AUTOREGRESSIVE MODELS FOR AMPLITUDE MODULATION, , , and , Idiap-RR-35-2009 |
|
APPLICATIONS OF SIGNAL ANALYSIS USING AUTOREGRESSIVE MODELS FOR AMPLITUDE MODULATION, , , and , in: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009, WASPAA '09., IEEE, Mohonk Mountain House, New Paltz, New York, USA, 2009 |
[URL] |
H
Stochastic techniques in deriving perceptual knowledge, , Idiap-RR-84-2004 |
|
TRAP-TANDEM: Data-driven extraction of temporal features from speech, , in: large part published in Proceedings of ASRU-2003, 2003 |
|
TRAP-TANDEM: Data-driven extraction of temporal features from speech, , Idiap-RR-50-2003 |
|
Some Emerging Concepts in Speech Recognition., and , Idiap-RR-82-2003 |
|
Multi-resolution RASTA filtering for TANDEM-based ASR, and , in: Proceedings of Interspeech 2005, 2005 |
|
Multi-resolution RASTA filtering for TANDEM-based ASR, and , Idiap-RR-18-2005 |
|
The Role of Speech in Multimodal Human-Computer Interaction (Towards Reliable Rejection of Non-Keyword Input), , and , in: Proceedings of 8th International Conference on Text, Speech and Dialogue - TSD 2005, 2005 |
|
The Role of Speech in Multimodal Human-Computer Interaction (Towards Reliable Rejection of Non-Keyword Input), , and , Idiap-RR-63-2005 |
|
Show What You Know: Musings on the Reporting of Negative Results in Speech Recognition Research, and , Idiap-RR-81-2003 |
|
I
Nonlinear Spectral Transformations for Robust Speech Recognition, , and , in: Proceedings of the IEEE Automatic Speech Recognition and Understanding (ASRU) Workshop 2003, 2003 |
|
Nonlinear Spectral Transformations for Robust Speech Recognition, , and , Idiap-RR-36-2003 |
|
Phase AutoCorrelation (PAC) Features for Noise Robust ASR, , , and , Idiap-RR-40-2004 |
Phase AutoCorrelation (PAC) features in Entropy based Multi-Stream for Robust Speech Recognition, , , and , in: Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04), 2004 |
|
Phase AutoCorrelation (PAC) features in Entropy based Multi-Stream for Robust Speech Recognition, , , and , Idiap-RR-54-2003 |
|
Phase AutoCorrelation (PAC) features for noise robust speech recognition, , , and , in: Speech Communication, 54(7):867–880, 2012 |
[DOI] |
Entropy Based Combination of Tandem Representations for Noise Robust ASR, , , , and , in: Proceedings of the INTERSPEECH-ICSLP-04, 2004 |
|
Entropy Based Combination of Tandem Representations for Noise Robust ASR, , , , and , Idiap-RR-19-2004 |
|
K
Identifying unexpected words using in-context and out-of-context phoneme posteriors, and , Idiap-RR-68-2006 |
|
L
On Confusions in a Phoneme Recognizer, , and , 2007 |
|
On Confusions in a Phoneme Recognizer, , and , Idiap-RR-10-2007 |
|
M
Improving Continuous Speech Recognition System Performance with Grapheme Modelling, , , and , Idiap-RR-16-2005 |
|