Keywords:
Publications of Hynek Hermansky
2020
AM-FM DECOMPOSITION OF SPEECH SIGNAL: APPLICATIONS FOR SPEECH PRIVACY AND DIAGNOSIS, , , , and , Idiap-RR-01-2020 |
|
2019
AM-FM DECOMPOSITION OF SPEECH SIGNAL: APPLICATIONS FOR SPEECH PRIVACY AND DIAGNOSIS, , , , and , in: 11th International workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, Universita Degli Studi Firenze, Firenze, Italy, 2019 |
[URL] |
2012
Phase AutoCorrelation (PAC) features for noise robust speech recognition, , , and , in: Speech Communication, 54(7):867–880, 2012 |
[DOI] |
2011
Analysis of MLP Based Hierarchical Phoneme Posterior Probability Estimator, , , , and , in: IEEE Transcations on Audio, Speech, and Language Processing, 19(2):225-241, 2011 |
|
2010
Autoregressive Models of Amplitude Modulations in Audio Compression, , and , in: IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 2010 |
[URL] |
The DIRAC AWEAR Audio-Visual Platform for Detection of Unexpected and Incongruent Events, , , , , , , , , , , , , and , Idiap-RR-41-2010 |
|
Wide-Band Audio Coding based on Frequency Domain Linear Prediction, , , and , in: EURASIP Journal on Audio Speech and Music Processing, 2010(856280), 2010 |
[DOI] [URL] |
2009
APPLICATIONS OF SIGNAL ANALYSIS USING AUTOREGRESSIVE MODELS FOR AMPLITUDE MODULATION, , , and , Idiap-RR-35-2009 |
|
APPLICATIONS OF SIGNAL ANALYSIS USING AUTOREGRESSIVE MODELS FOR AMPLITUDE MODULATION, , , and , in: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009, WASPAA '09., IEEE, Mohonk Mountain House, New Paltz, New York, USA, 2009 |
[URL] |
Arithmetic Coding of Sub-Band Residuals in FDLP Speech/Audio Codec, , and , in: 10th Annual Conference of the International Speech Communication Association, ISCA, Brighton, England, ISCA 2009, 2009 |
|
Autoregressive Models of Amplitude Modulations in Audio Compression, , and , Idiap-RR-33-2009 |
|
Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, , and , in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, Pilsen, Czech Republic, Springer - Verlag, Berlin Heidelberg 2009, 2009 |
|
Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, , and , in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, Pilsen, Czech Republic, Springer - Verlag, Berlin Heidelberg 2009, 2009 |
|
MDCT for Encoding Residual Signals in Frequency Domain Linear Prediction, , and , Idiap-RR-34-2009 |
|
MDCT for Encoding Residual Signals in Frequency Domain Linear Prediction, , and , in: Audio Engineering Society (AES,',','), 127th Convention, Audio Engineering Society (AES), Audio Engineering Society, 60 East 42nd Street, New York, New York 10165-2520, USA;, 2009 |
[URL] |
Phoneme Recognition Using Spectral Envelope and Modulation Frequency Features, , and , Idiap-RR-04-2009 |
|
Volterra Series for Analyzing MLP based Phoneme Posterior Probability Estimator, , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2009 |
|
Wide-Band Audio Coding based on Frequency Domain Linear Prediction, , and , Idiap-RR-32-2009 |
|
2008
A Data-driven Approach to Speech/Non-speech Detection, and , Idiap-RR-23-2008 |
|
Autoregressive Modelling of Hilbert Envelopes for Wide-band Audio Coding, , , and , in: AES 124th Convention, Audio Engineering Society, 2008 |
|
Autoregressive Modelling of Hilbert Envelopes for Wide-band Audio Coding, , , and , Idiap-RR-40-2008 |
|
Beyond Novelty Detection: Incongruent Events, when General and Specific Classifiers Disagree, , , , , , and , in: Advances in Neural Information Processing Systems 21, 2008 |
|
Combining Evidence from a Generative and a Discriminative Model in Phoneme Recognition, and , Idiap-RR-20-2008 |
|
Combining Evidence from a Generative and a Discriminative Model in Phoneme Recognition, and , in: Proceedings of Interspeech, 2008 |
|
Emulating Temporal Receptive Fields of Auditory Mid-Brain Neurons for Automatic Speech Recognition, and , Idiap-RR-24-2008 |
|
Emulating Temporal Receptive Fields of Auditory Mid-Brain Neurons for Automatic Speech Recognition, and , in: Proc. 16th European Signal Processing Conference (EUSIPCO), 2008 |
|
Entropy coding of Quantized Spectral Components in FDLP audio codec, , and , Idiap-RR-71-2008 |
|
Exploiting Contextual Information for Improved Phoneme Recognition, , , and , in: "IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)", 2008 |
|
Exploiting contextual information for speech/non-speech detection, and , Idiap-RR-22-2008 |
|
Exploiting Contextual Information for Speech/Non-Speech Detection, , and , in: Text, Speech and Dialogue, Brno, Czech Republic, Springer-Verlag Berlin, Heidelberg, 2008 |
|
Exploiting temporal context for speech/non-speech detection, , and , Idiap-RR-21-2008 |
|
Fast Approximate Spoken Term Detection from Sequence of Phonemes, , , and , Idiap-RR-45-2008 |
|
Fast Approximate Spoken Term Detection from Sequence of Phonemes, , , and , in: Workshop on Searching Spontaneous Conversational Speech at SIGIR, 2008 |
|
Front-end for Far-field Speech Recognition based on Frequency Domain Linear Prediction, , and , Idiap-RR-17-2008 |
|
Front-end for Far-field Speech Recognition based on Frequency Domain Linear Prediction, , and , in: Interspeech 2008, 2008 |
|
Hierarchical and Parallel Processing of Modulation Spectrum for ASR applications, and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008 |
|
Hilbert Envelope Based Features for Far-Field Speech Recognition, , and , Idiap-RR-42-2008 |
|
Hilbert Envelope Based Features for Far-Field Speech Recognition, , and , in: MLMI 2008, 2008 |
|
Hilbert Envelope Based Specto-Temporal Features for Phoneme Recognition in Telephone Speech, , and , Idiap-RR-18-2008 |
|
Hilbert Envelope Based Spectro-Temporal Features for Phoneme Recognition in Telephone Speech, , and , in: Interspeech 2008, 2008 |
|
Introducing Temporal Asymmetries in Feature Extraction for Automatic Speech Recognition, and , Idiap-RR-25-2008 |
|
Introducing Temporal Asymmetries in Feature Extraction for Automatic Speech Recognition, and , in: Interspeech 2008, 2008 |
|
Low-Delay Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, , and , Idiap-RR-75-2008 |
|
MODIFIED DISCRETE COSINE TRANSFORM FOR ENCODING RESIDUAL SIGNALS IN FREQUENCY DOMAIN LINEAR PREDICTION, , and , Idiap-RR-74-2008 |
|
Modulation Frequency Features For Phoneme Recognition In Noisy Speech, , and , Idiap-RR-70-2008 |
|
Modulation Frequency Features For Phoneme Recognition In Noisy Speech, , and , in: Journal of Acoustical Society of America - Express Letters, 2008 |
|
On the Combination of Auditory and Modulation Frequency Channels for ASR applications, and , Idiap-RR-12-2008 |
|
On the Combination of Auditory and Modulation Frequency Channels for ASR applications, and , in: Interspeech 2008, 2008 |
|
Perceptually motivated Sub-band Decomposition for FDLP Audio Coding, , , , and , in: Text, Speech and Dialogue, Brno, Czech Republic, Springer-Verlag Berlin, Heidelberg, 2008 |
|