Keywords:
Publications of Hynek Hermansky sorted by journal and type
Publications of type Idiap-RR
2005
Towards ASR Based on Hierarchical Posterior-Based Keyword Recognition, and , Idiap-RR-64-2005 |
|
2004
Entropy Based Combination of Tandem Representations for Noise Robust ASR, , , , and , Idiap-RR-19-2004 |
|
LP-TRAP: Linear predictive temporal patterns, , and , Idiap-RR-59-2004 |
|
New Nonsense Syllables Database -- Analyses and Preliminary ASR Experiments, , , and , Idiap-RR-29-2004 |
|
Phase AutoCorrelation (PAC) Features for Noise Robust ASR, , , and , Idiap-RR-40-2004 |
Phoneme vs Grapheme Based Automatic Speech Recognition, , , and , Idiap-RR-48-2004 |
|
PLP$^2$: Autoregressive modeling of auditory-like 2-D spectro-temporal patterns, , and , Idiap-RR-60-2004 |
|
Stochastic techniques in deriving perceptual knowledge, , Idiap-RR-84-2004 |
|
2003
Nonlinear Spectral Transformations for Robust Speech Recognition, , and , Idiap-RR-36-2003 |
|
On Use of Task Independent Training Data in Tandem Feature Extraction, and , Idiap-RR-57-2003 |
|
Phase AutoCorrelation (PAC) features in Entropy based Multi-Stream for Robust Speech Recognition, , , and , Idiap-RR-54-2003 |
|
Show What You Know: Musings on the Reporting of Negative Results in Speech Recognition Research, and , Idiap-RR-81-2003 |
|
Some Emerging Concepts in Speech Recognition., and , Idiap-RR-82-2003 |
|
Spectral Entropy Based Feature for Robust ASR, , , and , Idiap-RR-56-2003 |
|
TRAP-TANDEM: Data-driven extraction of temporal features from speech, , Idiap-RR-50-2003 |
|
1998
Automatic Speech Recognition: an Auditory Perspective, , and , Idiap-RR-17-1998 |
2009
Wide-Band Audio Coding based on Frequency Domain Linear Prediction, , and , Idiap-RR-32-2009 |
|
EURASIP Journal on Audio Speech and Music Processing
Wide-Band Audio Coding based on Frequency Domain Linear Prediction, , , and , in: EURASIP Journal on Audio Speech and Music Processing, 2010(856280), 2010 |
[DOI] [URL] |
{IEEE} Signal Processing Letters
Recognition Of Reverberant Speech Using Frequency Domain Linear Prediction, , and , in: IEEE Signal Processing Letters, 2008 |
|
IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING
Autoregressive Models of Amplitude Modulations in Audio Compression, , and , in: IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 2010 |
[URL] |
IEEE Transcations on Audio, Speech, and Language Processing
Analysis of MLP Based Hierarchical Phoneme Posterior Probability Estimator, , , , and , in: IEEE Transcations on Audio, Speech, and Language Processing, 19(2):225-241, 2011 |
|
Journal of Acoustical Society of America - Express Letters
Modulation Frequency Features For Phoneme Recognition In Noisy Speech, , and , in: Journal of Acoustical Society of America - Express Letters, 2008 |
|
Speech Communication
Phase AutoCorrelation (PAC) features for noise robust speech recognition, , , and , in: Speech Communication, 54(7):867–880, 2012 |
[DOI] |
Speech Processing in the Auditory System (2000)
Automatic Speech Recognition: an Auditory Perspective, , and , in: Speech Processing in the Auditory System, Springer Verlag, New York, 2000 |
11th International workshop on Models and Analysis of Vocal Emissions for Biomedical Applications (2019)
AM-FM DECOMPOSITION OF SPEECH SIGNAL: APPLICATIONS FOR SPEECH PRIVACY AND DIAGNOSIS, , , , and , in: 11th International workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, Universita Degli Studi Firenze, Firenze, Italy, 2019 |
[URL] |
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009, WASPAA '09. (2009)
APPLICATIONS OF SIGNAL ANALYSIS USING AUTOREGRESSIVE MODELS FOR AMPLITUDE MODULATION, , , and , in: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009, WASPAA '09., IEEE, Mohonk Mountain House, New Paltz, New York, USA, 2009 |
[URL] |
10th Annual Conference of the International Speech Communication Association (2009)
Arithmetic Coding of Sub-Band Residuals in FDLP Speech/Audio Codec, , and , in: 10th Annual Conference of the International Speech Communication Association, ISCA, Brighton, England, ISCA 2009, 2009 |
|
12th International Conference on Text, Speech and Dialogue, TSD 2009 (2009)
Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, , and , in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, Pilsen, Czech Republic, Springer - Verlag, Berlin Heidelberg 2009, 2009 |
|
Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, , and , in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, Pilsen, Czech Republic, Springer - Verlag, Berlin Heidelberg 2009, 2009 |
|
Audio Engineering Society (AES,',','),
127th Convention (2009)
MDCT for Encoding Residual Signals in Frequency Domain Linear Prediction, , and , in: Audio Engineering Society (AES,',','), 127th Convention, Audio Engineering Society (AES), Audio Engineering Society, 60 East 42nd Street, New York, New York 10165-2520, USA;, 2009 |
[URL] |
Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2009)
Volterra Series for Analyzing MLP based Phoneme Posterior Probability Estimator, , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2009 |
|
{AES} 124th Convention, Audio Engineering Society (2008)
Autoregressive Modelling of Hilbert Envelopes for Wide-band Audio Coding, , , and , in: AES 124th Convention, Audio Engineering Society, 2008 |
|
Advances in Neural Information Processing Systems 21 (2008)
Beyond Novelty Detection: Incongruent Events, when General and Specific Classifiers Disagree, , , , , , and , in: Advances in Neural Information Processing Systems 21, 2008 |
|
{P}roceedings of Interspeech (2008)
Combining Evidence from a Generative and a Discriminative Model in Phoneme Recognition, and , in: Proceedings of Interspeech, 2008 |
|
Proc. 16th European Signal Processing Conference (EUSIPCO) (2008)
Emulating Temporal Receptive Fields of Auditory Mid-Brain Neurons for Automatic Speech Recognition, and , in: Proc. 16th European Signal Processing Conference (EUSIPCO), 2008 |
|
"{IEEE} Int. Conf. on Acoustics, Speech, and Signal Processing ({ICASSP})" (2008)
Exploiting Contextual Information for Improved Phoneme Recognition, , , and , in: "IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)", 2008 |
|
Text, Speech and Dialogue (2008)
Exploiting Contextual Information for Speech/Non-Speech Detection, , and , in: Text, Speech and Dialogue, Brno, Czech Republic, Springer-Verlag Berlin, Heidelberg, 2008 |
|
{Workshop on Searching Spontaneous Conversational Speech at SIGIR} (2008)
Fast Approximate Spoken Term Detection from Sequence of Phonemes, , , and , in: Workshop on Searching Spontaneous Conversational Speech at SIGIR, 2008 |
|
Interspeech 2008 (2008)
Front-end for Far-field Speech Recognition based on Frequency Domain Linear Prediction, , and , in: Interspeech 2008, 2008 |
|
{IEEE} Int. Conf. on Acoustics, Speech, and Signal Processing ({ICASSP}) (2008)
Hierarchical and Parallel Processing of Modulation Spectrum for ASR applications, and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008 |
|
MLMI 2008 (2008)
Hilbert Envelope Based Features for Far-Field Speech Recognition, , and , in: MLMI 2008, 2008 |
|
Interspeech 2008 (2008)
Hilbert Envelope Based Spectro-Temporal Features for Phoneme Recognition in Telephone Speech, , and , in: Interspeech 2008, 2008 |
|
Introducing Temporal Asymmetries in Feature Extraction for Automatic Speech Recognition, and , in: Interspeech 2008, 2008 |
|
On the Combination of Auditory and Modulation Frequency Channels for ASR applications, and , in: Interspeech 2008, 2008 |
|
Text, Speech and Dialogue (2008)
Perceptually motivated Sub-band Decomposition for FDLP Audio Coding, , , , and , in: Text, Speech and Dialogue, Brno, Czech Republic, Springer-Verlag Berlin, Heidelberg, 2008 |
|
11th International {C}onference on {T}ext, {S}peech, and {D}ialogue (2008)
Reverse Correlation for analyzing MLP Posterior Features in ASR, , and , in: 11th International Conference on Text, Speech, and Dialogue, 2008 |
|
INTERSPEECH 2008 (2008)
Spectral Noise Shaping: Improvements in Speech/Audio Codec Based on Linear Prediction in Spectral Domain, , , and , in: INTERSPEECH 2008, 2008 |
|
EUSIPCO 2008 (2008)
Spectro-Temporal Features for Automatic Speech Recognition using Linear Prediction in Spectral Domain, , and , in: EUSIPCO 2008, 2008 |
|
{IEEE} Int. Conf. on Acoustics, Speech, and Signal Processing ({ICASSP}) (2008)
Temporal Masking for Bit-rate Reduction in Audio Codec Based on Frequency Domain Linear Prediction, , , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008 |
|
Proceedings of the International Conference on Multimodal Interfaces (2008)
The DIRAC AWEAR Audio-Visual Platform for Detection of Unexpected and Incongruent Events, , , , , , , , , , , , , and , in: Proceedings of the International Conference on Multimodal Interfaces, 2008 |
|