Keywords:
Publications of Hynek Hermansky sorted by journal and type
Publications of type Idiap-RR
2020
| AM-FM DECOMPOSITION OF SPEECH SIGNAL: APPLICATIONS FOR SPEECH PRIVACY AND DIAGNOSIS, , , , and , Idiap-RR-01-2020 |
|
2010
| The DIRAC AWEAR Audio-Visual Platform for Detection of Unexpected and Incongruent Events, , , , , , , , , , , , , and , Idiap-RR-41-2010 |
|
2009
| APPLICATIONS OF SIGNAL ANALYSIS USING AUTOREGRESSIVE MODELS FOR AMPLITUDE MODULATION, , , and , Idiap-RR-35-2009 |
|
| Autoregressive Models of Amplitude Modulations in Audio Compression, , and , Idiap-RR-33-2009 |
|
| MDCT for Encoding Residual Signals in Frequency Domain Linear Prediction, , and , Idiap-RR-34-2009 |
|
| Phoneme Recognition Using Spectral Envelope and Modulation Frequency Features, , and , Idiap-RR-04-2009 |
|
2008
| A Data-driven Approach to Speech/Non-speech Detection, and , Idiap-RR-23-2008 |
|
| Autoregressive Modelling of Hilbert Envelopes for Wide-band Audio Coding, , , and , Idiap-RR-40-2008 |
|
| Combining Evidence from a Generative and a Discriminative Model in Phoneme Recognition, and , Idiap-RR-20-2008 |
|
| Emulating Temporal Receptive Fields of Auditory Mid-Brain Neurons for Automatic Speech Recognition, and , Idiap-RR-24-2008 |
|
| Entropy coding of Quantized Spectral Components in FDLP audio codec, , and , Idiap-RR-71-2008 |
|
| Exploiting contextual information for speech/non-speech detection, and , Idiap-RR-22-2008 |
|
| Exploiting temporal context for speech/non-speech detection, , and , Idiap-RR-21-2008 |
|
| Fast Approximate Spoken Term Detection from Sequence of Phonemes, , , and , Idiap-RR-45-2008 |
|
| Front-end for Far-field Speech Recognition based on Frequency Domain Linear Prediction, , and , Idiap-RR-17-2008 |
|
| Hilbert Envelope Based Features for Far-Field Speech Recognition, , and , Idiap-RR-42-2008 |
|
| Hilbert Envelope Based Specto-Temporal Features for Phoneme Recognition in Telephone Speech, , and , Idiap-RR-18-2008 |
|
| Introducing Temporal Asymmetries in Feature Extraction for Automatic Speech Recognition, and , Idiap-RR-25-2008 |
|
| Low-Delay Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, , and , Idiap-RR-75-2008 |
|
| MODIFIED DISCRETE COSINE TRANSFORM FOR ENCODING RESIDUAL SIGNALS IN FREQUENCY DOMAIN LINEAR PREDICTION, , and , Idiap-RR-74-2008 |
|
| Modulation Frequency Features For Phoneme Recognition In Noisy Speech, , and , Idiap-RR-70-2008 |
|
| On the Combination of Auditory and Modulation Frequency Channels for ASR applications, and , Idiap-RR-12-2008 |
|
| Recognition Of Reverberant Speech Using Frequency Domain Linear Prediction, , and , Idiap-RR-41-2008 |
|
| Reverse Correlation for analyzing MLP Posterior Features in ASR, , and , Idiap-RR-13-2008 |
|
| Spectral Noise Shaping: Improvements in Speech/Audio Codec Based on Linear Prediction in Spectral Domain, , , and , Idiap-RR-16-2008 |
|
| Spectro-Temporal Features for Automatic Speech Recognition using Linear Prediction in Spectral Domain, , and , Idiap-RR-05-2008 |
|
| Volterra Series for Analyzing MLP based Phoneme Posterior Probability Estimator, , , and , Idiap-RR-69-2008 |
|
2007
| Analysis of Confusion Matrix to Combine Evidence for Phoneme Recognition, , , and , Idiap-RR-27-2007 |
|
| Comparing Different Word Lattice Rescoring Approaches Towards Keyword Spotting, , , and , Idiap-RR-32-2007 |
|
| Exploiting Contextual Information for Improved Phoneme Recognition, , , and , Idiap-RR-65-2007 |
|
| Exploiting Phoneme Similarities in Hybrid HMM-ANN Keyword Spotting, , and , Idiap-RR-11-2007 |
|
| Hierarchical and Parallel Processing of Modulation Spectrum for ASR applications, and , Idiap-RR-45-2007 |
|
| Hierarchical Neural Networks Feature Extraction for LVCSR system, , , , , and , Idiap-RR-08-2007 |
|
| Multi-stream Features Combination based on Dempster-Shafer Rule for LVCSR System, , and , Idiap-RR-09-2007 |
|
| Non-uniform QMF Decomposition for Wide-band Audio Coding based on Frequency Domain Linear Prediction, , , and , Idiap-RR-43-2007 |
|
| On Confusions in a Phoneme Recognizer, , and , Idiap-RR-10-2007 |
|
| Scalable Wide-band Audio Codec based on Frequency Domain Linear Prediction, , , and , Idiap-RR-16-2007 |
|
| Significance of Contextual Information in Phoneme Recognition, , , and , Idiap-RR-28-2007 |
|
| Temporal Masking for Bit-rate Reduction in Audio Codec Based on Frequency Domain Linear Prediction, , , and , Idiap-RR-48-2007 |
|
2006
| Audio Coding Based on Long Temporal Contexts, , , and , Idiap-RR-30-2006 |
|
| Combination of Acoustic Classifiers based on Dempster-Shafer Theory of evidence, and , Idiap-RR-61-2006 |
|
| Discriminant linear processing of time-frequency plane, and , Idiap-RR-20-2006 |
|
| Identifying unexpected words using in-context and out-of-context phoneme posteriors, and , Idiap-RR-68-2006 |
|
| Speech Coding based on Spectral Dynamics, , , and , Idiap-RR-05-2006 |
|
| Wide-Band Perceptual Audio Coding based on Frequency-Domain Linear Prediction, , and , Idiap-RR-58-2006 |
|
2005
| Improving Continuous Speech Recognition System Performance with Grapheme Modelling, , , and , Idiap-RR-16-2005 |
|
| Multi-resolution RASTA filtering for TANDEM-based ASR, and , Idiap-RR-18-2005 |
|
| The Role of Speech in Multimodal Human-Computer Interaction (Towards Reliable Rejection of Non-Keyword Input), , and , Idiap-RR-63-2005 |
|