Keywords:
- acoustic modeling
- Ad hoc array calibration
- Ad hoc microphone array calibration
- Ad-hoc microphone array
- Ad-hoc microphone calibration
- adaptive training
- Adequacy of diffuseness
- AFC
- Afrikaans
- Artificial Neural Networks
- assessment method
- association rules
- audio–visual speech synchrony
- Auto-associative multilayer perceptrons
- autoencoders
- automatic disambiguation
- Automatic prosodic event detection
- Automatic Speech Recognition
- automatic speech recognition (ASR)
- Autoregressive modeling
- Bayesian recognition
- binary masking
- Binary pattern matching
- bottleneck
- Broadband beam-pattern
- Cadzow algorithm
- canonical correlation analysis
- Cerebral Palsy
- chain models
- Channel selection
- clinical application
- Compressive Acoustic Measurements
- Compressive sampling
- Compressive Sensing
- computer vision
- connectionist temporal classification (ctc)
- Conversational technologies
- Convex optimization
- Convolutive source separation
- crosslingual adaptation
- ctc
- data analysis
- data utility
- deep MLPs
- Deep neural network
- Deep neural network (DNN)
- Deep neural network posterior features
- Deep neural network posterior probabilities
- deep neural networks
- Delay-and-sum beamformer
- Dictionary learning
- difference features
- Diffuse field coherence model
- Diffuse noise coherence
- Diffuse sound coherence model
- Diffuse sound field
- Digital IIR Filters
- Digital IIR Filters
- Directivity
- discourse connectives
- Distant speech recognition
- Distributed source localization.
- dnn
- dnn-based speech recognition
- duration models
- Dynamic Bayesian Network
- Dysarthria
- e2e-lfmmi
- error correction
- Euclidean distance matrix
- evidence combination
- exemplar-based modeling
- far-field asr
- far-field speech
- Fast $k$NN
- fast adaptation
- fast training
- FC
- floor control
- Fujisaki Model
- full combination
- gaming
- Generalized cross correlation
- Generalized Trust Region Subproblem (GTRS).
- Graphemes
- Grassmannian discriminant analysis
- hidden variable
- high-dimensional sparse representations
- HMM/ANN-Hybrid
- HMMs
- human behaviour analysis
- hybrid system
- Image Model
- information bottleneck
- Information Bottleneck clustering
- Information Retrieval
- intelligibility
- Joint sparse recovery
- k-nearest neighbor (kNN) search
- Keyword Detection
- Keyword spotting detection
- KL-divergence
- KL-HMM
- kNN classifier
- Kullback-Leibler divergence
- Kullback-Leibler divergence based hidden Markov model
- Laplacian speech modeling
- Linguistic parsing
- Low bit rate speech vocoding
- low-rank representation (LRR)
- low-rank sparsity
- Matrix completion
- microphone array
- Microphone array calibration
- missing data
- missing features
- ML-adaptation
- Model-Based Compressive Sensing
- Model-based sparse recovery
- models
- multi-band
- multi-band combination
- Multi-party Speech Recognition
- Multi-speaker Localization
- multi-stream
- multi-stream processing
- multiband
- multilayer perceptron
- multilingual acoustic modeling
- multilingual ASR
- multilingual speech recognition
- Multimodal interaction
- multimodal signal processing
- multimodal speaker diarisation
- Multiparty Conversation
- multiparty meetings
- multiple time scales
- mutual information
- nearest neighbour rule of classification.
- neural network
- neural network features
- neural networks
- Noise
- noise adaptation
- noise annoyance
- noise intrusiveness
- noise reduction
- noise robust ASR
- Noisy Text
- Non-negative matrix factorization
- non-verbal features
- Objective intelligibility
- objective measures
- open-architecture distributed system
- Overlap speech
- Overlapping Speech
- overlapping speech recognition
- P-ESTOI
- Pairwise distance estimation
- PCA
- perceptual quality assessment
- Phase transform
- Phone posterior
- Phoneme classification
- phonemes
- Phonological features
- Phonological posterior
- phonological posteriors
- posterior feature
- Posterior feature space
- Posterior features
- Posterior hashing
- posterior probability
- Posterior probability structures
- Posterior representatives
- posterior space properties
- posterior-based metrics
- Principle component analysis
- Pronunciation dewarping
- Prosody Modelling
- Psychoacoustics
- Quantized posterior hashing
- query by example
- real-time processing
- reliability
- representation learning
- Reverberant enclosure
- Reverberation
- robust ASR
- Robust microphone placement
- robust recognition
- Room acoustic characterization
- Room acoustic estimation
- Room Geometry
- Room geometry estima- tion
- S-stress
- Semidefinite programming
- sensor fusion
- Single-channel source localization
- Singular value decomposition
- Social Behaviour Analysis
- Social Interactions
- Social Signal Processing
- Social signals
- soft targets
- sound source localization
- Source localization
- sparse autoencoder
- sparse coding
- Sparse Component Analysis
- Sparse modeling
- sparse overcomplete autoencoder
- Sparse Recovery
- sparse representation
- Sparse Signal Recovery
- Sparse word posterior probabilities
- sparsity
- Speaker Diarization
- Speaker localization
- speaker turn
- spectral amplitude estimation
- Spectral subspace
- Speech
- Speech Analysis
- Speech dereverberation
- Speech enhancement
- Speech intelligibility
- speech modeling
- speech processing
- speech quality
- speech recognition
- speech separation
- Speech source localization
- speech sparsity
- Speech spectral structures
- speech synthesis
- Spoken Documents Retrieval
- spoken term detection
- spontaneous meeting recordings
- Statistical Machine Translation
- Structural similarity measure
- Structured Sparse Coding
- Structured sparse representation
- Structured sparsity
- structured sparsity models
- subbands
- subjective evaluation
- subjective testing
- Subspace detection
- Superdirective beamformer
- SVD
- synchronisation
- synthetic reference templates.
- Tandem
- template-based approach
- temporal modulations
- temporal subspace
- text-to-speech synthesis
- triphone mapping
- TTS
- un- derdetermined convolutive speech separation
- under-resourced languages
- under-resourced speech recognition
- universal phoneme set
- utterance verification
- verb tense
- wav2vec 2.0
- weighting
- word emphasis
Publications of Hervé Bourlard sorted by recency
Connectionist speech recognition, , in: Proceedings of IK'98, Interdisziplinares Kolleg, Spring Scholl, Gunne am Mohnessee, Germany, March 7--14, 1998 |
Confidence Measures in Hybrid HMM/ANN Speech Recognition, and , in: Proceedings of Workshop on Text, Speech and Dialog (TSD'98) Brno, Czech Republic, 1998 |
Automatic Speech Recognition: an Auditory Perspective, , and , in: Speech Processing in the Auditory System, Springer Verlag, New York, 2000 |
The full combination sub-bands approach to noise robust HMM/ANN based ASR, , and , in: 6th European Conference on Speech Communication and Technology --- Eurospeech'99, 1999 |
|
Non-Stationary Multi-Channel (Multi-Stream) Processing Towards Robust and Adaptive ASR, , in: Proc. of the ESCA Workshop on Robust Methods for Speech Recognition in Adverse Conditions, 1999 |
Multi-stream adaptive evidence combination for noise robust ASR, , , and , Idiap-RR-26-1999 |
|
Iterative Posterior-Based Keyword Spotting Without Filler Models: Iterative Viterbi Decoding and One-Pass Approach, and , Idiap-RR-27-1999 |
Iterative Posterior-Based Keyword Spotting Without Filler Models, and , in: Proceedings of the IEEE Automatic Speech Recognition and Understanding (ASRU'99) Workshop, 1999 |
Iterative Posterior-Based Keyword Spotting Without Filler Models, and , Idiap-RR-16-1999 |
INtegrating SPEech acoustic and linguistic Constraints: Baseline System Development, , , and , Idiap-RR-21-1999 |
|
Different Weighting Schemes in the Full Combination Subbands Approach for Noise Robust ASR, , and , in: Robust Methods for Speech Recognition in Adverse Conditions, 1999 |
|
Using Multiple Time Scales in the Framework of Multi-Stream Speech Recognition, and , in: ICSLP, 2000 |
|
Traitement de la Parole, , , , and , Presses Polytechniques Universitaires Romandes, 2000 |
Recent Developments in Speaker Verification at IDIAP, and , Idiap-RR-26-2000 |
|
Neural Networks in Automatic Speech Recognition, , , and , in: to be published in The Handbook of Brain Theory and Neural Networks, Bradford Books, The MIT Press, 2000 |
Iterative Posterior-Based Keyword Spotting Without Filler Models, and , in: Proceedings of the IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, 2000 |
HMM2- A Novel Approach to HMM Emission Probability Estimation, , and , in: International Conference on Spoken Langugae Processing (ICSLP 2000), 2000 |
|
From Multi-Band Full Combination to Multi-Stream Full Combination Processing in Robust ASR, , and , in: ISCA ITRW ASR2000, 2000 |
|
Automatic Speech Recognition using Pitch Information in Dynamic Bayesian Networks, , and , Idiap-RR-41-2000 |
|
Automatic Speech Recognition using Dynamic Bayesian Networks with both Acoustic and Articulatory Variables, , , and , in: 6th International Conference on Spoken Language Processing: ICSLP~2000 (Interspeech~2000), 2000 |
|
Auto-Association by Multilayer Perceptrons and Singular Value Decomposition, , Idiap-RR-16-2000 |
|
An EM Algorithm for HMMs with Emission Distributions Represented by HMMs, , and , Idiap-RR-11-2000 |
|
A neural network for classification with incomplete data: application to robust ASR, , , , and , in: Proc. ICSLP, 2000 |
|
Video OCR for Sport Video Annotation and Retrieval, , and , in: Proceedings of the 8th IEEE International Conference on Mechatronics and Machine Vision in Practice, 2001 |
|
User Customized HMM/ANN Based Speaker Verification, and , Idiap-RR-32-2001 |
|
Text Identification in Complex Background using SVM, , and , in: Proceedings of the Int. Conf. on computer vision and pattern recognition, 2001 |
Text Enhancement with Asymmetric Filter for Video OCR, , and , in: Proceedings of the 11th International Conference on Image Analysis and Processing, 2001 |
|
Speech/Music Discrimination using Entropy and Dynamism Features in a HMM Classification Framework, , and , in: Speech Communication, 40, 2003 |
|
Speech Recognition Using Advanced HMM2 Features, , and , in: Automatic Speech Recognition and Understanding Workshop, 2001 |
|
Speaker Verification Based On User-Customized Password, , and , Idiap-RR-13-2001 |
|
Robust Speech Recognition and Feature Extraction Using HMM2, , , and , in: Computer Speech & Language, 17(2-3), 2003 |
Pronunciation models and their evaluation using confidence measures, and , Idiap-RR-29-2001 |
|
New Approaches Towards Robust and Adaptive Speech Recognition, , and , in: Advances in Neural Information Processing Systems 13, MIT Press, 2001 |
|
Multi-stream adaptive evidence combination for noise robust ASR, , , and , in: Speech Communication, 2001 |
Modeling Auxiliary Information in Bayesian Network Based ASR, , and , in: 7th European Conference on Speech Communication and Technology (Eurospeech~2001), 2001 |
|
Microphone Array Post-filter for Diffuse Noise Field, and , in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2002 |
|
Microphone Array Post-filter based on Noise Field Coherence, and , in: IEEE Transactions on Speech and Audio Processing, 11(6), 2003 |
|
MAP Combination of Multi-Stream HMM or HMM/ANN Experts, , and , in: Proc. Eurospeech, 2001 |
|
IDIAP HMM/HMM2 System: Theoretical Basis and Software Specifications, , , and , Idiap-RR-27-2001 |
|
HMM2- Extraction of Formant Features and their Use for Robust ASR, , and , in: European Conference on Speech Communication and Technology (Eurospeech 2001), 2001 |
|
From missing data to maybe useful data: soft data modelling for noise robust ASR, , and , in: Proc. WISP, 2001 |
|
Error Correcting Posterior Combination for Robust Multi-Band Speech Recognition, and , in: EUROSPEECH, 2001 |
|
Analytic Assessment of Telephone Transmission Impact on ASR Performance Using a Simulation Model, and , in: Speech Communication, 2002 |
|
Adaptive ML-Weighting in Multi-Band Recombination of Gaussian Mixture ASR, , and , in: ICASSP, 2001 |
|
A Pragmatic View of the Application of HMM2 for ASR, , and , Idiap-RR-23-2001 |
|
User-Customized Password Speaker Verification based on HMM/ANN and GMM Models, and , in: International Conference on Spoken Language Processing (ICSLP~2002), 2002 |
|
User-Customized Password HMM Based Speaker Verification, and , in: Proceedings of the COST275 Workshop on the Advent of Biometrics on the Internet, 2002 |
|
Unknown-Multiple Speaker clustering using HMM, , , and , in: ICSLP, 2002 |
|
Towards Robust and Adaptive Speech Recognition Models, , and , Idiap-RR-47-2002 |
|
Towards Robust and Adaptive Speech Recognition Models, , and , in: Mathematical Foundations of Speech Processing and Recognition, Springer-Verlag, 2002 |
|