Keywords:
- acoustic modeling
- AMI Meetings
- Automatic Speech Recognition
- Channel selection
- Confidence Measure (CM)
- far-field speech
- information bottleneck
- Information Bottleneck clustering
- KeyWord Spotting (KWS)
- Language IDentification (LID)
- Large Vocabulary Continuous Speech Recognition (LVCSR)
- LVCSR
- meetings
- mutual information
- Out- Of-Language (OOL) detection
- Out-Of-Language (OOL) detection
- Overlap speech
- Paralinguistic
- Prosodic features
- Speaker Diarization
- Speaker Role Labeling
- speech recognition
- speech recognition.
- Spoken Language Understanding
- Spoken Term Detection (STD)
- Spontaneous Conversation
- Subs-ace Gaussian Mixture Models
- System Combination
- TANDEM features
- temporal modulations
- Turn-taking features
Publications of Fabio Valente
| 1 | 2 |
2009
An Information Theoretic Approach to Speaker Diarization of Meeting Data, , and , in: IEEE Transactions on Audio Speech and Language Processing, 17(7), 2009 |
[DOI] |
Hierarchical Processing of the Modulation Spectrum for GALE Mandarin LVCSR system, , , and , in: Proceedings of the 10thAnnual Conference of the International Speech Communication Association (Interspeech), Brighton, 2009 |
|
KL Realignment for Speaker Diarization with Multiple Feature Streams, , and , in: 10th Annual Conference of the International Speech Communication Association, 2009 |
MUTUAL INFORMATION BASED CHANNEL SELECTION FOR SPEAKER DIARIZATION OF MEETINGS DATA, , and , in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2009 |
|
Mutual Information based Channel Selection for Speaker Diarization of Meetings Data, , and , in: Proceedings of International conference on acoustics speech and signal processing, 2009 |
2008
An Information Theoretic Approach to Speaker Diarization of Meeting Data, , and , Idiap-RR-58-2008 |
|
COMBINATION OF AGGLOMERATIVE AND SEQUENTIAL CLUSTERING FOR SPEAKER DIARIZATION, , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008 |
|
Hierarchical and Parallel Processing of Modulation Spectrum for ASR applications, and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008 |
|
Integration of TDOA Features in Information Bottleneck Framework for Fast Speaker Diarization, , and , Idiap-RR-26-2008 |
|
Integration of TDOA Features in Information Bottleneck Framework for Fast Speaker Diarization, , and , in: Interspeech 2008, 2008 |
|
On the Combination of Auditory and Modulation Frequency Channels for ASR applications, and , Idiap-RR-12-2008 |
|
On the Combination of Auditory and Modulation Frequency Channels for ASR applications, and , in: Interspeech 2008, 2008 |
|
2007
AGGLOMERATIVE INFORMATION BOTTLENECK FOR SPEAKER DIARIZATION OF MEETINGS DATA, , and , in: IEEE Automatic Speech Recognition and Understanding Workshop, 2007 |
|
AGGLOMERATIVE INFORMATION BOTTLENECK FOR SPEAKER DIARIZATION OF MEETINGS DATA, , and , Idiap-RR-31-2007 |
|
Combination of Acoustic Classifiers based on Dempster-Shafer Theory of evidence, and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007 |
|
COMBINATION OF AGGLOMERATIVE AND SEQUENTIAL CLUSTERING FOR SPEAKER DIARIZATION, , and , Idiap-RR-51-2007 |
|
Hierarchical and Parallel Processing of Modulation Spectrum for ASR applications, and , Idiap-RR-45-2007 |
|
Hierarchical Neural Networks Feature Extraction for LVCSR system, , , , , and , in: Interspeech 2007, 2007 |
|
Hierarchical Neural Networks Feature Extraction for LVCSR system, , , , , and , Idiap-RR-08-2007 |
|
Multi-stream Features Combination based on Dempster-Shafer Rule for LVCSR System, , and , in: Interspeech 2007, 2007 |
|
Multi-stream Features Combination based on Dempster-Shafer Rule for LVCSR System, , and , Idiap-RR-09-2007 |
|
2006
Combination of Acoustic Classifiers based on Dempster-Shafer Theory of evidence, and , Idiap-RR-61-2006 |
|
Discriminant linear processing of time-frequency plane, and , in: International Conference on Spoken Language Processing, 2006 |
|
Discriminant linear processing of time-frequency plane, and , Idiap-RR-20-2006 |
|
Infinite Models for Speaker Clustering, , in: International Conference on Spoken Language Processing, 2006 |
|
Infinite Models for Speaker Clustering, , Idiap-RR-19-2006 |
|
| 1 | 2 |