Keywords:
- acoustic modeling
- AMI Meetings
- Automatic Speech Recognition
- Channel selection
- Confidence Measure (CM)
- far-field speech
- information bottleneck
- Information Bottleneck clustering
- KeyWord Spotting (KWS)
- Language IDentification (LID)
- Large Vocabulary Continuous Speech Recognition (LVCSR)
- LVCSR
- meetings
- mutual information
- Out- Of-Language (OOL) detection
- Out-Of-Language (OOL) detection
- Overlap speech
- Paralinguistic
- Prosodic features
- Speaker Diarization
- Speaker Role Labeling
- speech recognition
- speech recognition.
- Spoken Language Understanding
- Spoken Term Detection (STD)
- Spontaneous Conversation
- Subs-ace Gaussian Mixture Models
- System Combination
- TANDEM features
- temporal modulations
- Turn-taking features
Publications of Fabio Valente sorted by journal and type
| 1 | 2 |
Proceedings on IEEE International Conference on Acoustics, Speech and Signal Processing (2012)
IMPROVING ACOUSTIC BASED KEYWORD SPOTTING USING LVCSR LATTICES, , and , in: Proceedings on IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE, Japan, pages 4413-4416, 2012 |
ACM Multimedia (2012)
Predicting the Conflict Level in Television Political Debates: an Approach Based on Crowdsourcing, Nonverbal Communication and Gaussian Processes, , , and , in: ACM Multimedia, 2012 |
Proceedings of International Conference on Acoustic, Speech and Signal Processing (2012)
Speaker Diarization of Meetings based on large TDOA feature vectors, and , in: Proceedings of International Conference on Acoustic, Speech and Signal Processing, 2012 |
|
INTERSPEECH (2012)
Speaker diarization of overlapping speech based on silence distribution in meeting recordings, and , in: INTERSPEECH, Portland, Oregon, USA, 2012 |
|
Proceedings of Interspeech 2011 (2011)
Analysis and Comparison of Recent MLP Features for LVCSR Systems, , and , in: Proceedings of Interspeech 2011, 2011 |
|
Interspeech (2011)
Information Bottleneck Features for HMM/GMM Speaker Diarization of Meetings Recordings, and , in: Interspeech, Florence, Italy, pages 953-956, 2011 |
|
Proceedings of Interspeech (2011)
Language-Independent Socio-Emotional Role Recognition in the AMI Meetings Corpus, and , in: Proceedings of Interspeech, 2011 |
|
Proceedings of International Conference on Acoustics, Speech and Signal Processing (2011)
MULTISTREAM SPEAKER DIARIZATION THROUGH INFORMATION BOTTLENECK SYSTEM OUTPUTS COMBINATION, , and , in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2011 |
|
Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2011)
Speaker Diarization of Meetings based on Speaker Role N-gram Models, , and , in: Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011 |
|
Proceedings of the IEEE International Conference on Systems, Man and Cybernetics (2011)
Understanding Social Signals in Multi-party Conversations: Automatic Recognition of Socio-Emotional Roles in the AMI Meeting Corpus, , , and , in: Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, pages 374-379, 2011 |
Proceedings of Interspeech, Japan (2010)
A Comparative Study of MLP Front-ends for Mandarin ASR, , , , and , in: Proceedings of Interspeech, Japan, 2010 |
|
Proceedings of Interspeech (2010)
Advances in Fast Multistream Diarization based on the Information Bottleneck Framework, , and , in: Proceedings of Interspeech, 2010 |
2010 IEEE International Conference on Acoustics, Speech and Signal Processing (2010)
Application of Out-Of-Language Detection To Spoken-Term Detection, and , in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010 |
|
Proceedings of Interspeech, Makuhari, Japan, 2010 (2010)
English Spoken Term Detection in Multilingual Recordings, , and , in: Proceedings of Interspeech, Makuhari, Japan, 2010, ISCA, Makuhari, Japan, 2010 |
|
Proceedings of ACM Multimedia Workshop on Social Signal Processing (2010)
Improving Speech Processing trough Social Signals: Automatic Speaker Segmentation of Political Debates using Role based Turn-Taking Patterns., and , in: Proceedings of ACM Multimedia Workshop on Social Signal Processing, 2010 |
|
International Conference on Acoustics, Speech, and Signal Processing (2010)
Multistream Speaker Diarization beyond Two Acoustic Feature Streams, , and , in: International Conference on Acoustics, Speech, and Signal Processing, 2010 |
|
Proceedings of Measuring Behavior 2010, Eindhoven (The Netherlands) (2010)
Social Signal Processing: Understanding Nonverbal Communication in Social Interactions, and , in: Proceedings of Measuring Behavior 2010, Eindhoven (The Netherlands), 2010 |
|
Proceedings of ICASSP (2010)
VARIATIONAL BAYESIAN SPEAKER DIARIZATION OF MEETING RECORDINGS, , and , in: Proceedings of ICASSP, 2010 |
|
Proceedings of the 10thAnnual Conference of the International Speech Communication Association (Interspeech) (2009)
Hierarchical Processing of the Modulation Spectrum for GALE Mandarin LVCSR system, , , and , in: Proceedings of the 10thAnnual Conference of the International Speech Communication Association (Interspeech), Brighton, 2009 |
|
10th Annual Conference of the International Speech Communication Association (2009)
KL Realignment for Speaker Diarization with Multiple Feature Streams, , and , in: 10th Annual Conference of the International Speech Communication Association, 2009 |
Proceedings of International Conference on Acoustics, Speech and Signal Processing (2009)
MUTUAL INFORMATION BASED CHANNEL SELECTION FOR SPEAKER DIARIZATION OF MEETINGS DATA, , and , in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2009 |
|
Proceedings of International conference on acoustics speech and signal processing (2009)
Mutual Information based Channel Selection for Speaker Diarization of Meetings Data, , and , in: Proceedings of International conference on acoustics speech and signal processing, 2009 |
{IEEE} Int. Conf. on Acoustics, Speech, and Signal Processing ({ICASSP}) (2008)
COMBINATION OF AGGLOMERATIVE AND SEQUENTIAL CLUSTERING FOR SPEAKER DIARIZATION, , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008 |
|
Hierarchical and Parallel Processing of Modulation Spectrum for ASR applications, and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008 |
|
Interspeech 2008 (2008)
Integration of TDOA Features in Information Bottleneck Framework for Fast Speaker Diarization, , and , in: Interspeech 2008, 2008 |
|
On the Combination of Auditory and Modulation Frequency Channels for ASR applications, and , in: Interspeech 2008, 2008 |
|
{IEEE} Automatic Speech Recognition and Understanding Workshop (2007)
AGGLOMERATIVE INFORMATION BOTTLENECK FOR SPEAKER DIARIZATION OF MEETINGS DATA, , and , in: IEEE Automatic Speech Recognition and Understanding Workshop, 2007 |
|
{IEEE} Int. Conf. on Acoustics, Speech, and Signal Processing ({ICASSP}) (2007)
Combination of Acoustic Classifiers based on Dempster-Shafer Theory of evidence, and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007 |
|
Interspeech 2007 (2007)
Hierarchical Neural Networks Feature Extraction for LVCSR system, , , , , and , in: Interspeech 2007, 2007 |
|
Multi-stream Features Combination based on Dempster-Shafer Rule for LVCSR System, , and , in: Interspeech 2007, 2007 |
|
International Conference on Spoken Language Processing (2006)
Discriminant linear processing of time-frequency plane, and , in: International Conference on Spoken Language Processing, 2006 |
|
Infinite Models for Speaker Clustering, , in: International Conference on Spoken Language Processing, 2006 |
|
| 1 | 2 |