Update cookies preferences
 logo Idiap Research Institute        
All publications sorted by author
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 | 84 | 85 | 86 |


H

Towards utterance-based neural network adaptation in acoustic modeling, Ivan Himawan, Petr Motlicek, Marc Ferras and Srikanth Madikeri, in: IEEE Automatic Speech Recognition and Understanding Workshop, pages 289-295, 2015
attachment
Blind acoustic source separation for cocktail party speech recognition, H. Hong, Seunjin Choi, Hervé Glotin and Frédéric Berthommier, in: ICONIP, 7th IEEE Int. Conf. on Neural Information Processing, 2000
Emphasis Recreation for TTS using Intonation Atoms, Pierre-Edouard Honnet and Philip N. Garner, in: 9th ISCA Speech Synthesis Workshop, pages 14--20, 2016
attachment
[DOI]
Atom Decomposition-based Intonation Modelling, Pierre-Edouard Honnet, Branislav Gerazov and Philip N. Garner, in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, Australia, pages 4744--4748, IEEE, 2015
attachment
[DOI]
GENERALIZABILITY OF PREDICTIVE AND GENERATIVE SPEECH ENHANCEMENT MODELS TO PATHOLOGICAL SPEAKERS, Mingchi Hou, Ante Jukic and Ina Kodrasi, in: Proceedings of the 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2026
attachment
INFLUENCE OF CLEAN SPEECH CHARACTERISTICS ON SPEECH ENHANCEMENT PERFORMANCE, Mingchi Hou and Ina Kodrasi, in: Proceedings of the 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2026
attachment
Which private attributes do VLMs agree on and predict well?, Olena Hrynenko, Darya Baranouskaya, Alina Elena Baia and Andrea Cavallaro, in: Proceedings of the 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2026
attachment
Identifying Privacy Personas, Olena Hrynenko and Andrea Cavallaro, in: Proceedings on Privacy Enhancing Technologies, 2025
attachment
Towards Audio-Visual On-line Diarization Of Participants In Group Meetings, Hayley Hung and Gerald Friedland, in: European Conference on Computer Vision Workshop on Multi-camera and Multi-modal Sensor Fusion, 2008
attachment
Estimating Cohesion in Small Groups Using Audio-Visual Nonverbal Behavior, Hayley Hung and Daniel Gatica-Perez, in: IEEE Trans. on Multimedia, Special Issue on Multimodal Affective Interaction, 12(6):563 - 575, 2010
attachment
Identifying Dominant People in Meetings from Audio-Visual Sensors, Hayley Hung and Daniel Gatica-Perez, in: International Conference on Automatic Face and Gesture Recognition, Amsterdam, The Netherlands, 2008
attachment
Estimating Dominance in Multi-Party Meetings Using Speaker Diarization, Hayley Hung, Yan Huang, Gerald Friedland and Daniel Gatica-Perez, in: IEEE Transactions on Audio, Speech, and Language Processing, 19(4):847-860, 2011
attachment
Associating Audio-Visual Activity Cues in a Dominance Estimation Framework, Hayley Hung, Yan Huang, Chuohao Yeo and Daniel Gatica-Perez, in: First IEEE Workshop on CVPR for Human Communicative Behavior Analysis, 2008
attachment
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 | 84 | 85 | 86 |