logo Idiap Research Institute        
All publications sorted by author
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 |


H

The Interpreter Understands Your Meaning: End-to-end Spoken Language Understanding Aided by Speech Translation, Mutian He and Philip N. Garner, in: Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023, pages 4408-4423, Association for Computational Linguistics, 2023
[DOI]
Deep Learning Approaches for Auditory Perception in Robotics, Weipeng He, École polytechnique fédérale de Lausanne, 2021
attachment
Spatial Attention for Far-Field Speech Recognition with Deep Beamforming Neural Networks, Weipeng He, Lu Lu, Biqiao Zhang, Jay Mahadeokar, Kaustubh Kalgaonkar and Christian Fuegen, in: 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, pages 7499-7503, 2020
[DOI]
Neural Network Adaptation and Data Augmentation for Multi-Speaker Direction-of-Arrival Estimation, Weipeng He, Petr Motlicek and Jean-Marc Odobez, in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 29:1303-1317, 2021
[DOI]
[URL]
Adaptation of Multiple Sound Source Localization Neural Networks with Weak Supervision and Domain-Adversarial Training, Weipeng He, Petr Motlicek and Jean-Marc Odobez, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brighton, United Kingdom, pages 770-774, 2019
attachment
[DOI]
Deep Neural Networks for Multiple Speaker Detection and Localization, Weipeng He, Petr Motlicek and Jean-Marc Odobez, in: 2018 IEEE International Conference on Robotics and Automation (ICRA), Brisbane, AUSTRALIA, pages 74-79, 2018
attachment
[DOI]
Human Tracking and Pose Estimation in Open Spaces, Alexandre Heili, École Polytechnique Fédérale de Lausanne (EPFL), 2014
attachment
Detection-Based Multi-Human Tracking Using a CRF Model, Alexandre Heili, Cheng Chen and Jean-Marc Odobez, in: The Eleventh IEEE International Workshop on Visual Surveillance, 2011
attachment
Parameter Estimation and Contextual Adaptation for a Multi-Object Tracking CRF Model, Alexandre Heili and Jean-Marc Odobez, in: IEEE Workshop on Performance Evaluation of Tracking and Surveillance, 2013
attachment
Readback Error Detection by Automatic Speech Recognition to Increase ATM Safety, Hartmut Helmke, Matthias Kleinert, Shruthi Shetty, Oliver Ohneiser, heiko Ehr, Hörður Arilíusson, Teodor S. Simiganoschi, Amrutha Prasad, Petr Motlicek, Karel Vesely, Karel Ondřej, Pavel Smrz, Julia Harfmann and Christian Windisch, in: Fourteenth USA/Europe Air Traffic Management Research and Development Seminar (ATM2021), The United States Federal Aviation Administration (FAA), EUROCONTROL, pages 10, 2021
attachment
[URL]
The Unstoppable Rise of Computational Linguistics in Deep Learning, James Henderson, in: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Online, pages 6294-6306, Association for Computational Linguistics, 2020
[DOI]
[URL]
A VAE for Transformers with Nonparametric Variational Information Bottleneck, James Henderson and Fabio Fehr, in: The Eleventh International Conference on Learning Representations, 2023
attachment
[URL]
Sparse Probabilistic Classifiers, Romain Hérault and Yves Grandvalet, in: International Conference on Machine Learning (ICML), 2007
attachment
On matching data and model in LF-MMI-based dysarthric speech recognition, Enno Hermann, École polytechnique fédérale de Lausanne, 2023
attachment
[DOI]
[URL]
Dysarthric Speech Recognition with Lattice-Free MMI, Enno Hermann and Mathew Magimai.-Doss, in: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 6109-6113, 2020
attachment
[DOI]
[URL]
TRAP-TANDEM: Data-driven extraction of temporal features from speech, Hynek Hermansky, in: large part published in Proceedings of ASRU-2003, 2003
attachment
The Role of Speech in Multimodal Human-Computer Interaction (Towards Reliable Rejection of Non-Keyword Input), Hynek Hermansky, Petr Fousek and Mikko Lehtonen, in: Proceedings of 8th International Conference on Text, Speech and Dialogue - TSD 2005, 2005
attachment
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 |