Keywords:
- acoustic generators
- Artificial Neural Networks
- deep neural networks
- Delays
- direction-of-arrival estimation
- DOA estimation
- domain adaptation
- Encoding
- Estimation
- Feature extraction
- human-robot interaction
- likelihood-based encoding
- microphone arrays
- Microphones
- multiple sound sources
- multiple speaker detection
- network output
- neural nets
- neural network-based sound source localization methods
- neural networks
- Position measurement
- Robots
- simultaneous detection
- single sound source
- sound mixtures
- sound source localization
- spatial spectrum-based approaches
- speaker recognition
- training
- weakly-supervised learning.
Publications of Weipeng He sorted by title
A
Adaptation of Multiple Sound Source Localization Neural Networks with Weak Supervision and Domain-Adversarial Training, , and , Idiap-Com-01-2019 |
|
Adaptation of Multiple Sound Source Localization Neural Networks with Weak Supervision and Domain-Adversarial Training, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brighton, United Kingdom, pages 770-774, 2019 |
[DOI] |
D
Deep Learning Approaches for Auditory Perception in Robotics, , École polytechnique fédérale de Lausanne, 2021 |
|
Deep Neural Networks for Multiple Speaker Detection and Localization, , and , Idiap-RR-02-2018 |
|
Deep Neural Networks for Multiple Speaker Detection and Localization, , and , in: 2018 IEEE International Conference on Robotics and Automation (ICRA), Brisbane, AUSTRALIA, pages 74-79, 2018 |
[DOI] |
J
Joint Localization and Classification of Multiple Sound Sources Using a Multi-task Neural Network, , and , in: Proceedings of Interspeech, pages 312--316, 2018 |
[DOI] |
Joint Localization and Classification of Multiple Sound Sources Using a Multi-task Neural Network, , and , Idiap-RR-17-2018 |
|
M
Multi-task Neural Network for Robust Multiple Speaker Embedding Extraction, , and , in: Proceedings of Interspeech 2021, 2021 |
N
Neural Network Adaptation and Data Augmentation for Multi-Speaker Direction-of-Arrival Estimation, , and , in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 29:1303-1317, 2021 |
[DOI] [URL] |
S
Spatial Attention for Far-Field Speech Recognition with Deep Beamforming Neural Networks, , , , , and , in: 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, pages 7499-7503, 2020 |
[DOI] |
T
The MuMMER data set for Robot Perception in multi-party HRI Scenarios, , , and , in: Proceedings of the 29th IEEE International Conference on Robot & Human Interactive Communication, 2020 |
|