All publications sorted by author
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 |
L
Observations on Multi-Band Asynchrony in Distant Speech Recordings, , Idiap-RR-74-2006 |
![]() |
Spatio-Temporal Analysis of Spontaneous Speech with Microphone Arrays, , Ecole Polytechnique Fédérale de Lausanne, 2006 |
![]() |
Spatio-Temporal Analysis of Spontaneous Speech with Microphone Arrays, , Idiap-RR-77-2006 |
![]() |
Sector-Based Detection for Hands-Free Speech Enhancement in Cars, , and , in: EURASIP Journal on Applied Signal Processing, Special Issue on Advances in Multimicrophone Speech Processing, 2006 |
![]() |
Multichannel Speech Enhancement in Cars: Explicit vs. Implicit Adaptation Control, , and , in: Proceedings of HSCMA 2005, 2005 |
![]() |
Sector-Based Detection for Hands-Free Speech Enhancement in Cars, , and , Idiap-RR-67-2004 |
![]() |
A Sector-Based, Frequency-Domain Approach to Detection and Localization of Multiple Speakers, and , in: Proceedings of ICASSP 2005, 2005 |
![]() |
A Sector-Based, Frequency-Domain Approach to Detection and Localization of Multiple Speakers, and , Idiap-RR-54-2004 |
![]() |
Unsupervised Spectral Subtraction for Noise-Robust ASR on Unknown Transmission Channels, , and , Idiap-RR-09-2006 |
![]() |
Threshold Selection for Unsupervised Detection, with an Application to Microphone Arrays, , and , in: Proceedings of ICASSP 2006, 2006 |
![]() |
A Spectrogram Model for Enhanced Source Localization and Noise-Robust ASR, , and , in: Proceedings of INTERSPEECH 2005, 2005 |
![]() |
A Frequency-Domain Silence Noise Model, , and , Idiap-RR-13-2005 |
![]() |
Unsupervised Spectral Subtraction for Noise-Robust ASR, , , and , in: Proceedings of the 2005 IEEE ASRU Workshop, 2005 |
![]() |
Unsupervised Spectral Substraction for Noise-Robust ASR, , , and , Idiap-RR-42-2005 |
![]() |
Threshold Selection for Unsupervised Detection, with an Application to Microphone Arrays, , , and , Idiap-RR-52-2005 |
![]() |
A Sector-Based Approach for Localization of Multiple Speakers with Microphone Arrays, and , in: Proceedings of the 2004 SAPA Workshop, 2004 |
![]() |
A Sector-Based Approach for Localization of Multiple Speakers with Microphone Arrays, and , Idiap-RR-15-2004 |
![]() |
Location Based Speaker Segmentation, and , in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003 |
![]() |
Location Based Speaker Segmentation, and , Idiap-RR-43-2002 |
![]() |
Segmenting Multiple Concurrent Speakers Using Microphone Arrays, , and , in: Proceedings of Eurospeech 2003, 2003 |
![]() |
Segmenting Multiple Concurrent Speakers Using Microphone Arrays, , and , Idiap-RR-21-2003 |
![]() |
Unsupervised Location-Based Segmentation of Multi-Party Speech, , and , in: Proceedings of the 2004 ICASSP-NIST Meeting Recognition Workshop, 2004 |
![]() |
Short-Term Spatio-Temporal Clustering of Sporadic and Concurrent Events, , and , Idiap-RR-14-2004 |
![]() |
Short-Term Spatio-Temporal Clustering Applied to Multiple Moving Speakers, and , in: IEEE Transactions on Audio, Speech and Language Processing, 15(5):15, 2007 |
![]() |
AV16.3: an Audio-Visual Corpus for Speaker Localization and Tracking, , and , in: Proceedings of the 2004 MLMI Workshop, S. Bengio and H. Bourlard Eds, Springer Verlag, 2005 |
![]() |
AV16.3: an Audio-Visual Corpus for Speaker Localization and Tracking, , and , Idiap-RR-28-2004 |
![]() |
The Mobile Data Challenge: Big Data for Mobile Computing Research, , , , , , , , and , in: Pervasive Computing, Newcastle, 2012 |
![]() |
From Big Smartphone Data to Worldwide Research: The Mobile Data Challenge, , , , , , , and , in: Pervasive and Mobile Computing, 9(6):752–771, 2013 |
![]() |
International Conference on the Voynich Manuscript 2022, , , , , , and , in: Proceedings of the International Conference on Historical Cryptology, 2023 |
Probabilistic Amplitude Demodulation features in Speech Synthesis for Improving Prosody, , and , Idiap-RR-12-2016 |
![]() |
Probabilistic Amplitude Demodulation Features in Speech Synthesis for Improving Prosody, , and , in: Proceedings of Interspeech, San Francisco, USA, 2016 |
![]() |
Investigating Spectral Amplitude Modulation Phase Hierarchy Features in Speech Synthesis, , , and , Idiap-RR-22-2016 |
![]() |
Investigating Spectral Amplitude Modulation Phase Hierarchy Features in Speech Synthesis, , , and , in: 9th ISCA Speech Synthesis Workshop, 2016 |
![]() |
Syllable-based Regional Swiss French Accent Identification using Prosodic Features, , , and , in: Nouveaux cahiers de linguistique francaise, 2014 |
![]() |
Investigating Cross-lingual Multi-level Adaptive Networks: The Importance of the Correlation of Source and Target Languages, , , , and , in: Proceedings of the International Workshop on Spoken Language Translation, Seattle, WA, USA, 2016 |
![]() |
SVR vs MLP for Phone Duration Modelling in HMM-based Speech Synthesis, , and , Idiap-RR-03-2014 |
![]() |
SVR vs MLP for Phone Duration Modelling in HMM-based Speech Synthesis, , and , in: Speech Prosody, 2014 |
![]() |
SWISS FRENCH REGIONAL ACCENT IDENTIFICATION, , , , , and , in: Odyssey: The Speaker and Language Recognition Workshop, 2014 |
![]() |
DNN-based Speech Synthesis: Importance of input features and training data, , and , in: International Conference on Speech and Computer , SPECOM, pages 193-200, Springer Berlin Heidelberg, 2015 |
![]() [DOI] |
Multimodal Person Recognition in Audio-Visual Streams, , EPFL, 2019 |
![]() [DOI] |
Long-Term Time-Sensitive Costs for CRF-Based Tracking by Detection, , and , in: 2nd Workshop on Benchmarking Multi-target Tracking: MOTChallenge 2016, Amsterdam, 2016 |
![]() |
Temporally Subsampled Detection for Accurate and Efficient Face Tracking and Diarization, , , and , in: International Conference on Pattern Recognition, Cancun, Mexico, IEEE, 2016 |
![]() |
EUMSSI team at the MediaEval Person Discovery Challenge 2016, , and , in: MediaEval Benchmarking Initiative for Multimedia Evaluation, Hilversum, Netherlands, 2016 |
![]() |
Improving speech embedding using crossmodal transfer learning with audio-visual data, and , in: Multimedia Tools and Applications, 78(11):15681-15704, 2019 |
[DOI] |
Robust and Discriminative Speaker Embedding via Intra-Class Distance Variance Regularization, and , in: Proceedings of Interspeech, Hyderabad, INDIA, pages 2257-2261, 2018 |
![]() [DOI] |
Improving speaker turn embedding by crossmodal transfer learning from face embedding, and , in: ICCV Workshop on Computer Vision for Audio-Visual Media, 2017 |
![]() |
A Domain Adaptation Approach to Improve Speaker Turn Embedding Using Face Representation, and , in: ACM International Conference on Multimodal Interaction, Glasgow, Scotland, ACM, 2017 |
![]() |
Learning Multimodal Temporal Representation for Dubbing Detection in Broadcast Media, and , in: ACM Multimedia, Amsterdam, ACM, 2016 |
![]() |
Towards large scale multimedia indexing: A case study on person discovery in broadcast news, , and , in: 15th International Workshop on Content-Based Multimedia Indexing, 2017 |
![]() |
EUMSSI team at the MediaEval Person Discovery Challenge, , , and , in: Working Notes Proceedings of the MediaEval 2015 Workshop, Wurzen, Germany, 2015 |
![]() [URL] |
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 |