All publications sorted by author
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 |
M
IMPROVING ACOUSTIC BASED KEYWORD SPOTTING USING LVCSR LATTICES, , and , Idiap-RR-36-2012 |
|
IMPROVING ACOUSTIC BASED KEYWORD SPOTTING USING LVCSR LATTICES, , and , in: Proceedings on IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE, Japan, pages 4413-4416, 2012 |
Plucking Motions for Tea Harvesting Robots Using Probabilistic Movement Primitives, , , and , in: IEEE International Conference on Robotics and Automation, 2020 |
Evolution of the Mental States Operating a Brain-Computer Interface, , , and , in: Proceedings of the International Federation for Medical and Biological Engineering, 2002 |
|
Using Modality Replacement to Facilitate Communication between Visually and Hearing-Impaired People, , , , and , in: IEEE Multimedia, 18(2):26-37, 2011 |
[DOI] |
Adaptive Ensemble-based Optimisation for Petrophysical Inversion, and , in: Mathematical Geosciences, 2020 |
[DOI] [URL] |
Cross-lingual Transfer for News Article Labeling: Benchmarking Statistical and Neural Models, , and , Idiap-RR-26-2017 |
|
Trustworthy speaker recognition with minimal prior knowledge using neural networks, , Ecole polytechnique fédérale de Lausanne (EPFL), 2019 |
[DOI] [URL] |
Understanding and Visualizing Raw Waveform-based CNNs, , , and , in: Proceedings of Interspeech, 2019 |
|
Gradient-based spectral visualization of CNNs using raw waveforms, , , and , Idiap-RR-11-2018 |
|
Long Term Spectral Statistics for Voice Presentation Attack Detection, , , and , Idiap-RR-11-2017 |
|
Long-Term Spectral Statistics for Voice Presentation Attack Detection, , , and , in: IEEE/ACM Transactions on Audio, Speech and Language Processing, 25(11):2098-2111, 2017 |
|
Towards directly modeling raw speech signal for speaker verification using CNNs, , and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, Calgary, CANADA, pages 4884-4888, 2018 |
|
On Learning Vocal Tract System Related Speaker Discriminative Information from Raw Signal Using CNNs, , and , in: Proceedings of Interspeech, Hyderabad, INDIA, pages 1116-1120, 2018 |
|
End-to-End Convolutional Neural Network-based Voice Presentation Attack Detection, , and , in: International Joint Conference on Biometrics, Denver, Colorado, USA, 2017 |
|
Towards directly modeling raw speech signal for speaker verification using CNNs, , and , Idiap-RR-30-2017 |
|
Presentation Attack Detection Using Long-Term Spectral Statistics for Trustworthy Speaker Verification, , and , in: International Conference of the Biometrics Special Interest Group (BIOSIG), 2016 |
|
A Probabilistic Approach to Multi-Modal Adaptive Virtual Fixtures, , , , , , and , in: IEEE Robotics and Automation Letters (RA-L), 2024 |
|
On Job Training: Automated Interpersonal Behavior Assessment & Real-Time Feedback, , in: Proceedings of the 25th ACM International Conference on Multimedia, 2017, 2017 |
|
Dites-Moi: Wearable Feedback on Conversational Behavior, , , and , in: Proceedings of the 15th International Conference on Mobile and Ubiquitous Multimedia, 2016 |
|
Examining Linguistic Content and Skill Impression Structure for Job Interview Analytics in Hospitality, and , in: Proceedings of the 16th International Conference on Mobile and Ubiquitous Multimedia, 2017 |
|
Understanding Applicants' Reactions to Asynchronous Video Interviews through Self-Reports and Nonverbal Cues, , , , , and , in: Proc. ACM Int. Conf. on Multimodal Interaction (ICMI), Utrecht, 2020 |
|
Training on the Job: Behavioral Analysis of Job Interviews in Hospitality, , , , , and , in: Proceedings of the 18th ACM International Conference on Multimodal Interaction, pages 84-91, 2016 |
|
Words Worth: Verbal Content and Hirability Impressions in YouTube Video Resumes, , and , in: Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, 2018 |
|
A Tale of Two Interactions: Inferring Performance in Hospitality Encounters from Cross-Situation Social Sensing, , and , in: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 2(129), 2018 |
|
How May I Help You? Behavior and Impressions in Hospitality Service Encounters, , and , in: Proceddings of 19th ACM International Conference on Multimodal Interaction, 2017 |
|
Facing Employers and Customers: What Do Gaze and Expressions Tell About Soft Skills?, , , and , in: Proceedings of the 17th International Conference on Mobile and Ubiquitous Multimedia, Cairo, Egypt, pages 121-126, ASSOC COMPUTING MACHINERY, 2018 |
[DOI] |
Socially Intelligent Surveillance and Monitoring: Analysing Social Dimensions of Physical Space, , and , in: Proceedings of International Workshop on Socially Intelligent Surveillance and Monitoring, San Francisco, pages 51-58, 2010 |
|
Reliability and Validity of Nonverbal Thin Slices in Social Interactions, , , , , , , and , in: Personality and Social Psychology Bulletin, 41(2):199-213, 2014 |
[DOI] |
N
Convolutional Pitch Target Approximation Model for Speech Synthesis, and , Idiap-RR-05-2013 |
|
The Simulation of Mean Radiant Temperature in Outdoor Conditions: A Review of Architectural Tools Calculation Assumptions, , , and , in: Proceedings of Building Simulation 2019: 16th Conference of IBPSA, 2019 |
Learning Optimal Impedance Control During Complex 3D Arm Movements, , , , and , in: IEEE Robotics and Automation Letters (RA-L), 6(2):1248-1255, 2021 |
[DOI] [URL] |
Integrating large language models and ASR systems using confidence measures and prompting, , Idiap-Com-02-2024 |
|
Towards interfacing large language models with ASR systems using confidence measures and prompting, , , , and , in: Proceedings of Interspeech, pages 2980-2984, 2024 |
[DOI] |
Overview of the 8th Workshop on Asian Translation, , , , , , , , , , , , , , , and , in: Proceedings of the 8th Workshop on Asian Translation (WAT2021), pages 1--45, Association for Computational Linguistics, 2021 |
[URL] |
Overview of the 7th Workshop on Asian Translation, , , , , , , , , , , and , in: Proceedings of the 7th Workshop on Asian Translation, Association for Computational Linguistics, 2020 |
[URL] |
Phoneme based Respiratory Analysis of Read Speech, , , and , in: Proceedings of European Signal Processing Conference (EUSIPCO), 2021 |
|
Deep learning architectures for estimating breathing signal and respiratory parameters from speech recordings, , , , and , in: Neural Networks, 141:211--224, 2021 |
[DOI] |
EMPIRICAL EVALUATION AND COMBINATION OF PUNCTUATION PREDICTION MODELS APPLIED TO BROADCAST NEWS, and , Idiap-RR-01-2019 |
|
EMPIRICAL EVALUATION AND COMBINATION OF PUNCTUATION PREDICTION MODELS APPLIED TO BROADCAST NEWS, and , in: Proceedings of 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2019 |
|
Keep Sensors in Check: Disentangling Country-Level Generalization Issues in Mobile Sensor-Based Models with Diversity Scores, , , and , in: Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, 2023 |
|
Exploring syntactic information in sentence embeddings through multilingual subject-verb agreement, , , and , in: Tenth Italian Conference on Computational Linguistics, 2024 |
Tracking linguistic information in transformer-based sentence embeddings through targeted sparsification, and , in: Proceedings of the 9th Workshop on Representation Learning for NLP, 2024 |
[URL] |
Are there identifiable structural parts in the sentence embedding whole?, and , in: Proceedings of the 7th BlackboxNLP Workshop: Analyzing and Interpreting Neural Networks for NLP, 2024 |
Exploring Italian sentence embeddings properties through multi-tasking, , , and , in: Tenth Italian Conference on Computational Linguistics, 2024 |
Transferring Activities: Updating Human Behavior Analysis, , , , and , in: Visual Surveillance Workshop at ICCV, 2011 |
|
Learning to learn new models of human activities in indoor settings1, , , and , in: Interactive Multimodal Information Management, EPFL Press, 2013 |
Learning to learn new models of human activities in indoor settings1, , , and , in: Interactive Multimodal Information Management, EPFL Press, 2013 |
|
Detecting queues at vending machines: a statistical layered approach, and , Idiap-RR-04-2008 |
|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 |