Publication list - Idiap Publications

Cross-lingual Transfer for News Article Labeling: Benchmarking Statistical and Neural Models, Khalil Mrini, Nikolaos Pappas and Andrei Popescu-Belis, Idiap-RR-26-2017

Trustworthy speaker recognition with minimal prior knowledge using neural networks, Hannah Muckenhirn, Ecole polytechnique fédérale de Lausanne (EPFL), 2019

[DOI]
[URL]

Understanding and Visualizing Raw Waveform-based CNNs, Hannah Muckenhirn, Vinayak Abrol, Mathew Magimai-Doss and Sébastien Marcel, in: Proceedings of Interspeech, 2019

Gradient-based spectral visualization of CNNs using raw waveforms, Hannah Muckenhirn, Vinayak Abrol, Mathew Magimai-Doss and Sébastien Marcel, Idiap-RR-11-2018

Long Term Spectral Statistics for Voice Presentation Attack Detection, Hannah Muckenhirn, Pavel Korshunov, Mathew Magimai-Doss and Sébastien Marcel, Idiap-RR-11-2017

Long-Term Spectral Statistics for Voice Presentation Attack Detection, Hannah Muckenhirn, Pavel Korshunov, Mathew Magimai-Doss and Sébastien Marcel, in: IEEE/ACM Transactions on Audio, Speech and Language Processing, 25(11):2098-2111, 2017

Towards directly modeling raw speech signal for speaker verification using CNNs, Hannah Muckenhirn, Mathew Magimai-Doss and Sébastien Marcel, in: IEEE International Conference on Acoustics, Speech and Signal Processing, Calgary, CANADA, pages 4884-4888, 2018

On Learning Vocal Tract System Related Speaker Discriminative Information from Raw Signal Using CNNs, Hannah Muckenhirn, Mathew Magimai-Doss and Sébastien Marcel, in: Proceedings of Interspeech, Hyderabad, INDIA, pages 1116-1120, 2018

End-to-End Convolutional Neural Network-based Voice Presentation Attack Detection, Hannah Muckenhirn, Mathew Magimai-Doss and Sébastien Marcel, in: International Joint Conference on Biometrics, Denver, Colorado, USA, 2017

Towards directly modeling raw speech signal for speaker verification using CNNs, Hannah Muckenhirn, Mathew Magimai-Doss and Sébastien Marcel, Idiap-RR-30-2017

Presentation Attack Detection Using Long-Term Spectral Statistics for Trustworthy Speaker Verification, Hannah Muckenhirn, Mathew Magimai-Doss and Sébastien Marcel, in: International Conference of the Biometrics Special Interest Group (BIOSIG), 2016

A Probabilistic Approach to Multi-Modal Adaptive Virtual Fixtures, M. Mühlbauer, T. Hulin, B. Weber, Sylvain Calinon, F. Stulp, A. Albu-Schäffer and J. Silverio, in: IEEE Robotics and Automation Letters (RA-L), 2024

Effects of cool coatings on urban microclimate and outdoor thermal Comfort: A CFD–CitySim pro coupled simulation study, Da-Som Mun, Jérôme Kämpf and Jae-Jin Kim, in: Energy and Buildings, 2026

[DOI]
[URL]

SOCIAL SENSING METHODS FOR ANALYSIS OF DYADIC HOSPITALITY ENCOUNTERS, Skanda Muralidhar, EPFL, 2019

On Job Training: Automated Interpersonal Behavior Assessment & Real-Time Feedback, Skanda Muralidhar, in: Proceedings of the 25th ACM International Conference on Multimedia, 2017, 2017

Dites-Moi: Wearable Feedback on Conversational Behavior, Skanda Muralidhar, Jean M R Costa, Laurent Son Nguyen and Daniel Gatica-Perez, in: Proceedings of the 15th International Conference on Mobile and Ubiquitous Multimedia, 2016

Examining Linguistic Content and Skill Impression Structure for Job Interview Analytics in Hospitality, Skanda Muralidhar and Daniel Gatica-Perez, in: Proceedings of the 16th International Conference on Mobile and Ubiquitous Multimedia, 2017

Understanding Applicants' Reactions to Asynchronous Video Interviews through Self-Reports and Nonverbal Cues, Skanda Muralidhar, Emmanuelle Patricia Kleinlogel, Eric Mayor, Adrian Bangerter, Marianne Schmid Mast and Daniel Gatica-Perez, in: Proc. ACM Int. Conf. on Multimodal Interaction (ICMI), Utrecht, 2020

Training on the Job: Behavioral Analysis of Job Interviews in Hospitality, Skanda Muralidhar, Laurent Son Nguyen, Denise Frauendorfer, Jean-Marc Odobez, Marianne Schmid Mast and Daniel Gatica-Perez, in: Proceedings of the 18th ACM International Conference on Multimodal Interaction, pages 84-91, 2016

Words Worth: Verbal Content and Hirability Impressions in YouTube Video Resumes, Skanda Muralidhar, Laurent Son Nguyen and Daniel Gatica-Perez, in: Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, 2018

A Tale of Two Interactions: Inferring Performance in Hospitality Encounters from Cross-Situation Social Sensing, Skanda Muralidhar, Marianne Schmid Mast and Daniel Gatica-Perez, in: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 2(129), 2018

How May I Help You? Behavior and Impressions in Hospitality Service Encounters, Skanda Muralidhar, Marianne Schmid Mast and Daniel Gatica-Perez, in: Proceddings of 19th ACM International Conference on Multimodal Interaction, 2017

Facing Employers and Customers: What Do Gaze and Expressions Tell About Soft Skills?, Skanda Muralidhar, Remy Siegfried, Jean-Marc Odobez and Daniel Gatica-Perez, in: Proceedings of the 17th International Conference on Mobile and Ubiquitous Multimedia, Cairo, Egypt, pages 121-126, ASSOC COMPUTING MACHINERY, 2018

[DOI]

Socially Intelligent Surveillance and Monitoring: Analysing Social Dimensions of Physical Space, V. Murino, M. Cristani and Alessandro Vinciarelli, in: Proceedings of International Workshop on Socially Intelligent Surveillance and Monitoring, San Francisco, pages 51-58, 2010

Reliability and Validity of Nonverbal Thin Slices in Social Interactions, Nora A Murphy, Judith A Hall, Marianne Schmid Mast, Mollie A. Ruben, Denise Frauendorfer, Danielle Blanch-Hartigan, Debra L. Roter and Laurent Son Nguyen, in: Personality and Social Psychology Bulletin, 41(2):199-213, 2014

[DOI]

Convolutional Pitch Target Approximation Model for Speech Synthesis, Xingyu Na and Philip N. Garner, Idiap-RR-05-2013

The Simulation of Mean Radiant Temperature in Outdoor Conditions: A Review of Architectural Tools Calculation Assumptions, Emanuele Naboni, Marco Meloni, Chris Makey and Jérôme Kämpf, in: Proceedings of Building Simulation 2019: 16th Conference of IBPSA, 2019

Learning Optimal Impedance Control During Complex 3D Arm Movements, A. Naceri, T. Schumacher, Q. Li, Sylvain Calinon and H. Ritter, in: IEEE Robotics and Automation Letters (RA-L), 6(2):1248-1255, 2021

[DOI]
[URL]

Integrating large language models and ASR systems using confidence measures and prompting, Maryam Naderi, Idiap-Com-02-2024

Towards interfacing large language models with ASR systems using confidence measures and prompting, Maryam Naderi, Enno Hermann, Alexandre Nanchen, Sevada Hovsepyan and Mathew Magimai-Doss, in: Proceedings of Interspeech, pages 2980-2984, 2024

[DOI]

Overview of the 8th Workshop on Asian Translation, Toshiaki Nakazawa, Hideki Nakayama, Chenchen Ding, Raj Dabre, Shohei Higashiyama, Hideya Mino, Isao Goto, Win Pa Pa, Anoop Kunchukuttan, Shantipriya Parida, Ondrej Bojar, Chenhui Chu, Akiko Eriguchi, Kaori Abe, Yusuke Oda and Sadao Kurohashi, in: Proceedings of the 8th Workshop on Asian Translation (WAT2021), pages 1--45, Association for Computational Linguistics, 2021

[URL]

Overview of the 7th Workshop on Asian Translation, Toshiaki Nakazawa, Hideki Nakayama, Chenchen Ding, Raj Dabre, Shohei Higashiyama, Hideya Mino, Isao Goto, Win Pa Pa, Anoop Kunchukuttan, Shantipriya Parida, Ondrej Bojar and Sadao Kurohashi, in: Proceedings of the 7th Workshop on Asian Translation, Association for Computational Linguistics, 2020

[URL]

Phoneme based Respiratory Analysis of Read Speech, Venkata Srikanth Nallanthighal, Aki Härmä, Helmer Strik and Mathew Magimai-Doss, in: Proceedings of European Signal Processing Conference (EUSIPCO), 2021

Deep learning architectures for estimating breathing signal and respiratory parameters from speech recordings, Venkata Srikanth Nallanthighal, Zohreh Mostaani, Aki Härmä, Helmer Strik and Mathew Magimai-Doss, in: Neural Networks, 141:211--224, 2021

[DOI]

EMPIRICAL EVALUATION AND COMBINATION OF PUNCTUATION PREDICTION MODELS APPLIED TO BROADCAST NEWS, Alexandre Nanchen and Philip N. Garner, Idiap-RR-01-2019

EMPIRICAL EVALUATION AND COMBINATION OF PUNCTUATION PREDICTION MODELS APPLIED TO BROADCAST NEWS, Alexandre Nanchen and Philip N. Garner, in: Proceedings of 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2019

Keep Sensors in Check: Disentangling Country-Level Generalization Issues in Mobile Sensor-Based Models with Diversity Scores, Alexandre Nanchen, Lakmal Buddika Meegahapola, William Droz and Daniel Gatica-Perez, in: Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, 2023

Exploring syntactic information in sentence embeddings through multilingual subject-verb agreement, Vivi Nastase, Chunyang Jiang, Giuseppe Samo and Paola Merlo, in: Tenth Italian Conference on Computational Linguistics, 2024

Testing the assumptions about the geometry of sentence embedding spaces: the cosine measure need not apply, Vivi Nastase and Paola Merlo, in: arXiv, 2025

[URL]

Tracking linguistic information in transformer-based sentence embeddings through targeted sparsification, Vivi Nastase and Paola Merlo, in: Proceedings of the 9th Workshop on Representation Learning for NLP, 2024

[URL]

Are there identifiable structural parts in the sentence embedding whole?, Vivi Nastase and Paola Merlo, in: Proceedings of the 7th BlackboxNLP Workshop: Analyzing and Interpreting Neural Networks for NLP, 2024

Multilingual vs. monolingual transformer models in encoding linguistic structure and lexical abstraction, Vivi Nastase, Giuseppe Samo, Chunyang Jiang and Paola Merlo, in: CLiC-it 2025: Eleventh Italian Conference on Computational Linguistics, September 24 ? 26, 2025, Cagliari, Italy, 2025

[URL]

Exploring Italian sentence embeddings properties through multi-tasking, Vivi Nastase, Giuseppe Samo, Chunyang Jiang and Paola Merlo, in: Tenth Italian Conference on Computational Linguistics, 2024

Transferring Activities: Updating Human Behavior Analysis, Fabian Nater, Tatiana Tommasi, Helmut Grabner, Luc Van Gool and Barbara Caputo, in: Visual Surveillance Workshop at ICCV, 2011

Learning to learn new models of human activities in indoor settings1, Fabian Nater, Tatiana Tommasi, Luc Van Gool and Barbara Caputo, in: Interactive Multimodal Information Management, EPFL Press, 2013

Detecting queues at vending machines: a statistical layered approach, Xavier Naturel and Jean-Marc Odobez, Idiap-RR-04-2008

Detecting queues at vending machines: a statistical layered approach, Xavier Naturel and Jean-Marc Odobez, in: Proc. Int. Conf. on Pattern Recognition (ICPR), Tampa, 2008

The ELISA Systems for the NIST'99 Evaluation in Speaker Detection and Tracking, B. Nedic, Frédéric Bimbot, Raphaël Blouet, Jean-François Bonastre, Gilles Caloz, Jan Cernocky, Gérard Chollet, G. Durou, Corinne Fredouille, Dominique Genoud, Guillaume Gravier, Jean Hennebert, Jamal Kharroubi, I. Magrin-Chagnolleau, Teva Merlin, Chafic Mokbel, Dijana Petrovska-Delacretaz, S. Pigeon, M. Seck, Patrick Verlinde and M. Zouhal, in: DSP Journal (Special Issue on the Nist Speaker Recognition Workshop), 1999

Recent Developments in Speaker Verification at IDIAP, B. Nedic and Hervé Bourlard, Idiap-RR-26-2000