Conference papers list - Idiap Publications

Benchmarking Multimodal Large Language Models for Face Recognition, Hatef Otroshi Shahreza and Sébastien Marcel, in: Proceedings of the 2025 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2026

[URL]

Building A Civic Tool for Community-Police Engagement to Adapt Neighborhood Policing, Ravinithesh Annapureddy, Staņislavs Šeiko, Natalie Higham-James, William Droz, Alessandro Fornaroli, Sarah Vollmer, Britta Elena Hecking and Daniel Gatica-Perez, in: Designing Interactive Systems Conference (DIS '26), June 13--17, 2026, Singapore, Singapore, 2026

[DOI]

Comparing Natural and Synthetic Structured Data: A Study of the Passive Verb Alternation in French and Italian, Giuseppe Samo and Paola Merlo, in: Proceedings of the Workshop on Structured Linguistic Data and Evaluation (SLiDE), 2026

[DOI]

Datasets for Verb Alternations across Languages: BLM Templates and Data Augmentation Strategies, Giuseppe Samo and Paola Merlo, in: Proceedings of the Fifteenth Language Resources and Evaluation Conference (LREC 2026), 2026

[DOI]

DDialogue: A Collaborative Framework for Cross-Sectoral Dialogue through Data, Alessandro Fornaroli, Ravinithesh Annapureddy and Daniel Gatica-Perez, in: Participatory Design Conference 2026, , June 15--19, 2026, Milan, Italy, ACM, 2026

[DOI]

DriveFace: A Cross-Spectral Through-Glass Face Dataset for On-the-Move Vehicular Border Control, Anjith George, Luis S. Luevano, Alain Komaty, Vidit Vidit and Sébastien Marcel, in: IJCB, 2026

Évaluation de la reconnaissance automatique de la parole par les grands modèles de langage génératifs, Thibault Bañeras-Roux, Shashi Kumar, Driss Khalil, Petr Motlicek, Sergio Burdisso, Shiran Liu, Mickael Rouvier, Jane Wottawa and Richard Dufour, in: EvalLLM2026 : Atelier sur l'evaluation des modeles generatifs (LLM), le RAG et challenges, 2026

Exploratory analysis of yellow mongoose vocalization: detection from in-the-wild recordings and call classification, Sevada Hovsepyan, Imen Ben Mahmoud, Vanessa Rüegg, Marta Manser and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2026

FAccT-Checked: A Narrative Review of Authority Reconfigurations and Retention in AI-Mediated Journalism, Matilde Barbini, Daniel Gatica-Perez and Stefano Sorrentino, in: Proc. ACM Conference on Fairness, Accountability, and Transparency, Montreal, 2026

Framing Migration News with LLMs: Structured CoT as a Support for Human Interpretation, David Alonso del Barrio, Jing Wen and Daniel Gatica-Perez, in: COMPASS'26, 2026

GaitFace: A Multimodal Dataset for Long-Range Person Identification, Alain Komaty, Luis S. Luevano, Vidit Vidit, Anjith George and Sébastien Marcel, in: IEEE International Joint Conference on Biometrics (IJCB), 2026

GENERALIZABILITY OF PREDICTIVE AND GENERATIVE SPEECH ENHANCEMENT MODELS TO PATHOLOGICAL SPEAKERS, Mingchi Hou, Ante Jukic and Ina Kodrasi, in: Proceedings of the 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2026

Geometry-aware Policy Imitation, Yiming Li, Nael Darwiche, Amirreza Razmjoo, sichao Liu, Yilun Du, Auke Ijspeert and Sylvain Calinon, in: International Conference on Learning Representations, 2026

INFLUENCE OF CLEAN SPEECH CHARACTERISTICS ON SPEECH ENHANCEMENT PERFORMANCE, Mingchi Hou and Ina Kodrasi, in: Proceedings of the 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2026

Learning Ego-Exo Visual Representations for Conversational Gaze Estimation, Anshul Gupta, Yijun Qian, Ruohan Gao, Ishwarya Ananthabhotla, Jean-Marc Odobez, Vamsi Krishna Ithapu and Calvin Murdock, in: Conference on Computer Vision and Pattern Recognition Workshops, 2026

Learning to Disentangle Latent Reasoning Rules with Language VAEs: A Systematic Study, Yingji Zhang, Marco Valentino, Danilo Carvalho and Andre Freitas, in: The 40th Annual AAAI Conference on Artificial Intelligence, 2026

Meta-RL Induces Exploration in Language Agents, Yulun Jiang, Liangze Jiang, Damien Teney, Michael Moor and Maria Brbić, in: The Fourteenth International Conference on Learning Representations, 2026

Migrant Voices, Local News: Insights on Bridging Community Needs with Media Content, David Alonso del Barrio, Paula Dolores Rescala, Victor Bros and Daniel Gatica-Perez, in: ACM International Conference on Interactive Media Experiences, 2026

[DOI]

Mitigating Content Effects on Reasoning in Language Models through Fine-Grained Activation Steering, Marco Valentino, Geonhee Kim, Dhairya Dalal, Zhixue Zhao and Andre Freitas, in: The 40th Annual AAAI Conference on Artificial Intelligence, 2026

Modelling the Morphology of Verbal Paradigms: A Case Study in the Tokenization of Turkish and Hebrew, Giuseppe Samo and Paola Merlo, in: Proceedings of the Second Workshop Natural Language Processing for Turkic Languages (SIGTURK 2026), 2026

[DOI]

Optimizing Supply Temperature Control in District Heating Networks via Differentiable Dynamic Simulation and Gradient Descent, Roberto Boghetti and Jérôme Kämpf, in: Construction, Energy, Environment and Sustainability. Proceedings of CEES 2025 (Volume 2: Energy), Springer Singapore, 2026

[DOI]
[URL]

Politics of Questions in News: A Mixed-Methods Study of Interrogative Stances as Markers of Voice and Power, Victor Bros, Matilde Barbini, Patrick Gerard and Daniel Gatica-Perez, in: Vol. 20 (2026): Proceedings of the Twentieth International AAAI Conference on Web and Social Media, 2026

PrivLEX: Detecting legal concepts in images through Vision-Language Models, Darya Baranouskaya and Andrea Cavallaro, in: arXiv, 2026

[DOI]
[URL]

RAG as a Content-Analysis Assistant: Auditing SDG Discourse in Online Videos, Victor Bros, Daniel Gatica-Perez and Cristian Safta, in: Proceedings of the Workshops and Tutorials of the ACM International Conference on Multimedia Retrieval (ICMR 2026), 2026

Reducing Prompt Sensitivity in LLM-based Speech Recognition Through Learnable Projection, Sergio Burdisso, Esaú Villatoro-Tello, Shashi Kumar, Srikanth Madikeri, Andrés Carofilis, Pradeep Rangappa, Manjunath K E, Kadri Hacioğlu, Petr Motlicek and Andreas Stolcke, in: ICASSP 2026, 2026

Rethinking the Role of Collaborative Robots in Rehabilitation, Vivek Gupte, Shalutha Rajapakshe and Emmanuel Senft, in: Companion Proceedings of the 21st ACM/IEEE International Conference on Human-Robot Interaction (HRI Companion '26), March 16--19, 2026, Edinburgh, Scotland Uk, 2026

Skill Extraction from Resumes and Job Offers across Six Languages, Laura Vásquez-Rodríguez, Bertrand Audrin, Samuel Michel, Samuele Galli, Julneth Rogenhofer, Jacopo Negro Cusa and Lonneke van der Plas, in: Proceedings of the 11th edition of the Swiss Text Analytics Conference, 2026

Sparse Neuron Ablation Triggers Catastrophic Collapse of the Language Core in Large Vision-Language Models, Cen Lu, Yung-Chen Tang and Andrea Cavallaro, in: Mechanistic Interpretability Workshop at the 43rd International Conference on Machine Learning, 2026

[URL]

Text-only adaptation in LLM-based ASR through text denoising, Sergio Burdisso, Esaú Villatoro-Tello, Andrés Carofilis, Shashi Kumar, Kadri Hacioğlu, Srikanth Madikeri, Pradeep Rangappa, Manjunath K E, Petr Motlicek, Shankar Venkatesan and Andreas Stolcke, in: ICASSP, 2026

The impact of abstract and object tags on image privacy classification, Darya Baranouskaya and Andrea Cavallaro, in: Proceedings of the 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2026

[DOI]
[URL]

Triangulating Temporal Dynamics in Multilingual Swiss Online News, Victor Bros, Evan Dufraisse, Adrian Popescu and Daniel Gatica-Perez, in: Vol. 20 (2026): Proceedings of the Twentieth International AAAI Conference on Web and Social Media, 2026

When Specialization Helps (and Hurts): Cross-Modality Transfer in Ophthalmic Imaging with Foundation Models, Roberto Pulvirenti, Oscar Jimenez-del-Toro, Mattia Tomasoni, Florence Hoogewoud and André Anjos, in: 2026 IEEE 23rd International Symposium on Biomedical Imaging, 2026

[DOI]
[URL]

Which private attributes do VLMs agree on and predict well?, Olena Hrynenko, Darya Baranouskaya, Alina Elena Baia and Andrea Cavallaro, in: Proceedings of the 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2026

Why LLM Safety Guardrails Collapse After Fine-tuning: A Similarity Analysis Between Alignment and Fine-tuning Datasets, Lei Hsiung, Tianyu Pang, Yung-Chen Tang, Linyue Song, Tsung-Yi Ho, Pin-Yu Chen and Yaoqing Yang, in: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

[URL]

Zero frequency resonator based extraction of R-peaks in ECG signals, RaviShankar Prasad, Gürkan Yilmaz and Mathew Magimai-Doss, in: Proceedings of EUSIPCO, 2026

3D Face Morph Generation Using Geometry-Aware Template Inversion, Hatef Otroshi Shahreza, Laurent Colbois and Sébastien Marcel, in: 2025 IEEE 35th International Workshop on Machine Learning for Signal Processing (MLSP), 2025

[DOI]
[URL]

A Bayesian Interpretation of Adaptive Low-Rank Adaptation, Haolin Chen and Philip N. Garner, in: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

[DOI]

A Mechanistic Interpretation of Syllogistic Reasoning in Auto-Regressive Language Models, Geonhee Kim, Marco Valentino and Andre Freitas, in: Findings of the ACL, 2025

A Smooth Analytical Formulation of Collision Detection and Rigid Body Dynamics with Contact, Onur Beker, Nico Gürtler, Ji Shi, Andreas René Geist, Amirreza Razmjoo, George Martius and Sylvain Calinon, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2025

Accelerating Antibiotic Discovery with Large Language Models and Knowledge Graphs, Maxime Delmas, Magdalena Wysocka, Danilo Gusicuma and Andre Freitas, in: 63rd Annual Meeting of the Association for Computational Linguistics, Vienna, pages 693–705, Association for Computational Linguistics, 2025

[DOI]
[URL]

Accelerating Criminal Investigations with TRACY, Pradeep Rangappa, Petr Motlicek, Dairazalia Sanchez-Cortes, Alejandra Sanchez Lara, Michaela Antonopoulou, Ioannis Fourfouris, Nikos Avgerinos and Manolis Tsangaris, in: 16th EAI International Conference on Digital Forensics & Cyber Crime, 2025

An evidence-based guidance framework for neural network system diagrams, Guy Marshall, Andre Freitas and Caroline Jay, in: PLOS One, 2025

Analogical Structure, Minimal Contextual Cues and Contrastive Distractors: Input Design for Sample-Efficient Linguistic Rule Induction, Chunyang Jiang and Paola Merlo, in: arXiv cs.CL.2511.10441, 2025

ArtFace: Towards Historical Portrait Face Identification via Model Adaptation, Francois Poh, Anjith George and Sébastien Marcel, in: (Non-Archival), pages 4, 2025

[URL]

Assessing the reliability of archetype-based Urban Building Energy Simulations: A case study analysis in Turin (Italy), Matteo Piro, Jérôme Kämpf, Ilaria Ballarini and Vincenzo Corrado, in: Journal of Physics: Conference Series, pages 062028, IOP Publishing, 2025

[DOI]
[URL]

AugGen: Synthetic Augmentation using Diffusion Models Can Improve Recognition, Parsa Rahimi, Damien Teney and Sébastien Marcel, in: The Thirty-ninth Annual Conference on Neural Information Processing Systems, 2025

Automatic detection of the visual gaze components of joint attention in observational, naturalistic child language acquisition data, Miranda Dickerman, Anshul Gupta, Samy Tafasca, Xiaocheng Zhang, Jean-Marc Odobez and Sabine Stoll, in: Boston University Conference on Language Development, 2025

Automatic Parkinson’s disease detection from speech: Layer selection vs adaptation of foundation models, Tilak Purohit, Barbara Ruvolo, Juan Rafael Orozco-Arroyave and Mathew Magimai-Doss, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hyderabad, India, IEEE, 2025

Bayesian low-rank learning (Bella): A practical approach to bayesian neural networks, Bao Gia Doan, Afshar Shamsi, Xiao-Yu Guo, Arash Mohammadi, Hamid Alinejad-Rokny, Damien Teney, Damith Ranasinghe and Ehsan Abbasnejad, in: Proceedings of the AAAI Conference on Artificial Intelligence, 2025

Better Semi-supervised Learning for Multi-domain ASR Through Incremental Retraining and Data Filtering, Andrés Carofilis, Pradeep Rangappa, Srikanth Madikeri, Shashi Kumar, Sergio Burdisso, Jeena Prakash, Esaú Villatoro-Tello, Petr Motlicek, Bidisha Sharma, Kadri Hacioğlu, Shankar Venkatesan, Saurabh Vyas and Andreas Stolcke, in: Interspeech 2025, Rotterdam, The Netherlands, pages 3618--3622, 2025

[DOI]
[URL]

CARMA: Enhanced Compositionality in LLMs via Advanced Regularisation and Mutual Information Alignment, Nura Aljaafari, Danilo Carvalho and Andre Freitas, in: The 2025 Conference on Empirical Methods in Natural Language Processing, 2025

CCDP: Composition of Conditional Diffusion Policies with Guided Sampling, Amirreza Razmjoo, Sylvain Calinon, Michael Gienger and Fan Zhang, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2025

CCDP: Model-free Failure Recovery via Guided Diffusion Sampling, Amirreza Razmjoo, Sylvain Calinon, Michael Gienger and Fan Zhang, in: Workshop on The Art of Robustness: Surviving Failures in Robotics, IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2025

Chain-of-Model Learning for Language Model, Kaitao Song, Xiaohua Wang, Xu Tan, Huiqiang Jiang, Chengruidong Zhang, Yongliang Shen, Cen Lu, Zihao Li, Zifan Song, Caihua Shan, Yansen Wang, Kan Ren, Xiaoqing Zheng, Tao Qin, Yuqing Yang, Dongsheng Li and Lili Qiu, in: 39th Conference on Neural Information Processing Systems, 2025

Children's Voice Privacy: First Steps and Emerging Challenges, Ajinkya Kulkarni, Francisco Teixeira, Enno Hermann, Thomas Rolland, Isabel Trancoso and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2025

Co-Designing with Multiple Stakeholders and Datasets: A Community-Centered Process to Understand Youth Deviance in the Italian City of Turin, Ravinithesh Annapureddy, Alessandro Fornaroli, Massimo Fattori, Valeria Lacovara, Eleonora Fiori, Sarah Vollmer, Moritz Konradi, Britta Elena Hecking, Gianfranco Todesco and Daniel Gatica-Perez, in: Proceedings of the 12th International Conference on Communities & Technologies, pages 81-97, Association for Computing Machinery, 2025

[DOI]
[URL]

Comparing Self-Supervised Learning Models Pre-Trained on Human Speech and Animal Vocalizations for Bioacoustics Processing, Eklavya Sarkar and Mathew Magimai-Doss, in: International Conference on Acoustics, Speech and Signal Processing, 2025

Controlling Equational Reasoning in Large Language Models with Prompt Interventions, Jordan Meadows, Marco Valentino and Andre Freitas, in: The 39th Annual AAAI Conference on Artificial Intelligence, 2025

CoRet: Improved Retriever for Code Editing, Fabio Fehr, in: Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2025

[DOI]
[URL]

DeepID Challenge of Detecting Synthetic Manipulations in ID Documents, Pavel Korshunov, Vidit Vidit, Amir Mohammadi, Christophe Ecabert, Nevena Shamoska, Sébastien Marcel, Zeqin Yu, Ye Tian, Jiangqun Ni, Lazar Lazarevic, Renat Khizbullin, Anastasiia Evteeva, Alexey Tochin, Aleksei Grishin, Anjith George, Daniel DeAlcala, Tamas Endrei, Javier Munoz-Haro, Ruben Tolosana, Ruben Vera-Rodriguez, Aythami Morales, Julian Fierrez, Gyorgy Cserey, Hardik Sharma, Sachin Chaudhary, Akshay Dudhane, Praful Hambarde, Amit Shukla, Prateek Shaily, Jayant Kumar, Ajinkya Hase, Satish Maurya, Mridul Sharma and Pallav Dwivedi, in: International Conference on Computer Vision (ICCV), 2025

Detecting Text Manipulation in Images using Vision Language Models, Vidit Vidit, Pavel Korshunov, Amir Mohammadi, Christophe Ecabert, Ketan Kotwal and Sébastien Marcel, in: 36th British Machine Vision Conference 2025, 2025

Differentiable rasterization of minimum-time sigma-lognormal trajectories, D. Berio, Sylvain Calinon, R. Plamondon and F. F. Leymarie, in: In Proc. 22nd Conference of the International Graphonomics Society (IGS), 2025

Digi2Real: Bridging the Realism Gap in Synthetic Data Face Recognition via Foundation Models, Anjith George and Sébastien Marcel, in: 2025 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW), 2025

Distilling Contact Planning for Fast Trajectory Optimization in Robot Air Hockey, Julius Jankowski, Ante Marić, Puze Liu, Davide Tateo, Jan Peters and Sylvain Calinon, in: Proceedings of Robotics: Science and Systems, 2025

[DOI]
[URL]

Do We Always Need the Simplicity Bias? Looking for Optimal Inductive Biases in the Wild, Damien Teney, Liangze Jiang, Florin Gogianu and Ehsan Abbasnejad, in: The IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

EdgeDoc: Hybrid CNN-Transformer Model for Accurate Forgery Detection and Localization in ID Documents, Anjith George and Sébastien Marcel, in: ICCV, 2025

Effective Graph and Rank-based Contextual Embeddings for Textual and Multimedia Data, Thiago Almeida, Gustavo Leticio, Lucas Pascotti, Andre Freitas and Daniel Pedronette, in: International Joint Conference on Neural Networks, 2025

Efficient and Real-Time Motion Planning for Robotics Using Projection-Based Optimization, Xuemin Chi, Hakan Girgin, Tobias Löw, Yangyang Xie, Teng Xue, Jihao Huang, Zhitao Liu and Sylvain Calinon, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2025

Efficient Data Selection for Domain Adaptation of ASR Using Pseudo-Labels and Multi-Stage Filtering, Pradeep Rangappa, Andrés Carofilis, Jeena Prakash, Shashi Kumar, Sergio Burdisso, Srikanth Madikeri, Esaú Villatoro-Tello, Bidisha Sharma, Petr Motlicek, Kadri Hacioğlu, Shankar Venkatesan, Saurabh Vyas and Andreas Stolcke, in: Proc. Interspeech, 2025

Eliciting Critical Reasoning in Retrieval-Augmented Language Models via Contrastive Explanations, Leonardo Ranaldi, Marco Valentino and Andre Freitas, in: Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics, 2025

Emotion information recovery potential of wav2vec2 network fine-tuned for speech recognition task, Tilak Purohit and Mathew Magimai-Doss, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hyderabad, India, IEEE, 2025

Enhancing 3D Gaze Estimation in the Wild using Weak Supervision with Gaze Following Labels, Pierre Vuillecard and Jean-Marc Odobez, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025

Enhancing Domain Diversity in Synthetic Data Face Recognition with Dataset Fusion, Anjith George and Sébastien Marcel, in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, 2025

Exploring auditory feedback mechanisms in speech recognition, Louise Coppieters de Gibson and Philip N. Garner, in: Proceedings of Interspeech 2025, pages 4743-4747, 2025

[DOI]

Exploring ChatGPT for Face Presentation Attack Detection in Zero and Few-Shot in-Context Learning, Alain Komaty, Hatef Otroshi Shahreza, Anjith George and Sébastien Marcel, in: IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2025

Exploring In-Context Learning Capabilities of ChatGPT for Pathological Speech Detection, Mahdi Amiri, Hatef Otroshi Shahreza and Ina Kodrasi, in: ITG Conference on Speech Communication, IEEE, 2025

Exploring the Complexity of Parkinson’s Patient Speech for Depression Detection task: A Qualitative Analysis, Barbara Ruvolo, Tilak Purohit, Bogdan Vlasenko, Juan Rafael Orozco-Arroyave and Mathew Magimai-Doss, in: Proceedings of Workshop on Speech Pathology Analysis and DEtection (SPADE), Hyderabad, India, IEEE, 2025

Face Reconstruction from Face Embeddings using Adapter to a Face Foundation Model, Hatef Otroshi Shahreza, Anjith George and Sébastien Marcel, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

FaceLLM: A Multimodal Large Language Model for Face Understanding, Hatef Otroshi Shahreza and Sébastien Marcel, in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, 2025

[URL]

Faithful and Robust LLM-Driven Theorem Proving for NLI Explanations, Xin Quan, Marco Valentino, Louise Dennis and Andre Freitas, in: 63rd Annual Meeting of the Association for Computational Linguistics, 2025

FantasyID: A dataset for detecting digital manipulations of ID-documents, Pavel Korshunov, Amir Mohammadi, Vidit Vidit, Christophe Ecabert and Sébastien Marcel, in: Proceedings of IEEE International Joint Conference on Biometrics, 2025

Fast-and-Frugal Text-Graph Transformers are Effective Link Predictors, Andrei Catalin Coman, Christos Theodoropoulos, Marie-Francine Moens and James Henderson, in: Findings of the Association for Computational Linguistics, 2025

[URL]

Fine-Tuning Pretrained Models with NVIB for Improved Generalisation, Fabio Fehr, Alina Elena Baia, Xiaoguang Chang, Andrei Catalin Coman, Karl El Hajal, Dina El Zein, Shashi Kumar, Juan Zuluaga-Gomez, Andrea Cavallaro, Damien Teney and James Henderson, in: Workshop on Spurious Correlation and Shortcut Learning: Foundations and Solutions, 2025

[URL]

Formalizing Complex Mathematical Statements with LLMs: A Study on Mathematical Definitions, Lan Zhang, Marco Valentino and Andre Freitas, in: The 2025 Conference on Empirical Methods in Natural Language Processing (best resource paper award), 2025

Gem: Gaussian Mixture Model Embeddings for Numerical Feature Distributions, Hafiz Rauf, Alex Bogatu, Norman Paton and Andre Freitas, in: 8th International Conference on Extending Database Technology, 2025

Generating Synthetic Face Recognition Datasets Using Brownian Identity Diffusion and a Foundation Model, Hatef Otroshi Shahreza and Sébastien Marcel, in: 2025 IEEE 35th International Workshop on Machine Learning for Signal Processing (MLSP), 2025

[DOI]
[URL]

Giving Sense to Inputs: Toward an Accessible Control Framework for Shared Autonomy, Shalutha Rajapakshe, Jean-Marc Odobez and Emmanuel Senft, in: Proceedings of the 2025 ACM/IEEE International Conference on Human-Robot Interaction, Melbourne, Australia, ACM, 2025

[URL]

Graph Neural Networks for Parkinson's Disease Detection, Sheikh Shakeel, Yacouba Kaloga, Md Sahidullah and Ina Kodrasi, in: International Conference on Acoustics, Speech and Signal Processing, Hyderabad, India, IEEE, 2025

HintsOfTruth: A Multimodal Checkworthiness Detection Dataset with Real and Synthetic Claims, Michiel van der Meer, Pavel Korshunov, Sébastien Marcel and Lonneke van der Plas, in: The 63rd Annual Meeting of the Association for Computational Linguistics, 2025

HyperFace: Generating Synthetic Face Recognition Datasets by Exploring Face Embedding Hypersphere, Hatef Otroshi Shahreza and Sébastien Marcel, in: The Thirteenth International Conference on Learning Representations, 2025

[URL]

Identity-Preserving Aging and De-Aging of Faces in the StyleGAN Latent Space, Luis S. Luevano, Pavel Korshunov and Sébastien Marcel, in: 2025 IEEE International Joint Conference on Biometrics (IJCB), IEEE, 2025

Idiap kNN-TTS System for the Blizzard Challenge 2025, Enno Hermann, Karl El Hajal, Ajinkya Kulkarni and Mathew Magimai-Doss, in: Blizzard Challenge Workshop, 2025

Image-driven robot drawing with rapid lognormal movements, D. Berio, G. Clivaz, M. Stroh, O. Deussen, R. Plamondon, Sylvain Calinon and F. F. Leymarie, in: In Proc. IEEE Intl Symp. on Robot and Human Interactive Communication (Ro-Man), 2025

Improving chain-of-thought reasoning via quasi-symbolic abstractions, Leonardo Ranaldi, Marco Valentino, Alexander Polonsky and Andre Freitas, in: 63rd Annual Meeting of the Association for Computational Linguistics, 2025

Inductive Learning of Logical Theories with LLMs: A Complexity-graded Analysis, João Gandarela, Danilo Carvalho and Andre Freitas, in: The 39th Annual AAAI Conference on Artificial Intelligence, 2025

Investigation of accuracy and bias in face recognition trained with synthetic data, Pavel Korshunov, Ketan Kotwal, Christophe Ecabert, Vidit Vidit, Amir Mohammadi and Sébastien Marcel, in: Proceedings of IEEE International Joint Conference on Biometrics, 2025

Joint Fine-tuning and Conversion of Pretrained Speech and Language Models towards Linear Complexity, Mutian He and Philip N. Garner, in: 13th International Conference on Learning Representations (ICLR), 2025

[URL]

kNN Retrieval for Simple and Effective Zero-Shot Multi-speaker Text-to-Speech, Karl El Hajal, Ajinkya Kulkarni, Enno Hermann and Mathew Magimai-Doss, in: Proceedings of the Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL), Albuquerque, New Mexico, ACL, 2025

[URL]

LangVAE and LangSpace: Building and Probing for Language Model VAEs, Danilo Carvalho, Yingji Zhang, Harriet Unsworth and Andre Freitas, in: Demonstration at the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025

Latent Space Factorization in LoRA, Shashi Kumar, Yacouba Kaloga, John Mitros, Petr Motlicek and Ina Kodrasi, in: 39th Conference on Neural Information Processing Systems, 2025

[URL]

Leveraging Untranscribed Data for End-to-End Speech and Callsign Recognition in Air-Traffic Communication, Petr Motlicek, Shashi Kumar, Driss Khalil, Amrutha Prasad and Schüpbach Christof, in: SESAR Innovation Days 2025 (https://www.sesarju.eu/SIDS2025), Eurocontrol, Bled, Slovenia, 2025

[URL]

Loose Social-Interaction Recognition in Real-world Therapy Scenarios, Abid Ali, Rui Dai, Ashish Marisetty, Guillaume Astruc, Monique Thonnat, Jean-Marc Odobez, Suzanne Thümmler and Francois Bremond, in: IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

ManiDP: Manipulability-Aware Diffusion Policy for Posture-Dependent Bimanual Manipulation, Z. Li, J. Liu, D. Li, T. Teng, M. Li, Sylvain Calinon, D. G. Caldwell and F. Chen, in: In Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2025

MASA: A Modular Framework for LLM-Driven Multi-Agent Systems for Autoformalization, Lan Zhang, Marco Valentino and Andre Freitas, in: Demonstration at the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Meaningful Pose-Based Sign Language Evaluation, Zifan Jiang, Colin Leong, Amit Moryossef, Oliver Cory, Maksym Ivashechkin, Neha Tarigopula, Biao Zhang, Anne Göhring, Annette Rios, Rico Sennrich and Sarah Ebling, in: Proceedings of the Tenth Conference on Machine Translation (WMT), 2025

[DOI]
[URL]

MM-HSD: Multi-Modal Hate Speech Detection in Videos, Berta Céspedes-Sarrias, Carlos Collado-Capell, Pablo Rodenas-Ruiz, Olena Hrynenko and Andrea Cavallaro, in: Proceedings of the 33rd ACM International Conference on Multimedia (MM'25), October 27-31, 2025, Dublin, Ireland., 2025

[DOI]

Montague semantics and modifier consistency measurement in neural language models, Danilo Carvalho, Edoardo Manino, Julia Rozanova, Lucas Cordeiro and Andre Freitas, in: 31st International Conference on Computational Linguistics, 2025

Movement Generation and Drawing in Robotics, Sylvain Calinon, in: In Proc. 22nd Conference of the International Graphonomics Society (IGS), 2025

Multilingual vs. monolingual transformer models in encoding linguistic structure and lexical abstraction, Vivi Nastase, Giuseppe Samo, Chunyang Jiang and Paola Merlo, in: CLiC-it 2025: Eleventh Italian Conference on Computational Linguistics, September 24 ? 26, 2025, Cagliari, Italy, 2025

[URL]

Multimodal Prosody Modeling: A Use Case for Multilingual Sentence Mode Prediction, Bogdan Vlasenko and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2025

Multiview Canonical Correlation Analysis for Automatic Pathological Speech Detection, Yacouba Kaloga, Sheikh Shakeel and Ina Kodrasi, in: International Conference on Acoustics, Speech and Signal Processing, Hyderabad, India, IEEE, 2025

Mysteries of the Deep: Role of Intermediate Representations in Out of Distribution Detection, Ignacio Meza De la Jara, Cristian Rodriguez-Opazo, Damien Teney, Damith Ranasinghe and Ehsan Abbasnejad, in: Advances in neural information processing systems, 2025

Nexus: An Omni-Perceptive And-Interactive Model for Language, Audio, And Vision, Che Liu, Yingji Zhang, Dong Zhang, Weijie Zhang, Chenggong Gong, Haohan Li, Yu Lu, Shilin Zhou, Yue Lu, Ziliang Gan, Ziao Wang, Junwei Liao, Haipang Wu, Ji Liu, Andre Freitas, Qifan Wang, Zenglin Xu, Rongjunchen Zhang and Yong Dai, in: ACM Multimedia, 2025

OOD-Chameleon: Is Algorithm Selection for OOD Generalization Learnable?, Liangze Jiang and Damien Teney, in: Forty-Second International Conference on Machine Learning, 2025

Open Challenge: Exploring People's Everyday Life Behavior with Mobile Data, Andrea Bontempelli, Matteo Busso, Lakmal Buddika Meegahapola, Amalia de Götzen, Fausto Giunchiglia and Daniel Gatica-Perez, in: Companion of the 2025 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 2025

OpenBEERS: A digital platform for urban scale simulation of building energy efficiency, David Geissbuhler, Alejandro Pena-Bello, Jérôme Kämpf and Jakob Rager, in: Journal of Physics: Conference Series, pages 042013, IOP Publishing, 2025

[DOI]
[URL]

PEIRCE: Unifying Material and Formal Reasoning via LLM-Driven Neuro-Symbolic Refinement, Xin Quan, Marco Valentino, Danilo Carvalho, Dhairya Dalal and Andre Freitas, in: Demonstration at 63rd Annual Meeting of the Association for Computational Linguistics, 2025

Performance Evaluation of SLAM-ASR: The Good, the Bad, the Ugly, and the Way Forward, Shashi Kumar, Iuliia Thorbecke, Sergio Burdisso, Esaú Villatoro-Tello, Manjunath K E, Kadri Hacioğlu, Pradeep Rangappa, Petr Motlicek, Aravind Ganapathiraju and Andreas Stolcke, in: SALMA Workshop, Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hyderabad, India, IEEE, 2025

[URL]

Privacy-enhancing Sclera Segmentation Benchmarking Competition: SSBC 2025, Matej Vitek, Darian Tomašević, Abhijit Das, Sabari Nathan, Gökhan Özbulak, Gözde Ayşe Tataroğlu Özbulak, Jean-Paul Calbimonte, André Anjos, Hariohm Hemant Bhatt, Dhruv Dhirendra Premani, Jay Chaudhari, Caiyong Wang, Jian Jiang, Chi Zhang, Qi Zhang, Iyyakutti Iyappan Ganapathi, Syed Sadaf Ali, Divya Velayudan, Maregu Assefa, Naoufel Werghi, Zachary A Daniels, Leeon John, Ritesh Vyas, Jalil Nourmohammadi Khiarak, Taher Akbari Saeed, Mahsa Nasehi, Ali Kianfar, Mobina Pashazadeh Panahi, Geetanjali Sharma, Pushp Raj Panth, Raghavendra Ramachandra, Aditya Nigam, Umapada Pal, Helio Pedrini and Vitomir Struc, in: International Joint Conference on Biometrics, IEEE, 2025

Quasi-symbolic Semantic Geometry over Transformer-based Variational AutoEncoders, Yingji Zhang, Danilo Carvalho and Andre Freitas, in: 29th Conference on Computational Natural Language Learning (nominated for a best paper award), 2025

RAGferee: Building Contextual Reward Models for Retrieval-Augmented Generation, Andrei Catalin Coman, Ionuț-Teodor Sorodoc, Leonardo F. R. Ribeiro, Bill Byrne, James Henderson and Adrià de Gispert, in: Empirical Methods in Natural Language Processing, 2025

[URL]

Second Competition on Presentation Attack Detection on ID Card, Juan E. Tapia, Nieto Mario, Juan Espin, Alvaro Sanchez, Naser Damer, Christoph Busch, Marija Ivanovska, Leon Todorov, Renat Khizbullin, Aleksei Grishin, Lazar Lazarevic, Daniel Schulz, Sebastian Gonzalez, Amir Mohammadi, Ketan Kotwal, Sébastien Marcel, Raghavendra Mudgalgundurao, Kiran B. Raja, Patrick Schuch, Pedro Couto, Joao Pinto, Mariana Xavier, Andres Valenzuela, Borut Batagelj, Javier Barrachina, Marko Peterlin, Peter Peer, Ajnas Muhammed, Diogo Nunes, Nuno Gonçalves, Sushrut Patwardhan and Raghavendra Ramachandra, in: IEEE International Joint Conference on Biometrics (IJCB), 2025

Securing Face and Fingerprint Templates in Humanitarian Biometric Systems, Vedrana Krivokuca, Giuseppe Stragapede, Sam Merrick, Justin Sukaitis and Vincent Graf Narbel, in: Proceedings of the International Joint Conference on Biometrics (IJCB 2025), Osaka, Japan, IEEE, 2025

Soft Skills in the Wild: Challenges in Multilingual Classification, Laura Vásquez-Rodríguez, Bertrand Audrin, Samuel Michel, Samuele Galli, Julneth Rogenhofer, Jacopo Negro Cusa and Lonneke van der Plas, in: Proceedings of the 10th edition of the Swiss Text Analytics Conference, 2025

Specializing General-purpose LLM Embeddings for Implicit Hate Speech Detection across Datasets, Vassiliy Cheremetiev, Quang Long Ho Ngo, Chau Ying Kot, Alina Elena Baia and Andrea Cavallaro, in: Proceedings of the 2nd International Workshop on Diffusion of Harmful Content on Online Web (DHOW '25), October 27--28, 2025, Dublin, Ireland, 2025

Speech Data Selection for Efficient ASR Fine-Tuning using Domain Classifier and Pseudo-Label Filtering, Pradeep Rangappa, Juan Zuluaga-Gomez, Srikanth Madikeri, Andrés Carofilis, Jeena Prakash, Sergio Burdisso, Shashi Kumar, Esaú Villatoro-Tello, Nigmatulina Iuliia, Petr Motlicek, Karthik Pandia D S and Aravind Ganapathiraju, in: 2025 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025), 2025

[DOI]
[URL]

Speech power spectra: a window into neural oscillations in Parkinson’s disease, Sevada Hovsepyan and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2025

[DOI]

STM-GNN: Space-Time-and-Memory Graph Neural Networks for Predicting Multi-Drug Resistance Risks in Dynamic Patient Networks, Damien Geissbuhler, Alban Bornet, Catarina Marques, André Anjos, Sónia Pereira and Douglas Teodoro, in: International Conference on Artificial Intelligence in Medicine, Pavia, Italy, 2025

[DOI]

SylloBio-NLI: Evaluating Large Language Models on Biomedical Syllogistic Reasoning, Magdalena Wysocka, Danilo Carvalho, Oskar Wysocki, Marco Valentino and Andre Freitas, in: Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics, 2025

Synergy and diversity in CLIP: Enhancing performance through adaptive backbone ensembling, Cristian Rodriguez-Opazo, Ehsan Abbasnejad, Damien Teney, Hamed Damirchi, Edison Marrese-Taylor and Anton van den Hengel, in: International Conference on Learning Representations, 2025

Synthetic Face Datasets Generation via Latent Space Exploration from Brownian Identity Diffusion, David Geissbuhler, Hatef Otroshi Shahreza and Sébastien Marcel, in: The Forty-second International Conference on Machine Learning (ICML), 2025

[URL]

TableDC: Deep Clustering for Tabular Data, Hafiz Rauf, Andre Freitas and Norman Paton, in: ACM SIGMOD International Conference on Management of Data, 2025

Testing the assumptions about the geometry of sentence embedding spaces: the cosine measure need not apply, Vivi Nastase and Paola Merlo, in: arXiv, 2025

[URL]

The Greatest Challenge For Startups: Computational Text Analysis on Swiss Ventures, Takahiro Inada, Esaú Villatoro-Tello, Jung Park, Jim Pulcrano and Benoit F. Leleux, in: Academy of Management Proceedings 2025., 2025

[URL]

The Invisible Threat: Evaluating the Vulnerability of Cross-Spectral Face Recognition to Presentation Attacks, Anjith George and Sébastien Marcel, in: International Joint Conference on Biometrics (IJCB 2025), IEEE., 2025

The Suisse Romande Local News Dataset, Victor Bros and Daniel Gatica-Perez, in: Proceedings of the Nineteenth International AAAI Conference on Web and Social Media, 2025

TokenVerse++: Towards Flexible Multitask Learning with Dynamic Task Activation, Shashi Kumar, Srikanth Madikeri, Esaú Villatoro-Tello, Sergio Burdisso, Pradeep Rangappa, Andrés Carofilis, Petr Motlicek, Karthik Pandia D S, Shankar Venkatesan, Kadri Hacioğlu and Andreas Stolcke, in: 2025 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), IEEE, 2025

Towards Accessible and Intuitive Shared Autonomy, Shalutha Rajapakshe, in: Proceedings of the 2025 ACM/IEEE International Conference on Human-Robot Interaction, 2025

[URL]

Towards Dynamic Skeleton-based Handshape Subunits for Sign Language Assessment, Sandrine Tornay and Mathew Magimai-Doss, in: Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2025

Towards interpretable emotion recognition: Identifying key features with machine learning, Yacouba Kaloga and Ina Kodrasi, in: Forum Acusticum/EuroNoise, Malaga, Spain, 2025

Towards Leveraging Sequential Structure in Animal Vocalizations, Eklavya Sarkar and Mathew Magimai-Doss, in: Neural Information Processing Systems workshop: AI for Non-Human Animal Communication, 2025

TRACE: Training and Inference-Time Interpretability Analysis for Language Models, Nura Aljaafari, Danilo Carvalho and Andre Freitas, in: Demonstration at the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Transformers Pretrained on Procedural Data Contain Modular Structures for Algorithmic Reasoning, Zachary Shinnick, Liangze Jiang, Hemanth Saratchandran, Anton van den Hengel and Damien Teney, in: ICML 2025 Workshop on Methods and Opportunities at Small Scale, 2025

Unifying Global and Near-Context Biasing in a Single Trie Pass., Thorbecke Iuliia, Esaú Villatoro-Tello, Juan Zuluaga-Gomez, Shashi Kumar, Sergio Burdisso, Pradeep Rangappa, Andrés Carofilis, Srikanth Madikeri, Petr Motlicek, Karthik Pandia D S, Kadri Hacioğlu and Andreas Stolcke, in: Text, Speech, and Dialogue. TSD 2025. Lecture Notes in Computer Science, Springer, Springer, 2025

[DOI]
[URL]

Unsupervised Rhythm and Voice Conversion of Dysarthric to Healthy Speech for ASR, Karl El Hajal, Enno Hermann, Ajinkya Kulkarni and Mathew Magimai-Doss, in: Proceedings of Workshop on Speech Pathology Analysis and DEtection (SPADE), Hyderabad, India, IEEE, 2025

[URL]

Unsupervised Rhythm and Voice Conversion to Improve ASR on Dysarthric Speech, Karl El Hajal, Enno Hermann, Sevada Hovsepyan and Mathew Magimai-Doss, in: Proceedings of Interspeech, Rotterdam, Netherlands, ISCA, 2025

[URL]

Unveiling Audio Deepfake Origins: A Deep Metric learning And Conformer Network Approach With Ensemble Fusion, Ajinkya Kulkarni, Dowerah Sandipana, Mathew Magimai-Doss and Tanel alumae, in: Proceedings of Interspeech, 2025

Validation of two distinct simulation models of district heating networks: application to efficient looping analysis, Dubon Rodrigue, Roberto Boghetti, Jérôme Kämpf, Bastien Pasdeloup, Mohamed T. Mabrouk, Patrick Meyer and Bruno Lacarrière, in: Journal of Physics: Conference Series, pages 042021, IOP Publishing, 2025

[DOI]
[URL]

Variational Autoencoder for Personalized Pathological Speech Enhancement, Mingchi Hou and Ina Kodrasi, in: European Signal Processing Conference, 2025

Whole-Body Impedance Control of a Humanoid Robot Based on Human-Human Demonstration for Human-Robot Collaboration, C. Li, J. Liu, T. Teng, S. Wang, Sylvain Calinon and F. Chen, in: In Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2025

xEdgeFace: Efficient Cross-Spectral Face Recognition for Edge Devices, Anjith George and Sébastien Marcel, in: International Joint Conference on Biometrics (IJCB 2025), IEEE., 2025

XLSR-Transducer: Streaming ASR for Self-Supervised Pretrained Models, Shashi Kumar, Srikanth Madikeri, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Iuliia Thorbecke, Petr Motlicek, Manjunath K E and Aravind Ganapathiraju, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hyderabad, India, IEEE, 2025

[DOI]
[URL]

A Differentiable Integer Linear Programming Solver for Explanation-Based Natural Language Inference, Mokanarangan Thayaparan, Marco Valentino and Andre Freitas, in: The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, 2024

A Human Perspective to AI-based Candidate Screening, Laura Vásquez-Rodríguez, Bertrand Audrin, Samuel Michel, Samuele Galli, Julneth Rogenhofer, Jacopo Negro Cusa and Lonneke van der Plas, in: Proceedings of the 58th Hawaii International Conference on System Sciences (HICSS), 2024

A Novel and Responsible Dataset for Face Presentation Attack Detection on Mobile Devices, Nathan Ramoly, Alain Komaty, Vedrana Krivokuca, Lara Younes, Ahmad-Montaser Awal and Sébastien Marcel, in: The IEEE International Joint Conference on Biometrics, Buffalo, New York, pages 8, 2024

A Retrospective on the Robot Air Hockey Challenge: Benchmarking Robust, Reliable, and Safe Learning Techniques for Real-world Robotics, Puze Liu, Jonas Günster, Niklas Funk, Simon Gröger, Dong Chen, Haitham Bou-Ammar, Julius Jankowski, Ante Marić, Sylvain Calinon, Andrej Orsula, Miguel Olivares-Mendez, Hongyi Zhou, Rudolf Lioutikov, Gerhard Neumann, Amarildo Likmeta, Amirhossein Zhalehmehrabi, Thomas Bonenfant, Marcello Restelli, Davide Tateo, Ziyuan Liu and Jan Peters, in: Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks (NeurIPS Datasets and Benchmarks 2024), 2024

A Symbolic Framework for Systematic Evaluation of Mathematical Reasoning with Transformers, Jordan Meadows, Marco Valentino, Damien Teney and Andre Freitas, in: Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2024

[URL]

A System for Human-Robot Teaming through End-User Programming and Shared Autonomy, Michael Hagenow, Emmanuel Senft, Robert Radwin, Michael Gleicher, Michael Zinn and Bilge Mutlu, in: Proceedings of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, pages 231-239, 2024

[DOI]
[URL]

A Unified Model for Gaze Following and Social Gaze Prediction, Anshul Gupta, Samy Tafasca, Naravich Chutisilp and Jean-Marc Odobez, in: The 18th IEEE International Conference on Automatic Face and Gesture Recognition, 2024

Adversarial Robustness Analysis in Automatic Pathological Speech Detection Approaches, Mahdi Amiri and Ina Kodrasi, in: Interspeech, 2024

Aligning Large and Small Language Models via Chain-of-Thought Reasoning, Leonardo Ranaldi and Andre Freitas, in: Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

An LLM-based Knowledge Synthesis and Scientific Reasoning Framework for Biomedical Discovery, Oskar Wysocki, Magdalena Wysocka, Danilo Carvalho, Alex Bogatu, Danilo Gusicuma, Maxime Delmas, Harriet Unsworth and Andre Freitas, in: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, ACL, Bangkok, Thailand, pages 355-364, 2024

[DOI]
[URL]

Annotator-centric Active Learning for Subjective NLP Tasks, Michiel van der Meer, Neele Falk, Pradeep K. Murukannaiah and Enrico Liscio, in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, 2024

Are there identifiable structural parts in the sentence embedding whole?, Vivi Nastase and Paola Merlo, in: Proceedings of the 7th BlackboxNLP Workshop: Analyzing and Interpreting Neural Networks for NLP, 2024

Assessing the Reliability of Biometric Authentication on Virtual Reality Devices, Ketan Kotwal, Gökhan Özbulak and Sébastien Marcel, in: Proceedings of IEEE International Joint Conference on Biometrics, 2024

Bi-directional Training for Composed Image Retrieval via Text Prompt Learning, Zheyuan Liu, Weixuan Sun, Yicong Hong, Damien Teney and Stephen Gould, in: IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2024

[URL]

BLM-It - Blackbird Language Matrices for Italian: A CALAMITA Challenge, Chunyang Jiang, Giuseppe Samo, Vivi Nastase and Paola Merlo, in: Proceedings of the 10th Italian Conference on Computational Linguistics, 2024

Breaking Template Protection: Reconstruction of Face Images from Protected Facial Templates, Hatef Otroshi Shahreza and Sébastien Marcel, in: 18th International Conference on Automatic Face and Gesture Recognition (FG), 2024

Can We Learn to Select the Right Algorithm for OOD Generalization?, Liangze Jiang and Damien Teney, in: Out Of Distribution Generalization in Computer Vision, Workshop at ECCV, 2024

CCDb-HG: Novel Annotations and Gaze-Aware Representations for Head Gesture Recognition, Pierre Vuillecard, Arya Farkhondeh, Michael Villamizar and Jean-Marc Odobez, in: 18th IEEE Int. Conference on Automatic Face and Gesture Recognition (FG), Istanbul,, 2024

ChatGPT and biometrics: an assessment of face recognition, gender detection, and age estimation capabilities, Ahmad Hassanpour, Yasamin Kowsari, Hatef Otroshi Shahreza, Bian Yang and Sébastien Marcel, in: 2024 IEEE International Conference on Image Processing (ICIP), 2024

[DOI]
[URL]

ChildPlay-Hand: A Dataset of Hand Manipulations in the Wild, Arya Farkhondeh, Samy Tafasca and Jean-Marc Odobez, in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024

COMPARING DATA-DRIVEN AND HANDCRAFTED FEATURES FOR DIMENSIONAL EMOTION RECOGNITION, Bogdan Vlasenko, Sargam Vyas and Mathew Magimai-Doss, in: International Conference on Acoustics, Speech and Signal Processing, 2024

Comparing Stability and Discriminatory Power of Hand-crafted Versus Deep Radiomics: A 3D-Printed Anthropomorphic Phantom Study, Oscar Jimenez-del-Toro, Christoph Aberle, Roger Schaer, Michael Bach, Kyriakos Flouris, Ender Konukoglu, Bram Stieltjes, Markus M. Obmann, André Anjos, Henning Müller and Adrien Depeursinge, in: Proceedings of the 12th European Workshop on Visual Information Processing, 2024

Configuration Space Distance Fields for Manipulation Planning, Yiming Li, Xuemin Chi, Amirreza Razmjoo and Sylvain Calinon, in: Robotics: Science and Systems (RSS), 2024, 2024

Consistent Autoformalization for Constructing Mathematical Libraries, Lan Zhang, Xin Quan and Andre Freitas, in: The 2024 Conference on Empirical Methods in Natural Language Processing, 2024

CONTENT-BASED OBJECTIVE EVALUATION OF ARTIFICIALLY GENERATED SIGN LANGUAGE VIDEOS, Neha Tarigopula, Preyas Garg, Skanda Muralidhar, Sandrine Tornay, Dinesh Babu Jayagopi and Mathew Magimai-Doss, in: ICASSP, 2024

CONTEXTUAL BIASING METHODS FOR IMPROVING RARE WORD DETECTION IN AUTOMATIC SPEECH RECOGNITION, Mrinmoy Bhattacharjee, Nigmatulina Iuliia, Amrutha Prasad, Pradeep Rangappa, Srikanth Madikeri, Petr Motlicek, Hartmut Helmke and Matthias Kleinert, in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, Seoul, Korea, 2024

Cross-transfer Knowledge between Speech and Text Encoders to Evaluate Customer Satisfaction, Luis Felipe Parra-Gallego, Tilak Purohit, Bogdan Vlasenko, Juan Rafael Orozco-Arroyave and Mathew Magimai-Doss, in: Proceedings of Interspeech, Kos Island, Greece, ISCA, 2024

D-LGP: Dynamic Logic-Geometric Program for Reactive Task and Motion Planning, Teng Xue, Razmjoo Amirreza and Sylvain Calinon, in: IEEE International Conference on Robotics and Automation, 2024

DAIC-WOZ: On the Validity of Using the Therapist's prompts in Automatic Depression Detection from Clinical Interviews, Sergio Burdisso, Ernesto A. Reyes-Ramírez, Esaú Villatoro-Tello, Fernando Sánchez-Vega, A. Pastor López-Monroy and Petr Motlicek, in: Proceedings of the 6th Clinical Natural Language Processing Workshop, Mexico City, Mexico, pages 82–90, Association for Computational Linguistics, 2024

[DOI]
[URL]

Deep Clustering for Data Cleaning and Integration, Hafiz Rauf, Andre Freitas and Norman Paton, in: 27th International Conference on Extending Database Technology, 2024

Deep Variational Privacy Funnel: General Modeling with Applications in Face Recognition, Behrooz Razeghi, Parsa Rahimi and Sébastien Marcel, in: 49th IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE, 2024

[DOI]
[URL]

Demographic Fairness Transformer for Bias Mitigation in Face Recognition, Ketan Kotwal and Sébastien Marcel, in: Proceedings of IEEE International Joint Conference on Biometrics (IJCB2024), 2024

Detecting Criminal Networks via Non-Content Communication Data Analysis Techniques from the TRACY Project, Pradeep Rangappa, Muscat Amand, Alejandra Sanchez Lara, Petr Motlicek, Michaela Antonopoulou, Ioannis Fourfouris, Antonios Skarlatos, Nikos Avgerinos, Manolis Tsangaris and Kasia Kostka, in: Digital Forensics and Cyber Crime. ICDF2C 2024, Dubrovnik, Croatia, 2024

[DOI]
[URL]

Dialog2Flow: Pre-training Soft-Contrastive Action-Driven Sentence Embeddings for Automatic Dialog Flow Extraction, Sergio Burdisso, Srikanth Madikeri and Petr Motlicek, in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Miami, Florida, USA, pages 5421–5440, Association for Computational Linguistics, 2024

[URL]

DiffuCOMET: Contextual Commonsense Knowledge Diffusion, Silin Gao, Mete Ismayilzada, Mengjie Zhao, Hiromi Wakaki, Yuki Mitsufuji and Antoine Bosselut, in: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Bangkok, Thailand, pages 4809–4831, Association for Computational Linguistics, 2024

[DOI]
[URL]

Diffusion Twigs with Loop Guidance for Conditional Graph Generation, Giangiacomo Mercatali, Yogesh Verma, Andre Freitas and Vikas Garg, in: Thirty-Eighth Annual Conference on Neural Information Processing Systems, 2024

Does Data-Efficient Generalization Exacerbate Bias in Foundation Models?, Dilermando Queiroz Neto, Anderson Carlos, Maíra Fatoretto, Luis Filipe Nakayama, André Anjos and Lilian Berton, in: Proceedings of the 18th European Conference on Computer Vision, 2024

Empowering Cross-lingual Abilities of Instruction-tuned Large Language Models by Translation-following demonstrations, Leonardo Ranaldi, Giulia Pucci and Andre Freitas, in: Findings of the ACL, 2024

Enhancing Ethical Explanations of Large Language Models through Iterative Symbolic Refinement, Xin Quan, Marco Valentino, Louise A Dennis and Andre Freitas, in: The 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024), 2024

Entity Matching Across Small Networks Using Node Attributes, Zahra Ahmadi, Zijian Zhang, Hoang H. Nguyen, Sergio Burdisso, Srikanth Madikeri, Petr Motlicek, Erinc Dikici, Gerhard Backfried, Marek Kovac and Daniel Kudenko, in: ECAI 2024 - 27th European Conference on Artificial Intelligence, October 19-24, 2024, Santiago de Compostela, Spain - Including 13th Conference on Prestigious Applications of Intelligent Systems (PAIS 2024), Proceedings, 2024

[DOI]

Estimating the Causal Effects of Natural Logic Features in Transformer-Based NLI Models, Julia Rozanova, Marco Valentino and Andre Freitas, in: The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, 2024

Evaluating the Effectiveness of Attack-Agnostic Features for Morphing Attack Detection, Laurent Colbois and Sébastien Marcel, in: International Joint Conference on Biometrics, 2024

Explaining models relating objects and privacy, Alessio Xompero, Myriam Bontonou, Jean-Michel Arbona, Emmanouil Benetos and Andrea Cavallaro, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024

[URL]

Exploring generalization to unseen audio data for spoofing: insights from SSL models, Atharva Kulkarni, Hoan My Tran, Ajinkya Kulkarni, Dowerah Sandipana, Damien Lolive and Mathew Magimai-Doss, in: ISCA Proceedings, Greece, 2024

[DOI]
[URL]

Exploring Italian sentence embeddings properties through multi-tasking, Vivi Nastase, Giuseppe Samo, Chunyang Jiang and Paola Merlo, in: Tenth Italian Conference on Computational Linguistics, 2024

Exploring syntactic information in sentence embeddings through multilingual subject-verb agreement, Vivi Nastase, Chunyang Jiang, Giuseppe Samo and Paola Merlo, in: Tenth Italian Conference on Computational Linguistics, 2024

Exploring the Limits of Fine-grained LLM-based Physics Inference via Premise Removal Interventions, Jordan Meadows, Tamsin James and Andre Freitas, in: Findings of EMNLP, 2024

Exploring the Zero-Shot Capabilities of Vision-Language Models for Improving Gaze Following, Anshul Gupta, Pierre Vuillecard, Arya Farkhondeh and Jean-Marc Odobez, in: Int. Conf. Computer Vision and Pattern Recognition (CVPR), Workshop on Gaze Estimation and Prediction in the Wild, 2024

Extending the Cooperative Dual-Task Space in Conformal Geometric Algebra, Tobias Löw and Sylvain Calinon, in: Proc. IEEE Intl Conf. on Robotics and Automation, 2024

Face Liveness Detection Competition (LivDet-Face) - 2024, Lambert Igene, Afzal Hossain, Mohammad Zahir Uddin Chowdhury, Humaira Rezaie, Ayden Rollins, Jesse Dykes, Rahul Vijaykumar, Alain Komaty, Sébastien Marcel, Stephanie Schuckers, Juan E. Tapia, Carlos Aravena, Daniel Schulz, Banafsheh Adami, Nima Karimian, Diogo Nunes, João Marcos, Nuno Gonçalves, Lovro Sikosek, Borut Batagelj, Aleksandr Alenin, Alhasan Alkhaddour, Anton Pimenov, Artem Tregubov, Igor Avdonin, Maxim Kazantsev, Mikhail Pozigun, Vasiliy Pryadchenko, Nima Schei, David Pabon and Manuela Tiedemann, in: IEEE International Joint Conference on Biometrics, 2024

Face Recognition Using Lensless Camera, Hatef Otroshi Shahreza, Alexandre Veuthey and Sébastien Marcel, in: Proceedings of the 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2024

[DOI]
[URL]

Face Reconstruction from Partially Leaked Facial Embeddings, Hatef Otroshi Shahreza and Sébastien Marcel, in: Proceedings of the 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2024

[DOI]
[URL]

Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper, Iuliia Thorbecke, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Shashi Kumar, Pradeep Rangappa, Sergio Burdisso, Petr Motlicek, Karthik Pandia D S and Aravind Ganapathiraju, in: Findings of the Association for Computational Linguistics: EMNLP 2024, pages 16747–16762, Association for Computational Linguistics (ACL), 2024

[DOI]
[URL]

Feature Representations for Automatic Meerkat Vocalization Classification, Imen Ben Mahmoud, Eklavya Sarkar, Marta Manser and Mathew Magimai-Doss, in: 4th International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots, 2024

Fine-tuning Self-Supervised Models For Language Identification Using Orthonormal Constraint, Amrutha Prasad, Andrés Carofilis, Geoffroy Vanderreydt, Driss Khalil, Srikanth Madikeri, Petr Motlicek and Schüpbach Christof, in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP), pages 11921-11925, 2024

[DOI]

Formal Semantic Controls over Language Models, Danilo Carvalho, Yingji Zhang and Andre Freitas, in: LREC-COLING, 2024

FRCSyn Challenge at WACV 2024: Face Recognition Challenge in the Era of Synthetic Data, Alexander Unnervik, Anjith George, Christophe Ecabert, Parsa Rahimi, Hatef Otroshi Shahreza and Sébastien Marcel, in: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Workshops, pages 892-901, 2024

[URL]

GADePo: Graph-Assisted Declarative Pooling Transformers for Document-Level Relation Extraction, Andrei Catalin Coman, Christos Theodoropoulos, Marie-Francine Moens and James Henderson, in: Proceedings of the 3rd Workshop on Knowledge Augmented Methods for NLP, Association for Computational Linguistics, 2024

[DOI]
[URL]

Generalized Policy Iteration using Tensor Approximation for Hybrid Control, Suhan Shetty, Teng Xue and Sylvain Calinon, in: International Conference on Learning Representations (ICLR), 2024

GLoFool: global enhancements and local perturbations to craft adversarial images, Mirko Agarla and Andrea Cavallaro, in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024

Graph Neural Flows for Unveiling Systemic Interactions Among Irregularly Sampled Time Series, Giangiacomo Mercatali, Andre Freitas and Jie Chen, in: Thirty-Eighth Annual Conference on Neural Information Processing Systems, 2024

Graph-Induced Syntactic-Semantic Spaces in Transformer-Based Variational AutoEncoders, Yingji Zhang, Marco Valentino, Danilo Carvalho, Ian Pratt-Hartmann and Andre Freitas, in: In Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Hardware-effective Approaches for Skill Extraction in Job Offers and Resumes, Laura Vásquez-Rodríguez, Bertrand Audrin, Samuel Michel, Samuele Galli, Julneth Rogenhofer, Jacopo Negro Cusa and Lonneke van der Plas, in: The 4th Workshop on Recommender Systems for Human Resources, in conjunction with the 18th ACM Conference on Recommender Systems, 2024

[URL]

Heterogeneous Face Recognition Using Domain Invariant Units, Anjith George and Sébastien Marcel, in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2024

Human Interest or Conflict? Leveraging LLMs for Automated Framing Analysis in TV Shows, David Alonso del Barrio, Max Tiel and Daniel Gatica-Perez, in: ACM International Conference on Interactive Media Experiences, 2024

HyperFace: Generating Synthetic Face Recognition Datasets by Exploring Face Embedding Hypersphere, Hatef Otroshi Shahreza and Sébastien Marcel, in: NeurIPS Safe Generative AI Workshop 2024, 2024

[URL]

Image-guided topic modeling for interpretable privacy classification, Alina Elena Baia and Andrea Cavallaro, in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024

Impact of Speech Mode in Automatic Pathological Speech Detection, Sheikh Shakeel and Ina Kodrasi, in: EUSIPCO, IEEE, 2024

[URL]

Improving Semantic Control in Discrete Latent Spaces with Transformer Quantized Variational Autoencoders, Yingji Zhang, Danilo Carvalho, Marco Valentino, Ian Pratt-Hartmann and Andre Freitas, in: Findings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024), 2024

Inference to the Best Explanation in Large Language Models, Dhairya Dalal, Marco Valentino, Andre Freitas and Paul Buitelaar, in: In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

Investigating Semantic Segmentation Models to Assist Visually Impaired People, Michael Villamizar, Olivier Canévet and Jean-Marc Odobez, in: European Conference on Computer Vision - Workshops, 2024

Latent Enhancing AutoEncoder for Occluded Image Classification, Ketan Kotwal, in: Proceedings of International Conference on Image Processing, 2024

Learning About Social Context from Smartphone Data: Generalization Across Countries and Daily Life Moments, Aurel Ruben Mader, Lakmal Buddika Meegahapola and Daniel Gatica-Perez, in: Proc. ACM Conference on Human Factors in Computing Systems, 2024

Learning Disentangled Semantic Spaces of Explanations via Invertible Neural Networks, Yingji Zhang, Danilo Carvalho and Andre Freitas, in: The 62nd Annual Meeting of the Association for Computational Linguistics, 2024

Learning Goal-oriented Bimanual Dough Rolling Using Dynamic Heterogeneous Graph Based on Human Demonstration, J. Liu, C. Li, S. Wang, Z. Dong, Z. Tang, Sylvain Calinon, M. Li and F. Chen, in: In Proc. IEEE Intl Conf. on Robotics and Biomimetics (ROBIO), 2024

Logic-Skill Programming: An Optimization-based Approach to Sequential Skill Planning, Teng Xue, Amirreza Razmjoo, Suhan Shetty and Sylvain Calinon, in: Proc. Robotics: Science and Systems (RSS), 2024

Mapping the Media Landscape: Predicting Factual Reporting and Political Bias Through Web Interactions, Dairazalia Sanchez-Cortes, Sergio Burdisso, Esaú Villatoro-Tello and Petr Motlicek, in: Proceedings of the 15th International Conference of the CLEF Association: Experimental IR Meets Multilinguality, Multimodality, and Interaction, Grenoble, France, pages 127-138, Springer Nature Switzerland, 2024

[DOI]
[URL]

Mitigating Demographic Bias in Face Recognition via Regularized Score Calibration, Ketan Kotwal and Sébastien Marcel, in: IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, IEEE/CVF, 2024

Modality Agnostic Heterogeneous Face Recognition with Switch Style Modulators, Anjith George and Sébastien Marcel, in: IEEE International Joint Conference on Biometrics, 2024

Model Pairing Using Embedding Translation for Backdoor Attack Detection on Open-Set Classification Tasks, Alexander Unnervik, Hatef Otroshi Shahreza, Anjith George and Sébastien Marcel, in: NeurIPS Safe Generative AI Workshop 2024, 2024

MTGS: A Novel Framework for Multi-Person Temporal Gaze Following and Social Gaze Prediction, Anshul Gupta, Samy Tafasca, Arya Farkhondeh, Pierre Vuillecard and Jean-Marc Odobez, in: 38th Conf. on Neural Information Processing System, 2024

Multi-Operational Mathematical Derivations in Latent Space, Marco Valentino, Jordan Meadows, Lan Zhang and Andre Freitas, in: Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Multi-Relational Hyperbolic Word Embeddings from Natural Language Definitions, Marco Valentino, Danilo Carvalho and Andre Freitas, in: The 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024), 2024

Multitask Speech Recognition and Speaker Change Detection for Unknown Number of Speakers, Shashi Kumar, Srikanth Madikeri, Nigmatulina Iuliia, Esaú Villatoro-Tello, Petr Motlicek, Karthik Pandia D S, S. Pavankumar Dubagunta and Aravind Ganapathiraju, in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, Seoul, Republic of Korea, pages 12592-12596, IEEE, 2024

[DOI]
[URL]

Neural Redshift: Random Networks are not Random Functions, Damien Teney, Armand Mihai Nicolicioiu, Valentin Hartmann and Ehsan Abbasnejad, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024

Neurocomputational model of speech recognition for pathological speech detection: a case study on Parkinson’s disease speech detection, Sevada Hovsepyan and Mathew Magimai-Doss, in: Proceedings of Interspeech, Kos Island, Greece, pages 3590-3594, 2024

[DOI]
[URL]

Nonparametric Variational Regularisation of Pretrained Transformers, Fabio Fehr and James Henderson, in: First conference on Language Modelling, 2024

[URL]

Normalizing Flows for Speaker and Language Recognition Backend, Aleix Espuña, Amrutha Prasad, Petr Motlicek, Srikanth Madikeri and Schüpbach Christof, in: Odyssey 2024: The Speaker and Language Recognition Workshop, 2024

On the Utility of Speech and Audio Foundation Models for Marmoset Call Analysis, Eklavya Sarkar and Mathew Magimai-Doss, in: 4th International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots, 2024

Open-Vocabulary Object 6D Pose Estimation, Jaime Corsetti, Davide Boscaini, Changjae Oh, Andrea Cavallaro and Fabio Poiesi, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024

[URL]

OptoMechanical Modulation Tomography for Ungated Compressive Cardiac Light Sheet Microscopy, François Marelli and Michael Liebling, in: 2024 IEEE International Symposium on Biomedical Imaging (ISBI), Athens, Greece, pages 1--4, 2024

[DOI]
[URL]

OptoMechanical Modulation Tomography for Ungated Compressive Cardiac Light Sheet Microscopy, François Marelli and Michael Liebling, in: 2024 IEEE International Symposium on Biomedical Imaging (ISBI), Athens, Greece, pages 1--4, 2024

[DOI]
[URL]

Parametric point spread function estimation for thermal imaging systems using easy-to-manufacture random pattern targets, Florian Piras, Edouard De Moura Presa, Peter Wellig and Michael Liebling, in: Target and Background Signatures X: Traditional Methods and Artificial Intelligence, pages 1319905-(1-9), SPIE, 2024

[DOI]
[URL]

Predicting Heart Activity from Speech using Data-driven and Knowledge-based features, Gasser Elbanna, Zohreh Mostaani and Mathew Magimai-Doss, in: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2024

Probability-Aware Word-Confusion-Network-to-Text Alignment Approach for Intent Classification, Esaú Villatoro-Tello, Srikanth Madikeri, Bidisha Sharma, Driss Khalil, Shashi Kumar, Nigmatulina Iuliia, Petr Motlicek and Aravind Ganapathiraju, in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, Seoul, Republic of Korea, pages 12617-12621, IEEE, 2024

[DOI]
[URL]

ProGAP: Progressive Graph Neural Networks with Differential Privacy Guarantees, Sina Sajadmanesh and Daniel Gatica-Perez, in: The 17th ACM International Conference on Web Search and Data Mining, 2024

Reasoning with Natural Language Explanations, Marco Valentino and Andre Freitas, in: In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: Tutorial Abstracts, 2024

Recursive Forward Dynamics for Serial Kinematic Chains using Conformal Geometric Algebra, Tobias Löw and Sylvain Calinon, in: In Proc. Intl Workshop on the Algorithmic Foundations of Robotics (WAFR), 2024

Refining Tuberculosis Detection in CXR Imaging: Addressing Bias in Deep Neural Networks via Interpretability, Özgür Güler, Manuel Günther and André Anjos, in: Proceedings of the 12th European Workshop on Visual Information Processing, 2024

[DOI]
[URL]

Reliability Estimation of News Media Sources: Birds of a Feather Flock Together, Sergio Burdisso, Dairazalia Sanchez-Cortes, Esaú Villatoro-Tello and Petr Motlicek, in: Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Mexico City, Mexico, pages 6900–6918, Association for Computational Linguistics, 2024

[DOI]
[URL]

Representing Robot Geometry as Distance Fields: Applications to Whole-body Manipulation, Yiming Li, Yan Zhang, Amirreza Razmjoo and Sylvain Calinon, in: IEEE International Conference on Robotics and Automation, 2024

Robust Manipulation Primitive Learning via Domain Contraction, Teng Xue, Amirreza Razmjoo Fard, Suhan Shetty and Sylvain Calinon, in: Proceedings of Conference on Robot Learning, 2024

ROXSD: The ROXANNE Multimodal and Simulated Dataset for Advancing Criminal Investigations, Petr Motlicek, Erinc Dikici, Srikanth Madikeri, Pradeep Rangappa, Miroslav Janosik, Gerhard Backfried, Dorothea Thomas-Aniola, Maximilian Schurz, Johan Rohdin, Petr Schwarz, Marek Kovac, Květoslav Malý, Dominik Boboš, Mathias Leibiger, Costas Kalogiros, Andreas Alexopoulos, Daniel Kudenko, Zahra Ahmadi, Hoang H. Nguyen, Aravind Krishnan, Dawei Zhu, Dietrich Klakow, Maria Jofre, Francesco Calderoni, Denis Marraud, Nikolaos Koutras, Nikos Nikolau, Christiana Apostiki, Panagiotis Douris, Konstantinos Gkountas, Eleni Sergidou, Wauter Bosma, Joshua Hughues and Hellenic Police Team, in: Odyssey 2024: The Speaker and Language Recognition Workshop, pages 17-24, 2024

[DOI]
[URL]

σ-GPTs: A New Approach to Autoregressive Models., Arnaud Pannatier, Evann Courdier and Francois Fleuret, in: European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2024

Score Normalization for Demographic Fairness in Face Recognition, Yu Linghu, Tiago de Freitas Pereira, Christophe Ecabert, Sébastien Marcel and Manuel Günther, in: IEEE International Joint Conference on Biometrics (IJCB 2024), 2024

SDFR: Synthetic Data for Face Recognition Competition, Hatef Otroshi Shahreza, Christophe Ecabert, Anjith George, Alexander Unnervik and Sébastien Marcel, in: 2024 IEEE 18th International Conference on Automatic Face and Gesture Recognition (FG), IEEE, 2024

[DOI]
[URL]

SDFR: Synthetic Data for Face Recognition Competition, Hatef Otroshi Shahreza, Christophe Ecabert, Anjith George, Alexander Unnervik and Sébastien Marcel, in: IEEE FG 2024 : 18th IEEE International Conference on Automatic Face and Gesture Recognition, 2024

Second Edition FRCSyn Challenge at CVPR 2024: Face Recognition Challenge in the Era of Synthetic Data, Hatef Otroshi Shahreza, Anjith George, Alexander Unnervik, Parsa Rahimi and Sébastien Marcel, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, pages 3173-3183, 2024

[URL]

Segmenting Object Affordances: Reproducibility and Sensitivity to Scale, Tommaso Apicella, Alessio Xompero, Paolo Gastaldo and Andrea Cavallaro, in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024

Selective Mixup Helps with Distribution Shifts, But Not (Only) because of Mixup, Damien Teney, Jindong Wang and Ehsan Abbasnejad, in: International Conference on Machine Learning (ICML), 2024

[URL]

Self-Refine Instruction-Tuning for Aligning Reasoning in Language Models, Andre Freitas and Leonardo Ranaldi, in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

SemEval-2024 Task 2: Safe Biomedical Natural Language Inference for Clinical Trials, Mael Jullien, Marco Valentino and Andre Freitas, in: In Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024), 2024

Sharingan: A Transformer Architecture for Multi-Person Gaze Following, Samy Tafasca, Anshul Gupta and Jean-Marc Odobez, in: Int. Conference Computer Vision and Pattern Recognition (CVPR), Seatle, 2024

Sparse multi-view hand-object reconstruction for unseen environments, Yik Lung Pang, Changjae Oh and Andrea Cavallaro, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2024

[URL]

Sparse Optical Sampling in the Close Proximity of a Robotic Arm, Martin Laurenzis, Ante Marić, Emmanuel Bacher, Mateusz Pietrzak, Stéphane Schertzer, Francesco Grella and Sylvain Calinon, in: Springer Proceedings in Advanced Robotics, 2024

Speech and Language Recognition with Low-rank Adaptation of Pretrained Models, Amrutha Prasad, Srikanth Madikeri, Driss Khalil, Petr Motlicek and Schüpbach Christof, in: Interspeech 2024, pages 2825--2829, 2024

[DOI]
[URL]

Suppressing Noise Disparity in Training Data for Automatic Pathological Speech Detection, Mahdi Amiri and Ina Kodrasi, in: IWAENC, 2024

SYLLABLE LEVEL FEATURES FOR PARKINSON'S DISEASE DETECTION FROM SPEECH, Sevada Hovsepyan and Mathew Magimai-Doss, in: ICASSP, 2024

Synergizing Natural Language Towards Enhanced Shared Autonomy, Shalutha Rajapakshe, Atharva Dastenavar and Emmanuel Senft, in: Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, 2024

[URL]

Synthetic to Authentic: Transferring Realism to 3D Face Renderings for Boosting Face Recognition, Parsa Rahimi, Behrooz Razeghi and Sébastien Marcel, in: European Conference on Computer Vision Workshops, 2024

Temporal fine-tuning for early risk detection, Horacio Thompson, Esaú Villatoro-Tello, Manuel Montes-y-Gómez and Marcelo Errecalde, in: Memorias De Las JAIIO, Argentina, pages 137-149, 2024

[URL]

TESS: Text-to-text selfconditioned simplex diffusion, Rabeeh Karimi Mahabadi, Hamish Ivison, Jaesung Tae, James Henderson, Iz Beltagy, Matthew Peters and Arman Cohan, in: Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), pages 2347–2361, Association for Computational Linguistics, 2024

Test-time adaptation for automatic pathological speech detection in noisy environments, Mahdi Amiri and Ina Kodrasi, in: EUSIPCO, 2024

TokenVerse: Towards Unifying Speech and NLP Tasks via Transducer-based ASR, Shashi Kumar, Srikanth Madikeri, Juan Zuluaga-Gomez, Iuliia Thorbecke, Esaú Villatoro-Tello, Sergio Burdisso, Petr Motlicek, Karthik Pandia D S and Aravind Ganapathiraju, in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pages 20988–20995, Association for Computational Linguistics (ACL), 2024

[DOI]
[URL]

Toward Semantic Gaze Target Detection, Samy Tafasca, Anshul Gupta, Victor Bros and Jean-Marc Odobez, in: 38th Conf. on Neural Information Processing System, 2024

Towards interfacing large language models with ASR systems using confidence measures and prompting, Maryam Naderi, Enno Hermann, Alexandre Nanchen, Sevada Hovsepyan and Mathew Magimai-Doss, in: Proceedings of Interspeech, pages 2980-2984, 2024

[DOI]

Towards Robo-Coach: Robot Interactive Stiffness/Position Adaptation for Human Strength and Conditioning Training, C. Li, X. Wu, T. Teng, Sylvain Calinon and F. Chen, in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), 2024

Towards Wine Tasting Activity Recognition for a Digital Sommelier, Mario Parra, Jesus Favela, Luis Castro and Daniel Gatica-Perez, in: Proc. ACM Workshop on Exploring Innovative Technology for Commensality and Human-Food Interaction, 2024

Tracking linguistic information in transformer-based sentence embeddings through targeted sparsification, Vivi Nastase and Paola Merlo, in: Proceedings of the 9th Workshop on Representation Learning for NLP, 2024

[URL]

Understanding the effects of language-specific class imbalance in multilingual fine-tuning, Vincent Jung and Lonneke van der Plas, in: Findings of the European chapter of Association for Computational Linguistics, 2024, 2024

Unveiling Biases while Embracing Sustainability: Assessing the Dual Challenges of Automatic Speech Recognition Systems, Ajinkya Kulkarni, Atharva Kulkarni, Miguel Couceiro and Isabel Trancoso, in: ISCA proceedings, Greece, pages 4, 2024

[DOI]
[URL]

Unveiling Synthetic Faces: How Synthetic Datasets Can Expose Real Identities, Hatef Otroshi Shahreza and Sébastien Marcel, in: NeurIPS Workshop on New Frontiers in Adversarial Machine Learning, 2024

Using Backbone Foundation Model for Evaluating Fairness in Chest Radiography Without Demographic Data, Dilermando Queiroz Neto, André Anjos and Lilian Berton, in: Proceedings of the International Conference on Medical Image Computing and Computer Assisted Intervention, 2024

[DOI]
[URL]

Vascular Biometrics Experiments on Candy -- A New Contactless Finger-Vein Dataset, Sushil Bhattacharjee, David Geissbuhler, G. Clivaz, Ketan Kotwal and Sébastien Marcel, in: Proceedings of the International Conference on Pattern Recognition (ICPR), Calcutta (India), 2024

Verification and Refinement of Natural Language Explanations through LLM-Symbolic Theorem Proving, Xin Quan, Marco Valentino, Louise A Dennis and Andre Freitas, in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Vulnerability of Face Age Verification to Replay Attacks, Pavel Korshunov, Anjith George, Gökhan Özbulak and Sébastien Marcel, in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2024

Weakly-supervised Autism Severity Assessment in Long Videos, Abid Ali, Mahmoud Ali, Camilla Barbini, Séverine Dubuisson, Jean-Marc Odobez, Francois Bremond and Suzanne Thümmler, in: International Conference on Content-based Multimedia Indexing, 2024

What Does it Take to Generalize SER Model Across Datasets? A Comprehensive Benchmark, Adham Ibrahim, Shady Shehata, Ajinkya Kulkarni, Mukhtar Mohamed and Muhammad Abdul-Mageed, in: ISCA proceedings, Greece, 2024

[DOI]
[URL]

A benchmark for the simulation of meshed district heating networks based on anonymised monitoring data, Roberto Boghetti and Jérôme Kämpf, in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2023

[DOI]
[URL]

A Machine Learning Model for the Prediction of Building Hourly Heating Demand from CityGML Files: Training Workflow and Deployment as an API, Marco Tognoli, Giuseppe Peronato and Jérôme Kämpf, in: Proceedings of Building Simulation 2023: 18th Conference of IBPSA, pages 2932 - 2939, 2023

[DOI]
[URL]

A Multitask and Kernel approach for Learning to Push Objects with a Task-Parameterized Deep Q-Network, Marco Ewerton, Michael Villamizar, Julius Jankowski, Sylvain Calinon and Jean-Marc Odobez, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2023

A Study on the Importance of Formant Transitions for Stop-Consonant Classification in VCV Sequence, Siddarth Chandrasekar, Arvind Ramesh, Tilak Purohit and Prasanta Kumar Ghosh, in: Proceedings of Interspeech, Dublin, Ireland, ISCA, 2023

A VAE for Transformers with Nonparametric Variational Information Bottleneck, James Henderson and Fabio Fehr, in: The Eleventh International Conference on Learning Representations, 2023

[URL]

Affordance segmentation of hand-occluded containers from exocentric images, Tommaso Apicella, Alessio Xompero, Edoardo Ragusa, Riccardo Berta, Andrea Cavallaro and Paolo Gastaldo, in: Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), 2023

[DOI]
[URL]

Approximating Optimal Morphing Attacks using Template Inversion, Laurent Colbois, Hatef Otroshi Shahreza and Sébastien Marcel, in: IEEE International Joint Conference on Biometric, 2023

[DOI]

Automatic Speech Analysis Framework for ATC Communication in HAAWAII, Petr Motlicek, Amrutha Prasad, Nigmatulina Iuliia, Hartmut Helmke, Oliver Ohneiser and Matthias Kleinert, in: 13th SESAR Innovation Days, 2023

Automatic Speech Recognition and Understanding for Radar Label Maintenance Support Increases Safety and Reduces Air Traffic Controllers’ Workload, Hartmut Helmke, Matthias Kleinert, Nils Ahrenhold, heiko Ehr, Thorsten Mühlhausen, Oliver Ohneiser, Petr Motlicek, Amrutha Prasad, Juan Zuluaga-Gomez, Lucas Klamert, Jelena Dokic and Ella Pinska Chauvin, in: Fifteenth USA/Europe Air Traffic Management Research and Development Seminar (ATM2023), Eurocontrol (Europe), FAA (U.S.), Savannah, Georgia, USA, 2023

[URL]

BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications, Juan Zuluaga-Gomez, Seyyed Saeed Sarfjoo, Amrutha Prasad, Nigmatulina Iuliia, Petr Motlicek, Karel Ondřej and Oliver Ohneiser, in: 2023 IEEE Spoken Language Technology Workshop (SLT), IEEE, 2023

[URL]

Black-box Attacks on Image Activity Prediction and its Natural Language Explanations, Alina Elena Baia, Valentina Poggioni and Andrea Cavallaro, in: Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), 2023

[DOI]
[URL]

Blackbird Language Matrices Tasks for Generalization, Paola Merlo, Chunyang Jiang, Giuseppe Samo and Vivi Nastase, in: Proceedings of the 1st GenBench Workshop on (Benchmarking) Generalisation in NLP, ACL, 2023

Blackbox Face Reconstruction from Deep Facial Embeddings Using A Different Face Recognition Model, Hatef Otroshi Shahreza and Sébastien Marcel, in: Proceedings of the IEEE International Conference on Image Processing (ICIP), Kuala Lumpur, Malaysia, pages 2435-2439, 2023

[DOI]
[URL]

BLESS: Benchmarking Large Language Models on Sentence Simplification, Tannon Kew, Alison Chi, Laura Vásquez-Rodríguez, Sweta Agrawal, Dennis Aumiller, Fernando Alva-Manchego and Matthew Shardlow, in: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Singapore, 2023

BLM-AgrF: A New French Benchmark to Investigate Generalization of Agreement in Neural Networks., Aixiu An, Chunyang Jiang, Maria A. Rodriguez, Vivi Nastase and Paola Merlo, in: Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

BLM-s/lE: A structured dataset of English spray-load verb alternations for testing generalization in LLMs., Giuseppe Samo, Vivi Nastase, Chunyang Jiang and Paola Merlo, in: Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Bridging the Gap: Heterogeneous Face Recognition with Conditional Adaptive Instance Modulation, Anjith George and Sébastien Marcel, in: IJCB, 2023

Building Structured Synthetic Datasets: The Case of Blackbird Language Matrices (BLMs), Paola Merlo, Giuseppe Samo, Vivi Nastase and Chunyang Jiang, in: Proceedings of the 9th Italian Conference on Computational Linguistics, 2023

Can ChatGPT Detect Intent? Evaluating Large Language Models for Spoken Language Understanding, Mutian He and Philip N. Garner, in: Proc. INTERSPEECH 2023, pages 1109-1113, 2023

[DOI]

Can Language Models Learn Analogical Reasoning? Investigating Training Objectives and Comparisons to Human Performance, Molly R. Petersen and Lonneke van der Plas, in: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Singapore, Association for Computational Linguistics, 2023

Can personalised hygienic masks be used to attack face recognition systems?, Alain Komaty, Vedrana Krivokuca, Christophe Ecabert and Sébastien Marcel, in: Proceedings of IEEE International Joint Conference on Biometrics (IJCB2023), 2023

Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers?, Eklavya Sarkar and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2023

ChildPlay: A New Benchmark for Understanding Children's Gaze Behaviour, Samy Tafasca, Anshul Gupta and Jean-Marc Odobez, in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023

CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice, Juan Zuluaga-Gomez, Ahmed Sara, Visockas Danielius and Subakan Cem, in: Proc. Interspeech 2023, 2023

[URL]

Complex Daily Activities, Country-Level Diversity, and Smartphone Sensing: A Study in Denmark, Italy, Mongolia, Paraguay, and UK, Karim Assi, Lakmal Buddika Meegahapola, William Droz, PETER KUN, Amalia de Götzen, Miriam Bidoglia, Sally Stares, George Gaskell, Altangerel Chagnaa, Amarsanaa Ganbold, Tsolmon Zundui, Carlo Caprini, Daniele Miorandi, José Luis Zarza, Alethia Hume, Luca Cernuzzi, Ivano Bison, Marcelo Rodas Britez, Matteo Busso, Ronald Chenu-Abente, Fausto Giunchiglia and Daniel Gatica-Perez, in: Proceedings of the 2023 ACM CHI Conference on Human Factors in Computing Systems (CHI), Association for Computing Machinery, 2023

[DOI]

Customization of Automatic Speech Recognition Engines for Rare Word Detection Without Costly Model Re-Training, Mrinmoy Bhattacharjee, Petr Motlicek, Nigmatulina Iuliia, Hartmut Helmke, Oliver Ohneiser, Matthias Kleinert and heiko Ehr, in: Proc. 13th SESAR Innovation Days, Seville, Spain, 2023

[DOI]
[URL]

Data-driven Urban Building Energy Modeling with Machine Learning in Satom (CH), Ahad Montazeri, Jérôme Kämpf and Guglielmina Mutani, in: 6th International IEEE Conference AND Workshop in Obuda on Electrical and Power Engineering, 2023

Demonstration-guided Optimal Control for Long-term Non-prehensile Planar Manipulation, Teng Xue, Hakan Girgin, Teguh Santoso Lembono and Sylvain Calinon, in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), pages 4999-5005, 2023

[DOI]

Diffusion Transformer for Adaptive Text-to-Speech, Haolin Chen and Philip N. Garner, in: Proc. 12th ISCA Speech Synthesis Workshop (SSW 12), 2023

[DOI]

Document-level Text Simplification with Coherence Evaluation, Laura Vásquez-Rodríguez, Matthew Shardlow, Piotr Przybyla and Sophia Ananiadou, in: Proceedings of the Second Workshop on Text Simplification, Accessibility and Readability, 2023

EFaR 2023: Efficient Face Recognition Competition, Anjith George, Christophe Ecabert, Hatef Otroshi Shahreza, Ketan Kotwal and Sébastien Marcel, in: IEEE International Joint Conference on Biometrics (IJCB 2023), 2023

Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks, Esaú Villatoro-Tello, Srikanth Madikeri, Juan Zuluaga-Gomez, Bidisha Sharma, Seyyed Saeed Sarfjoo, Nigmatulina Iuliia, Petr Motlicek, Alexei V. Ivanov and Aravind Ganapathiraju, in: Proceedings of the 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, 2023

Efficient Grapevine Structure Estimation in Vineyards Conditions, Théophile Gentilhomme, Michael Villamizar, Jérome Corre and Jean-Marc Odobez, in: Proceedings of the IEEE/CVF International Conference on Computer Vision, Paris, pages 712--720, 2023

[URL]

End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation, Juan Zuluaga-Gomez, Zhaocheng Huang, Xing Niu, Sundararajan Srinavasan, Prashant Mathur, Brian Thompson and Marcello Federico, in: The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), Singapore, 2023

[URL]

Energy assessment of a district by integrating solar thermal in district heating network: a dynamic analysis approach, Matteo Bilardo, Jérôme Kämpf and Enrico Fabrizio, in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2023

[DOI]
[URL]

Enhancing Multi-modal Classification of Violent Events using Image Captioning, Daniel Vallejo-Aldana, A. Pastor López-Monroy and Esaú Villatoro-Tello, in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2023), CEUR Workshop Proceedings, Jaén, Spain, 2023

[URL]

Enhancing user acceptance in automated systems with human-centric lighting: the role of visual comfort, personality, and preference, Michael Papinutto, Moreno Colombo, Roberto Boghetti, Chantal Basurto, Kornelius Reutter, Denis Lalanne, Jérôme Kämpf and Julien Nembrini, in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2023

[DOI]
[URL]

ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields, Mohammad Mahdi Johari, Camilla Carta and Francois Fleuret, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (CVPR), pages 17408-17419, 2023

[DOI]

Examining European Press Coverage of the Covid-19 No-Vax Movement: An NLP Framework, David Alonso del Barrio and Daniel Gatica-Perez, in: 2nd ACM International Workshop on Multimedia AI against Disinformation (MAD '23), June 12, 2023, Thessaloniki, Greece, 2023

Face Reconstruction from Facial Templates by Learning Latent Space of a Generator Network, Hatef Otroshi Shahreza and Sébastien Marcel, in: Thirty-seventh Conference on Neural Information Processing Systems, 2023

[URL]

Factors that Affect Personalization of Robots for Older Adults, Laura Stegner, Emmanuel Senft and Bilge Mutlu, in: CONCATENATE Workshop at HRI 2023 in Stockholm, Sweden, 2023

[URL]

Few-shot Dysarthric Speech Recognition with Text-to-Speech Data Augmentation, Enno Hermann and Mathew Magimai-Doss, in: Proceedings of Interspeech, pages 156-160, 2023

[DOI]
[URL]

Findings of the IWSLT 2023 evaluation campaign, Milind Agarwal, Sweta Agarwal, Antonios Anastasopoulos, Luisa Bentivogli, Ondrej Bojar, Claudia Borg, Marine Carpuat, Roldano Cattoni, Mauro Cettolo, Mingda Chen, William Chen, Khalid Choukri, Alexandra Chronopoulou, Anna Currey, Thierry Declerck, Qianqian Dong, Kevin Duh, Yannick Esteve, Marcello Federico, Souhir Gahbiche, Barry Haddow, Benjamin Hsu, Phu Mon Htut, Hirofumi Inaguma, David Javorsky, John Judge, Yasumasa Kano, Tom Ko, Rishu Kumar, Pengwei Li, Xutai Ma, Prashant Mathur, Evgeny Matusov, Paul McNamee, John P. McCrae, Kenton Murray, Maria Nadejde, Satoshi Nakamura, Matteo Negri, Ha Nguyen, Jan Niehues, Xing Niu, Atul Kr. Ojha, John E. Ortega, Proyag Pal, Juan Pino, Lonneke van der Plas, Peter Polak, Elijah Rippeth, Elizabeth Salesky, Jiatong Shi, Matthias Sperber, Sebastian Stuker, Katsuhito Sudoh, Yun Tang, Brian Thompson, Kevin Tran, Marco Turchi, Alex Waibe, Mingxuan Wang, Shinji Watanabe and Rodolfo Zevallos, in: Proceedings of the IWSLT conference, 2023

Framing the News: From Human Perception to Large Language Model Inferences, David Alonso del Barrio and Daniel Gatica-Perez, in: International Conference on Multimedia Retrieval (ICMR '23), June 12--15, 2023, Thessaloniki, Greece, 2023

Fully Automatic Grading of Retinal Vasculitis on Fluorescein Angiography Time-lapse from Real-world Data in Clinical Settings, Victor Amiot, Oscar Jimenez-del-Toro, Pauline Eyraud, Yan Guex-Crosier, Ciara Bergin, André Anjos, Florence Hoogewoud and Mattia Tomasoni, in: 2023 IEEE 36th International Symposium on Computer-Based Medical Systems (CBMS), L'Aquila, Italy, 2023, pages 689-693, 2023

[DOI]
[URL]

GAP: Differentially Private Graph Neural Networks with Aggregation Perturbation, Sina Sajadmanesh, Ali Shahin Shamsabadi, Aurélien Bellet and Daniel Gatica-Perez, in: 32nd USENIX Security Symposium (USENIX Security 23), 2023

How Does Pre-trained Wav2Vec 2.0 Perform on Domain-Shifted ASR? An Extensive Benchmark on Air Traffic Control Communications, Juan Zuluaga-Gomez, Amrutha Prasad, Nigmatulina Iuliia, Seyyed Saeed Sarfjoo, Petr Motlicek, Matthias Kleinert, Hartmut Helmke, Oliver Ohneiser and Qingran Zhan, in: 2023 IEEE Spoken Language Technology Workshop (SLT), IEEE, 2023

[URL]

Human-Robot Collaboration in a Sanding Task, Anna Konstant, Nitzan Orr, Michael Hagenow, Emmanuel Senft, Isabelle Gundrum, Bilge Mutlu, Michael Zinn, Michael Gleicher and Robert Radwin, in: Proceedings of the Human Factors and Ergonomics Society Annual Meeting, 2023

HyperConformer: Multi-head HyperMixer for Efficient Speech Recognition, Florian Mai, Juan Zuluaga-Gomez, Titouan Parcollet and Petr Motlicek, in: Proc. Interspeech 2023, Ireland, 2023

HyperMixer: An MLP-based Low Cost Alternative to Transformers, Florian Mai, Arnaud Pannatier, Fabio Fehr, Haolin Chen, François Marelli, Francois Fleuret and James Henderson, in: Proc. of the 61st Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Toronto, Canada, pages 15632-15654, 2023

[DOI]

ID and OOD performance are sometimes inversely correlated on real-world datasets, Damien Teney, Yong Lin, Seong Joon Oh and Ehsan Abbasnejad, in: Advances in Neural Information Processing Systems (NeurIPS), 2023

Implementing contextual biasing in GPU decoder for online ASR, Nigmatulina Iuliia, Srikanth Madikeri, Esaú Villatoro-Tello, Petr Motlicek, Juan Zuluaga-Gomez, Karthik Pandia D S and Aravind Ganapathiraju, in: Proc. Interspeech 2023, pages 4494--4498, 2023

[DOI]
[URL]

Implicit phonetic information modeling for speech emotion recognition, Tilak Purohit, Bogdan Vlasenko and Mathew Magimai-Doss, in: Proceedings of Interspeech, Dublin, Ireland, ISCA, 2023

International Conference on the Voynich Manuscript 2022, Colin Layfield, René Zandbergen, Lisa Fagin Davis, John Abela, Claire Bowern, Michael Rosner and Lonneke van der Plas, in: Proceedings of the International Conference on Historical Cryptology, 2023

Inversion of Deep Facial Templates using Synthetic Data, Hatef Otroshi Shahreza and Sébastien Marcel, in: Proceedings of the IEEE International Joint Conference on Biometric, 2023

[DOI]
[URL]

Keep Sensors in Check: Disentangling Country-Level Generalization Issues in Mobile Sensor-Based Models with Diversity Scores, Alexandre Nanchen, Lakmal Buddika Meegahapola, William Droz and Daniel Gatica-Perez, in: Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, 2023

Learning Disentangled Representations for Natural Language Definitions, Danilo Carvalho, Giangiacomo Mercatali, Yingji Zhang and Andre Freitas, in: In Findings of the European chapter of Association for Computational Linguistics, 2023

Learning diverse features in vision transformers for improved generalization, Armand Mihai Nicolicioiu, Andrei Liviu Nicolicioiu, Bogdan Alexe and Damien Teney, in: ICML 2023: The Second Workshop on Spurious Correlations, Invariance and Stability, 2023

[URL]

Learning Joint Space Reference Manifold for Reliable Physical Assistance, Amirreza Razmjoo, Tilen Brecelj, Kristina Savevska, Ales Ude, Tadej Petric and Sylvain Calinon, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), pages 10412-10417, 2023

[DOI]

Learning to Abstract with Nonparametric Variational Information Bottleneck, Melika Behjati, Fabio Fehr and James Henderson, in: The 2023 Conference on Empirical Methods in Natural Language Processing, 2023

[URL]

Leveraging Diffusion Disentangled Representations to Mitigate Shortcuts in Underspecified Visual Tasks, Luca Scimeca, Alexander Rubinstein, Armand Mihai Nicolicioiu, Damien Teney and Yoshua Bengio, in: NeurIPS Workshop on Diffusion Models, 2023

[URL]

MLP-Hash: Protecting Face Templates via Hashing of Randomized Multi-Layer Perceptron, Hatef Otroshi Shahreza, Vedrana Krivokuca and Sébastien Marcel, in: Proceedings of the 31st European Signal Processing Conference, Helsinki, Finland, 2023

[DOI]
[URL]

Multi-image deconvolution of thermal images with a boundary condition weighting scheme, Florian Piras, François Marelli, Edouard De Moura Presa, Peter Wellig and Michael Liebling, in: Target and Background Signatures IX, International Society for Optics and Photonics, Amsterdam, pages 149-158, SPIE, 2023

[DOI]
[URL]

Multi-IVE: Privacy Enhancement of Multiple Soft-Biometrics in Face Embeddings, Pietro Melzi, Hatef Otroshi Shahreza, Christian Rathgeb, Ruben Tolosana, Ruben Vera-Rodriguez, Julian Fierrez, Sébastien Marcel and Christoph Busch, in: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

[URL]

NLI4CT: Multi-Evidence Natural Language Inference for Clinical Trial Reports, Mael Jullien, Marco Valentino, Hannah Frost, Paul O'Reagan, Donal Landers and Andre Freitas, in: Proceedings of The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), 2023

Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviews, Sergio Burdisso, Esaú Villatoro-Tello, Srikanth Madikeri and Petr Motlicek, in: Proceedings of Interspeech, 2023

On Interventional Probing in High Dimensions: An NLI Case Study, Julia Rozanova, Marco Valentino, Lucas Cordeiro and Andre Freitas, in: Findings of the 17th European Chapter of the Association for Computational Linguistics, 2023

Parameter-Efficient Tuning With Adaptive Bottlenecks For Automatic Speech Recognition, Geoffroy Vanderreydt, Amrutha Prasad, Driss Khalil, Srikanth Madikeri, Kris Demuynck and Petr Motlicek, in: Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU'23, 2023

[DOI]

Potential for district heating networks from waste heat: an assessment tool and its application to sewage treatment plants in the Canton of Zurich, Giuseppe Peronato and Jérôme Kämpf, in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2023

[DOI]
[URL]

Quantified Canine: Inferring Dog Personality From Wearables, Lakmal Buddika Meegahapola, Marios Constantinides, Zoran Radivojevic, Hongwei Li, Daniele Quercia and Michael S. Eggleston, in: Proceedings of the 2023 ACM CHI Conference on Human Factors in Computing Systems (CHI) 2023, Association for Computing Machinery, 2023

[DOI]

Referencing in YouTube Knowledge Communication Videos, Haeeun Kim and Daniel Gatica-Perez, in: ACM International Conference on Interactive Media Experiences (IMX '23), June 2023, Nantes, France, 2023

Remote Cancelable Biometric System for Verification and Identification Applications, Hatef Otroshi Shahreza, Amina Bassit, Sébastien Marcel and Raymond Veldhuis, in: Proceedings of the International Conference of the Biometrics Special Interest Group (BIOSIG), 2023

[DOI]
[URL]

Robust Execution of Assembly Policies Using a Pose Invariant Task Representation, Bojan Nemec, Matevz Hrovat, Mihael Simonič, Suhan Shetty, Sylvain Calinon and Ales Ude, in: 20th International Conference on Ubiquitous Robots (UR), Honolulu, HI, USA, IEEE, 2023

RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question, Alireza Mohammadshahi, Thomas Scialom, Majid Yazdani, Pouya Yanki, Angela Fan, James Henderson and Marzieh Saeidi, in: Association for Computational Linguistics: ACL 2023, Toronto, Canada, 2023

[URL]

SemEval-2023 Task 7: Multi-Evidence Natural Language Inference for Clinical Trial Data, Mael Jullien, Marco Valentino, Hannah Frost, Paul O'Reagan, Donal Landers and Andre Freitas, in: Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023), Association for Computational Linguistics, 2023

[DOI]
[URL]

Shortcut Bias Mitigation via Ensemble Diversity Using Diffusion Probabilistic Models, Luca Scimeca, Alexander Rubinstein, Damien Teney, Seong Joon Oh, Armand Mihai Nicolicioiu and Yoshua Bengio, in: Under review, 2023

[URL]

Situated Participatory Design: A Method for In Situ Design of Robotic Interaction with Older Adults, Laura Stegner, Emmanuel Senft and Bilge Mutlu, in: CHI '23: Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

[DOI]
[URL]

SoftGPT: Learn Goal-oriented Soft Object Manipulation Skills by Generative Pre-trained Heterogeneous Graph Transformer, J. Liu, Z. Li, Sylvain Calinon and F. Chen, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2023

Strong and Efficient Baselines for Open Domain Conversational Question Answering, Andrei Catalin Coman, Gianni Barlacchi and Adrià de Gispert, in: Findings of EMNLP, Association for Computational Linguistics, 2023

[DOI]
[URL]

Syntax-Aware Graph-to-Graph Transformer for Semantic Role Labelling, Alireza Mohammadshahi and James Henderson, in: Procceedings of 8th Workshop on Representation Learning for NLP, 2023

[URL]

SynthDistill: Face Recognition with Knowledge Distillation from Synthetic Data, Hatef Otroshi Shahreza, Anjith George and Sébastien Marcel, in: IEEE International Joint Conference on Biometrics (IJCB 2023), 2023

[DOI]

Template Inversion Attack against Face Recognition Systems using 3D Face Reconstruction, Hatef Otroshi Shahreza and Sébastien Marcel, in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 19662-19672, 2023

[DOI]
[URL]

The AI4Autism Project: A Multimodal and Interdisciplinary Approach to Autism Diagnosis and Stratification, Samy Tafasca, Anshul Gupta, Nada Kojovic, Mirko Gelsomini, Thomas Maillart, Michela Papandrea, Marie Schaer and Jean-Marc Odobez, in: Companion Publication of the 25th International Conference on Multimodal Interaction, Paris, France, pages 414–425, Association for Computing Machinery, 2023

[DOI]
[URL]

The Idiap Speech Synthesis System for the Blizzard Challenge 2023, Haolin Chen, Mutian He, Louise Coppieters de Gibson and Philip N. Garner, in: Proc. 18th Blizzard Challenge Workshop, 2023

[DOI]

The Interpreter Understands Your Meaning: End-to-end Spoken Language Understanding Aided by Speech Translation, Mutian He and Philip N. Garner, in: Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023, pages 4408-4423, Association for Computational Linguistics, 2023

[DOI]

The Unconstrained Ear Recognition Challenge 2023: Maximizing Performance and Minimizing Bias, Anjith George and Sébastien Marcel, in: IEEE International Joint Conference on Biometrics (IJCB 2023), 2023

Toward responsible face datasets: modeling the distribution of a disentangled latent space for sampling face images from demographic groups, Parsa Rahimi, Christophe Ecabert and Sébastien Marcel, in: IEEE International Joint Conference on Biometrics, 2023

Towards Improved Replicability of Human Studies in Human-Robot Interaction: Recommendations for Formalized Reporting, Shelly Bagshy, Patrick Holthaus, Gloria Beraldo, Emmanuel Senft, Daniel Hernandez Garcia, Zhao Han, Suresh Kumaar Jayaraman, Alessandra Rossi, Connor Esterwood, Antonio Andriella and Paul Pridham, in: Companion of the 2023 ACM/IEEE International Conference on Human-Robot Interaction, pages 629-633, 2023

Towards learning emotion information from short segments of speech, Tilak Purohit, Sarthak Yadav, Bogdan Vlasenko, S. Pavankumar Dubagunta and Mathew Magimai-Doss, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Rhodes Island, Greece, IEEE, 2023

Transformers as Graph-to-Graph Models, James Henderson, Alireza Mohammadshahi, Andrei Catalin Coman and Lesly Miculicich, in: Big Picture Workshop at EMNLP 2023, 2023

Transformers, Tables and Frame Semantics, Mario Ramirez, Alex Bogatu, Norman Paton and Andre Freitas, in: International Conference on Semantic Computing, 2023

Understanding the Social Context of Eating with Multimodal Smartphone Sensing: The Role of Country Diversity, Nathan Kammoun, Lakmal Buddika Meegahapola and Daniel Gatica-Perez, in: 25th ACM International Conference on Multimodal Interaction, 2023

[DOI]
[URL]

Using Commercial ASR Solutions to Assess Reading Skills in Children: A Case Report, Timothy Piton, Enno Hermann, Angela Pasqualotto, Marjolaine Cohen, Mathew Magimai-Doss and Daphné Bavelier, in: Proceedings of Interspeech, pages 4573-4577, 2023

[DOI]
[URL]

Verification of PyDHN - a Python library for the thermo-hydraulic simulation of district heating networks - through the DESTEST, Roberto Boghetti, Giuseppe Peronato and Jérôme Kämpf, in: Proceedings of Building Simulation 2023: 18th Conference of IBPSA, IBPSA, IBPSA, 2023

[DOI]
[URL]

VP-STO: Via-point-based Stochastic Trajectory Optimization for Reactive Robot Behavior, Julius Jankowski, Lara Brudermuller, Nick Hawes and Sylvain Calinon, in: Proc. IEEE International Conference on Robotics and Automation (ICRA), 2023

Vulnerability of Automatic Identity Recognition to Audio-Visual Deepfakes, Pavel Korshunov, Haolin Chen, Philip N. Garner and Sébastien Marcel, in: IEEE International Joint Conference on Biometrics, 2023

Zero-shot Retrieval: Augmenting Pre-trained Models with Search Engines, Hamed Damirchi, Cristian Rodriguez-Opazo, Ehsan Abbasnejad, Damien Teney, Javen Qinfeng Shi, Stephen Gould and Anton van den Hengel, in: Under review, 2023

[URL]

A Corpus and Evaluation for Predicting Semi-Structured Human Annotations, Andreas Marfurt, Ashley Thornton, David Sylvan, Lonneke van der Plas and James Henderson, in: Workshop on Generation, Evaluation and Metrics (GEM), 2022

A Modular Multimodal Architecture for Gaze Target Prediction: Application to Privacy-Sensitive Settings, Anshul Gupta, Samy Tafasca and Jean-Marc Odobez, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

A two-step approach to leverage contextual data: speech recognition in air-traffic communications, Nigmatulina Iuliia, Juan Zuluaga-Gomez, Amrutha Prasad, Seyyed Saeed Sarfjoo and Petr Motlicek, in: Proc. of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 6282-6286, IEEE, 2022

[DOI]
[URL]

Active Learning by Feature Mixing, Amin Parvaneh, Ehsan Abbasnejad, Damien Teney, Reza Haffari, Anton van den Hengel and Javen Qinfeng Shi, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Adversarial-free speaker identity-invariant representation learning for automatic dysarthric speech classification, Parvaneh Janbakhshi and Ina Kodrasi, in: Annual Conference of the International Speech Communication Association, 2022

An anomaly detection approach for backdoored neural networks: face recognition as a case study, Alexander Unnervik and Sébastien Marcel, in: 21st International Conference of the Biometrics Special Interest Group (BIOSIG 2022), Darmstadt, Germany, 2022

An Empirical Comparison of Semantic Similarity Methods for Analyzing down-streaming Automatic Minuting task, Aditya Upadhyay, Aakash Bhatnagar, Nidhir Bhavsar, Muskaan Singh and Petr Motlicek, in: Pacific Asia Conference on Language, Information and Computation (PACLIC 36), In ACL Anthology Proceedings, 2022

An End-to-End Multilingual System for Automatic Minuting of Multi-Party Dialogues, Aakash Bhatnagar, Nidhir Bhavsar, Muskaan Singh and Petr Motlicek, in: Pacific Asia Conference on Language, Information and Computation (PACLIC 36) , In proceedings of ACL Anthology, 2022

An exploratory interplay between daylight, general and task lighting for visual comfort and electricity savings in a personal office space, Chantal Basurto, Michael Papinutto, Moreno Colombo, Roberto Boghetti, Kornelius Reutter, Julien Nembrini and Jérôme Kämpf, in: Proceedings of ISES and IEA SHC International Conference on Solar Energy for Buildings and Industry, Kassel, Germany, 2022

Are GAN-based Morphs Threatening Face Recognition?, Eklavya Sarkar, Pavel Korshunov, Laurent Colbois and Sébastien Marcel, in: International Conference on Acoustics, Speech and Signal Processing, 2022

Automatic Minuting: A Pipeline Method for Generating Minutes, Kartik Shinde, Tirthankar Ghosal, Muskaan Singh and Ondrej Bojar, in: Pacific Asia Conference on Language, Information and Computation (PACLIC 36), In proceedings of ACL Anthology, 2022

Automatic Summarization for Creative Writing: Denoising Auto-Encoder based Pipeline Method for Generating Summary of Movie Scripts, Aditya Upadhyay, Nidhir Bhavsar, Aakash Bhatnagar, Muskaan Singh and Petr Motlicek, in: Automatic Summarization for Creative Writing, International Conference on Computational Linguistics (COLING 2022), 2022

Bag-of-Vectors Autoencoders for Unsupervised Conditional Text Generation, Florian Mai and James Henderson, in: Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), Association for Computational Linguistics, Online, pages 468–488, 2022

[URL]

Bayesian Recurrent Units and the Forward Backward Algorithm, Alexandre Bittar and Philip N. Garner, in: Proc. Interspeech 2022, pages 4137-4141, 2022

[DOI]

Bio-Medical Multi-label Scientific Literature Classification using LWAN and Dual-attention module, Deepanshu Khanna, Aakash Bhatnagar, Nidhir Bhavsar, Muskaan Singh and Petr Motlicek, in: PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION (PACLIC 2022), In Proceedings of ACL Anthology, 2022

Borrowing from yourself: Faster future video segmentation with partial channel update, Evann Courdier and Francois Fleuret, in: International Conference on Pattern Recognition, 2022

Case-Based Abductive Natural Language Inference, Marco Valentino, Mokanarangan Thayaparan and Andre Freitas, in: Proceedings of the 29th International Conference on Computational Linguistics, 2022

[URL]

Comparing Biosignal and Acoustic feature Representation for Continuous Emotion Recognition, Sarthak Yadav, Tilak Purohit, Zohreh Mostaani, Bogdan Vlasenko and Mathew Magimai-Doss, in: International Multimodal Sentiment Analysis Workshop and Challenge, 2022

Comparing supervised and self-supervised embedding for ExVo Multi-Task learning track, Tilak Purohit, Imen Ben Mahmoud, Bogdan Vlasenko and Mathew Magimai-Doss, in: Proceedings of the ICML Expressive Vocalizations Workshop held in conjunction with the 39th International Conference on Machine Learning, Maryland, USA, 2022

Comparison of 5 methods for the evaluation of intelligibility in mild to moderate French dysarthric speech, Cecile Fougeron, Nicolas Audibert, Ina Kodrasi, Parvaneh Janbakhshi, Michaela Pernon, Nathalie Leveque, Stephanie Borel, Marina Laganaro, Hervé Bourlard and Frederic Assal, in: Annual Conference of the International Speech Communication Association, pages 2188-2192, 2022

[DOI]

Conversational Speech Recognition Needs Data? Experiments with Austrian German, Julian Linke, Philip N. Garner, Gernot Kubin and Barbara Schuppler, in: Proceedings of the 13th Language Resources and Evaluation Conference, European Language Resources Association, pages 4684--4691, 2022

[URL]

Custom attribution loss for improving generalization and interpretability of deepfake detection, Pavel Korshunov, Anubhav Jain and Sébastien Marcel, in: International Conference on Acoustics, Speech, and Signal Processing, 2022

Decomposing Natural Logic Inferences for Neural NLI, Julia Rozanova, Deborah Mendes, Mokanarangan Thayaparan, Marco Valentino and Andre Freitas, in: BlackBoxNLP: Workshop on analyzing and interpreting neural networks for NLP, 2022

DeepCon: An End-to-End Multilingual Toolkit for Automatic Minuting of Multi-Party Dialogues, Aakash Bhatnagar, Nidhir Bhavsar, Muskaan Singh and Petr Motlicek, in: Special Interest Group on Discourse and Dialogue (SIGDIAL 2022), 2022

Demystifying the Scribes behind the Voynich Manuscript using Computational Linguistic Techniques, Kevin Farrugia, Colin Layfield and Lonneke van der Plas, in: Proceedings of the 1st International Conference on the Voynich Manuscript, 2022

DHgeN: Automated Generation of District Heating Network Layouts for Feasibility Studies, Giuseppe Peronato and Jérôme Kämpf, in: -, 2022

EBMs vs. CL: Exploring Self-Supervised Visual Pretraining for Visual Question Answering, Violetta Shevchenko, Ehsan Abbasnejad, Anthony Dick, Anton van den Hengel and Damien Teney, in: arXiv, 2022

Efficient Training of Low-Curvature Neural Networks, Suraj Srinivas, Kyle Matoba, Himabindu Lakkaraju and Francois Fleuret, in: NeurIPS 2022, 2022

[URL]

Efficient Wind Speed Nowcasting with GPU-Accelerated Nearest Neighbors Algorithm, Arnaud Pannatier, Ricardo Picatoste and Francois Fleuret, in: Proceedings of SIAM Data Mining, Virginia US and Virtual, 2022

Evading the Simplicity Bias: Training a Diverse Set of Models Discovers Solutions with Superior OOD Generalization, Damien Teney, Ehsan Abbasnejad, Simon Lucey and Anton van den Hengel, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Expanded Lattice Embeddings for Spoken Document Retrieval on Informal Meetings, Esaú Villatoro-Tello, Srikanth Madikeri, Petr Motlicek, Aravind Ganapathiraju and Alexei V. Ivanov, in: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, 2022

[DOI]

Experimental investigation on STFT phase Representations for deep learning-based dysarthric speech detection, Parvaneh Janbakhshi and Ina Kodrasi, in: International Conference on Acoustics, Speech, and Signal Processing, 2022

Face Anthropometry Aware Audio-visual Age Verification, Pavel Korshunov and Sébastien Marcel, in: ACM Multimedia, 2022

Face Reconstruction from Deep Facial Embeddings using a Convolutional Neural Network, Hatef Otroshi Shahreza, Vedrana Krivokuca and Sébastien Marcel, in: Proceedings of the 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, France, IEEE, 2022

[DOI]
[URL]

Fairness Index Measures to Evaluate Bias in Biometric Recognition, Ketan Kotwal and Sébastien Marcel, in: International Conference on Pattern Recognition Workshops, 2022

From Undercomplete to Sparse Overcomplete Autoencoders to Improve LF-MMI Speech Recognition, Selen Hande Kabil and Hervé Bourlard, in: Proceedings of Interspeech Conference, 2022

GeoNeRF: Generalizing NeRF with Geometry Priors, Mohammad Mahdi Johari, Yann Lepoittevin and Francois Fleuret, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (CVPR), 2022

[URL]

Grammar Based Speaker Role Identification for Air Traffic Control Speech Recognition, Amrutha Prasad, Juan Zuluaga-Gomez, Petr Motlicek, Seyyed Saeed Sarfjoo, Nigmatulina Iuliia, Oliver Ohneiser and Hartmut Helmke, in: 12th SESAR Innovation Days, 2022

Graph Refinement for Coreference Resolution, Lesly Miculicich and James Henderson, in: Findings of Association for >Computational Linguistics: ACL 2022, 2022

Health Talk: Understanding Practices of Popular Professional YouTubers, Thanh-Trung Phan, Chloé Michoud, Lucia Volpato, María del Río Carral and Daniel Gatica-Perez, in: Proceedings of the 21st International Conference on Mobile and Ubiquitous Multimedia, 2022

Hierarchical Multi-task learning framework for Isometric-Speech Language Translation, Aakash Bhatnagar, Nidhir Bhavsar, Muskaan Singh and Petr Motlicek, in: ACL, 2022

HMIST: Hierarchical Multilingual Isometric Speech Translation using Multi-Task Learning Framework for Automatic Dubbing, Nidhir Bhavsar, Aakash Bhatnagar, Muskaan Singh and Petr Motlicek, in: PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION (PACLIC 2022), In Proceedings of ACL Anthology, 2022

How Did Europe’s Press Cover Covid-19 Vaccination News? A Five-Country Analysis, David Alonso del Barrio and Daniel Gatica-Perez, in: MAD '22: Proceedings of the 1st International Workshop on Multimedia AI against Disinformation, 2022

[DOI]
[URL]

Hybrid Autoregressive Inference for Scalable Multi-hop Explanation Regeneration, Marco Valentino, Mokanarangan Thayaparan, Deborah Mendes and Andre Freitas, in: Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Hybrid Protection of Biometric Templates by Combining Homomorphic Encryption and Cancelable Biometrics, Hatef Otroshi Shahreza, Christian Rathgeb, Dailé Osorio-Roig, Vedrana Krivokuca, Sébastien Marcel and Christoph Busch, in: Proceedings of the 2022 International Joint Conference on Biometrics (IJCB), Abu Dhabi, United Arab Emirates (UAE), IEEE, 2022

[DOI]
[URL]

IDIAP Submission@LT-EDI-ACL2022 : Hope Speech Detection for Equality, Diversity and Inclusion, Muskaan Singh and Petr Motlicek, in: ACL, 2022

IDIAP Submission@LT-EDI-ACL2022: Detecting Signs of Depression from Social Media Text, Muskaan Singh and Petr Motlicek, in: ACL, 2022

IDIAP Submission@LT-EDI-ACL2022: Homophobia/Transphobia Detection in social media comments, Muskaan Singh and Petr Motlicek, in: ACL Proceedings, 2022

IDIAPers @ Causal News Corpus 2022: Efficient Causal Relation Identification Through a Prompt-based Few-shot Approach, Sergio Burdisso, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Martin Fajcik, Muskaan Singh, Pavel Smrz and Petr Motlicek, in: The 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE @ EMNLP 2022), 2022

[URL]

IDIAPers @ Causal News Corpus 2022: Extracting Cause-Effect-Signal Triplets via Pre-trained Autoregressive Language Model, Martin Fajcik, Muskaan Singh, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Sergio Burdisso, Petr Motlicek and Pavel Smrz, in: The 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE @ EMNLP 2022), 2022

[URL]

IDIAP_TIET@LT-EDI-ACL2022 : Hope Speech Detection in Social Media using Contextualized BERT with Attention Mechanism, Deepanshu Khanna, Muskaan Singh and Petr Motlicek, in: ACL, 2022

Imitation of Manipulation Skills Using Multiple Geometries, Boyang Ti, Yongsheng Gao, Jie Zhao and Sylvain Calinon, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2022

Indexing Protected Deep Face Templates by Frequent Binary Patterns, Dailé Osorio-Roig, Christian Rathgeb, Hatef Otroshi Shahreza, Christoph Busch and Sébastien Marcel, in: Proceedings of the 2022 International Joint Conference on Biometrics (IJCB), Abu Dhabi, United Arab Emirates (UAE), IEEE, 2022

[DOI]
[URL]

Innovators@SMM4H'22: An Ensembles Approach for self-reporting of COVID-19 Vaccination Status Tweets, Mohammad Zohair, Aakash Bhatnagar, Nidhir Bhavsar, Muskaan Singh and Petr Motlicek, in: International Conference on Computational Linguistics (COLING 2022), 2022

Innovators@SMM4H'22: An Ensembles Approach for Stance and Premise Classification of COVID-19 Health Mandates Tweets, Vatsal Savaliya, Aakash Bhatnagar, Nidhir Bhavsar, Muskaan Singh and Petr Motlicek, in: International Conference on Computational Linguistics (COLING 2022), 2022

Learning to Guide Online Multi-Contact Receding Horizon Planning, Jiayi Wang, Teguh Santoso Lembono, Sanghyun Kim, Sylvain Calinon, Sethu Vijayakumar and Steve Tonneau, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2022

Leveraging Events Sub-Categories for Violent-Events Detection in Social Media, Daniel Vallejo-Aldana, A. Pastor López-Monroy and Esaú Villatoro-Tello, in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022), 2022

[URL]

Local estimation of parametric point spread functions in thermal images via convolutional neural networks, Florian Piras, Edouard De Moura Presa, Peter Wellig and Michael Liebling, in: SPIE sensors + imaging, Target and Background Signatures VIII, Berlin, Germany, pages 1227009 1--8, SPIE, 2022

[DOI]
[URL]

Low-Level Physiological Implications of End-to-End Learning for Speech Recognition, Louise Coppieters de Gibson and Philip N. Garner, in: Proc. Interspeech 2022, pages 749--753, 2022

[DOI]

Modeling Of Pre-trained Neural Network Embeddings Learned From Raw Waveform For Covid-19 Infection Detection, Zohreh Mostaani, RaviShankar Prasad, Bogdan Vlasenko and Mathew Magimai-Doss, in: Proceedings of ICASSP, 2022

Multilingual Extraction and Categorization of Lexical Collocations with Graph-aware Transformers, Luis Espinosa Anke, Alexander Shvets, Alireza Mohammadshahi, James Henderson and Leo Wanner, in: Proceedings of the 11th Joint Conference on Lexical and Computational Semantics, Seattle, USA, pages 89–100, 2022

Multimodality for NLP-Centered Applications: Resources, Advances and Frontiers, Seema Wazarkar, muskan garg, Muskaan Singh and Ondrej Bojar, in: International Conference on Language Resources and Evaluation (LREC 2022), 2022

On Breathing Pattern Information in Synthetic Speech, Zohreh Mostaani and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2022

On the detection of morphing attacks generated by GANs, Laurent Colbois and Sébastien Marcel, in: 21st International Conference of the Biometrics Special Interest Group (BIOSIG 2022), 2022

On-demand compute reduction with stochastic wav2vec 2.0, Apoorv Vyas, Wei-Ning Hsu, Michael Auli and Alexei Baevski, in: Proceedings of Interspeech, 2022

Paumer: Patch Pausing Transformer for Semantic Segmentation, Evann Courdier, Prabhu Teja Sivaprasad and Francois Fleuret, in: 33th British Machine Vision Conference 2022, London, UK, 21 - 24 November 2022, 2022

PERFECT: Prompt-free and Efficient Few-shot Learning with Language Models, Rabeeh Karimi Mahabadi, Luke Zettlemoyer, James Henderson, saeidi marzieh, lambert mathias, Veselin Stoyanov and Majid Yazdani, in: ACL, 2022

Pre-training Data Quality and Quantity for a Low-Resource Language: New Corpus and BERT Models for Maltese, Kurt Micallef, Albert Gatt, Marc Tanti, Lonneke van der Plas and Claudia Borg, in: Proceedings of the workshop on Deep Learning for Low-Resource NLP, 2022

[URL]

Predicting is not understanding: Recognizing and addressing underspecification in machine learning, Damien Teney, Ehsan Abbasnejad and Maxime Peyrard, in: European Conference on Computer Vision, pages 458-476, Springer, 2022

Pulmonary Tuberculosis Screening from Radiological Signs on Chest X-Ray Images Using Deep Models, Geoffrey Raposo, Anete Trajman and André Anjos, in: Union World Conference on Lung Health, The Union, 2022

Reactive Anticipatory Robot Skills with Memory, Hakan Girgin, Julius Jankowski and Sylvain Calinon, in: The International Symposium on Robotics Research, 2022

Readback Error Detection by Automatic Speech Recognition and Understanding -- Results of HAAWAII Project for Isavia’s Enroute Airspace, Hartmut Helmke, Karel Ondřej, Shruthi Shetty, Hörður Arilíusson, Teodor S. Simiganoschi, Matthias Kleinert, Oliver Ohneiser, heiko Ehr, Juan Zuluaga-Gomez and Pavel Smrz, in: 11th SESAR Innovation Days, SESAR, pages 9, 2022

Reasoning over vision and language: Exploring the benefits of supplemental knowledge, Violetta Shevchenko, Damien Teney, Anthony Dick and Anton van den Hengel, in: arXiv, 2022

Residual Feature Pyramid Network for Enhancement of Vascular Patterns, Ketan Kotwal and Sébastien Marcel, in: IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

SelecMix: Debiased Learning by Contradicting-pair Sampling, Inwoo Hwang, Sangjun Lee, Yunhyeok Kwak, Seong Joon Oh, Damien Teney, Jin-Hwa Kim and Byoung-Tak Zhang, in: Advances in Neural Information Processing Systems, 2022

SelecMix: Debiased Learning by Mixing up Contradicting Pairs, Inwoo Hwang, Sangjun Lee, Yunhyeok Kwak, Seong Joon Oh, Damien Teney, Jin-Hwa Kim and Byoung-Tak Zhang, in: ICML Workshop on Spurious Correlations, Invariance and Stability, 2022

Shallow Discourse Parsing for Open Information Extraction and Text Simplification, Christina Niklaus, Andre Freitas and Siegfried Handschuh, in: 3rd Workshop on Computational Approaches to Discourse (CODI) @ COLING, 2022

SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages, Alireza Mohammadshahi, Vassilina Nikoulina, Alexandre Berard, Caroline Brun, James Henderson and Laurent Besacier, in: The 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Speaker recognition on mono-channel telephony recordings, Yosef Solewicz, Noa Cohen, Johan Rohdin, Srikanth Madikeri and Honza Cernocky, in: The Speaker and Language Recognition Workshop, 2022

Speech and Natural Language Processing Technologies for Pseudo-Pilot Simulator, Amrutha Prasad, Juan Zuluaga-Gomez, Petr Motlicek, Seyyed Saeed Sarfjoo, Nigmatulina Iuliia and Karel Vesely, in: 12th SESAR Innovation Days, 2022

Symmetry-induced Disentanglement on Graphs, Giangiacomo Mercatali, Vikas Garg and Andre Freitas, in: Advances in Neural Information Processing Systems 35, 2022

Systematicity, Compositionality and Transitivity of Deep NLP Models: a Metamorphic Testing Perspective, Edoardo Manino, Julia Rozanova, Danilo Carvalho, Andre Freitas and Lucas Cordeiro, in: Findings of the ACL, 2022

Team Innovators at SemEval-2022 for Task 8: Multi-Task Training with Hyperpartisan and Semantic Relation for Multi-Lingual News Article Similarity, Nidhir Bhavsar, Aakash Bhatnagar, Muskaan Singh, Petr Motlicek and Tirthankar Ghosal, in: Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022), NAACL 2022, 2022

TextGraphs 2022 Shared Task on Natural Language Premise Selection, Marco Valentino, Deborah Mendes, Mokanarangan Thayaparan, Andre Freitas and Dmitry Ustalov, in: Proceedings of TextGraphs-16: Graph-based Methods for Natural Language Processing, 2022

[URL]

The Winning Approach for the Recommendation Systems Shared Task @REST_MEX 2022, Cipriano Callejas-Hernández, Erika Rivadeneira-Pérez, Fernando Sánchez-Vega, A. Pastor López-Monroy and Esaú Villatoro-Tello, in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022), 2022

[URL]

To be or not to be an Integer? Encoding Variables for Mathematical Text, Deborah Mendes, Mokanarangan Thayaparan, Marco Valentino, Julia Rozanova and Andre Freitas, in: Findings of the ACL, 2022

Towards Accessible Sign Language Learning and Assessment, Neha Tarigopula, Sandrine Tornay, Skanda Muralidhar and Mathew Magimai-Doss, in: ACM International Conference on Multimodal Interaction, Bangalore, INDIA, pages 626-631, 2022

[DOI]

Towards Automatic Prediction of Non-Expert Perceived Speech Fluency Ratings, S. Pavankumar Dubagunta, Edoardo Moneta, Eleni Theocharopoulos and Mathew Magimai-Doss, in: ACM International Conference on Multimodal Interaction (ICMI Companion), 2022

[DOI]

Towards energy hubs: an innovative Geographic Information System based approach for cluster definition, Giacomo Cillari, Fabio Fantozzi, Alessandro Franco and Jérôme Kämpf, in: ICREC 2022 Conference Proceedings, 2022

UM-DFKI Maltese Speech Translation, Aiden Williams, Kurt Abela, Rishu Kumar, Martin Bär, Hannah Billinghurst, Kurt Micallef, Ahnaf Mozib Samin, Andrea DeMarco, Lonneke van der Plas and Claudia Borg, in: Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023), 2022

UNSL at eRisk 2022: Decision policies with history for early classification, Juan Martín Loyola, Horacio Thompson, Sergio Burdisso and Marcelo Errecalde, in: CEUR Workshop Proceedings, 2022

[URL]

Unsupervised Token-level Hallucination Detection from Summary Generation By-products, Andreas Marfurt and James Henderson, in: Workshop on Generation, Evaluation and Metrics (GEM), 2022

Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering, Eklavya Sarkar, RaviShankar Prasad and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2022

Vision-Language Pretraining: Current Trends and the Future, Aishwarya Agrawal, Damien Teney and Aida Nematzadeh, in: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts, 2022

[URL]

Visually Grounded Interpretation of Noun-Noun Compounds in English, Inga Lang, Lonneke van der Plas, Malvina Nissim and Albert Gatt, in: Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics, Association for Computational Linguistics, 2022

Voyager: Data Discovery for Onboarding in Data Science, Alex Bogatu, Norman Paton, Mark Douthwaite and Andre Freitas, in: 37th IEEE International Conference on Data Engineering (ICDE), 2022

What Do Compressed Multilingual Machine Translation Models Forget?, Alireza Mohammadshahi, Vassilina Nikoulina, Alexandre Berard, Caroline Brun, James Henderson and Laurent Besacier, in: The 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Why Scholars Are Diagramming Neural Network Models, Guy Marshall, Caroline Jay and Andre Freitas, in: 13th International Conference on the Theory and Application of Diagrams, 2022

Your Day in Your Pocket: Complex Activity Recognition from Smartphone Accelerometers, Emma Bouton--Bessac, Daniel Gatica-Perez and Lakmal Buddika Meegahapola, in: EAI Pervasive Health, 2022

A Bayesian Interpretation of the Light Gated Recurrent Unit, Alexandre Bittar and Philip N. Garner, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2021

[DOI]

A Comparative Study Of Simulation Tools To Model The Solar Irradiation On Building Façades, Martin Thebault, Benjamin Govehovitch, Karine Bouty, Cyril Caliot, Raphaël Compagnon, Gilles Desthieux, Matteo Formolli, Stéphanie Giroux-Julien, Victor Guillot, Ellis Herman, Jérôme Kämpf, Jouri Kanters, Gabriele Lobaccaro, Christophe Ménézo, Giuseppe Peronato and Arnkell Jonas Petersen, in: Proceedings of SWC 2021: ISES Solar World Congress, ISES, 2021

[DOI]
[URL]

A COMPARISON OF METHODS FOR OOV-WORD RECOGNITION ON A NEW PUBLIC DATASET, Rudolf Braun, Srikanth Madikeri and Petr Motlicek, in: 2021 IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE Signal Processing Society, Toronto, Ontario, Canada, 2021

A Laser-based Dual-arm System for Precise Control of Collaborative Robots, J. Silverio, G. Clivaz and Sylvain Calinon, in: IEEE International Conference on Robotics and Automation, 2021

A machine-learning model for the prediction of aggregated building heating demand from pan-European land-use maps, Giuseppe Peronato, Roberto Boghetti and Jérôme Kämpf, in: Journal of Physics: Conference Series, 2021

[DOI]

An Attention Mechanism for Deep Q-Networks with Applications in Robotic Pushing, Marco Ewerton, Sylvain Calinon and Jean-Marc Odobez, in: Proc. of Workshop on Emerging paradigms for robotic manipulation: from the lab to the productive world, ICRA, 2021

An Efficient Image-to-Image Translation HourGlass-based Architecture for Object Pushing Policy Learning, Marco Ewerton, Angel Martínez-González and Jean-Marc Odobez, in: IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

An Evaluation Benchmark for Automatic Speech Recognition of German-English Code-Switching, Abbas Khosravani, Philip N. Garner and Alexandros Lazaridis, in: IEEE Automatic Speech Recognition and Understanding Workshop, 2021

An Objective Evaluation Framework for Pathological Speech Synthesis, Bence Halpern, Julian Fritsch, Enno Hermann, Rob Van Son, Odette Scharenborg and Mathew Magimai-Doss, in: Proceedings of ITG Conference on Speech Communication, 2021

Approximating the Mental Lexicon from Clinical Interviews as a Support Tool for Depression Detection, Esaú Villatoro-Tello, Gabriela Ramírez-de-la-Rosa, Daniel Gatica-Perez, Mathew Magimai-Doss and Héctor Jiménez-Salazar, in: Proceedings of the 2021 International Conference on Multimodal Interaction, ACM, 2021

[DOI]

Automated Interpretation of Air Traffic Control Communication: The Journey from Spoken Words to a Deeper Understanding of the Meaning, Matthias Kleinert, Hartmut Helmke, Shruthi Shetty, Oliver Ohneiser, heiko Ehr, Amrutha Prasad, Petr Motlicek and Julia Harfmann, in: 2021 IEEE/AIAA 40th Digital Avionics Systems Conference (DASC), San Antonio, TX, USA, pages 1-9, IEEE, 2021

[DOI]

Automatic and perceptual discrimination between dysarthria, apraxia of speech, and neurotypical speech, Ina Kodrasi, Michaela Pernon, Marina Laganaro and Hervé Bourlard, in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2021

Automatic Dialect Detection for Low Resource Santali Language, Sunil Kumar Sahoo, Brojo Kishore Mishra, Shantipriya Parida, Satya Ranjan Dash, Jatindra Nath Besra and Esaú Villatoro-Tello, in: Proceeding of International Conference on Information Technology (OCIT), 2021

AUTOMATIC DYSARTHRIC SPEECH DETECTION EXPLOITING PAIRWISE DISTANCE-BASED CONVOLUTIONAL NEURAL NETWORKS, Parvaneh Janbakhshi, Ina Kodrasi and Hervé Bourlard, in: 45th International Conference on Acoustics, Speech, and Signal Processing, Toronto, Canada, pages 7328–7332, 2021

Automatic processing pipeline for collecting and annotating air-traffic voice communication data, Martin Kocour, Karel Vesely, Igor Szoke, Santosh Kesiraju, Juan Zuluaga-Gomez, Blatt Alexander, Amrutha Prasad, Nigmatulina Iuliia, Petr Motlicek and et al., in: Proceedings of 9th OpenSky Symposium 2020, OpenSky Network, Brussels, Belgium, pages 1-9, MDPI, 2021

Beyond question-based biases: Assessing multimodal shortcut learning in visual question answering, Corentin Dancette, Remi Cadene, Damien Teney and Matthieu Cord, in: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021

Boosting of contextual information in ASR for air-traffic call-sign recognition, Martin Kocour, Karel Vesely, Blatt Alexander, Juan Zuluaga-Gomez, Igor Szoke, Jan Cernocky, Dietrich Klakow and Petr Motlicek, in: Interspeech 2021, 2021

Challenges for Using Impact Regularizers to Avoid Negative Side Effects, David Lindner, Kyle Matoba and Alexander Meulemans, in: SafeAI 2021 - AAAI's Workshop on Artificial Intelligence Safety, 2021

Compacter: Efficient Low-Rank Hypercomplex Adapter Layers, Rabeeh Karimi Mahabadi, James Henderson and Sebastian Ruder, in: NeurIPS, 2021

Comparing CTC and LFMMI for out-of-domain adaptation of wav2vec 2.0 acoustic model, Apoorv Vyas, Srikanth Madikeri and Hervé Bourlard, in: Proceedings of Interspeech, 2021

[URL]

Contextual Semi-Supervised Learning: An Approach To Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems, Juan Zuluaga-Gomez, Nigmatulina Iuliia, Amrutha Prasad, Petr Motlicek, Karel Vesely, Martin Kocour and Igor Szoke, in: Interspeech 2021, 2021

[URL]

Cost–effective Variational Active Entity Resolution, Alex Bogatu, Norman Paton, Mark Douthwaite, Stuart Davie and Andre Freitas, in: 37th IEEE International Conference on Data Engineering (ICDE), 2021

[URL]

Cross Modal Focal Loss for RGBD Face Anti-Spoofing, Anjith George and Sébastien Marcel, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021

Deep Auto-Encoding and Biohashing for Secure Finger Vein Recognition, Hatef Otroshi Shahreza and Sébastien Marcel, in: Proceedings of the 2021 IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE, Toronto, Canada, 2021

[DOI]
[URL]

DepthInSpace: Exploitation and Fusion of Multiple Video Frames for Structured-Light Depth Estimation, Mohammad Mahdi Johari, Camilla Carta and Francois Fleuret, in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 6039-6048, 2021

[URL]

Development of a lung segmentation algorithm for analog imaged chest X-Ray: preliminary results, Matheus A. Renzo, Natália Fernandez, André A. Baceti, Natanael Nunes de Moura Junior and André Anjos, in: XV Brazilian Congress on Computational Intelligence, Joinville, Brazil, 2021

[URL]

Disentangling Generative Factors in Natural Language with Discrete Variational Autoencoders, Giangiacomo Mercatali and Andre Freitas, in: The 2021 Conference on Empirical Methods in Natural Language Processing, 2021

District heating network modelling for future integration of solar thermal energy, Clément Dromart, Loïc Puthod, Jérôme Kämpf and Diane von Gunten, in: Journal of Physics: Conference Series, pages 012089, IOP Publishing, 2021

[DOI]

Do Natural Language Explanations Represent Valid Logical Arguments? Verifying Entailment in Explainable NLI Gold Standards, Marco Valentino, Ian Pratt-Hartmann and Andre Freitas, in: 14th International Conference on Computational Semantics, 2021

[URL]

Does My Representation Capture X? Probe-Ably, Deborah Mendes, Julia Rozanova, Mokanarangan Thayaparan, Marco Valentino and Andre Freitas, in: 59th Annual Meeting of the Association for Computational Linguistics (Demonstration track), 2021

[URL]

Encoding Explanatory Knowledge for Zero-shot Science Question Answering, Zili Zhou, Marco Valentino, Donal Landers and Andre Freitas, in: 14th International Conference on Computational Semantics, 2021

[URL]

Estimating Nonplanar Flow from 2D Motion-blurred Widefield Microscopy Images via Deep Learning, Adrian Shajkofci and Michael Liebling, in: International Symposium on Biomedical Imaging, 2021, 2021

Explainable Inference Over Grounding-Abstract Chains for Science Questions, Mokanarangan Thayaparan, Marco Valentino and Andre Freitas, in: 59th Annual Meeting of the Association for Computational Linguistics (ACL Findings), 2021

Explainable Natural Language Reasoning via Conceptual Unification, Marco Valentino, Mokanarangan Thayaparan and Andre Freitas, in: 16th conference of the European Chapter of the Association for Computational Linguistics (EACL), 2021

[URL]

Face Liveness Detection Competition (LivDet-Face) - 2021, Sandip Purnapatra, Nic Smalt, Keivan Bahmani, Priyanka Das, David Yambay, Amir Mohammadi, Anjith George, Thirimachos Bourlai, Sébastien Marcel and Stephanie Schuckers, in: International Joint Conference on Biometrics, 2021

Fusion of Acoustic and Linguistic Information Using Supervised Autoencoder for Improved Emotion Recognition, Bogdan Vlasenko, RaviShankar Prasad and Mathew Magimai-Doss, in: 2nd Multimodal Sentiment Analysis Challenge (MuSe '21), October 24, 2021, Virtual Event, China, 2021

[DOI]

Graph2Speak: Improving Speaker Identification using Network Knowledge in Criminal Conversational Data, Mael Fabien, Seyyed Saeed Sarfjoo, Srikanth Madikeri and Petr Motlicek, in: 1st ISCA Symposium on Security and Privacy in Speech Communication, pages 10--13, 2021

[DOI]

Handling acoustic variation in dysarthric speech recognition systems through model combination, Enno Hermann and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2021

Identification of F1 and F2 in speech using modified zero frequency filtering, RaviShankar Prasad and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2021

Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models, Zheyuan Liu, Cristian Rodriguez-Opazo, Damien Teney and Stephen Gould, in: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021

Implementation of machine learning techniques for the quasi real-time blind and electric lighting optimization in a controlled experimental facility, Chantal Basurto, Roberto Boghetti, Moreno Colombo, Michael Papinutto, Julien Nembrini and Jérôme Kämpf, in: Journal of Physics: Conference Series, IOP Publishing, 2021

[DOI]
[URL]

Imposing Relation Structure in Language-Model Embeddings Using Contrastive Learning, Christos Theodoropoulos, James Henderson, Andrei Catalin Coman and Marie-Francine Moens, in: Proceedings of the 25th Conference on Computational Natural Language Learning, Online, pages 337-348, Association for Computational Linguistics, 2021

Improving Emotional TTS with an Emotion Intensity Input from Unsupervised Extraction, Bastian Schnell and Philip N. Garner, in: 11th ISCA Speech Synthesis Workshop, 2021

[URL]

Improving Generalization of Deepfake Detection by Training for Attribution, Anubhav Jain, Pavel Korshunov and Sébastien Marcel, in: International Workshop on Multimedia Signal Processing, 2021

Intrinsically-Motivated Robot Learning of Bayesian Probabilistic Movement Primitives, Thibaut Kulak and Sylvain Calinon, in: ICRA workshop: "Towards Curious Robots: Modern Approaches for Intrinsically-Motivated Intelligent Behavior", 2021

Late Fusion of the Available Lexicon and Raw Waveform-based Acoustic Modeling for Depression and Dementia Recognition, Esaú Villatoro-Tello, S. Pavankumar Dubagunta, Julian Fritsch, Gabriela Ramírez-de-la-Rosa, Petr Motlicek and Mathew Magimai-Doss, in: Proceedings of Interspeech 2021, ISCA-International Speech Communication Association 2021, 2021

LATTICE-FREE MMI ADAPTATION OF SELF-SUPERVISED PRETRAINED ACOUSTIC MODELS, Apoorv Vyas, Srikanth Madikeri and Hervé Bourlard, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, 2021

[URL]

Learning to Translate Low-Resourced Swiss German Dialectal Speech into Standard German Text, Abbas Khosravani, Philip N. Garner and Alexandros Lazaridis, in: IEEE Automatic Speech Recognition and Understanding Workshop, Colombia, Cartagena, IEEE, 2021

Locally Private Graph Neural Networks, Sina Sajadmanesh and Daniel Gatica-Perez, in: ACM Conference on Computer and Communications Security (CCS), 2021

Machine learning techniques for the daylight and electric lighting performance predictions, Chantal Basurto, Oliver Paul and Jérôme Kämpf, in: Proceedings of Building Simulation 2021, 2021

Measuring Speech Recognition And Understanding Performance in Air Traffic Control Domain Beyond Word Error Rates, Hartmut Helmke, Shruthi Shetty, Matthias Kleinert, Oliver Ohneiser, heiko Ehr, Amrutha Prasad, Petr Motlicek, Cerna Aneta and Christian Windisch, in: 11th SESAR Innovation Days, 2021

Modeling Dialectal Variation for Swiss German Automatic Speech Recognition, Abbas Khosravani, Philip N. Garner and Alexandros Lazaridis, in: Proceedings of Interspeech, 2021

[DOI]

Multi-Adversarial Learning for Cross-Lingual Word Embeddings, Haozhou Wang, James Henderson and Paola Merlo, in: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Online, pages 463-472, 2021

Multi-task Neural Network for Robust Multiple Speaker Embedding Extraction, Weipeng He, Petr Motlicek and Jean-Marc Odobez, in: Proceedings of Interspeech 2021, 2021

Multi-task Single Channel Speech Enhancement Using Speech Presence Probability As A Secondary Task Training Target, Lei Wang, Jie Zhu and Ina Kodrasi, in: European Signal Processing Conference, EUSIPCO 2021, 2021

Multimodal Neural Machine Translation System for English to Bengali, Shantipriya Parida, Subhadarshi Panda, Satya Prakash Biswal, Ketan Kotwal, Arghyadeep Sen, Satya Ranjan Dash and Petr Motlicek, in: Proceedings of the First Workshop on Multimodal Machine Translation for Low Resource Languages (MMTLRL 2021), Online (Virtual Mode), pages 31--39, INCOMA Ltd., 2021

[URL]

Multitask adaptation with Lattice-Free MMI for multi-genre speech recognition of low resource languages, Srikanth Madikeri, Petr Motlicek and Hervé Bourlard, in: Proceedings of Interspeech 2021, 2021

Natural Language Inference over Tables: Enabling Explainable Data Exploration on Data Lakes, Mario Ramirez, Alex Bogatu, Norman Paton and Andre Freitas, in: 18th Extended Semantic Web Conference (ESWC), 2021

[URL]

NLPHut's Participation at WAT2021, Shantipriya Parida, Subhadarshi Panda, Ketan Kotwal, Amulya Ratna Dash, Satya Ranjan Dash, Yashvardhan Sharma, Petr Motlicek and Ondrej Bojar, in: Proceedings of the 8th Workshop on Asian Translation (WAT2021), pages 146--154, Association for Computational Linguistics, 2021

[URL]

On Modeling Glottal Source Information for Phonation Assessment in Parkinson’s Disease, Juan Camilo Vasquez-Correa, Julian Fritsch, Juan Rafael Orozco-Arroyave, Elmar Nöth and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2021

On the Effectiveness of Vision Transformers for Zero-shot Face Anti-Spoofing, Anjith George and Sébastien Marcel, in: International Joint Conference on Biometrics (IJCB 2021), 2021

On the Language-specificity of Multilingual BERT and the Impact of Fine-tuning, Marc Tanti, Lonneke van der Plas, Claudia Borg and Albert Gatt, in: Proceedings of the Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2021

On the Recognition Performance of BioHashing on state-of-the-art Face Recognition models, Hatef Otroshi Shahreza, Vedrana Krivokuca and Sébastien Marcel, in: Proceedings of the 13th IEEE International Workshop on Information Forensics and Security (WIFS), Montpellier, France, IEEE, 2021

[DOI]
[URL]

On The Relationship Between Speech-based Breathing Signal Prediction Evaluation Measures And Breathing Parameters Estimation, Zohreh Mostaani, Venkata Srikanth Nallanthighal, Aki Härmä, Helmer Strik and Mathew Magimai-Doss, in: Proc. of ICASSP, 2021

On the use of automatically generated synthetic image datasets for benchmarking face recognition, Laurent Colbois, Tiago de Freitas Pereira and Sébastien Marcel, in: International Joint Conference on Biometrics (IJCB 2021), 2021

Open Machine Translation for Low Resource South American Languages (AmericasNLP 2021 Shared Task Contribution), Shantipriya Parida, Subhadarshi Panda, Amulya Ratna Dash, Esaú Villatoro-Tello, A. Seza Dogruöz, Rosa M. Ortega-Mendoza, Amadeo Hernández, Yashvardhan Sharma and Petr Motlicek, in: Proceedings of the First Workshop on Natural Language Processing for Indigenous Languages of the Americas, pages 218–223, Association for Computational Linguistics, 2021

[DOI]
[URL]

Open-Set Speaker Identification pipeline in live criminal investigations, Mael Fabien and Petr Motlicek, in: 1st ISCA Symposium on Security and Privacy in Speech Communication, 2021

Optics Versus Computation: Influence of Illumination and Reconstruction Model Accuracy in Focal-Plane-Scanning Optical Projection Tomography, François Marelli and Michael Liebling, in: 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI), Nice, France, pages 567-570, IEEE, 2021

[DOI]

Optimal Control Combining Emulation and Imitation to Acquire Physical Assistance Skills, Amirreza Razmjoo, Teguh Santoso Lembono and Sylvain Calinon, in: Proc. IEEE Intl Conf. on Advanced Robotics (ICAR), 2021

Optimization of robot configurations for motion planning in industrial riveting, Hakan Girgin, Teguh Santoso Lembono, Radu Cirligeanu and Sylvain Calinon, in: Proc. IEEE Intl Conf. on Advanced Robotics (ICAR), 2021

Overview of the 8th Workshop on Asian Translation, Toshiaki Nakazawa, Hideki Nakayama, Chenchen Ding, Raj Dabre, Shohei Higashiyama, Hideya Mino, Isao Goto, Win Pa Pa, Anoop Kunchukuttan, Shantipriya Parida, Ondrej Bojar, Chenhui Chu, Akiko Eriguchi, Kaori Abe, Yusuke Oda and Sadao Kurohashi, in: Proceedings of the 8th Workshop on Asian Translation (WAT2021), pages 1--45, Association for Computational Linguistics, 2021

[URL]

Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks, Rabeeh Karimi Mahabadi, Sebastian Ruder, Dehghani Mostafa and James Henderson, in: ACL, 2021

Perspectives and limitations of visible-thermal image pair synthesis via generative adversarial networks, Danick Panchard, François Marelli, Edouard De Moura Presa, Peter Wellig and Michael Liebling, in: Security + Defence, Target and Background Signatures VII, Proc. of SPIE, online only, pages 1186509-1--1186509-8, SPIE, 2021

[DOI]
[URL]

Phoneme based Respiratory Analysis of Read Speech, Venkata Srikanth Nallanthighal, Aki Härmä, Helmer Strik and Mathew Magimai-Doss, in: Proceedings of European Signal Processing Conference (EUSIPCO), 2021

Pose Transformers (POTR): Human Motion Prediction with Non-Autoregressive Transformers, Angel Martínez-González, Michael Villamizar and Jean-Marc Odobez, in: International Conference in Computer Vision - Workshops, 2021

Probabilistic Iterative LQR for Short Time Horizon MPC, Teguh Santoso Lembono and Sylvain Calinon, in: International Conference on Intelligent Robots and Systems, pages 579-585, 2021

[DOI]

PROMPT: Probabilistic Motion Primitives based Trajectory Planning, Tobias Löw, Tirthankar Bandyopadhyay, Jason Williams and Paulo Borges, in: Proceedings of Robotics: Science and Systems, 2021

[DOI]
[URL]

Readback Error Detection by Automatic Speech Recognition to Increase ATM Safety, Hartmut Helmke, Matthias Kleinert, Shruthi Shetty, Oliver Ohneiser, heiko Ehr, Hörður Arilíusson, Teodor S. Simiganoschi, Amrutha Prasad, Petr Motlicek, Karel Vesely, Karel Ondřej, Pavel Smrz, Julia Harfmann and Christian Windisch, in: Fourteenth USA/Europe Air Traffic Management Research and Development Seminar (ATM2021), The United States Federal Aviation Administration (FAA), EUROCONTROL, pages 10, 2021

[URL]

Rethinking the Role of Gradient-based Attribution Methods for Model Interpretability, Suraj Srinivas and Francois Fleuret, in: International Conference on Learning Representations, 2021

Robust Command Recognition for Lithuanian Air Traffic Control Tower Utterances, Oliver Ohneiser, Seyyed Saeed Sarfjoo, Hartmut Helmke, Shruthi Shetty, Petr Motlicek, Matthias Kleinert, heiko Ehr and Šarūnas Murauskas, in: Interspeech, 2021

ROXANNE Research Platform: Automate criminal investigations, Mael Fabien, Shantipriya Parida, Dawei Zhu, Petr Motlicek, Aravind Krishnan and Hoang H. Nguyen, in: Interspeech Show and Tell 2021, 2021

ROXSD: a Simulated Dataset of Communication in Organized Crime, Hoang H. Nguyen, Mael Fabien, Petr Motlicek, Shantipriya Parida and Kvetoslav Maly, in: 1st ISCA Symposium on Security and Privacy in Speech Communication, 2021

Scholarly AI system diagrams as an access point to mental models, Guy Marshall, Caroline Jay and Andre Freitas, in: Diagrams, 2021

Sentence-level Planning for Especially Abstractive Summarization, Andreas Marfurt and James Henderson, in: Proceedings of the Third Workshop on New Frontiers in Summarization, pages 1--14, Association for Computational Linguistics, 2021

[URL]

Speech Activity Detection Based on Multilingual Speech Recognition System, Seyyed Saeed Sarfjoo, Srikanth Madikeri and Petr Motlicek, in: Interspeech, 2021

STAR: Cross-modal Statement Representation for Selecting Relevant Mathematical Premises, Deborah Mendes and Andre Freitas, in: 16th conference of the European Chapter of the Association for Computational Linguistics (EACL), 2021

Structuralist analysis for neural network system diagrams, Guy Marshall, Caroline Jay and Andre Freitas, in: Diagrams, 2021

Subjective and objective evaluation of deepfake videos, Pavel Korshunov and Sébastien Marcel, in: The international Conference on Acoustics, Speech, and Signal Processing, 2021

Supervised Speech Representation Learning for Parkinson's Disease Classification, Parvaneh Janbakhshi and Ina Kodrasi, in: ITG Conference on Speech Communication, 2021

Supporting Context Monotonicity Abstractions in Neural NLI Models, Julia Rozanova, Deborah Mendes, Mokanarangan Thayaparan, Marco Valentino and Andre Freitas, in: Natural Logic Meets Machine Learning Workshop, 2021

[URL]

Switching Contexts: Transportability Measures for NLP, Guy Marshall, Mokanarangan Thayaparan, Philip Osborne and Andre Freitas, in: 14th International Conference on Computational Semantics, 2021

[URL]

Syntax-Aware Graph-to-Graph Transformer for Semantic Role Labelling, Alireza Mohammadshahi and James Henderson, in: Arxiv, 2021

Test time Adaptation through Perturbation Robustness, Prabhu Teja Sivaprasad and Francois Fleuret, in: Workshop on Distribution Shifts, 35th Conference on Neural Information Processing Systems (NeurIPS 2021), 2021

The DCU-EPFL Enhanced Dependency Parser at the IWPT 2021 Shared Task, James Barry, Alireza Mohammadshahi, Joachim Wagner, Jennifer Foster and James Henderson, in: Proceedings of the 17th International Conference on Parsing Technologies and the IWPT 2021 Shared Task on Parsing into Enhanced Universal Dependencies, Online, pages 204-212, Association for Computational Linguistics, 2021

The Theory, Practice, and Ethical Challenges of Designing a Diversity-Aware Platform for Social Relations, Laura Schelenz, Ivano Bison, Matteo Busso, Amalia de Götzen, Daniel Gatica-Perez, Fausto Giunchiglia, Lakmal Buddika Meegahapola and Salvador Ruiz-Correa, in: Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society, pages 11, ACM, 2021

[DOI]

Trajectory Prediction with Compressed 3D Environment Representation using Tensor Train Decomposition, Lara Brudermuller, Teguh Santoso Lembono, Suhan Shetty and Sylvain Calinon, in: International Conference on Advanced Robotics, 2021

Trust indicators and explainable AI: A study on user perceptions, Delphine Ribes Lemay, Nicolas Henchoz, Hélène Portier, Lara Defayes, Thanh-Trung Phan, Daniel Gatica-Perez and Andreas Sondereger, in: Proc. Int. Conf. on Human-Computer Interaction, Bari, Italy, 2021

Uncertainty Reduction for Model Adaptation in Semantic Segmentation, Prabhu Teja Sivaprasad and Francois Fleuret, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021

Unequivocal cardiac phase sorting from alternating ramp- and pulse-illuminated microscopy image sequences, Olivia Mariani, François Marelli, Christian Jaques, Alexander Ernst and Michael Liebling, in: 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI), pages 868-872, 2021

[DOI]
[URL]

Unification-based Reconstruction of Multi-hop Explanations for Science Questions, Marco Valentino, Mokanarangan Thayaparan and Andre Freitas, in: 16th conference of the European Chapter of the Association for Computational Linguistics, 2021

[URL]

Unshuffling data for improved generalization in visual question answering, Damien Teney, Ehsan Abbasnejad and Anton van den Hengel, in: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021

Variational Information Bottleneck for Effective Low-Resource Fine-Tuning, Rabeeh Karimi Mahabadi, yonatan belinkov and James Henderson, in: ICLR, 2021

Vein Enhancement with Deep Auto-Encoders to improve Finger Vein Recognition, Victor Bros, Ketan Kotwal and Sébastien Marcel, in: Biometrics Special Interest Group (BIOSIG 2021), 2021

Visual Focus of Attention Estimation in 3D Scene with an Arbitrary Number of Targets, Remy Siegfried and Jean-Marc Odobez, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pages 9, IEEE, 2021

What is SemEval evaluating? A Systematic Analysis of Evaluation Campaigns in NLP, Oskar Wysocki, Malina Florea, Donal Landers and Andre Freitas, in: EMNLP, 2nd Workshop on Evaluation and Comparison of NLP Systems, 2021

Whole Body Model Predictive Control with a Memory of Motion:Experiments on a Torque-Controlled Talos, Ewen Dantec, Rohan Budhiraja, Adria Roig, Teguh Santoso Lembono, Guilhem Saurel, Olivier Stasse, Pierre Fernbach, Steve Tonneau, Sethu Vijayakumar, Sylvain Calinon, Michel Taix and Nicolas Mansard, in: IEEE International Conference on Robotics and Automation, 2021

Zurich Like New: Analyzing Open Urban Multimodal Data, Marcel Granero-Moya, Thanh-Trung Phan and Daniel Gatica-Perez, in: Proceedings of the 1st International Workshop on Multimedia Computing for Urban Data, 2021

A Comparison of Acoustic and Linguistics Methodologies for Alzheimer's Dementia Recognition, Nicholas Cummins, Yilin Pan, Zhao Ren, Julian Fritsch, Venkata Srikanth Nallanthighal, Heidi Christensen, Daniel Blackburn, Björn Schuller, Mathew Magimai-Doss, Helmer Strik and Aki Härmä, in: Proceedings of Interspeech, pages 2182-2186, 2020

A memory of motion for visual predictive control tasks, Antonio Paolillo, Teguh Santoso Lembono and Sylvain Calinon, in: International Conference on Robotics and Automation, 2020

A Phonology-based Approach for Isolated Sign Production Assessment in Sign Language, Sandrine Tornay, Necati Cihan Camgoz, Richard Bowden and Mathew Magimai-Doss, in: Companion Publication of the 2020 International Conference on Multimodal Interaction (ICMI '20 Companion), 2020

Active Improvement of Control Policies with Bayesian Gaussian Mixture Model, Hakan Girgin, E. Pignat, N. Jaquier and Sylvain Calinon, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, 2020

Alone or With Others? Understanding Eating Episodes of College Students with Mobile Sensing, Lakmal Buddika Meegahapola, Salvador Ruiz-Correa and Daniel Gatica-Perez, in: 19th International Conference on Mobile and Ubiquitous Multimedia, ACM, Essen, Germany, pages 162–166, Association for Computing Machinery, 2020

[DOI]
[URL]

An HMM Approach with Inherent Model Selection for Sign Language and Gesture Recognition, Sandrine Tornay, Oya Aran and Mathew Magimai-Doss, in: Proceedings of the International Conference on Language Resources and Evaluation LREC 2020, 2020

An Integrated and strategic evaluation of automatic blind controls to achieve energy and occupant's comfort objectives, Chantal Basurto and Jérôme Kämpf, in: Proceedings of the 5th IBPSA-England Conference on Building Simulation and Optimization (Virtual), Loughborough, UK, 2020

[URL]

Analysis and Transfer of Human Movement Manipulability in Industry-like Activities, N. Jaquier, L. Rozo and Sylvain Calinon, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, 2020

Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications, Juan Zuluaga-Gomez, Karel Vesely, Blatt Alexander, Petr Motlicek, Dietrich Klakow, Allan Tart, Igor Szoke, Amrutha Prasad, Seyyed Saeed Sarfjoo, Pavel Kolcarek, Martin Kocour, Honza Cernocky, Claudia Cevenini, Khalid Choukri, Mickael Rigault and Fabian Landis, in: Proceedings of 8th OpenSky Symposium 2020, OpenSky Network, pages 1-10, MDPI, 2020

[DOI]
[URL]

Automatic Discrimination of Apraxia of Speech and Dysarthria using a Minimalistic Set of Handcrafted Features, Ina Kodrasi, Michaela Pernon, Marina Laganaro and Hervé Bourlard, in: Interspeech, 2020

Automatic Speech Recognition Benchmark for Air-Traffic Communications, Juan Zuluaga-Gomez, Petr Motlicek, Qingran Zhan, Rudolf Braun and Karel Vesely, in: Proc. Interspeech 2020, pages 2297-2301, 2020

[DOI]

BertAA: BERT fine-tuning for Authorship Attribution, Mael Fabien, Esaú Villatoro-Tello, Petr Motlicek and Shantipriya Parida, in: Proceedings of the 17th International Conference on Natural Language Processing, 2020

CNN Patch Pooling for Detecting 3D Mask Presentation Attacks in NIR, Ketan Kotwal and Sébastien Marcel, in: IEEE International Conference on Image Processing, 2020

DeepFocus: a Few-shot Microscope Slide Auto-Focus using a Sample Invariant CNN-based Sharpness Function, Adrian Shajkofci and Michael Liebling, in: International Symposium on Biomedical Imaging, 2020

Detection of S1 and S2 locations in phonocardiogram signals using zero frequency filter, RaviShankar Prasad, Gürkan Yilmaz, Olivier Chetelat and Mathew Magimai-Doss, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Detection of Similar Languages and Dialects Using Deep Supervised Autoencoders, Shantipriya Parida, Esaú Villatoro-Tello, Sajit Kumar, Mael Fabien and Petr Motlicek, in: Proceedings of the 17th International Conference on Natural Language Processing, 2020

DOMAIN ADAPTATION FOR GENERALIZATION OF FACE PRESENTATION ATTACK DETECTION IN MOBILE SETTINGS WITH MINIMAL INFORMATION, Amir Mohammadi, Sushil Bhattacharjee and Sébastien Marcel, in: 45th International Conference on Acoustics, Speech, and Signal Processing, Barcelona, Spain, IEEE, 2020

[URL]

Dysarthric Speech Recognition with Lattice-Free MMI, Enno Hermann and Mathew Magimai-Doss, in: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 6109-6113, 2020

[DOI]
[URL]

End-to-End Bias Mitigation by Modelling Biases in Corpora, Rabeeh Karimi Mahabadi, yonatan belinkov and James Henderson, in: ACL, 2020

Estimating The Degree of Sleepiness by Integrating Articulatory Feature Knowledge In Raw Waveform Based CNNs, Julian Fritsch, S. Pavankumar Dubagunta and Mathew Magimai-Doss, in: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Barcelona, Spain, 2020

Exact Preimages of Neural Network Aircraft Collision Avoidance Systems, Kyle Matoba and Francois Fleuret, in: Machine Learning for Engineering Modeling, Simulation, and Design Workshop at Neural Information Processing Systems 2020, 2020

Fast Transformers with Clustered Attention, Apoorv Vyas, Angelos Katharopoulos and Francois Fleuret, in: Proceedings of the International Conference on Neural Information Processing Systems, 2020

Fourier movement primitives: an approach for learning rhythmic robot skills from demonstrations, Thibaut Kulak, J. Silverio and Sylvain Calinon, in: Robotics: Science and Systems, 2020

Generating Master Faces for Use in PerformingWolf Attacks on Face Recognition Systems, Huy H. Nguyen, Junichi Yamagishi, Isao Echizen and Sébastien Marcel, in: International Join Conference on Biometrics, 2020

Generative adversarial training of product of policies for robust and adaptive movement primitives, Emmanuel Pignat, Hakan Girgin and Sylvain Calinon, in: In Proc. Conference on Robot Learning (CoRL), 2020

Graph-to-Graph Transformer for Transition-based Dependency Parsing, Alireza Mohammadshahi and James Henderson, in: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, 2020

[URL]

Graph-to-Graph Transformer for Transition-based Dependency Parsing, Alireza Mohammadshahi and James Henderson, in: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Findings, ACL, Online, pages 3278–3289, Association for Computational Linguistics, 2020

[URL]

Idiap & UAM participation at GermEval 2020: Classification and Regression of Cognitive and Motivational Style from Text, Esaú Villatoro-Tello, Shantipriya Parida, Sajit Kumar, Petr Motlicek and Qingran Zhan, in: Proceedings of the GermEval 2020 Shared Task on the Classification and Regression of Cognitive and Motivational style from Text, 2020

[URL]

Idiap and UAM Participation at MEX-A3T Evaluation Campaign, Esaú Villatoro-Tello, Gabriela Ramírez-de-la-Rosa, Shantipriya Parida, Sajit Kumar and Petr Motlicek, in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2020) co-located with 36th Conference of the Spanish Society for Natural Language Processing (SEPLN 2020), pages 6, CEUR Workshop Proceedings, 2020

[URL]

Idiap Submission to Swiss-German Language Detection Shared Task, Shantipriya Parida, Esaú Villatoro-Tello, Sajit Kumar, Petr Motlicek and Qingran Zhan, in: Proceedings of the 5th Swiss Text Analytics Conference (SwissText) & 16th Conference on Natural Language Processing (KONVENS), CEUR Workshop Proceedings, 2020

[URL]

IMPROVING CROSS-DATASET PERFORMANCE OF FACE PRESENTATION ATTACK DETECTION SYSTEMS USING FACE RECOGNITION DATASETS, Amir Mohammadi, Sushil Bhattacharjee and Sébastien Marcel, in: 45th International Conference on Acoustics, Speech, and Signal Processing, IEEE, 2020

[URL]

INCREMENTAL SEMI-SUPERVISED LEARNING FOR MULTI-GENRE SPEECH RECOGNITION, Banriskhem Khonglah, Srikanth Madikeri, Subhadeep Dey, Hervé Bourlard, Petr Motlicek and Jayadev Billa, in: Proceedings of ICASSP 2020, 2020

Iris Liveness Detection Competition (LivDet-Iris) – The 2020 Edition, Priyanka Das, Joseph McGrath, Zhaoyuan Fang, Aidan Boyd, Ganghee Jang, Amir Mohammadi, Sandip Purnapatra, David Yambay, Sébastien Marcel, Mateusz Trokielewicz, Piotr Maciejewicz, Kevin Bowyer, Adam Czajka and Stephanie Schuckers, in: INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB 2020), 2020

[URL]

Lattice-Free Maximum Mutual Information Training of Multilingual Speech Recognition System, Srikanth Madikeri, Banriskhem Khonglah, Sibo Tong, Petr Motlicek, Hervé Bourlard and Daniel Povey, in: In Proceedings of Interspeech 2020, pages 4746--4750, ISCA, 2020

Learning How to Walk: Warm-starting Optimal Control Solver with Memory of Motion, Teguh Santoso Lembono, Carlos Mastalli, Pierre Fernbach, Nicolas Mansard and Sylvain Calinon, in: International Conference on Robotics and Automation, 2020

Learning Urban Nightlife Routines from Mobile Data, Ada Pozo, Thanh-Trung Phan and Daniel Gatica-Perez, in: Proc. Int. Conf. on Mobile and Ubiquitous Multimedia, Essen, Germany, 2020

Learning, Generating and Adapting Wave Gestures for Expressive Human-Robot Interaction, M. Panteris, S. Manschitz and Sylvain Calinon, in: Proc. ACM/IEEE Intl Conf. on Human-Robot Interaction (HRI), pages 386-388, 2020

[DOI]
[URL]

ManiGaze: a Dataset for Evaluating Remote Gaze Estimator in Object Manipulation Situations, Remy Siegfried, Bozorgmehr Aminian and Jean-Marc Odobez, in: Symposium on Eye Tracking Research and Applications, Stuttgart, Germany, pages 5, ACM, 2020

[DOI]

ODIANLP's Participation in WAT2020, Shantipriya Parida, Petr Motlicek, Amulya Ratna Dash, Satya Ranjan Dash, Debasish Kumar Mallick, Satya Prakash Biswal, Priyanka Pattnaik, Biranchi Narayan Nayak and Ondrej Bojar, in: Proceedings of the 7th Workshop on Asian Translation, ACL Anthology, 2020

Optimizer Benchmarking Needs to Account for Hyperparameter Tuning, Prabhu Teja Sivaprasad, Florian Mai, Thijs Vogels, Martin Jaggi and Francois Fleuret, in: Proceedings of the 37th International Conference on Machine Learning, Vienna, Austria, 2020

[URL]

Overview of the 7th Workshop on Asian Translation, Toshiaki Nakazawa, Hideki Nakayama, Chenchen Ding, Raj Dabre, Shohei Higashiyama, Hideya Mino, Isao Goto, Win Pa Pa, Anoop Kunchukuttan, Shantipriya Parida, Ondrej Bojar and Sadao Kurohashi, in: Proceedings of the 7th Workshop on Asian Translation, Association for Computational Linguistics, 2020

[URL]

Partially-supervised Mention Detection, Lesly Miculicich and James Henderson, in: Proceedings of the Third Workshop on Computational Models of Reference, Anaphora and Coreference, 2020

Plucking Motions for Tea Harvesting Robots Using Probabilistic Movement Primitives, Kurena Motokura, Masaki Takahashi, Marco Ewerton and Jan Peters, in: IEEE International Conference on Robotics and Automation, 2020

Plug and Play Autoencoders for Conditional Text Generation, Florian Mai, Nikolaos Pappas, Ivan Montero, Noah A. Smith and James Henderson, in: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, Online, 2020

Protecting Mobile Food Diaries from Getting too Personal, Lakmal Buddika Meegahapola, Salvador Ruiz-Correa and Daniel Gatica-Perez, in: 19th International Conference on Mobile and Ubiquitous Multimedia, Essen, Germany, pages 212–222, Association for Computing Machinery, 2020

[DOI]
[URL]

pyannote.audio: neural building blocks for speaker diarization, Herve Bredin, Ruiqing Yin, Juan Manuel Coria, Pavel Korshunov, Marvin Lavechin, Diego Fustes, Hadrien Titeux, Wassim Bouaziz and Marie-Philippe Gill, in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2020

[URL]

Real-Time Segmentation Networks should be Latency Aware, Evann Courdier and Francois Fleuret, in: Asian Conference on Computer Vision, 2020

Residual Pose: A Decoupled Approach for Depth-based 3D Human Pose Estimation, Angel Martínez-González, Michael Villamizar, Olivier Canévet and Jean-Marc Odobez, in: IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Spatial Attention for Far-Field Speech Recognition with Deep Beamforming Neural Networks, Weipeng He, Lu Lu, Biqiao Zhang, Jay Mahadeokar, Kaustubh Kalgaonkar and Christian Fuegen, in: 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, pages 7499-7503, 2020

[DOI]

Supervised domain adaptation for text-independent speaker verification using limited data, Seyyed Saeed Sarfjoo, Srikanth Madikeri, Petr Motlicek and Sébastien Marcel, in: Interspeech, pages 3815-3819, 2020

[URL]

SYNTHETIC SPEECH REFERENCES FOR AUTOMATIC PATHOLOGICAL SPEECH INTELLIGIBILITY ASSESSMENT, Parvaneh Janbakhshi, Ina Kodrasi and Hervé Bourlard, in: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Barcelona, Spain, 2020

The MuMMER data set for Robot Perception in multi-party HRI Scenarios, Olivier Canévet, Weipeng He, Petr Motlicek and Jean-Marc Odobez, in: Proceedings of the 29th IEEE International Conference on Robot & Human Interactive Communication, 2020

The societal and ethical relevance of computational Creativity, Michele Loi, Eleonora Viganò and Lonneke van der Plas, in: Proceedings of the International Conference on Computational Creativity, 2020

The Unstoppable Rise of Computational Linguistics in Deep Learning, James Henderson, in: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Online, pages 6294-6306, Association for Computational Linguistics, 2020

[DOI]
[URL]

Towards Multilingual Sign Language Recognition, Sandrine Tornay, Marzieh Razavi and Mathew Magimai-Doss, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Transformers are RNNs: Fast Autoregressive Transformers with Linear Attention, Angelos Katharopoulos, Apoorv Vyas, Nikolaos Pappas and Francois Fleuret, in: Proceedings of International Conference on Machine Learning, 2020

Understanding Applicants' Reactions to Asynchronous Video Interviews through Self-Reports and Nonverbal Cues, Skanda Muralidhar, Emmanuelle Patricia Kleinlogel, Eric Mayor, Adrian Bangerter, Marianne Schmid Mast and Daniel Gatica-Perez, in: Proc. ACM Int. Conf. on Multimodal Interaction (ICMI), Utrecht, 2020

Understanding Heavy Drinking at Night through Smartphone Sensing and Active Human Engagement, Thanh-Trung Phan, Florian Labhart, Skanda Muralidhar and Daniel Gatica-Perez, in: Proceedings of the 14th EAI International Conference on Pervasive Computing Technologies for Healthcare, 2020

Unsupervised Representation Learning for Gaze Estimation, Yu Yu and Jean-Marc Odobez, in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020

Variational Inference with Mixture Model Approximation for Applications in Robotics, Emmanuel Pignat, Teguh Santoso Lembono and Sylvain Calinon, in: International Conference on Robotics and Automation, 2020

#Drink Or #Drunk: Multimodal Signals and Drinking Practices on Instagram, Thanh-Trung Phan, Skanda Muralidhar and Daniel Gatica-Perez, in: Proceedings of the 13th EAI International Conference on Pervasive Computing Technologies for Healthcare, Trento, Italy, 2019

A BAYESIAN APPROACH TO INTER-TASK FUSION FOR SPEAKER RECOGNITION, Srikanth Madikeri, Subhadeep Dey and Petr Motlicek, in: In Proceedings of ICASSP 2019, Brighton, ENGLAND, pages 5786-5790, 2019

A Deep Learning Approach for Robust Head Pose Independent Eye Movements Recognition from Videos, Remy Siegfried, Yu Yu and Jean-Marc Odobez, in: 2019 ACM Symposium on Eye Tracking Research and Applications, pages 5, ACM, 2019

[DOI]

A Learning-Based Framework for Quantized Compressed Sensing, Rabeeh Karimi Mahabadi, Junhong lin and Volkan Cevher, in: A Learning-Based Framework for Quantized Compressed Sensing, 2019

A morphological based PV generation and energy consumption predictive model for Singapore neighbourhood, Kin Ho Poon and Jérôme Kämpf, in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019

[DOI]

A smart luminaire in an office environment: impact on light distribution, user interactions and comfort, Julien Nembrini, Jérôme Kämpf, Michael Papinutto and Denis Lalanne, in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019

[DOI]

Abstract Text Summarization: A Low Resource Challenge, Shantipriya Parida and Petr Motlicek, in: In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2019), HongKong, China, pages 5, Association for Computational Linguistics (ACL), 2019

Adaptation of Assistant Based Speech Recognition to New Domains and Its Acceptance by Air Traffic Controllers, Matthias Kleinert, Hartmut Helmke, Gerald Siol, heiko Ehr, Dietrich Klakow, Mittul Singh, Petr Motlicek, Kern Christian, Cerna Aneta and Hlousek Petr, in: Proceedings of the 2nd International Conference on Intelligent Human Systems Integration (IHSI 2019): Integrating People and Intelligent Systems, San Diego, California, USA, pages 820 - 826, 2019

[DOI]

Adaptation of Multiple Sound Source Localization Neural Networks with Weak Supervision and Domain-Adversarial Training, Weipeng He, Petr Motlicek and Jean-Marc Odobez, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brighton, United Kingdom, pages 770-774, 2019

[DOI]

Aligning Multilingual Word Embeddings for Cross-Modal Retrieval Task, Alireza Mohammadshahi, Karl Aberer and Rémi Lebret, in: Proceedings of the Second Workshop on Fact Extraction and VERification (FEVER), Hong Kong, pages 27-33, Association for Computational Linguistics, 2019

[DOI]
[URL]

AM-FM DECOMPOSITION OF SPEECH SIGNAL: APPLICATIONS FOR SPEECH PRIVACY AND DIAGNOSIS, Petr Motlicek, Hynek Hermansky, Srikanth Madikeri, Amrutha Prasad and Sriram Ganapathy, in: 11th International workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, Universita Degli Studi Firenze, Firenze, Italy, 2019

[URL]

An End-to-end Network to Synthesize Intonation Using a Generalized Command Response Model, François Marelli, Bastian Schnell, Hervé Bourlard, T. Dutoit and Philip N. Garner, in: ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, United Kingdom, pages 7040-7044, IEEE, 2019

[DOI]
[URL]

AN INVESTIGATION OF MULTILINGUAL ASR USING END-TO-END LF-MMI, Sibo Tong, Philip N. Garner and Hervé Bourlard, in: International Conference on Acoustics, Speech and Signal Processing, 2019

ANALYZING UNCERTAINTIES IN SPEECH RECOGNITION USING DROPOUT, Apoorv Vyas, Pranay Dighe, Sibo Tong and Hervé Bourlard, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

Automatic Diagnosis of Alzheimer's Disease Using Neural Network Language Models, Julian Fritsch, Sebastian Wankerl and Elmar Nöth, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

Bayesian Optimization Meets Riemannian Manifolds in Robot Learning, N. Jaquier, L. Rozo, Sylvain Calinon and M. Buerger, in: Conference on Robot Learning, 2019

BookTubing Across Regions: Examining Differences based on Nonverbal and Verbal Cues, Chinchu Thomas, Dinesh Jayagopi and Daniel Gatica-Perez, in: Proc. ACM Int. Conf. on Interactive Experiences for Television and Online Video (TVX), Salford, ENGLAND, 2019

[DOI]

Building energy models with Morphological urban-scale parameters: a case study in Turin, Roberto Boghetti, Fabio Fantozzi, Jérôme Kämpf, Guglielmina Mutani, Giacomo Salvadori and Valeria Todeschi, in: Proceedings of 4th Building Simulation Applications Conference - BSA 2019, 2019

[URL]

CBOW Is Not All You Need: Combining CBOW with the Compositional Matrix Space Model, Florian Mai, Lukas Galke and Ansgar Scherp, in: International Conference on Learning Representations, New Orleans, Louisiana, USA, 2019

[URL]

CityLearn v1.0: An OpenAI Gym Environment for Demand Response with Deep Reinforcement Learning, José Vázquez-Canteli, Jérôme Kämpf, Gregor Henze and Zoltán Nagy, in: Proceedings of the 6th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation, New-York, USA, pages 356-357, ACM, 2019

[DOI]

CO2 experimental measurements towards the development of a predictive framework using user actions in smart buildings, Rui Oliveira, Jérôme Kämpf, Romeu Vicente, Ricardo Almeida and António Figueiredo, in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019

[DOI]

Cross-lingual Automatic Speech Recognition Exploiting Articulatory Features, Qingran Zhan, Petr Motlicek, Shixuan Du, Yahui Shan, Xiang Xie and Sifan Ma, in: Proceedings of APSIPA ASC 2019, 2019

Custom Silicone Face Masks - Vulnerability of Commercial Face Recognition Systems & Presentation Attack Detection, Ramachandra Raghavendra, Sushma Venkatesh, Kiran B. Raja, Sushil Bhattacharjee, Pankaj Wasnik, Sébastien Marcel and Christoph Busch, in: Proceedings of 7th IAPR/IEEE International Workshop on Biometrics and Forensics, 2019

Daylight regulated by automated external Venetian blinds based on HDR sky luminance mapping in winter, Yujie Wu, Jérôme Kämpf and J. -L. Scartezzini, in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019

[DOI]

Deep Pixel-wise Binary Supervision for Face Presentation Attack Detection, Anjith George and Sébastien Marcel, in: International Conference on Biometrics, 2019

Deep Residual Output Layers for Neural Language Generation, Nikolaos Pappas and James Henderson, in: Proceedings of the 36th International Conference on Machine Learning (ICML), 2019

Discovering Eating Routines in Context with a Smartphone App, Daniel Gatica-Perez, Joan-Isaac Biel, David Labbe and Nathalie Martin, in: Ubicomp/Iswc'19 Adjunct: Proceedings Of The 2019 Acm International Joint Conference On Pervasive And Ubiquitous Computing And Proceedings Of The 2019 Acm International Symposium On Wearable Computers, London, pages 422-429, 2019

[DOI]

Domain Adaptation in Multi-Channel Autoencoder based Features for Robust Face Anti-Spoofing, Olegs Nikisins, Anjith George and Sébastien Marcel, in: International Conference on Biometrics 2019, IEEE, 2019

EMPIRICAL EVALUATION AND COMBINATION OF PUNCTUATION PREDICTION MODELS APPLIED TO BROADCAST NEWS, Alexandre Nanchen and Philip N. Garner, in: Proceedings of 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2019

End-to-End Accented Speech Recognition, Thibault Viglino, Petr Motlicek and Milos Cernak, in: International Conference on Speech and Language Processing, Interspeech, ISCA, Graz, Austria, pages 2140-2144, 2019

[DOI]

Exploiting semi-supervised training through a dropout regularization in end-to-end speech recognition, Subhadeep Dey, Petr Motlicek, Trung Bui and Franck Dernoncourt, in: Proc. of Interspeech 2019, 2019

Full-Gradient Representation for Neural Network Visualization, Suraj Srinivas and Francois Fleuret, in: Advances in Neural Information Processing Systems, 2019

[URL]

Generalized temporal sampling with active illumination in optical microscopy, Christian Jaques and Michael Liebling, in: Proceeding of the SPIE Conference Optics and Photonics, Wavelets and Sparsity XVIII, SPIE, San Diego, California, United States, SPIE, 2019

HMM-based Approaches to Model Multichannel Information in Sign Language inspired from Articulatory Features-based Speech Processing, Sandrine Tornay, Marzieh Razavi, Necati Cihan Camgoz, Richard Bowden and Mathew Magimai-Doss, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

Idiap Abstract Text Summarization System for German Text Summarization Task, Shantipriya Parida and Petr Motlicek, in: Proceedings of the 4th edition of the Swiss Text Analytics Conference, 2019

[URL]

Idiap NMT System for WAT 2019 Multimodal Translation Task, Shantipriya Parida and Petr Motlicek, in: Proceedings of the 6th Workshop on Asian Translation, Hong Kong, China, pages 175–180, Association for Computational Linguistics, 2019

[DOI]
[URL]

Implicit discourse relation classification with syntax-aware contextualized word representations, D. N. Popa, J. Perez, James Henderson and E. Gaussier, in: Proceedings of the 32nd International Florida Artificial Intelligence Research Society Conference, 2019

Improving Children Speech Recognition through Feature Learning from Raw Speech Signal, S. Pavankumar Dubagunta, Selen Hande Kabil and Mathew Magimai-Doss, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

Improving dual-arm assembly by master-slave compliance, M. Suomalainen, Sylvain Calinon, E. Pignat and V. Kyrki, in: Proc. IEEE Intl Conf. on Robotics and Automation, pages 8676-8682, 2019

Improving Few-Shot User-Specific Gaze Adaptation via Gaze Redirection Synthesis, Yu Yu, Gang Liu and Jean-Marc Odobez, in: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019

INCREMENTAL TRANSFER LEARNING IN TWO-PASS INFORMATION BOTTLENECK BASED SPEAKER DIARIZATION SYSTEM FOR MEETINGS, Nauman Dawalatabad, Srikanth Madikeri, Hema A Murthy and C Chandra Sekhar, in: Proceedings of ICASSP 2019, pages 6291-6295, 2019

Joint estimation of RETF vector and power spectral densities for speech enhancement based on alternating least squares, Marvin Tammen, Ina Kodrasi and Simon Doclo, in: IEEE International Conference on Acoustics, Speech and Signal Processing, pages 795--799, 2019

Learning an event sequence embedding for event-based deep stereo, Stepan Tulyakov, Francois Fleuret, Martin Kiefel, Peter Gehler and Michael Hirsch, in: Proceedings of the IEEE International Conference on Computer Vision, 2019

Learning from demonstration with model-based Gaussian process, N. Jaquier, David Ginsbourger and Sylvain Calinon, in: Conference on Robot Learning, 2019

Learning voice source related information for depression detection, S. Pavankumar Dubagunta, Bogdan Vlasenko and Mathew Magimai-Doss, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

Multi-agent reinforcement learning for adaptive demand response in smart cities, José Vázquez-Canteli, Thomas Detjeen, Gregor Henze, Jérôme Kämpf and Zoltán Nagy, in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019

[DOI]

Multi-Spectral Widefield Microscopy of the Beating Heart through Post-Acquisition Synchronization and Unmixing, Christian Jaques, Linda Bapst-Wicht, Daniel F. Schorderet and Michael Liebling, in: 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019), Venice, Italy, pages 1382-1385, 2019

[DOI]

Multilingual Bottleneck Features for Query by Example Spoken Term Detection, Dhananjay Ram, Lesly Miculicich and Hervé Bourlard, in: IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Neural VTLN for Speaker Adaptation in TTS, Bastian Schnell and Philip N. Garner, in: Proc. 10th ISCA Speech Synthesis Workshop, ISCA, Vienna, Austria, pages 6, 2019

[DOI]

Open-Vocabulary Keyword Spotting With Audio And Text Embeddings, Niccolò Sacchi, Alexandre Nanchen, Martin Jaggi and Milos Cernak, in: Proceedings of Interspeech 2019, 2019

[DOI]

Overview of the 6th Workshop on Asian Translation, Shantipriya Parida, in: Proceedings of the 6th Workshop on Asian Translation, Hong Kong, China, pages 1–35, Association for Computational Linguistics, 2019

[DOI]
[URL]

PATHOLOGICAL SPEECH INTELLIGIBILITY ASSESSMENT BASED ON THE SHORT-TIME OBJECTIVE INTELLIGIBILITY MEASURE, Parvaneh Janbakhshi, Ina Kodrasi and Hervé Bourlard, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brighton, UK, pages 6405--6409, 2019

Processing Megapixel Images with Deep Attention-Sampling Models, Angelos Katharopoulos and Francois Fleuret, in: Proceedings of International Conference on Machine Learning, 2019

[URL]

Reducing Noise in GAN Training with Variance Reduced Extragradient, Tatjana Chavdarova, Gauthier Gidel, Francois Fleuret and Simon Lacoste-Julien, in: Proceedings of the international conference on Neural Information Processing Systems, 2019

Reinforcement learning of trajectory distributions: Applications in assisted teleoperation and motion planning, Marco Ewerton, Guilherme Maeda, Dorothea Koert, Zlatko Kolev, Masaki Takahashi and Jan Peters, in: IEEE International Conference on Intelligent Robots and Systems, 2019

Retrofitting, district heating and energy storage: neighborhood energy planning, Diane von Gunten, Jakob Rager, Jérôme Kämpf, Fabien Kuchler and Fabien Poumadère, in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019

[DOI]

SARAL: A Low-Resource Cross-Lingual Domain-Focused Information Retrieval System for Effective Rapid Document Triage, Elizabeth Boschee, Joel Barry, Jayadev Billa, Marjorie Freedman, Thamme Gowda, Constantine Lignos, Chester Palen-Michel, Michael Pust, Banriskhem Khonglah, Srikanth Madikeri, Jonathan May and Scott Miller, in: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: System Demonstrations. 2019, pages 19-24, 2019

SATokE: How can Syntax-Aware Contextualized Word Representations Benefit Implicit Discourse Relation Classification?, D. N. Popa, J. Perez, James Henderson and E. Gaussier, in: Ptroc. 2019 Conference sur l'Apprentissage automatique, 2019

Segment-level training of ANNs based on acoustic confidence measures for hybrid HMM/ANN Speech Recognition, S. Pavankumar Dubagunta and Mathew Magimai-Doss, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

Selecting, Planning, and Rewriting: A Modular Approach for Data-to-Document Generation and Translation, Lesly Miculicich, Marc Marone and Hany Hassan, in: WNGT EMNLP, 2019

Self-attention for Speech Emotion Recognition, Lorenzo Tarantino, Philip N. Garner and Alexandros Lazaridis, in: Proc. Interspeech 2019, 2019

[DOI]

Social Multimedia, Diversity, and Global South Cities: A Double Blind Side, Daniel Gatica-Perez, Darshan Santani, Joan-Isaac Biel and Thanh-Trung Phan, in: Proc. ACM Workshop on Fairness, Accountability, and Transparency in Multimedia (FAT/MM), Nice, 2019

Spectral Subspace Analysis for Automatic Assessment of Pathological Speech Intelligibility, Parvaneh Janbakhshi, Ina Kodrasi and Hervé Bourlard, in: Proceedings of Interspeech, Graz, Austria, pages 3038--3042, 2019

Spoken language identification using language bottleneck features, Malo Grisard, Petr Motlicek, Wissem Allouchi, Michael Baeriswyl, Alexandros Lazaridis and Qingran Zhan, in: Proceedings of TSD, 2019

Super-gaussianity of Speech Spectral Coefficients as a Potential Biomarker for Dysarthric Speech Detection, Ina Kodrasi and Hervé Bourlard, in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2019

Tampered Speaker Inconsistency Detection with Phonetically Aware Audio-visual Features, Pavel Korshunov, Michael Halstead, Diego Castan, Martin Graciarena, Mitchell McLaren, Brian Burns, Aaron Lawson and Sébastien Marcel, in: International Conference on Machine Learning, 2019

The Simulation of Mean Radiant Temperature in Outdoor Conditions: A Review of Architectural Tools Calculation Assumptions, Emanuele Naboni, Marco Meloni, Chris Makey and Jérôme Kämpf, in: Proceedings of Building Simulation 2019: 16th Conference of IBPSA, 2019

Unbiased semi-supervised LF-MMI training using dropout, Sibo Tong, Apoorv Vyas, Philip N. Garner and Hervé Bourlard, in: Proceedings of Interspeech 2019, 2019

[DOI]

Uncertainty-aware imitation learning using kernelized movement primitives, J. Silverio, Y. Huang, F. J. Abu-Dakka, L. Rozo and D. G. Caldwell, in: IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

Understanding and Visualizing Raw Waveform-based CNNs, Hannah Muckenhirn, Vinayak Abrol, Mathew Magimai-Doss and Sébastien Marcel, in: Proceedings of Interspeech, 2019

Understanding the performance gap: a machine learning approach on residential buildings in Turin, Italy, Roberto Boghetti, Fabio Fantozzi, Jérôme Kämpf and Giacomo Salvadori, in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019

[DOI]

Using Speech Production Knowledge for Raw Waveform Modelling based Styrian Dialect Identification, S. Pavankumar Dubagunta and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2019

Virtual High-Framerate Microscopy of the Beating Heart via Sorting of Still Images, Olivia Mariani, Kevin G. Chan, Alexander Ernst, Nadia Mercader and Michael Liebling, in: 2019 IEEE 16th International Symposium on Biomedical Imaging, pages 312--315, 2019

Vulnerability assessment and detection of Deepfake videos, Pavel Korshunov and Sébastien Marcel, in: IAPR International Conference on Biometrics, 2019

Vulnerability of Face Recognition to Deep Morphing, Pavel Korshunov and Sébastien Marcel, in: International Conference on Biometrics for Borders, 2019

Weakly-Supervised Concept-based Adversarial Learning for Cross-lingual Word Embeddings, Haozhou Wang, James Henderson and Paola Merlo, in: Proc. 2019 Conference on Empirical Methods in Natural Language Processing, 2019

A Differential Approach for Gaze Estimation with Calibration, Gang Liu, Yu Yu, Kenneth Alberto Funes Mora and Jean-Marc Odobez, in: 29TH BRITISH MACHINE VISION CONFERENCE, 2018

A Neural Model to Predict Parameters for a Generalized Command Response Model of Intonation, Bastian Schnell and Philip N. Garner, in: Proc. Interspeech 2018, pages 3147-3151, 2018

[DOI]

A Neural Model to Predict Parameters for a Generalized Command Response Model of Intonation, Bastian Schnell and Philip N. Garner, in: MLSLP-18 Proceedings, Hyderabad, 2018

[URL]

Ambiance in Social Media Venues: Visual Cue Interpretation by Machines and Crowds, Gulcan Can, Yassir Benkhedda and Daniel Gatica-Perez, in: IEEE CVPR Workshop on Visual Understanding of Subjective Attributes, 2018

Analysis of Language Dependent Front-End for Speaker Recognition, Srikanth Madikeri, Subhadeep Dey and Petr Motlicek, in: Proceedings of Interspeech 2018, Hyderabad, INDIA, pages 1101-1105, 2018

[DOI]

Beyond Weight Tying: Learning Joint Input-Output Embeddings for Neural Machine Translation, Nikolaos Pappas, Lesly Miculicich and James Henderson, in: Proceedings of the Third Conference on Machine Translation (WMT), 2018

Bimanual Skill Learning with Pose and Joint Space Constraints, J. Silverio, Sylvain Calinon, L. Rozo and D. G. Caldwell, in: Proc. of the IEEE-RAS Intl Conf. on Humanoid Robots (Humanoids), 2018

Building Blocks of Assistant Based Speech Recognition for Air Traffic Management Applications, Matthias Kleinert, Hartmut Helmke, heiko Ehr, Kern Christian, Dietrich Klakow, Petr Motlicek, Mittul Singh and Gerald Siol, in: Conference: SESAR Innovation Days 2018, European Union, Eurocontrol, Salzburg, Austria, SESARJU, 2018

[URL]

CNN based Query by Example Spoken Term Detection, Dhananjay Ram, Lesly Miculicich and Hervé Bourlard, in: Proceedings of Interspeech, 2018

Complexity reduction of eigenvalue decomposition-based diffuse power spectral density estimators using the power method, Marvin Tammen, Ina Kodrasi and Simon Doclo, in: Proc. International Conference on Acoustics, Speech, and Signal Processing, Calgary, Canada, pages 451-455, 2018

CONTEXT-AWARE ATTENTION MECHANISM FOR SPEECH EMOTION RECOGNITION, Gaetan Ramet, Philip N. Garner, Michael Baeriswyl and Alexandros Lazaridis, in: IEEE Workshop on Spoken Language Technology, Athens, Greece, pages 126-131, 2018

[URL]

Deep Multitask Gaze Estimation with a Constrained Landmark-Gaze Model, Yu Yu, Gang Liu and Jean-Marc Odobez, in: European Conference on Computer Vision Workshop, 2018

Deep Neural Networks for Multiple Speaker Detection and Localization, Weipeng He, Petr Motlicek and Jean-Marc Odobez, in: 2018 IEEE International Conference on Robotics and Automation (ICRA), Brisbane, AUSTRALIA, pages 74-79, 2018

[DOI]

Denoising and Raw-waveform Networks for Weakly-Supervised Gender Identification on Noisy Speech, Jilt Sebastian, Manoj Kumar, D S Pavan Kumar, Mathew Magimai-Doss, Hema A Murthy and Shrikanth Narayanan, in: Proceedings of Interspeech 2018, Hyderabad, INDIA, pages 292-296, 2018

[DOI]

DNN based speaker embedding using content information for text-dependent speaker verification, Subhadeep Dey, Takafumi Koshinaka, Petr Motlicek and Srikanth Madikeri, in: Proceedings of 2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2018

Document-Level Neural Machine Translation with Hierarchical Attention Networks, Lesly Miculicich, Dhananjay Ram, Nikolaos Pappas and James Henderson, in: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2018

Embedding Context-Dependent Variations of Prosodic Contours using Variational Encoding for Decomposing the Structure of Speech Prosody, Branislav Gerazov, Gérard Bailly, Omar Mohammed, Yi Xu and Philip N. Garner, in: Workshop on Prosody and Meaning: Information Structure and Beyond, Aix-en-Provence, France, 2018

[URL]

End-to-end text-dependent speaker verification using novel distance measures, Subhadeep Dey, Srikanth Madikeri and Petr Motlicek, in: Proceedings of Interspeech 2018, Hyderabad, INDIA, Aug 02-Sep 06, 2018, pages 3598-3602, 2018

[DOI]

Enhancing Trust in eAssessment - the TeSLA System Solution, Malinka Ivanova, Sushil Bhattacharjee, Sébastien Marcel, Anna Rozeva and Mariana Durcheva, in: Technology Enhanced Assessment Conference., 2018

Experimental evaluation of speech enhancement methods in remote microphone systems for hearing aids, Gilles Curtois, Vincent Grimaldi, Hervé Lissek, Ina Kodrasi and Eleftheria Georganti, in: Proc. EuroNoise 2018, Crete, Greece, pages 351-358, 2018

Facing Employers and Customers: What Do Gaze and Expressions Tell About Soft Skills?, Skanda Muralidhar, Remy Siegfried, Jean-Marc Odobez and Daniel Gatica-Perez, in: Proceedings of the 17th International Conference on Mobile and Ubiquitous Multimedia, Cairo, Egypt, pages 121-126, ASSOC COMPUTING MACHINERY, 2018

[DOI]

Far-field ASR Using Low-rank and Sparse Soft Targets from Parallel Data, Pranay Dighe, Hervé Bourlard and Afsaneh Asaei, in: IEEE Workshop on Spoken Language Technology, Athens, GREECE, pages 581-587, IEEE, 2018

Fast cross-correlation based wrist vein recognition algorithm with rotation and translation compensation, Olegs Nikisins, Teodors Eglitis, André Anjos and Sébastien Marcel, in: Sixth International Workshop on Biometrics and Forensics, 2018

Fast Language Adaptation Using Phonological Information, Sibo Tong, Philip N. Garner and Hervé Bourlard, in: Proceedings of Interspeech 2018, Hyderabad, INDIA, pages 2459-2463, 2018

[DOI]

Generalizing Robot Imitation Learning with Invariant Hidden Semi-Markov Models, A. K. Tanwani, J. Lee, B. Thananjeyan, M. Laskey, S. Krishnan, R. Fox, K. Goldberg and Sylvain Calinon, in: 13th Intl Workshop on the Algorithmic Foundations of Robotics (WAFR), 2018

Geodesic Convolutional Shape Optimization, Pierre Baqué, Edoardo Remelli, Francois Fleuret and Pascal Fua, in: Proceedings of the International Conference on Machine Learning, 2018

Geometry-aware Control and Learning in Robotics, N. Jaquier and Sylvain Calinon, in: R:SS Pioneers Workshop, 2018

Geometry-aware Robot Manipulability Transfer, N. Jaquier, L. Rozo and Sylvain Calinon, in: R:SS Workshop on Learning and Inference in Robotics: Integrating Structure, Priors and Models, 2018

Geometry-aware Tracking of Manipulability Ellipsoids, N. Jaquier, L. Rozo, D. G. Caldwell and Sylvain Calinon, in: Robotics: Science and Systems, Pittsburgh, USA, 2018

Implementing Fusion Techniques for the Classification of Paralinguistic Information, Bogdan Vlasenko, Jilt Sebastian, D S Pavan Kumar and Mathew Magimai-Doss, in: Proceedings of Interspeech 2018, pages 526-530, 2018

Investigating Depth Domain Adaptation for Efficient Human Pose Estimation, Angel Martínez-González, Michael Villamizar, Olivier Canévet and Jean-Marc Odobez, in: European Conference on Computer Vision - Workshops, 2018

Iterative alternating least-aquares approach to jointly estimate the RETFs and the diffuse PSD, Marvin Tammen, Ina Kodrasi and Simon Doclo, in: Proc. ITG conference on Speech Communication, Oldenburg, Germany, 2018

Iterative Learning of Speech Recognition Models for Air Traffic Control, Ajay Srinivasamurthy, Petr Motlicek, Mittul Singh, Youssef Oualil, Matthias Kleinert, heiko Ehr and Hartmut Helmke, in: Proceedings of Interspeech 2018, ISCA, Hyderabad, India, pages 3519-3523, 2018

[DOI]

Joining high-level symbolic planning with low-level motion primitives in adaptive HRI: application to dressing assistance, G. Canal, E. Pignat, G. Alenya, Sylvain Calinon and C. Torras, in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), 2018

Joint late reverberation and noise power spectral density estimation in a spatially homogeneous noise field, Ina Kodrasi and Simon Doclo, in: Proc. International Conference on Acoustics, Speech, and Signal Processing, Calgary, Canada, pages 441-445, 2018

Joint Localization and Classification of Multiple Sound Sources Using a Multi-task Neural Network, Weipeng He, Petr Motlicek and Jean-Marc Odobez, in: Proceedings of Interspeech, pages 312--316, 2018

[DOI]

Knowledge Transfer with Jacobian Matching, Suraj Srinivas and Francois Fleuret, in: Proceedings of the International Conference on Machine Learning, 2018

[URL]

Kronecker Recurrent Units, Cijo Jose, Moustapha Cisse and Francois Fleuret, in: Proceedings of the International Conference on Machine Learning, 2018

Leveraging Convolutional Pose Machines for Fast and Accurate Head Pose Estimation, Yuanzhouhan Cao, Olivier Canévet and Jean-Marc Odobez, in: 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Madrid, SPAIN, pages 1089-1094, IEEE, 2018

Low-latency speaker spotting with online diarization and detection, Jose Patino, Ruiqing Yin, Hector Delgado, Herve Bredin, Alain Komaty, Guillaume Wisniewski, Claude Barras, Nicholas Evans and Sébastien Marcel, in: The Speaker and Language Recognition Workshop (Odyssey), 2018

Multilingual bottleneck features for subword modeling in zero-resource languages, Enno Hermann and Sharon Goldwater, in: Proc. Interspeech, pages 2668-2672, 2018

[DOI]

NASAL SPEECH SOUNDS DETECTION USING CONNECTIONIST TEMPORAL, Milos Cernak and Sibo Tong, in: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 2018

Not All Samples Are Created Equal: Deep Learning with Importance Sampling, Angelos Katharopoulos and Francois Fleuret, in: Proceedings of International Conference on Machine Learning, 2018

On Effectiveness of Anomaly Detection Approaches against Unseen Presentation Attacks in Face Anti-Spoofing, Olegs Nikisins, Amir Mohammadi, André Anjos and Sébastien Marcel, in: The 11th IAPR International Conference on Biometrics (ICB 2018), 2018

On Learning to Identify Genders from Raw Speech Signal Using CNNs, Selen Hande Kabil, Hannah Muckenhirn and Mathew Magimai-Doss, in: Proceedings of Interspeech, Hyderabad, INDIA, pages 287-291, 2018

[DOI]

On Learning Vocal Tract System Related Speaker Discriminative Information from Raw Signal Using CNNs, Hannah Muckenhirn, Mathew Magimai-Doss and Sébastien Marcel, in: Proceedings of Interspeech, Hyderabad, INDIA, pages 1116-1120, 2018

On the Use of Convolutional Neural Networks for Speech Presentation Attack Detection, Pavel Korshunov, Andreé R. Goncalves, Ricardo P. V. Violato, Flávio O. Simões and Sébastien Marcel, in: International Conference on Identity, Security and Behavior Analysis, 2018

Phonological Posterior Hashing for Query by Example Spoken Term Detection, Afsaneh Asaei, Dhananjay Ram and Hervé Bourlard, in: Proceedings of Interspeech, 2018

Practical Deep Stereo (PDS): Toward applications-friendly deep stereo matching, Stepan Tulyakov, Anton Ivanov and Francois Fleuret, in: Proceedings of the international conference on Neural Information Processing Systems, 2018

Probabilistic Learning of Torque Controllers from Kinematic and Force Constraints, J. Silverio, Y. Huang, L. Rozo, Sylvain Calinon and D. G. Caldwell, in: Proc. of the IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), pages 6552-6559, 2018

Pulse-based Features for Face Presentation Attack Detection, Guillaume Heusch and Sébastien Marcel, in: Proceedings of BTAS 2018, special session on Image and Video Forensics in Biometrics, 2018

Real-time Convolutional Networks for Depth-based Human Pose Estimation, Angel Martínez-González, Michael Villamizar, Olivier Canévet and Jean-Marc Odobez, in: IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

Real-Time DCT Learning-based Reconstruction of Neural Signals, Rabeeh Karimi Mahabadi, Cosimo Aprile and Volkan Cevher, in: EUSIPCO, 2018

Robust and Discriminative Speaker Embedding via Intra-Class Distance Variance Regularization, Nam Le and Jean-Marc Odobez, in: Proceedings of Interspeech, Hyderabad, INDIA, pages 2257-2261, 2018

[DOI]

SCALAR - Simultaneous Calibration of 2D Laser And Robot's Kinematic Parameters Using Three Planar Constraints, Teguh Santoso Lembono, Francisco Suarez-Ruiz and Quang-Cuong Pham, in: International Conference on Intelligent Robots, 2018

Self-Attentive Residual Decoder for Neural Machine Translation, Lesly Miculicich, Nikolaos Pappas, Dhananjay Ram and Andrei Popescu-Belis, in: Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2018

Semi-Blind Spatially-Variant Deconvolution in Optical Microscopy with Local Point Spread Function Estimation by Use of Convolutional Neural Networks, Adrian Shajkofci and Michael Liebling, in: 2018 25th IEEE International Conference on Image Processing (ICIP), pages 3818-3822, IEEE, 2018

[DOI]

Semi-supervised Adaptation of Assistant Based Speech Recognition Models for different Approach Areas, Matthias Kleinert, Hartmut Helmke, Gerald Siol, heiko Ehr, Cerna Aneta, Kern Christian, Dietrich Klakow, Petr Motlicek, Youssef Oualil, Mittul Singh and Ajay Srinivasamurthy, in: 37th AIAA/IEEE Digital Avionics Systems Conference, AIAA/IEEE, London, 2018

[URL]

SGAN: An Alternative Training of Generative Adversarial Networks, Tatjana Chavdarova and Francois Fleuret, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, pages 9407-9415, IEEE, 2018

[DOI]

SIIP: An Innovative Speaker Identification Approach for Law Enforcement Agencies, Khaled Khelif, yann Mombrun, Gideon Hazzani, Petr Motlicek, Srikanth Madikeri, Farhan Sahito, Damien Kelly, Luca Scarpatto, Emmanouil Chatzigavriil and Gerhard Backfried, in: Big Data and Artificial Intelligence for Military Decision Making, http://www.sto.nato.int/, pages PT-1 - 1: PT-1 - 14, STO, 2018

[DOI]
[URL]

Single-channel late reverberation power spectral density estimation using denoising autoencoders, Ina Kodrasi and Hervé Bourlard, in: Proc. Annual Conference of the International Speech Communication Association, Hyderabad, India, 2018

SMILE Swiss German Sign Language Dataset, Sarah Ebling, Necati Cihan Camgoz, Penny Boyes Braem, Katja Tissi, Sandra Sidler-Miserez, Stephanie Stoll, Simon Hadfield, Tobias Haug, Richard Bowden, Sandrine Tornay, Marzieh Razavi and Mathew Magimai-Doss, in: Language Resources and Evaluation Conference, 2018

Speaker Inconsistency Detection in Tampered Video, Pavel Korshunov and Sébastien Marcel, in: European Signal Processing Conference, 2018

Spoofing Deep Face Recognition With Custom Silicone Masks, Sushil Bhattacharjee, Amir Mohammadi and Sébastien Marcel, in: Proceedings of BTAS2018, 2018

Statistical modeling of speech spectral coefficients in patients with Parkinson's disease, Ina Kodrasi and Hervé Bourlard, in: Proc. ITG conference on Speech Communication, Oldenburg, Germany, 2018

Stochastic Variance Reduced Gradient Optimization of Generative Adversarial Networks, Tatjana Chavdarova, Sebastian Stich, Martin Jaggi and Francois Fleuret, in: International Conference on Machine Learning (ICML) workshop on Theoretical Foundations and Applications of Deep Generative Models, 2018

Towards directly modeling raw speech signal for speaker verification using CNNs, Hannah Muckenhirn, Mathew Magimai-Doss and Sébastien Marcel, in: IEEE International Conference on Acoustics, Speech and Signal Processing, Calgary, CANADA, pages 4884-4888, 2018

Towards Quantifying the Entropy of Fingervein Patterns across Different Feature Extractors, Vedrana Krivokuca and Sébastien Marcel, in: 2018 IEEE 4th International Conference on Identity, Security, and Behavior Analysis (ISBA), 2018

UNICITY: A depth maps database for people detection in security airlocks, Joël Dumoulin, Olivier Canévet, Michael Villamizar, Hugo Nunes, Omar Abou Khaled, Elena Mugellini, Fabrice Moscheni and Jean-Marc Odobez, in: IEEE International Conference on Advanced Video and Signal-based Surveillance Workshop, 2018

Vlogging Over Time: Longitudinal Impressions and Behavior in YouTube, Daniel Gatica-Perez, Dairazalia Sanchez-Cortes, Trinh-Minh-Tri Do, Dinesh Babu Jayagopi and Kazuhiro Otsuka, in: Proceedings of the 17th International Conference on Mobile and Ubiquitous Multimedia, Cairo, EGYPT, pages 37-46, 2018

[DOI]

WatchNet: Efficient and Depth-based Network for People Detection in Video Surveillance Systems, Michael Villamizar, Angel Martínez-González, Olivier Canévet and Jean-Marc Odobez, in: IEEE International Conference on Advanced Video and Signal-based Surveillance, Auckland, NEW ZEALAND, pages 109-114, 2018

WILDTRACK: A Multi-camera HD Dataset for Dense Unscripted Pedestrian Detection, Tatjana Chavdarova, Pierre Baqué, Andrii Maksai, Stéphane Bouquet, Cijo Jose, Louis Lettry, Francois Fleuret, Pascal Fua and Luc Van Gool, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, pages 5030-5039, 2018

[DOI]

Words Worth: Verbal Content and Hirability Impressions in YouTube Video Resumes, Skanda Muralidhar, Laurent Son Nguyen and Daniel Gatica-Perez, in: Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, 2018

#Healthy #Fondue #Dinner: Analysis and Inference of Food and Drink Consumption Patterns on Instagram, Thanh-Trung Phan and Daniel Gatica-Perez, in: Proceedings of the 16th International Conference on Mobile and Ubiquitous Multimedia, 2017

A Competition on Generalized Software-based Face Presentation Attack Detection in Mobile Scenarios, Z. Boulkenafet, J. Komulainen, Zahid Akhtar, A. Benlamoudi, SE. Bekhouche, F. Dornaika, A. Ouafi, Amir Mohammadi, Sushil Bhattacharjee and Sébastien Marcel, in: Proceedings of the International Joint Conference on Biometrics, 2017, 2017

A Context-Aware Speech recognition and Understanding System for Air Traffic Control Domain, Youssef Oualil, Dietrich Klakow, Gyorgy Szaszak, Ajay Srinivasamurthy, Hartmut Helmke and Petr Motlicek, in: Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, Okinawa, Japan, 2017

A Domain Adaptation Approach to Improve Speaker Turn Embedding Using Face Representation, Nam Le and Jean-Marc Odobez, in: ACM International Conference on Multimodal Interaction, Glasgow, Scotland, ACM, 2017

A Generative Model for Intention Recognition and Manipulation Assistance in Teleoperation, Ajay Kumar Tanwani and Sylvain Calinon, in: IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017

A Sub-Quadratic Exact Medoid Algorithm, James Newling and Francois Fleuret, in: Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017

Active Online Anomaly Detection using Dirichlet Process Mixture Model and Gaussian Process Classification, Jagannadan Varadarajan, R Subramanian, Narendra Ahuja, Pierre Moulin and Jean-Marc Odobez, in: IEEE Winter Conference on Applications of Computer Vision (WACV), Washington, 2017

An Investigation of Deep Neural Networks for Multilingual Speech Recognition Training and Adaptation, Sibo Tong, Philip N. Garner and Hervé Bourlard, in: Proc. of Interspeech, 2017

BEAT: An Open-Science Web Platform, André Anjos, Laurent El Shafey and Sébastien Marcel, in: Thirty-fourth International Conference on Machine Learning, Sydney, Australia, 2017

[URL]

Bob Speaks Kaldi, Milos Cernak, Alain Komaty, Amir Mohammadi, André Anjos and Sébastien Marcel, in: Proc. of Interspeech, 2017

Boosted Exudate Segmentation in Retinal Images using Residual Nets, Samaneh Abbasi-Sureshjani, Behdad Dasht Bozorg, Bart ter Haar Romeny and Francois Fleuret, in: Proceedings of the MICCAI Workshop on Ophthalmic Medical Image Analysis, 2017

Consistent Translation of Repeated Nouns using Syntactic and Semantic Cues, Xiao Pu, Laura Mascarell and Andrei Popescu-Belis, in: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL), Valencia, Spain, Association for Computational Linguistics, 2017

Content Normalization for Text-dependent Speaker Verification, Subhadeep Dey, Srikanth Madikeri, Petr Motlicek and Marc Ferras, in: Proc. of Interspeech, 2017

Continuously Reproducing Toolchains in Pattern Recognition and Machine Learning Experiments, André Anjos, Manuel Günther, Tiago de Freitas Pereira, Pavel Korshunov, Amir Mohammadi and Sébastien Marcel, in: Thirty-fourth International Conference on Machine Learning, Sidney, Australia, 2017

[URL]

Cross-Eyed 2017: Cross-Spectral Iris/Periocular Recognition Competition., Ana Sequeira, Lulu Chen, James Ferryman, Peter Wild, Fernando Alonso-Fernandez, Josef Bigün, Kiran B. Raja, R. Raghavendra, Christoph Busch, Tiago de Freitas Pereira, Sébastien Marcel, Sushree Sangeeta Behera, Mahesh Gour and Vivek Kanhangad, in: IEEE/IAPR International Joint Conference on Biometrics, Denver, Colorado, USA, IEEE, 2017

Deep Multi-Camera People Detection, Tatjana Chavdarova and Francois Fleuret, in: Proceedings of the IEEE International Conference on Machine Learning and Applications, 2017

Deep Occlusion Reasoning for Multi-Camera Multi-Target Detection, Pierre Baqué, Francois Fleuret and Pascal Fua, in: Proceedings of the IEEE International Conference on Computer Vision, 2017

Dynamic Graffiti Stylisation with Stochastic Optimal Control, D. Berio, Sylvain Calinon and F. F. Leymarie, in: Intl Workshop on movement and computing (MOCO), London, UK, pages 1-8, ACM, 2017

[DOI]
[URL]

Elderly People Living Alone: Detecting Home Visits with Ambient and Wearable Sensing, Rui Hu, Hieu Pham, Philipp Buluschek and Daniel Gatica-Perez, in: In Proceedings of MMHealth, 2017

End-to-End Convolutional Neural Network-based Voice Presentation Attack Detection, Hannah Muckenhirn, Mathew Magimai-Doss and Sébastien Marcel, in: International Joint Conference on Biometrics, Denver, Colorado, USA, 2017

Examining Linguistic Content and Skill Impression Structure for Job Interview Analytics in Hospitality, Skanda Muralidhar and Daniel Gatica-Perez, in: Proceedings of the 16th International Conference on Mobile and Ubiquitous Multimedia, 2017

Exploiting Eigenposteriors for Semi-supervised Training of DNN Acoustic Models with Sequence Discrimination, Pranay Dighe, Afsaneh Asaei and Hervé Bourlard, in: Proceedings of Interspeech, 2017

EXPLOITING SEQUENCE INFORMATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION, Subhadeep Dey, Petr Motlicek, Srikanth Madikeri and Marc Ferras, in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, New Orleans, pages 5370-5374, 2017

Exploratory Study on Direct Prediction of Diabetes using Deep Residual Networks, Samaneh Abbasi-Sureshjani, Behdad Dasht Bozorg, Bart ter Haar Romeny and Francois Fleuret, in: Proceedings of the thematic conference on computational vision and medical image processing, 2017

Gaussian Mixture Regression on Symmetric Positive Definite Matrices Manifolds: Application to Wrist Motion Estimation with sEMG, N. Jaquier and Sylvain Calinon, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, pages 59-64, 2017

[URL]

Generating Calligraphic Trajectories with Model Predictive Control, D. Berio, Sylvain Calinon and F. F. Leymarie, in: Proc. 43rd Conf. on Graphics Interface, Edmonton, AL, Canada, pages 132-139, 2017

[DOI]

How May I Help You? Behavior and Impressions in Hospitality Service Encounters, Skanda Muralidhar, Marianne Schmid Mast and Daniel Gatica-Perez, in: Proceddings of 19th ACM International Conference on Multimodal Interaction, 2017

Implementing gender-dependent vowel-level analysis for boosting speech-based depression recognition, Bogdan Vlasenko, Hesam Sagha, Nicholas Cummins and Björn Schuller, in: Proceedings of Interspeech 2017, 2017

Improving hand and wrist activity detection using tactile sensors and tensor regression methods on Riemannian manifolds, N. Jaquier, C. Castellini and Sylvain Calinon, in: Proc. of the Myoelectric Control Symposium, 2017

[URL]

Improving speaker turn embedding by crossmodal transfer learning from face embedding, Nam Le and Jean-Marc Odobez, in: ICCV Workshop on Computer Vision for Audio-Visual Media, 2017

Insiders and Outsiders: Comparing Urban Impressions between Population Groups, Darshan Santani, Salvador Ruiz-Correa and Daniel Gatica-Perez, in: International Conference on Multimedia Retrieval, ACM, 2017

[DOI]

INTRA-CLASS COVARIANCE ADAPTATION IN PLDA BACK-ENDS FOR SPEAKER VERIFICATION, Srikanth Madikeri, Marc Ferras, Petr Motlicek and Subhadeep Dey, in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, pages 5365-5369, 2017

[DOI]

K-Medoids For K-Means Seeding, James Newling and Francois Fleuret, in: Proceedings of the international conference on Neural Information Processing Systems, 2017

Learning Manipulability Ellipsoids for Task Compatibility in Robot Manipulation, L. Rozo, N. Jaquier, Sylvain Calinon and D. G. Caldwell, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, pages 3183-3189, 2017

[URL]

Learning Task-Space Synergies using Riemannian Geometry, M. Zeestraten, I. Havoutis, Sylvain Calinon and D. G. Caldwell, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), Vancouver, Canada, pages 73-78, IEEE, 2017

[URL]

Low-rank and Sparse Soft Targets to Learn Better DNN Acoustic Models, Pranay Dighe, Afsaneh Asaei and Hervé Bourlard, in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), 2017

Machine Learning of Controller Command Prediction Models from Recorded Radar Data and Controller Speech Utterances, Matthias Kleinert, Hartmut Helmke, Gerald Siol, heiko Ehr, Michael Finke, Youssef Oualil and Ajay Srinivasamurthy, in: Proceedings of the 7th SESAR Innovation Days (SID), University of Belgrade, Belgrade, Serbia, 2017

Machine translation of Spanish personal and possessive pronouns using anaphora probabilities, Ngoc-Quang Luong and Andrei Popescu-Belis, in: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL), Valencia, Spain, Association for Computational Linguistics, 2017

Multi-Modal Mean-Fields via Cardinality-Based Clamping, Pierre Baqué, Francois Fleuret and Pascal Fua, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2017

Multi-view Representation Learning Via GCCA for Multimodal Analysis of Parkinson's Disease, Juan Camilo Vasquez-Correa, Juan Rafael Orozco-Arroyave, Raman Arora, Elmar Nöth, Najim Dehak, Heidi Christensen, Frank Rudzicz, Tobias Bocklet, Milos Cernak, Hamidreza Chinaei, Julius Hannink, Phani Sankar Nidadavolu, Maria Yancheva, Alyssa Vann and Nikolai Vogler, in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), 2017

Multilingual Hierarchical Attention Networks for Document Classification, Nikolaos Pappas and Andrei Popescu-Belis, in: Proceedings of the 8th International Joint Conference on Natural Language Processing (IJCNLP), pages 1015-1025, 2017

Non-Markovian Globally Consistent Multi-Object Tracking, Andrii Maksai, Xinchao Wang, Francois Fleuret and Pascal Fua, in: Proceedings of the IEEE International Conference on Computer Vision, 2017

Non-parametric warping via local scale estimation for non-stationary Gaussian process modelling, Sébastien Marmin, Jean Baccou, Jacques Liandrat and David Ginsbourger, in: Wavelets and Sparsity XVII, pages 1039421, International Society for Optics and Photonics, 2017

[DOI]
[URL]

On Job Training: Automated Interpersonal Behavior Assessment & Real-Time Feedback, Skanda Muralidhar, in: Proceedings of the 25th ACM International Conference on Multimedia, 2017, 2017

On the Generalization of Fused Systems in Voice Presentation Attack Detection, Andreé R. Goncalves, Pavel Korshunov, Ricardo P. V. Violato, Flávio O. Simões and Sébastien Marcel, in: 16th International Conference of the Biometrics Special Interest Group, 2017

On the Impact of Non-modal Phonation On Phonological Features, Milos Cernak, Elmar Nöth, Frank Rudzicz, Heidi Christensen, Juan Rafael Orozco-Arroyave, Raman Arora, Tobias Bocklet, Hamidreza Chinaei, Julius Hannink, Phani Sankar Nidadavolu, Juan Camilo Vasquez, Maria Yancheva, Alyssa Vann and Nikolai Vogler, in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), 2017

Planification adaptative d'expériences numériques par paquets en contexte non stationnaire pour une étude de fissuration mécanique, Sébastien Marmin, Jean Baccou, Frédéric Perales, David Ginsbourger and Jacques Liandrat, in: 23eme Congres Francais de Mecanique, 28 aout - 1er septembre 2017, Lille, France (FR), AFM, 2017

[URL]

Robust and Accurate 3D Head Pose Estimation through 3DMM and Online Head Model Reconstruction, Yu Yu, Kenneth Alberto Funes Mora and Jean-Marc Odobez, in: Proceedings of the 12th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2017), 2017

Semi-supervised Learning with Semantic Knowledge Extraction for Improved Speech Recognition in Air Traffic Control, Ajay Srinivasamurthy, Petr Motlicek, Ivan Himawan, Gyorgy Szaszak, Youssef Oualil and Hartmut Helmke, in: Proceedings of Interspeech 2017, Stockholm, Sweden, pages 2406-2410, 2017

[DOI]

Sense-Aware Statistical Machine Translation using Adaptive Context-Dependent Clustering, Xiao Pu, Nikolaos Pappas and Andrei Popescu-Belis, in: Proceedings of Second Conference on Machine Translation (WMT17), 2017

Shape Representations for Maya Codical Glyphs: Knowledge-driven or Deep?, Gulcan Can, Jean-Marc Odobez and Daniel Gatica-Perez, in: 15th International Workshop on Content-Based Multimedia Indexing, 2017

Social Scene Understanding: End-to-End Multi-Person Action Localization and Collective Activity Recognition, Timur Bagautdinov, Alexandre Alahi, Francois Fleuret, Pascal Fua and Sylvio Savarese, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2017

Sparse Pronunciation Codes for Perceptual Phonetic Information Assessment, Afsaneh Asaei, Milos Cernak, Hervé Bourlard and Dhananjay Ram, in: Workshop on Signal Processing with Adaptive Sparse Structured Representations (SPARS), 2017

Subspace Regularized Dynamic Time Warping for Spoken Query Detection, Dhananjay Ram, Afsaneh Asaei and Hervé Bourlard, in: Workshop on Signal Processing with Adaptive Sparse Structured Representations (SPARS), 2017

Supervised Gaze Bias Correction for Gaze Coding in Interactions, Remy Siegfried and Jean-Marc Odobez, in: ECEM COGAIN Symposium, pages 3, 2017

Supervisory teleoperation with online learning and optimal control, I. Havoutis and Sylvain Calinon, in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Singapore, pages 1534-1540, IEEE, 2017

[URL]

The SUMMA Platform Prototype, Renars Liepins and et al., in: Proceedings of the EACL 2017 Software Demonstrations, Valencia, Spain, pages 116--119, 2017

[URL]

Towards a breakthrough Speaker Identification approach for Law Enforcement Agencies: SIIP, Khaled Khelif, yann Mombrun, Gerhard Backfried, Farhan Sahito, Luca Scarpatto, Petr Motlicek, Damien Kelly, Gideon Hazzani, Emmanouil Chatzigavriil and Srikanth Madikeri, in: European Intelligence and Security Informatics Conference (EISIC) 2017, Athenes, Greece, pages 32-39, IEEE Computer Society, 2017

[DOI]
[URL]

Towards large scale multimedia indexing: A case study on person discovery in broadcast news, Nam Le, Jean-Marc Odobez and et al., in: 15th International Workshop on Content-Based Multimedia Indexing, 2017

Towards the Use of Social Interaction Conventions as Prior for Gaze Model Adaptation, Remy Siegfried, Yu Yu and Jean-Marc Odobez, in: Proceedings of 19th ACM International Conference on Multimodal Interaction, pages 9, ACM, 2017

[DOI]

Trajectory and Foothold Optimization using Low-Dimensional Models for Rough Terrain Locomotion, C. Mastalli, M. Focchi, I. Havoutis, A. Radulescu, Sylvain Calinon, J. Buchli, D. G. Caldwell and C. Semini, in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Singapore, pages 1096-1103, IEEE, 2017

[URL]

Using Coreference Links to Improve Spanish-to-English Machine Translation, Lesly Miculicich and Andrei Popescu-Belis, in: Proceedings of the 2nd Workshop on Coreference Resolution Beyond OntoNotes (CORBON 2017), Valencia, Spain, pages 30-40, Association for Computational Linguistics (ACL), 2017

Validation of an Automatic Metric for the Accuracy of Pronoun Translation (APT), Lesly Miculicich and Andrei Popescu-Belis, in: Proceedings of the Third Workshop on Discourse in Machine Translation (DiscoMT), Denmark, Copenhagen, Association for Computational Linguistics (ACL), 2017

Venues in Social Media: Examining Ambiance Perception Through Scene Semantics, Yassir Benkhedda, Darshan Santani and Daniel Gatica-Perez, in: Proceedings of the 25th ACM International Conference on Multimedia, ACM, 2017, 2017

Weakly Supervised Learning of Deep Metrics for Stereo Reconstruction, Stepan Tulyakov, Anton Ivanov and Francois Fleuret, in: Proceedings of the IEEE International Conference on Computer Vision, 2017

What you can't see can help you -- extended-range imaging for 3D-mask presentation attack detection, Sushil Bhattacharjee and Sébastien Marcel, in: Proceedings of the 16th International Conference on Biometrics Special Interest Group., Darmstadt (Germany), Gesellschaft fuer Informatik e.V. (GI), 2017

A Contextual Language Model to Improve Machine Translation of Pronouns by Re-ranking Translation Hypotheses, Ngoc-Quang Luong and Andrei Popescu-Belis, in: European Association for Machine Translation, 2016

A MultiPath Network for Object Detection, Sergey Zagoruyko, Adam Lerer, Tsung-Yi Lin, Pedro H. O. Pinheiro, Sam Gross, Soumith Chintala and Piotr Dollar, in: Proceedings of the British Machine Vision Conference, BMVA Press, 2016

[URL]

A Point-Spread-Function-Aware Filtered Backprojection Algorithm for Focal-Plane-Scanning Optical Projection Tomography, Kevin G. Chan and Michael Liebling, in: 2016 IEEE International Symposium on Biomedical Imaging, 2016

An agonist-antagonist pitch production model, Branislav Gerazov and Philip N. Garner, in: Lecture Notes in Artificial Intelligence: 18th International Conference, SPECOM 2016, Budapest, Hungary, pages 84--91, 2016

Ancient Maya Writings as High-Dimensional Data: a Visualization Approach, Gulcan Can, Jean-Marc Odobez, Carlos Pallan Gayol and Daniel Gatica-Perez, in: Digital Humanities (DH), Krakow, 2016

Anomaly detection in elderly daily behavior in ambient sensing environments, Oya Aran, Dairazalia Sanchez-Cortes, Trinh-Minh-Tri Do and Daniel Gatica-Perez, in: Proceedings of the 7th Int. Workshop on Human Behavior Understanding, ACM Multimedia, 2016, Amsterdam, Netherlands, 2016

Assessing a Shape Descriptor for Analysis of Mesoamerican Hieroglyphics: A View Towards Practice in Digital Humanities, Rui Hu, Jean-Marc Odobez and Daniel Gatica-Perez, in: Digital Humanities Conference (DH), Krakow, 2016

Audiovisual Summarization of Lectures and Meetings Using a Segment Similarity Graph, Chidansh A. Bhatt, Andrei Popescu-Belis and Matthew Cooper, in: Proceedings of the ACM International Conference on Multimedia Retrieval (ICMR), ACM, New York, NY, ACM Press, 2016

Comparing Two Strategies for Query Expansion in a News Monitoring System, Parvaz Mahdabi and Andrei Popescu-Belis, in: Natural Language Processing and Information Systems: 21st International Conference on Applications of Natural Language to Information Systems, NLDB 2016, pages 267-275, Springer-Verlag, 2016

[DOI]

Cross-database evaluation of audio-based spoofing detection systems, Pavel Korshunov and Sébastien Marcel, in: Interspeech, San Francisco, USA, 2016

[URL]

DEEP NEURAL NETWORK BASED POSTERIORS FOR TEXT-DEPENDENT SPEAKER VERIFICATION, Subhadeep Dey, Srikanth Madikeri, Marc Ferras and Petr Motlicek, in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5050-5054, IEEE, 2016

Deep Neural Networks for Syntactic Parsing of Morphologically Rich Languages, Joël Legrand and Ronan Collobert, in: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Design of a Speech Corpus for Research on Cross-Lingual Prosody Transfer, Milan Secujski, Branislav Gerazov, Tamas Gabor Csapo, Vlado Delic, Philip N. Garner, Aleksandar Gjoreski, David Guennec, Zoran Ivanovski, Aleksandar Melov, Geza Nemeth, Ana Stojković and Gyorgy Szaszak, in: Lecture Notes in Artificial Intelligence: 18th International Conference, SPECOM 2016, Budapest, Hungary, pages 199--206, 2016

Dexterous Undersea Interventions with Far Distance Onshore Supervision: the DexROV Project, J. Gancet, P. Weiss, G. Antonelli, M. F. Pfingsthorn, Sylvain Calinon, A. Turetta, C. Walen, D. Urbina, S. Govindaraj, P. Letier, X. Martinez, J. Salini, B. Chemisky, G. Indiveri, G. Casalino, P. Di Lillo, E. Simetti, D. De Palma, A. Birk, A. K. Tanwani, I. Havoutis, A. Caffaz and L. Guilpain, in: IFAC Conference on Control Applications in Marine Systems (CAMS), Trondheim, Norway, pages 414-419, 2016

[DOI]
[URL]

Dites-Moi: Wearable Feedback on Conversational Behavior, Skanda Muralidhar, Jean M R Costa, Laurent Son Nguyen and Daniel Gatica-Perez, in: Proceedings of the 15th International Conference on Mobile and Ubiquitous Multimedia, 2016

Efficient Posterior Exemplar Search Space Hashing Exploiting Class-Specific Sparsity Structures, Afsaneh Asaei, Gil Luyet, Milos Cernak and Hervé Bourlard, in: Interspeech, San Francisco, CA, 2016

Emphasis Recreation for TTS using Intonation Atoms, Pierre-Edouard Honnet and Philip N. Garner, in: 9th ISCA Speech Synthesis Workshop, pages 14--20, 2016

[DOI]

EUMSSI team at the MediaEval Person Discovery Challenge 2016, Nam Le, Sylvain Meignier and Jean-Marc Odobez, in: MediaEval Benchmarking Initiative for Multimedia Evaluation, Hilversum, Netherlands, 2016

Exploiting Low-dimensional Structures to Enhance DNN based Acoustic Modeling in Speech Recognition, Pranay Dighe, Gil Luyet, Afsaneh Asaei and Hervé Bourlard, in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5690-5694, IEEE, 2016

Fast K-Means with Accurate Bounds, James Newling and Francois Fleuret, in: Proceedings of the International Conference on Machine Learning (ICML), New York, 2016

Findings of the 2016 WMT Shared Task on Cross-lingual Pronoun Prediction, Liane Guillou, Christian Hardmeier, Preslav Nakov, Sara Stymne, Jorg Tiedemann, Yannick Versley, Mauro Cettolo, Bonnie Webber and Andrei Popescu-Belis, in: Proceedings of WMT 2016 (First Conference on Machine Translation), Association for Computational Linguistics, Berlin, Germany, pages 525–542, 2016

[URL]

Heterogeneous Face Recognition using Inter-Session Variability Modelling, Tiago de Freitas Pereira and Sébastien Marcel, in: IEEE Computer Society Workshop on Biometrics, Las Vegas - USA, IEEE, 2016

Hierarchical Planning of Dynamic Movements without Scheduled Contact Sequences, Carlos Mastalli, I. Havoutis, Michele Focchi, Claudio Semini and D. G. Caldwell, in: Proceedings of the IEEE International Conference of Robotics and Automation, 2016

HMM-based Non-native Accent Assessment using Posterior Features, Ramya Rasipuram, Milos Cernak and Mathew Magimai-Doss, in: Proceedings of Interspeech, San Francisco, USA, 2016

Human versus Machine Attention in Document Classification: A Dataset with Crowdsourced Annotations, Nikolaos Pappas and Andrei Popescu-Belis, in: Proceedings of the EMNLP 2016 Workshop on Natural Language Processing for Social Media, Austin, USA, 2016

Importance Sampling Tree for Large-scale Empirical Expectation, Olivier Canévet, Cijo Jose and Francois Fleuret, in: Proceedings of the International Conference on Machine Learning (ICML), New-York, 2016

Improving Pronoun Translation by Modeling Coreference Uncertainty, Ngoc-Quang Luong and Andrei Popescu-Belis, in: Proceedings of the First Conference on Machine Translation (WMT16), Berlin, Germany, 2016

Improving Under-Resourced Language ASR Through Latent Subword Unit Space Discovery, Marzieh Razavi and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2016

INFORMATION THEORETIC CLUSTERING FOR UNSUPERVISED DOMAIN-ADAPTATION, Subhadeep Dey, Srikanth Madikeri and Petr Motlicek, in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5580-5584, IEEE, 2016

InnerView: Learning Place Ambiance from Social Media Images, Darshan Santani, Rui Hu and Daniel Gatica-Perez, in: Proceedings of the 24th ACM International Conference on Multimedia, ACM, 2016

[DOI]

Inter-task System Fusion for Speaker Recognition, Marc Ferras, Srikanth Madikeri, Subhadeep Dey, Petr Motlicek and Hervé Bourlard, in: Proceeedings of the INTERSPEECH, 2016

Investigating Cross-lingual Multi-level Adaptive Networks: The Importance of the Correlation of Source and Target Languages, Alexandros Lazaridis, Ivan Himawan, Petr Motlicek, Iosif Mporas and Philip N. Garner, in: Proceedings of the International Workshop on Spoken Language Translation, Seattle, WA, USA, 2016

Investigating Spectral Amplitude Modulation Phase Hierarchy Features in Speech Synthesis, Alexandros Lazaridis, Milos Cernak, Pierre-Edouard Honnet and Philip N. Garner, in: 9th ISCA Speech Synthesis Workshop, 2016

Joint Operation of Voice Biometrics and Presentation Attack Detection, Pavel Korshunov and Sébastien Marcel, in: IEEE International Conference on Biometrics: Theory, Applications and Systems, 2016

[URL]

Large Scale Hard Sample Mining with Monte Carlo Tree Search, Olivier Canévet and Francois Fleuret, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2016

Learning assistive teleoperation behaviors from demonstration, I. Havoutis and Sylvain Calinon, in: Proc. IEEE International Symposium on Safety, Security and Rescue Robotics, pages 258-263, 2016

Learning dynamic graffiti strokes with a compliant robot, D. Berio, Sylvain Calinon and F. F. Leymarie, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, pages 3981-3986, 2016

[URL]

Learning Multimodal Temporal Representation for Dubbing Detection in Broadcast Media, Nam Le and Jean-Marc Odobez, in: ACM Multimedia, Amsterdam, ACM, 2016

Learning to Refine Object Segments, Pedro H. O. Pinheiro, Tsung-Yi Lin, Ronan Collobert and Piotr Dollar, in: Computer Vision - ECCV 2016, Amsterdam, pages 75-91, Springer, 2016

[DOI]
[URL]

Long-Term Time-Sensitive Costs for CRF-Based Tracking by Detection, Nam Le, Alexandre Heili and Jean-Marc Odobez, in: 2nd Workshop on Benchmarking Multi-target Tracking: MOTChallenge 2016, Amsterdam, 2016

Low-Rank Representation of Nearest Neighbor Phone Posterior Probabilities to Enhance DNN Acoustic Modeling, Gil Luyet, Pranay Dighe, Afsaneh Asaei and Hervé Bourlard, in: Interspeech, 2016

Manual and automatic labeling of discourse connectives for machine translation (Keynote paper), Andrei Popescu-Belis, in: TextLink: Structuring Discourse in Multilingual Europe (Handbook of the Second Action Conference), Budapest, Hungary, pages 16-20, 2016

[URL]

Modeling Unvoiced Sounds In Statistical Parametric Speech Synthesis with a Continuous Vocoder, Tamas Gabor Csapo, Geza Nemeth, Milos Cernak and Philip N. Garner, in: Proc. of EUSIPCO, Budapest, Hungary, 2016

Multilingual Visual Sentiment Concept Matching, Nikolaos Pappas, Mercan Topkara, Miriam Redi, Brendan Jou, Tao Chen, Hongyi Liu and Shih-Fu Chang, in: Proceedings of the International Conference on Multimedia Retrieval (ICMR), 2016

Nested Mini-Batch K-Means, James Newling and Francois Fleuret, in: Proceedings of NIPS, 2016

Neural Network-based Word Alignment through Score Aggregation, Joël Legrand, Michael Auli and Ronan Collobert, in: Proceedings of the ACL 1st Conference on Machine Translation, 2016

Online Inference in Bayesian Non-Parametric Mixture Models under Small Variance Asymptotics, Ajay Kumar Tanwani and Sylvain Calinon, in: NIPS workshop on Advances in Approximate Bayesian Inference, Barcelona, Spain, pages 1-5, 2016

[URL]

Online motion synthesis with minimal intervention control and formal safety guarantees, M. Zeestraten, A. Pereira, M. Althoff and Sylvain Calinon, in: Proc. IEEE Intl Conf. on Systems, Man, and Cybernetics, Budapest, Hungary, 2016

Overview of BTAS 2016 Speaker Anti-spoofing Competition, Pavel Korshunov, Sébastien Marcel, Hannah Muckenhirn, A. R. Gonçalves, A. G. Souza Mello, R. P. Velloso Violato, F. O. Simões, M. U. Neto, M. de Assis Angeloni, J. A. Stuchi, H. Dinkel, N. Chen, Y. Qian, D. Paul, G. Saha and Md Sahidullah, in: IEEE International Conference on Biometrics: Theory, Applications and Systems, 2016

[URL]

PAoS Markers: Trajectory Analysis of Selective Phonological Posteriors for Assessment of Progressive Apraxia of Speech, Afsaneh Asaei, Milos Cernak and Marina Laganaro, in: Proceeding on the 7th Workshop on Speech and Language Processing for Assistive Technologies (SLPAT), 2016

PhonVoc: A Phonetic and Phonological Vocoding Toolkit, Milos Cernak and Philip N. Garner, in: Interspeech, San Francisco, USA, 2016

Phrase Representations for Multiword Expressions, Joël Legrand and Ronan Collobert, in: Proceedings of the 12th Workshop on Multiword Expressions, 2016

Presentation Attack Detection Using Long-Term Spectral Statistics for Trustworthy Speaker Verification, Hannah Muckenhirn, Mathew Magimai-Doss and Sébastien Marcel, in: International Conference of the Biometrics Special Interest Group (BIOSIG), 2016

Principled Parallel Mean-Field Inference for Discrete Random Fields, Pierre Baqué, Timur Bagautdinov, Francois Fleuret and Pascal Fua, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2016

Probabilistic Amplitude Demodulation Features in Speech Synthesis for Improving Prosody, Alexandros Lazaridis, Milos Cernak and Philip N. Garner, in: Proceedings of Interspeech, San Francisco, USA, 2016

Pronoun Language Model and Grammatical Heuristics for Aiding Pronoun Prediction, Ngoc-Quang Luong and Andrei Popescu-Belis, in: Proceedings of the First Conference on Machine Translation (WMT16), Berlin, Germany, ACL, 2016

Scalable Metric Learning via Weighted Approximate Rank Component Analysis, Cijo Jose and Francois Fleuret, in: ECCV 2016, 2016

Sound Pattern Matching for Automatic Prosodic Event Detection, Milos Cernak, Afsaneh Asaei, Pierre-Edouard Honnet, Philip N. Garner and Hervé Bourlard, in: Interspeech, San Francisco, USA, 2016

Stochastic learning and control in multiple coordinate systems, Sylvain Calinon, in: Intl Workshop on Human-Friendly Robotics, Genoa, Italy, pages 1-5, 2016

Stressful First Impressions in Job Interviews, Ailbhe Finnerty, Skanda Muralidhar, Laurent Son Nguyen, Fabio Pianesi and Daniel Gatica-Perez, in: Proceedings of the 18th ACM International Conference on Multimodal Interaction, pages 325-332, 2016

Subspace Detection of DNN Posterior Probabilities via Sparse Representation for Query by Example Spoken Term Detection, Dhananjay Ram, Afsaneh Asaei and Hervé Bourlard, in: Interspeech, 2016

SYSTEM FUSION AND SPEAKER LINKING FOR LONGITUDINAL DIARIZATION OF TV SHOWS, Marc Ferras, Srikanth Madikeri, Petr Motlicek and Hervé Bourlard, in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5495-5499, IEEE, 2016

Temporally Subsampled Detection for Accurate and Efficient Face Tracking and Diarization, Nam Le, Alexandre Heili, Di Wu and Jean-Marc Odobez, in: International Conference on Pattern Recognition, Cancun, Mexico, IEEE, 2016

The Night is Young: Urban Crowdsourcing of Nightlife Patterns, Darshan Santani, Joan-Isaac Biel, Florian Labhart, Jasmine Truong, Sara Landolt, Emmanuel Kuntsche and Daniel Gatica-Perez, in: Proceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing, ACM, 2016

[DOI]

The REPLAY-MOBILE Face Presentation-Attack Database, Artur Costa-Pazo, Sushil Bhattacharjee, Esteban Vazquez-Fernandez and Sébastien Marcel, in: Proceedings of the International Conference on Biometrics Special Interests Group, 2016

The SIWIS database: a multilingual speech database with acted emphasis, Jean-Philippe Goldman, Pierre-Edouard Honnet, Rob Clark, Philip N. Garner, Maria Ivanova, Alexandros Lazaridis, Hui Liang, Tiago Macedo, Beat Pfister, Manuel Sam Ribeiro, Eric Wehrli and Junichi Yamagishi, in: Proceedings of Interspeech, San Francisco, USA, pages 1532--1535, 2016

[DOI]

Towards building an attentive artificial listener: on the perception of attentiveness in audio-visual feedback tokens, Catharine Oertel, José David Lopes, Yu Yu, Kenneth Alberto Funes Mora, Joakim Gustafson, Alan Black and Jean-Marc Odobez, in: Proceedings of the 18th ACM International Conference on Multimodal Interaction, Tokyo, Japan, pages 21-28, ACM, 2016

[DOI]

Training on the Job: Behavioral Analysis of Job Interviews in Hospitality, Skanda Muralidhar, Laurent Son Nguyen, Denise Frauendorfer, Jean-Marc Odobez, Marianne Schmid Mast and Daniel Gatica-Perez, in: Proceedings of the 18th ACM International Conference on Multimodal Interaction, pages 84-91, 2016

Transferring Neural Representations for Low-dimensional Indexing of Maya Hieroglyphic Art, Edgar Roman-Rangel, Gulcan Can, Stephane Marchand-Maillet, Rui Hu, Carlos Pallan Gayol, Guido Krempel, Jakub Spotak, Jean-Marc Odobez and Daniel Gatica-Perez, in: Proc. ECCV Workshop on Computer Vision for Art Analysis, Amsterdam, pages 842-855, Springer, 2016

[DOI]
[URL]

Two-Pass IB based Speaker Diarization System using Meeting-Specific ANN based Features, Nauman Dawalatabad, Srikanth Madikeri, C Chandra Sekhar and Hema A Murthy, in: Proceedings of Interspeech 2016, pages 2199-2203, 2016

Unified Prosody Model based on Atom Decomposition for Emphasis Detection, Branislav Gerazov, Aleksandar Gjoreski, Aleksandar Melov, Pierre-Edouard Honnet, Zoran Ivanovski and Philip N. Garner, in: Proceedings of ETAI, 2016

Unsupervised Interpretable Pattern Discovery in Time Series Using Autoencoders, Kevin Bascol, Remi Emonet, Elisa Fromont and Jean-Marc Odobez, in: IAPR Int. Workshops on Structural and Syntactic Pattern Recognition (SSPR), 2016

Using the TED Talks to Evaluate Spoken Post-editing of Machine Translation, Jeevanthi Liyanapathirana and Andrei Popescu-Belis, in: Proceedings of the 10th Language Resources and Evaluation Conference (LREC), Portoroz, Slovenia, 2016

Variable Duration Movement Encoding with Minimal Intervention Control, M. Zeestraten, Sylvain Calinon and D. G. Caldwell, in: Proc. of the IEEE Intl Conf. on Robotics and Automation (ICRA), pages 497-503, 2016

When Naïve Bayes Nearest Neighbors Meet Convolutional Neural Networks, Ilja Kuzborskij, Fabio M. Carlucci and Barbara Caputo, in: Computer Vision and Pattern Recognition (CVPR), IEEE Conference on, 2016

Wiki-LDA: A Mixed-Method Approach for Effective Interest Mining on Twitter Data, Xiao Pu, Mohamed Amine Chatti, Hendrik Thues and Ulrik Schroeder, in: Proceedings of CSEDU 2016, 2016

A Deeper Look at Dataset Bias, Tatiana Tommasi, Novi Patricia, Barbara Caputo and Tinne Tuytelaars, in: German Conference on Pattern Recognition, Aachen, Germany, pages 504–516, Springer International Publishing, 2015

[DOI]

An Empirical Model of Emphatic Word Detection, Milos Cernak and Pierre-Edouard Honnet, in: Proc. of Interspeech, Dresden, Germany, pages 573-577, ISCA, 2015

An HMM-Based Formalism for Automatic Subword Unit Derivation and Pronunciation Generation, Marzieh Razavi and Mathew Magimai-Doss, in: International Conference on Acoustics, Speech and Signal Processing, pages 4639-4643, IEEE, 2015

[DOI]

An Investigation of Muscle Models for Physiologically Based Intonation Modelling, Branislav Gerazov and Philip N. Garner, in: Proceedings of the 23rd Telecommunications Forum, pages 468--471, 2015

[DOI]

Analysis of CNN-based Speech Recognition System using Raw Speech as Input, Dimitri Palaz, Mathew Magimai-Doss and Ronan Collobert, in: Proceedings of Interspeech, ISCA, Dresden, pages 11-15, ISCA, 2015

Annotators' agreement and spontaneous emotion classification performance, Bogdan Vlasenko and Andreas Wendemuth, in: Proceedings of Interspeech, pages 1546-1550, 2015

Atom Decomposition-based Intonation Modelling, Pierre-Edouard Honnet, Branislav Gerazov and Philip N. Garner, in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, Australia, pages 4744--4748, IEEE, 2015

[DOI]

Automatic Accentedness Evaluation of Non-Native Speech Using Phonetic and Sub-Phonetic Posterior Probabilities, Ramya Rasipuram, Milos Cernak, Alexandre Nanchen and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2015

Building context-dependent DNN acoustic models using Kullback-Leibler divergence-based state tying, Gábor Gosztolya, Tamás Grósz, László Tóth and David Imseng, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2015

Channel Selection in the Short-time Modulation Domain for Distant Speech Recognition, Ivan Himawan, Petr Motlicek, Sridha Sridharan, David Dean and Dian Tjondronegoro, in: Proceedings of Interspeech, pages 741-745, 2015

COMBINING SGMM SPEAKER VECTORS AND KL-HMM APPROACH FOR SPEAKER DIARIZATION, Srikanth Madikeri, Petr Motlicek and Hervé Bourlard, in: Proceedings of ICASSP 2015, pages 4834-4837, 2015

CommuniSense: Crowdsourcing Road Hazards in Nairobi, Darshan Santani, Jidraph Njuguna, Tierra Bills, Aisha W. Bryant, Reginald Bryant, Jonathan Ledgard and Daniel Gatica-Perez, in: Proceedings of the 17th International Conference on Human-Computer Interaction with Mobile Devices and Services, Copenhagen, Denmark, pages 445-456, ACM, 2015

[DOI]
[URL]

Convolutional Neural Networks-based Continuous Speech Recognition using Raw Speech Signal, Dimitri Palaz, Mathew Magimai-Doss and Ronan Collobert, in: International Conference on Acoustics, Speech and Signal Procecssing, IEEE, South Brisbane, QLD, pages 4295 - 4299, IEEE, 2015

Cross-Corpus Acoustic Emotion Recognition: Variances and Strategies (Extended Abstract), Björn Schuller, Bogdan Vlasenko, Florian Eyben, Martin Wöllmer, André Stuhlsatz, Andreas Wendemuth and Gerhard Rigoll, in: Proceedings of ACII, Xi'an, pages 470-476, IEEE, 2015

[DOI]

Deciphering the Silent Participant. On the Use of Audio-Visual Cues for the Classification of Listener Categories in Group Discussions, Catharine Oertel, Kenneth Alberto Funes Mora, Joakim Gustafson and Jean-Marc Odobez, in: Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, ACM, Seattle, Washington, USA, pages 107--114, ACM, 2015

[DOI]

DexROV: Dexterous Undersea Inspection and Maintenance in Presence of Communication Latencies, J. Gancet, D. Urbina, P. Letier, M. Ilzokvitz, P. Weiss, F. Gauch, G. Antonelli, G. Indiveri, G. Casalino, A. Birk, M. F. Pfingsthorn, Sylvain Calinon, Ajay Kumar Tanwani, A. Turetta, C. Walen and L. Guilpain, in: IFAC Workshop on Navigation, Guidance and Control of Underwater Vehicles (NGCUV), pages 218-223, 2015

Dictionary Learning for Sparse Representation of Neural Network Exemplars in Speech Recognition, Pranay Dighe, Afsaneh Asaei and Hervé Bourlard, in: Proceedings of SPARS 2015: Workshop on Signal Processing with Adaptive Sparse Structured Representations, 2015, pages 1093, 2015

EMPLOYMENT OF SUBSPACE GAUSSIAN MIXTURE MODELS IN SPEAKER RECOGNITION, Petr Motlicek, Subhadeep Dey, Srikanth Madikeri and Lukas Burget, in: 2015 IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE, Brisbane, Australia, pages 4445-4449, 2015

[URL]

Estimation of Divergence-Free 3D Cardiac Blood Flow in a Zebrafish Larva Using Multi-View Microscopy, Kevin G. Chan and Michael Liebling, in: Biomedical Imaging (ISBI), 2015 IEEE 12th International Symposium on, IEEE, Brooklyn, NY, USA, pages 385-388, 2015

[DOI]

EUMSSI team at the MediaEval Person Discovery Challenge, Nam Le, Di Wu, Sylvain Meignier and Jean-Marc Odobez, in: Working Notes Proceedings of the MediaEval 2015 Workshop, Wurzen, Germany, 2015

[URL]

Exploring Dataset Similarities using PCA-based Feature Selection, Ingo Siegert, Ronald Boeck, Bogdan Vlasenko and Andreas Wendemuth, in: Proceedings of ACII, Xi'an, pages 387-393, IEEE, 2015

[DOI]

Finger vein Liveness Detection Using Motion Magnification, Ramachandra Raghavendra, Manasa Avinas, Christoph Busch and Sébastien Marcel, in: IEEE International Conference on Biometrics: Theory, Applications and Systems, pages 1-7, 2015

[DOI]

From Image-level to Pixel-level Labeling with Convolutional Networks, Pedro H. O. Pinheiro and Ronan Collobert, in: Computer Vision and Patter Recognition (CVPR), Boston, MA, pages 1713-1721, IEEE, 2015

[DOI]
[URL]

Gender Classification by LUT based boosting of Overlapping Block Patterns, Rakesh Metha, Manuel Günther and Sébastien Marcel, in: Scandinavian Conference on Image Analysis, pages 530-542, Springer International Publishing, 2015

[DOI]
[URL]

Head Nod Detection from a Full 3D Model, Yiqiang Chen, Yu Yu and Jean-Marc Odobez, in: Proceedings of the ICCV 2015, pages 528-536, 2015

I would hire you in a minute: Thin slices of nonverbal behavior in job interviews, Laurent Son Nguyen and Daniel Gatica-Perez, in: Proceedings of the ACM International Conference on Multimodal Interaction (ICMI), pages 51-58, 2015

Integrated Pronunciation Learning for Automatic Speech Recognition Using Probabilistic Lexical Modeling, Ramya Rasipuram, Marzieh Razavi and Mathew Magimai-Doss, in: International Conference on Acoustics, Speech and Signal Processing, South Brisbane, QLD, pages 5176-5180, 2015

[DOI]

Integrating Online I-vector extractor with Information Bottleneck based Speaker Diarization system, Srikanth Madikeri, Ivan Himawan, Petr Motlicek and Marc Ferras, in: Proceedings of Interspeech 2015, pages 3105-3109, 2015

Intensity-Based Point-Spread-Function-Aware Registration for Multi-View Applications in Optical Microscopy, Nikhil Chacko, Kevin G. Chan and Michael Liebling, in: Biomedical Imaging (ISBI), 2015 IEEE 12th International Symposium on, pages 306-309, IEEE, 2015

[DOI]

International Conference on Mobile and Ubiquitous Multimedia, Gilberto Chávez-Martínez, Salvador Ruiz-Correa and Daniel Gatica-Perez, in: Happy and Agreeable? Multi-Label Classification of Impressions in Social Video, Linz, Austria, pages 109-120, ACM, 2015

[DOI]
[URL]

Joint RNN-Based Greedy Parsing and Word Composition, Joël Legrand and Ronan Collobert, in: Proceedings of ICLR 2015, 2015

KL-HMM BASED SPEAKER DIARIZATION SYSTEM FOR MEETINGS, Srikanth Madikeri and Hervé Bourlard, in: Proceedings of ICASSP 2015, pages 4435-4439, 2015

Kullback-Leibler Proximal Variational Inference, Emtiyaz Khan, Pierre Baqué, Francois Fleuret and Pascal Fua, in: Proceedings of the international conference on Neural Information Processing Systems, pages 3402-3410, 2015

Learned Minimal Intervention Control Synthesis based on Hidden Semi-Markov Models, M. Zeestraten, Sylvain Calinon and D. G. Caldwell, in: Proc. of the 8th Intl Workshop on Human-Friendly Robotics, pages 17, 2015

Learning bimanual end-effector poses from demonstrations using task-parameterized dynamical systems, J. Silverio, L. Rozo, Sylvain Calinon and D. G. Caldwell, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), Hamburg, Germany, pages 464-470, 2015

Learning Feature Mapping using Deep Neural Network Bottleneck Features for Distant Large Vocabulary Speech Recognition, Ivan Himawan, Petr Motlicek, David Imseng, Blaise Potard, Namhoon Kim and Jaewon Lee, in: IEEE International Conference on Acoustics, Speech, and Signal Processing, pages 4540-4544, 2015

[DOI]

Learning Optimal Controllers in Human-robot Cooperative Transportation Tasks with Position and Force Constraints, L. Rozo, D. Bruno, Sylvain Calinon and D. G. Caldwell, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), Hamburg, Germany, pages 1024-1030, 2015

Learning to Segments Objects Candidates, Pedro H. O. Pinheiro, Ronan Collobert and Piotr Dollar, in: Advances in Neural Information Processing Systems, Montreal, Canada, pages 1990-1998, Curran Associates, Inc., 2015

[URL]

Leveraging Compounds to Improve Noun Phrase Translation from Chinese and German, Xiao Pu, Laura Mascarell, Andrei Popescu-Belis, Mark Fishel, Ngoc-Quang Luong and Martin Volk, in: Proceedings of the ACL-IJCNLP 2015 Student Research Workshop, Beijing, China, pages 8-15, 2015

Looking at Cities in Mexico with Crowds, Darshan Santani, Salvador Ruiz-Correa and Daniel Gatica-Perez, in: Proceedings of the 2015 Annual Symposium on Computing for Development, London, United Kingdom, pages 127-135, ACM, 2015

[DOI]
[URL]

Loud and Trendy: Crowdsourcing Impressions of Social Ambiance in Popular Indoor Urban Places, Darshan Santani and Daniel Gatica-Perez, in: Proceedings of the 23rd ACM International Conference on Multimedia, Brisbane, Australia, pages 211-220, ACM, 2015

[DOI]
[URL]

N-gram-Based Low-Dimensional Representation for Document Classification, Rémi Lebret and Ronan Collobert, in: International Conference on Learning Representations, 2015

[URL]

Neuromorphic Based Oscillatory Device for Incremental Syllable Boundary Detection, Alexandre Hyafil and Milos Cernak, in: Proc. of Interspeech, Dresden, Germany, pages 1191-1195, ISCA, 2015

Novel GCC-PHAT Model in Diffuse Sound Field for Microphone Array Pairwise Distance Based Calibration, Jose Velasco, Mohammad J. Taghizadeh, Afsaneh Asaei, Hervé Bourlard, Carlos J. Martín-Arguedas, Javier Macias-Guarasa and Daniel Pizarro, in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 2669-2673, 2015

Objective Intelligibility Assessment of Text-to-Speech Systems Through Utterance Verification, Raphael Ullmann, Ramya Rasipuram, Mathew Magimai-Doss and Hervé Bourlard, in: Proceedings of Interspeech, Dresden, Germany, pages 3501-3505, 2015

[URL]

Objective Speech Intelligibility Assessment through Comparison of Phoneme Class Conditional Probability Sequences, Raphael Ullmann, Mathew Magimai-Doss and Hervé Bourlard, in: 40th IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, Australia, pages 4924-4928, 2015

[DOI]

On Application Of Non-Negative Matrix Factorization for Ad Hoc Microphone Array Calibration from Incomplete Noisy Distances, Afsaneh Asaei, Nasser Mohammadiha, Mohammad J. Taghizadeh, Simon Doclo and Hervé Bourlard, in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 2694-2698, IEEE, 2015

[DOI]

On Compressibility of Neural Network phonological Features for Low Bit Rate Speech Coding, Afsaneh Asaei, Milos Cernak and Hervé Bourlard, in: Proceeding of Interspeech, pages 418-422, ISCA, 2015

On the Vulnerability of Palm Vein Recognition to Spoofing Attacks, Pedro Tome and Sébastien Marcel, in: The 8th IAPR International Conference on Biometrics (ICB), pages 319 - 325, 2015

[DOI]
[URL]

On the Vulnerability of Speaker Verification to Realistic Voice Spoofing, Serife Kucur Ergunay, Elie Khoury, Alexandros Lazaridis and Sébastien Marcel, in: IEEE International Conference on Biometrics: Theory, Applications and Systems, pages 1-8, IEEE, 2015

[DOI]
[URL]

Overlapping Speech, Utterance Duration and Affective Content in HHI and HCI - an Comparison, Ingo Siegert, Ronald Boeck, Bogdan Vlasenko, Kerstin Ohnemus and Andreas Wendemuth, in: 6th IEEE Conference on Cognitive Infocommunications, Gyor, pages 83-88, 2015

[DOI]

Palm Vein Database and Experimental Framework for Reproducible Research, Pedro Tome and Sébastien Marcel, in: IEEE International Conference of the Biometrics Special Interest Group, pages 1-7, 2015

[DOI]
[URL]

Periocular Biometrics in Mobile Environment, Tiago de Freitas Pereira and Sébastien Marcel, in: IEEE Seventh International Conference on Biometrics: Theory, Applications and Systems, Arlington, USA, pages 1-7, IEEE, 2015

[DOI]

Personality Trait Classification via Co-Occurrent Multiparty Multimodal Event Discovery, Shogo Okada, Oya Aran and Daniel Gatica-Perez, in: Proceedings of the ACM International Conference on Multimodal Interaction, Seattle, Washington, USA, pages 15-22, ACM, 2015

[DOI]

Phonological Vocoding Using Artificial Neural Networks, Milos Cernak, Blaise Potard and Philip N. Garner, in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, Australia, pages 4844-4848, IEEE, 2015

[DOI]

Phrase-based Image Captioning, Rémi Lebret, Pedro H. O. Pinheiro and Ronan Collobert, in: International Conference on Machine Learning (ICML), Lille, France, pages 2085–2094, JMLR, 2015

[URL]

Probability Occupancy Maps for Occluded Depth Images, Timur Bagautdinov, Francois Fleuret and Pascal Fua, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, pages 2829-2837, 2015

Pronoun Translation and Prediction with or without Coreference Links, Ngoc-Quang Luong, Lesly Miculicich and Andrei Popescu-Belis, in: Proceedings of the Second Workshop on Discourse in Machine Translation (DiscoMT), Lisbon, Portugal, pages 94–100, 2015

Pronunciation Lexicon Development for Under-Resourced Languages Using Automatically Derived Subword Units: A Case Study on Scottish Gaelic, Marzieh Razavi, Ramya Rasipuram and Mathew Magimai-Doss, in: 4th Biennial Workshop on Less-Resourced Languages, 2015

Query Refinement Using Conversational Context: a Method and an Evaluation Resource, Maryam Habibi and Andrei Popescu-Belis, in: Proceedings of NLDB 2015 (20th International Conference on Applications of Natural Language to Information Systems), Passau, Germany, pages 89-102, Springer-Verlag Berlin, 2015

[DOI]

Robot Learning with Task-Parameterized Generative Models, Sylvain Calinon, in: Proc. Intl Symp. on Robotics Research, 2015

Robust Microphone Placement for Source Localization from Noisy Distance Measurements, Mohammad J. Taghizadeh, Saeid Haghighatshoar, Afsaneh Asaei, Philip N. Garner and Hervé Bourlard, in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 2579-2583, IEEE, 2015

[DOI]

Sparse Modeling of Neural Network Posterior Probabilities for Exemplar-Based Speech Recognition, Pranay Dighe, Afsaneh Asaei and Hervé Bourlard, in: Proceedings of SPARS 2015: Workshop on Signal Processing with Adaptive Sparse Structured Representations, 2015

Sparse Modeling of Posterior Exemplars for Keyword Detection, Dhananjay Ram, Afsaneh Asaei, Pranay Dighe and Hervé Bourlard, in: Proceedings of Interspeech, pages 3690-3694, 2015

The 1st Competition on Counter Measures to Finger Vein Spoofing Attacks, Pedro Tome, Ramachandra Raghavendra, Christoph Busch, Santosh Tirunagari, Norman Poh, B. H. Shekar, Diego Gragnaniello, Carlo Sansone, Luisa Verdoliva and Sébastien Marcel, in: The 8th IAPR International Conference on Biometrics (ICB), pages 513 - 518, 2015

[DOI]
[URL]

Towards utterance-based neural network adaptation in acoustic modeling, Ivan Himawan, Petr Motlicek, Marc Ferras and Srikanth Madikeri, in: IEEE Automatic Speech Recognition and Understanding Workshop, pages 289-295, 2015

Transfer Learning through Greedy Subset Selection, Ilja Kuzborskij, Francesco Orabona and Barbara Caputo, in: Image Analysis and Processing - ICIAP 2015, Genoa, Italy, pages 3-14, Springer International Publishing, 2015

[DOI]

Visual Affect Around the World: A Large-scale Multilingual Visual Sentiment Ontology, Brendan Jou, Tao Chen, Nikolaos Pappas, Miriam Redi, Mercan Topkara and Shih-Fu Chang, in: Proceedings of the ACM International Conference on Multimedia, pages 159--168, 2015

Weighted Correlation based Atom Decomposition Intonation Modelling, Branislav Gerazov, Pierre-Edouard Honnet, Aleksandar Gjoreski and Philip N. Garner, in: Proceedings of Interspeech, Dresden, Germany, pages 1601--1605, 2015

3D Gaze Tracking and Automatic Gaze Coding from RGB-D Cameras, Kenneth Alberto Funes Mora and Jean-Marc Odobez, in: IEEE Conference in Computer Vision and Pattern Recognition, Vision Meets Cognition Workshop, Columbus, Ohio, USA, 2014

A Conditional Random field approach for audio-visual people diarization, Paul Gay, Elie Khoury, Sylvain Meignier, Jean-Marc Odobez and Paul Deleglise, in: Proceedings of the 39th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 116 - 120, IEEE, 2014

[DOI]

A Skill Transfer Approach for Continuum Robots - Imitation of Octopus Reaching Motion with the STIFF-FLOP Robot, M. S. Malekzadeh, Sylvain Calinon, D. Bruno and D. G. Caldwell, in: In Proc. of the AAAI Symp. on Knowledge, Skill, and Behavior Transfer in Autonomous Robots, Arlington, VA, USA, pages 49-52, 2014

[URL]

A task-parameterized probabilistic model with minimal intervention control, Sylvain Calinon, D. Bruno and D. G. Caldwell, in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Hong Kong, pages 3339 - 3344, IEEE, 2014

[DOI]

Ad-Hoc Microphone Array Calibration from Partial Distance Measurements, Mohammad J. Taghizadeh, Afsaneh Asaei, Philip N. Garner and Hervé Bourlard, in: Proceeding of 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014

Ad-Hoc Microphone Array Calibration from Partial Distance Measurements, Mohammad J. Taghizadeh, Afsaneh Asaei, Philip N. Garner and Hervé Bourlard, in: Proceedings of the 4th Joint Workshop on Hands-free speech communication and Microphone Arrays, Villers-les-Nancy, pages 1 - 5, IEEE, 2014

[DOI]

Artificial neural network features for speaker diarization, Sree Harsha Yella, Andreas Stolcke and Malcolm Slaney, in: IEEE Spoken Language Technology workshop, South Lake Tahoe, USA, 2014

Audio-Visual Gender Recognition in Uncontrolled Environment Using Variability Modeling Techniques, Laurent El Shafey, Elie Khoury and Sébastien Marcel, in: International Joint Conference on Biometrics, Clearwater, Florida, USA, pages 1 - 8, IEEE, 2014

[DOI]
[URL]

Automated Bobbing and Phase Analysis to Measure Walking Entrainment, Adolfo Lopez-Mendez, C. E. I Westling, Remi Emonet, M. Easteal, L. Lavia, H. J. Witchel and Jean-Marc Odobez, in: IEEE International Conference on Image Processing (ICIP), Paris, 2014

Automatic Blinking Detection towards Stress Discovery, Alvaro Marcos-Ramiro, Daniel Pizarro-Perez, Marta Marron-Romera and Daniel Gatica-Perez, in: Proc. ACM Int. Conf. on Multimodal Interaction, Istanbul, pages 307-310, ACM New York, 2014

[DOI]

Automatic Maya Hieroglyph Retrieval Using Shape and Context Information, Rui Hu, Carlos Pallan, Guido Krempel, Jean-Marc Odobez and Daniel Gatica-Perez, in: ACM MM, pages 4, 2014

[URL]

Automatic Speech Recognition and Translation of a Swiss German Dialect: Walliserdeutsch, Philip N. Garner, David Imseng and Thomas Meyer, in: Proceedings of Interspeech, 2014

Capturing Upper Body Motion in Conversation: an Appearance Quasi-Invariant Approach, Alvaro Marcos-Ramiro, Daniel Pizarro-Perez, Marta Marron-Romera and Daniel Gatica-Perez, in: Proc. ACM Int. Conf. on Multimodal Interaction, Istanbul, pages 327-334, ACM New York, 2014

[DOI]

Comparison of Two Methods for Unsupervised Person Identification in TV Shows, Paul Gay, Gregor Dupuy, Jean-Marc Odobez, Sylvain Meignier and Paul Deleglise, in: 12th International Workshop on Content-Based Multimedia Indexing, 2014

Cross-Database Evaluation With an Open Finger Vein Sensor, Matthias Vanoni, Pedro Tome, Laurent El Shafey and Sébastien Marcel, in: IEEE Workshop on Biometric Measurements and Systems for Security and Medical Applications (BioMS), Rome, Italy, pages 30-35, IEEE, 2014

[DOI]

Cross-linguistic annotation of narrativity for English/French verb tense disambiguation, Cristina Grisot and Thomas Meyer, in: 9th Edition of the Language Resources and Evaluation Conference, Reykjavik, Iceland, 2014

Detecting and Labeling Speakers on Overlapping Speech using Vector Taylor Series, Pranay Dighe, Marc Ferras and Hervé Bourlard, in: INTERSPEECH, 2014

Detecting speaker roles and topic changes in multiparty conversations using latent topic models, A. Sapru and Hervé Bourlard, in: Proceedings of Interspeech, 2014

Development of Bilingual ASR System for MediaParl Corpus, Petr Motlicek, David Imseng, Milos Cernak and Namhoon Kim, in: Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech 2014), Singapore, ISCA, 2014

Dialect Levelling in Finnish: A Universal Speech Attribute Approach, Hamid Behravan, Ville Hautamäki, Sabato Marco Siniscalchi, Elie Khoury, Tommi Kurki, Tomi Kinnunen and Chin-Hui Lee, in: The 15th Annual Conference of the International Speech Communication Association, 2014

Diarizing Large Corpora using Multi-modal Speaker Linking, Marc Ferras, Stefano Masneri, Oliver Schreer and Hervé Bourlard, in: INTERSPEECH 2014, 2014

Dynamic Programming Boosting for Discriminative Macro-Action Discovery, Leonidas Lefakis and Francois Fleuret, in: International Conference on Machine Learning, 2014

Effect of nonverbal behavioral patterns on the performance of small groups, Umut Avci and Oya Aran, in: ICMI Workshop on Understanding and Modeling Multiparty Multimodal Interactions, Istanbul, Turkey, 2014

Efficient Sample Mining for Object Detection, Olivier Canévet and Francois Fleuret, in: Proceedings of the 6th Asian Conference on Machine Learning (ACML), Nha Trang, Vietnam, 2014

Enforcing Topic Diversity in a Document Recommender for Conversations, Maryam Habibi and Andrei Popescu-Belis, in: Proceedings of the Coling 2014 (25th International Conference on Computational Linguistics), Dublin, Ireland, pages 746-759, IEEE, 2014

English-French Verb Phrase Alignment in Europarl for Tense Translation Modeling, Sharid Loaiciga, Thomas Meyer and Andrei Popescu-Belis, in: The Ninth Language Resources and Evaluation Conference, Reykjavik, Iceland, 2014

Explaining the Stars: Weighted Multiple-Instance Learning for Aspect-Based Sentiment Analysis, Nikolaos Pappas and Andrei Popescu-Belis, in: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar, 2014

Exploiting Scene Cues for Dropped Object Detection, Adolfo Lopez-Mendez, Florent Monay and Jean-Marc Odobez, in: 9th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications., 2014

Exploiting un-transcribed foreign data for speech recognition in well-resourced languages, David Imseng, Blaise Potard, Petr Motlicek, Alexandre Nanchen and Hervé Bourlard, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Florence, IT, pages 2322 - 2326, IEEE, 2014

[DOI]

EYEDIAP: A Database for the Development and Evaluation of Gaze Estimation Algorithms from RGB and RGB-D Cameras, Kenneth Alberto Funes Mora, Florent Monay and Jean-Marc Odobez, in: Proceedings of the ACM Symposium on Eye Tracking Research and Applications, Safety Harbor, Florida, United States of America, ACM, 2014

[DOI]

Face identification from overlaid texts using Local Face Recurrent Patterns and CRF models, Paul Gay, Elie Khoury, Sylvain Meignier, Jean-Marc Odobez and Paul Deleglise, in: IEEE International Conference on Image Processing 2014, Paris, IEEE, 2014

Feature Switching in the i-vector Framework for Speaker Verification, Asha T, Saranya M S, Karthik Pandia D S, Srikanth Madikeri and Hema A Murthy, in: Proc. of Interspeech 2014, pages 5, 2014

Geometric Generative Gaze Estimation (G3E) for Remote RGB-D Cameras, Kenneth Alberto Funes Mora and Jean-Marc Odobez, in: IEEE Computer Vision and Pattern Recognition Conference, Columbus, Ohio,USA, pages 1773-1780, IEEE, 2014

[DOI]

Hierarchical speaker clustering methods for the NIST i-vector Challenge, Elie Khoury, Laurent El Shafey, Marc Ferras and Sébastien Marcel, in: Odyssey: The Speaker and Language Recognition Workshop, 2014

How Do You Like Your Virtual Agent?: Human-Agent Interaction Experience through Nonverbal Features and Personality Traits, Aleksandra Cerekovic, Oya Aran and Daniel Gatica-Perez, in: Human Behavior Understanding, pages 1-15, Springer, 2014

Importance of Prosody in Swiss French Accent for Speech Synthesis, Pierre-Edouard Honnet and Philip N. Garner, in: Nouveaux cahiers de linguistique francaise, 2014

Improving Head and Body Pose Estimation through Semi-supervised Manifold Alignment, Alexandre Heili, Jagannadan Varadarajan, Bernard Ghanem, Narendra Ahuja and Jean-Marc Odobez, in: International Conference on Image Processing, 2014

Improving Speaker Diarization using social role information, A. Sapru, Sree Harsha Yella and Hervé Bourlard, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2014

Inferring social relationships in a phone call from a single party's speech, Sree Harsha Yella, Xavier Anguera and Jordi Luque, in: ICASSP, Florence, IT, pages 4843 - 4847, IEEE, 2014

[DOI]

Information Bottleneck based Speaker Diarization of Meetings using Non-speech as Side Information, Sree Harsha Yella and Hervé Bourlard, in: ICASSP, Florence, IT, pages 96 - 100, IEEE, 2014

[DOI]

Introducing I-Vectors for Joint Anti-spoofing and Speaker Verification, Elie Khoury, Tomi Kinnunen, Aleksandr Sizov, Zhizheng Wu and Sébastien Marcel, in: The 15th Annual Conference of the International Speech Communication Association, 2014

Is That a Jaguar? Segmenting Ancient Maya Glyphs via Crowdsourcing, Gulcan Can, Jean-Marc Odobez and Daniel Gatica-Perez, in: Proc. ACM Int. Workshop on Crowdsourcing for Multimedia, Orlando, pages 37-40, ACM New York, 2014

[DOI]

Joint Phoneme Segmentation Inference and Classification using CRFs, Dimitri Palaz, Mathew Magimai-Doss and Ronan Collobert, in: Global Conference on Signal and Information Processing, Atlanta, GA, pages 587 - 591, IEEE, 2014

[DOI]

Jointly Informative Feature Selection, Leonidas Lefakis and Francois Fleuret, in: International Conference on Artificial Intelligence and Statistics, pages 567–575, 2014

Learning adaptive movements from demonstration and self-guided exploration, D. Bruno, Sylvain Calinon and D. G. Caldwell, in: Proc. IEEE Intl Conf. on Development and Learning and on Epigenetic Robotics (ICDL-EpiRob), Genoa, Italy, pages 160-165, 2014

Learning Force and Position Constraints in Human-robot Cooperative Transportation, L. Rozo, Sylvain Calinon and D. G. Caldwell, in: Proc. IEEE Intl Symposium on Robot and Human Interactive Communication (Ro-Man), Edinburgh, Scotland, UK, pages 619-624, 2014

Learning from demonstrations with partially observable task parameters, T. Alizadeh, Sylvain Calinon and D. G. Caldwell, in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Hong Kong, pages 3309 - 3314, IEEE, 2014

[DOI]

Learning to Learn, from Transfer Learning to Domain Adaptation: A Unifying Perspective, Novi Patricia and Barbara Caputo, in: Proceedings of the Computer Vision and Pattern Recognition, Columbus, OH, pages 1442-1449, IEEE, 2014

[DOI]

LETHA: Learning from High Quality Inputs for 3D Pose Estimation in Low Quality Images, Adrian Penate-Sanchez, Francesc Moreno-Noguer, Juan Andrade-Cetto and Francois Fleuret, in: Proceedings of the International Conference on 3D vision, pages 517–524, 2014

Mode of Teaching Based Segmentation and Annotation of Video Lectures, Yogesh Singh Rawat, Chidansh A. Bhatt and Mohan S. Kankanhalli, in: International Workshop on Content-Based Multimedia Indexing, 2014

Model-based Sparse Component Analysis for Reverberant Speech Localization, Afsaneh Asaei, Hervé Bourlard, Mohammad J. Taghizadeh and Volkan Cevher, in: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing, pages 1439 - 1443, IEEE, 2014

[DOI]

Modeling Overlapping Speech using Vector Taylor Series, Pranay Dighe, Marc Ferras and Hervé Bourlard, in: Odyssey: The Speaker and Language Recognition Workshop, Joensuu, Finland, 2014

Multi-Source Adaptive Learning for Fast Control of Prosthetics Hand, Novi Patricia, Tatiana Tommasi and Barbara Caputo, in: Proceedings of the International Conference on Pattern Recognition, Stockholm, pages 2769 - 2774, 2014

[DOI]

Multi-source Posteriors for Speech Activity Detection on Public Talks, Marc Ferras and Hervé Bourlard, in: INTERSPEECH, 2014

Multilingual Deep Neural Network based Acoustic Modeling For Rapid Language Adaptation, Ngoc Thang Vu, David Imseng, Daniel Povey, Petr Motlicek, Tanja Schultz and Hervé Bourlard, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Florence, pages 7639-7643, IEEE, 2014

[DOI]

Multimodal Reranking of Content-based Recommendations for Hyperlinking Video Snippets, Chidansh A. Bhatt, Nikolaos Pappas, Maryam Habibi and Andrei Popescu-Belis, in: ACM International Conference on Multimedia Retrieval, 2014

Null space redundancy learning for a flexible surgical robot, D. Bruno, Sylvain Calinon and D. G. Caldwell, in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Hong Kong, pages 2443 - 2448, IEEE, 2014

[DOI]

On Modeling Context-Dependent Clustered States: Comparing HMM/GMM, Hybrid HMM/ANN and KL-HMM Approaches, Marzieh Razavi, Ramya Rasipuram and Mathew Magimai-Doss, in: International Conference on Acoustics, Speech, and Signal Processing, Florence, IT, pages 7659 - 7663, IEEE, 2014

[DOI]

On Recognition of Non-Native Speech Using Probabilistic Lexical Model, Marzieh Razavi and Mathew Magimai-Doss, in: Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech 2014), 2014

On the Vulnerability of Finger Vein Recognition to Spoofing, Pedro Tome, Matthias Vanoni and Sébastien Marcel, in: IEEE International Conference of the Biometrics Special Interest Group (BIOSIG), Darmstadt, Germay, pages 1 - 10, IEEE, 2014

Overview of the ImageCLEF 2014 Domain Adaptation Task, Barbara Caputo and Novi Patricia, in: ImageCLEF 2014: Overview and analysis of the results, 2014

Phoneme Background Model for Information Bottleneck based Speaker Diarization, Sree Harsha Yella, Petr Motlicek and Hervé Bourlard, in: Interspeech 2014, 2014

Phoneme Background Model for Information Bottleneck based Speaker Diarization, Sree Harsha Yella, Petr Motlicek and Hervé Bourlard, in: Interspeech, Singapore, 2014

Posterior-based Sparse Representation for Automatic Speech Recognition, Sara Bahaadini, Afsaneh Asaei, David Imseng and Hervé Bourlard, in: Proceeding of Interspeech, 2014

Prosody in Swiss French Accents: Investigation using Analysis by Synthesis, Pierre-Edouard Honnet, Alexandros Lazaridis, Jean-Philippe Goldman and Philip N. Garner, in: Speech Prosody, 2014

Recurrent Convolutional Neural Networks for Scene Labeling, Pedro H. O. Pinheiro and Ronan Collobert, in: 31st International Conference on Machine Learning (ICML), Beijing, China, pages 82-90, JMLR, 2014

[URL]

Recurrent Greedy Parsing with Neural Networks, Joël Legrand and Ronan Collobert, in: Proceedings of ECML 2014, pages 130-144, Springer Berlin Heidelberg, 2014

[DOI]

Rewards-driven control of robot arm by decoding EEG signals, Ajay Kumar Tanwani, José del R. Millán and Aude Billard, in: Engineering in Medicine and Biology Society (EMBC), 2014 36th Annual International Conference of the IEEE, pages 1658-1661, IEEE, 2014

[DOI]
[URL]

ROCKIT: Roadmap for Conversational Interaction Technologies, Steve Renals, Jean Carletta, K Edwards, Hervé Bourlard, Philip N. Garner, Andrei Popescu-Belis, Dietrich Klakow, A Girenko, Volha Petukhova, P Wacker, A Joscelyne, C Kompis, S Aliwell, W Stevens and Y Sabbah, in: Proceedings of the 2014 Workshop on Roadmapping the Future of Multimodal Interaction Research including Business Opportunities and Challenges (RFMIR '14), pages 39-42, ACM, 2014

[DOI]

Sample Distillation for Object Detection and Image Classification, Olivier Canévet, Leonidas Lefakis and Francois Fleuret, in: Proceedings of the 6th Asian Conference on Machine Learning (ACML), Nha Trang, Vietnam, 2014

Scalable Probabilistic Models: Applied to Face Identification in the Wild, Laurent El Shafey and Sébastien Marcel, in: 8th European Biometrics Research and Industry Awards, European Association for Biometrics, Darmstadt, Germany, 2014

[URL]

Scene Recognition with Naive Bayes Non-linear Learning, Marco Fornoni and Barbara Caputo, in: Proceedings of the 22nd International Conference on Pattern Recognition, Stockholm, pages 3404 - 3409, IEEE, 2014

[DOI]

Skills Learning in Robots by Interaction with Users and Environment, Sylvain Calinon, in: In Proc. of the Intl Conf. on Ubiquitous Robots and Ambient Intelligence (URAI), Kuala Lumpur, Malaysia, pages 161-162, 2014

[URL]

SPEAR: An open source toolbox for speaker recognition based on Bob, Elie Khoury, Laurent El Shafey and Sébastien Marcel, in: Proceedings of the 39th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 1655 - 1659, 2014

[DOI]
[URL]

Stress and Accent Transmission In HMM-Based Syllable-Context Very Low Bit Rate Speech Coding, Milos Cernak, Alexandros Lazaridis, Philip N. Garner and Petr Motlicek, in: Interspeech, 2014

SVR vs MLP for Phone Duration Modelling in HMM-based Speech Synthesis, Alexandros Lazaridis, Pierre-Edouard Honnet and Philip N. Garner, in: Speech Prosody, 2014

SWISS FRENCH REGIONAL ACCENT IDENTIFICATION, Alexandros Lazaridis, Elie Khoury, Jean-Philippe Goldman, Mathieu Avanzi, Sébastien Marcel and Philip N. Garner, in: Odyssey: The Speaker and Language Recognition Workshop, 2014

Syllable-based Regional Swiss French Accent Identification using Prosodic Features, Alexandros Lazaridis, Jean-Philippe Goldman, Mathieu Avanzi and Philip N. Garner, in: Nouveaux cahiers de linguistique francaise, 2014

The DBOX Corpus Collection of Spoken Human-Human and Human-Machine Dialogues, Volha Petukhova, Martin Gropp, Dietrich Klakow, Anna Schmidt, Gregor Eigner, Mario Topf, Stefan Srb, Petr Motlicek, Blaise Potard, John Dines, O. Deroo, Ronny Egeler, Uwe Meinz and Steffen Liersch, in: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), Reykjavik, Iceland, European Language Resources Association (ELRA), 2014

[URL]

The MAAYA Project: Multimedia Analysis and Access for Documentation and Decipherment of Maya Epigraphy, Daniel Gatica-Perez, Carlos Pallan Gayol, Stephane Marchand-Maillet, Jean-Marc Odobez, Edgar Roman-Rangel, Guido Krempel and Nikolai Grube, in: Proc. Digital Humanities Conference, Lausanne, 2014

The SP2 SCOPES Project on Speech Prosody, Gyorgy Szaszak, Tamas Gabor Csapo, Philip N. Garner, Branislav Gerazov, Zoran Ivanovski, Geza Nemeth, Balint Toth, Milan Secujski and Vlado Delic, in: DOGS2014 - Digital speech and image processing, 2014

The Workshop on Computational Personality Recognition 2014, Fabio Celli, Bruno Lepri, Joan-Isaac Biel, Giuseppe Riccardi, Daniel Gatica-Perez and Fabio Pianesi, in: Proceedings of the ACM International Conference on Multimedia, 2014

The Young and the City: Crowdsourcing Urban Awareness in a Developing Country, Salvador Ruiz-Correa, Darshan Santani and Daniel Gatica-Perez, in: Proceedings of the First International Conference on IoT in Urban Space, pages 74-79, 2014

[DOI]
[URL]

Tracking Interacting Objects Optimally Using Integer Programming, Xinchao Wang, Engin Turetken, Francois Fleuret and Pascal Fua, in: Proceedings of the European Conference on Computer Vision, pages 17-32, 2014

Translation and Prosody in Swiss Languages, Philip N. Garner, Rob Clark, Jean-Philippe Goldman, Pierre-Edouard Honnet, Maria Ivanova, Alexandros Lazaridis, Hui Liang, Beat Pfister, Manuel Sam Ribeiro, Eric Wehrli and Junichi Yamagishi, in: Nouveaux cahiers de linguistique francaise, 2014

What to Show? Automatic Stream Selection Among Multiple Sensors, Remi Emonet, E. Oberzaucher and Jean-Marc Odobez, in: International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, 2014

Who Will Get the Grant ? A Multimodal Corpus for the Analysis of Conversational Behaviours in Group Interviews, Catharine Oertel, Kenneth Alberto Funes Mora, Samira Sheikhi, Jean-Marc Odobez and Joakim Gustafson, in: International Conference on Multimodal Interaction, Understanding and Modeling Multiparty, Multimodal Interactions Workshop, Istanbul, Turkey, ACM, 2014

[DOI]

Within- and Cross- Database Evaluations for Gender Classification via BeFIT Protocols, Nesli Erdogmus, Matthias Vanoni and Sébastien Marcel, in: International Workshop on Multimedia Signal Processing, pages 1-6, 2014

[DOI]
[URL]

Word Embeddings through Hellinger PCA, Rémi Lebret and Ronan Collobert, in: 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

3D Head Pose and Gaze Tracking and Their Application to Diverse Multimodal Tasks, Kenneth Alberto Funes Mora, in: Doctoral consortium of the 15th ACM International Conference on Multimodal Interaction, Sydney, Australia, 2013

[DOI]

A Multipath Sparse Beamfroming Method, Afsaneh Asaei, Bhiksha Raj, Hervé Bourlard and Volkan Cevher, in: Signal Processing with Adaptive Sparse Structured Representations SPARS, 2013

A Probabilistic Framework for Multiple Speaker Localization, Youssef Oualil, Mathew Magimai-Doss, Friedrich Faubel and Dietrich Klakow, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2013

A Semi-Automated System for Accurate Gaze Coding in Natural Dyadic Interactions, Kenneth Alberto Funes Mora, Laurent Son Nguyen, Daniel Gatica-Perez and Jean-Marc Odobez, in: 15th ACM International Conference on Multimodal Interaction, Sydney, Australia, ACM, 2013

[DOI]

Accelerated Training of Linear Object Detectors, Charles Dubout and Francois Fleuret, in: CVPR 2013 Workshop on Structured Prediction, 2013

[URL]

ACCENT ADAPTATION USING SUBSPACE GAUSSIAN MIXTURE MODELS, Petr Motlicek, Philip N. Garner, Namhoon Kim and Jeongmi Cho, in: The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Vancouver, BC, Canada, pages 7170-7174, 2013

[DOI]

Alignment of Binocular-Binaural Data Using a Moving Audio-Visual Target, Vasil Khalidov, Florence Forbes and Radu Horaud, in: Proc. of IEEE Int. Workshop on Multimedia Signal Processing (MMSP), 2013

An Open-source State-of-the-art Toolbox for Broadcast News Diarization, Mickael Rouvier, Gregor Dupuy, Paul Gay, Elie Khoury, Teva Merlin and Sylvain Meignier, in: INTERSPEECH, 2013

Anti-spoofing in action: joint operation with a verification system, Ivana Chingovska, André Anjos and Sébastien Marcel, in: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Workshop on Biometrics, Portland, Oregon, 2013

Are ACT's scores increasing with better translation quality?, Najeh Hajlaoui, in: Are ACT's scores increasing with better translation quality?, pages 6, 2013

Assessing the Accuracy of Discourse Connective Translations: Validation of an Automatic Metric, Najeh Hajlaoui and Andrei Popescu-Belis, in: 14th International Conference on Intelligent Text Processing and Computational Linguistics, University of the Aegean, Samos, Greece, pages 236-247, Springer, 2013

[DOI]

Automatic Social Role Recognition In Professional Meetings Using Conditional Random Fields, A. Sapru and Hervé Bourlard, in: Proceedings of Interspeech, 2013

Automatic Staging of Audio with Emotions, Lakshmi Saheer and Milos Cernak, in: International Conference on Affective Computing and Intelligent Interaction, 2013

Body communicative cue extraction for conversational analysis, Alvaro Marcos-Ramiro, Daniel Pizarro-Perez, Marta Marron-Romera, Laurent Son Nguyen and Daniel Gatica-Perez, in: Proceedings of IEEE International Conference on Automatic Face and Gesture Recognition, 2013

Can face anti-spoofing countermeasures work in a real world scenario?, Tiago de Freitas Pereira, André Anjos, José Mario De Martino and Sébastien Marcel, in: International Conference on Biometrics, Madrid, Spain, 2013

[URL]

Combining Content with User Preferences for TED Lecture Recommendation, Nikolaos Pappas and Andrei Popescu-Belis, in: Proceedings of the 11th International Workshop on Content Based Multimedia Indexing, Veszprém, Hungary, IEEE, 2013

Complementary Countermeasures for Detecting Scenic Face Spoofing Attacks, Jukka Komulainen, Abdenour Hadid, Matti Pietikainen, André Anjos and Sébastien Marcel, in: International Conference on Biometrics, Madrid, Spain, 2013

[URL]

Computing Text Semantic Relatedness using the Contents and Links of a Hypertext Encyclopedia, Majid Yazdani and Andrei Popescu-Belis, in: International Joint Conference on artificial intelligence, 2013

Context Aware Addressee Estimation for Human Robot Interaction, Samira Sheikhi, Dinesh Babu Jayagopi, Vasil Khalidov and Jean-Marc Odobez, in: Proceedings of the 6th Workshop on Eye Gaze in Intelligent Human Machine Interaction: Gaze in Multimodal Interaction, 2013

Creative Applications of Human Behavior Understanding. HBU 2013: 1-14, Albert Ali Salah, Hayley Hung, Oya Aran and Hatice Gunes, in: Human Behavior Understanding, pages 1-14, 2013

Cross-Domain Personality Prediction: From Video Blogs to Small Group Meetings, Oya Aran and Daniel Gatica-Perez, in: 15th ACM International Conference on Multimodal Interaction, 2013

Crosslingual Tandem-SGMM: Exploiting Out-Of-Language Data for Acoustic Model and Feature Level Adaptation, Petr Motlicek, David Imseng and Philip N. Garner, in: Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013), ISCA - International Speech Communication Association, Lyon, France, pages 510-514, ISCA, 2013

Deformable Part Models with Individual Part Scaling, Charles Dubout and Francois Fleuret, in: British Machine Vision Conference, 2013

Detecting Narrativity to Improve English to French Translation of Simple Past Verbs, Thomas Meyer, Cristina Grisot and Andrei Popescu-Belis, in: Proceedings of the 1st DiscoMT Workshop at ACL 2013 (51st Annual Meeting of the Association for Computational Linguistics), Sofia, Bulgaria, pages 33-42, 2013

Discovering Temporal Patterns in Water Quality Time Series, Focusing on Floods with the LDA method, Alice Aubert, Romain Tavenard, Simon Malinowski, Thomas Guyet, René Quiniou, Jean-Marc Odobez, Remi Emonet and Chantal Gascuel, in: European Geosciences Union, 2013

Distinguishing the Popularity Between Topics: A System for Up-to-date Opinion Retrieval and Mining in the Web, Nikolaos Pappas, Georgios Katsimpras and Efstathios Stamatatos, in: Proceedings of the 14th International Conference on Intelligent Text Processing and Computational Linguistics, LNCS, Samos, Greece, ACM, 2013

[URL]

Diverse Keyword Extraction from Conversations, Maryam Habibi and Andrei Popescu-Belis, in: Proceedings of the ACL 2013 (51th Annual Meeting of the Association for Computational Linguistics ), Short Papers, Sofia, Bulgaria, pages 651-657, ACL, 2013

Estimating Phoneme Class Conditional Probabilities from Raw Speech Signal using Convolutional Neural Networks, Dimitri Palaz, Ronan Collobert and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2013

Euclidean Distance Matrix Completion for Ad-hoc Microphone Array Calibration, Mohammad J. Taghizadeh, Reza Parhizkar, Philip N. Garner and Hervé Bourlard, in: Proceedings IEEE International Conference On Digital Signal Processing, 2013

Evaluating Intra- and Crosslingual Adaptation for Non-native Speech Recognition in a Bilingual Environment, Gyorgy Szaszak and Philip N. Garner, in: Proceedings of the 4th IEEE International Conference on Cognitive Infocommunications, IEEE, Budapest, Hungary, pages 357-361, 2013

Evaluating Shape Descriptors for Detection of Maya Hieroglyphs, Edgar Roman-Rangel, Jean-Marc Odobez and Daniel Gatica-Perez, in: in Proc. Mexican Conf. on Pattern Recognition, Queretaro, 2013

Exploiting Accelerometers to Improve Movement Classification for Prosthetics, Arjan Gijsberts and Barbara Caputo, in: International Conference on Rehabilitation Robotics, 2013

Fast Object Detection with Entropy-Driven Evaluation, Raphael Sznitman, Carlos Becker, Francois Fleuret and Pascal Fua, in: Proceedings of the Conference on Computer Vision and Pattern Recognition, 2013

FEATURE AND SCORE LEVEL COMBINATION OF SUBSPACE GAUSSIANS IN LVCSR TASK, Petr Motlicek, Daniel Povey and Martin Karafiat, in: The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Vancouver, BC, Canada, pages 7604-7608, 2013

[DOI]

From Foursquare to my Square: Learning Check-in Behavior from Multiple Sources, Eric Malmi, Trinh-Minh-Tri Do and Daniel Gatica-Perez, in: The 7th International AAAI Conference on Weblogs and Social Media, 2013

From N to N+1: Multiclass Transfer Incremental Learning, Ilja Kuzborskij, Francesco Orabona and Barbara Caputo, in: Proceedings of the Conference on Computer Vision and Pattern Recognition, 2013

Fusing Matching and Biometric Similarity Measures for Face Diarization in Video, Elie Khoury, Paul Gay and Jean-Marc Odobez, in: Proceedings of the 3rd ACM conference on International conference on multimedia retrieval, Dallas, Texas, USA, pages 97-104, ACM, 2013

Gammatone Wavelet Cepstral Coefficients for Robust Speech Recognition, Aniruddha Adiga, Mathew Magimai-Doss and Chandra Sekhar Seelamantula, in: Proceedings of IEEE TENCON, 2013

Given that, Should I Respond? Contextual Addressee Estimation in Multi-Party Human-Robot Interactions, Dinesh Babu Jayagopi and Jean-Marc Odobez, in: Proceedings of Human Robot Interaction (HRI) Conference, 2013

Grapheme and Multilingual Posterior Features for Under-Resourced Speech Recognition: A Study on Scottish Gaelic, Ramya Rasipuram, Peter Bell and Mathew Magimai-Doss, in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2013

I4U Submission to NIST SRE 2012: a large-scale collaborative effort for noise-robust speaker verification, Rahim Saedi, Kong Aik Lee, Tomi Kinnunen, Tawfik Hasan, Benoit Fauve, Pierre-Michel Bousquet, Elie Khoury, Pablo Luis Sordo Martinez, Jia Min Karen Kua, Changhuai You, Hanwu Sun, Anthony Larcher, Padmanabhan Rajan, Ville Hautamäki, Cemal Hanilci, Billy Braithwaite, Gonzalez-Hautamäki Rosa, Seyed Omid Sadjadi, Gang Liu, Hynek Boril, Navid Shokouhi, Driss Matrouf, Laurent El Shafey, Pejman Mowlaee, Julien Epps, Tharmarajah Thiruvaran, David Van Leeuwen, Bin Ma, Haizhou Li, John Hansen, Jean-François Bonastre, Sébastien Marcel, John Mason and Eliathamby Ambikairajah, in: INTERSPEECH, Lyon, France, 2013

Idiap at MediaEval 2013: Search and Hyperlinking Task, Chidansh A. Bhatt, Nikolaos Pappas, Maryam Habibi and Andrei Popescu-Belis, in: MediaEval 2013 Workshop, Barcelona, Spain, CEUR-WS.org, 2013

Impact of deep MLP architecture on different acoustic modeling techniques for under-resourced speech recognition, David Imseng, Petr Motlicek, Philip N. Garner and Hervé Bourlard, in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, 2013

Implicitation of Discourse Connectives in (Machine) Translation, Thomas Meyer and Bonnie Webber, in: Proceedings of the 1st DiscoMT Workshop at ACL 2013 (51st Annual Meeting of the Association for Computational Linguistics), Sofia, Bulgaria, pages 19-26, 2013

Improved Overlap Speech Diarization of Meeting Recordings using Long-term Conversational Features, Sree Harsha Yella and Hervé Bourlard, in: ICASSP, 2013

Improving Grapheme-based ASR by Probabilistic Lexical Modeling Approach, Ramya Rasipuram and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2013

Inferring Mood in Ubiquitous Conversational Video, Dairazalia Sanchez-Cortes, Joan-Isaac Biel, Shiro Kumano, Junji Yamato, Kazuhiro Otsuka and Daniel Gatica-Perez, in: 12th International Conference on Mobile and Ubiquitous Multimedia, Luleå, Sweden, ACM Press, 2013

Inferring social activities with mobile sensor networks, Trinh-Minh-Tri Do, Kyriaki Kalimeri, Bruno Lepri, Fabio Pianesi and Daniel Gatica-Perez, in: 15th ACM International Conference on Multimodal Interaction, 2013

Investigating the Impact of Language Style and Vocal Expression on Social Roles of Participants in Professional Meetings, A. Sapru and Hervé Bourlard, in: Affective Computing and Intelligent Interaction, Geneva, pages 324-329, IEEE, 2013

[DOI]

Learning to Rank on Network Data, Majid Yazdani, Ronan Collobert and Andrei Popescu-Belis, in: Mining and Learning with Graphs, 2013

Leveraging the robot dialog state for visual focus of attention recognition, Samira Sheikhi, Vasil Khalidov, David Klotz, Britta Wrede and Jean-Marc Odobez, in: Proceedings of the 15th ACM on International Conference on Multimodal Interaction, 2013

Machine Translation with Many Manually Labeled Discourse Connectives, Thomas Meyer and Lucie Polakova, in: Proceedings of the 1st DiscoMT Workshop at ACL 2013 (51st Annual Meeting of the Association for Computational Linguistics), Sofia, Bulgaria, pages 43-50, 2013

Manifold Sparse Beamforming, Baran Gözcü, Afsaneh Asaei and Volkan Cevher, in: IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing, Saint Martin, France, pages 113-116, IEEE, 2013

[DOI]

MLP-based Factor Analysis for Tandem Speech Recognition, Marc Ferras and Hervé Bourlard, in: Proceedings of the International Conference on Acoustics, Speech and Signal Processing, 2013

Multi-factor Segmentation for Topic Visualization and Recommendation: the MUST-VIS System, Chidansh A. Bhatt, Andrei Popescu-Belis, Maryam Habibi, Sandy Ingram, Stefano Masneri, Fergus McInnes, Nikolaos Pappas and Oliver Schreer, in: Proceedings of the 21st ACM International Conference on Multimedia, Barcelona, Spain, pages 365-368, ACM, 2013

[DOI]
[URL]

Multiclass Latent Locally Linear Support Vector Machines, Marco Fornoni, Barbara Caputo and Francesco Orabona, in: JMLR W&CP, Volume 29: Asian Conference on Machine Learning, Canberra, Australia, pages 229-244, 2013

[URL]

Multimodal Analysis of Body Communication Cues in Employment Interviews, Laurent Son Nguyen, Alvaro Marcos-Ramiro, Marta Marron-Romera and Daniel Gatica-Perez, in: 15th ACM International Conference on Multimodal Interaction Proceedings, 2013

Noise Intrusiveness Factors in Speech Telecommunications, Raphael Ullmann, Hervé Bourlard, Jens Berger and Anna Llagostera Casanovas, in: Proceedings of the AIA-DAGA 2013 International Conference on Acoustics, Merano, Italy, pages 436-439, 2013

On the (Un)importance of the Contextual Factors In HMM-Based Speech Synthesis, Milos Cernak, Petr Motlicek and Philip N. Garner, in: Proceedings of the IEEE Intl. Conference on Acoustics, Speech and Signal Processing (ICASSP), Vancouver, Canada, pages 8140 - 8143, 2013

On the Improvements of Uni-modal and Bi-modal Fusions of Speaker and Face Recognition for Mobile Biometrics, Elie Khoury, Manuel Günther, Laurent El Shafey and Sébastien Marcel, in: Biometric Technologies in Forensic Science, Nijmegen, The Netherlands, 2013

One of a Kind: Inferring Personality Impressions in Meetings, Oya Aran and Daniel Gatica-Perez, in: 15th ACM International Conference on Multimodal Interaction, 2013

Overview of the ImageCLEF 2013 Robot Vision Task, Jesus Martinez-Gomez, Ismael Garcia-Varea, Miguel Cazorla and Barbara Caputo, in: Working Notes, CLEF 2013, 2013

Parameter Estimation and Contextual Adaptation for a Multi-Object Tracking CRF Model, Alexandre Heili and Jean-Marc Odobez, in: IEEE Workshop on Performance Evaluation of Tracking and Surveillance, 2013

Person Independent 3D Gaze Estimation From Remote RGB-D Cameras, Kenneth Alberto Funes Mora and Jean-Marc Odobez, in: International Conference on Image Processing, Melbourne, Australia, IEEE, 2013

[DOI]

Probabilistic Lexical Modeling and Unsupervised Training for Zero-Resourced ASR, Ramya Rasipuram, Marzieh Razavi and Mathew Magimai-Doss, in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, 2013

Processing and Linking Audio Events in Large Multimedia Archives: The EU inEvent Project, Hervé Bourlard, Marc Ferras, Nikolaos Pappas, Andrei Popescu-Belis, Steve Renals, Fergus McInnes, Peter Bell, Sandy Ingram and Maël Guillemot, in: Workshop on Speech, Language and Audio in Multimedia, 2013

Reservoir Boosting : Between Online and Offline Ensemble Learning, Leonidas Lefakis and Francois Fleuret, in: Proceedings of the international conference on Neural Information Processing Systems, 2013

Revisiting the Generality of the Rank-based Human Mobility Model, Darshan Santani and Daniel Gatica-Perez, in: Proceedings of the 2013 ACM Conference on Pervasive and Ubiquitous Computing Adjunct Publication, Zurich, Switzerland, pages 1209-1218, ACM, 2013

[DOI]
[URL]

Sentiment Analysis of User Comments for One-Class Collaborative Filtering over TED Talks, Nikolaos Pappas and Andrei Popescu-Belis, in: 36th ACM SIGIR Conference on Research and Development in Information Retrieval, Dublin, Ireland, ACM, 2013

Speaker adaptive Kullback-Leibler divergence based hidden Markov models, David Imseng and Hervé Bourlard, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2013

Speaking Swiss: Languages and Venues in Foursquare, Darshan Santani and Daniel Gatica-Perez, in: Proceedings of the 21st ACM International Conference on Multimedia, Barcelona, Spain, pages 501-504, ACM, 2013

[DOI]
[URL]

Spoofing Attacks To 2D Face Recognition Systems With 3D Masks, Nesli Erdogmus and Sébastien Marcel, in: International Conference of the Biometrics Special Interes Group, Darmstadt, Germany, 2013

Spoofing in 2D Face Recognition with 3D Masks and Anti-spoofing with Kinect, Nesli Erdogmus and Sébastien Marcel, in: Biometrics: Theory, Applications and Systems, Washington DC, USA, 2013

Stability and Hypothesis Transfer Learning, Ilja Kuzborskij and Francesco Orabona, in: International Conference on Machine Learning, 2013

Structured Sparse Acoustic Modeling for Speech Separation, Afsaneh Asaei, Mohammad Golbabaee, Hervé Bourlard and Volkan Cevher, in: Signal Processing with Adaptive Sparse Structured Representations SPARS, SPARS, 2013

Syllable-based Pitch Encoding for Low Bit Rate Speech Coding with Recognition/Synthesis Architecture, Milos Cernak, Xingyu Na and Philip N. Garner, in: Proc. of Interspeech 2013, Lyon, France, 2013

The 2013 Face Recognition Evaluation in Mobile Environment, Manuel Günther, Artur Costa-Pazo, Changxing Ding, Elhocine Boutellaa, Giovani Chiachia, Honglei Zhang, Marcus de Assis Angeloni, Vitomir Struc, Elie Khoury, Esteban Vazquez-Fernandez, Dacheng Tao, Messaoud Bengherabi, David Cox, Serkan Kiranyaz, Tiago de Freitas Pereira, Jerneja Zganec-Gros, Enrique Argones-Rúa, Nicolas Pinto, Moncef Gabbouj, Flávio Simões, Simon Dobrisek, Daniel González-Jiménez, Anderson Rocha, Mário Uliani Neto, Nikola Pavesic, Alexandre Falcão, Ricardo Violato and Sébastien Marcel, in: The 6th IAPR International Conference on Biometrics, 2013

The 2013 Speaker Recognition Evaluation in Mobile Environment, Elie Khoury, Bostjan Vesnicer, Javier Franco-Pedroso, Ricardo Violato, Zenelabidine Boulkenafet, Luis-Miguel Mazaira Fernandez, Mireia Diez, Justina Kosmala, Houssemeddine Khemiri, Tomas Cipr, Rahim Saedi, Manuel Günther, Jerneja Zganec-Gros, Ruben Zazo Candil, Flávio Simões, Messaoud Bengherabi, Augustin Alvarez Marquina, Mikel Penagarikano, Alberto Abad, Mehdi Boulayemen, Petr Schwarz, David Van Leeuwen, Javier Gonzalez-Domınguez, Mário Uliani Neto, Elhocine Boutellaa, Pedro Gomez Vilda, Amparo Varona, Dijana Petrovska-Delacretaz, Pavel Matejka, Joaquin Gonzalez-Rodrıguez, Tiago de Freitas Pereira, Farid Harizi, Luis Javier Rodriguez-Fuentes, Laurent El Shafey, Marcus Angeloni, German Bordel, Gérard Chollet and Sébastien Marcel, in: The 6th IAPR International Conference on Biometrics, 2013

The 2nd competition on counter measures to 2D face spoofing attacks, Ivana Chingovska, Jinwei Yang, Zhen Lei, Dong Yi, Stan Z.Li, Olga Kähm, Naser Damer, Christian Glaser, Arjan Kuijper, Alexander Nouak, Jukka Komulainen, Tiago de Freitas Pereira, Shubham Gupta, Shubham Bansal, Shubham Khandelwal, Ayush Rai, Tarun Krishna, Dushyant Goyal, Muhammad-Adeel Waris, Honglei Zhang, Iftikhar Ahmad, Serkan Kiranyaz, Moncef Gabbouj, Roberto Tronci, Maurizio Pili, Nicola Sirena, Fabio Roli, Javier Galbally, Julian Fierrez, Allan Pinto, Helio Pedrini, William Robson Schwartz, Anderson Rocha, André Anjos and Sébastien Marcel, in: International Conference of Biometrics 2013, Madrid, Spain, 2013

The vernissage corpus: a conversational human-robot-interaction dataset, Dinesh Babu Jayagopi, Samira Sheikhi, David Klotz, Johannes Wienke, Jean-Marc Odobez, Sebastian Wrede, Vasil Khalidov, Laurent Son Nguyen, Britta Wrede and Daniel Gatica-Perez, in: Proceedings of the 8th ACM/IEEE international conference on Human-robot interaction, 2013

Time-Sensitive Topic Models for Action Recognition in Videos, Romain Tavenard, Remi Emonet and Jean-Marc Odobez, in: IEEE International Conference on Image Processing, 2013

Transfer in Inverse Reinforcement Learning for Multiple Strategies, Ajay Kumar Tanwani and Aude Billard, in: IEEE/RSJ International Conference on Intelligent Robots and Systems, Tokyo, Japan, pages 3244-3250, IEEE, 2013

[DOI]
[URL]

Understanding Factors in Emotion Perception, Lakshmi Saheer and Blaise Potard, in: ISCA Speech Synthesis Workshop, 2013

Using Phonological Phrase Segmentation to Improve Automatic Keyword Spotting for the Highly Agglutinating Hungarian Language, Gyorgy Szaszak and Andras Beke, in: Proc. of Interspeech 2013, 2013

Who is Persuasive? The Role of Perceived Personality and Communication Modality in Social Multimedia, Gelareh Mohammadi, Sunghyun Park, Kenji Sagae, Alessandro Vinciarelli and Louis-Philippe Morency, in: International Conference on Multimodal Interaction, 2013

A Multiple Hypothesis Gaussian Mixture Filter for Acoustic Source Localization and Tracking, Youssef Oualil, Friedrich Faubel and Dietrich Klakow, in: 13th International Workshop on Acoustic Signal Enhancement, pages 233-236, 2012

A TDOA Gaussian Mixture Model for Improving Acoustic Source Tracking, Youssef Oualil, Friedrich Faubel, Mathew Magimai-Doss and Dietrich Klakow, in: 20th European Signal Processing Conference, 2012

A tree-based distance between distributions: application to classification of neurons, Riwal Lefort and Francois Fleuret, in: ICASSP 2012 : IEEE International Conference on Acoustics, Speech and Signal Processing, 2012

Acoustic Data-driven Grapheme-to-Phoneme Conversion using KL-HMM, Ramya Rasipuram and Mathew Magimai-Doss, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2012

An Agent-Based Focused Crawling Framework for Topic- and Genre-Related Web Document Discovery, Nikolaos Pappas, Georgios Katsimpras and Efstathios Stamatatos, in: 24th IEEE International Conference on Tools with Artificial Intelligence, Athens, Greece, IEEE, 2012

[URL]

An Open Source Framework for Standardized Comparisons of Face Recognition Algorithms, Manuel Günther, Roy Wallace and Sébastien Marcel, in: Computer Vision - ECCV 2012. Workshops and Demonstrations, Idiap Research Institute, Heidelberg, pages 547-556, Springer Berlin, 2012

[DOI]
[URL]

Annotation and Recognition of Personality Traits in Spoken Conversations from the AMI Meetings Corpus, Fabio Valente, Samuel Kim and Petr Motlicek, in: Proceedings of Interspeech 2012, 2012

Assessing the Impact of Language Style on Emergent Leadership Perception from Ubiquitous Audio, Dairazalia Sanchez-Cortes, Petr Motlicek and Daniel Gatica-Perez, in: Proceedings of the 11th International Conference on Mobile and Ubiquitous Multimedia, Ulm, Germany, 2012

Automatic detection of conflict escalation in spoken conversations, Samuel Kim, Sree Harsha Yella and Fabio Valente, in: INTERSPEECH, ISCA, Portland, Oregon, USA., 2012

Automatic detection of conflicts in spoken conversations: ratings and analysis of broadcast political debates, Samuel Kim, Fabio Valente and Alessandro Vinciarelli, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, Japan, 2012

Automatic Speaker Role Labeling in AMI Meetings: Recognition of Formal and Social Roles, A. Sapru and Fabio Valente, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, Japan, 2012, 2012

Baseline Multimodal Place Classifier for the 2012 Robot Vision Task, Jesus Martinez-Gomez, Ismael Garcia-Varea and Barbara Caputo, in: Working Notes of the ImageCLEF 2012 Laboratory, 2012

Beyond Dataset Bias: Multi-task Unaligned Shared Knowledge Transfer, Tatiana Tommasi, Novi Quadrianto, Barbara Caputo and Christoph H. Lampert, in: Asian Conference on Computer Vision, 2012

Bi-Modal Authentication in Mobile Environments Using Session Variability Modelling, Petr Motlicek, Laurent El Shafey, Roy Wallace, Chris McCool and Sébastien Marcel, in: Proceedings of the 21st International Conference on Pattern Recognition, 2012

Bi-Modal Person Recognition on a Mobile Phone: using mobile phone data, Chris McCool, Sébastien Marcel, Abdenour Hadid, Matti Pietikainen, Pavel Matejka, Jan Cernocky, Norman Poh, J. Kittler, Anthony Larcher, Christophe Levy, Driss Matrouf, Jean-François Bonastre, Phil Tresadern and Timothy Cootes, in: IEEE ICME Workshop on Hot Topics in Mobile Multimedia, 2012

Bob: a free signal processing and machine learning toolbox for researchers, André Anjos, Laurent El Shafey, Roy Wallace, Manuel Günther, Chris McCool and Sébastien Marcel, in: Proceedings of the ACM Multimedia Conference, 2012

[URL]

Boosting localized binary features for speech recognition, Anindya Roy, Mathew Magimai-Doss and Sébastien Marcel, in: Symposium on Machine Learning in Speech and Language Processing (MLSLP), 2012

Boosting under-resourced speech recognizers by exploiting out of language data - Case study on Afrikaans, David Imseng, Hervé Bourlard and Philip N. Garner, in: Proceedings of the 3rd International Workshop on Spoken Languages Technologies for Under-resourced Languages, Cape Town, pages 60--67, 2012

Bridging the Past, Present and Future: Modeling Scene Activities From Event Relationships and Global Rules, Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, in: IEEE Conference on Computer Vision and Pattern Recognition, 2012, Providence, Rhode Island, USA, 2012

Building the NinaPro Database: a Resource for the Biorobotics Community, Manfredo Atzori, Arjan Gijsberts, Simone Heynen, Anne-Gabrielle Mittaz Hager, Olivier Deriaz, Patrick van der Smagt, Claudio Castellini, Barbara Caputo and Henning Müller, in: Proceedings of the Fourth IEEE RAS/EMBS International Conference on Biomedical Robotics and Biomechatronics, 2012

Checking In or Checked In: Comparing Large-Scale Manual and Automatic Location Disclosure Patterns, Eric Malmi, Trinh-Minh-Tri Do and Daniel Gatica-Perez, in: Proceedings of the 11th International Conference on Mobile and Ubiquitous Multimedia, Ulm, Germany, 2012

Collecting data for socially intelligent surveillance and monitoring approaches: the case of conflict in competitive conversations, Alessandro Vinciarelli, Samuel Kim, Fabio Valente and Hugues Salamin, in: International Symposium on Communications, Control, and Signal Processing, 2012

Combination of Sparse Classification and Multilayer Perceptron for Noise Robust ASR, Yang Sun, Mathew Magimai-Doss, Jort F. Gemmeke, B. Cranen, Louis ten Bosch and Lou Boves, in: Proceedings of Interspeech, 2012

Combining Acoustic Data Driven G2P and Letter-to-Sound Rules for Under Resource Lexicon Generation, Ramya Rasipuram and Mathew Magimai-Doss, in: Proceedings of Interspeech, Portland, Oregon, 2012

COMBINING CEPSTRAL NORMALIZATION AND COCHLEAR IMPLANT-LIKE SPEECH PROCESSING FOR MICROPHONE ARRAY-BASED SPEECH RECOGNITION, Cong-Thanh Do, Mohammad J. Taghizadeh and Philip N. Garner, in: Proceedings of the IEEE Workshop on Spoken Language Technology, 2012

Combining transcription-based and acoustic-based speaker identifications for broadcast news, Elie Khoury, Antoine Laurent, Sylvain Meignier and Simon Petitrenaud, in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2012

COMBINING VOCAL TRACT LENGTH NORMALIZATION WITH HIERARCHIAL LINEAR TRANSFORMATIONS, Lakshmi Saheer, Junichi Yamagishi, Philip N. Garner and John Dines, in: Proceedings in International conference on Speech and Signal processing, Kyoto, Japan, pages 4493-4496, IEEE SPS (ICASSP), 2012

Comparing different acoustic modeling techniques for multilingual boosting, David Imseng, John Dines, Petr Motlicek, Philip N. Garner and Hervé Bourlard, in: Proceedings of Interspeech, Portland, Oregon, 2012

Computational Methods For Structured Sparse Component Analysis of Convolutive Speech Mixtures, Afsaneh Asaei, Michael E. Davies, Hervé Bourlard and Volkan Cevher, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2012

Contextual Conditional Models for Smartphone-based Human Mobility Prediction, Trinh-Minh-Tri Do and Daniel Gatica-Perez, in: Proceedings of the 14th ACM International Conference on Ubiquitous Computing, 2012

Conversion of Recurrent Neural Network Language Models to Weighted Finite State Transducers for Automatic Speech Recognition, Gwénolé Lecorvé and Petr Motlicek, in: Proceedings of Interspeech, Portland, Oregon, USA, pages to appear, 2012

Crowdsourcing Micro-Level Multimedia Annotations: The Challenges of Evaluation and Interface, Sunghyun Park, Gelareh Mohammadi, Ron Artstein and Louis-Philippe Morency, in: Proceedings of International ACM Workshop on Crowdsourcing for Multimedia, 2012

Detecting and Labeling Folk Literature in Spoken Cultural Heritage Archives using Structural and Prosodic Features, Fabio Valente and Petr Motlicek, in: IEEE Content Based Multimedia Indexing, 2012

DiarTk : An Open Source Toolkit for Research in Multistream Speaker Diarization and its Application to Meetings Recordings, Deepu Vijayasenan and Fabio Valente, in: Proceedings of Interspeech, 2012

Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns, Andrei Popescu-Belis, Thomas Meyer, Jeevanthi Liyanapathirana, Bruno Cartoni and Sandrine Zufferey, in: Proceedings of the eighth international conference on Language Resources and Evaluation (LREC), pages 5, 2012

Empirical validations of multilingual annotation schemes for discourse relations, Sandrine Zufferey, Liesbeth Degand, Andrei Popescu-Belis and Ted Sanders, in: 8th Joint ACL-ISO Workshop on Interoperable Semantic Annotation, 2012

Exact Acceleration of Linear Object Detectors, Charles Dubout and Francois Fleuret, in: Proceedings of the European Conference on Computer Vision, 2012

Experiences in the Creation of an Electromyography Database to Help Hand Amputated Persons, Manfredo Atzori, Arjan Gijsberts, Simone Heynen, Anne-Gabrielle Mittaz Hager, Claudio Castellini, Barbara Caputo and Henning Müller, in: Proceedings of the 24th European Medical Informatics Conference, 2012

Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies, Bruno Cartoni and Thomas Meyer, in: Proceedings of the eighth international conference on Language Resources and Evaluation (LREC), Istanbul, TR, pages 6, 2012

Extracting Informative Textual Parts from Web Pages Containing User-Generated Content, Nikolaos Pappas, Georgios Katsimpras and Efstathios Stamatatos, in: 12th International Conference on Knowledge Management and Knowledge Technologies, ACM ICPS, Graz, Austria, pages 4:1--4:8, ACM, 2012

[URL]

Extracting Mobile Behavioral Patterns with the Distant N-Gram Topic Model, Katayoun Farrahi and Daniel Gatica-Perez, in: Proceedings of the IEEE International Symposium on Wearable Computers, Newcastle, 2012

Face Recognition with Disparity Corrected Gabor Phase Differences, Manuel Günther, Dennis Haufe and Rolf P. Würtz, in: Artificial Neural Networks and Machine Learning, Heidelberg, pages 411-418, Springer Berlin, 2012

[DOI]

Face Verification using Gabor Filtering and Adapted Gaussian Mixture Models, Laurent El Shafey, Roy Wallace and Sébastien Marcel, in: Proceedings of the 11th International Conference of the Biometrics Special Interest Group, Darmstadt, Germany, pages 397-408, GI-Edition, 2012

FaceTube: predicting personality from facial expressions of emotion in online conversational video, Joan-Isaac Biel, Lucia Teijeiro-Mosquera and Daniel Gatica-Perez, in: Proceedings International Conference on Multimodal Interfaces (ICMI-MLMI), 2012

From Speech to Personality: Mapping Voice Quality and Intonation into Personality Differences, Gelareh Mohammadi, Antonio Origlia, Maurizio Pili and Alessandro Vinciarelli, in: in Proceedings of ACM Multimedia 2012, 2012

Gaze Estimation From Multimodal Kinect Data, Kenneth Alberto Funes Mora and Jean-Marc Odobez, in: IEEE Conference in Computer Vision and Pattern Recognition, Workshop on Gesture Recognition, Providence, RI, USA, 2012

[DOI]

Generating Exact Lattices in The WFST Framework, Daniel Povey, Mirko Hannemann, Gilles Boulianne, Lukas Burget, Arnab Ghoshal, Milos Janda, Martin Karafiat, Stefan Kombrink, Petr Motlicek, Yanmin Qian, Korbinian Riedhammer, Karel Vesely and Ngoc Thang Vu, in: Proceedings of 2012 IEEE International Conference on Acoustics, Speech and Signal Processing., The 37th International Conference on Acoustics, Speech, and Signal Processing, Kyoto, JP, Kyoto, Japan, pages 4213-4216, IEEE Signal Processing Societ, 2012

[DOI]

Impact du degré de supervision sur l'adaptation à un domaine d'un modèle de langage à partir du Web, Gwénolé Lecorvé, John Dines, Thomas Hain and Petr Motlicek, in: Actes de la conference conjointe JEP-TALN-RECITAL 2012, Grenoble, France, pages 193-200, ATALA/AFCP, 2012

IMPROVING ACOUSTIC BASED KEYWORD SPOTTING USING LVCSR LATTICES, Petr Motlicek, Fabio Valente and Igor Szoke, in: Proceedings on IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE, Japan, pages 4413-4416, 2012

Indoor Scene Recognition using Task and Saliency-driven Feature Pooling, Marco Fornoni and Barbara Caputo, in: Proceedings of the British Machine Vision Conference, Guildford, UK, 2012

Investigating the Midline Effect for Visual Focus of Attention Recognition, Samira Sheikhi and Jean-Marc Odobez, in: Int Conf. on Multimodal Interaction (ICMI), Santa Monica, 2012

Iterative Relevance Feedback with Adaptive Exploration/Exploitation Trade-off, Nicolae Suditu and Francois Fleuret, in: Proceedings of the 21st ACM Conference on Information and Knowledge Management, pages 1323-1331, 2012

Joint Detection and Localization of Multiple Speakers using a Probabilistic Interpretation of the Steered Response Power, Youssef Oualil, Mathew Magimai-Doss, Friedrich Faubel and Dietrich Klakow, in: Statistical and Perceptual Audition Workshop, 2012

LBP-TOP based countermeasure against face spoofing attacks, Tiago de Freitas Pereira, André Anjos, José Mario De Martino and Sébastien Marcel, in: International Workshop on Computer Vision With Local Binary Pattern Variants - ACCV, pages 12, 2012

Leveraging over prior knowledge for online learning of visual categories, Tatiana Tommasi, Francesco Orabona, Mohsen Kaboli and Barbara Caputo, in: Proceedings of the British Machine Vision Conference, 2012

Linking Speaking and Looking Behavior Patterns with Group Composition, Perception, and Performance, Dinesh Babu Jayagopi, Dairazalia Sanchez-Cortes, Kazuhiro Otsuka, Junji Yamato and Daniel Gatica-Perez, in: Proceedings of the International Conference on Multimodal Interaction (ICMI), Santa Monica, USA, 2012

Machine Translation of Labeled Discourse Connectives, Thomas Meyer, Andrei Popescu-Belis, Najeh Hajlaoui and Andrea Gesmundo, in: Proceedings of the Tenth Biennial Conference of the Association for Machine Translation in the Americas (AMTA), pages 10, 2012

Macro-Action Discovery Based on Change Point Detection and Boosting, Leonidas Lefakis and Francois Fleuret, in: International Conference on Machine Learning and Applications, 2012

MediaParl: Bilingual mixed language accented speech database, David Imseng, Hervé Bourlard, Holger Caesar, Philip N. Garner, Gwénolé Lecorvé and Alexandre Nanchen, in: Proceedings of the 2012 IEEE Workshop on Spoken Language Technology, pages 263--268, 2012

Microphone Array Beampattern Characterization for Hands-free Speech Applications, Mohammad J. Taghizadeh, Philip N. Garner and Hervé Bourlard, in: IEEE 7th Sensor Array and Multichannel Signal Processing Workshop(SAM), Hoboken, NJ, USA, pages 473-476, 2012

Modeling dominance effects on nonverbal behaviors using granger causality, Kyriaki Kalimeri, Bruno Lepri, Oya Aran, Dinesh Babu Jayagopi, Daniel Gatica-Perez and Fabio Pianesi, in: Proceedings of International Conference on Multimodal Interaction, ICMI 2012, Santa Monica, CA, 2012

Multimodal Cue Detection Engine for Orchestrated Entertainment, Danil Korchagin, Stefan Duffner, Petr Motlicek and Carl Scheffler, in: Proceedings International Conference on MultiMedia Modeling, Klagenfurt, Austria, 2012

On Speaker-Independent Personality Perception and Prediction from Speech, Polzehl Tim, Schoenenberg Katrin, Moller Sebastian, Metze Florian, Gelareh Mohammadi and Alessandro Vinciarelli, in: in Proceedings of INTERSPEECH 2012, 2012

On the Challenge of Classifying 52 Hand Movements from Surface Electromyography, Ilja Kuzborskij, Arjan Gijsberts and Barbara Caputo, in: 34th Annual Conference of the IEEE Engineering in Medicine & Biology Society, 2012

On the Effectiveness of Local Binary Patterns in Face Anti-spoofing, Ivana Chingovska, André Anjos and Sébastien Marcel, in: Proceedings of the 11th International Conference of the Biometrics Special Interes Group, 2012

Overview of the ImageCLEF 2012 Robot Vision Task, Jesus Martinez-Gomez, Ismael Garcia-Varea and Barbara Caputo, in: Working Notes of the ImageCLEF 2012 Laboratory, 2012

Predicting the Conflict Level in Television Political Debates: an Approach Based on Crowdsourcing, Nonverbal Communication and Gaussian Processes, Samuel Kim, Maurizio Filippone, Fabio Valente and Alessandro Vinciarelli, in: ACM Multimedia, 2012

Reading Companion: The Technical and Social Design of an Automated Reading Tutor, Arthur Kantor, Milos Cernak, Jiri Havelka, Sean Huber, Jan Kleindienst and Doris B. Gonzalez, in: Workshop on Child, Computer and Interaction, Portland, Oregon, U.S.A., 2012

Recognizing the Visual Focus of Attention for Human Robot Interaction, Samira Sheikhi, Vasil Khalidov and Jean-Marc Odobez, in: IEEE International Conference on Intelligent Robots and Systems (IROS) - Human Behavior Understanding Workshop(IROS-HBU), 2012

Robot-to-group Interaction in a Vernissage: Architecture & Dataset for Multi-party Dialog, David Klotz, Johannes Wienke, Britta Wrede, Sebastian Wrede, Samira Sheikhi, Dinesh Babu Jayagopi, Vasil Khalidov and Jean-Marc Odobez, in: Proceedings of 5th International Conference on Cognitive Systems, 2012

Robust triphone mapping for acoustic modeling, Milos Cernak, David Imseng and Hervé Bourlard, in: Proceedings of Interspeech, Portland, Oregon, 2012

Socio-Technical Network Analysis from Wearable Interactions, Katayoun Farrahi, Remi Emonet and Alois Ferscha, in: International Symposium on Wearable Computers, 2012

Speaker Diarization and Linking of Large Corpora, Marc Ferras and Hervé Bourlard, in: Proceedings of the IEEE Workshop on Spoken Language Technology, 2012

Speaker Diarization of Meetings based on large TDOA feature vectors, Deepu Vijayasenan and Fabio Valente, in: Proceedings of International Conference on Acoustic, Speech and Signal Processing, 2012

Speaker diarization of overlapping speech based on silence distribution in meeting recordings, Sree Harsha Yella and Fabio Valente, in: INTERSPEECH, Portland, Oregon, USA, 2012

StressSense: Detecting Stress in Unconstrained Acoustic Environments using Smartphones, Hong Lu, Mashfiqui Rabbi, Gokul Chittaranjan, Denise Frauendorfer, Marianne Schmid Mast, Andrew T. Campbell, Daniel Gatica-Perez and Tanzeem Choudhury, in: Ubicomp'12, Pittsburgh, 2012

Structured Sparse Coding for Microphone Array Location Calibration, Afsaneh Asaei, Bhiksha Raj, Hervé Bourlard and Volkan Cevher, in: SAPA-SCALE Conference, The 5th ISCA workshop on statistical and perceptual audition, 2012

Sub-Band Based Log-Energy and its Dynamic Range Stretching for Robust In-Car Speech Recognition, Weifeng Li and Hervé Bourlard, in: Proceedings of the 13th Annual Conference of the International Speech Communication Association (InterSpeech), Portland, Oregon, 2012

Supervised and unsupervised Web-based language model domain adaptation, Gwénolé Lecorvé, John Dines, Thomas Hain and Petr Motlicek, in: Proceedings of Interspeech, Portland, Oregon, USA, pages to appear, 2012

Synthetic References for Template-based ASR using Posterior Features, Serena Soldo, Mathew Magimai-Doss and Hervé Bourlard, in: Proceedings of Interspeech, Portland, Oregon, USA, 2012

Template-based ASR using Posterior features and synthetic references: comparing different TTS systems, Serena Soldo, Mathew Magimai-Doss and Hervé Bourlard, in: SAPA-SCALE Conference, International Speech Communication Association, 2012

The Good, the Bad, and the Angry: Analyzing Crowdsourced Impressions of Vloggers, Joan-Isaac Biel and Daniel Gatica-Perez, in: Proceedings of AAAI International Conference on Weblogs and Social Media, 2012

The I4U Submission to the 2012 NIST Speaker Recognition Evaluation, Kong Aik Lee, Rahim Saedi, Tawfik Hasan, Tomi Kinnunen, Benoit Fauve, Pierre-Michel Bousquet, Elie Khoury, Pablo Luis Sordo Martinez, Tharmarajah Thiruvaran, Changhuai You, Padmanabhan Rajan, David Van Leeuwen, Seyed Omid Sadjadi, Driss Matrouf, Laurent El Shafey, John Mason, Eliathamby Ambikairajah, Hanwu Sun, Anthony Larcher, Bin Ma, Ville Hautamäki, Cemal Hanilci, Billy Braithwaite, Gonzalez-Hautamäki Rosa, Gang Liu, Hynek Boril, Navid Shokouhi, John Hansen, Jean-François Bonastre and Sébastien Marcel, in: NIST Speaker Recognition Conference, 2012

The Idiap Speaker Recognition Evaluation System at NIST SRE 2012, Elie Khoury, Laurent El Shafey and Sébastien Marcel, in: NIST Speaker Recognition Conference, NIST, Orlando, USA, 2012

The INTERSPEECH 2012 Speaker Trait Challenge, Björn Schuller, Stefan Steidl, Anton Batliner, Elmar Nöth, Alessandro Vinciarelli, Felix Burkhardt, Rob Van Son, felix Weninger, Florian Eyben, Tobias Bocklet, Gelareh Mohammadi and Benjamin Weiss, in: in Proceedings of INTERSPEECH, 2012

The Mobile Data Challenge: Big Data for Mobile Computing Research, J. K. Laurila, Daniel Gatica-Perez, I. Aad, Blom J., Olivier Bornet, Trinh-Minh-Tri Do, O. Dousse, J. Eberle and M. Miettinen, in: Pervasive Computing, Newcastle, 2012

Translating English Discourse Connectives into Arabic: a Corpus-based Analysis and an Evaluation Metric, Najeh Hajlaoui and Andrei Popescu-Belis, in: Fourth Workshop on Computational Approaches to Arabic Script-based Languages at Proceedings of the Tenth Biennial Conference of the Association for Machine Translation in the Americas (AMTA), 2012

Unsupervised Activity Analysis and Monitoring algorithms for Effective Surveillance Systems, Jean-Marc Odobez, C. Carincotte, Remi Emonet, E. Jouneau, Sofia Zaidenberg, Bertrand Raverra, Francois Bremond and Andrea Grifoni, in: European Conference on Computer Vision, 2012

Using Crowdsourcing to Compare Document Recommendation Strategies for Conversations, Maryam Habibi and Andrei Popescu-Belis, in: RecSys, Recommendation Utility Evaluation (RUE 2012), Dublin, Ireland, pages 15-20, 2012

Using KL-divergence and multilingual information to improve ASR for under-resourced languages, David Imseng, Hervé Bourlard and Philip N. Garner, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, pages 4869--4872, 2012

Using Self-Context for Multimodal Detection of Head Nods in Face-to-Face Interactions, Laurent Son Nguyen, Jean-Marc Odobez and Daniel Gatica-Perez, in: Proceedings of the 14th ACM International Conference on Multimodal Interaction, 2012

Using Sense-labeled Discourse Connectives for Statistical Machine Translation, Thomas Meyer and Andrei Popescu-Belis, in: Proceedings of the EACL2012 Workshop on Hybrid Approaches to Machine Translation (HyTra), Avignon, FR, pages 129--138, 2012

Using Sparse Classification Outputs as Feature Observations for Noise Robust ASR, Yang Sun, B. Cranen, Jort F. Gemmeke, Lou Boves, Louis ten Bosch and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2012

We are not Contortionists: Coupled Adaptive Learning for Head and Body Orientation Estimation in Surveillance Video, Cheng Chen and Jean-Marc Odobez, in: IEEE International Conference on Computer Vision and Pattern Recognition, 2012

A Bimodal Sound Source Model for Vehicle Tracking in Traffic Monitoring, Patrick Marmaroli, Jean-Marc Odobez, Xavier Falourd and Hervé Lissek, in: European Signal Processing Conference, 2011

A BSS-based Approach for Localization of Simultaneous Speakers in Reverberant Conditions, Hamid Reza Abutalebi, Hedieh Heli, Danil Korchagin and Hervé Bourlard, in: Proceedings of the 19th European Signal Processing Conference (EUSIPCO), 2011

A Compressive Sensing Based Compressed Neural Network for Sound Source Localization, Mehdi Banitalebi Dehkordi, Hamid Reza Abutalebi and Hossein Ghanei, in: Proceedings of International Symposium on Artificial Intelligence and Signal Processing, 2011

A Corpus-based Contrastive Analysis for Defining Minimal Semantics of Inter-sentential Dependencies for Machine Translation, Thomas Meyer, Andrei Popescu-Belis, Jeevanthi Liyanapathirana and Bruno Cartoni, in: Proceedings of the GSCL2011 Workshop on "Contrastive Analysis - Translation Studies - Machine Translation: What can we learn from each other?", Hamburg, Germany, pages 5, 2011

A Joint Estimation of Head and Body Orientation Cues in Surveillance Video, Cheng Chen, Alexandre Heili and Jean-Marc Odobez, in: IEEE International Workshop on Socially Intelligent Surveillance and Monitoring, 2011

A Just-in-Time Document Retrieval System for Dialogues or Monologues, Andrei Popescu-Belis, Majid Yazdani, Alexandre Nanchen and Philip N. Garner, in: SIGDIAL 2011 (12th annual SIGDIAL Meeting on Discourse and Dialogue), Demonstration Session, Portland, OR, pages 350-352, 2011

A Large-Scale Database of Images and Captions for Automatic Face Naming, Mert Ozcan, Jie Luo, Vittorio Ferrari and Barbara Caputo, in: Proceedings of the 22nd British Machine Vision Conference, 2011

A Speech-based Just-in-Time Retrieval System using Semantic Search, Andrei Popescu-Belis, Majid Yazdani, Alexandre Nanchen and Philip N. Garner, in: Proceedings of the ACL-HLT 2011 System Demonstrations (49th Annual Meeting of the Association for Computational Linguistics), Portland, OR, pages 80-86, 2011

[URL]

An Audio Visual Corpus for Emergent Leader Analysis, Dairazalia Sanchez-Cortes, Oya Aran and Daniel Gatica-Perez, in: Multimodal Corpora for Machine Learning: Taking Stock and Road mapping the Future, 2011

An Integrated Framework for Multi-Channel Multi-Source Localization and Voice Activity Detection, Mohammad J. Taghizadeh, Philip N. Garner, Hervé Bourlard, Hamid Reza Abutalebi and Afsaneh Asaei, in: The Third Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2011

Analysis and Comparison of Recent MLP Features for LVCSR Systems, Fabio Valente, Mathew Magimai-Doss and Wen Wang, in: Proceedings of Interspeech 2011, 2011

Audio Spatio-Temporal Fingerprints for Cloudless Real-Time Hands-Free Diarization on Mobile Devices, Danil Korchagin, in: Proceedings of the 3rd Joint Workshop on Hands-Free Speech Communication and Microphone Arrays, Edinburgh, UK, 2011

Automated Quantification of Morphodynamics for High-Throughput Live Cell Imaging Datasets, German Gonzalez, L. Fusco, Riwal Lefort, F. Benmansour, Pascal Fua and Kevin C. Smith, in: 1st International SystemsX.ch Conference on Systems Biology, 2011

Automatic Time Skew Detection and Correction, Danil Korchagin, in: Proceedings International Conference on Signal Acquisition and Processing, Singapore, 2011

Boosting with Maximum Adaptive Sampling, Charles Dubout and Francois Fleuret, in: Proceedings of the Neural Information Processing Systems Conference, 2011

Building 'directional corpora' for unbiased contrastive analysis, Bruno Cartoni and Thomas Meyer, in: Proceedings of Corpus Linguistics Conference, Birmingham, UK, pages 29-30, 2011

Combined Estimation of Location and Body Pose in Surveillance Video, Cheng Chen, Alexandre Heili and Jean-Marc Odobez, in: AVSS, 2011

Competition on Counter Measures to 2-D Facial Spoofing Attacks, Murali Mohan Chakka, André Anjos, Sébastien Marcel, Roberto Tronci, Daniele Muntoni, Gianluca Fadda, Maurizio Pili, Nicola Sirena, Gabriele Murgia, Marco Ristori, Fabio Roli, Junjie Yan, Dong Yi, Zhen Lei, Zhiwei Zhang, Stan Z.Li, William Robson Schwartz, Anderson Rocha, Helio Pedrini, Javier Lorenzo-Navarro, Modesto Castrillón-Santana, Jukka Maatta, Abdenour Hadid and Matti Pietikainen, in: Proceedings of IAPR IEEE International Joint Conference on Biometrics (IJCB), Washington DC, USA, 2011

Contextual grouping: discovering real-life interaction types from longitudinal Bluetooth data, Trinh-Minh-Tri Do and Daniel Gatica-Perez, in: 12th International Conference on Mobile Data Management, 2011

Counter-Measures to Photo Attacks in Face Recognition: a public database and a baseline, André Anjos and Sébastien Marcel, in: International Joint Conference on Biometrics 2011, 2011

[URL]

Cross-Lingual Speaker Discrimination Using Natural and Synthetic Speech, Mirjam Wester and Hui Liang, in: Proceedings of Interspeech, Florence, Italy, 2011

Deep Learning for Efficient Discriminative Parsing, Ronan Collobert, in: International Conference on Artificial Intelligence and Statistics, 2011

Detection-Based Multi-Human Tracking Using a CRF Model, Alexandre Heili, Cheng Chen and Jean-Marc Odobez, in: The Eleventh IEEE International Workshop on Visual Surveillance, 2011

Disambiguating discourse connectives using parallel corpora: senses vs. translations, Thomas Meyer, Charlotte Roze, Bruno Cartoni, Laurence Danlos, Sandrine Zufferey and Andrei Popescu-Belis, in: Proceedings of Corpus Linguistics Conference, Birmingham, UK, pages 104-105, 2011

Disambiguating Temporal-Contrastive Discourse Connectives for Machine Translation, Thomas Meyer, in: Proceedings of ACL-HLT 2011 Student Session, Association for Computational Linguistics, Portland, OR, pages 46--51, 2011

Engagement-based Multi-party Dialog with a Humanoid Robot, David Klotz, Johannes Wienke, Julia Peltason, Britta Wrede, Sebastian Wrede, Vasil Khalidov and Jean-Marc Odobez, in: Proceedings of the SIGDIAL 2011: the 12th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pages 341-343, 2011

Environment - Application - Adaptation: a Community Architecture for Ambient Intelligence, Remi Emonet, in: International Conference on Ambient Computing, Applications, Services and Technologies, 2011

Exploiting Long-Term Observations for Track Creation and Deletion in Online Multi-Face Tracking, Stefan Duffner and Jean-Marc Odobez, in: IEEE Conference on Automatic Face and Gesture Recognition, pages 525-530, IEEE, 2011

Exploiting observers' judgements for nonverbal group interaction analysis, Gokul Chittaranjan, Oya Aran and Daniel Gatica-Perez, in: IEEE Conference on Automatic Face and Gesture Recognition, pages 6, IEEE, 2011

Extracting and Locating Temporal Motifs in Video Scenes Using a Hierarchical Non Parametric Bayesian Model, Remi Emonet, Jagannadan Varadarajan and Jean-Marc Odobez, in: IEEE Conference on Computer Vision and Pattern Recognition, 2011

Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition, David Imseng, Ramya Rasipuram and Mathew Magimai-Doss, in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Hawaii, USA, pages 348-353, 2011

Fast Speaker Verification on Mobile Phone data using Boosted Slice Classifiers, Anindya Roy, Mathew Magimai-Doss and Sébastien Marcel, in: IAPR IEEE International Joint Conference on Biometrics, Washington DC, 2011

Finding Audio-Visual Events in Informal Social Gatherings, Xavier Alameda-Pineda, Vasil Khalidov, Radu Horaud and Florence Forbes, in: IEEE/ACM 13th International Conference on Multimodal Interaction, 2011

FlowBoost - Appearance Learning from Sparsely Annotated Video, Karim Ali, David Hasler and Francois Fleuret, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2011

Grapheme-based Automatic Speech Recognition using KL-HMM, Mathew Magimai-Doss, Ramya Rasipuram, Guillermo Aradilla and Hervé Bourlard, in: Proceedings of Interspeech, 2011

GroupUs: Smartphone Proximity Data and Human Interaction Type Mining, Trinh-Minh-Tri Do and Daniel Gatica-Perez, in: 15th annual International Symposium on Wearable Computers, San Francisco, USA, 2011

HEAT: Iterative Relevance Feedback with One Million Images, Nicolae Suditu and Francois Fleuret, in: Proceedings of the IEEE International Conference on Computer Vision, pages 2118-2125, 2011

Hierarchical Tandem Features for ASR in Mandarin, Joel Pinto, Mathew Magimai-Doss and Hervé Bourlard, in: Proceedings of Interspeech, 2011

How Comparable are Parallel Corpora? Measuring the Distribution of General Vocabulary and Connectives, Bruno Cartoni, Sandrine Zufferey, Thomas Meyer and Andrei Popescu-Belis, in: Proceedings of 4th Workshop on Building and Using Comparable Corpora, ACL, Portland, OR, pages 78--86, 2011

Humans as Feature Extractors: Combining Prosody and Personality Perception for Better Speaking Style Recognition, Gelareh Mohammadi and Alessandro Vinciarelli, in: Proceeding of IEEE Int Conference on Systems, Man, and Cybernetics - Special Sessions, 2011

Impact of Excitation Frequency on Short-Term Recording Synchronisation and Confidence Estimation, Danil Korchagin, in: Proceedings European Signal Processing Conference, Barcelona, Spain, 2011

Improving Articulatory Feature and Phoneme Recognition using Multitask Learning, Ramya Rasipuram and Mathew Magimai-Doss, in: Artificial Neural Networks and Machine Learning - ICANN 2011, pages 299-306, Springer Berlin / Heidelberg, 2011

[DOI]
[URL]

Improving non-native ASR through stochastic multilingual phoneme space transformations, David Imseng, Hervé Bourlard, John Dines, Philip N. Garner and Mathew Magimai-Doss, in: Proceedings of Interspeech, Florence, Italy, pages 537-540, 2011

Inferring truth from multiple annotators for social interaction analysis, Gokul Chittaranjan, Oya Aran and Daniel Gatica-Perez, in: Neural Information Processing Systems (NIPS) Workshop on Modeling Human Communication Dynamics (HCD), pages 4, 2011

Information Bottleneck Features for HMM/GMM Speaker Diarization of Meetings Recordings, Sree Harsha Yella and Fabio Valente, in: Interspeech, Florence, Italy, pages 953-956, 2011

Integrating articulatory features using Kullback-Leibler divergence based acoustic model for phoneme recognition, Ramya Rasipuram and Mathew Magimai-Doss, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, pages 5192 - 5195, 2011

[DOI]

Inter-session Variability Modelling and Joint Factor Analysis for Face Authentication, Roy Wallace, Mitchell McLaren, Chris McCool and Sébastien Marcel, in: International Joint Conference on Biometrics, 2011

Joint Adaptive Colour Modelling and Skin, Hair and Clothing Segmentation Using Coherent Probabilistic Index Maps, Carl Scheffler and Jean-Marc Odobez, in: British Machine Vision Conference, British Machine Vision Association, Dundee, UK, 2011

Joint Optimization of Hidden Conditional Random Fields and Non Linear Feature Extraction, Antoine Vinel, Trinh-Minh-Tri Do and Thierry Artieres, in: Proceedings of International Conference on Document Analysis and Recognition, 2011

Just-in-Time Multimodal Association and Fusion from Home Entertainment, Danil Korchagin, Petr Motlicek, Stefan Duffner and Hervé Bourlard, in: Proceedings IEEE International Conference on Multimedia & Expo, Barcelona, Spain, 2011

Language dependent universal phoneme posterior estimation for mixed language speech recognition, David Imseng, Hervé Bourlard, Mathew Magimai-Doss and John Dines, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Prag, CZ, pages 5012-5015, 2011

Language-Independent Socio-Emotional Role Recognition in the AMI Meetings Corpus, Fabio Valente and Alessandro Vinciarelli, in: Proceedings of Interspeech, 2011

Learning Structured Embeddings of Knowledge Bases, Antoine Bordes, Jason Weston, Ronan Collobert and Yoshua Bengio, in: Conference on Artificial Intelligence, 2011

Look at who's talking, M. Cristani, A. Pesarin, Alessandro Vinciarelli, M. Crocco and V. Murino, in: Proceedings of International Conference on Ambient Intelligence, pages 68-76, 2011

LP Residual Features for Robust, Privacy-Sensitive Speaker Diarization, Sree Hari Krishnan Parthasarathi, Hervé Bourlard and Daniel Gatica-Perez, in: Interspeech, 2011

Machine learning techniques to analyse complex, computer vision-extracted, dynamic cellular phenotypes, Riwal Lefort, L. Fusco, F. Benmansour, Kevin C. Smith, O. Pertz and Francois Fleuret, in: 1st International SystemsX.ch Conference on Systems Biology, 2011

Model-based Compressive Sensing for Multi-party Distant Speech Recognition, Afsaneh Asaei, Hervé Bourlard and Volkan Cevher, in: 2011 IEEE International Conference on Acoustics, Speech and Signal Processing, 2011

Morphodynamic profiling to explore spatio-temporal signaling networks regulating neurite outgrowth, L. Fusco, Kevin C. Smith, F. Benmansour, Riwal Lefort, Francois Fleuret, Pascal Fua and O. Pertz, in: 1st International SystemsX.ch Conference on Systems Biology, 2011

Multi-camera Open Space Human Activity Discovery for Anomaly Detection, Remi Emonet, Jagannadan Varadarajan and Jean-Marc Odobez, in: 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011

Multi-party Speech Recovery Exploiting Structured Sparsity Models, Afsaneh Asaei, Mohammad J. Taghizadeh, Hervé Bourlard and Volkan Cevher, in: Proceedings of Interspeech, 2011

Multiclass Transfer Learning from Unconstrained Priors, Jie Luo, Tatiana Tommasi and Barbara Caputo, in: Proceedings of the 13th International Conference on Computer Vision, 2011

Multilingual Annotation and Disambiguation of Discourse Connectives for Machine Translation, Thomas Meyer, Andrei Popescu-Belis, Sandrine Zufferey and Bruno Cartoni, in: Proceedings of 12th SIGdial Meeting on Discourse and Dialogue, Association for Computational Linguistics, Portland, OR, pages 194--203, 2011

MULTISTREAM SPEAKER DIARIZATION THROUGH INFORMATION BOTTLENECK SYSTEM OUTPUTS COMBINATION, Deepu Vijayasenan, Fabio Valente and Petr Motlicek, in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2011

New world, New Worlds: Visual Analysis of Pre-Columbian Pictorial Collections., Daniel Gatica-Perez, Edgar Roman-Rangel, Jean-Marc Odobez and Carlos Pallan, in: Proceedings of the International Workshop on Multimedia for Cultural Heritage, Modena, Italy., Springer CCIS series book, 2011

People-Centric Mobile Sensing with a Pragmatic Twist: from Behavioral Data Points to Active User Involvement, Jan Blom, Daniel Gatica-Perez and N. Kiukkonen, in: International Conference on Human-Computer Interaction with Mobile Devices and Services, 2011

Pervasive Sensing to Model Political Opinions in Face-to-Face Networks, Anmol Madan, Katayoun Farrahi, Daniel Gatica-Perez and Alex Pentland, in: Pervasive, San Francisco, 2011

Phoneme Recognition using Boosted Binary Features, Anindya Roy, Mathew Magimai-Doss and Sébastien Marcel, in: IEEE Intl. Conference on Acoustics, Speech and Signal Processing 2011, 2011

Phonological Knowledge Guided HMM State Mapping for Cross-Lingual Speaker Adaptation, Hui Liang and John Dines, in: Proceedings of Interspeech, Florence, Italy, 2011

Posterior Features for Template-based ASR, Serena Soldo, Mathew Magimai-Doss, Joel Praveen Pinto and Hervé Bourlard, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Prague, Czech Republic, 2011

Recent Developments in Social Signal Processing, Albert Ali Salah, Maja Pantic and Alessandro Vinciarelli, in: Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, pages 380-385, 2011

Searching the Past: An Improved Shape Descriptor to Retrieve Maya Hieroglyphs., Edgar Roman-Rangel, Carlos Pallan, Jean-Marc Odobez and Daniel Gatica-Perez, in: Proceedings of the ACM International Conference in Multimedia, Scottsdale, USA, ACM, 2011

Smartphone usage in the wild: a large-scale analysis of applications and context, Trinh-Minh-Tri Do, Jan Blom and Daniel Gatica-Perez, in: 13th International Conference on Multimodal Interaction, 2011

Social Focus of Attention as a Time Function Derived from Multimodal Signals, Danil Korchagin and Hamid Reza Abutalebi, in: Proceedings IEEE International Conference on Multimedia & Expo, Barcelona, Spain, 2011

Speaker Diarization of Meetings based on Speaker Role N-gram Models, Fabio Valente, Deepu Vijayasenan and Petr Motlicek, in: Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011

Tasting Families of Features for Image Classification, Charles Dubout and Francois Fleuret, in: International Conference on Computer Vision, 2011

The Kaldi Speech Recognition Toolkit, Daniel Povey, Arnab Ghoshal, Gilles Boulianne, Lukas Burget, Ondrej Glembek, Nagendra Goel, Mirko Hannemann, Petr Motlicek, Yanmin Qian, Petr Schwarz, Jan Silovsky, Georg Stemmer and Karel Vesely, in: IEEE 2011 Workshop on Automatic Speech Recognition and Understanding, Hilton Waikoloa Village, Big Island, Hawaii, US, IEEE Signal Processing Society, 2011

The MASH Project, Francois Fleuret, Philip Abbet, Charles Dubout and Leonidas Lefakis, in: European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2011

The TA2 Database - A Multi-Modal Database from Home Entertainment, Stefan Duffner, Petr Motlicek and Danil Korchagin, in: International Conference on Signal Acquisition and Processing, Singapore, 2011

Torch7: A Matlab-like Environment for Machine Learning, Ronan Collobert, Koray Kavukcuoglu and Clément Farabet, in: BigLearn, NIPS Workshop, 2011

Towards Computational Proxemics: Inferring Social Relations from Interpersonal Distances, M. Cristani, G. Paggetti, Alessandro Vinciarelli, L. Bazzani, G. Menegaz and V. Murino, in: Proceedings of the IEEE International Conference on Social Computing, pages 290-297, 2011

Towards semi-supervised learning of semantic spatial concepts, Jesus Martinez-Gomez and Barbara Caputo, in: IEEE International Conference on Robotics and Automation, 2011

Tracking Multiple Objects under Global Appearance Constraints, Horesh Ben Shitrit, Jerome Berclaz, Francois Fleuret and Pascal Fua, in: Proceedings of the IEEE International Conference on Computer Vision, 2011

Transferring Activities: Updating Human Behavior Analysis, Fabian Nater, Tatiana Tommasi, Helmut Grabner, Luc Van Gool and Barbara Caputo, in: Visual Surveillance Workshop at ICCV, 2011

Ultra-Fast Optimization Algorithm for Sparse Multi Kernel Learning, Francesco Orabona and Jie Luo, in: Proceedings of the 28th International Conference on Machine Learning, 2011

Understanding Social Signals in Multi-party Conversations: Automatic Recognition of Socio-Emotional Roles in the AMI Meeting Corpus, Fabio Valente, Alessandro Vinciarelli, Sree Harsha Yella and A. Sapru, in: Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, pages 374-379, 2011

Using a Wikipedia-based Semantic Relatedness Measure for Document Clustering., Majid Yazdani and Andrei Popescu-Belis, in: Graph-based Methods for Natural Language Processing, 2011

Who's Who with Big-Five: Analyzing and Classifying Personality Traits with Smartphones, Gokul Chittaranjan, Jan Blom and Daniel Gatica-Perez, in: International Symposium on Wearable Computing, pages 8, 2011

You Are Known by How You Vlog: Personality Impressions and Nonverbal Behavior in YouTube, Joan-Isaac Biel, Oya Aran and Daniel Gatica-Perez, in: Proceedings of AAAI International Conference on Weblogs and Social Media, Barcelona, 2011

A Comparative Study of MLP Front-ends for Mandarin ASR, Fabio Valente, Mathew Magimai-Doss, Christian Plahl, Ravuri Suman and Wang Wen, in: Proceedings of Interspeech, Japan, 2010

A Comparison of Supervised and Unsupervised Cross-Lingual Speaker Adaptation Approaches for HMM-Based Speech Synthesis, Hui Liang, John Dines and Lakshmi Saheer, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Dallas, U.S.A., 2010

A Multi Cue Discriminative Approach to Semantic Place Classification, Marco Fornoni, Jesus Martinez-Gomez and Barbara Caputo, in: CLEF 2010 Notebook Papers/LABs/Workshops, 2010

A Multimodal Corpus for Studying Dominance in Small Group Conversations, Oya Aran, Hayley Hung and Daniel Gatica-Perez, in: LREC workshop on Multimodal Corpora: Advances in Capturing, Coding and Analyzing Multimodality, Malta, May 2010, 2010

A Random Walk Framework to Compute Textual Semantic Similarity: a Unified Model for Three Benchmark Tasks, Majid Yazdani and Andrei Popescu-Belis, in: Proceedings of the 4th IEEE International Conference on Semantic Computing (ICSC 2010 ), Carnegie Mellon University, Pittsburgh, PA, USA, 2010

A Sparsity Constraint for Topic Models - Application to Temporal Activity Mining, Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, in: NIPS-2010 Workshop on Practical Applications of Sparse Modeling: Open Issues and New Directions, 2010

Advances in Fast Multistream Diarization based on the Information Bottleneck Framework, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: Proceedings of Interspeech, 2010

An Adaptive Initialization Method for Speaker Diarization based on Prosodic Features, David Imseng and Gerald Friedland, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, pages 4946-4949, 2010

An Alternative Scanning Strategy to Detect Faces, Venkatesh Bala Subburaman and Sébastien Marcel, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010

An Analysis of Language Mismatch in HMM State Mapping-Based Cross-Lingual Speaker Adaptation, Hui Liang and John Dines, in: Proceedings of Interspeech, Makuhari, Japan, 2010

Analysis of Phone Posterior Feature Space Exploiting Class Specific Sparsity and MLP-based Similarity Measure, Afsaneh Asaei, Benjamin Picart and Hervé Bourlard, in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, 2010

Application of Out-Of-Language Detection To Spoken-Term Detection, Petr Motlicek and Fabio Valente, in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010

Are you a Werewolf? Detecting deceptive roles and outcomes in a conversational role-playing game, Gokul Chittaranjan and Hayley Hung, in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2010

Audioâ€“Visual Synchronisation for Speaker Diarisation, Giulia Garau, Alfred Dielmann and Hervé Bourlard, in: International Conference on Speech and Language Processing, Interspeech, Makuhari, Japan, 2010

Automatic Content Linking: Speech-based Just-in-time Retrieval for Multimedia Archives, Andrei Popescu-Belis, Jonathan Kilgour, Peter Poller, Alexandre Nanchen, Erik Boertjes and Joost de Wit, in: Proceedings of the 33rd Annual ACM SIGIR Conference, Geneva, Switzerland, pages 703, 2010

[DOI]

Automatic Role Recognition Based on Conversational and Prosodic Behaviour, Hugues Salamin, Khiet Truong, Gelareh Mohammadi and Alessandro Vinciarelli, in: Proceedings of the ACM International Conference on Multimedia, 2010

Automatic Temporal Alignment of AV Data with Confidence Estimation, Danil Korchagin, Philip N. Garner and John Dines, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010

BOOSTED BINARY FEATURES FOR NOISE-ROBUST SPEAKER VERIFICATION, Anindya Roy, Mathew Magimai-Doss and Sébastien Marcel, in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, Texas, 2010

By their apps you shall understand them: mining large-scale patterns of mobile phone usage, Trinh-Minh-Tri Do and Daniel Gatica-Perez, in: The 9th International Conference on Mobile and Ubiquitous Multimedia, 2010

Crossmodal Matching of Speakers using Lip and Voice Features in Temporally Non-overlapping Audio and Video Streams, Anindya Roy and Sébastien Marcel, in: 20th International Conference on Pattern Recognition, Istanbul, Turkey, International Association for Pattern Recognition (IAPR), Istanbul, Turkey, 2010

Delineating Trees in Noisy 2D Images and 3D Image Stacks, German Gonzalez, Engin Turetken, Francois Fleuret and Pascal Fua, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, pages 2799–2806, 2010

Determination of Pitch Range Based on Onset and Offset Analysis in Modulation Frequency Domain, Azar Mahmoodzadeh, Hamid Reza Abutalebi, Hamid Soltanianzadeh and Hamid Sheikhzadeh, in: Proceedings of 5th International Symposium on Telecommunications, 2010

Discovering Human Places of Interest from Multimodal Mobile Phone Data, Raul. Montoliu and Daniel Gatica-Perez, in: Proceedings of 9th International Conference on on Mobile and Ubiquitous Multimedia (MUM,',','), Limassol, Cyprus, 2010

English Spoken Term Detection in Multilingual Recordings, Petr Motlicek, Fabio Valente and Philip N. Garner, in: Proceedings of Interspeech, Makuhari, Japan, 2010, ISCA, Makuhari, Japan, 2010

Evaluating the Robustness of Privacy-Sensitive Audio Features for Speech Detection in Personal Audio Log Scenarios, Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Hervé Bourlard and Daniel Gatica-Perez, in: ICASSP 2010, 2010

Extracting Motifs from Time Series Generated by Concurrent Activities., Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, in: NIPS workshop on Learning and Planning from Batch Time Series Data, 2010

Fast Bounding Box Estimation based Face Detection, Venkatesh Bala Subburaman and Sébastien Marcel, in: ECCV, Workshop on Face Detection: Where we are, and what next?, 2010

[URL]

Floor Holder Detection and End of Speaker Turn Prediction in Meetings, Alfred Dielmann, Giulia Garau and Hervé Bourlard, in: International Conference on Speech and Language Processing, Interspeech, Makuhari, Japan, ISCA, 2010

Fusing Audio-Visual Nonverbal Cues to Detect Dominant People in Conversations, Oya Aran and Daniel Gatica-Perez, in: 20th International Conference on Pattern Recognition, Istanbul, Turkey, 2010, Istanbul, Turkey, 2010

Hands Free Audio Analysis from Home Entertainment, Danil Korchagin, Philip N. Garner and Petr Motlicek, in: Proceedings of Interspeech, Makuhari, Japan, 2010

Hierarchical Multilayer Perceptron based Language Identification, David Imseng, Mathew Magimai-Doss and Hervé Bourlard, in: Proceedings of Interspeech, Makuhari, Japan, pages 2722-2725, 2010

Identifying Emergent Leadership in Small Groups using Nonverbal Communicative Cues, Dairazalia Sanchez-Cortes, Oya Aran, Marianne Schmid Mast and Daniel Gatica-Perez, in: Proc. ICMI-MLMI '10 International Conference on Multimodal Interfaces and the Workshop on Machine Learning for Multimodal Interaction, Beijing, ACM New York, NY, USA ©2010, 2010

[DOI]

Implementation of VTLN for Statistical Speech Synthesis, Lakshmi Saheer, John Dines, Philip N. Garner and Hui Liang, in: Proceedings of ISCA Speech Synthesis Workshop, Kyoto, Japan, 2010

Improving Speech Processing trough Social Signals: Automatic Speaker Segmentation of Political Debates using Role based Turn-Taking Patterns., Fabio Valente and Alessandro Vinciarelli, in: Proceedings of ACM Multimedia Workshop on Social Signal Processing, 2010

Introducing Crossmodal Biometrics:Person Identification from Distinct Audio & Visual Streams, Anindya Roy and Sébastien Marcel, in: IEEE Fourth International Conference on Biometrics: Theory, Applications and Systems, 2010

Joint Cascade Optimization Using a Product Of Boosted Classifiers, Leonidas Lefakis and Francois Fleuret, in: Proceedings of the Neural Information Processing Systems Conference, pages 1315–1323, 2010

Kodak Moments and Flickr Diamonds: How Users Shape Large-scale Media, Radu-Andrei Negoescu, Alexander Loui and Daniel Gatica-Perez, in: Proc. of the 18th Intl. Conf. on Multimedia, Firenze, Italy, 2010

Learning from Candidate Labeling Sets, Jie Luo and Francesco Orabona, in: Advances in Neural Information Processing Systems 23 (NIPS10), NIPS Foundation, Vancouver, B.C., Canada, MIT Press, 2010

Leveraging speaker diarization for meeting recognition from distant microphones, Andreas Stolcke, Gerald Friedland and David Imseng, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, pages 4390--4393, 2010

Mining Human Location-Routines Using a Multi-Level Approach to Topic Modeling, Katayoun Farrahi and Daniel Gatica-Perez, in: 2010 IEEE Second International Conference on Social Computing, SIN Symposium, Minneapolis, Minnesota, USA, 2010

Mobile Social Signal Processing: vision and research issues, Alessandro Vinciarelli, Roderick Murray-Smith and Hervé Bourlard, in: Proceedings of the International Workshop on Mobile HCI, Lisbon, pages 513-516, 2010

Multistream Speaker Diarization beyond Two Acoustic Feature Streams, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: International Conference on Acoustics, Speech, and Signal Processing, 2010

Neural conditional random fields, Trinh-Minh-Tri Do and Thierry Artieres, in: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, Chia Laguna, Sardinia, Italy, JMLR: W&CP, 2010

Object Recognition using Visuo-Affordance Maps, Arjan Gijsberts, Tatiana Tommasi, Giorgio Metta and Barbara Caputo, in: International Conference on Intelligent Robots and Systems, Taipei, pages 1572-1578, IEEE, 2010

[DOI]

OM-2: An Online Multi-class Multi-kernel Learning Algorithm, Jie Luo, Francesco Orabona, Marco Fornoni, Barbara Caputo and Nicolo Cesa-Bianchi, in: In Proceeding of CVPR 2010, Online Learning for Computer Vision Workshop, 2010

Online-Batch Strongly Convex Multi Kernel Learning, Francesco Orabona, Jie Luo and Barbara Caputo, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2010

Personalising speech-to-speech translation in the EMIME project, Mikko Kurimo, William Byrne, John Dines, Philip N. Garner, Matthew Gibson, Yong Guan, Teemu Hirsimäki, Reima Karhila, Simon King, Hui Liang, Keiichiro Oura, Lakshmi Saheer, Matt Shannon, Sayaka Shiota, Jilei Tian, Keiichi Tokuda, Mirjam Wester, Yi-Jian Wu and Junichi Yamagishi, in: Proceedings of the ACL 2010 System Demonstrations, Association for Computational Linguistics, Uppsala, Sweden, 2010

[URL]

Probabilistic Latent Sequential Motifs: Discovering temporal activity patterns in video scenes, Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, in: BMVC 2010, Aberystwyth University, Aberystwyth, BMVA Press, 2010

Recognizing conversational context in group interaction using privacy-sensitive mobile sensors, Dinesh Babu Jayagopi, Taemie Kim, Alex Pentland and Daniel Gatica-Perez, in: Proceedings of International Conference on Mobile and Ubiquitous Multimedia, Limassol, Cyprus, 2010

Safety in Numbers: Learning Categories from Few Examples with Multi Model Knowledge Transfer, Tatiana Tommasi, Francesco Orabona and Barbara Caputo, in: Proceedings of IEEE Computer Vision and Pattern Recognition Conference, San Francisco, CA, pages 3081-3088, IEEE, 2010

[DOI]

Single Channel Speech Separation with a Frame-based Pitch Range Estimation Method in Modulation Frequency, Azar Mahmoodzadeh, Hamid Reza Abutalebi, Hamid Soltanianzadeh and Hamid Sheikhzadeh, in: Proceedings of 5th International Symposium on Telecommunications, 2010

Social Signal Processing: Understanding Nonverbal Communication in Social Interactions, Alessandro Vinciarelli and Fabio Valente, in: Proceedings of Measuring Behavior 2010, Eindhoven (The Netherlands), 2010

Socially Intelligent Surveillance and Monitoring: Analysing Social Dimensions of Physical Space, V. Murino, M. Cristani and Alessandro Vinciarelli, in: Proceedings of International Workshop on Socially Intelligent Surveillance and Monitoring, San Francisco, pages 51-58, 2010

Sparse Component Analysis for Speech Recognition in Multi-Speaker Environment, Afsaneh Asaei, Hervé Bourlard and Philip N. Garner, in: Proceedings of Interspeech, Makuhari, Japan, 2010

Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project, Mirjam Wester, John Dines, Matthew Gibson, Hui Liang, Yi-Jian Wu, Lakshmi Saheer, Simon King, Keiichiro Oura, Philip N. Garner, William Byrne, Yong Guan, Teemu Hirsimäki, Reima Karhila, Mikko Kurimo, Matt Shannon, Sayaka Shiota, Jilei Tian, Keiichi Tokuda and Junichi Yamagishi, in: Proceedings of the 7th ISCA Speech Synthesis Workshop, Kyoto, Japan, 2010

Speech Enhancement using an Improved MMSE Estimator with Laplacian Prior, Mehdi Rashidinejad, Hamid Reza Abutalebi and Ali Akbar Tadaion, in: Proceedings of 5th International Symposium on Telecommunications, 2010

The ACLD: Speech-based Just-in-Time Retrieval of Meeting Transcripts, Documents and Websites, Andrei Popescu-Belis, Jonathan Kilgour, Alexandre Nanchen and Peter Poller, in: ACM Multimedia Workshop on Searching Spontaneous Conversational Speech, Florence, Italy, 2010

The AMIDA 2009 Meeting Transcription System, Thomas Hain, Lukas Burget, John Dines, Philip N. Garner, Asmaa El Hannani, Marijn Huijbregts, Martin Karafiat, Mike Lincoln and Vincent Wan, in: Proceedings of Interspeech, Makuhari, Japan, 2010

The Robot Vision Track at ImageCLEF 2010, Andrzej Pronobis, Marco Fornoni, Henrik I. Christensen and Barbara Caputo, in: CLEF 2010 Notebook Papers/LABs/Workshops, 2010

[URL]

The Voice of Personality: Mapping Nonverbal Vocal Behavior into Trait Attributions, Gelareh Mohammadi, Alessandro Vinciarelli and Marcello Mortillaro, in: Proceedings of ACM Multimedia Workshop on Social Signal Processing, 2010

The Wolf Corpus: Exploring group behaviour in a competitive role-playing game, Hayley Hung and Gokul Chittaranjan, in: ACM Multimedia, 2010

Towards a quantitative measure of rareness, Tatiana Tommasi and Barbara Caputo, in: DIRAC Workshop at the European Conference on Machine Learning, pages 129-136, Springer Berlin Heidelberg, 2010

[DOI]

Towards a standard for dialogue act annotation, Harry Bunt, Jan Alexandersson, Jean Carletta, Jae-Woong Choe, Alex Fang, Koiti Hasida, Kiyong Lee, Volha Petukhova, Andrei Popescu-Belis, Laurent Romary, Claudia Soria and Traum. David, in: 7th International Conference on Language Resources and Evaluation, Malta, 2010

[URL]

Towards mixed language speech recognition systems, David Imseng, Hervé Bourlard and Mathew Magimai-Doss, in: Proceedings of Interspeech, Makuhari, Japan, pages 278-281, 2010

Towards rich mobile phone datasets: Lausanne data collection campaign, N. Kiukkonen, Blom J., O. Dousse, Daniel Gatica-Perez and J. K. Laurila, in: Proc. ACM Int. Conf. on Pervasive Services (ICPS,',','), Berlin., 2010

Tracter: A Lightweight Dataflow Framework, Philip N. Garner and John Dines, in: Proceedings of Interspeech, Makuhari, Japan, 2010

Using Audio and Visual Cues for Speaker Diarisation Initialisation, Giulia Garau and Hervé Bourlard, in: International Conference on Acoustics, Speech and Signal Processing, 2010

VARIATIONAL BAYESIAN SPEAKER DIARIZATION OF MEETING RECORDINGS, Fabio Valente, Petr Motlicek and Deepu Vijayasenan, in: Proceedings of ICASSP, 2010

View-Based Appearance Model Online Learning for 3D Deformable Face Tracking, Stéphanie Lefèvre and Jean-Marc Odobez, in: Proc. Int. Conf. on Computer Vision Theory and Applications, Angers, 2010

Visual processing-inspired Fern-Audio features for Noise-Robust Speaker Verification, Anindya Roy and Sébastien Marcel, in: ACM 25th Symposium on Applied Computing, 2010, Sierre, Switzerland, Association for Computing Machinery, 2010

Vlogcast Yourself: Nonverbal Behavior and Attention in Social Media, Joan-Isaac Biel and Daniel Gatica-Perez, in: Proceedings International Conference on Multimodal Interfaces (ICMI-MLMI), 2010

Voices of Vlogging, Joan-Isaac Biel and Daniel Gatica-Perez, in: Proceedings of AAAI International Conference on Weblogs and Social Media, Washington DC, 2010

VTLN Adaptation for Statistical Speech Synthesis, Lakshmi Saheer, Philip N. Garner, John Dines and Hui Liang, in: Proceedings of ICASSP, Dallas, Texas, 2010

A Multimedia Retrieval System Using Speech Input, Andrei Popescu-Belis, Peter Poller, Jonathan Kilgour, Erik Boertjes, Jean Carletta, Sandro Castronovo, Michal Fapso, Alexandre Nanchen, Theresa Wilson, Joost de Wit and Majid Yazdani, in: Proceedings of ICMI-MLMI 2009 (11th International Conference on Multimodal Interfaces and 6th Workshop on Machine Learning for Multimodal Interaction), Cambridge, MA, 2009

A theoretical framework for transfer of knowledge across modalities in artificial and cognitive systems, Francesco Orabona, Barbara Caputo, Antje Fillbrandt and Frank Ohl, in: International Conference on Developmental Learning, 2009

An online framework for learning novel concepts over multiple cues, Jie Luo, Francesco Orabona and Barbara Caputo, in: Proceeding of The 9th Asian Conference on Computer Vision, Xi'an, China, 2009

APPLICATIONS OF SIGNAL ANALYSIS USING AUTOREGRESSIVE MODELS FOR AMPLITUDE MODULATION, Sriram Ganapathy, Samuel Thomas, Petr Motlicek and Hynek Hermansky, in: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009, WASPAA '09., IEEE, Mohonk Mountain House, New Paltz, New York, USA, 2009

[URL]

Arithmetic Coding of Sub-Band Residuals in FDLP Speech/Audio Codec, Petr Motlicek, Sriram Ganapathy and Hynek Hermansky, in: 10th Annual Conference of the International Speech Communication Association, ISCA, Brighton, England, ISCA 2009, 2009

Automatic Out-of-Language Detection Based on Confidence Measures Derived fromLVCSR Word and Phone Lattices, Petr Motlicek, in: 10thAnnual Conference of the International Speech Communication Association, ISCA, Brighton, England, 2009

Automatic Role Recognition in Multiparty Recordings Using Social Networks and Probabilistic Sequential Models, Sarah Favre, Alfred Dielmann and Alessandro Vinciarelli, in: ACM International Conference on Multimedia, 2009

Automatic vs. human question answering over multimedia meeting recordings, Quoc Anh Le and Andrei Popescu-Belis, in: 10th Annual Conference of the International Speech Communication Association, 2009

Bayesian Networks to Combine Intensity and Color Information in Face Recognition, Guillaume Heusch and Sébastien Marcel, in: International Conference on Biometrics, Springer, 2009

Canal9: A database of political debates for analysis of social interactions, Alessandro Vinciarelli, Alfred Dielmann, Sarah Favre and Hugues Salamin, in: Proceedings of the International Conference on Affective Computing and Intelligent Interaction (IEEE International Workshop on Social Signal Processing), Amsterdam, Netherlands, 2009

[DOI]

Characterising Conversationsal Group Dynamics Using Nonverbal Behaviour, Dinesh Babu Jayagopi, Raducanu Bogdan and Daniel Gatica-Perez, in: Proceedings ICME 2009, 2009

Discovering Group Nonverbal Conversational Patterns with Topics, Dinesh Babu Jayagopi and Daniel Gatica-Perez, in: Proceedings ICMI-MLMI, 2009

Dynamic Partitioned Sampling For Tracking With Discriminative Features, Stefan Duffner, Jean-Marc Odobez and Elisa Ricci, in: Proceedings of the British Maschine Vision Conference, London, 2009

Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, Sriram Ganapathy, Petr Motlicek and Hynek Hermansky, in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, Pilsen, Czech Republic, Springer - Verlag, Berlin Heidelberg 2009, 2009

Evaluation of Probabilistic Occupancy Map People Detection for Surveillance Systems, Jerome Berclaz, Ali Shahrokni, Francois Fleuret, James Ferryman and Pascal Fua, in: Proceedings of the IEEE International Workshop on Performance Evaluation of Tracking and Surveillance, 2009

Flickr Hypergroups, Radu-Andrei Negoescu, Brett Adams, Dinh Phung, Svetha Venkatesh and Daniel Gatica-Perez, in: Proceedings of the 17th ACM International Conference on Multimedia, 2009

Haar Local Binary Pattern Feature for Fast Illumination Invariant Face Detection, Anindya Roy and Sébastien Marcel, in: British Machine Vision Conference 2009, 2009

Hierarchical Processing of the Modulation Spectrum for GALE Mandarin LVCSR system, Fabio Valente, Mathew Magimai-Doss, Christian Plahl and Ravuri Suman, in: Proceedings of the 10thAnnual Conference of the International Speech Communication Association (Interspeech), Brighton, 2009

Hill-Climbing Attack to an Eigenface-Based Face Verification System, Javier Galbally, Chris McCool, Julian Fierrez, Sébastien Marcel and Javier Ortega-Garcia, in: Proceedings of the First IEEE International Conference on Biometrics, Identity and Security (BIdS), 2009

Implicit Human Centered Tagging, Alessandro Vinciarelli, Nicolae Suditu and Maja Pantic, in: Proceedings of IEEE Conference on Multimedia and Expo, 2009

Investigating Privacy-Sensitive Features for Speech Detection in Multiparty Conversations, Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Hervé Bourlard and Daniel Gatica-Perez, in: Proceedings of Interspeech 2009, 2009

Investigating the use of Visual Focus of Attention for Audio-Visual Speaker Diarisation, Giulia Garau, Silèye O. Ba, Hervé Bourlard and Jean-Marc Odobez, in: Proceedings of the ACM International Conference on Multimedia, Beijing, China, 2009

Joint Pose Estimator and Feature Learning for Object Detection, Karim Ali, Francois Fleuret, David Hasler and Pascal Fua, in: Proceedings of the IEEE International Conference on Computer Vision, 2009

KL Realignment for Speaker Diarization with Multiple Feature Streams, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: 10th Annual Conference of the International Speech Communication Association, 2009

Learning and Predicting Multimodal Daily Life Patterns from Cell Phones, Katayoun Farrahi and Daniel Gatica-Perez, in: ICMI-MLMI, 2009

Learning Large Margin Likelihood for Realtime Head Pose Tracking, Elisa Ricci and Jean-Marc Odobez, in: IEEE Int. Conference on Image Processing, Cairo, Egypt, IEEE, 2009

Learning Rotational Features for Filament Detection, German Gonzalez, Francois Fleuret and Pascal Fua, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2009

MDCT for Encoding Residual Signals in Frequency Domain Linear Prediction, Sriram Ganapathy, Petr Motlicek and Hynek Hermansky, in: Audio Engineering Society (AES,',','), 127th Convention, Audio Engineering Society (AES), Audio Engineering Society, 60 East 42nd Street, New York, New York 10165-2520, USA;, 2009

[URL]

Measuring the gap between HMM-based ASR and TTS, John Dines, Junichi Yamagishi and Simon King, in: Proceedings of Interspeech, Brighton, U.K., 2009

Memoirs of Togetherness from Audio Logs, Danil Korchagin, in: Proceedings International ICST Conference on User Centric Media, Venice, Italy, 2009

MLP Based Hierarchical System for Task Adaptation in ASR, Joel Praveen Pinto, Mathew Magimai-Doss and Hervé Bourlard, in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, 2009

Model adaptation with least-square SVM for adaptive hand prosthetics, Francesco Orabona, Claudio Castellini, Barbara Caputo, Angelo Emanuele Fiorilla and Giulio Sandini, in: IEEE International conference on Robotics and Automation, 2009

MULTI-MODAL SPEAKER DIARIZATION OF REAL-WORLD MEETINGS USING COMPRESSED-DOMAIN VIDEO FEATURES, Gerald Friedland, Hayley Hung and Chuohao Yeo, in: International Conference on Audio, Speech and Signal Processing, 2009

MUTUAL INFORMATION BASED CHANNEL SELECTION FOR SPEAKER DIARIZATION OF MEETINGS DATA, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2009

Mutual Information based Channel Selection for Speaker Diarization of Meetings Data, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: Proceedings of International conference on acoustics speech and signal processing, 2009

Non-linear mapping for multi-channel speech separation and robust overlapping speech recognition, Weifeng Li, John Dines, Mathew Magimai-Doss and Hervé Bourlard, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009

Out-of-Scene AV Data Detection, Danil Korchagin, in: Proceedings IADIS International Conference Applied Computing, Rome, Italy, 2009

Overview of the CLEF 2009 medical image annotation track, Tatiana Tommasi, Barbara Caputo, Petra Welter, Mark O. Güld and Thomas M Deserno, in: Workshop of the Cross-Language Evaluation Forum, Corfu, Greece, pages 85-93, Springer Berlin Heidelberg, 2009

[DOI]

Parts-Based Face Verification using Local Frequency Bands, Chris McCool and Sébastien Marcel, in: in Proceedings of IEEE/IAPR International Conference on Biometrics, 2009

Posterior features applied to speech recognition tasks with user-defined vocabulary, Guillermo Aradilla, Hervé Bourlard and Mathew Magimai-Doss, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009

Predicting Remote Versus Collocated Group Interactions using Nonverbal Cues, Dairazalia Sanchez-Cortes, Dinesh Babu Jayagopi and Daniel Gatica-Perez, in: Proc. Int. Conf. on Multimodal Interfaces, Workshop on Multimodal Sensor-Based Systems and Mobile Phones for Social Computing,, Cambridge, 2009

[DOI]

Real-Time ASR from Meetings, Philip N. Garner, John Dines, Thomas Hain, Asmaa El Hannani, Martin Karafiat, Danil Korchagin, Mike Lincoln, Vincent Wan and Le Zhang, in: Proceedings of Interspeech, Brighton, UK., 2009

Retrieving Ancient Maya Glyphs with Shape Context, Edgar Roman-Rangel, Carlos Pallan, Jean-Marc Odobez and Daniel Gatica-Perez, in: 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops, Kyoto, Japan, IEEE, 2009

Robust Discriminative Keyword Spotting for Emotionally Colored Spontaneous Speech using Bidirectional LSTM Networks, Martin Wöllmer, Florian Eyben, Joseph Keshet, Alex Graves, Björn Schuller and Gerhard Rigoll, in: IEEE International Conference on Acoustic, Speech, and Signal Processing, 2009

Robust Speaker Diarization for Short Speech Recordings, David Imseng and Gerald Friedland, in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, pages 432-437, 2009

Robustness of Phase based Features for Speaker Recognition, Padmanabhan Rajan, Sree Hari Krishnan Parthasarathi and Hema A Murthy, in: Proceedings of Interspeech, 2009

SNR Features for Automatic Speech Recognition, Philip N. Garner, in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, 2009

Social Network Analysis in Multimedia Indexing: Making Sense of People in Multiparty Recordings, Sarah Favre, in: Proceedings of the Doctoral Consortium of the International Conference on Affective Computing & Intelligent Interaction (ACII), 2009

Speaker Change Detection with Privacy-Preserving Audio Cues, Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Daniel Gatica-Perez and Hervé Bourlard, in: Proceedings of ICMI-MLMI 2009, 2009

Speech recognition with speech synthesis models by marginalising over decision tree leaves, John Dines, Lakshmi Saheer and Hui Liang, in: Proceedings of Interspeech, Brighton, U.K., 2009

Steerable Features for Statistical 3D Dendrite Detection, German Gonzalez, Francois Aguet, Francois Fleuret, Michael Unser and Pascal Fua, in: Proceedings of the International Conference on Medical Image Computing and Computer Assisted Intervention, 2009

Structure and appearance features for robust 3D facial actions tracking, Stéphanie Lefèvre and Jean-Marc Odobez, in: IEEE Proc. Int. Conf. on Multimedia and Expo, IEEE, 2009

The more you know, the less you learn: from knowledge transfer to one-shot learning of object categories, Tatiana Tommasi and Barbara Caputo, in: British Machine Vision Conference, 2009

Topic Models for Scene Analysis and Abnormality Detection, Jagannadan Varadarajan and Jean-Marc Odobez, in: 9th International Workshop in Visual Surveillance, IEEE, Kyoto, Japan, IEEE, 2009

Towards a theoretical framework for learning multi-modal patterns for embodied agents, Nicoletta Noceti, Barbara Caputo, Claudio Castellini, Luca Baldassarre, Annalisa Barla, Lorenzo Rosasco, Francesca Odone and Giulio Sandini, in: International Conference on Image Analysis and Processing, 2009

Visual Activity Context For Focus of Attention Estimation in Dynamic Meetings, Silèye O. Ba, Hayley Hung and Jean-Marc Odobez, in: International Conference on Multimedia & Expo, 2009

Visual Speaker Localization Aided by Acoustic Models, Gerald Friedland, Chuohao Yeo and Hayley Hung, in: ACM Multimedia, 2009

Volterra Series for Analyzing MLP based Phoneme Posterior Probability Estimator, Joel Praveen Pinto, G. S. V. S. Sivaram, Hynek Hermansky and Mathew Magimai-Doss, in: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2009

Wearing a YouTube hat: directors, comedians, gurus, and user aggregated behavior, Joan-Isaac Biel and Daniel Gatica-Perez, in: Proceedings of the 17th ACM International Conference on Multimedia, ACM, 2009

Who's Doing What: Joint Modeling of Names and Verbs for Simultaneous Face and Pose Annotation, Jie Luo, Barbara Caputo and Vittorio Ferrari, in: Advances in Neural Information Processing Systems 22 (NIPS09), NIPS Foundation, Vancouver, B.C., Canada, MIT Press, 2009

YOU ARE FIRED! NONVERBAL ROLE ANALYSIS IN COMPETITIVE MEETINGS, Raducanu Bogdan, Vitria J. and Daniel Gatica-Perez, in: Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP,',','), Taiwan., 2009

You live, you learn, you forget: continuous learning of visual places with a forgetting mechanism, Muhammad Muneeb Ullah, Francesco Orabona and Barbara Caputo, in: International Conference on Robotic and Systems, 2009

A Comparative Psychophysical and EEG Study of Different Feedback Modalities for HRI, Xavier Perrin, Ricardo Chavarriaga, Céline Ray, Roland Siegwart and José del R. Millán, in: 3rd ACM/IEEE Conf on Human-Robot Interaction (HRI08), 2008

A Distance Model for Rhythms, Jean-François Paiement, Yves Grandvalet, Samy Bengio and Douglas Eck, in: 25th International Conference on Machine Learning (ICML), 2008

Adaptive Beamforming with a Maximum Negentropy Criterion, Kenichi Kumatani, John McDonough, Dietrich Klakow, Philip N. Garner and Weifeng Li, in: Proceedings of the Joint Workshop on Hands-free Speech Communication and Microphone Arrays, Italy, 2008

An SVM Confidence-Based Approach to Medical Image Annotation, Tatiana Tommasi, Francesco Orabona and Barbara Caputo, in: Workshop of the Cross-Language Evaluation Forum, 2008

Analyzing Flickr Groups, Radu-Andrei Negoescu and Daniel Gatica-Perez, in: Proc. of the Intl. Conf. on Image and Video Retrieval, ACM, 2008

Analyzing Interactions Between Navigation Strategies Using a Computational Model of Action Selection, Laurent Dollé, Mehdi Khamassi, Benoît Girard, Agnès Guillot and Ricardo Chavarriaga, in: Int Conf Spatial Cognition 2008, 2008

Associating Audio-Visual Activity Cues in a Dominance Estimation Framework, Hayley Hung, Yan Huang, Chuohao Yeo and Daniel Gatica-Perez, in: First IEEE Workshop on CVPR for Human Communicative Behavior Analysis, 2008

Asynchronous detection and classification of oscillatory brain activity, Ricardo Chavarriaga, Ferran Galán and José del R. Millán, in: 16 European Signal Processing Conference, 2008

Automated Delineation of Dendritic Networks in Noisy Image Stacks, German Gonzalez, Francois Fleuret and Pascal Fua, in: proceedings of the European Conference on Computer Vision, 2008

Autoregressive Modelling of Hilbert Envelopes for Wide-band Audio Coding, Sriram Ganapathy, Petr Motlicek, Hynek Hermansky and Harinath Garudadri, in: AES 124th Convention, Audio Engineering Society, 2008

Beyond Novelty Detection: Incongruent Events, when General and Specific Classifiers Disagree, Daphna Weinshall, Hynek Hermansky, Alon Zweig, Jie Luo, Holly Jimison, Frank Ohl and Misha Pavel, in: Advances in Neural Information Processing Systems 21, 2008

Biologically Motivated Audio-Visual Cue Integration for Object, Joern Anemueller, Joerg-Henrik Back, Barbara Caputo, Jie Luo, Frank Ohl, Francesco Orabona, Rufin Vogels, Daphna Weinshall and Alon Zweig, in: Proceedings of the first Internatinal Conference on Cognitive Systems, 2008

Brain-Computer Interfaces for HCI and Games, A. Nijholt, D. Tan, B. Allison, José del R. Millán, M. Moore and B. Graimann, in: Proceedings of the 26th Annual CHI Conference on Human Factors in Computing Systems, Extended Abstracts, 2008

Calibration from statistical properties of the visual world, Etienne Grossmann, José António Gaspar and Francesco Orabona, in: European Conf. on Computer Vision, 2008

COMBINATION OF AGGLOMERATIVE AND SEQUENTIAL CLUSTERING FOR SPEAKER DIARIZATION, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008

Combining Evidence from a Generative and a Discriminative Model in Phoneme Recognition, Joel Praveen Pinto and Hynek Hermansky, in: Proceedings of Interspeech, 2008

Composite Kernel Learning, Marie Szafranski, Yves Grandvalet and Alain Rakotomamonjy, in: Proceedings of the 25th Annual International Conference on Machine Learning (ICML 2008), Omnipress, 2008

Continuous Brain-Actuated Control of an Intelligent Wheelchair by Human EEG, Ferran Galán, Marnix Nuttin, Dirk Vanhooydonck, Eileen Lew, Pierre W. Ferrez, Johan Philips and José del R. Millán, in: In proceedings, 4th Intl. Brain-Computer Interface Workshop and Training Course, 2008

Cue Integration for Medical Image Annotation, Tatiana Tommasi, Francesco Orabona and Barbara Caputo, in: Advances in Multilingual and Multimodal Information Retrieval: 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Budapest, Hungary, September 19-21, 2007, Revised Selected Papers, Springer-Verlag, 2008

Daily Routine Classification from Mobile Phone Data, Katayoun Farrahi and Daniel Gatica-Perez, in: Workshop on Machine Learning and Multimodal Interaction (MLMI08), 2008

Detecting queues at vending machines: a statistical layered approach, Xavier Naturel and Jean-Marc Odobez, in: Proc. Int. Conf. on Pattern Recognition (ICPR), Tampa, 2008

Discovering Human Routines from Cell Phone Data with Topic Models, Katayoun Farrahi and Daniel Gatica-Perez, in: IEEE International Symposium on Wearable Computers (ISWC), 2008

Emulating Temporal Receptive Fields of Auditory Mid-Brain Neurons for Automatic Speech Recognition, G. S. V. S. Sivaram and Hynek Hermansky, in: Proc. 16th European Signal Processing Conference (EUSIPCO), 2008

Exploiting Contextual Information for Improved Phoneme Recognition, Joel Praveen Pinto, Hynek Hermansky, B. Yegnanarayana and Mathew Magimai-Doss, in: "IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)", 2008

Exploiting Contextual Information for Speech/Non-Speech Detection, Sree Hari Krishnan Parthasarathi, Petr Motlicek and Hynek Hermansky, in: Text, Speech and Dialogue, Brno, Czech Republic, Springer-Verlag Berlin, Heidelberg, 2008

Fast Approximate Spoken Term Detection from Sequence of Phonemes, Joel Praveen Pinto, Igor Szoke, S. R. Mahadeva Prasanna and Hynek Hermansky, in: Workshop on Searching Spontaneous Conversational Speech at SIGIR, 2008

Fast human detection from videos using covariance features, Jian Yao and Jean-Marc Odobez, in: European Conference on Computer Vision, workshop on Visual Surveillance (ECCV-VS), Marseille, 2008

Filter Bank Design based on Minimization of Individual Aliasing Terms for Minimum Mutual Information Subband Adaptive Beamforming, Kenichi Kumatani, John McDonough, Stefan Schacht, Dietrich Klakow, Philip N. Garner and Weifeng Li, in: Proceedings of ICASSP 2008, Las Vegas, USA, 2008

Front-end for Far-field Speech Recognition based on Frequency Domain Linear Prediction, Sriram Ganapathy, Samuel Thomas and Hynek Hermansky, in: Interspeech 2008, 2008

Graphical representation of meetings on mobile devices, Lukas Matena, Alejandro Jaimes and Andrei Popescu-Belis, in: MobileHCI 2008 (10th International Conference on Human-Computer Interaction with Mobile Devices and Services, Demonstrations Session), Amsterdam, 2008

Hierarchical and Parallel Processing of Modulation Spectrum for ASR applications, Fabio Valente and Hynek Hermansky, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008

Hierarchical Integration of Phonetic and Lexical Knowledge in Phone Posterior Estimation, Hamed Ketabdar and Hervé Bourlard, in: ICASSP'08, 2008

Hilbert Envelope Based Features for Far-Field Speech Recognition, Samuel Thomas, Sriram Ganapathy and Hynek Hermansky, in: MLMI 2008, 2008

Hilbert Envelope Based Spectro-Temporal Features for Phoneme Recognition in Telephone Speech, Samuel Thomas, Sriram Ganapathy and Hynek Hermansky, in: Interspeech 2008, 2008

Identifying Dominant People in Meetings from Audio-Visual Sensors, Hayley Hung and Daniel Gatica-Perez, in: International Conference on Automatic Face and Gesture Recognition, Amsterdam, The Netherlands, 2008

Improving Contextual Quality Models for MT Evaluation Based on Evaluators' Feedback, Paula Estrella, Andrei Popescu-Belis and Margaret King, in: 6th International Conference on Language Resources and Evaluation, Marrakech, Morocco, 2008

In-Context Phone Posteriors as Complementary Features for Tandem ASR, Hamed Ketabdar and Hervé Bourlard, in: ICSLP'08, 2008

Integration of TDOA Features in Information Bottleneck Framework for Fast Speaker Diarization, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: Interspeech 2008, 2008

Introducing Temporal Asymmetries in Feature Extraction for Automatic Speech Recognition, G. S. V. S. Sivaram and Hynek Hermansky, in: Interspeech 2008, 2008

Investigating Automatic Dominance Estimation in Groups From Visual Attention and Speaking Activity, Hayley Hung, Dinesh Babu Jayagopi, Silèye O. Ba, Jean-Marc Odobez and Daniel Gatica-Perez, in: International Conference on Multi-modal Interfaces, 2008

Maximum kurtosis beamforming with the generalized sidelobe canceller, Kenichi Kumatani, John McDonough, Barbara Rauch, Philip N. Garner, Weifeng Li and John Dines, in: Proceedings of INTERSPEECH, September 2008, Brisbane, Australia, 2008

Multi-camera 3d person tracking with particle filter in a surveillance environment, Jian Yao and Jean-Marc Odobez, in: 16th European Signal processing Conference (EUSIPCO), 2008

Multi-camera multi-person 3d space tracking with mcmc in surveillance scenarios, Jian Yao and Jean-Marc Odobez, in: European Conference on Computer Vision, workshop on Multi Camera and Multi-modal Sensor Fusion Algorithms and Applications (ECCV-M2SFA2), Marseille, 2008

Multi-Camera Tracking and Atypical Motion Detection with Behavioral Maps, Jerome Berclaz, Francois Fleuret and Pascal Fua, in: proceedings of the European Conference on Computer Vision, 2008

Multi-party Focus of Attention Recognition in Meetings from Head Pose and Multimodal Contextual Cues, Silèye O. Ba and Jean-Marc Odobez, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008

Object Category Detection using Audio-visual Cues, Jie Luo, Barbara Caputo, Alon Zweig, Joerg-Henrik Back and Joern Anemueller, in: International Conference on Computer Vision Systems (ICVS08), 2008

On the Combination of Auditory and Modulation Frequency Channels for ASR applications, Fabio Valente and Hynek Hermansky, in: Interspeech 2008, 2008

Perceptually motivated Sub-band Decomposition for FDLP Audio Coding, Petr Motlicek, Sriram Ganapathy, Hynek Hermansky, Harinath Garudadri and Marios Athineos, in: Text, Speech and Dialogue, Brno, Czech Republic, Springer-Verlag Berlin, Heidelberg, 2008

Predicting the Dominant Clique in Meetings through Fusion of Nonverbal Cues, Dinesh Babu Jayagopi, Hayley Hung, Chuohao Yeo and Daniel Gatica-Perez, in: ACM MM 2008, 2008

Predicting Two Facets of Social Verticality in Meetings from Five-Minute Time Slices and Nonverbal Cues, Dinesh Babu Jayagopi, Silèye O. Ba, Jean-Marc Odobez and Daniel Gatica-Perez, in: Proceedings - ICMI 2008, 2008

Principled Detection-by-classification from Multiple Views, Jerome Berclaz, Francois Fleuret and Pascal Fua, in: proceedings of the International Conference on Computer Vision Theory and Applications, 2008

Recognition and Understanding of Meetings Overview of the European AMI and AMIDA Projects, Hervé Bourlard and Steve Renals, in: LangTech 2008, 2008

Recognition of Anticipatory Behavior from Human EEG, Gangadhar Garipelli, Ricardo Chavarriaga and José del R. Millán, in: In proceedings, 4th Intl. Brain-Computer Interface Workshop and Training Course, 2008

Reference-based vs. task-based evaluation of human language technology, Andrei Popescu-Belis, in: LREC 2008 ELRA Workshop on Evaluation, ELRA, Marrakech, Morocco, 2008

Reverse Correlation for analyzing MLP Posterior Features in ASR, Joel Praveen Pinto, G. S. V. S. Sivaram and Hynek Hermansky, in: 11th International Conference on Text, Speech, and Dialogue, 2008

Role Recognition for Meeting Participants: an Approach Based on Lexical Information and Social Network Analysis, Sarah Favre, Hugues Salamin, Alessandro Vinciarelli, Dilek Hakkani Tür and N. P. Garg, in: ACM International Conference on Multimedia, Vancouver, Canada, 2008

Role Recognition in Multiparty Recordings using Social Affiliation Networks and Discrete Distributions, Sarah Favre, Hugues Salamin, John Dines and Alessandro Vinciarelli, in: International Conference on Multimodal Interfaces, Chania, Greece, 2008

Silence Models in Weighted Finite-State Transducers, Philip N. Garner, in: Interspeech, 2008

Simultaneous Real-Time Detection of Motor Imagery and Error-Related Potentials for Improved BCI Accuracy, Pierre W. Ferrez and José del R. Millán, in: Proceedings of the 4th International Brain-Computer Interface Workshop and Training Course, 2008

Social Signal Processing: State-of-the-Art and Future Perspectives of an Emerging Domain, Alessandro Vinciarelli, Maja Pantic, Hervé Bourlard and Alex Pentland, in: Proceedings of the ACM International Conference on Multimedia, 2008

Social Signals, their Function, and Automatic Analysis: A Survey, Alessandro Vinciarelli, Maja Pantic, Hervé Bourlard and Alex Pentland, in: Proceedings of International Conference on Multimodal Interfaces (to appear), 2008

Spectral Noise Shaping: Improvements in Speech/Audio Codec Based on Linear Prediction in Spectral Domain, Sriram Ganapathy, Petr Motlicek, Hynek Hermansky and Harinath Garudadri, in: INTERSPEECH 2008, 2008

Spectro-Temporal Features for Automatic Speech Recognition using Linear Prediction in Spectral Domain, Samuel Thomas, Sriram Ganapathy and Hynek Hermansky, in: EUSIPCO 2008, 2008

Support Vector Machines with a Reject Option, Yves Grandvalet, Alain Rakotomamonjy, Joseph Keshet and Stéphane Canu, in: Proceedings of the 22nd Annual Conference on Neural Information Processing Systems, 2008

SVM-based Discriminative Accumulation Scheme for Place Recognition, Andrzej Pronobis, Oscar Martinez Monos and Barbara Caputo, in: Proceedings of the IEEE International Conference on Robotics and Automation (ICRA08), 2008

Task-based evaluation of meeting browsers: from BET task elicitation to user behavior analysis, Andrei Popescu-Belis, Mike Flynn, Pierre Wellner and Philippe Baudrion, in: 6th International Conference on Language Resources and Evaluation, Marrakech, Morocco, 2008

Temporal Masking for Bit-rate Reduction in Audio Codec Based on Frequency Domain Linear Prediction, Sriram Ganapathy, Petr Motlicek, Hynek Hermansky and Harinath Garudadri, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008

The AMIDA Automatic Content Linking Device: Just-in-Time Document Retrieval in Meetings, Andrei Popescu-Belis, Erik Boertjes, Jonathan Kilgour, Peter Poller, Sandro Castronovo, Theresa Wilson, Alejandro Jaimes and Jean Carletta, in: Machine Learning for Multimodal Interaction V, Utrecht, Springer-Verlag, 2008

[DOI]

The DIRAC AWEAR Audio-Visual Platform for Detection of Unexpected and Incongruent Events, Joern Anemueller, Joerg-Henrik Back, Barbara Caputo, michal havlena, Jie Luo, Hendrik Kayser, Bastian Leibe, Petr Motlicek, Tomas Pajdla, Misha Pavel, Akihiko Torii, Luc Van Gool, Alon Zweig and Hynek Hermansky, in: Proceedings of the International Conference on Multimodal Interfaces, 2008

The Projectron: a Bounded Kernel-Based Perceptron, Francesco Orabona, Joseph Keshet and Barbara Caputo, in: Int. Conf. on Machine Learning, 2008

Timbre and Rhythmic TRAP-TANDEM features for music information retrieval, Nicolas Scaringella, in: "Int. Conf. on Music Information Retrieval (ISMIR)", 2008

Topickr: Flickr Groups and Users Reloaded, Radu-Andrei Negoescu and Daniel Gatica-Perez, in: MM '08: Proc. of the 16th ACM Intl. Conf. on Multimedia, ACM, 2008

Towards Audio-Visual On-line Diarization Of Participants In Group Meetings, Hayley Hung and Gerald Friedland, in: European Conference on Computer Vision Workshop on Multi-camera and Multi-modal Sensor Fusion, 2008

Towards Robust Place Recognition for Robot Localization, Muhammad Muneeb Ullah, Andrzej Pronobis, Barbara Caputo, Jie Luo, Patric Jensfelt and Henrik I. Christensen, in: IEEE International Conference on Robotics ad Automation, 2008

Understanding Metro Station Usage using Closed Circuit Television Cameras Analysis, C. Carincotte, Xavier Naturel, M. Hick, Jean-Marc Odobez, Jian Yao, A. Bastide and B. Corbucci, in: 11th International IEEE Conference on Intelligent Transportation Systems (ITSC), Bejing, 2008

Visual Focus of Attention Estimation from Head Pose Posterior Probability Distributions, Silèye O. Ba and Jean-Marc Odobez, in: International Conference on Multi-media & Expo, 2008

What Did You Do Today? Discovering Daily Routines from Large-Scale Mobile Data, Katayoun Farrahi and Daniel Gatica-Perez, in: ACM International Conference on Multimedia (ACMMM), 2008

A Cognitive and Unsupervised MAP Adaptation Approach to the Recognition of the Focus of Attention from Head Pose, Jean-Marc Odobez and Silèye O. Ba, in: International Conference on Multi-Media & Expo (ICME07), 2007

A Generative Model for Rhythms, Jean-François Paiement, Yves Grandvalet, Samy Bengio and Douglas Eck, in: NIPS Workshop on Brain, Music and Cognition, 2007

A supervised learning approach based on STDP and polychronization in spiking neuron networks, Hélène Paugam-Moisy, R. Martinez and Samy Bengio, in: European Symposium on Artificial Neural Networks, ESANN, 2007

Adaptive Shared Control of a Brain-Actuated Simulated Wheelchair, Johan Philips, José del R. Millán, G. Vanacker, Eileen Lew, Ferran Galán, Pierre W. Ferrez, H. Van Brussel and Marnix Nuttin, in: Proceedings of the 10th IEEE International Conference on Rehabilitation Robotics, 2007

AGGLOMERATIVE INFORMATION BOTTLENECK FOR SPEAKER DIARIZATION OF MEETINGS DATA, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: IEEE Automatic Speech Recognition and Understanding Workshop, 2007

An Acoustic Model Based on Kullback-Leibler Divergence for Posterior Features, Guillermo Aradilla, Jithendra Vepa and Hervé Bourlard, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007

An Asynchronous and Non-Invasive Brain-Actuated Wheelchair, Ferran Galán, Marnix Nuttin, Eileen Lew, Pierre W. Ferrez, G. Vanacker, Johan Philips, H. Van Brussel and José del R. Millán, in: Proceedings of the 13th International Symposium on Robotics Research, 2007

Augmenting Astronaut's Capabilities through Brain-Machine Interfaces, M. Broschart, Christina de Negueruela, José del R. Millán and C. Menon, in: Proceedings of the 20th International Joint Conference on Artificial Intelligence, Workshop on Artificial Intelligence for Space Applications, 2007

Bayesian Controller for a Novel Semi-Autonomous Navigation Concept, Xavier Perrin, Ricardo Chavarriaga, Roland Siegwart and José del R. Millán, in: 3rd European Conference on Mobile Robots (ECMR 2007), 2007

Biometric Person Authentication IS A Multiple Classifier Problem, Samy Bengio and Johnny Mariéthoz, in: 7th International Workshop on Multiple Classifier Systems, MCS, 2007

Brain-Machine Interfaces through Control of Electroencephalographic Signals and Vibrotactile Feedback, F. Aloise, N. Caporusso, D. Mattia, F. Babiloni, L. Kauhanen, José del R. Millán, Marnix Nuttin, M. G. Marciani and F. Cincotti, in: Proceedings of the 12th International Conference on Human-Computer Interaction, 2007

Broadcast News Story Segmentation Using Social Network Analysis and Hidden Markov Models, Alessandro Vinciarelli and Sarah Favre, in: ACM International Conference on Multimedia, 2007

CLEF2007 Image Annotation Task: an SVM-based Cue Integration Approach, Tatiana Tommasi, Francesco Orabona and Barbara Caputo, in: Proceedings of ImageCLEF 2007 -LNCS, 2007

Combination of Acoustic Classifiers based on Dempster-Shafer Theory of evidence, Fabio Valente and Hynek Hermansky, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007

Confidence-based Cue Integration for Visual Place Recognition, Andrzej Pronobis and Barbara Caputo, in: IEEE International Conference on Intelligent RObot Systems (IROS), 2007

Detection and Recognition of Number Sequences in Spoken Utterances, Guillermo Aradilla and Jitendra Ajmera, in: 2nd Workshop on Speech in Mobile and Pervasive Environments (SiMPE), 2007

Discriminative Keyword Spotting, Joseph Keshet, David Grangier and Samy Bengio, in: Workshop on Non-Linear Speech Processing, Paris, France, 2007

EEG-Based Brain-Computer Interaction: Improved Accuracy by Automatic Single-Trial Error Detection, Pierre W. Ferrez and José del R. Millán, in: Advances in Neural Information Processing Systems 21, 2007

ESTIMATING THE DOMINANT PERSON IN MULTI-PARTY CONVERSATIONS USING SPEAKER DIARIZATION STRATEGIES, Hayley Hung, Yan Huang, Gerald Friedland and Daniel Gatica-Perez, in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2007

Exploiting Phoneme Similarities in Hybrid HMM-ANN Keyword Spotting, Joel Praveen Pinto, Andrew Lovitt and Hynek Hermansky, 2007

Face Authentication with Salient Local Features and Static Bayesian Network, Guillaume Heusch and Sébastien Marcel, in: IEEE / IAPR Intl. Conf. On Biometrics (ICB), 2007

Feature Extraction for Multi-Class BCI using Canonical Variates Analysis, Ferran Galán, Pierre W. Ferrez, Francesc Oliva, Joan Guàrdia and José del R. Millán, in: Proceedings of the IEEE International Symposium on Intelligent Signal Processing, 2007

Frequency Domain Linear Prediction for QMF Sub-bands and Applications to Audio Coding, Petr Motlicek, Hynek Hermansky, Sriram Ganapathy and Harinath Garudadri, in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), 2007

Hierarchical Neural Networks Feature Extraction for LVCSR system, Fabio Valente, Jithendra Vepa, Christian Plahl, Christian Gollan, Hynek Hermansky and Ralf Schlüter, in: Interspeech 2007, 2007

Hierarchical Penalization, Marie Szafranski, Yves Grandvalet and Pierre Morizet-Mahoudeaux, in: Advances in Neural Information Processing Systems 21, 2007

Incremental Learning for Place Recognition in Dynamic Environments, Jie Luo, Andrzej Pronobis, Barbara Caputo and Patric Jensfelt, in: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS07), 2007

Indoor Place Recognition using Online Independent Support Vector Machines, Francesco Orabona, Claudio Castellini, Barbara Caputo, Jie Luo and Giulio Sandini, in: 18th British Machine Vision Conference (BMVC07), 2007

Learning the Inter-frame Distance for Discriminative Template-based Keyword Detection, David Grangier and Samy Bengio, in: International Conference on Speech Communication and Technology (INTERSPEECH), 2007

More Efficiency in Multiple Kernel Learning, Alain Rakotomamonjy, Francis Bach, Stéphane Canu and Yves Grandvalet, in: International Conference on Machine Learning (ICML), 2007

Multi-Layer Background Subtraction Based on Color and Texture, Jian Yao and Jean-Marc Odobez, in: CVPR 2007 Workshop on Visual Surveillance (VS2007), 2007

Multi-stream Features Combination based on Dempster-Shafer Rule for LVCSR System, Fabio Valente, Jithendra Vepa and Hynek Hermansky, in: Interspeech 2007, 2007

Non-Invasive Brain-Actuated Interaction, José del R. Millán, Pierre W. Ferrez, Ferran Galán, Eileen Lew and Ricardo Chavarriaga, in: Proceedings of the 2nd International Symposium on Brain, Vision and Artificial Intelligence, 2007

Non-linear Spectral Contrast Stretching for In-car Speech Recognition, Weifeng Li and Hervé Bourlard, in: Interspeech-Eurospeech # to appear in html, 2007

Non-Uniform Speech/Audio Coding Exploiting Predictability of Temporal Evolution of Spectral Envelopes, Petr Motlicek, Hynek Hermansky, Sriram Ganapathy and Harinath Garudadri, in: Tenth International Conference on TEXT, SPEECH and DIALOGUE (TSD), 2007

On Confusions in a Phoneme Recognizer, Andrew Lovitt, Joel Praveen Pinto and Hynek Hermansky, 2007

Posterior-Based Features and Distances in Template Matching for Speech Recognition, Guillermo Aradilla and Hervé Bourlard, in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), 2007

Probabilistic Head Pose Tracking Evaluation in Single and Multiple Camera Setups, Silèye O. Ba and Jean-Marc Odobez, in: Classification of Events, Activities and Relationship Evaluation and Workshop, 2007

Recognition and Understanding of Meetings The AMI and AMIDA Projects, Steve Renals, Thomas Hain and Hervé Bourlard, in: Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU'07, 2007

Semantic Segmentation of Radio Programs Using Social Network Analysis and Duration Distribution Modeling, Alessandro Vinciarelli, F. Fernàndez and Sarah Favre, in: IEEE International Conference on Multimedia and Expo (ICME), 2007

Sparse Probabilistic Classifiers, Romain Hérault and Yves Grandvalet, in: International Conference on Machine Learning (ICML), 2007

SVM-based Transfer of Visual Knowledge Across Robotic Platforms, Jie Luo, Andrzej Pronobis and Barbara Caputo, in: International Conference on Computer Vision Systems (ICVS07), 2007

The use of brain-computer interfacing for ambient intelligence, Gangadhar Garipelli, Ferran Galán, Ricardo Chavarriaga, Pierre W. Ferrez, Eileen Lew and José del R. Millán, in: In the book, Constructing Ambient Intelligence: AmI-07 Workshops Proceedings, Max M\:uhlh\:auser, Alois Ferscha, and Erwin Aitenbichler (Eds.,',','), LNCS, Springer Verlag, 2008., 2007

To Err Is Human: Learning from Error Potentials in Brain-Computer Interfaces, Ricardo Chavarriaga, Pierre W. Ferrez and José del R. Millán, in: 1st International Conference on Cognitive Neurodynamics (ICCN 2007), 2007

Unsupervised Speech/Non-speech Detection for Automatic Speech Recognition in Meeting Rooms, Hari Krishna Maganti, Petr Motlicek and Daniel Gatica-Perez, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007

Using Audio and Video Features to Classify the Most Dominant Person in a Group Meeting, Hayley Hung, Dinesh Babu Jayagopi, Chuohao Yeo, Gerald Friedland, Silèye O. Ba, Jean-Marc Odobez, Kannan Ramchandran, Nikki Mirghafori and Daniel Gatica-Perez, in: "", 2007

Vibrotactile Feedback in the Context of Mu-Rhythm based BCI, F. Cincotti, L. Kauhanen, F. Aloise, T. Palomäki, N. Caporusso, P. Jylänki, D. Mattia, F. Babiloni, G. Vanacker, Marnix Nuttin, M. G. Marciani and José del R. Millán, in: Proceedings of the 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2007

Visuo-Spatial Attention Frame Recognition for Brain-Computer Interfaces, Ferran Galán, J. Palix, Ricardo Chavarriaga, Pierre W. Ferrez, Eileen Lew, C. -A. Hauert and José del R. Millán, in: Proceedings of the 1st International Conference on Cognitive Neurodynamics, 2007

Wide-Band Perceptual Audio Coding based on Frequency-Domain Linear Prediction, Petr Motlicek, Vijay Ullal and Hynek Hermansky, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007

2D Multi-Person Tracking: A Comparative Study in AMI Meetings, Kevin C. Smith, Sascha Schreiber, Vítezslav Beran, Igor Potúcek, Gerhard Rigoll and Daniel Gatica-Perez, in: Classification of Events, Activities, and Relationships (CLEAR) 2006, 2006

A Discriminative Approach for the Retrieval of Images from Text Queries, David Grangier, Florent Monay and Samy Bengio, in: European Conference on Machine Learning (ECML), 2006

A Discriminative Approach to Robust Visual Place Recognition, Andrzej Pronobis, Barbara Caputo, Patric Jensfelt and Henrik I. Christensen, in: IEEE International Conference on Intelligent RObot Systems (IROS), 2006

A Max Kernel For Text-Independent Speaker Verification Systems, Johnny Mariéthoz and Samy Bengio, in: Second Workshop on Multimodal User Authentication, MMUA, 2006

A Neural Network to Retrieve Images from Text Queries, David Grangier and Samy Bengio, in: International Conference on Artificial Neural Networks (ICANN), 2006

A Study on Visual Focus of Attention Recognition from Head Pose in a Meeting Room, Silèye O. Ba and Jean-Marc Odobez, in: 3rd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI06), 2006

Analyzing Group Interactions in Conversations: a Review, Daniel Gatica-Perez, in: IEEE Int. Conf. on Multisensor Fusion and Integration for Intelligent Systems (MFI), 2006

Bi-Modal Face and Speech Authentication: a BioLogin Demonstration System, Sébastien Marcel, Johnny Mariéthoz, Yann Rodriguez and Fabien Cardinaux, in: Workshop on Multimodal User Authentication (MMUA), 2006

Constructing visual models with a latent space approach, Florent Monay, Pedro Quelhas, Daniel Gatica-Perez and Jean-Marc Odobez, in: the Springer series of Lecture Notes in Computer Science, 2006

Cursive Character Challenge: a New Database for Machine Learning and Pattern Recognition, Francesco Camastra, Marco Spinetti and Alessandro Vinciarelli, in: Proceedings of International Conference on Pattern Recognition (ICPR), 2006

Detecting Abandoned Luggage Items in a Public Space, Kevin C. Smith, Pedro Quelhas and Daniel Gatica-Perez, in: IEEE Performance Evaluation of Tracking and Surveillance Workshop (PETS), 2006

Detection and Application of Influence Rankings in Small Group Meetings, Rutger Rienks, Dong Zhang, Daniel Gatica-Perez and Wilfried Post, in: In the Eighth International Conference on Multimodal Interfaces (ICMI'06), 2006

Discriminant linear processing of time-frequency plane, Fabio Valente and Hynek Hermansky, in: International Conference on Spoken Language Processing, 2006

Discriminative Kernel-Based Phoneme Sequence Recognition, Joseph Keshet, Shai Shalev-Shwartz, Yoram Singer, Samy Bengio and Dan Chazan, in: The 9th International Conference on Spoken Language Processing (INTERSPEECH), Pittsburgh, PA, 2006

Exploring Contextual Information in a Layered Framework for Group Action Recognition, Dong Zhang, Daniel Gatica-Perez and Samy Bengio, in: In the Eighth International Conference on Multimodal Interfaces (ICMI'06), 2006

Face Authentication Using Adapted Local Binary Pattern Histograms, Yann Rodriguez and Sébastien Marcel, in: 9th European Conference on Computer Vision (ECCV), 2006

Finding groups of people in Google news, Dhiraj Joshi and Daniel Gatica-Perez, in: ACM Int. Conf. on Human-Centered Multimedia (HCM), 2006

Hand Posture Classification and Recognition using the Modified Census Transform, Agnès Just, Yann Rodriguez and Sébastien Marcel, in: IEEE Int. Conf. on Automatic Face and Gesture Recognition (AFGR), 2006

Haptic Feedback Compared with Visual Feedback for BCI, L. Kauhanen, T. Palomäki, P. Jylänki, F. Aloise, Marnix Nuttin and José del R. Millán, in: Proceedings of the 3rd International Brain-Computer Interface Workshop & Training Course 2006, 2006

High Frequency Bands and Estimated Local Field Potentials to Improve Single-Trial Classification of Electroencephalographic Signals, Pierre W. Ferrez, Ferran Galán, Anna Buttfield, S. L. González Andino, R. Grave de Peralta and José del R. Millán, in: Proceedings of the 3rd International Brain-Computer Interface Workshop & Training Course 2006, 2006

Indexation de Documents Manuscrits, Alessandro Vinciarelli, in: Proceedings du Colloque International Francophone sur l'Ecrit et le Document (CIFED06), 2006

Infinite Models for Speaker Clustering, Fabio Valente, in: International Conference on Spoken Language Processing, 2006

Integrating co-occurrence and spatial contexts on patch-based scene segmentation, Florent Monay, Pedro Quelhas, Jean-Marc Odobez and Daniel Gatica-Perez, in: Beyond Patches Workshop, in conjunction with CVPR, 2006

Investigating Lexical Substitution Scoring for Subtitle Generation, Oren Glickman, Ido Dagan, Mikaela Keller, Samy Bengio and Walter Daelemans, in: Proceedings of the 10th Conference on Computational Natural Language Learning (CoNLL)., 2006

Juicer: A Weighted Finite-State Transducer speech decoder, Darren Moore, John Dines, Mathew Magimai-Doss, Jithendra Vepa, Octavian Cheng and Thomas Hain, in: 3rd Joint Workshop on Multimodal Interaction and Related Machine LEarning Algorithms MLMI'06, 2006

Kernel Methods for Melanoma Recognition, Elisabetta La Torre, Tatiana Tommasi and Barbara Caputo, in: Medical Informatics in Europe (MIE), 2006

Kernel Methods for Melanoma Recognition, Tatiana Tommasi, Elisabetta La Torre and Barbara Caputo, in: Proceedings of Workshop on Computer Vision Approaches to Medical Image Analysis (CVAMIA) 2006), 2006

Learning to Retrieve Images from Text Queries with a Discriminative Model, David Grangier, Florent Monay and Samy Bengio, in: International Workshop on Adaptive Multimedia Retrieval (AMR), 2006

Local Binary Patterns as an Image Preprocessing for Face Authentication, Guillaume Heusch, Yann Rodriguez and Sébastien Marcel, in: IEEE Int. Conf. on Automatic Face and Gesture Recognition (AFGR), 2006

Melanoma Recognition Using Representative and Discriminative Kernel Classifiers, Tatiana Tommasi, Elisabetta La Torre and Barbara Caputo, in: International Workshop on Computer Vision Applications for Medical Image Analysis, 2006

Modeling Interactions from Email Communication, Daniel Gatica-Perez, Dong Zhang and Samy Bengio, in: Proc. IEEE International Conference on Multimedia & Expo (ICME,',','), 2006, 2006

Multi-Person Tracking in Meetings: A Comparative Study, Kevin C. Smith, Sascha Schreiber, Vítezslav Beran, Igor Potúcek, Gerhard Rigoll and Daniel Gatica-Perez, in: Multimodal Interaction and Related Machine Learning Algorithms (MLMI), 2006

Multi-stream ASR: An Oracle Perspective, Hemant Misra, Jithendra Vepa and Hervé Bourlard, in: Proceedings of ISCA International Conference on Spoken Language Processing (ICSLP), 2006

Natural Scene Image Modeling using Color and Texture Visterms., Pedro Quelhas and Jean-Marc Odobez, in: Conference on Image and Video Retrieval CIVR, 2006

Nearly optimal exploration-exploitation decision thresholds, Christos Dimitrakakis, in: Int. Conf. on Artificial Neural Networks (ICANN), 2006

Non-Invasive Brain Computer Interface for Mental Control of a Simulated Wheelchair, Eileen Lew, Marnix Nuttin, Pierre W. Ferrez, A. Degeest, Anna Buttfield, G. Vanacker and José del R. Millán, in: Proceedings of the 3rd International Brain-Computer Interface Workshop & Training Course 2006, 2006

Online Classifier Adaptation in High Frequency EEG, Anna Buttfield, Pierre W. Ferrez and José del R. Millán, in: Proceedings of the 3rd International Brain-Computer Interface Workshop & Training Course 2006, 2006

Posterior Based Keyword Spotting with A Priori Thresholds, Hamed Ketabdar, Jithendra Vepa, Samy Bengio and Hervé Bourlard, in: International Conference on Spoken Language Processing (ICSLP), 2006

Prospects on Brain-Machine Interfaces for Space System Control, C. Menon, Christina de Negueruela, José del R. Millán, O. Tonet, F. Carpi, M. Broschart, Pierre W. Ferrez, Anna Buttfield, P. Dario, L. Citi, C. Laschi, M. Tombini, F. Sepulveda, R. Poli, R. Palaniappan, F. Tecchio, P. M. Rossini and D. de Rossi, in: Proceedings of the 57th International Astronautical Conference, 2006

Revisiting Doddington's Zoo: A Systematic Method to Assess User-dependent Variabilities, Norman Poh, Samy Bengio and Arun Ross, in: Multimodal User Authentication (MMUA), 2006

Sociometry Based Multiparty Audio Recordings Segmentation, Alessandro Vinciarelli, in: Proceedings of the IEEE Conference on Multimedia and Expo (ICME 2006), 2006

Sociometry Based Multiparty Audio Recordings Summarization, Alessandro Vinciarelli, in: Proceedings of International Conference on Pattern Recognition (ICPR 2006), 2006

Speaker Localization for Microphone Array-Based ASR: The Effects of Accuracy on Overlapping Speech, Hari Krishna Maganti and Daniel Gatica-Perez, in: Proc. Int. Conf. on Multimodal Interfaces (ICMI), 2006

Speech Coding based on Spectral Dynamics, Petr Motlicek, Hynek Hermansky, Harinath Garudadri and Naveen Srinivasamurthy, in: Ninth International Conference on TEXT, SPEECH and DIALOGUE (TSD), 2006

The More you Learn, the Less you Store: Memory-Controlled Incremental SVM, Andrzej Pronobis and Barbara Caputo, in: Proceedings of International Cognitive Vision Workshop (ICVW) 2006), 2006

The segmentation of multi-channel meeting recordings for automatic speech recognition, John Dines, Jithendra Vepa and Thomas Hain, in: Int. Conf. on Spoken Language Processing (Interspeech ICSLP), 2006

Threshold Selection for Unsupervised Detection, with an Application to Microphone Arrays, Guillaume Lathoud, Mathew Magimai-Doss and Hervé Bourlard, in: Proceedings of ICASSP 2006, 2006

Tracking the Multi Person Wandering Visual Focus of Attention, Kevin C. Smith, Silèye O. Ba, Daniel Gatica-Perez and Jean-Marc Odobez, in: International Conference on Multimodal Interfaces (ICMI06), 2006

Unified Inference for Variational Bayesian Linear Gaussian State-Space Models, David Barber and Silvia Chiappa, in: NIPS, 2006

Using Chimeric Users to Construct Fusion Classifiers in Biometric Authentication Tasks: An Investigation, Norman Poh and Samy Bengio, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2006

Using more informative posterior probabilities for speech recognition, Hamed Ketabdar, Jithendra Vepa, Samy Bengio and Hervé Bourlard, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2006

Using Pitch as Prior Knowledge in Template-Based Speech Recognition, Guillermo Aradilla, Jithendra Vepa and Hervé Bourlard, in: Proceedings of ICASSP, 2006, 2006

Using Posterior-Based Features in Template Matching for Speech Recognition, Guillermo Aradilla, Jithendra Vepa and Hervé Bourlard, in: International Conference on Spoken Language Processing, 2006

Writer Identification for Smart Meeting Room Systems, Marcus Liwicki, Andreas Schlapbach, Horst Bunke, Samy Bengio, Johnny Mariéthoz and Jonas Richiardi, in: Seventh IAPR Workshop on Document Analysis Systems, DAS, 2006

A Graphical Model for Chord Progressions Embedded in a Psychoacoustic Space, Jean-François Paiement, Douglas Eck, Samy Bengio and David Barber, in: Proceedings of the 22nd International Conference on Machine Learning, 2005

A Meeting Browser Evaluation Test, Pierre Wellner, Mike Flynn, Simon Tucker and Steve Whittaker, in: CHI '92: Proceedings of the SIGCHI conference on Human factors in computing systems, Portland, OR, USA, ACM Press, 2005

A Neural Network for Text Representation, Mikaela Keller and Samy Bengio, in: International Conference on Artificial Neural Networks, ICANN, 2005

A Novel Approach to Combining Client-Dependent and Confidence Information in Multimodal Biometric, Norman Poh and Samy Bengio, in: Fifth Int'l. Conf. Audio- and Video-Based Biometric Person Authentication AVBPA, 2005

A Probabilistic Interpretation of SVMs with an Application to Unbalanced Classification, Yves Grandvalet, Johnny Mariéthoz and Samy Bengio, in: Advances in Neural Information Processing Systems, NIPS 15, 2005

A Probabilistic Model for Chord Progressions, Jean-François Paiement, Douglas Eck and Samy Bengio, in: Proceedings of the Sixth International Conference on Music Information Retrieval (ISMIR), 2005

A Rao-Blackwellized Mixed State Particle Filter for Head Pose Tracking, Silèye O. Ba and Jean-Marc Odobez, in: ACM ICMI Workshop on Multimodal Multiparty Meeting Processing (MMMP), 2005

A Sector-Based, Frequency-Domain Approach to Detection and Localization of Multiple Speakers, Guillaume Lathoud and Mathew Magimai-Doss, in: Proceedings of ICASSP 2005, 2005

A Spectrogram Model for Enhanced Source Localization and Noise-Robust ASR, Guillaume Lathoud, Mathew Magimai-Doss and Bertrand Mesot, in: Proceedings of INTERSPEECH 2005, 2005

AV16.3: an Audio-Visual Corpus for Speaker Localization and Tracking, Guillaume Lathoud, Jean-Marc Odobez and Daniel Gatica-Perez, in: Proceedings of the 2004 MLMI Workshop, S. Bengio and H. Bourlard Eds, Springer Verlag, 2005

Benchmarking Non-Parametric Statistical Tests, Mikaela Keller, Samy Bengio and Siew Yeung Wong, in: Advances in Neural Information Processing Systems, NIPS 18. MIT Press, 2005

Boosting word error rates, Christos Dimitrakakis and Samy Bengio, in: IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP, 2005

Database, Protocol and Tools for Evaluating Score-Level Fusion Algorithms in Biometric Authentication, Norman Poh and Samy Bengio, in: Fifth Int'l. Conf. Audio- and Video-Based Biometric Person Authentication AVBPA, 2005

Detecting Group Interest-level in Meetings, Daniel Gatica-Perez, Iain A. McCowan, Dong Zhang and Samy Bengio, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2005

Developing and Enhancing Posterior Based Speech Recognition Systems, Hamed Ketabdar, Jithendra Vepa, Samy Bengio and Hervé Bourlard, in: Proceedings of Interspeech, 2005

EER of Fixed and Trainable Fusion Classifiers: A Theoretical Study with Application to Biometric Authentication Tasks, Norman Poh and Samy Bengio, in: Sixth International Workshop on Multiple Classifier System (MCS2005), 2005

Effect of Segmentation Method on Video Retrieval Performance, David Grangier and Alessandro Vinciarelli, in: Proceedings of the 2005 IEEE International Conference on Multimedia and Expo (ICME-05), 2005

Evaluation of Multiple Cues Head Pose Tracking Algorithm in Natural Environments, Silèye O. Ba and Jean-Marc Odobez, in: International Conference on Multimedia & Expo ICME 2005, 2005

Exploiting Hyperlinks to Learn a Retrieval Model, David Grangier and Samy Bengio, in: NIPS Workshop on Learning to Rank, 2005

Extracting Information from Multimedia Meeting Collections, Daniel Gatica-Perez, Dong Zhang and Samy Bengio, in: 7th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2005

F-ratio Client-Dependent Normalisation on Biometric Authentication Tasks, Norman Poh and Samy Bengio, in: Proceedings of the 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-05), 2005

Generative Independent Component Analysis for EEG Classification, Silvia Chiappa and David Barber, in: European Symposium on Artificial Neural Networks ESANN, 2005

Generative Temporal ICA for Classification in Asynchronous BCI Systems, Silvia Chiappa and David Barber, in: The 2nd International IEEE EMBS Conference On Neural Engineering, 2005

Gradient estimates of return distributions, Christos Dimitrakakis and Samy Bengio, in: PASCAL Workshop on Principled Methods of Trading Exploration and Exploitation, 2005

Hierarchical Multi-Stream Posterior Based Speech Recognition System, Hamed Ketabdar, Hervé Bourlard and Samy Bengio, in: Proceedings MLMI workshop, 2005

Implicit Control of Noise Canceller for Speech Enhancement, Julien Bourgeois, Jürgen Freudenberger and Guillaume Lathoud, in: Proceedings of INTERSPEECH 2005, 2005

Improving Fusion with Margin-Derived Confidence In Biometric Authentication Tasks, Norman Poh and Samy Bengio, in: Fifth Int'l. Conf. Audio- and Video-Based Biometric Person Authentication AVBPA, 2005

Improving Speech Recognition Using a Data-Driven Approach, Guillermo Aradilla, Jithendra Vepa and Hervé Bourlard, in: Proceedings of Interspeech, 2005, 2005

Inferring Document Similarity from Hyperlinks, David Grangier and Samy Bengio, in: ACM Conference on Information and Knowledge Management, 2005

Learning influence among interacting Markov chains, Dong Zhang, Daniel Gatica-Perez, Samy Bengio and Deb Roy, in: NIPS, 2005

Machine Learning for Multimodal Interaction: First International Workshop, MLMI'2004, Springer-Verlag Heidelberg, 2005

Modeling Scenes with Local Descriptors and Latent Aspects, Pedro Quelhas, Florent Monay, Jean-Marc Odobez, Daniel Gatica-Perez, Tinne Tuytelaars and Luc Van Gool, in: IEEE Int. Conf. on Computer Vision, 2005

Multi-resolution RASTA filtering for TANDEM-based ASR, Hynek Hermansky and Petr Fousek, in: Proceedings of Interspeech 2005, 2005

Multi-resolution Spectral Entropy Based Feature for Robust ASR, Hemant Misra, Shajith Ikbal, Sunil Sivadas and Hervé Bourlard, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2005

Multichannel Speech Enhancement in Cars: Explicit vs. Implicit Adaptation Control, Guillaume Lathoud, Julien Bourgeois and Jürgen Freudenberger, in: Proceedings of HSCMA 2005, 2005

Multimodal Integration for Meeting Group Action Segmentation and Recognition, Marc Al-Hames, Alfred Dielmann, Daniel Gatica-Perez, Stephan Reiter, Steve Renals and Dong Zhang, in: MLMI, 2005

Multimodal Multispeaker Probabilistic Tracking in Meetings, Daniel Gatica-Perez, Guillaume Lathoud, Jean-Marc Odobez and Iain A. McCowan, in: Proc. Int. Conf. on Multimodal Interfaces (ICMI), 2005

On Accuracy/Robustness/Complexity Trade-Offs in Face Verification, Conrad Sanderson, Fabien Cardinaux and Samy Bengio, in: IEEE International Conference on Information Technology and Applications, ICITA, 2005

Semi-supervised Adapted HMMs for Unusual Event Detection, Dong Zhang, Daniel Gatica-Perez, Samy Bengio and Iain A. McCowan, in: Pro. IEEE CVPR, 2005

Semi-supervised Meeting Event Recognition with Adapted HMMs, Dong Zhang, Daniel Gatica-Perez and Samy Bengio, in: Pro. IEEE ICME, 2005

Spectral Entropy Feature in Full-Combination Multi-Stream for Robust ASR, Hemant Misra and Hervé Bourlard, in: Proceedings of ISCA European Conference on Speech Communication and Technology (Eurospeech), 2005

Speech Acquisition in Meetings with an Audio-Visual Sensor Array, Iain A. McCowan, Maganti Hari Krishna, Daniel Gatica-Perez, Darren Moore and Silèye O. Ba, in: Pro. IEEE ICME, 2005

The AMI Meeting Corpus: a Pre-Announcement, Jean Carletta, Simone Ashby, Sebastien Bourban, Mike Flynn, Maël Guillemot, Thomas Hain, Jaroslav Kadlec, Vasilis Karaiskos, Wessel Kraaij, Melissa Kronenthal, Guillaume Lathoud, Mike Lincoln, Agnes Lisowska, Iain A. McCowan, Wilfried Post, Dennis Reidsma and Pierre Wellner, in: Machine Learning for Multimodal Interaction: Second International Workshop, MLMI'2005, 2005

The Expected Performance Curve, Samy Bengio, Johnny Mariéthoz and Mikaela Keller, in: International Conference on Machine Learning, ICML, Workshop on ROC Analysis in Machine Learning, 2005

The Role of Speech in Multimodal Human-Computer Interaction (Towards Reliable Rejection of Non-Keyword Input), Hynek Hermansky, Petr Fousek and Mikko Lehtonen, in: Proceedings of 8th International Conference on Text, Speech and Dialogue - TSD 2005, 2005

Tracking People in Meetings with Particles, Daniel Gatica-Perez, Jean-Marc Odobez, Silèye O. Ba, Kevin C. Smith and Guillaume Lathoud, in: Proc. Int. Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS,',','), invited paper, 2005

Unsupervised Spectral Subtraction for Noise-Robust ASR, Guillaume Lathoud, Mathew Magimai-Doss, Bertrand Mesot and Hervé Bourlard, in: Proceedings of the 2005 IEEE ASRU Workshop, 2005

You Are Wrong!---Automatic Detection of Interaction Errors from Brain Waves, Pierre W. Ferrez and José del R. Millán, in: Proceedings of the 19th International Joint Conference on Artificial Intelligence, 2005

A Gentle Hessian for Efficient Gradient Descent, Ronan Collobert and Samy Bengio, in: IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP, 2004

A probabilistic framework for joint head tracking and pose estimation, Silèye O. Ba and Jean-Marc Odobez, in: 17th Int. Conf. Pattern Recognition (ICPR), 2004

A Sector-Based Approach for Localization of Multiple Speakers with Microphone Arrays, Guillaume Lathoud and Iain A. McCowan, in: Proceedings of the 2004 SAPA Workshop, 2004

A Statistical Significance Test for Person Authentication, Samy Bengio and Johnny Mariéthoz, in: Proceedings of Odyssey 2004: The Speaker and Language Recognition Workshop, 2004

A Symmetric Transformation for LDA-based Face Verification, Sébastien Marcel, in: Proceedings of the 6th International Conference on Automatic Face and Gesture Recognition, IEEE Computer Society Press, 2004

An Investigation of Spectral Subband Centroids for Speaker Authentication, Norman Poh, Conrad Sanderson and Samy Bengio, in: Int'l Conf. on Biometric Authentication, 2004

An Online Audio Indexing System, Jitendra Ajmera, Iain A. McCowan and Hervé Bourlard, 2004

Assessing Scene Structuring in Consumer Videos, Daniel Gatica-Perez, Napat Triroj, Jean-Marc Odobez, Alexander Loui and Ming-Ting Sun, in: Int. Conf. on Image and Video Retrieval (CIVR), 2004

Boosting HMMs with an application to speech recognition, Christos Dimitrakakis and Samy Bengio, in: IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP, 2004

Boosting Pixel-based Classifiers for Face Verification, Sébastien Marcel and Yann Rodriguez, in: Biometric Authentication Workshop of the 8th European Conference on Computer Vision, BIOAW2004, Springer-Verlag, 2004

Clustering And Segmenting Speakers And Their Locations In Meetings, Jitendra Ajmera, Guillaume Lathoud and Iain A. McCowan, in: ICASSP, 2004

Confidence Measures in Multiple pronunciations Modeling For Speaker Verification, Mohamed Faouzi BenZeghiba and Hervé Bourlard, in: Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04), 2004

Cue integration through discriminative accumulation, Maria Elena Nilsback and Barbara Caputo, in: International Conference on Computer Vision and Pattern Recognition, 2004

Effect of Recognition Errors on Information Retrieval Performance, Alessandro Vinciarelli, in: Proceedings of International Workshop on Frontiers in Handwriting Recognition, 2004

Embedding motion in model-based stochastic tracking, Jean-Marc Odobez and Daniel Gatica-Perez, in: 17th Int. Conf. Pattern Recognition (ICPR), 2004

Entropy Based Combination of Tandem Representations for Noise Robust ASR, Shajith Ikbal, Hemant Misra, Sunil Sivadas, Hynek Hermansky and Hervé Bourlard, in: Proceedings of the INTERSPEECH-ICSLP-04, 2004

Estimating the Quality of Face Localization for Face Verification, Yann Rodriguez, Fabien Cardinaux, Samy Bengio and Johnny Mariéthoz, in: IEEE International Conference on Image Processing, ICIP, 2004

Face Verification Using Adapted Generative Models, Fabien Cardinaux, Conrad Sanderson and Samy Bengio, in: The 6th International Conference on Automatic Face and Gesture Recognition, FG2004, IEEE, 2004

Fusion of Structural and Color Local Descriptors for Enhanced Object Recognition, Pedro Quelhas and Jean-Marc Odobez, in: Proceedings IEEE WIAMIS 2004(5th International Workshop on Image Analysis for Multimedia Interactive Services,',','), 21-23 April, 2004, Lisboa, Portugal, 2004

HMM and IOHMM Modeling of EEG Rhythms for Asynchronous BCI Systems, Silvia Chiappa and Samy Bengio, in: European Symposium on Artificial Neural Networks ESANN, 2004

Joint Decoding for Phoneme-Grapheme Continuous Speech Recognition, Mathew Magimai-Doss, Samy Bengio and Hervé Bourlard, in: Proceedings of ICASSP, 2004

Links Between Perceptrons, MLPs and SVMs, Ronan Collobert and Samy Bengio, in: International Conference on Machine Learning, ICML, 2004

LP-TRAP: Linear predictive temporal patterns, Marios Athineos, Hynek Hermansky and Daniel P. W. Ellis, 2004

Modeling Individual and Group Actions in Meetings With Layered HMMs, Dong Zhang, Daniel Gatica-Perez, Samy Bengio, Iain A. McCowan and Guillaume Lathoud, in: IEEE Transaction on Multimedia, June, 2006, 2004

Modeling Individual and Group Actions in Meetings: a Two-Layer HMM Framework, Dong Zhang, Daniel Gatica-Perez, Samy Bengio, Iain A. McCowan and Guillaume Lathoud, in: the Second IEEE Workshop on Event Mining: Detection and Recognition of Events in Video, In Association with CVPR, 2004

Modelling Auxiliary Features in Tandem Systems, Mathew Magimai-Doss, Todd Andrew Stephenson, Shajith Ikbal and Hervé Bourlard, in: Proceedings of ICSLP, 2004

Multimodal Group Action Clustering in Meetings, Dong Zhang, Daniel Gatica-Perez, Samy Bengio, Iain A. McCowan and Guillaume Lathoud, in: ACM 2nd International Workshop on Video Surveillance & Sensor Networks in conjunction with 12th ACM International Conference on Multimedia, 2004

New Nonsense Syllables Database -- Analyses and Preliminary ASR Experiments, Petr Fousek, Petr Svojanovsky, Frantisek Grezl and Hynek Hermansky, in: Proceedings of International Conference on Spoken Language Processing (ICSLP), 2004

Noise-Robust Multi-Stream Fusion for Text-Independent Speaker Authentication, Norman Poh and Samy Bengio, in: The Speaker and Recognition Workshop, 2004

Noisy Text Categorization, Alessandro Vinciarelli, in: Proceedings of International Conference on Pattern Recognition (ICPR), 2004

On Performance Evaluation of Face Detection and Localization Algorithms, Vlad Popovici, Jean-Philippe Thiran, Yann Rodriguez and Sébastien Marcel, in: 17th International Conference on Pattern Recognition (ICPR), 2004

On the Need for On-Line Learning in Brain-Computer Interfaces, José del R. Millán, in: Proceedings of the International Joint Conference on Neural Networks, 2004

On Use of Task Independent Training Data in Tandem Feature Extraction, Sunil Sivadas and Hynek Hermansky, in: Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04), 2004

Online Policy Adaptation for Ensemble Classifiers, Christos Dimitrakakis and Samy Bengio, in: 12th European Symposium on Artificial Neural Networks, ESANN 04, 2004

Order Matters: A Distributed Sampling Method for Multi-Object Tracking, Kevin C. Smith, in: British Machine Vision Conference (BMVC), 2004

Phase AutoCorrelation (PAC) features in Entropy based Multi-Stream for Robust Speech Recognition, Shajith Ikbal, Hemant Misra, Hervé Bourlard and Hynek Hermansky, in: Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04), 2004

PLP$^2$: Autoregressive modeling of auditory-like 2-D spectro-temporal patterns, Marios Athineos, Hynek Hermansky and Daniel P. W. Ellis, 2004

PLSA-based Image Auto-Annotation: Constraining the Latent Space, Florent Monay and Daniel Gatica-Perez, in: Proc. ACM Int. Conf. on Multimedia (ACM MM), 2004

Posteriori Probabilities and Likelihoods Combination for Speech and Speaker Recognition, Mohamed Faouzi BenZeghiba and Hervé Bourlard, in: International Conference on Spoken Language Processing (ICSLP~2004), 2004

Pseudo-Syntactic Language Modeling for Disfluent Speech Recognition, Michael McGreevy, in: Proceedings of SST 2004 (10th Australian International Conference on Speech Science & Technology,',','), Sydney, Australia, 2004, 2004

Real-Time Face Detection Using Boosting Learning in Hierarchical Feature Spaces, Dong Zhang, S. Z. Li and Daniel Gatica-Perez, in: the International Conference on Pattern Recognition (ICPR), 2004

Recognition of Isolated Complex Mono- and Bi-Manual 3D Hand Gestures, Agnès Just, O. Bernier and Sébastien Marcel, in: Proc. of the sixth International Conference on Automatic Face and Gesture Recognition, 2004

Restoring Locomotion with a Thought Controlled Mobile Robot, José del R. Millán, in: Proceedings of the 4th Forum of European Neuroscience, 2004

Robust Playfield Segmentation using MAP Adaptation, Mark Barnard and Jean-Marc Odobez, in: Proc. 17th International Conference on Pattern Recognition (ICPR 2004), 2004

Spectral Entropy Based Feature for Robust ASR, Hemant Misra, Shajith Ikbal, Hervé Bourlard and Hynek Hermansky, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2004

Spectro-Temporal Activity Pattern (STAP) Features for Noise Robust ASR, Shajith Ikbal, Mathew Magimai-Doss, Hemant Misra and Hervé Bourlard, in: Proceedings of the INTERSPEECH-ICSLP-04, 2004

Statistical Transformations of Frontal Models for Non-Frontal Face Verification, Conrad Sanderson and Samy Bengio, in: Proceedings of the IEEE International Conference on Image Processing (ICIP), 2004

Subjective Evaluation of Join Cost Functions Used in Unit Selection Speech Synthesis, Jithendra Vepa and Simon King, in: Proceedings of ICSLP, 2004

Tangent Vector Kernels for Invariant Image Classification with SVMs, Alexei Pozdnoukhov and Samy Bengio, in: 17th Int. Conf. Pattern Recognition (ICPR), 2004

The Expected Performance Curve: a New Assessment Measure for Person Authentication, Samy Bengio and Johnny Mariéthoz, in: Proceedings of Odyssey 2004: The Speaker and Language Recognition Workshop, 2004

Theme Topic Mixture Model: A Graphical Model for Document Representation, Mikaela Keller and Samy Bengio, in: Pascal Workshop on Text Mining and Understanding, 2004

Towards Predicting Optimal Subsets of Base-Experts in Biometric Authentication Task, Norman Poh and Samy Bengio, 2004

Unsupervised Location-Based Segmentation of Multi-Party Speech, Guillaume Lathoud, Iain A. McCowan and Jean-Marc Odobez, in: Proceedings of the 2004 ICASSP-NIST Meeting Recognition Workshop, 2004

Using RASTA in task independent TANDEM feature extraction, Guillermo Aradilla, John Dines and Sunil Sivadas, in: Proceedings of ICSLP, 2004, 2004

Why Do Multi-Stream, Multi-Band and Multi-Modal Approaches Work on Biometric User Authentication Tasks?, Norman Poh and Samy Bengio, in: Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04), 2004

A Hierarchical Keyframe User Interface for Browsing Video over the Internet, Maël Guillemot, Pierre Wellner, Daniel Gatica-Perez and Jean-Marc Odobez, in: Proceedings of the 9th International Conference on Human-Computer Interaction (INTERACT-2003), IOS Press, 2003

A Mixed-State I-Particle Filter for Multi-Camera Speaker Tracking, Daniel Gatica-Perez, Guillaume Lathoud, Iain A. McCowan and Jean-Marc Odobez, in: IEEE Int. Conf. on Computer Vision Workshop on Multimedia Technologies for E-Learning and Collaboration (ICCV-WOMTEC), 2003

A Robust Speaker Clustering Algorithm, Jitendra Ajmera and Charles Wooters, in: IEEE Automatic Speech Recognition Understanding Workshop, 2003

Adaptive Brain Interfaces for Communication and Control, José del R. Millán, in: Proceedings of the 10th International Conference on Human-Computer Interaction, 2003

An Asynchronous Hidden Markov Model for Audio-Visual Speech Recognition, Samy Bengio, in: Advances in Neural Information Processing Systems, NIPS 15, MIT Press, 2003

An Implicit Motion Likelihood for Tracking with Particle Filters, Jean-Marc Odobez, Silèye O. Ba and Daniel Gatica-Perez, in: British Machine Vision Conference (BMVC), Springer Verlag, 2003

Audio-Visual Speaker Tracking with Importance Particle Filters, Daniel Gatica-Perez, Guillaume Lathoud, Iain A. McCowan, Jean-Marc Odobez and Darren Moore, in: IEEE International Conference on Image Processing (ICIP), 2003

Augmenting Frontal Face Models for Non-Frontal Verification, Conrad Sanderson and Samy Bengio, in: Proceedings of the 2003 Workshop on Multimodal User Authentication (MMUA'03), 2003

Client Dependent GMM-SVM Models for Speaker Verification, Quan Le and Samy Bengio, in: International Conference on Artificial Neural Networks, ICANN/ICONIP 2003, Springer Verlag, 2003

Comparison of different feature classifiers for brain computer interfaces, F. Cincotti, A. Scipione, A. Tiniperi, D. Mattia, M. G. Marciani, José del R. Millán, S. Salinari, L. Bianchi and F. Babiloni, in: Proceedings of the 1st International IEEE EMBS Conference on Neural Engineering, 2003

Comparison of MLP and GMM Classifiers for Face Verification on XM2VTS, Fabien Cardinaux, Conrad Sanderson and Sébastien Marcel, in: 4th International Conference on AUDIO- and VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, 2003

Confusion Matrix Based Entropy Correction in Multi-stream Combination, Hemant Misra and Andrew Morris, in: Proceedings of ISCA European Conference on Speech Communication and Technology (Eurospeech), 2003

Direct Non-Invasive Brain Computer Interfaces, R. Grave de Peralta Menendez, S. L. González Andino, José del R. Millán, T. Pun and C. M. Michel, in: Proceedings of the 9th International Conference on Functional Mapping of the Human Brain, 2003

Hybrid HMM/ANN and GMM Combination for User-Customized Password Speaker Verification, Mohamed Faouzi BenZeghiba and Hervé Bourlard, in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003

Improving Face Authetication Using Virtual Samples, Norman Poh, Sébastien Marcel and Samy Bengio, in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003

Location Based Speaker Segmentation, Guillaume Lathoud and Iain A. McCowan, in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003

Mel-Cepstrum Modulation Spectrum (MCMS) Features for Robust ASR, Vivek Tyagi, Iain A. McCowan, Hervé Bourlard and Hemant Misra, in: IEEE ASRU, 2003

Microphone Array Speech Recognition : Experiments on Overlapping Speech in Meetings, Darren Moore and Iain A. McCowan, in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003

Modeling Human Interaction in Meetings, Iain A. McCowan, Samy Bengio, Daniel Gatica-Perez, Guillaume Lathoud, Florent Monay, Darren Moore, Pierre Wellner and Hervé Bourlard, in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2003

Modélisation implicite du mouvement en suivi par filtrage de Monte Carlo séquentiel, Jean-Marc Odobez and Silèye O. Ba, in: GRETSI conference, Signal and Image Processing,, 2003

Multi-Modal Audio-Visual Event Recognition for Football Analysis, Mark Barnard, Jean-Marc Odobez and Samy Bengio, in: Proc. IEEE Workshop on Neural Networks for Signal Processing (NNSP), 2003

Multimodal Authentication using Asynchronous HMMs, Samy Bengio, in: 4th International Conference on Audio- and Video-Based Biometric Person Authentication, AVBPA, Springer-Verlag, 2003

New Entropy Based Combination Rules in HMM/ANN Multi-stream ASR, Hemant Misra, Hervé Bourlard and Vivek Tyagi, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2003

Noise Resistant Audio-Visual Verification via Structural Constraints, Conrad Sanderson and Kuldip K. Paliwal, in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003

Non-Invasive Brain-Actuated Control of a Mobile Robot, José del R. Millán, F. Renkens, J. Mouriño and W. Gerstner, in: Proceedings of the 18th International Joint Conference on Artificial Intelligence, 2003

Non-Linear Variance Reduction Techniques in Biometric Authentication, Norman Poh and Samy Bengio, in: Workshop on Multimodal User Authentication, 2003

Nonlinear Spectral Transformations for Robust Speech Recognition, Shajith Ikbal, Hynek Hermansky and Hervé Bourlard, in: Proceedings of the IEEE Automatic Speech Recognition and Understanding (ASRU) Workshop 2003, 2003

Offline Recognition of Large Vocabulary Cursive Handwritten Text, Alessandro Vinciarelli, Samy Bengio and Horst Bunke, in: Proceedings of International Conference on Document Analysis and Recognition (ICDAR), 2003

On automatic annotation of meeting databases, Daniel Gatica-Perez, Iain A. McCowan, Mark Barnard, Samy Bengio and Hervé Bourlard, in: IEEE International Conference on Image Processing (ICIP), 2003

On Factorizing Spectral Dynamics for Robust Speech Recognition, Vivek Tyagi, Iain A. McCowan, Hervé Bourlard and Hemant Misra, in: Eurospeech, 2003

On Image Auto-Annotation with Latent Space Models, Florent Monay and Daniel Gatica-Perez, in: Proc. ACM Int. Conf. on Multimedia (ACM MM), 2003

On the Combination of Speech and Speaker Recognition, Mohamed Faouzi BenZeghiba and Hervé Bourlard, in: European Conference On Speech, Communication and Technology (EUROSPEECH'03), 2003

Phase AutoCorrelation (PAC) derived Robust Speech Features, Shajith Ikbal, Hemant Misra and Hervé Bourlard, in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003

Phoneme-Grapheme Based Speech Recognition System, Mathew Magimai-Doss, Todd Andrew Stephenson, Hervé Bourlard and Samy Bengio, in: Proceedings of IEEE ASRU, 2003

Robust Features for Frontal Face Authentication in Difficult Image Conditions, Conrad Sanderson and Samy Bengio, in: Proceedings of 4th International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA-03), 2003

Scalability Analysis of Audio-Visual Person Identity Verification, J. Czyz, Samy Bengio, Christine Marcel and L. Vandendorpe, in: 4th International Conference on Audio- and Video-Based Biometric Person Authentication, AVBPA, Springer-Verlag, 2003

Segmenting Multiple Concurrent Speakers Using Microphone Arrays, Guillaume Lathoud, Iain A. McCowan and Darren Moore, in: Proceedings of Eurospeech 2003, 2003

Sequential Monte Carlo Video Text Segmentation, Datong Chen and Jean-Marc Odobez, in: ICIP, 2003

Spectral Structuring of Home Videos, Jean-Marc Odobez, Daniel Gatica-Perez and Maël Guillemot, in: International Conference on Image and Video Retrieval (CIVR'03), Springer Verlag, 2003

Speech & Face Based Biometric Authentication at IDIAP, Conrad Sanderson, Samy Bengio, Hervé Bourlard, Johnny Mariéthoz, Ronan Collobert, Mohamed Faouzi BenZeghiba, Fabien Cardinaux and Sébastien Marcel, in: Proceedings of the 2003 IEEE International Conference on Multimedia & Expo (ICME-03), 2003

Speech recognition of spontaneous, noisy speech using auxiliary information in Bayesian networks, Todd Andrew Stephenson, Mathew Magimai-Doss and Hervé Bourlard, in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003

Studying Phase Synchrony for Classification of Mental Tasks in Brain Machine Interfaces, E. Gysels, José del R. Millán, Silvia Chiappa and P. Celka, in: Proceedings of the Conference of the International Society for Brain Electromagnetic Topography, 2003

The BANCA Database and Evaluation Protocol, E. Bailly-Baillière, Samy Bengio, Frédéric Bimbot, M. Hamouz, J. Kittler, Johnny Mariéthoz, J. Matas, K. Messer, Vlad Popovici, F. Porée, B. Ruiz and Jean-Philippe Thiran, in: 4th International Conference on Audio- and Video-Based Biometric Person Authentication, AVBPA, Springer-Verlag, 2003

TRAP-TANDEM: Data-driven extraction of temporal features from speech, Hynek Hermansky, in: large part published in Proceedings of ASRU-2003, 2003

Using pitch frequency information in speech recognition, Mathew Magimai-Doss, Todd Andrew Stephenson and Hervé Bourlard, in: Proceedings of Eurospeech, 2003

Vessel Segmentation and Branching Detection using an Adaptive Profile Kalman Filter in Retinal Blood Vessel Structure Analysis, Pedro Quelhas and James Boyce, in: Pattern Recognition and Image Analysis: First Iberian Conference, IbPRIA 2003, Springer-Verlag LNCS, 2003

Video Shot Clustering using Spectral Methods, Jean-Marc Odobez, Daniel Gatica-Perez and Maël Guillemot, in: 3rd Workshop on Content-Based Multimedia Indexing (CBMI), 2003

A Comparative Study of Adaptation Methods for Speaker Verification, Johnny Mariéthoz and Samy Bengio, in: International Conference on Spoken Language Processing ICSLP, 2002

A Multi-sample Multi-source Model for Biometric Authentication, Norman Poh, Samy Bengio and Jerzy Korczak, in: IEEE International Workshop on Neural Networks for Signal Processing (NNSP), 2002

A Parallel Mixture of SVMs for Very Large Scale Problems, Ronan Collobert, Samy Bengio and Yoshua Bengio, in: Advances in Neural Information Processing Systems, NIPS 14, MIT Press, 2002

A State-of-the-art Neural Network for Robust Face Verification, Sébastien Marcel, Christine Marcel and Samy Bengio, in: Proceedings of the COST275 Workshop on The Advent of Biometrics on the Internet, 2002

Auxiliary Variables in Conditional Gaussian Mixtures for Automatic Speech Recognition, Todd Andrew Stephenson, Mathew Magimai-Doss and Hervé Bourlard, in: Seventh International Conference on Spoken Language Processing (ICSLP~2002), 2002

Conditional Gaussian Mixture Models for Environmental Risk Mapping, Nicolas Gilardi, Samy Bengio and Mikhail Kanevski, in: IEEE International Workshop on Neural Networks for Signal Processing (NNSP), 2002

Dynamic Bayesian Network Based Speech Recognition with Pitch and Energy as Auxiliary Variables, Todd Andrew Stephenson, Jaume Escofet, Mathew Magimai-Doss and Hervé Bourlard, in: 2002 IEEE International Workshop on Neural Networks for for Signal Processing (NNSP~2002), 2002

Evaluation of Formant-Like Features for ASR, Katrin Weber, F. de Wet, B. Cranen, Louis Boves, Samy Bengio and Hervé Bourlard, in: International Conference on Spoken Language Processing (ICSLP 2002), 2002

Evolution of the Mental States Operating a Brain-Computer Interface, J. Mouriño, Silvia Chiappa, R. Jané and José del R. Millán, in: Proceedings of the International Federation for Medical and Biological Engineering, 2002

Face Verification using MLP and SVM, Fabien Cardinaux and Sébastien Marcel, in: XI Journees NeuroSciences et Sciences pour l'Ingenieur (NSI 2002), 2002

Facial Expression Analysis using Shape and Motion Information Extracted by Convolutional Neural Networks, B. Fasel, in: International IEEE Workshop on Neural Networks for Signal Processing (NNSP 02), 2002

Head-Pose Invariant Facial Expression Recognition using Convolutional Neural Networks, B. Fasel, in: International IEEE Conference on Multimodal Interfaces (ICMI 02), 2002

Improving Face Verification using Skin Color Information, Sébastien Marcel and Samy Bengio, in: Proceedings of the 16th International Conference on Pattern Recognition, IEEE Computer Society Press, 2002

Increasing Speech Recognition Noise Robustness with HMM2, Katrin Weber, Samy Bengio and Hervé Bourlard, in: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 02), 2002

Linking Objects in Videos by Importance Sampling, Daniel Gatica-Perez and Ming-Ting Sun, in: IEEE International Conference on Multimedia and Expo, 2002

Low cost duration modelling for noise robust speech recognition, Andrew Morris, Simon Payne and Hervé Bourlard, in: Proc. ICSLP, 2002

Microphone Array Post-filter for Diffuse Noise Field, Iain A. McCowan and Hervé Bourlard, in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2002

Mixed Bayesian Networks with Auxiliary Variables for Automatic Speech Recognition, Todd Andrew Stephenson, Mathew Magimai-Doss and Hervé Bourlard, in: International Conference on Pattern Recognition (ICPR~2002), 2002

Mutliscale Facial Expression Recognition using Convolutional Neural Networks, B. Fasel, in: Indian Conference on Computer Vision, Graphics and Image Processing (ICVGIP 02), 2002

Object Localization in Metric Spaces for Video Linking, Daniel Gatica-Perez and Ming-Ting Sun, in: IEEE Workshop on Motion and Video Computing, 2002

Offline Cursive Word Recognition using Continuous Density Hidden Markov Models trained with PCA or ICA Features, Alessandro Vinciarelli and Samy Bengio, in: Proceedings of International Conference on Pattern Recognition, 2002

Probabilistic Home Video Structuring: Feature Selection and Performance Evaluation, Daniel Gatica-Perez, Ming-Ting Sun and Alexander Loui, in: IEEE International Conference on Image Processing, 2002

Robust Face Analysis using Convolutional Neural Networks, B. Fasel, in: Proceedings of the International Conference on Pattern Recognition (ICPR 02), 2002

Robust HMM-Based Speech/Music Segmentation, Jitendra Ajmera, Iain A. McCowan and Hervé Bourlard, in: ICASSP, 2002

Robust Speech Recognition with Small Microphone Arrays using the Missing Data Approach, Iain A. McCowan, Andrew Morris and Hervé Bourlard, in: Proceedings of International Conference on Speech and Language Processing (ICSLP), 2002

Scaling Large Learning Problems with Hard Parallel Mixtures, Ronan Collobert, Yoshua Bengio and Samy Bengio, in: International Workshop on Pattern Recognition with Support Vector Machines, SVM'2002, 2002

Speaker Normalization using HMM2, Shajith Ikbal, Katrin Weber and Hervé Bourlard, in: Proceedings of the 2002 IEEE International Workshop on Neural Networks for Signal Processing (NNSP-02), 2002

Text Segmentation and Recognition in Complex Background Based on Markov Random Field, Datong Chen, Jean-Marc Odobez and Hervé Bourlard, in: Int. Conf. Pattern Recognition 2002, 2002

Unknown-Multiple Speaker clustering using HMM, Jitendra Ajmera, Hervé Bourlard, I. Lapidot and Iain A. McCowan, in: ICSLP, 2002

User-Customized Password HMM Based Speaker Verification, Mohamed Faouzi BenZeghiba and Hervé Bourlard, in: Proceedings of the COST275 Workshop on the Advent of Biometrics on the Internet, 2002

User-Customized Password Speaker Verification based on HMM/ANN and GMM Models, Mohamed Faouzi BenZeghiba and Hervé Bourlard, in: International Conference on Spoken Language Processing (ICSLP~2002), 2002

Video Text Recognition Based on Markov Random Field and Grayscale Consistency Constraint, Jean-Marc Odobez and Datong Chen, in: Int. Conf. Image Processing 2002, 2002

Writer adaptation techniques in HMM based Off-Line Cursive Script Recognition, Alessandro Vinciarelli and Samy Bengio, in: Proceedings of 8$^{th}$ International Conference on Frontiers on Handwriting Recognition, 2002

Adaptive ML-Weighting in Multi-Band Recombination of Gaussian Mixture ASR, Astrid Hagen, Hervé Bourlard and Andrew Morris, in: ICASSP, 2001

Confidence Evaluation for Risk Prediction, Nicolas Gilardi, Tom Melluish and Michel Maignan, in: 2001 Annual Conference of the IAMG, 2001

Data utility modelling for mismatch reduction, Andrew Morris, in: Proc. CRAC (workshop on Consistent & Reliable Acoustic Cues for sound analysis), 2001

EEG pattern recognition through multi-stream evidence combination, Andrew Morris, Bernhard Obermaier and Gert Pfurtscheller, in: Proc. World Congress on Neuroinformatics, 2001

Error Correcting Posterior Combination for Robust Multi-Band Speech Recognition, Astrid Hagen and Hervé Bourlard, in: EUROSPEECH, 2001

From missing data to maybe useful data: soft data modelling for noise robust ASR, Andrew Morris, Jon Barker and Hervé Bourlard, in: Proc. WISP, 2001

HMM2- Extraction of Formant Features and their Use for Robust ASR, Katrin Weber, Samy Bengio and Hervé Bourlard, in: European Conference on Speech Communication and Technology (Eurospeech 2001), 2001

Learning the Decision Function for Speaker Verification, Samy Bengio and Johnny Mariéthoz, in: IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP, 2001

MAP Combination of Multi-Stream HMM or HMM/ANN Experts, Andrew Morris, Astrid Hagen and Hervé Bourlard, in: Proc. Eurospeech, 2001

Modeling Auxiliary Information in Bayesian Network Based ASR, Todd Andrew Stephenson, Mathew Magimai-Doss and Hervé Bourlard, in: 7th European Conference on Speech Communication and Technology (Eurospeech~2001), 2001

New Approaches Towards Robust and Adaptive Speech Recognition, Hervé Bourlard, Samy Bengio and Katrin Weber, in: Advances in Neural Information Processing Systems 13, MIT Press, 2001

Signal modeling with Non Uniform Topology lattice filters, Sacha Krstulović and Frédéric Bimbot, in: Proc. ICASSP 2001, 2001

Speech Recognition Using Advanced HMM2 Features, Katrin Weber, Samy Bengio and Hervé Bourlard, in: Automatic Speech Recognition and Understanding Workshop, 2001

Text Enhancement with Asymmetric Filter for Video OCR, Datong Chen, Kim Shearer and Hervé Bourlard, in: Proceedings of the 11th International Conference on Image Analysis and Processing, 2001

Text Identification in Complex Background using SVM, Datong Chen, Hervé Bourlard and Jean-Philippe Thiran, in: Proceedings of the Int. Conf. on computer vision and pattern recognition, 2001

Video OCR for Sport Video Annotation and Retrieval, Datong Chen, Kim Shearer and Hervé Bourlard, in: Proceedings of the 8th IEEE International Conference on Mechatronics and Machine Vision in Practice, 2001

A front-end using the harmonicity cue for speech enhancement in loud noise, Frédéric Berthommier, Hervé Glotin and Emmanuel Tessier, in: Int. Conf. on Spoken Language Processing (ICSLP), 2000

A MAP approach, with synchronous decoding and unit-based normalization for text-dependent speaker verification, Johnny Mariéthoz, Johan Lindberg and Frédéric Bimbot, in: ICSLP, 2000

A neural network for classification with incomplete data: application to robust ASR, Andrew Morris, Ljubomir Josifovski, Hervé Bourlard, Martin Cooke and Phil Green, in: Proc. ICSLP, 2000

Adaptation robuste de modèles HMM pour la vérification du locuteur dépendante du texte, Johnny Mariéthoz and Frédéric Bimbot, in: Journee d'Etudes sur la Parole, Aussois, 2000

Audio visual speech recognition, C. Neti, G. Potamianos, Juergen Luettin, I. Matthews, Hervé Glotin, D. Vergyri, J. Sison and A. Mashari, Johns Hopkins University-CLSP, 2000

Automatic Speech Recognition using Dynamic Bayesian Networks with both Acoustic and Articulatory Variables, Todd Andrew Stephenson, Hervé Bourlard, Samy Bengio and Andrew Morris, in: 6th International Conference on Spoken Language Processing: ICSLP~2000 (Interspeech~2000), 2000

Behavior of a Bayesian adaptation method for incremental enrollment in speaker verification, Corinne Fredouille, Johnny Mariéthoz, Cédric Jaboulet, Jean Hennebert, Chafic Mokbel and Frédéric Bimbot, in: ICASSP2000 - IEEE International Conference on Acoustics, Speech, and Signal Processing, 2000

Blind acoustic source separation for cocktail party speech recognition, H. Hong, Seunjin Choi, Hervé Glotin and Frédéric Berthommier, in: ICONIP, 7th IEEE Int. Conf. on Neural Information Processing, 2000

Comparison of HMM experts with MLP experts in the Full Combination Multi-Band Approach to Robust ASR, Astrid Hagen and Andrew Morris, in: ICSLP, 2000

Comparison of Unsupervised and Supervised Training of RBF Neural Networks. Case Study: Mapping of Contamination Data, V. Polishchuk and Mikhail Kanevski, in: Neural Computation 2000, 2000

Environmental and Pollution Spatial Data Classification with Support Vector Machines and Geostatistics, Nicolas Gilardi, Mikhail Kanevski, Michel Maignan and Eddy Mayoraz, in: Geostatistical congress 2000, 2000

Etudes comparatives des robustesses au bruit de l'approche 'Full Combination' et de son approximation, Astrid Hagen and Hervé Glotin, in: Journee d'Etudes sur la Parole, Aussois, 2000

Fast latent semantic indexing of spoken documents by using self-organizing maps, Mikko Kurimo, in: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP'2000, 2000

From Multi-Band Full Combination to Multi-Stream Full Combination Processing in Robust ASR, Astrid Hagen, Andrew Morris and Hervé Bourlard, in: ISCA ITRW ASR2000, 2000

HMM2- A Novel Approach to HMM Emission Probability Estimation, Katrin Weber, Samy Bengio and Hervé Bourlard, in: International Conference on Spoken Langugae Processing (ICSLP 2000), 2000

Incorporating Domain Knowledge with Video and Voice Data Analysis in News Broadcasts, Kim Shearer, Chitra Dorai and Svetha Venkatesh, in: Proceedings of the Sixth ACM International Conference on Knowledge Discovery and Data Mining, ACM, Boston, MA, USA, 2000

Indexing spoken audio by LSA and SOMs, Mikko Kurimo, in: Proceedings of the European Signal Processing Conference EUSIPCO'2000, 2000

Indoor Radon Risk Assessment with Geostatistics and Artificial Neural Networks, V. Demyanov, Mikhail Kanevski, Michel Maignan, E. Savelieva, V. Timonin, S. Chernov and G. Piller, in: Geostatistical congress 2000, 2000

Inverse lattice filtering of speech with adapted non-uniform delays, Sacha Krstulović and Frédéric Bimbot, in: Proc. ICSLP 2000, 2000

Iterative Posterior-Based Keyword Spotting Without Filler Models, Marius-Calin Silaghi and Hervé Bourlard, in: Proceedings of the IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, 2000

LPC modeling with speech production constraints, Sacha Krstulović, in: Proc. 5th Speech Production Seminar, 2000

Multichannel signal separation for cocktail party speech recognition: a dynamic recurrent network, Seunjin Choi, H. Hong, Hervé Glotin and Frédéric Berthommier, in: Int. Conf. on Spoken Language Processing (ICSLP), no IDIAP RR, see RESPITE www, 2000

Multiple Hypotheses Video OCR, Datong Chen and Juergen Luettin, in: Proceedings of the 4th International Workshop on Document Analysis System, 2000

Multiple Timescale Feature Combination towards Robust Speech Recognition, Katrin Weber, in: KONVENS 2000 / Sprachkommunikation, 2000

Neural Network Residual Stochastic Co-simulation for Environmental Data Analysis, V. Demyanov, Mikhail Kanevski, E. Savelieva, V. Timonin and S. Chernov, in: Neural Computation 2000, 2000

Off-Line Cursive Script Recognition Based on Continuous Density HMM, Alessandro Vinciarelli and Juergen Luettin, in: Proceedings of 7th International Workshop on Frontiers in Handwriting Recognition, 2000

Recognition of Asymmetric Facial Action Unit Activities and Intensities, B. Fasel and Juergen Luettin, in: Proceedings of the International Conference on Pattern Recognition (ICPR 2000), 2000

Reconnaissance de la parole dans le bruit après renforcement fondé sur l'harmonicité, Frédéric Berthommier and Hervé Glotin, in: Proceedings of JEP'2000, no IDIAP RR, see RESPITE www, 2000

Relating LPC modeling to a factor-based articulatory model, Sacha Krstulović, in: Proc. ICSLP 2000, 2000

Some applications of a priori knowledge in multi-stream HMM and HMM/ANN based ASR, Andrew Morris, in: Phonus No.5,Dec.2000, ISSN 0949-1791, Proc. Workshop on Phonetics and Phonology in ASR, 2000

Test of several external posterior weighting functions for multiband Full Combination ASR, Hervé Glotin and Frédéric Berthommier, in: Int. Conf. on Spoken Language Processing (ICSLP), 2000

Using Multiple Time Scales in the Framework of Multi-Stream Speech Recognition, Astrid Hagen and Hervé Bourlard, in: ICSLP, 2000

A CASA front-end using the localisation cue for segregation and then cocktail-party speech recognition, Emmanuel Tessier, Frédéric Berthommier, Hervé Glotin and Seunjin Choi, in: Proc. IEEE Int. Conference on Speech Processing (ICSP), IEEE, 1999

A CASA-labelling model using the localisation cue for robust cocktail-party speech recognition, Hervé Glotin, Frédéric Berthommier and Emmanuel Tessier, in: Proc.\ European Conf.\ on Speech Communication and Technology (EUROSPEECH), 1999

A comparison of mixture models for density estimation, Perry Moerland, in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'99), London: IEE, 1999

A comparison of two strategies for ASR in additive noise : Missing Data and Spectral Subtraction, Christopher Kermorvant and Andrew Morris, in: 6th European Conference on Speech Communication and Technology --- Eurospeech'99, 1999

A measure of speech and pitch reliability from voicing, Frédéric Berthommier and Hervé Glotin, in: Proc. Int. Joint Conf. on Artificial Intelligence (IJCAI), Scandinavian AI Society, 1999

A new SNR-feature mapping for robust multistream speech recognition, Frédéric Berthommier and Hervé Glotin, in: Proc. Int. Congress on Phonetic Sciences (ICPhS), 1999

An Overview of the PICASSO Project Research Activities in Speaker Verification for Telephone Applications, Frédéric Bimbot, Mats Blomberg, Louis Boves, Gérard Chollet, Cédric Jaboulet, Bruno Jacob, Jamal Kharroubi, Johan Koolwaaij, Johan Lindberg, Johnny Mariéthoz, Chafic Mokbel and Houda Mokbel, in: 6th european conference on speech communication and technology --- eurospeech'99, 1999

Audio-Visual Person Verification, Souheil Ben-Yacoub, Juergen Luettin, K. Jonsson, J. Matas and J. Kittler, in: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, June 1999, Fort Collins, USA, 1999

Blind separation of delayed and superimposed acoustic sources : learning algorithms an experimental study, Seunjin Choi, Youngki Lyu, Frédéric Berthommier, Hervé Glotin and Andrzej Cichocki, in: Proc. IEEE Int. Conference on Speech Processing (ICSP), IEEE, 1999

Classification using localized mixtures of experts, Perry Moerland, in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'99), London: IEE, 1999

CLIENT / WORLD MODEL SYNCHRONOUS ALIGNEMENT FOR SPEAKER VERIFICATION, Johnny Mariéthoz, Dominique Genoud, Frédéric Bimbot and Chafic Mokbel, in: 6th European Conference on Speech Communication and Technology --- Eurospeech'99, 1999

Combinatorial Approach for Data Binarization, Eddy Mayoraz and Miguel Moreira, in: Principles of Data Mining and Knowledge Discovery: third european conference; proceedings / PKDD'99, Springer, 1999

Data binarization by discriminant elimination, Miguel Moreira, Alain Hertz and Eddy Mayoraz, in: Proceedings of the ICML-99 Workshop: From Machine Learning to Knowledge Discovery in Databases, 1999

Decision-Oriented Environmental Mapping with Radial Basis Function Neural Networks, V. Demyanov, Nicolas Gilardi, Mikhail Kanevski, Michel Maignan and V. Polishchuk, in: Intelligent techniques for Spatio-Temporal Data Analysis in Environmental Applications. Workshop W07, 1999

Deliberate Imposture: a challenge for automatic speaker verification systems, Dominique Genoud and Gérard Chollet, in: Proceedings of the European Conference on Speech Communication and Technology, 1999

Different Weighting Schemes in the Full Combination Subbands Approach for Noise Robust ASR, Astrid Hagen, Andrew Morris and Hervé Bourlard, in: Robust Methods for Speech Recognition in Adverse Conditions, 1999

Environmental and Pollution Spatial Data Classification with Support Vector Machines and Geostatistics, Nicolas Gilardi, Mikhail Kanevski, Michel Maignan and Eddy Mayoraz, in: Intelligent techniques for Spatio-Temporal Data Analysis in Environmental Applications. Workshop W07, 1999

Evaluating the Complexity of Databases for Person Identification and Verification, Georg Thimm, Souheil Ben-Yacoub and Juergen Luettin, in: 8th Int. Conf. Computer Analysis of Images and Patterns, Springer Verlag, 1999

Experimental evaluation of text-dependent speaker verification on laboratory and field test databases in the M2VTS project, Laurent Besacier, Juergen Luettin, Gilbert Maître and E. Meurville, in: Proceedings of the European Conference on Speech Communication and Technology, 1999

Extraction of Articulators in X-Ray Image Sequences, Georg Thimm and Juergen Luettin, in: Proceedings of the European Conference on Speech Communication and Technology, 1999

Fast Face Detection using MLP and FFT, Souheil Ben-Yacoub, B. Fasel and Juergen Luettin, in: Proc. Second International Conference on Audio and Video-based Biometric Person Authentication (AVBPA'99), 1999

Illumination-robust Pattern Matching Using Distorted Color Histograms, Georg Thimm and Juergen Luettin, in: Pattern Recognition and Image Understanding, Infix, 1999

Incremental Enrollment of Speech Recognizers, Chafic Mokbel and Olivier Collin, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'99,',','), Phoenix, Arizona, USA, 1999

Iterative Posterior-Based Keyword Spotting Without Filler Models, Marius-Calin Silaghi and Hervé Bourlard, in: Proceedings of the IEEE Automatic Speech Recognition and Understanding (ASRU'99) Workshop, 1999

Latent Semantic Indexing by Self-Organizing Map, Mikko Kurimo and Chafic Mokbel, in: ESCA ETRW workshop on Accessing Information in Spoken Audio, 1999

LPC-based inversion of the DRM articulatory model, Sacha Krstulović, in: Proc. Eurospeech'99, 1999

Multi Modal Verification for Teleservices and Security Applications, G. Richard, Y. Menguy, I. Guis, N. Suaudeau, J. Boudy, P. Lockwood, C. Fernández, F. Fernàndez, D. Garcia-Plaza, C. Kotropoulos, A. Tefas, I. Pitas, R. Heimgartner, P. Ryser, C. Beumier, Patrick Verlinde, S. Pigeon, G. Matas, J. Kittler, Josef Bigün, Yousri Abdeljaoued, E. Meurville, Laurent Besacier, M. Ansorge, Gilbert Maître, Juergen Luettin, Souheil Ben-Yacoub, B. Ruiz, J. Cortés and K. Aldama, in: IEEE International Conference on Multimedia Computing and Systems, 1999

Multi-Modal Data Fusion for Person Authentication using SVM, Souheil Ben-Yacoub, in: Proc. Second International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'99), 1999

Non-Stationary Multi-Channel (Multi-Stream) Processing Towards Robust and Adaptive ASR, Hervé Bourlard, in: Proc. of the ESCA Workshop on Robust Methods for Speech Recognition in Adverse Conditions, 1999

Robust Person Verification based on Speech and Facial Images, Juergen Luettin and Souheil Ben-Yacoub, in: Proceedings of the European Conference on Speech Communication and Technology, 1999

The Elisa'99 Speaker Recognition and Tracking Systems, B. Nedic, Guillaume Gravier, Jamal Kharroubi, Gérard Chollet, Dijana Petrovska-Delacretaz, G. Durou, Frédéric Bimbot, Raphaël Blouet, M. Seck, Jean-François Bonastre, Corinne Fredouille, Teva Merlin, I. Magrin-Chagnolleau, S. Pigeon, Patrick Verlinde and Jan Cernocky, in: IEEE Workshop on Automatic Advanced Technologies, 1999

The full combination sub-bands approach to noise robust HMM/ANN based ASR, Andrew Morris, Astrid Hagen and Hervé Bourlard, in: 6th European Conference on Speech Communication and Technology --- Eurospeech'99, 1999

Towards introducing long-term statistics in MUSE for robust speech recognition, Christopher Kermorvant and Chafic Mokbel, in: Automatic Speech Recognition and Understanding (ASRU) workshop, 1999

Tracking Articulators in X-ray Movies of the Vocal Tract, Georg Thimm, in: 8th Int. Conf. Computer Analysis of Images and Patterns, Springer Verlag, 1999

XM2VTSDB: The Extended M2VTS Database, K. Messer, J. Matas, J. Kittler, Juergen Luettin and Gilbert Maître, in: Proc. Second International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'99), 1999

A comparison of a priori threshold setting procedures for speaker verification in the CAVE project, J. B. Pierrot, Johan Lindberg, Johan Koolwaaij, H. P. Hutter, Dominique Genoud, Mats Blomberg and Frédéric Bimbot, in: ICASSP 98, 1998

An overview of the cave project research activities in speaker verification, Frédéric Bimbot, H. P. Hutter, Cédric Jaboulet, Johan Koolwaaij, Johan Lindberg and J. B. Pierrot, in: Reconnaissance du locuteur et ses applications commerciales et criminalistiques, 1998

Confidence Measures in Hybrid HMM/ANN Speech Recognition, Giulia Bernardis and Hervé Bourlard, in: Proceedings of Workshop on Text, Speech and Dialog (TSD'98) Brno, Czech Republic, 1998

Connectionist speech recognition, Hervé Bourlard, in: Proceedings of IK'98, Interdisziplinares Kolleg, Spring Scholl, Gunne am Mohnessee, Germany, March 7--14, 1998

Continuous Audio-Visual Speech Recognition, Juergen Luettin and Stéphane Dupont, in: Proc. 5th European Conference on Computer Vision, Springer Verlag, 1998

Decision fusion using a multi-linear classifier, Patrick Verlinde, Gilbert Maître and Eddy Mayoraz, in: 1st International Conference on Multisource-Multisensor Data Fusion, 1998

Improved Pairwise Coupling Classification With Correcting Classifiers, Miguel Moreira and Eddy Mayoraz, in: Machine Learning: ECML-98, Springer, 1998

Improving Posterior Based Confidence Measures in Hybrid HMM/ANN Speech Recognition Systems, Giulia Bernardis and Hervé Bourlard, in: Proceedings of International Conference on Spoken Language Processing (ICSLP'98) Sydney, Australia, 1998

Interfacing of CASA and Multistream recognition, Hervé Glotin, Frédéric Berthommier, Emmanuel Tessier and Hervé Bourlard, in: TSD'98-Text, Speech and Dialog International Workshop, BRNO-Czech Republic, 1998

Interfacing of CASA and partial recognition based on a multistream technique, Frédéric Berthommier, Hervé Glotin, Emmanuel Tessier and Hervé Bourlard, in: ICSLP'98, Sidney, 1998

POLYCOST: a telephone-speech database for speaker recognition, Dijana Petrovska-Delacretaz, Jean Hennebert, H. Melin and Dominique Genoud, in: Reconnaissance du locuteur et ses applications commerciales et criminalistiques, 1998

Reconnaissance multi-bandes de la parole bruitée par couplage entre les niveaux primitifs et d'identification, Hervé Glotin, Emmanuel Tessier, Hervé Bourlard and Frédéric Berthommier, in: Journees Etude Parole - Martigny, 1998

Reconnaissance robuste de la parole par segmentation signal/bruit en sous-bandes, Hervé Glotin, Emmanuel Tessier, Hervé Bourlard and Frédéric Berthommier, in: Neurosciences et Sciences de l'Ingenieur'98 - Munster, CNRS, 1998

Speech pre-processing against intentional imposture in speaker recognition, Dominique Genoud and Gérard Chollet, in: Proceedings of ICSLP, Sidney, 1998

Text dependent speaker verification using binary classifiers, Dominique Genoud, Miguel Moreira and Eddy Mayoraz, in: Proceedings of the 1998 IEEE International Conference on Acoustics, Speech, and Signal Processing --- ICASSP'98, IEEE, IEEE, 1998

Using the Multi-Stream Approach for Continuous Audio-Visual Speech Recognition: Experiments on the M2VTS Database, Stéphane Dupont and Juergen Luettin, in: Proc. 5th Int. Conf. on Spoken Language Processing, 1998

Voice transformation, a tool for imposture of speaker verification, Dominique Genoud and Gérard Chollet, in: Proceedings of International Phonetic Science conference IPS98, Washington, 1998

Voice-B System, Gilles Caloz, Cédric Jaboulet, Johnny Mariéthoz, A. Glaeser and Dominique Genoud, in: IEEE 4th Workshop on Intercative Voice Technology for Telecommunications Applications (IVTTA'98) September 29--30, Torino, Italy, 1998

A Connectionist System for Two-Dimensional Representation of Multivariate Location Data, Emile Fiesler and Michel Maignan, in: Proceedings of the Fifth International Workshop on Artificial Intelligence for High Energy Physics, AIHENP, Lausanne, Switzerland, Elsevier Science, 1997

Acoustic-Labial Speaker Verification, Pierre Jourlin, Juergen Luettin, Dominique Genoud and Hubert Wassner, in: Proceedings of the First International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'97), Springer Verlag, 1997

Adapting the 2-Class Recursive Deterministic Perceptron Neural Network to m Classes, M. Tajine, D. Elizondo, Emile Fiesler and Jerzy Korczak, in: Proceedings of the International Conference on Neural Networks, IEEE, IEEE, 1997

An Optical Thresholding Perceptron, Indu Saxena, Perry Moerland, Emile Fiesler, A. R. Pourzand and N. Collings, in: Proceedings of the Workshop on Optics and Computer Science, Geneva, Switzerland, 1997

Estimation of Global Posteriors and Forward-Backward Training of Hybrid HMM/ANN Systems, Jean Hennebert, Christophe Ris, Hervé Bourlard and Steve Renals, in: EUROSPEECH'97, 1997

Handwritten Digit Recognition with Binary Optical Perceptron, Indu Saxena, Perry Moerland, Emile Fiesler and A. R. Pourzand, in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'97), Springer-Verlag, 1997

Hybrid HMM/ANN Systems for Speech Recognition: Overview and New Research Directions, Hervé Bourlard and Nelson Morgan, in: International School on Neural Nets: Adaptive Processing of Temporal Information, Springer Verlag, 1997

Hybrid HMM/ANN Systems for Training Independent Tasks: Experiments on 'Phonebook' and Related Improvements, Stéphane Dupont, Hervé Bourlard, O. Deroo, Vincent Fontaine and J. -M. Boite, in: IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, 1997

Integrating Acoustic and Labial Information for Speaker Identification and Verification, Pierre Jourlin, Juergen Luettin, Dominique Genoud and H. Wassner, in: Proceedings of the European Conference on Speech Communication and Technology, 1997

Likelihood ratio adjustment for the compensation of model mismatch in speaker verification, Frédéric Bimbot and Dominique Genoud, in: Eurospeech 97, 1997

Mixtures of Experts Estimate A Posteriori Probabilities, Perry Moerland, in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'97), Springer-Verlag, 1997

On the Complexity of Recognizing Iterated Differences of Polyhedra, Eddy Mayoraz, in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'97), Springer-Verlag, 1997

On the Decomposition of Polychotomies into Dichotomies, Eddy Mayoraz and Miguel Moreira, in: Proceedings of The Fourteenth International Conference on Machine Learning, Morgan Kaufmann, 1997

Person Authentication by Fusing Face and Speech Information, Benoît Duc, Gilbert Maître, Stefan Fischer and Josef Bigün, in: Proceedings of the First International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'97), Springer Verlag, 1997

Robust Speech Recognition based on Multi-Stream Features, Stéphane Dupont, Hervé Bourlard and Christophe Ris, in: Proc. of the ESCA-NATO Workshop on Robust Speech Recognition for Unknown Communication Channels, 1997

Speaker Verification in the Telephone Network : Research Activities in the CAVE Project, Frédéric Bimbot, H. P. Hutter, Cédric Jaboulet, Johan Koolwaaij, Johan Lindberg and J. B. Pierrot, in: Proceedings of the European Conference on Speech Communication and Technology (EUROSPEECH'97), 1997

Speaker-Dependent Speech Recognition Based on Phone-Like Unit Model -- Application to Voice Dialing, Vincent Fontaine and Hervé Bourlard, in: IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, 1997

State-of-the-Art and Recent Progress in Hybrid HMM/ANN Speech Recognition, Hervé Bourlard, in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'97), Springer-Verlag, 1997

Subband-Based Speech Recognition, Hervé Bourlard and Stéphane Dupont, in: IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, 1997

Towards Speaker Independent Continuous Speechreading, Juergen Luettin, in: Proceedings of the European Conference on Speech Communication and Technology, 1997

Using Multiple Time Scales in a Multi-Stream Speech Recognition System, Stéphane Dupont and Hervé Bourlard, in: EUROSPEECH'97, 1997

A Boolean Approach to Construct Neural Networks for Non-Boolean Problems, Georg Thimm and Emile Fiesler, in: Proceedings of the 8th IEEE International Conference on Tools with Artificial Intelligence, IEEE, 1996

A Method for All-Positive Optical Multilayer Perceptrons, Indu Saxena, Emile Fiesler and Perry Moerland, in: Proceedings of the Third IEEE International Conference on Electronics, Circuits, and Systems, University of Patras, Rhodos, Greece, IEEE, 1996

Amelioration des performances de verification du locuteur par combinaison de methodes, Dominique Genoud, Guillaume Gravier, Frédéric Bimbot and Gérard Chollet, in: Journees d'etudes sur la parole, JEP, 1996

Bounds on the Degree of High Order Binary Perceptrons, Eddy Mayoraz, in: Proceedings of ESANN'96, D facto, 1996

Combining methods to improve speaker verification decision, Dominique Genoud, Frédéric Bimbot, Guillaume Gravier and Gérard Chollet, in: Proceedings of The Fourth International Conference on Spoken Language Processing, ICSLP, ICSLP, 1996

Connectionist Quantization Functions, Tomas Lundin, Emile Fiesler and Perry Moerland, in: Proceedings of the '96 SIPAR-Workshop on Parallel and Distributed Computing, Scientific and Parallel Computing Group, University of Geneva, Geneva, Switzerland, 1996

ETC\_vérif : un environnement multi-agents de reconnaissance automatique de la parole en continu, Jean-Luc Cochard and Murielle Vial, in: Proceedings of JEP'96: XXIemes Journees d'etude sur la Parole, 1996

Extended Cauchy Machines, S. Cuche and Emile Fiesler, in: Proceedings of the International Conference on Neural Information Processing, 1996

Hardware-Friendly Learning Algorithms for Neural Networks: An Overview, Perry Moerland and Emile Fiesler, in: Proceedings of the Fifth International Conference on Microelectronics for Neural Networks and Fuzzy Systems: MicroNeuro'96, EPFL and CSEM, Lausanne, Switzerland, IEEE Computer Society Press, 1996

Image Classification by Neural Networks for the Quality Control of Watches, Miguel Moreira, Emile Fiesler and Gianni Pante, in: Proceedings ISAI /IFIS 1996, ITESM, Cancun, Mexico, ITESM, 1996

Learning to recognise talking faces, Juergen Luettin, Neil A. Thacker and Steve W. Beet, in: Proceedings of the International Conference on Pattern Recognition (ICPR'96), IAPR, 1996

Locating and tracking facial speech features, Juergen Luettin, Neil A. Thacker and Steve W. Beet, in: Proceedings of the International Conference on Pattern Recognition (ICPR'96), IAPR, 1996

Multi-modal person verification tools using speech and images, M. Acheroy et al., in: European Conference on Multimedia Applications, Services and Techniques, 1996

Neural Network Pruning and Pruning Parameters, Georg Thimm and Emile Fiesler, in: The 1st Workshop on Soft Computing, Dept. of Information Electronics Nagoya University, 1996

New time-frequency derived cepstral coefficients for automatic speech recognition, Hubert Wassner and Gérard Chollet, in: Proceedings of the 8th European Signal Processing Conference (Eusipco'96), 1996

Overcoming Inaccuracies in Optical Multilayer Perceptrons, Perry Moerland, Emile Fiesler and Indu Saxena, in: Proceedings of the First International Symposium on Neuro-Fuzzy Systems (AT'96), Lausanne, Switzerland, AATI, 1996

Polycost Database, Dominique Genoud, Jean Hennebert and H. Melin, 1996

Secured vocal access to telephone servers, Olivier Bornet, Gérard Chollet, Jean-Luc Cochard, Andrei Constantinescu and Dominique Genoud, in: Proceedings of IVTTA 1996 IEEE Third Workshop Interactive Voice Technology for Telecommunications Applications, 1996

Semi-automatic HMM-based annotation of the PolyCOST Database, Dijana Petrovska-Delacretaz, Jean Hennebert, Dominique Genoud and Gérard Chollet, in: Application of speaker recognition techniques in telephony, COST250, 1996

Sparse Initial Topologies for High Order Perceptrons, Andrea De Pol, Georg Thimm and Emile Fiesler, in: Proceedings of the International Conference on Neural Networks, IEEE, 1996

Speachreading using shape and intensity information, Juergen Luettin, Neil A. Thacker and Steve W. Beet, in: Proceedings of the 4th International Conference on Spoken Language Processing (ICSLP'96), 1996

Speaker identification by lipreading, Juergen Luettin, Neil A. Thacker and Steve W. Beet, in: Proceedings of the 4th International Conference on Spoken Language Processing (ICSLP'96), 1996

Statistical lip modelling for visual speech recognition, Juergen Luettin, Neil A. Thacker and Steve W. Beet, in: Proceedings of the 8th European Signal Processing Conference (Eusipco'96), 1996

Sun Workstation and SwissNet Platform for Speech Recognition and Speaker Verification over the Telephone, Andrzej Drygajlo, Jean-Luc Cochard, Gérard Chollet, Olivier Bornet and Philippe Renevey, in: Proceedings of Workstations und ihre Anwendungen, SIWORK'96, 1996

Superceptron Construction, R. Visscher, Emile Fiesler and Georg Thimm, in: Proceedings of the '96 SIPAR-Workshop on Parallel and Distributed Computing, Scientific and Parallel Computing Group, University of Geneva, Geneva, Switzerland, 1996

Swiss PolyPhone and PolyVar: Building Databases for Speech Recognition and Speaker Verification, Andrei Constantinescu and Gérard Chollet, in: Proceedings of The 3rd Slovenian-German and 2nd SDRV Workshop, Speech and Image Understanding, 1996

Traitement préliminaire de l'image d'un texte manuscrit en vue de sa reconnaissance: une méthode de sur-segmentation, Gilbert Maître, Stéphane Brunet and Gianni Pante, in: 4eme Colloque National sur l'A?crit et le Document (CNED'96), 1996

Un système prédictif de la structuration syntaxico-rythmique d'un énoncé à l'aide d'informations prosodiques, Philippe Langlais, Henri Méloni and Jean-Luc Cochard, in: Proceedings of JEP'96: XXIemes Journees d'etude sur la Parole, 1996

Validating Different Flexible Vocabulary Approaches on the Swiss French PolyPhone and PolyVar databases, Andrei Constantinescu, Olivier Bornet, Gilles Caloz and Gérard Chollet, in: Proceedings of ICSLP 96, 1996

Visual Speech Recognition using Active Shape Models and Hidden Markov Models, Juergen Luettin, Neil A. Thacker and Steve W. Beet, in: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'96), 1996

A graphical tool for monitoring Oz objects activity, Jean-Luc Cochard and Dinh Van Linh Nguyen, in: Proc. of WOz'95: International Workshop on Oz Programming, IDIAP, Uni. Fribourg, 1995

A study of Intra- and Inter-Speaker Variability in the Voices of Twins for Speaker Verification, Gérard Chollet and M. Homayounpour, in: International Congress of Phonetic Sciences, 1995

Boolean Logic Inspired High Order Perceptron Construction, Andrea De Pol, Georg Thimm and Emile Fiesler, in: SIPAR Workshop'95 Parallel and Distributed Systems, SIPAR SI Group for Parallel Systems, Biel School of Engineering, Computer Science Department, 1995

Discrimination of the voices of twins and siblings for speaker verification, Gérard Chollet and M. Homayounpour, in: 4th European Conference on Speech Communication and Technology, 1995

Environnement multi-agents de reconnaissance automatique de la parole en continu, Jean-Luc Cochard and Philippe Froidevaux, in: Actes des 3emes Journees Francophones sur l'Intelligence Artificielle Distribuee et les Systemes Multi-agents, 1995

ETC\_vérif, a Prototype of a Cooperative Automatic Speech Recognition System, Jean-Luc Cochard and Murielle Vial, in: Proc. of WOz'95: International Workshop on Oz Programming, IDIAP, Uni. Fribourg, 1995

Evaluating pruning methods, Georg Thimm and Emile Fiesler, in: 1995 International Symposium on Artificial Neural Networks (ISANN'95), 1995

Gain Elimination form Backpropagation Neural Networks, Georg Thimm, Emile Fiesler and Perry Moerland, in: Proceedings of the International Conference on Neural Networks, IEEE, Perth, IEEE, 1995

Handwriting Recognition, Thomas M. Breuel, in: Second Asian Conference on Computer Vision (ACCV'95,',','), Singapore, 1995

Lexical filtrering by means of prosodic information, Frédéric Béchet, Philippe Langlais and Henri Méloni, in: International Congress of Phonetic Sciences, 1995

Microprosodic study of isolated French word corpora, Philippe Langlais, in: 4th European Conference on Speech Communication and Technology, 1995

Neural nets approaches to Speaker Verification: comparison with Second Order Statistical Measure, Gérard Chollet and M. Homayounpour, in: ICASSP, 1995

Non-Ontogenic Sparse Neural Networks, D. Elizondo, Emile Fiesler and Jerzy Korczak, in: Proceedings of the International Conference on Neural Networks, IEEE, IEEE, 1995

Ontogenic High Order Cauchy Machines, S. Cuche and Emile Fiesler, in: Proceedings of the SIPAR Workshop '95: Parallel and Distributed Systems, Biel School of Engineering, 1995

Optical Multilayer Perceptrons based on Liquid Crystal Devices, Indu Saxena, Emile Fiesler, N. Collings and A. R. Pourzand, in: Optics and Information, Cercle SFO/SEE d'Opto-informatique, Mulhouse, France, European Optical Society (EOS), 1995

Reliability in a Multi-agent Spoken Language Recognition System, Jean-Luc Cochard and Olivier Oppizzi, in: 4th European Conference on Speech Communication and Technology, 1995

Swiss-French Polyphone: a Telephone Speech Database to develop Interactive Voice Servers, Gérard Chollet, Jean-Luc Cochard, Philippe Langlais and R. van Kommer, in: Linguistic Databases, 1995

The Effects of Optical Thresholding in Backpropagation Neural Networks, Perry Moerland, Emile Fiesler and Indu Saxena, in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'95 and NeuroNimes'95), ENNS, Paris, France, EC2 & Cie, 1995

The use of prosodic agents in a cooperative automatic speech recognition system, Philippe Langlais and Jean-Luc Cochard, in: International Congress of Phonetic Sciences, 1995

A system for the off-line recognition of handwritten text, Thomas M. Breuel, in: International Conference on Pattern Recognition (ICPR,',','), Jerusalem, 1994

Design and Implementation of a System for the Recognition of Handwritten Responses on US Census Forms, Thomas M. Breuel, in: IAPR Workshop on Document Analysis Systems, 1994

Modular Object-Oriented Neural Network Simulators and Topology Generalizations, Georg Thimm, R. Grau and Emile Fiesler, in: Proceedings of the International Conference on Artificial Neural Networks (ICANN 94), Sorrento, Italy, Springer-Verlag, 1994

Results on the Steepness in Backpropagation Neural Networks, Perry Moerland, Georg Thimm and Emile Fiesler, in: Proceedings of the '94 SIPAR-Workshop on Parallel and Distributed Computing, SI Group for Parallel Systems, 1994

Weight Initialization for High Order and Multilayer Perceptrons, Georg Thimm and Emile Fiesler, in: Proceedings of the '94 SIPAR--Workshop on Parallel and Distributed Computing, SI Group for Parallel Systems, 1994

Do Backpropagation trained neural networks have normal weight distributions?, I Bellido and Emile Fiesler, in: International Conference on Artificial neural Networks, 1993

Higher-Order Statistics in Visual Object Recognition, Thomas M. Breuel, in: Proc. IEEE Conf. on Computer Vision and Pattern Recognition, 1993

Recognition of Handprinted Digits using Optimal Bounded Error Matching, Thomas M. Breuel, in: International Conference on Document Analysis and Retrieval (ICDAR,',','), Tsukuba Science City, Japan, 1993