Idiap - Idiap Publications

Grey-Box RC Building Models for Intelligent Management of Large-Scale Energy Flexibility: From Mass Modeling to Decentralized Digital Twins, Leonardo A. Bisogno Bernardini, Jérôme Kämpf, Umberto Desideri, Francesco Leccese and Giacomo Salvadori, in: Energies, 19(1), 2025

[DOI]
[URL]

Analogical Structure, Minimal Contextual Cues and Contrastive Distractors: Input Design for Sample-Efficient Linguistic Rule Induction, Chunyang Jiang and Paola Merlo, in: arXiv cs.CL.2511.10441, 2025

Leveraging Untranscribed Data for End-to-End Speech and Callsign Recognition in Air-Traffic Communication, Petr Motlicek, Shashi Kumar, Driss Khalil, Amrutha Prasad and Schüpbach Christof, in: SESAR Innovation Days 2025 (https://www.sesarju.eu/SIDS2025), Eurocontrol, Bled, Slovenia, 2025

[URL]

Towards Integrated Processing of Physiological Signals and Speech, Zohreh Mostaani, Ecole polytechnique fédérale de Lausanne (EPFL), 2025

[DOI]
[URL]

Advancing Phonology-Based Sign Language Assessment: From Learner to Machine-Generated Videos, Neha Tarigopula, Ecole polytechnique fédérale de Lausanne (EPFL), 2025

[DOI]
[URL]

Measuring negative emotions and stress through acoustic correlates in speech: A systematic review, Lilien Schewski, Mathew Magimai-Doss, Guido Beldi and Sandra Keller, in: PLoS One, 20(7), 2025

[DOI]

UpSMART: five years of digital innovation in cancer clinical research---achievements, challenges, and recommendations, Paul O'Regan, Fouziah Butt, Louise Carter, Donna M. Graham, Anja Le Blanc, Richard Hoskins, Laura Stephenson, Akshita Patil, Muhammad Shabbir, Dilan Eken, Subir Singh, Andrea Villa, Luca Agnelli, Silvia Damian, Christopher Grave, Giulia Pretelli, Elena Garralda, Hannah Frost, Filippo de Braud, Andre Freitas, Caroline Dive and Harriet Unsworth, in: Frontiers in Digital Health, 7, 2025

[DOI]

SylloBio-NLI: Evaluating Large Language Models on Biomedical Syllogistic Reasoning, Magdalena Wysocka, Danilo Carvalho, Oskar Wysocki, Marco Valentino and Andre Freitas, in: Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics, 2025

Inductive Learning of Logical Theories with LLMs: A Complexity-graded Analysis, João Gandarela, Danilo Carvalho and Andre Freitas, in: The 39th Annual AAAI Conference on Artificial Intelligence, 2025

Synergy and diversity in CLIP: Enhancing performance through adaptive backbone ensembling, Cristian Rodriguez-Opazo, Ehsan Abbasnejad, Damien Teney, Hamed Damirchi, Edison Marrese-Taylor and Anton van den Hengel, in: International Conference on Learning Representations, 2025

Bayesian low-rank learning (Bella): A practical approach to bayesian neural networks, Bao Gia Doan, Afshar Shamsi, Xiao-Yu Guo, Arash Mohammadi, Hamid Alinejad-Rokny, Damien Teney, Damith Ranasinghe and Ehsan Abbasnejad, in: Proceedings of the AAAI Conference on Artificial Intelligence, 2025

Mysteries of the Deep: Role of Intermediate Representations in Out of Distribution Detection, Ignacio Meza De la Jara, Cristian Rodriguez-Opazo, Damien Teney, Damith Ranasinghe and Ehsan Abbasnejad, in: Advances in neural information processing systems, 2025

Subtask C: Tools and methods to leverage the thermal demand response potential in buildings connected to thermal networks, Hicham Johra, Markus Schaffer, Daniel Leiria, Roberto Boghetti, Elisa Guelpa, Christopher Graf, Benedetto Nastasi, Ingo Leusbrock, Anders Rhiger Hansen, Stefano Mazzoni, Salam Al-Saegh, Qian Wang, Yangzhe Chen, Zeng Peng, Jad Al Koussa, Steffen Petersen and Jérôme Kämpf, in: EBC Annex 84: Demand Management of Buildings in Thermal Networks, Aalborg University, Denmark, 2025

[DOI]

Subtask D: Description and comparative analysis of case studies, Christopher Graf, Anna Cadenbach, Ruben Otte, Anna Marszal-Pomianowska, Elisa Guelpa, Vittorio Verda, Ingo Leusbrock, Demet Suna, Ralf-Roman Schmidt, Ole Michael Jensen, Laura Lehmann, Clemens Felsmann, Axel Oliva, Toke Haunstrup Bach Christensen, Jad Al Koussa, Tijs van Oevelen, Dirk Vanhoudt, Michele Tunzi, Roberto Boghetti and Jérôme Kämpf, in: EBC Annex 84: Demand Management of Buildings in Thermal Networks, Aalborg University, Denmark, 2025

[DOI]

Assessing the reliability of archetype-based Urban Building Energy Simulations: A case study analysis in Turin (Italy), Matteo Piro, Jérôme Kämpf, Ilaria Ballarini and Vincenzo Corrado, in: Journal of Physics: Conference Series, pages 062028, IOP Publishing, 2025

[DOI]
[URL]

OpenBEERS: A digital platform for urban scale simulation of building energy efficiency, David Geissbuhler, Alejandro Pena-Bello, Jérôme Kämpf and Jakob Rager, in: Journal of Physics: Conference Series, pages 042013, IOP Publishing, 2025

[DOI]
[URL]

Listening to Hypoglycemia: Voice as a Biomarker for Detection of a Medical Emergency Using Machine Learning, Vera Lehmann, Martin Hilpert, Zohreh Mostaani, Sevada Hovsepyan, Esmé Wallace, Colombine Verzat, Stefan Feuerriegel, Mathias Kraus, James Rosenthal, Gürkan Yilmaz, Mathew Magimai-Doss and Christoph Stettler, in: Diabetes Care, 2025

[DOI]

An evidence-based guidance framework for neural network system diagrams, Guy Marshall, Andre Freitas and Caroline Jay, in: PLOS One, 2025

Montague semantics and modifier consistency measurement in neural language models, Danilo Carvalho, Edoardo Manino, Julia Rozanova, Lucas Cordeiro and Andre Freitas, in: 31st International Conference on Computational Linguistics, 2025

Effective Graph and Rank-based Contextual Embeddings for Textual and Multimedia Data, Thiago Almeida, Gustavo Leticio, Lucas Pascotti, Andre Freitas and Daniel Pedronette, in: International Joint Conference on Neural Networks, 2025

TableDC: Deep Clustering for Tabular Data, Hafiz Rauf, Andre Freitas and Norman Paton, in: ACM SIGMOD International Conference on Management of Data, 2025

Gem: Gaussian Mixture Model Embeddings for Numerical Feature Distributions, Hafiz Rauf, Alex Bogatu, Norman Paton and Andre Freitas, in: 8th International Conference on Extending Database Technology, 2025

Eliciting Critical Reasoning in Retrieval-Augmented Language Models via Contrastive Explanations, Leonardo Ranaldi, Marco Valentino and Andre Freitas, in: Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics, 2025

Controlling Equational Reasoning in Large Language Models with Prompt Interventions, Jordan Meadows, Marco Valentino and Andre Freitas, in: The 39th Annual AAAI Conference on Artificial Intelligence, 2025

A Mechanistic Interpretation of Syllogistic Reasoning in Auto-Regressive Language Models, Geonhee Kim, Marco Valentino and Andre Freitas, in: Findings of the ACL, 2025

PEIRCE: Unifying Material and Formal Reasoning via LLM-Driven Neuro-Symbolic Refinement, Xin Quan, Marco Valentino, Danilo Carvalho, Dhairya Dalal and Andre Freitas, in: Demonstration at 63rd Annual Meeting of the Association for Computational Linguistics, 2025

Improving chain-of-thought reasoning via quasi-symbolic abstractions, Leonardo Ranaldi, Marco Valentino, Alexander Polonsky and Andre Freitas, in: 63rd Annual Meeting of the Association for Computational Linguistics, 2025

Faithful and Robust LLM-Driven Theorem Proving for NLI Explanations, Xin Quan, Marco Valentino, Louise Dennis and Andre Freitas, in: 63rd Annual Meeting of the Association for Computational Linguistics, 2025

MASA: A Modular Framework for LLM-Driven Multi-Agent Systems for Autoformalization, Lan Zhang, Marco Valentino and Andre Freitas, in: Demonstration at the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

TRACE: Training and Inference-Time Interpretability Analysis for Language Models, Nura Aljaafari, Danilo Carvalho and Andre Freitas, in: Demonstration at the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

CARMA: Enhanced Compositionality in LLMs via Advanced Regularisation and Mutual Information Alignment, Nura Aljaafari, Danilo Carvalho and Andre Freitas, in: The 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Formalizing Complex Mathematical Statements with LLMs: A Study on Mathematical Definitions, Lan Zhang, Marco Valentino and Andre Freitas, in: The 2025 Conference on Empirical Methods in Natural Language Processing (best resource paper award), 2025

Mitigating Content Effects on Reasoning in Language Models through Fine-Grained Activation Steering, Marco Valentino, Geonhee Kim, Dhairya Dalal, Zhixue Zhao and Andre Freitas, in: The 40th Annual AAAI Conference on Artificial Intelligence, 2026

Nexus: An Omni-Perceptive And-Interactive Model for Language, Audio, And Vision, Che Liu, Yingji Zhang, Dong Zhang, Weijie Zhang, Chenggong Gong, Haohan Li, Yu Lu, Shilin Zhou, Yue Lu, Ziliang Gan, Ziao Wang, Junwei Liao, Haipang Wu, Ji Liu, Andre Freitas, Qifan Wang, Zenglin Xu, Rongjunchen Zhang and Yong Dai, in: ACM Multimedia, 2025

Quasi-symbolic Semantic Geometry over Transformer-based Variational AutoEncoders, Yingji Zhang, Danilo Carvalho and Andre Freitas, in: 29th Conference on Computational Natural Language Learning (nominated for a best paper award), 2025

LangVAE and LangSpace: Building and Probing for Language Model VAEs, Danilo Carvalho, Yingji Zhang, Harriet Unsworth and Andre Freitas, in: Demonstration at the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025

Learning to Disentangle Latent Reasoning Rules with Language VAEs: A Systematic Study, Yingji Zhang, Marco Valentino, Danilo Carvalho and Andre Freitas, in: The 40th Annual AAAI Conference on Artificial Intelligence, 2026

3D Face Morph Generation Using Geometry-Aware Template Inversion, Hatef Otroshi Shahreza, Laurent Colbois and Sébastien Marcel, in: 2025 IEEE 35th International Workshop on Machine Learning for Signal Processing (MLSP), 2025

[DOI]
[URL]

Generating Synthetic Face Recognition Datasets Using Brownian Identity Diffusion and a Foundation Model, Hatef Otroshi Shahreza and Sébastien Marcel, in: 2025 IEEE 35th International Workshop on Machine Learning for Signal Processing (MLSP), 2025

[DOI]
[URL]

Validation of two distinct simulation models of district heating networks: application to efficient looping analysis, Dubon Rodrigue, Roberto Boghetti, Jérôme Kämpf, Bastien Pasdeloup, Mohamed T. Mabrouk, Patrick Meyer and Bruno Lacarrière, in: Journal of Physics: Conference Series, pages 042021, IOP Publishing, 2025

[DOI]
[URL]

Learning problem decomposition for efficient sequential multi-object manipulation planning, Yan Zhang, Teng Xue, Amirreza Razmjoo Fard and Sylvain Calinon, in: IEEE Robotics and Automation Letters, 2025

Sampling-Based Constrained Motion Planning with Products of Experts, Amirreza Razmjoo, Teng Xue, Suhan Shetty and Sylvain Calinon, in: International Journal of Robotics Research, 2025

Neural Image Abstraction Using Long Smoothing B-splines, D. Berio, M. Stroh, Sylvain Calinon, F. F. Leymarie, O. Deussen and A. Shamir, in: ACM Transactions on Graphics (ToG), 2025

Study of Full-View Finger Vein Biometrics on Redundancy Analysis and Dynamic Feature Extraction, Junduan Huang, Sushil Bhattacharjee, Sébastien Marcel and Wenxiong Kang, in: IEEE Transactions on Information Forensics and Security, 2025

[DOI]
[URL]

Testing the assumptions about the geometry of sentence embedding spaces: the cosine measure need not apply, Vivi Nastase and Paola Merlo, in: arXiv, 2025

[URL]

Responses to Past-Behavior Questions in Face-To-Face and Asynchronous Video Interviews: Storytelling, Interview Performance and Criterion Validity, Elisabeth Germanier, Adrian Bangerter, Koralie Orji, Laetitia A. Renier, Marianne Schmid Mast, Mutian He and Philip N. Garner, in: Human Performance, 38(5):284-298, 2025

[DOI]

Towards Leveraging Sequential Structure in Animal Vocalizations, Eklavya Sarkar and Mathew Magimai-Doss, in: Neural Information Processing Systems workshop: AI for Non-Human Animal Communication, 2025

EdgeDoc: Hybrid CNN-Transformer Model for Accurate Forgery Detection and Localization in ID Documents, Anjith George and Sébastien Marcel, in: ICCV, 2025

Optimizing Supply Temperature Control in District Heating Networks via Differentiable Dynamic Simulation and Gradient Descent, Roberto Boghetti and Jérôme Kämpf, in: Construction, Energy, Environment and Sustainability. Proceedings of CEES 2025 (Volume 2: Energy), Springer Singapore, 2026

[DOI]
[URL]

CCDP: Model-free Failure Recovery via Guided Diffusion Sampling, Amirreza Razmjoo, Sylvain Calinon, Michael Gienger and Fan Zhang, in: Workshop on The Art of Robustness: Surviving Failures in Robotics, IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2025

ERP Signals During Speech Articulation: Does Auditory Feedback Mask Other Ongoing Cognitive-motor Processes?, Michael De Pretto, Marina Laganaro and Ina Kodrasi, in: Brain Topography, 38(5), 2025

Towards interpretable emotion recognition: Identifying key features with machine learning, Yacouba Kaloga and Ina Kodrasi, in: Forum Acusticum/EuroNoise, Malaga, Spain, 2025

Overview of Automatic Speech Analysis and Technologies for Neurodegenerative Disorders: Diagnosis and Assistive Applications, Sheikh Shakeel, Md Sahidullah and Ina Kodrasi, in: IEEE Journal of Selected Topics in Signal Processing, 2025

Multiview Canonical Correlation Analysis for Automatic Pathological Speech Detection, Yacouba Kaloga, Sheikh Shakeel and Ina Kodrasi, in: International Conference on Acoustics, Speech and Signal Processing, Hyderabad, India, IEEE, 2025

Graph Neural Networks for Parkinson's Disease Detection, Sheikh Shakeel, Yacouba Kaloga, Md Sahidullah and Ina Kodrasi, in: International Conference on Acoustics, Speech and Signal Processing, Hyderabad, India, IEEE, 2025

Exploring In-Context Learning Capabilities of ChatGPT for Pathological Speech Detection, Mahdi Amiri, Hatef Otroshi Shahreza and Ina Kodrasi, in: ITG Conference on Speech Communication, IEEE, 2025

RAGferee: Building Contextual Reward Models for Retrieval-Augmented Generation, Andrei Catalin Coman, Ionuț-Teodor Sorodoc, Leonardo F. R. Ribeiro, Bill Byrne, James Henderson and Adrià de Gispert, in: Empirical Methods in Natural Language Processing, 2025

[URL]

Fast-and-Frugal Text-Graph Transformers are Effective Link Predictors, Andrei Catalin Coman, Christos Theodoropoulos, Marie-Francine Moens and James Henderson, in: Findings of the Association for Computational Linguistics, 2025

[URL]

From forest to zoo: great ape behavior recognition with ChimpBehave, Michael Fuchs, Emilie Genty, Adrian Bangerter, Klaus Zuberbühler, Jean-Marc Odobez and Paul Cotofrei, in: International Journal of Computer Vision, 133:6668–6688, 2025

[DOI]

Multilingual vs. monolingual transformer models in encoding linguistic structure and lexical abstraction, Vivi Nastase, Giuseppe Samo, Chunyang Jiang and Paola Merlo, in: CLiC-it 2025: Eleventh Italian Conference on Computational Linguistics, September 24 ? 26, 2025, Cagliari, Italy, 2025

[URL]

Identifying storytelling in job interviews using deep learning, Elisabeth Germanier, Mutian He, Amina Mardiyyah Rufai, Philip N. Garner, Adrian Bangerter, Laetitia A. Renier, Marianne Schmid Mast and Koralie Orji, in: Computers in Human Behavior Reports, 19(100688), 2025

[DOI]

Robot Manipulation with Geometric Algebra: A Unified Geometric Framework for Control and Optimization, Tobias Löw, EDEE, 2025

Distilling Contact Planning for Fast Trajectory Optimization in Robot Air Hockey, Julius Jankowski, Ante Marić, Puze Liu, Davide Tateo, Jan Peters and Sylvain Calinon, in: Proceedings of Robotics: Science and Systems, 2025

[DOI]
[URL]

Geometric Structures for Learning and Optimization in Robotics, Sylvain Calinon, in: Annual Review of Control, Robotics, and Autonomous Systems., 2025

Ergodic exploration of dynamic distribution, L. Lanča, K. Jakac, Sylvain Calinon and S. Ivić, in: IEEE Robotics and Automation Letters (RA-L), 2025

Efficient and Real-Time Motion Planning for Robotics Using Projection-Based Optimization, Xuemin Chi, Hakan Girgin, Tobias Löw, Yangyang Xie, Teng Xue, Jihao Huang, Zhitao Liu and Sylvain Calinon, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2025

Robust Pushing: Exploiting Quasi-static Belief Dynamics and Contact-informed Optimization, Julius Jankowski, Lara Brudermuller, Nick Hawes and Sylvain Calinon, in: International Journal of Robotics Research (IJRR), 2025

Human-Humanoid Robots Cross-Embodiment Behavior-Skill Transfer Using Decomposed Adversarial Learning from Demonstration, J. Liu, Z. Li, M. Yu, Z. Dong, Sylvain Calinon, D. G. Caldwell and F. Chen, in: IEEE Robotics and Automation Magazine (RAM), 32(1):68-78, 2025

ManiDP: Manipulability-Aware Diffusion Policy for Posture-Dependent Bimanual Manipulation, Z. Li, J. Liu, D. Li, T. Teng, M. Li, Sylvain Calinon, D. G. Caldwell and F. Chen, in: In Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2025

Whole-Body Impedance Control of a Humanoid Robot Based on Human-Human Demonstration for Human-Robot Collaboration, C. Li, J. Liu, T. Teng, S. Wang, Sylvain Calinon and F. Chen, in: In Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2025

Image-driven robot drawing with rapid lognormal movements, D. Berio, G. Clivaz, M. Stroh, O. Deussen, R. Plamondon, Sylvain Calinon and F. F. Leymarie, in: In Proc. IEEE Intl Symp. on Robot and Human Interactive Communication (Ro-Man), 2025

Differentiable rasterization of minimum-time sigma-lognormal trajectories, D. Berio, Sylvain Calinon, R. Plamondon and F. F. Leymarie, in: In Proc. 22nd Conference of the International Graphonomics Society (IGS), 2025

Movement Generation and Drawing in Robotics, Sylvain Calinon, in: In Proc. 22nd Conference of the International Graphonomics Society (IGS), 2025

From Movement Primitives to Distance Fields to Dynamical Systems, Yiming Li and Sylvain Calinon, in: IEEE Robotics and Automation Letters (RA-L), 10(9), 2025

A Smooth Analytical Formulation of Collision Detection and Rigid Body Dynamics with Contact, Onur Beker, Nico Gürtler, Ji Shi, Andreas René Geist, Amirreza Razmjoo, George Martius and Sylvain Calinon, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2025

CCDP: Composition of Conditional Diffusion Policies with Guided Sampling, Amirreza Razmjoo, Sylvain Calinon, Michael Gienger and Fan Zhang, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2025

Towards Accessible and Intuitive Shared Autonomy, Shalutha Rajapakshe, in: Proceedings of the 2025 ACM/IEEE International Conference on Human-Robot Interaction, 2025

[URL]

Giving Sense to Inputs: Toward an Accessible Control Framework for Shared Autonomy, Shalutha Rajapakshe, Jean-Marc Odobez and Emmanuel Senft, in: Proceedings of the 2025 ACM/IEEE International Conference on Human-Robot Interaction, Melbourne, Australia, ACM, 2025

[URL]

Transformers Pretrained on Procedural Data Contain Modular Structures for Algorithmic Reasoning, Zachary Shinnick, Liangze Jiang, Hemanth Saratchandran, Anton van den Hengel and Damien Teney, in: ICML 2025 Workshop on Methods and Opportunities at Small Scale, 2025

Do We Always Need the Simplicity Bias? Looking for Optimal Inductive Biases in the Wild, Damien Teney, Liangze Jiang, Florin Gogianu and Ehsan Abbasnejad, in: The IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

OOD-Chameleon: Is Algorithm Selection for OOD Generalization Learnable?, Liangze Jiang and Damien Teney, in: Forty-Second International Conference on Machine Learning, 2025

MM-HSD: Multi-Modal Hate Speech Detection in Videos, Berta Céspedes-Sarrias, Carlos Collado-Capell, Pablo Rodenas-Ruiz, Olena Hrynenko and Andrea Cavallaro, in: Proceedings of the 33rd ACM International Conference on Multimedia (MM'25), October 27-31, 2025, Dublin, Ireland., 2025

[DOI]

Chain-of-Model Learning for Language Model, Kaitao Song, Xiaohua Wang, Xu Tan, Huiqiang Jiang, Chengruidong Zhang, Yongliang Shen, Cen Lu, Zihao Li, Zifan Song, Caihua Shan, Yansen Wang, Kan Ren, Xiaoqing Zheng, Tao Qin, Yuqing Yang, Dongsheng Li and Lili Qiu, in: 39th Conference on Neural Information Processing Systems, 2025

CoRet: Improved Retriever for Code Editing, Fabio Fehr, in: Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2025

[DOI]
[URL]

Fine-Tuning Pretrained Models with NVIB for Improved Generalisation, Fabio Fehr, Alina Elena Baia, Xiaoguang Chang, Andrei Catalin Coman, Karl El Hajal, Dina El Zein, Shashi Kumar, Juan Zuluaga-Gomez, Andrea Cavallaro, Damien Teney and James Henderson, in: Workshop on Spurious Correlation and Shortcut Learning: Foundations and Solutions, 2025

[URL]

Nonparametric Variational Information Bottleneck: Attention-based Architectures as Latent Variable Models, Fabio Fehr, EPFL, 2025

[URL]

Idiap kNN-TTS System for the Blizzard Challenge 2025, Enno Hermann, Karl El Hajal, Ajinkya Kulkarni and Mathew Magimai-Doss, in: Blizzard Challenge Workshop, 2025

Better Semi-supervised Learning for Multi-domain ASR Through Incremental Retraining and Data Filtering, Andrés Carofilis, Pradeep Rangappa, Srikanth Madikeri, Shashi Kumar, Sergio Burdisso, Jeena Prakash, Esaú Villatoro-Tello, Petr Motlicek, Bidisha Sharma, Kadri Hacioğlu, Shankar Venkatesan, Saurabh Vyas and Andreas Stolcke, in: Interspeech 2025, Rotterdam, The Netherlands, pages 3618--3622, 2025

[DOI]
[URL]

The Greatest Challenge For Startups: Computational Text Analysis on Swiss Ventures, Takahiro Inada, Esaú Villatoro-Tello, Jung Park, Jim Pulcrano and Benoit F. Leleux, in: Academy of Management Proceedings 2025., 2025

[URL]

Unifying Global and Near-Context Biasing in a Single Trie Pass., Thorbecke Iuliia, Esaú Villatoro-Tello, Juan Zuluaga-Gomez, Shashi Kumar, Sergio Burdisso, Pradeep Rangappa, Andrés Carofilis, Srikanth Madikeri, Petr Motlicek, Karthik Pandia D S, Kadri Hacioğlu and Andreas Stolcke, in: Text, Speech, and Dialogue. TSD 2025. Lecture Notes in Computer Science, Springer, Springer, 2025

[DOI]
[URL]

DeepID Challenge of Detecting Synthetic Manipulations in ID Documents, Pavel Korshunov, Vidit Vidit, Amir Mohammadi, Christophe Ecabert, Nevena Shamoska, Sébastien Marcel, Zeqin Yu, Ye Tian, Jiangqun Ni, Lazar Lazarevic, Renat Khizbullin, Anastasiia Evteeva, Alexey Tochin, Aleksei Grishin, Anjith George, Daniel DeAlcala, Tamas Endrei, Javier Munoz-Haro, Ruben Tolosana, Ruben Vera-Rodriguez, Aythami Morales, Julian Fierrez, Gyorgy Cserey, Hardik Sharma, Sachin Chaudhary, Akshay Dudhane, Praful Hambarde, Amit Shukla, Prateek Shaily, Jayant Kumar, Ajinkya Hase, Satish Maurya, Mridul Sharma and Pallav Dwivedi, in: International Conference on Computer Vision (ICCV), 2025

Unified and Multimodal Learning for Gaze Prediction in Naturalistic Settings, Anshul Gupta, EPFL, 2025

EdgeDoc: Hybrid CNN-Transformer Model for Accurate Forgery Detection and Localization in ID Documents, Anjith George and Sébastien Marcel, Idiap-RR-08-2025

Tokenwise Contrastive Speech and Text Pre-Training for Speech Emotion Recognition, Eklavya Sarkar and Neha Tarigopula, Idiap-RR-07-2025

Decoding community proximity discourse: A mixed-methods comparative analysis of online local and national newspapers in Romandy, Switzerland, Victor Bros and Daniel Gatica-Perez, in: PLOS One, 2025

[DOI]
[URL]

Investigation of accuracy and bias in face recognition trained with synthetic data, Pavel Korshunov, Ketan Kotwal, Christophe Ecabert, Vidit Vidit, Amir Mohammadi and Sébastien Marcel, in: Proceedings of IEEE International Joint Conference on Biometrics, 2025

Transferability of Learnt Speech Representations for Decoding Non-Human Vocal Communication, Eklavya Sarkar, Ecole Polytechnique Fédérale de Lausanne, 2025

Enhancing Domain Diversity in Synthetic Data Face Recognition with Dataset Fusion, Anjith George and Sébastien Marcel, in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, 2025

Second FRCSyn-onGoing: Winning Solutions and Post-Challenge Analysis to Improve Face Recognition with Synthetic Data, Hatef Otroshi Shahreza, Anjith George, Rahimi Parsa, Alexander Unnervik and Sébastien Marcel, in: Information Fusion, 2025

xEdgeFace: Efficient Cross-Spectral Face Recognition for Edge Devices, Anjith George and Sébastien Marcel, in: International Joint Conference on Biometrics (IJCB 2025), IEEE., 2025

The Invisible Threat: Evaluating the Vulnerability of Cross-Spectral Face Recognition to Presentation Attacks, Anjith George and Sébastien Marcel, in: International Joint Conference on Biometrics (IJCB 2025), IEEE., 2025

Improving ASR and Callsign Detection in Air Traffic Control Speech using Whisper Prompting, Jehan Joachim Daniel Piaget, Amrutha Prasad and Petr Motlicek, Idiap-RR-04-2025

Face morphing attacks in the era of deepfakes: risks, detection & source attribution, Laurent Colbois, Université de Lausanne, 2025

On the Nature of Explanation: An Epistemological-Linguistic Perspective for Explanation-Based Natural Language Inference, Marco Valentino and Andre Freitas, in: Philosophy & Technology, 2025

Unveiling Audio Deepfake Origins: A Deep Metric learning And Conformer Network Approach With Ensemble Fusion, Ajinkya Kulkarni, Dowerah Sandipana, Mathew Magimai-Doss and Tanel alumae, in: Proceedings of Interspeech, 2025

Multimodal Prosody Modeling: A Use Case for Multilingual Sentence Mode Prediction, Bogdan Vlasenko and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2025

STM-GNN: Space-Time-and-Memory Graph Neural Networks for Predicting Multi-Drug Resistance Risks in Dynamic Patient Networks, Damien Geissbuhler, Alban Bornet, Catarina Marques, André Anjos, Sónia Pereira and Douglas Teodoro, in: International Conference on Artificial Intelligence in Medicine, Pavia, Italy, 2025

[DOI]

HintsOfTruth: A Multimodal Checkworthiness Detection Dataset with Real and Synthetic Claims, Michiel van der Meer, Pavel Korshunov, Sébastien Marcel and Lonneke van der Plas, in: The 63rd Annual Meeting of the Association for Computational Linguistics, 2025

Autocrime - open multimodal platform for combating organized crime, Srikanth Madikeri, Petr Motlicek, Dairazalia Sanchez-Cortes, Pradeep Rangappa, Joshua Hughes, Jacob Tkaczuk, Alejandra Sanchez Lara, Driss Khalil, Johan Rohdin, Dawei Zhu, Aravind Krishnan, Dietrich Klakow, Zahra Ahmadi, Marek Kovac, Dominik Boboš, Costas Kalogiros, Andreas Alexopoulos and Denis Marraud, in: Forensic Science International: Digital Investigation, 54, 2025

[DOI]
[URL]

Robust Contact-rich Manipulation through Implicit Motor Adaptation, Teng Xue, Amirreza Razmjoo Fard, Suhan Shetty and Sylvain Calinon, in: International Journal of Robotics Research, 2025

Leveraging Sequential Structure in Animal Vocalizations, Eklavya Sarkar and Mathew Magimai-Doss, Idiap-RR-06-2025

Adaptation of Speech and Bioacoustics Models, Eklavya Sarkar, Amir Mohammadi and Mathew Magimai-Doss, Idiap-RR-05-2025

Soft Skills in the Wild: Challenges in Multilingual Classification, Laura Vásquez-Rodríguez, Bertrand Audrin, Samuel Michel, Samuele Galli, Julneth Rogenhofer, Jacopo Negro Cusa and Lonneke van der Plas, in: Proceedings of the 10th edition of the Swiss Text Analytics Conference, 2025

The Suisse Romande Local News Dataset, Victor Bros and Daniel Gatica-Perez, in: Proceedings of the Nineteenth International AAAI Conference on Web and Social Media, 2025

On feature representations for marmoset vocal communication analysis, Eklavya Sarkar, Kaja Wierucka, Alexandra B. Bosshard, Judith Burkart and Mathew Magimai-Doss, in: Bioacoustics: The International Journal of Animal Sound and its Recording:1-15, 2025

[DOI]
[URL]

Joint Fine-tuning and Conversion of Pretrained Speech and Language Models towards Linear Complexity, Mutian He and Philip N. Garner, in: 13th International Conference on Learning Representations (ICLR), 2025

[URL]

Enhancing 3D Gaze Estimation in the Wild using Weak Supervision with Gaze Following Labels, Pierre Vuillecard and Jean-Marc Odobez, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025

Multidisciplinary characterization of embarrassment through behavioral and acoustic modeling, Dajana Šipka, Bogdan Vlasenko, Maria Stein, Thomas Dierks, Mathew Magimai-Doss and Yosuke Morishima, in: Scientific reports, 2025

Review of Demographic Bias in Face Recognition, Ketan Kotwal and Sébastien Marcel, Idiap-RR-01-2025

Posterior-based analysis of spatio-temporal features for Sign Language Assessment, Neha Tarigopula, Sandrine Tornay, Ozge Mercanoglu Sincan, Richard Bowden and Mathew Magimai-Doss, in: IEEE Open Journal of Signal Processing, 2025

[DOI]

Towards Dynamic Skeleton-based Handshape Subunits for Sign Language Assessment, Sandrine Tornay and Mathew Magimai-Doss, in: Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2025

ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications, Juan Zuluaga-Gomez, Karel Vesely, Igor Szoke, Blatt Alexander, Petr Motlicek, Martin Kocour, Khalid Choukri, Nigmatulina Iuliia, Claudia Cevenini, Allan Tart, Jan Cernocky and Dietrich Klakow, in: Journal of Data-centric Machine Learning Research, 2024

[URL]

Digi2Real: Bridging the Realism Gap in Synthetic Data Face Recognition via Foundation Models, Anjith George and Sébastien Marcel, in: 2025 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW), 2025

Exploring ChatGPT for Face Presentation Attack Detection in Zero and Few-Shot in-Context Learning, Alain Komaty, Hatef Otroshi Shahreza, Anjith George and Sébastien Marcel, in: IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2025

Sparse Optical Sampling in the Close Proximity of a Robotic Arm, Martin Laurenzis, Ante Marić, Emmanuel Bacher, Mateusz Pietrzak, Stéphane Schertzer, Francesco Grella and Sylvain Calinon, in: Springer Proceedings in Advanced Robotics, 2024

Minimum effort adaptation of automatic speech recognition system in air traffic management, Mrinmoy Bhattacharjee, Petr Motlicek, Srikanth Madikeri, Hartmut Helmke, Oliver Ohneiser, Matthias Kleinert and heiko Ehr, in: European Journal of Transport and Infrastructure Research, 24(4 (2024)):133–153, 2025

[DOI]
[URL]

A Bayesian Interpretation of Adaptive Low-Rank Adaptation, Haolin Chen and Philip N. Garner, in: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

[DOI]

TEAM SWITZERLAND SUBMISSION TO NIST SRE24 SPEAKER RECOGNITION EVALUATION, Amrutha Prasad, Hatef Otroshi Shahreza, Andrés Carofilis, Aref Farhadipour, Shiran Liu, Srikanth Madikeri, Anjith George, Petr Motlicek, Sébastien Marcel, Masoumeh Chapariniya, Valeriia Perepelytsia, Teodora Vukovic and Volker Dellwo, Idiap-RR-10-2025

TESS: Text-to-text selfconditioned simplex diffusion, Rabeeh Karimi Mahabadi, Hamish Ivison, Jaesung Tae, James Henderson, Iz Beltagy, Matthew Peters and Arman Cohan, in: Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), pages 2347–2361, Association for Computational Linguistics, 2024

Formal Semantic Controls over Language Models, Danilo Carvalho, Yingji Zhang and Andre Freitas, in: LREC-COLING, 2024

Empowering Cross-lingual Abilities of Instruction-tuned Large Language Models by Translation-following demonstrations, Leonardo Ranaldi, Giulia Pucci and Andre Freitas, in: Findings of the ACL, 2024

Deep Clustering for Data Cleaning and Integration, Hafiz Rauf, Andre Freitas and Norman Paton, in: 27th International Conference on Extending Database Technology, 2024

Learning Disentangled Semantic Spaces of Explanations via Invertible Neural Networks, Yingji Zhang, Danilo Carvalho and Andre Freitas, in: The 62nd Annual Meeting of the Association for Computational Linguistics, 2024

Graph Neural Flows for Unveiling Systemic Interactions Among Irregularly Sampled Time Series, Giangiacomo Mercatali, Andre Freitas and Jie Chen, in: Thirty-Eighth Annual Conference on Neural Information Processing Systems, 2024

Diffusion Twigs with Loop Guidance for Conditional Graph Generation, Giangiacomo Mercatali, Yogesh Verma, Andre Freitas and Vikas Garg, in: Thirty-Eighth Annual Conference on Neural Information Processing Systems, 2024

Exploring the Limits of Fine-grained LLM-based Physics Inference via Premise Removal Interventions, Jordan Meadows, Tamsin James and Andre Freitas, in: Findings of EMNLP, 2024

Consistent Autoformalization for Constructing Mathematical Libraries, Lan Zhang, Xin Quan and Andre Freitas, in: The 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Montague semantics and modifier consistency measurement in neural language models, Danilo Carvalho, Edoardo Manino, Julia Rozanova, Lucas Cordeiro and Andre Freitas, in: 31st International Conference on Computational Linguistics, 2025

Tactile Ergodic Coverage on Curved Surfaces, Cem Bilaloglu, Tobias Löw and Sylvain Calinon, in: IEEE Transactions on Robotics (T-RO), 41:1421-1435, 2025

Temporal fine-tuning for early risk detection, Horacio Thompson, Esaú Villatoro-Tello, Manuel Montes-y-Gómez and Marcelo Errecalde, in: Memorias De Las JAIIO, Argentina, pages 137-149, 2024

[URL]

Natural Language Understanding for Navigation of Service Robots in Low-Resource Domains and Languages: Scenarios in Spanish and Nahuatl, Amadeo Hernández, Rosa M. Ortega-Mendoza, Esaú Villatoro-Tello, César Joel Camacho-Bello and Obed Pérez-Cortés, in: Mathematics, 12(8), 2024

[DOI]
[URL]

MTGS: A Novel Framework for Multi-Person Temporal Gaze Following and Social Gaze Prediction, Anshul Gupta, Samy Tafasca, Arya Farkhondeh, Pierre Vuillecard and Jean-Marc Odobez, in: 38th Conf. on Neural Information Processing System, 2024

Toward Semantic Gaze Target Detection, Samy Tafasca, Anshul Gupta, Victor Bros and Jean-Marc Odobez, in: 38th Conf. on Neural Information Processing System, 2024

Loose Social-Interaction Recognition in Real-world Therapy Scenarios, Abid Ali, Rui Dai, Ashish Marisetty, Guillaume Astruc, Monique Thonnat, Jean-Marc Odobez, Suzanne Thümmler and Francois Bremond, in: IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

Weakly-supervised Autism Severity Assessment in Long Videos, Abid Ali, Mahmoud Ali, Camilla Barbini, Séverine Dubuisson, Jean-Marc Odobez, Francois Bremond and Suzanne Thümmler, in: International Conference on Content-based Multimedia Indexing, 2024

Automatic detection of the visual gaze components of joint attention in observational, naturalistic child language acquisition data, Miranda Dickerman, Anshul Gupta, Samy Tafasca, Xiaocheng Zhang, Jean-Marc Odobez and Sabine Stoll, in: Boston University Conference on Language Development, 2025

Reasoning with Natural Language Explanations, Marco Valentino and Andre Freitas, in: In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: Tutorial Abstracts, 2024

SemEval-2024 Task 2: Safe Biomedical Natural Language Inference for Clinical Trials, Mael Jullien, Marco Valentino and Andre Freitas, in: In Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024), 2024

Graph-Induced Syntactic-Semantic Spaces in Transformer-Based Variational AutoEncoders, Yingji Zhang, Marco Valentino, Danilo Carvalho, Ian Pratt-Hartmann and Andre Freitas, in: In Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Large Language Models, scientific knowledge and factuality: A framework to streamline human expert evaluation, Magdalena Wysocka, Oskar Wysocki, Maxime Delmas, Vincent Mutel and Andre Freitas, in: Journal of Biomedical Informatics(158), 2024

[DOI]
[URL]

An LLM-based Knowledge Synthesis and Scientific Reasoning Framework for Biomedical Discovery, Oskar Wysocki, Magdalena Wysocka, Danilo Carvalho, Alex Bogatu, Danilo Gusicuma, Maxime Delmas, Harriet Unsworth and Andre Freitas, in: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, ACL, Bangkok, Thailand, pages 355-364, 2024

[DOI]
[URL]

What Does it Take to Generalize SER Model Across Datasets? A Comprehensive Benchmark, Adham Ibrahim, Shady Shehata, Ajinkya Kulkarni, Mukhtar Mohamed and Muhammad Abdul-Mageed, in: ISCA proceedings, Greece, 2024

[DOI]
[URL]

Exploring generalization to unseen audio data for spoofing: insights from SSL models, Atharva Kulkarni, Hoan My Tran, Ajinkya Kulkarni, Dowerah Sandipana, Damien Lolive and Mathew Magimai-Doss, in: ISCA Proceedings, Greece, 2024

[DOI]
[URL]

Unveiling Biases while Embracing Sustainability: Assessing the Dual Challenges of Automatic Speech Recognition Systems, Ajinkya Kulkarni, Atharva Kulkarni, Miguel Couceiro and Isabel Trancoso, in: ISCA proceedings, Greece, pages 4, 2024

[DOI]
[URL]

Multi-Operational Mathematical Derivations in Latent Space, Marco Valentino, Jordan Meadows, Lan Zhang and Andre Freitas, in: Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Inference to the Best Explanation in Large Language Models, Dhairya Dalal, Marco Valentino, Andre Freitas and Paul Buitelaar, in: In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

On the Nature of Explanation: An Epistemological-Linguistic Perspective for Explanation-Based Natural Language Inference, Marco Valentino and Andre Freitas, in: Philosophy & Technology, 2024

Verification and Refinement of Natural Language Explanations through LLM-Symbolic Theorem Proving, Xin Quan, Marco Valentino, Louise A Dennis and Andre Freitas, in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Self-Refine Instruction-Tuning for Aligning Reasoning in Language Models, Andre Freitas and Leonardo Ranaldi, in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Aligning Large and Small Language Models via Chain-of-Thought Reasoning, Leonardo Ranaldi and Andre Freitas, in: Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

SDFR: Synthetic Data for Face Recognition Competition, Hatef Otroshi Shahreza, Christophe Ecabert, Anjith George, Alexander Unnervik and Sébastien Marcel, in: IEEE FG 2024 : 18th IEEE International Conference on Automatic Face and Gesture Recognition, 2024

Heterogeneous Face Recognition with Prepended Domain Transformers, Anjith George and Sébastien Marcel, in: Face Recognition Across the Imaging Spectrum, Springer, 2024

Genome scale metabolic network modelling for metabolic profile predictions, Juliette Cooke, Maxime Delmas, Cecilia Wieder, Pablo Rodriguez Mier, Clément Frainay, Florence Vinson, Timothy Ebbels, Nathalie Poupin and Fabien Jourdan, in: PLOS Computational Biology, 20(2):e1011381, 2024

[DOI]
[URL]

Building Structured Synthetic Datasets: The Case of Blackbird Language Matrices (BLMs), Paola Merlo, Giuseppe Samo, Vivi Nastase and Chunyang Jiang, in: Proceedings of the 9th Italian Conference on Computational Linguistics, 2023

Blackbird Language Matrices Tasks for Generalization, Paola Merlo, Chunyang Jiang, Giuseppe Samo and Vivi Nastase, in: Proceedings of the 1st GenBench Workshop on (Benchmarking) Generalisation in NLP, ACL, 2023

BLM-s/lE: A structured dataset of English spray-load verb alternations for testing generalization in LLMs., Giuseppe Samo, Vivi Nastase, Chunyang Jiang and Paola Merlo, in: Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

BLM-AgrF: A New French Benchmark to Investigate Generalization of Agreement in Neural Networks., Aixiu An, Chunyang Jiang, Maria A. Rodriguez, Vivi Nastase and Paola Merlo, in: Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Analysing the potential of open hotel review databases for IEQ assessment: A text mining approach, Giulia Lamberti, Roberto Boghetti, Fabio Fantozzi, Francesco Leccese and Giacomo Salvadori, in: Analysing the potential of open hotel review databases for IEQ assessment: A text mining approach:1-19, 2024

[DOI]
[URL]

A Retrospective on the Robot Air Hockey Challenge: Benchmarking Robust, Reliable, and Safe Learning Techniques for Real-world Robotics, Puze Liu, Jonas Günster, Niklas Funk, Simon Gröger, Dong Chen, Haitham Bou-Ammar, Julius Jankowski, Ante Marić, Sylvain Calinon, Andrej Orsula, Miguel Olivares-Mendez, Hongyi Zhou, Rudolf Lioutikov, Gerhard Neumann, Amarildo Likmeta, Amirhossein Zhalehmehrabi, Thomas Bonenfant, Marcello Restelli, Davide Tateo, Ziyuan Liu and Jan Peters, in: Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks (NeurIPS Datasets and Benchmarks 2024), 2024

Learning Goal-oriented Bimanual Dough Rolling Using Dynamic Heterogeneous Graph Based on Human Demonstration, J. Liu, C. Li, S. Wang, Z. Dong, Z. Tang, Sylvain Calinon, M. Li and F. Chen, in: In Proc. IEEE Intl Conf. on Robotics and Biomimetics (ROBIO), 2024

A Minimum-Jerk Approach to Handle Singularities in Virtual Fixtures, G. Braglia, Sylvain Calinon and L. Biagiotti, in: IEEE Robotics and Automation Letters (RA-L), 9(11):10256-10263, 2024

Intuitive Robot Programming, C. Blanc, Julius Jankowski, A. Sonderegger, Sylvain Calinon and S. Dégallier Rochat, in: Ergonomics in Robotics: Advances and Innovations, Springer, 2025

Impact of Speech Mode in Automatic Pathological Speech Detection, Sheikh Shakeel and Ina Kodrasi, in: EUSIPCO, IEEE, 2024

[URL]

Are there identifiable structural parts in the sentence embedding whole?, Vivi Nastase and Paola Merlo, in: Proceedings of the 7th BlackboxNLP Workshop: Analyzing and Interpreting Neural Networks for NLP, 2024

Tracking linguistic information in transformer-based sentence embeddings through targeted sparsification, Vivi Nastase and Paola Merlo, in: Proceedings of the 9th Workshop on Representation Learning for NLP, 2024

[URL]

Exploring Italian sentence embeddings properties through multi-tasking, Vivi Nastase, Giuseppe Samo, Chunyang Jiang and Paola Merlo, in: Tenth Italian Conference on Computational Linguistics, 2024

Biologically Inspired Spiking Neural Networks for Speech Recognition, Alexandre Bittar, EPFL/EDEE, 2024

[DOI]

BLM-It - Blackbird Language Matrices for Italian: A CALAMITA Challenge, Chunyang Jiang, Giuseppe Samo, Vivi Nastase and Paola Merlo, in: Proceedings of the 10th Italian Conference on Computational Linguistics, 2024

Can We Learn to Select the Right Algorithm for OOD Generalization?, Liangze Jiang and Damien Teney, in: Out Of Distribution Generalization in Computer Vision, Workshop at ECCV, 2024

Neural Redshift: Random Networks are not Random Functions, Damien Teney, Armand Mihai Nicolicioiu, Valentin Hartmann and Ehsan Abbasnejad, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024

CulturePark: Boosting Cross-cultural Understanding in Large Language Models, Cheng Li, Damien Teney, Linyi Yang, Qingsong Wen, Xing Xie and Jindong Wang, in: Advances in Neural Information Processing Systems (NeurIPS), 2024

Robust Manipulation Primitive Learning via Domain Contraction, Teng Xue, Amirreza Razmjoo Fard, Suhan Shetty and Sylvain Calinon, in: Proceedings of Conference on Robot Learning, 2024

Mirror-based Full-View Finger Vein Authentication with Illumination Adaptation, Junduan Huang, Zifeng Li, Sushil Bhattacharjee, Wenxiong Kang and Sébastien Marcel, in: IEEE Transactions on Circuits and Systems for Video Technology, 2024

[DOI]

A Stochastic Approach to Contact-rich Manipulation, Julius Jankowski, Ecole Polytechnique Fédérale de Lausanne, 2024

Robot Learning using Tensor Networks, Suhan Shetty, Ecole Polytechnique Fédérale de Lausanne, 2024

[DOI]

Annotator-centric Active Learning for Subjective NLP Tasks, Michiel van der Meer, Neele Falk, Pradeep K. Murukannaiah and Enrico Liscio, in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, 2024

Identifying Privacy Personas, Olena Hrynenko and Andrea Cavallaro, in: Proceedings on Privacy Enhancing Technologies, 2025

GADePo: Graph-Assisted Declarative Pooling Transformers for Document-Level Relation Extraction, Andrei Catalin Coman, Christos Theodoropoulos, Marie-Francine Moens and James Henderson, in: Proceedings of the 3rd Workshop on Knowledge Augmented Methods for NLP, Association for Computational Linguistics, 2024

[DOI]
[URL]

Strong and Efficient Baselines for Open Domain Conversational Question Answering, Andrei Catalin Coman, Gianni Barlacchi and Adrià de Gispert, in: Findings of EMNLP, Association for Computational Linguistics, 2023

[DOI]
[URL]

Posterior-based analysis of spatio-temporal features for Sign Language Assessment, Neha Tarigopula, Sandrine Tornay, Ozge Mercanoglu Sincan, Richard Bowden and Mathew Magimai-Doss, Idiap-RR-11-2024

Hardware-effective Approaches for Skill Extraction in Job Offers and Resumes, Laura Vásquez-Rodríguez, Bertrand Audrin, Samuel Michel, Samuele Galli, Julneth Rogenhofer, Jacopo Negro Cusa and Lonneke van der Plas, in: The 4th Workshop on Recommender Systems for Human Resources, in conjunction with the 18th ACM Conference on Recommender Systems, 2024

[URL]

DiffuCOMET: Contextual Commonsense Knowledge Diffusion, Silin Gao, Mete Ismayilzada, Mengjie Zhao, Hiromi Wakaki, Yuki Mitsufuji and Antoine Bosselut, in: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Bangkok, Thailand, pages 4809–4831, Association for Computational Linguistics, 2024

[DOI]
[URL]

Towards Dynamic Skeleton-based Handshape Subunits for Sign Language Assessment, Sandrine Tornay and Mathew Magimai-Doss, Idiap-RR-09-2024

Nonparametric Variational Regularisation of Pretrained Transformers, Fabio Fehr and James Henderson, in: First conference on Language Modelling, 2024

[URL]

ChildPlay-Hand: A Dataset of Hand Manipulations in the Wild, Arya Farkhondeh, Samy Tafasca and Jean-Marc Odobez, in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024

Bayesian Parameter-Efficient Fine-Tuning for Overcoming Catastrophic Forgetting, Haolin Chen and Philip N. Garner, in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024

[DOI]

Recursive Forward Dynamics for Serial Kinematic Chains using Conformal Geometric Algebra, Tobias Löw and Sylvain Calinon, in: In Proc. Intl Workshop on the Algorithmic Foundations of Robotics (WAFR), 2024

Adversarial Robustness Analysis in Automatic Pathological Speech Detection Approaches, Mahdi Amiri and Ina Kodrasi, in: Interspeech, 2024

Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper, Thorbecke Iuliia, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Shashi Kumar, Pradeep Rangappa, Sergio Burdisso, Petr Motlicek, Karthik Pandia D S and Aravind Ganapathiraju, Idiap-RR-10-2024

Synergizing Natural Language Towards Enhanced Shared Autonomy, Shalutha Rajapakshe, Atharva Dastenavar and Emmanuel Senft, in: Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, 2024

[URL]

Exploring neural oscillations during speech perception via surrogate gradient spiking neural networks, Alexandre Bittar and Philip N. Garner, in: Frontiers in Neuroscience, 18(1449181), 2024

[DOI]

Missed Opportunities in Building Energy Performance Assessment, Minu Agarwal, Parag Cameron-Rastogi, Giuseppe Peronato and Georgios Mavromatidis, in: Journal of Sustainable Real Estate, 16(1), 2024

[DOI]

CCDb-HG: Novel Annotations and Gaze-Aware Representations for Head Gesture Recognition, Pierre Vuillecard, Arya Farkhondeh, Michael Villamizar and Jean-Marc Odobez, in: 18th IEEE Int. Conference on Automatic Face and Gesture Recognition (FG), Istanbul,, 2024

Sharingan: A Transformer Architecture for Multi-Person Gaze Following, Samy Tafasca, Anshul Gupta and Jean-Marc Odobez, in: Int. Conference Computer Vision and Pattern Recognition (CVPR), Seatle, 2024

Exploring the Zero-Shot Capabilities of Vision-Language Models for Improving Gaze Following, Anshul Gupta, Pierre Vuillecard, Arya Farkhondeh and Jean-Marc Odobez, in: Int. Conf. Computer Vision and Pattern Recognition (CVPR), Workshop on Gaze Estimation and Prediction in the Wild, 2024

Segmenting Object Affordances: Reproducibility and Sensitivity to Scale, Tommaso Apicella, Alessio Xompero, Paolo Gastaldo and Andrea Cavallaro, in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024

Image-guided topic modeling for interpretable privacy classification, Alina Elena Baia and Andrea Cavallaro, in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024

Comparing Stability and Discriminatory Power of Hand-crafted Versus Deep Radiomics: A 3D-Printed Anthropomorphic Phantom Study, Oscar Jimenez-del-Toro, Christoph Aberle, Roger Schaer, Michael Bach, Kyriakos Flouris, Ender Konukoglu, Bram Stieltjes, Markus M. Obmann, André Anjos, Henning Müller and Adrien Depeursinge, in: Proceedings of the 12th European Workshop on Visual Information Processing, 2024

GLoFool: global enhancements and local perturbations to craft adversarial images, Mirko Agarla and Andrea Cavallaro, in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024

Explaining models relating objects and privacy, Alessio Xompero, Myriam Bontonou, Jean-Michel Arbona, Emmanouil Benetos and Andrea Cavallaro, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024

[URL]

Sparse multi-view hand-object reconstruction for unseen environments, Yik Lung Pang, Changjae Oh and Andrea Cavallaro, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2024

[URL]

Open-Vocabulary Object 6D Pose Estimation, Jaime Corsetti, Davide Boscaini, Changjae Oh, Andrea Cavallaro and Fabio Poiesi, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024

[URL]

Test-time adaptation for 6D pose tracking, Long Tian, Changjae Oh and Andrea Cavallaro, in: Pattern Recognition, 152, 2024

[DOI]
[URL]

Training a Filter-Based Model of the Cochlea in the Context of Pre-Trained Acoustic Models, Louise Coppieters de Gibson and Philip N. Garner, in: Acoustics, 6:470 - 488, 2024

[DOI]

Low-Resource Speech Recognition and Understanding for Challenging Applications, Juan Zuluaga-Gomez, EPFL-EDEE, 2024

Refining Tuberculosis Detection in CXR Imaging: Addressing Bias in Deep Neural Networks via Interpretability, Özgür Güler, Manuel Günther and André Anjos, in: Proceedings of the 12th European Workshop on Visual Information Processing, 2024

[DOI]
[URL]

Face Liveness Detection Competition (LivDet-Face) - 2024, Lambert Igene, Afzal Hossain, Mohammad Zahir Uddin Chowdhury, Humaira Rezaie, Ayden Rollins, Jesse Dykes, Rahul Vijaykumar, Alain Komaty, Sébastien Marcel, Stephanie Schuckers, Juan E. Tapia, Carlos Aravena, Daniel Schulz, Banafsheh Adami, Nima Karimian, Diogo Nunes, João Marcos, Nuno Gonçalves, Lovro Sikosek, Borut Batagelj, Aleksandr Alenin, Alhasan Alkhaddour, Anton Pimenov, Artem Tregubov, Igor Avdonin, Maxim Kazantsev, Mikhail Pozigun, Vasiliy Pryadchenko, Nima Schei, David Pabon and Manuela Tiedemann, in: IEEE International Joint Conference on Biometrics, 2024

Feature Representations for Automatic Meerkat Vocalization Classification, Imen Ben Mahmoud, Eklavya Sarkar, Marta Manser and Mathew Magimai-Doss, in: 4th International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots, 2024

XLSR-Transducer: Streaming ASR for Self-Supervised Pretrained Models, Shashi Kumar, Srikanth Madikeri, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Nigmatulina Iuliia, Petr Motlicek, Manjunath K E and Aravind Ganapathiraju, Idiap-RR-08-2024

[URL]

Evaluating the Effectiveness of Attack-Agnostic Features for Morphing Attack Detection, Laurent Colbois and Sébastien Marcel, in: International Joint Conference on Biometrics, 2024

Towards Wine Tasting Activity Recognition for a Digital Sommelier, Mario Parra, Jesus Favela, Luis Castro and Daniel Gatica-Perez, in: Proc. ACM Workshop on Exploring Innovative Technology for Commensality and Human-Food Interaction, 2024

Integrating large language models and ASR systems using confidence measures and prompting, Maryam Naderi, Idiap-Com-02-2024

On the Utility of Speech and Audio Foundation Models for Marmoset Call Analysis, Eklavya Sarkar and Mathew Magimai-Doss, in: 4th International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots, 2024

Sentiment Analysis using pretrained LLMs, Alexandre Huou, Petr Motlicek and Esaú Villatoro-Tello, Idiap-RR-05-2024

A Novel and Responsible Dataset for Face Presentation Attack Detection on Mobile Devices, Nathan Ramoly, Alain Komaty, Vedrana Krivokuca, Lara Younes, Ahmad-Montaser Awal and Sébastien Marcel, in: The IEEE International Joint Conference on Biometrics, Buffalo, New York, pages 8, 2024

Vascular Biometrics Experiments on Candy -- A New Contactless Finger-Vein Dataset, Sushil Bhattacharjee, David Geissbuhler, G. Clivaz, Ketan Kotwal and Sébastien Marcel, in: Proceedings of the International Conference on Pattern Recognition (ICPR), Calcutta (India), 2024

SWEET - An Open Source Modular Platform for Contactless Hand Vascular Biometric Experiments, David Geissbuhler, Sushil Bhattacharjee, Ketan Kotwal, G. Clivaz and Sébastien Marcel, in: arXiv, 2024

[DOI]
[URL]

Assessing the Reliability of Biometric Authentication on Virtual Reality Devices, Ketan Kotwal, Gökhan Özbulak and Sébastien Marcel, in: Proceedings of IEEE International Joint Conference on Biometrics, 2024

Towards interfacing large language models with ASR systems using confidence measures and prompting, Maryam Naderi, Enno Hermann, Alexandre Nanchen, Sevada Hovsepyan and Mathew Magimai-Doss, in: Proceedings of Interspeech, pages 2980-2984, 2024

[DOI]

Factors that Affect Personalization of Robots for Older Adults, Laura Stegner, Emmanuel Senft and Bilge Mutlu, in: CONCATENATE Workshop at HRI 2023 in Stockholm, Sweden, 2023

[URL]

A System for Human-Robot Teaming through End-User Programming and Shared Autonomy, Michael Hagenow, Emmanuel Senft, Robert Radwin, Michael Gleicher, Michael Zinn and Bilge Mutlu, in: Proceedings of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, pages 231-239, 2024

[DOI]
[URL]

Generative AI Literacy: Twelve Defining Competencies, Ravinithesh Annapureddy, Alessandro Fornaroli and Daniel Gatica-Perez, in: ACM Digital Government: Research and Practice, 2024

[DOI]
[URL]

Feature Representations for Automatic Meerkat Vocalization Classification, Imen Ben Mahmoud, Eklavya Sarkar, Marta Manser and Mathew Magimai-Doss, Idiap-RR-06-2024

GAFRO: Geometric Algebra for Robotics [Tutorial], Tobias Löw, Philip Abbet and Sylvain Calinon, in: IEEE Robotics and Automation Magazine, 32(3):184-194, 2025

[DOI]

M3BAT: Unsupervised Domain Adaptation for Multimodal Mobile Sensing with Multi-Branch Adversarial Training, Lakmal Buddika Meegahapola, Hamza Hassoune and Daniel Gatica-Perez, in: PACM on Interactive, Mobile, Wearable, and Ubiquitous Technologies (IMWUT), 8(2):46, 2024

[DOI]

Group Membership Verification via Nonlinear Sparsifying Transform Learning, Behrooz Razeghi, Marzieh Gheisari, Amir Atashin, Dimche Kostadinov, Sébastien Marcel, Deniz Gunduz and Slava Voloshynovskiy, in: IEEE Access, 12:86739-86751, 2024

[DOI]
[URL]

Performing And Detecting Backdoor Attacks on Face Recognition Algorithms, Alexander Unnervik, Ecole Polytechnique Fédérale de Lausanne, 2024

Logic Learning from Demonstrations for Multi-step Manipulation Tasks in Dynamic Environments, Yan Zhang, Teng Xue, Amirreza Razmjoo and Sylvain Calinon, in: IEEE Robotics and Automation Letters, 2024

Modality Agnostic Heterogeneous Face Recognition with Switch Style Modulators, Anjith George and Sébastien Marcel, in: IEEE International Joint Conference on Biometrics, 2024

Speech and Language Recognition with Low-rank Adaptation of Pretrained Models, Amrutha Prasad, Srikanth Madikeri, Driss Khalil, Petr Motlicek and Schüpbach Christof, in: Interspeech 2024, pages 2825--2829, 2024

[DOI]
[URL]

Logic-Skill Programming: An Optimization-based Approach to Sequential Skill Planning, Teng Xue, Amirreza Razmjoo, Suhan Shetty and Sylvain Calinon, in: Proc. Robotics: Science and Systems (RSS), 2024

Configuration Space Distance Fields for Manipulation Planning, Yiming Li, Xuemin Chi, Amirreza Razmjoo and Sylvain Calinon, in: Robotics: Science and Systems (RSS), 2024, 2024

A Unified Model for Gaze Following and Social Gaze Prediction, Anshul Gupta, Samy Tafasca, Naravich Chutisilp and Jean-Marc Odobez, in: The 18th IEEE International Conference on Automatic Face and Gesture Recognition, 2024

Advancing Self-Supervised Deep Learning for 3D Scene Understanding, Mohammad Mahdi Johari, EPFL, 2024

Representing Robot Geometry as Distance Fields: Applications to Whole-body Manipulation, Yiming Li, Yan Zhang, Amirreza Razmjoo and Sylvain Calinon, in: IEEE International Conference on Robotics and Automation, 2024

Online Multi-Contact Receding Horizon Planning via Value Function Approximation, J. Wang, S. Kim, Teguh Santoso Lembono, W. Du, J. Shim, S. Samadi, K. Wang, V. Ivan, Sylvain Calinon, S. Vijayakumar and S. Tonneau, in: IEEE Transactions on Robotics (T-RO), 2024

An Optimal Control Formulation of Tool Affordance Applied to Impact Tasks, Boyang Ti, Y. Gao, Jie Zhao and Sylvain Calinon, in: IEEE Transactions on Robotics (T-RO), 2024

A Probabilistic Approach to Multi-Modal Adaptive Virtual Fixtures, M. Mühlbauer, T. Hulin, B. Weber, Sylvain Calinon, F. Stulp, A. Albu-Schäffer and J. Silverio, in: IEEE Robotics and Automation Letters (RA-L), 2024

Towards Robo-Coach: Robot Interactive Stiffness/Position Adaptation for Human Strength and Conditioning Training, C. Li, X. Wu, T. Teng, Sylvain Calinon and F. Chen, in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), 2024

D-LGP: Dynamic Logic-Geometric Program for Reactive Task and Motion Planning, Teng Xue, Razmjoo Amirreza and Sylvain Calinon, in: IEEE International Conference on Robotics and Automation, 2024

Extending the Cooperative Dual-Task Space in Conformal Geometric Algebra, Tobias Löw and Sylvain Calinon, in: Proc. IEEE Intl Conf. on Robotics and Automation, 2024

Generalized Policy Iteration using Tensor Approximation for Hybrid Control, Suhan Shetty, Teng Xue and Sylvain Calinon, in: International Conference on Learning Representations (ICLR), 2024

Normalizing Flows for Speaker and Language Recognition Backend, Aleix Espuña, Amrutha Prasad, Petr Motlicek, Srikanth Madikeri and Schüpbach Christof, in: Odyssey 2024: The Speaker and Language Recognition Workshop, 2024

DAIC-WOZ: On the Validity of Using the Therapist's prompts in Automatic Depression Detection from Clinical Interviews, Sergio Burdisso, Ernesto A. Reyes-Ramírez, Esaú Villatoro-Tello, Fernando Sánchez-Vega, A. Pastor López-Monroy and Petr Motlicek, in: Proceedings of the 6th Clinical Natural Language Processing Workshop, Mexico City, Mexico, pages 82–90, Association for Computational Linguistics, 2024

[DOI]
[URL]

Reliability Estimation of News Media Sources: Birds of a Feather Flock Together, Sergio Burdisso, Dairazalia Sanchez-Cortes, Esaú Villatoro-Tello and Petr Motlicek, in: Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Mexico City, Mexico, pages 6900–6918, Association for Computational Linguistics, 2024

[DOI]
[URL]

A Differentiable Integer Linear Programming Solver for Explanation-Based Natural Language Inference, Mokanarangan Thayaparan, Marco Valentino and Andre Freitas, in: The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, 2024

Estimating the Causal Effects of Natural Logic Features in Transformer-Based NLI Models, Julia Rozanova, Marco Valentino and Andre Freitas, in: The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, 2024

Why daylight should be a priority for urban planning, Carlo Volf, Bruno Bueno, Peter Edwards, Richard Hobday, Stephan Mäder, Barbara Matusiak, Katharina Wulff, Werner Osterhaus, Gabriele Manoli, Christina Della Giustina, Jasmin Joshi, Jérôme Kämpf, Kevin Vega and Christoph Kueffer, in: Journal of Urban Management, 2024

[DOI]
[URL]

From Modalities to Styles: Rethinking the Domain Gap in Heterogeneous Face Recognition, Anjith George and Sébastien Marcel, in: IEEE Transactions on Biometrics, Behavior, and Identity Science, 2024

On Learning to Classify Meerkat Calls, Imen Ben Mahmoud, Idiap-Com-01-2024

Enhancing Ethical Explanations of Large Language Models through Iterative Symbolic Refinement, Xin Quan, Marco Valentino, Louise A Dennis and Andre Freitas, in: The 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024), 2024

Improving Semantic Control in Discrete Latent Spaces with Transformer Quantized Variational Autoencoders, Yingji Zhang, Danilo Carvalho, Marco Valentino, Ian Pratt-Hartmann and Andre Freitas, in: Findings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024), 2024

Multi-Relational Hyperbolic Word Embeddings from Natural Language Definitions, Marco Valentino, Danilo Carvalho and Andre Freitas, in: The 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024), 2024

COMPARING DATA-DRIVEN AND HANDCRAFTED FEATURES FOR DIMENSIONAL EMOTION RECOGNITION, Bogdan Vlasenko, Sargam Vyas and Mathew Magimai-Doss, in: International Conference on Acoustics, Speech and Signal Processing, 2024

Developing 3D-Printed Wrist Splints for Distal Radius and Scaphoid Fractures, Bernadette Tobler-Ammann, Frederic Schuind, Loïc Voillat, Théophile Gentilhomme, Esther Vögelin, Noé Murith and Bernard Masserey, in: Journal of Wrist Surgery, 2024

[DOI]
[URL]

Understanding the effects of language-specific class imbalance in multilingual fine-tuning, Vincent Jung and Lonneke van der Plas, in: Findings of the European chapter of Association for Computational Linguistics, 2024, 2024

Generalization and Personalization of Machine Learning for Multimodal Mobile Sensing in Everyday Life, Lakmal Buddika Meegahapola, EPFL, 2023

Vulnerability of Face Age Verification to Replay Attacks, Pavel Korshunov, Anjith George, Gökhan Özbulak and Sébastien Marcel, in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2024

EdgeFace : Efficient Face Recognition Model for Edge Devices, Anjith George, Christophe Ecabert, Hatef Otroshi Shahreza, Ketan Kotwal and Sébastien Marcel, in: IEEE Transactions on Biometrics, Behavior, and Identity Science, 2024

PRIMIS: Privacy-Preserving Medical Image Sharing via Deep Sparsifying Transform Learning with Obfuscation, Isaac Shiri, Behrooz Razeghi, Sohrab Ferdowsi, Yazdan Salimi, Deniz Gunduz, Douglas Teodoro, Slava Voloshynovskiy and Habib Zaidi, in: Journal of Biomedical Informatics, Elsevier, 150, 2024

[DOI]
[URL]

Deep Variational Privacy Funnel: General Modeling with Applications in Face Recognition, Behrooz Razeghi, Parsa Rahimi and Sébastien Marcel, in: 49th IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE, 2024

[DOI]
[URL]

Heterogeneous Face Recognition Using Domain Invariant Units, Anjith George and Sébastien Marcel, in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2024

Transformers as Graph-to-Graph Models, James Henderson, Alireza Mohammadshahi, Andrei Catalin Coman and Lesly Miculicich, in: Big Picture Workshop at EMNLP 2023, 2023

Nonparametric Variational Regularisation of Pretrained Transformers, Fabio Fehr and James Henderson, in: ArXiv, 2023

[DOI]
[URL]

Safe Deep Neural Networks, Kyle Matoba, EPFL, 2024

Verification of an open-source Python library for the simulation of district heating networks with complex topologies, Roberto Boghetti and Jérôme Kämpf, in: Energy, 290, 2024

[DOI]
[URL]

Loose and Tight: Creative Formation but Rigid Use of Nominal Compounds in Conspiracist Texts, Alessandro miani, Lonneke van der Plas and Adrian Bangerter, in: The Journal of Creative Behavior, 2023

Absolute retinal blood flow in healthy eyes and in eyes with retinal vein occlusion, Thibaud Mautuit, Pierre Cunnac, Frédéric Truffer, André Anjos, Rebecca Dufrane, Gilbert Maître, Martial Geiser and Christophe Chiquet, in: Microvascular Research, 152, 2024

[DOI]

Fine-tuning Self-Supervised Models For Language Identification Using Orthonormal Constraint, Amrutha Prasad, Andrés Carofilis, Geoffroy Vanderreydt, Driss Khalil, Srikanth Madikeri, Petr Motlicek and Schüpbach Christof, in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP), pages 11921-11925, 2024

[DOI]

Online Learning of Continuous Signed Distance Fields Using Piecewise Polynomials, Ante Marić, Yiming Li and Sylvain Calinon, in: IEEE Robotics and Automation Letters (RA-L), 9(6):6020-6026, 2024

[DOI]
[URL]

CONTEXTUAL BIASING METHODS FOR IMPROVING RARE WORD DETECTION IN AUTOMATIC SPEECH RECOGNITION, Mrinmoy Bhattacharjee, Nigmatulina Iuliia, Amrutha Prasad, Pradeep Rangappa, Srikanth Madikeri, Petr Motlicek, Hartmut Helmke and Matthias Kleinert, in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, Seoul, Korea, 2024

From Zero Energy to Zero Power Buildings: a new paradigm for a sustainable transition of the building stock, Matteo Bilardo, Jérôme Kämpf and Enrico Fabrizio, in: Sustainable Cities and Society, 2023

[DOI]
[URL]

Automatic Speech Analysis Framework for ATC Communication in HAAWAII, Petr Motlicek, Amrutha Prasad, Nigmatulina Iuliia, Hartmut Helmke, Oliver Ohneiser and Matthias Kleinert, in: 13th SESAR Innovation Days, 2023

CONTENT-BASED OBJECTIVE EVALUATION OF ARTIFICIALLY GENERATED SIGN LANGUAGE VIDEOS, Neha Tarigopula, Preyas Garg, Skanda Muralidhar, Sandrine Tornay, Dinesh Babu Jayagopi and Mathew Magimai-Doss, in: ICASSP, 2024

Demystifying the Scribes behind the Voynich Manuscript using Computational Linguistic Techniques, Kevin Farrugia, Colin Layfield and Lonneke van der Plas, in: Proceedings of the 1st International Conference on the Voynich Manuscript, 2022

International Conference on the Voynich Manuscript 2022, Colin Layfield, René Zandbergen, Lisa Fagin Davis, John Abela, Claire Bowern, Michael Rosner and Lonneke van der Plas, in: Proceedings of the International Conference on Historical Cryptology, 2023

UM-DFKI Maltese Speech Translation, Aiden Williams, Kurt Abela, Rishu Kumar, Martin Bär, Hannah Billinghurst, Kurt Micallef, Ahnaf Mozib Samin, Andrea DeMarco, Lonneke van der Plas and Claudia Borg, in: Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023), 2022

Findings of the IWSLT 2023 evaluation campaign, Milind Agarwal, Sweta Agarwal, Antonios Anastasopoulos, Luisa Bentivogli, Ondrej Bojar, Claudia Borg, Marine Carpuat, Roldano Cattoni, Mauro Cettolo, Mingda Chen, William Chen, Khalid Choukri, Alexandra Chronopoulou, Anna Currey, Thierry Declerck, Qianqian Dong, Kevin Duh, Yannick Esteve, Marcello Federico, Souhir Gahbiche, Barry Haddow, Benjamin Hsu, Phu Mon Htut, Hirofumi Inaguma, David Javorsky, John Judge, Yasumasa Kano, Tom Ko, Rishu Kumar, Pengwei Li, Xutai Ma, Prashant Mathur, Evgeny Matusov, Paul McNamee, John P. McCrae, Kenton Murray, Maria Nadejde, Satoshi Nakamura, Matteo Negri, Ha Nguyen, Jan Niehues, Xing Niu, Atul Kr. Ojha, John E. Ortega, Proyag Pal, Juan Pino, Lonneke van der Plas, Peter Polak, Elijah Rippeth, Elizabeth Salesky, Jiatong Shi, Matthias Sperber, Sebastian Stuker, Katsuhito Sudoh, Yun Tang, Brian Thompson, Kevin Tran, Marco Turchi, Alex Waibe, Mingxuan Wang, Shinji Watanabe and Rodolfo Zevallos, in: Proceedings of the IWSLT conference, 2023

A Machine Learning Model for the Prediction of Building Hourly Heating Demand from CityGML Files: Training Workflow and Deployment as an API, Marco Tognoli, Giuseppe Peronato and Jérôme Kämpf, in: Proceedings of Building Simulation 2023: 18th Conference of IBPSA, pages 2932 - 2939, 2023

[DOI]
[URL]

Data-driven urban building energy modeling in Satom (CH): The energy savings potential and use of available renewable energy sources., Ahad Montazeri, Guglielmina Mutani and Jérôme Kämpf, Politecnico di Torino, 2023

[URL]

Meta-analysis informed machine learning: Supporting cytokine storm detection during CAR-T cell Therapy, Alex Bogatu, Magdalena Wysocka, Oskar Wysocki, Holly Butterworth, Manon Pillai, Jennifer Allison, Donal Landers, Elaine Kilgour, Fiona Thistlethwaite and Andre Freitas, in: Journal of Biomedical Informatics, 142, 2023

[DOI]

Epidemiological and clinical analysis of polish short-term and long-term travelers returning from tropical countries, Oskar Wysocki, Martyna Bykowska-Tumasz and Katarzyna Sikorska, in: Travel Medicine and Infectious Disease, 55, 2023

[DOI]

Defining the role of real-world data in cancer clinical research: the position of the European Organisation for Research and Treatment of Cancer, Robbe Saese, Andre Freitas and et al., in: European Journal of Cancer, 2023

A systematic review of biologically-informed deep learning models for cancer: fundamental trends for encoding and interpreting oncology data, Magdalena Wysocka, Oskar Wysocki, Marie Zufferey, Donal Landers and Andre Freitas, in: BMC Bioinformatics, 24(198), 2023

[DOI]

Learning Lessons from the COVID-19 pandemic for Real World Evidence research in Oncology–shared perspectives from an international consortia, Luis Castelo-Branco, Andre Freitas and et al., in: ESMO Open, 2023

What do individuals with visual impairment need and want from a dialogue-based digital assistant?, John Taylor, Ahalya Subramanian, Andre Freitas, Deborah Mendes and Chris Dickinson, in: Clinical and Experimental Optometry, 2023

NLI4CT: Multi-Evidence Natural Language Inference for Clinical Trial Reports, Mael Jullien, Marco Valentino, Hannah Frost, Paul O'Reagan, Donal Landers and Andre Freitas, in: Proceedings of The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), 2023

Transformers, Tables and Frame Semantics, Mario Ramirez, Alex Bogatu, Norman Paton and Andre Freitas, in: International Conference on Semantic Computing, 2023

SemEval-2023 Task 7: Multi-Evidence Natural Language Inference for Clinical Trial Data, Mael Jullien, Marco Valentino, Hannah Frost, Paul O'Reagan, Donal Landers and Andre Freitas, in: Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023), Association for Computational Linguistics, 2023

[DOI]
[URL]

Introduction to Mathematical Language Processing: Informal Proofs, Word Problems, and Supporting Tasks, Jordan Meadows and Andre Freitas, in: Transactions of the ACL, 2023

A Canonical Context-preserving Representation for Open IE: Extracting Semantically Typed Relational Tuples from Complex Sentences, Christina Niklaus, Mathias Cetto, Andre Freitas and Siegfried Handschuh, in: Knowledge-based Systems, 2023

Learning Disentangled Representations for Natural Language Definitions, Danilo Carvalho, Giangiacomo Mercatali, Yingji Zhang and Andre Freitas, in: In Findings of the European chapter of Association for Computational Linguistics, 2023

Assessment of Subsidization Strategies for Multi-Objective Optimization of Energy Efficiency Measures for Building Renovation at District Scale, Federico Battini, Giovanni Pernigotto, Federica Morandi, Andrea Gasparella and Jérôme Kämpf, in: Energies, 16(15), 2023

[DOI]

Bi-directional Training for Composed Image Retrieval via Text Prompt Learning, Zheyuan Liu, Weixuan Sun, Yicong Hong, Damien Teney and Stephen Gould, in: IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2024

[URL]

A Symbolic Framework for Systematic Evaluation of Mathematical Reasoning with Transformers, Jordan Meadows, Marco Valentino, Damien Teney and Andre Freitas, in: Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2024

[URL]

Candidate Set Re-ranking for Composed Image Retrieval with Dual Multi-modal Encoder, Zheyuan Liu, Weixuan Sun, Damien Teney and Stephen Gould, in: Transactions on Machine Learning Research (TMLR), 2024

[URL]

Selective Mixup Helps with Distribution Shifts, But Not (Only) because of Mixup, Damien Teney, Jindong Wang and Ehsan Abbasnejad, in: International Conference on Machine Learning (ICML), 2024

[URL]

Learning diverse features in vision transformers for improved generalization, Armand Mihai Nicolicioiu, Andrei Liviu Nicolicioiu, Bogdan Alexe and Damien Teney, in: ICML 2023: The Second Workshop on Spurious Correlations, Invariance and Stability, 2023

[URL]

Leveraging Diffusion Disentangled Representations to Mitigate Shortcuts in Underspecified Visual Tasks, Luca Scimeca, Alexander Rubinstein, Armand Mihai Nicolicioiu, Damien Teney and Yoshua Bengio, in: NeurIPS Workshop on Diffusion Models, 2023

[URL]

Energy assessment of a district by integrating solar thermal in district heating network: a dynamic analysis approach, Matteo Bilardo, Jérôme Kämpf and Enrico Fabrizio, in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2023

[DOI]
[URL]

ZooPFL: Exploring Black-box Foundation Models for Personalized Federated Learning, Wang Lu, Hao Yu, Jindong Wang, Damien Teney, Haohan Wang, Yiqiang Chen, Qiang Yang, Xing Xie and Xiangyang Ji, in: IEEE Transactions on Neural Networks and Learning Systems, 2025

[URL]

Shortcut Bias Mitigation via Ensemble Diversity Using Diffusion Probabilistic Models, Luca Scimeca, Alexander Rubinstein, Damien Teney, Seong Joon Oh, Armand Mihai Nicolicioiu and Yoshua Bengio, in: Under review, 2023

[URL]

Zero-shot Retrieval: Augmenting Pre-trained Models with Search Engines, Hamed Damirchi, Cristian Rodriguez-Opazo, Ehsan Abbasnejad, Damien Teney, Javen Qinfeng Shi, Stephen Gould and Anton van den Hengel, in: Under review, 2023

[URL]

Potential for district heating networks from waste heat: an assessment tool and its application to sewage treatment plants in the Canton of Zurich, Giuseppe Peronato and Jérôme Kämpf, in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2023

[DOI]
[URL]

Suggesting disease associations for overlooked metabolites using literature from metabolic neighbors, Maxime Delmas, Olivier Filangi, Duperier Christophe, Paulhe Nils, Florence Vinson, Pablo Rodriguez Mier, Giacomoni Franck, Fabien Jourdan and Clément Frainay, in: GigaScience, 12:13, 2023

[DOI]

End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation, Juan Zuluaga-Gomez, Zhaocheng Huang, Xing Niu, Sundararajan Srinavasan, Prashant Mathur, Brian Thompson and Marcello Federico, in: The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), Singapore, 2023

[URL]

Enhancing Multi-modal Classification of Violent Events using Image Captioning, Daniel Vallejo-Aldana, A. Pastor López-Monroy and Esaú Villatoro-Tello, in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2023), CEUR Workshop Proceedings, Jaén, Spain, 2023

[URL]

Robust Face Presentation Attack Detection with Multi-channel Neural Networks, Anjith George and Sébastien Marcel, in: Handbook of Biometric Anti-Spoofing, Springer, 2023

Attacking Face Recognition with T-shirts: Database, Vulnerability Assessment and Detection, Anjith George and Sébastien Marcel, in: IEEE Access, 2023

Learning to Abstract with Nonparametric Variational Information Bottleneck, Melika Behjati, Fabio Fehr and James Henderson, in: The 2023 Conference on Empirical Methods in Natural Language Processing, 2023

[URL]

Human-Robot Collaboration in a Sanding Task, Anna Konstant, Nitzan Orr, Michael Hagenow, Emmanuel Senft, Isabelle Gundrum, Bilge Mutlu, Michael Zinn, Michael Gleicher and Robert Radwin, in: Proceedings of the Human Factors and Ergonomics Society Annual Meeting, 2023

Towards Improved Replicability of Human Studies in Human-Robot Interaction: Recommendations for Formalized Reporting, Shelly Bagshy, Patrick Holthaus, Gloria Beraldo, Emmanuel Senft, Daniel Hernandez Garcia, Zhao Han, Suresh Kumaar Jayaraman, Alessandra Rossi, Connor Esterwood, Antonio Andriella and Paul Pridham, in: Companion of the 2023 ACM/IEEE International Conference on Human-Robot Interaction, pages 629-633, 2023

Combating COVID-19 with charisma: Evidence on governor speeches in the United States, Ulrich Jensen, Dominic Rohner, Olivier Bornet, Daniel Carron, Philip N. Garner, Dimitra Loupi and John Antonakis, in: The Leadership Quarterly, 2023

[DOI]
[URL]

Diversity and neocolonialism in Big Data research: Avoiding extractivism while struggling with paternalism, Paula Helm, Amalia de Götzen, Luca Cernuzzi, Alethia Hume, Shyam Diwakar, Salvador Ruiz-Correa and Daniel Gatica-Perez, in: Big Data & Society, 2023

[DOI]

Integrated transcriptome landscape of ALS identifies genome instability linked to TDP-43 pathology, Oliver Ziff, Jacob Neeves, Jamie Mitchell, Giulia E. Tyzack, Carlos Martinez-Ruiz, Raphaelle Luisier, Anob Chakrabarti, Nicholas McGranahan, Kevin Litchfield, Simon Boulton, Amar Al-Chalabi, Gavin Kelly, Jack Humphrey and Rickie Patani, in: Nature Communications, 2023

RNA at a breaking point? Cytoplasmic cleavage and other post-transcriptional RNA processing in neurodevelopment and disease, Monika Piwecka, Raphaelle Luisier and Catia Andreassi, in: Frontiers in Molecular Neuroscience, 2023

The predicted RNA-binding protein regulome of axonal mRNAs, Raphaelle Luisier, Catia Andreassi, Lisa Fournier and Antonella Riccio, in: Genome Research, 2023

Robust Execution of Assembly Policies Using a Pose Invariant Task Representation, Bojan Nemec, Matevz Hrovat, Mihael Simonič, Suhan Shetty, Sylvain Calinon and Ales Ude, in: 20th International Conference on Ubiquitous Robots (UR), Honolulu, HI, USA, IEEE, 2023

Tensor Train for Global Optimization Problems in Robotics, Suhan Shetty, Teguh Santoso Lembono, Tobias Löw and Sylvain Calinon, in: International Journal of Robotics Research, 43(6):811-839, 2024

[DOI]

EdgeFace: Efficient Face Recognition Model for Edge Devices, Anjith George, Christophe Ecabert, Hatef Otroshi Shahreza, Ketan Kotwal and Sébastien Marcel, Idiap-RR-01-2024

Whole-Body Ergodic Exploration with a Manipulator Using Diffusion, Cem Bilaloglu, Tobias Löw and Sylvain Calinon, in: IEEE Robotics and Automation Letters, 8(12):8581-8587, 2023

[DOI]
[URL]

The Interpreter Understands Your Meaning: End-to-end Spoken Language Understanding Aided by Speech Translation, Mutian He and Philip N. Garner, in: Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023, pages 4408-4423, Association for Computational Linguistics, 2023

[DOI]

Enhancing user acceptance in automated systems with human-centric lighting: the role of visual comfort, personality, and preference, Michael Papinutto, Moreno Colombo, Roberto Boghetti, Chantal Basurto, Kornelius Reutter, Denis Lalanne, Jérôme Kämpf and Julien Nembrini, in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2023

[DOI]
[URL]

SoftGPT: Learn Goal-oriented Soft Object Manipulation Skills by Generative Pre-trained Heterogeneous Graph Transformer, J. Liu, Z. Li, Sylvain Calinon and F. Chen, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2023

Customization of Automatic Speech Recognition Engines for Rare Word Detection Without Costly Model Re-Training, Mrinmoy Bhattacharjee, Petr Motlicek, Nigmatulina Iuliia, Hartmut Helmke, Oliver Ohneiser, Matthias Kleinert and heiko Ehr, in: Proc. 13th SESAR Innovation Days, Seville, Spain, 2023

[DOI]
[URL]

A Multitask and Kernel approach for Learning to Push Objects with a Task-Parameterized Deep Q-Network, Marco Ewerton, Michael Villamizar, Julius Jankowski, Sylvain Calinon and Jean-Marc Odobez, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2023

Reactive Anticipatory Robot Skills with Memory, Hakan Girgin, Julius Jankowski and Sylvain Calinon, in: Robotic Research, pages 436-451, Springer, 2023

Programming industrial robots from few demonstrations., Sylvain Calinon, in: Human-Robot Collaboration: Unlocking the potential for industrial applications, pages 9-37, Institution of Engineering and Technology (IET), 2023

Efficient Grapevine Structure Estimation in Vineyards Conditions, Théophile Gentilhomme, Michael Villamizar, Jérome Corre and Jean-Marc Odobez, in: Proceedings of the IEEE/CVF International Conference on Computer Vision, Paris, pages 712--720, 2023

[URL]

Mitigating Demographic Bias in Face Recognition via Regularized Score Calibration, Ketan Kotwal and Sébastien Marcel, in: IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, IEEE/CVF, 2024

Parameter-Efficient Tuning With Adaptive Bottlenecks For Automatic Speech Recognition, Geoffroy Vanderreydt, Amrutha Prasad, Driss Khalil, Srikanth Madikeri, Kris Demuynck and Petr Motlicek, in: Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU'23, 2023

[DOI]

Coordinated Multi-Robot Shared Autonomy Based on Scheduling and Demonstrations, Michael Hagenow, Emmanuel Senft, Nitzan Orr, Robert Radwin, Michael Gleicher, Bilge Mutlu, Dylan P. Losey and Michael Zinn, in: IEEE Robotics and Automation Letters, 8(12):8335 - 8342, 2023

[DOI]
[URL]

The Suisse Romande Local News Dataset, Victor Bros and Daniel Gatica-Perez, Idiap-Com-03-2023

Lessons Learned in Transcribing 5000 h of Air Traffic Control Communications for Robust Automatic Speech Understanding, Juan Zuluaga-Gomez, Nigmatulina Iuliia, Amrutha Prasad, Petr Motlicek, Driss Khalil, Srikanth Madikeri, Allan Tart, Igor Szoke, Vincent Lenders, Mickael Rigault and Khalid Choukri, in: Aerospace, 10(10):898, 2023

[DOI]
[URL]

An Automatic Speaker Clustering Pipeline for the Air Traffic Communication Domain, Driss Khalil, Amrutha Prasad, Petr Motlicek, Juan Zuluaga-Gomez, Nigmatulina Iuliia, Srikanth Madikeri and Schüpbach Christof, in: Aerospace, 10(10):876, 2023

[DOI]
[URL]

The Idiap Speech Synthesis System for the Blizzard Challenge 2023, Haolin Chen, Mutian He, Louise Coppieters de Gibson and Philip N. Garner, in: Proc. 18th Blizzard Challenge Workshop, 2023

[DOI]

Modeling Structured Data in Attention-based Models, Alireza Mohammadshahi, EPFL, 2023

[URL]

ProGAP: Progressive Graph Neural Networks with Differential Privacy Guarantees, Sina Sajadmanesh and Daniel Gatica-Perez, in: The 17th ACM International Conference on Web Search and Data Mining, 2024

BLESS: Benchmarking Large Language Models on Sentence Simplification, Tannon Kew, Alison Chi, Laura Vásquez-Rodríguez, Sweta Agrawal, Dennis Aumiller, Fernando Alva-Manchego and Matthew Shardlow, in: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Singapore, 2023

Can Language Models Learn Analogical Reasoning? Investigating Training Objectives and Comparisons to Human Performance, Molly R. Petersen and Lonneke van der Plas, in: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Singapore, Association for Computational Linguistics, 2023

Development and comparison of adaptive data-driven models for thermal comfort assessment and control, Giulia Lamberti, Roberto Boghetti, Jérôme Kämpf, Fabio Fantozzi, Francesco Leccese and Giacomo Salvadori, in: Total Environment Research Themes, 8, 2023

[DOI]
[URL]

Benefits of Max Pooling in Neural Networks: Theoretical and Experimental Evidence, Kyle Matoba, Nikolaos Dimitriadis and Francois Fleuret, in: Transactions on Machine Learning Research, 2023

Practical computational imaging by use of spatiotemporal light modulation: from simulations to applications in biological microscopy, François Marelli, EPFL, 2023

[DOI]

Affordance segmentation of hand-occluded containers from exocentric images, Tommaso Apicella, Alessio Xompero, Edoardo Ragusa, Riccardo Berta, Andrea Cavallaro and Paolo Gastaldo, in: Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), 2023

[DOI]
[URL]

Black-box Attacks on Image Activity Prediction and its Natural Language Explanations, Alina Elena Baia, Valentina Poggioni and Andrea Cavallaro, in: Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), 2023

[DOI]
[URL]

Privacy-Preserving Machine Learning on Graphs, Sina Sajadmanesh, EPFL, 2023

[DOI]

Document-level Text Simplification with Coherence Evaluation, Laura Vásquez-Rodríguez, Matthew Shardlow, Piotr Przybyla and Sophia Ananiadou, in: Proceedings of the Second Workshop on Text Simplification, Accessibility and Readability, 2023

Generalizable Automatic Classification of Sleep Stages, Samuel Michel, Idiap-Com-02-2023

From Nano to Macro: An overview of the IEEE Bio Image and Signal Processing Technical Committee, Selin Aviyente, Alejandro F. Frangi, Erik Meijering, Arrate Muñoz-Barrutia, Michael Liebling, Dimitri Van De Ville, Jean-Christophe Olivo-Marin, Jelena Kovačević and Michael Unser, in: IEEE Signal Processing Magazine, 40(4):61-71, 2023

[DOI]
[URL]

SynthDistill: Face Recognition with Knowledge Distillation from Synthetic Data, Hatef Otroshi Shahreza, Anjith George and Sébastien Marcel, in: IEEE International Joint Conference on Biometrics (IJCB 2023), 2023

[DOI]

Multi-image deconvolution of thermal images with a boundary condition weighting scheme, Florian Piras, François Marelli, Edouard De Moura Presa, Peter Wellig and Michael Liebling, in: Target and Background Signatures IX, International Society for Optics and Photonics, Amsterdam, pages 149-158, SPIE, 2023

[DOI]
[URL]

The Unconstrained Ear Recognition Challenge 2023: Maximizing Performance and Minimizing Bias, Anjith George and Sébastien Marcel, in: IEEE International Joint Conference on Biometrics (IJCB 2023), 2023

EFaR 2023: Efficient Face Recognition Competition, Anjith George, Christophe Ecabert, Hatef Otroshi Shahreza, Ketan Kotwal and Sébastien Marcel, in: IEEE International Joint Conference on Biometrics (IJCB 2023), 2023

The DCU-EPFL Enhanced Dependency Parser at the IWPT 2021 Shared Task, James Barry, Alireza Mohammadshahi, Joachim Wagner, Jennifer Foster and James Henderson, in: Proceedings of the 17th International Conference on Parsing Technologies and the IWPT 2021 Shared Task on Parsing into Enhanced Universal Dependencies, Online, pages 204-212, Association for Computational Linguistics, 2021

Multilingual Extraction and Categorization of Lexical Collocations with Graph-aware Transformers, Luis Espinosa Anke, Alexander Shvets, Alireza Mohammadshahi, James Henderson and Leo Wanner, in: Proceedings of the 11th Joint Conference on Lexical and Computational Semantics, Seattle, USA, pages 89–100, 2022

Characterizing Swiss Alpine Lakes: from Wikipedia to Citizen Science, Yuanhui Lin and Daniel Gatica-Perez, in: ACM Journal on Computing and Sustainable Societies, 2023

Urban Crowdsourcing Platforms across the World: A Systematic Review, Alessandro Fornaroli and Daniel Gatica-Perez, in: ACM Digital Government: Research and Practice, 2023

[DOI]
[URL]

Understanding the Social Context of Eating with Multimodal Smartphone Sensing: The Role of Country Diversity, Nathan Kammoun, Lakmal Buddika Meegahapola and Daniel Gatica-Perez, in: 25th ACM International Conference on Multimodal Interaction, 2023

[DOI]
[URL]

Health Talk: Understanding Practices of Popular Professional YouTubers, Thanh-Trung Phan, Chloé Michoud, Lucia Volpato, María del Río Carral and Daniel Gatica-Perez, in: Proceedings of the 21st International Conference on Mobile and Ubiquitous Multimedia, 2022

Novel Methods For Detection And Analysis Of Atypical Aspects In Speech, Julian Fritsch, École Polytechnique Fédérale de Lausanne, 2023

[DOI]

Claim-Dissector: An Interpretable Fact-Checking System with Joint Re-ranking and Veracity Prediction, Martin Fajcik, Petr Motlicek and Pavel Smrz, in: Association for Computational Linguistics, Findings of the Association for Computational Linguistics: ACL 2023:10184–10205, 2023

[URL]

ChildPlay: A New Benchmark for Understanding Children's Gaze Behaviour, Samy Tafasca, Anshul Gupta and Jean-Marc Odobez, in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023

Toward responsible face datasets: modeling the distribution of a disentangled latent space for sampling face images from demographic groups, Parsa Rahimi, Christophe Ecabert and Sébastien Marcel, in: IEEE International Joint Conference on Biometrics, 2023

A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers, Juan Zuluaga-Gomez, Amrutha Prasad, Nigmatulina Iuliia, Petr Motlicek and Matthias Kleinert, in: Aerospace, 10(5), 2023

[DOI]
[URL]

Approximating Optimal Morphing Attacks using Template Inversion, Laurent Colbois, Hatef Otroshi Shahreza and Sébastien Marcel, in: IEEE International Joint Conference on Biometric, 2023

[DOI]

Validating Automatic Speech Recognition and Understanding for Pre-Filling Radar Labels-Increasing Safety While Reducing Air Traffic Controllers' Workload, Nils Ahrenhold, Hartmut Helmke, Thorsten Mühlhausen, Oliver Ohneiser, Matthias Kleinert, heiko Ehr, Lucas Klamert and Juan Zuluaga-Gomez, in: Aerospace, 10(6):538, 2023

[DOI]

Learning Joint Space Reference Manifold for Reliable Physical Assistance, Amirreza Razmjoo, Tilen Brecelj, Kristina Savevska, Ales Ude, Tadej Petric and Sylvain Calinon, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), pages 10412-10417, 2023

[DOI]

A Geometric Optimal Control Approach for Imitation and Generalization of Manipulation Skills, Boyang Ti, Amirreza Razmjoo, Yongsheng Gao, Jie Zhao and Sylvain Calinon, in: Robotics and Autonomous Systems, 2023

VP-STO: Via-point-based Stochastic Trajectory Optimization for Reactive Robot Behavior, Julius Jankowski, Lara Brudermuller, Nick Hawes and Sylvain Calinon, in: Proc. IEEE International Conference on Robotics and Automation (ICRA), 2023

PAAQ: Paired Alternating AcQuisitions for Virtual High Frame Rate Multichannel Cardiac Fluorescence Microscopy, François Marelli, Alexander Ernst, Nadia Mercader and Michael Liebling, in: Biological Imaging, 3:e20, 2023

[DOI]

Efficient compressed sensing reconstruction for 3D fluorescence microscopy using OptoMechanical Modulation Tomography (OMMT) with a 1+2D regularization, François Marelli and Michael Liebling, in: Optics Express, 31(20):31718-31733, 2023

[DOI]

Vulnerability of Automatic Identity Recognition to Audio-Visual Deepfakes, Pavel Korshunov, Haolin Chen, Philip N. Garner and Sébastien Marcel, in: IEEE International Joint Conference on Biometrics, 2023

A VAE for Transformers with Nonparametric Variational Information Bottleneck, James Henderson and Fabio Fehr, in: The Eleventh International Conference on Learning Representations, 2023

[URL]

Bridging the Gap: Heterogeneous Face Recognition with Conditional Adaptive Instance Modulation, Anjith George and Sébastien Marcel, in: IJCB, 2023

Idiap Scientific Report 2022, Hervé Bourlard, Daniel Gatica-Perez, Jean-Marc Odobez, Philip N. Garner, Petr Motlicek, Mathew Magimai-Doss, Sylvain Calinon, Sébastien Marcel, Jérôme Kämpf, Raphaelle Luisier, Michael Liebling, Lonneke van der Plas, Damien Teney, Ina Kodrasi, Emmanuel Senft, James Henderson, Andre Freitas and André Anjos, Idiap-RR-05-2023

Predicting is not understanding: Recognizing and addressing underspecification in machine learning, Damien Teney, Ehsan Abbasnejad and Maxime Peyrard, in: European Conference on Computer Vision, pages 458-476, Springer, 2022

On matching data and model in LF-MMI-based dysarthric speech recognition, Enno Hermann, École polytechnique fédérale de Lausanne, 2023

[DOI]
[URL]

Text Representation Learning for Low Cost Natural Language Understanding, Florian Mai, École polytechnique fédérale de Lausanne, 2023

[DOI]
[URL]

HyperMixer: An MLP-based Low Cost Alternative to Transformers, Florian Mai, Arnaud Pannatier, Fabio Fehr, Haolin Chen, François Marelli, Francois Fleuret and James Henderson, in: Proc. of the 61st Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Toronto, Canada, pages 15632-15654, 2023

[DOI]

Demonstration-guided Optimal Control for Long-term Non-prehensile Planar Manipulation, Teng Xue, Hakan Girgin, Teguh Santoso Lembono and Sylvain Calinon, in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), pages 4999-5005, 2023

[DOI]

Verification of PyDHN - a Python library for the thermo-hydraulic simulation of district heating networks - through the DESTEST, Roberto Boghetti, Giuseppe Peronato and Jérôme Kämpf, in: Proceedings of Building Simulation 2023: 18th Conference of IBPSA, IBPSA, IBPSA, 2023

[DOI]
[URL]

Automatic Speech Recognition and Understanding for Radar Label Maintenance Support Increases Safety and Reduces Air Traffic Controllers’ Workload, Hartmut Helmke, Matthias Kleinert, Nils Ahrenhold, heiko Ehr, Thorsten Mühlhausen, Oliver Ohneiser, Petr Motlicek, Amrutha Prasad, Juan Zuluaga-Gomez, Lucas Klamert, Jelena Dokic and Ella Pinska Chauvin, in: Fifteenth USA/Europe Air Traffic Management Research and Development Seminar (ATM2023), Eurocontrol (Europe), FAA (U.S.), Savannah, Georgia, USA, 2023

[URL]

Diffusion Transformer for Adaptive Text-to-Speech, Haolin Chen and Philip N. Garner, in: Proc. 12th ISCA Speech Synthesis Workshop (SSW 12), 2023

[DOI]

Intelligent Technologies: Concepts, Applications, and Future Directions, Volume 2, Satya Ranjan Dash and Esaú Villatoro-Tello, Springer, volume 1098, 2023

[DOI]

Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviews, Sergio Burdisso, Esaú Villatoro-Tello, Srikanth Madikeri and Petr Motlicek, in: Proceedings of Interspeech, 2023

Periscope: A Robotic Camera System to Support Remote Physical Collaboration, Pragathi Praveena, Yeping Wang, Emmanuel Senft, Michael Gleicher and Bilge Mutlu, in: Proceedings of the ACM on Human Computer Interaction, 2023

Using Commercial ASR Solutions to Assess Reading Skills in Children: A Case Report, Timothy Piton, Enno Hermann, Angela Pasqualotto, Marjolaine Cohen, Mathew Magimai-Doss and Daphné Bavelier, in: Proceedings of Interspeech, pages 4573-4577, 2023

[DOI]
[URL]

Few-shot Dysarthric Speech Recognition with Text-to-Speech Data Augmentation, Enno Hermann and Mathew Magimai-Doss, in: Proceedings of Interspeech, pages 156-160, 2023

[DOI]
[URL]

HyperConformer: Multi-head HyperMixer for Efficient Speech Recognition, Florian Mai, Juan Zuluaga-Gomez, Titouan Parcollet and Petr Motlicek, in: Proc. Interspeech 2023, Ireland, 2023

Can ChatGPT Detect Intent? Evaluating Large Language Models for Spoken Language Understanding, Mutian He and Philip N. Garner, in: Proc. INTERSPEECH 2023, pages 1109-1113, 2023

[DOI]

Attacking Face Recognition with T-shirts: Database, Vulnerability Assessment and Detection, Anjith George and Sébastien Marcel, Idiap-RR-08-2023

Bridging the Gap: Heterogeneous Face Recognition with Conditional Adaptive Instance Modulation, Anjith George and Sébastien Marcel, Idiap-RR-09-2023

Implementing contextual biasing in GPU decoder for online ASR, Nigmatulina Iuliia, Srikanth Madikeri, Esaú Villatoro-Tello, Petr Motlicek, Juan Zuluaga-Gomez, Karthik Pandia D S and Aravind Ganapathiraju, in: Proc. Interspeech 2023, pages 4494--4498, 2023

[DOI]
[URL]

CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice, Juan Zuluaga-Gomez, Ahmed Sara, Visockas Danielius and Subakan Cem, in: Proc. Interspeech 2023, 2023

[URL]

Development of 3D-printed Patient-Specific Anatomical Braces (PSAB) for Distal Radius and Scaphoid Fractures, Bernadette Tobler-Ammann, Frederic Schuind, Loïc Voillat, Théophile Gentilhomme, Esther Vögelin, Noé Murith and Bernard Masserey, in: Journal of wrist Surgery, 2023

Keep Sensors in Check: Disentangling Country-Level Generalization Issues in Mobile Sensor-Based Models with Diversity Scores, Alexandre Nanchen, Lakmal Buddika Meegahapola, William Droz and Daniel Gatica-Perez, in: Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, 2023

Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers?, Eklavya Sarkar and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2023

Implementing contextual biasing in GPU decoder for online ASR, Nigmatulina Iuliia, Srikanth Madikeri, Esaú Villatoro-Tello, Petr Motlicek, Juan Zuluaga-Gomez, Karthik Pandia D S and Aravind Ganapathiraju, Idiap-RR-02-2023

Geometric Algebra for Optimal Control with Applications in Manipulation Tasks, Tobias Löw and Sylvain Calinon, in: IEEE Transactions on Robotics, 2023

Approximating Optimal Morphing Attacks using Template Inversion, Laurent Colbois, Hatef Otroshi Shahreza and Sébastien Marcel, Idiap-RR-07-2023

The rise of artificial intelligence reading of chest X-rays for enhanced TB diagnosis and elimination, Coralie Geric, Zhi Zhen Qin, Claudia M. Denkinger, Sandra V. Kik, Ben Marais, André Anjos, Pierre-Marie David, Faiz A. Khan and Anete Trajman, in: The International Journal of Tuberculosis and Lung Disease, 27(5):367--372, 2023

[DOI]
[URL]

Referencing in YouTube Knowledge Communication Videos, Haeeun Kim and Daniel Gatica-Perez, in: ACM International Conference on Interactive Media Experiences (IMX '23), June 2023, Nantes, France, 2023

Examining European Press Coverage of the Covid-19 No-Vax Movement: An NLP Framework, David Alonso del Barrio and Daniel Gatica-Perez, in: 2nd ACM International Workshop on Multimedia AI against Disinformation (MAD '23), June 12, 2023, Thessaloniki, Greece, 2023

Framing the News: From Human Perception to Large Language Model Inferences, David Alonso del Barrio and Daniel Gatica-Perez, in: International Conference on Multimedia Retrieval (ICMR '23), June 12--15, 2023, Thessaloniki, Greece, 2023

Learning and Optimization of Anticipatory Feedback Controllers for Robot Manipulation, Hakan Girgin, École Polytechnique Fédérale de Lausanne, 2023

[DOI]

Automatic identification of storytelling responses to past-behavior interview questions via machine learning, Adrian Bangerter, Eric Mayor, Skanda Muralidhar, Emmanuelle Patricia Kleinlogel, Daniel Gatica-Perez and Marianne Schmid Mast, in: International Journal of Selection and Assessment, 2023

ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields, Mohammad Mahdi Johari, Camilla Carta and Francois Fleuret, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (CVPR), pages 17408-17419, 2023

[DOI]

A lexical-availability-based framework from short communications for automatic personality identification, Gabriela Ramírez-de-la-Rosa, Héctor Jiménez-Salazar, Esaú Villatoro-Tello, Verónica Reyes-Meza and Jaime Rojas-Avila, in: Cognitive Systems Research, 79:126-137, 2023

[DOI]
[URL]

Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks, Esaú Villatoro-Tello, Srikanth Madikeri, Juan Zuluaga-Gomez, Bidisha Sharma, Seyyed Saeed Sarfjoo, Nigmatulina Iuliia, Petr Motlicek, Alexei V. Ivanov and Aravind Ganapathiraju, in: Proceedings of the 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, 2023

Sparse Autoencoders for Speech Modeling and Recognition, Selen Hande Kabil, École polytechnique fédérale de Lausanne, 2023

[DOI]

Stop Wasting my FLOPS: Improving the Efficiency of Deep Learning Models, Angelos Katharopoulos, École Polytechnique Fédérale de Lausanne, 2022

[DOI]

Automatic pathological speech assessment, Parvaneh Janbakhshi, École polytechnique fédérale de Lausanne, 2022

[DOI]

Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviews, Sergio Burdisso, Esaú Villatoro-Tello, Srikanth Madikeri and Petr Motlicek, Idiap-RR-03-2023

Quantified Canine: Inferring Dog Personality From Wearables, Lakmal Buddika Meegahapola, Marios Constantinides, Zoran Radivojevic, Hongwei Li, Daniele Quercia and Michael S. Eggleston, in: Proceedings of the 2023 ACM CHI Conference on Human Factors in Computing Systems (CHI) 2023, Association for Computing Machinery, 2023

[DOI]

Complex Daily Activities, Country-Level Diversity, and Smartphone Sensing: A Study in Denmark, Italy, Mongolia, Paraguay, and UK, Karim Assi, Lakmal Buddika Meegahapola, William Droz, PETER KUN, Amalia de Götzen, Miriam Bidoglia, Sally Stares, George Gaskell, Altangerel Chagnaa, Amarsanaa Ganbold, Tsolmon Zundui, Carlo Caprini, Daniele Miorandi, José Luis Zarza, Alethia Hume, Luca Cernuzzi, Ivano Bison, Marcelo Rodas Britez, Matteo Busso, Ronald Chenu-Abente, Fausto Giunchiglia and Daniel Gatica-Perez, in: Proceedings of the 2023 ACM CHI Conference on Human Factors in Computing Systems (CHI), Association for Computing Machinery, 2023

[DOI]

Ranking parameters in urban energy models for various building forms and climates using sensitivity analysis, Aysegul Demir Dilsiz, Kaitlynn Ng, Jérôme Kämpf and Zoltán Nagy, in: Building Simulation, 2022

[DOI]

Situated Participatory Design: A Method for In Situ Design of Robotic Interaction with Older Adults, Laura Stegner, Emmanuel Senft and Bilge Mutlu, in: CHI '23: Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

[DOI]
[URL]

A Bayesian approach to machine learning model comparison, Antonio Morais, Idiap-Com-01-2023

On Interventional Probing in High Dimensions: An NLI Case Study, Julia Rozanova, Marco Valentino, Lucas Cordeiro and Andre Freitas, in: Findings of the 17th European Chapter of the Association for Computational Linguistics, 2023

Graph Refinement for Coreference Resolution, Lesly Miculicich and James Henderson, in: Findings of Association for >Computational Linguistics: ACL 2022, 2022

IDIAP SUBMISSION TO NIST LRE22 LANGUAGE RECOGNITION EVALUATION, Amrutha Prasad, Driss Khalil, Srikanth Madikeri and Petr Motlicek, Idiap-RR-11-2025

Why Scholars Are Diagramming Neural Network Models, Guy Marshall, Caroline Jay and Andre Freitas, in: 13th International Conference on the Theory and Application of Diagrams, 2022

Shallow Discourse Parsing for Open Information Extraction and Text Simplification, Christina Niklaus, Andre Freitas and Siegfried Handschuh, in: 3rd Workshop on Computational Approaches to Discourse (CODI) @ COLING, 2022

digital ECMT cancer trial matching tool, an open source research application to support oncologists in the identification of precision medicine clinical trials,, Paul O'Reagan, Andre Freitas and et al., in: JCO Clinical Cancer Informatics, 2022

Assessing the communication gap between AI models and healthcare professionals: explainability, utility and trust in AI-driven clinical decision-making, Oskar Wysocki, Jessica Katharine Davies, Markel Vigo, Anne Caroline Armstrong, Donal Landers, Rebecca Lee and Andre Freitas, in: Artificial Intelligence, 2022

Patient Attrition in Molecular Tumour Boards: A Systematic Review, Hannah Frost, Donna Graham, Louise Carter, Paul O'Reagan, Donal Landers and Andre Freitas, in: British Journal of Cancer, 2022

Symmetry-induced Disentanglement on Graphs, Giangiacomo Mercatali, Vikas Garg and Andre Freitas, in: Advances in Neural Information Processing Systems 35, 2022

Transformers and the representation of biomedical background knowledge, Oskar Wysocki, Zili Zhou, Paul O'Reagan, Deborah Mendes, Donal Landers and Andre Freitas, in: Computational Linguistics, 2022

Towards energy hubs: an innovative Geographic Information System based approach for cluster definition, Giacomo Cillari, Fabio Fantozzi, Alessandro Franco and Jérôme Kämpf, in: ICREC 2022 Conference Proceedings, 2022

An exploratory interplay between daylight, general and task lighting for visual comfort and electricity savings in a personal office space, Chantal Basurto, Michael Papinutto, Moreno Colombo, Roberto Boghetti, Kornelius Reutter, Julien Nembrini and Jérôme Kämpf, in: Proceedings of ISES and IEA SHC International Conference on Solar Energy for Buildings and Industry, Kassel, Germany, 2022

Speech and Natural Language Processing Technologies for Pseudo-Pilot Simulator, Amrutha Prasad, Juan Zuluaga-Gomez, Petr Motlicek, Seyyed Saeed Sarfjoo, Nigmatulina Iuliia and Karel Vesely, in: 12th SESAR Innovation Days, 2022

Grammar Based Speaker Role Identification for Air Traffic Control Speech Recognition, Amrutha Prasad, Juan Zuluaga-Gomez, Petr Motlicek, Seyyed Saeed Sarfjoo, Nigmatulina Iuliia, Oliver Ohneiser and Hartmut Helmke, in: 12th SESAR Innovation Days, 2022

Integrating daylight with general and task lighting: A longitudinal in-the-wild study in individual and open space working areas, Chantal Basurto, Michael Papinutto, Moreno Colombo, Roberto Boghetti, Kornelius Reutter, Julien Nembrini and Jérôme Kämpf, in: Solar Energy Advances, 2, 2022

[DOI]
[URL]

Identification of existing tools and workflows for solar neighborhood planning, Nicholas Baker, Raffaella Belmonte Monteiro, Alessia Boccalatte, Karine Bouty, Johannes Brozovsky, Cyril Caliot, Rafael Campamà Pizarro, Raphaël Compagnon, Agnieszka Czachura, Gilles Desthieux, Matteo Formolli, Stéphanie Giroux-Julien, Victor Guillot, Govehovitch Benjamin, Caroline Hachem-Vermette, Ellis Herman, Olivia Alarcon Herrera, Jérôme Kämpf, Gabriele Lobaccaro, Christophe Ménézo, Marjorie Musy, Giuseppe Peronato, Arnkell Jonas Petersen, Auline Rodler, Kuljeet Singh, Viktor Sjöberg, Mark Snow, Joar Tjetland and Yupeng Wang, in: SHC Task 63: Solar Neighborhood Planning, Subtask C: Solar Planning Tools, IEA, 2022

[DOI]

Natural Language Processing in Healthcare, Satya Ranjan Dash, Shantipriya Parida, Esaú Villatoro-Tello, Biswaranjan Acharya and Ondrej Bojar, Taylor & Francis Groups, 2022

[DOI]
[URL]

Response Burden and Dropout in a Probability-Based Online Panel Study – A Comparison between an App and Browser-Based Design, Caroline Roberts, Jessica M. E. Herzing, Marc Asensio Manjon, Philip Abbet and Daniel Gatica-Perez, in: Journal of Official Statistics, 2022

[DOI]
[URL]

Differentiation of motor speech disorders through the seven deviance scores from MonPaGe-2.0.s, Cecile Fougeron, Ina Kodrasi and Marina Laganaro, in: Brain Sciences, 12(11):1471-1487, 2022

IDIAPers @ Causal News Corpus 2022: Extracting Cause-Effect-Signal Triplets via Pre-trained Autoregressive Language Model, Martin Fajcik, Muskaan Singh, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Sergio Burdisso, Petr Motlicek and Pavel Smrz, in: The 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE @ EMNLP 2022), 2022

[URL]

Meta-analysis of the amyotrophic lateral sclerosis spectrum uncovers genome instability, Oliver Ziff, Jacob Neeves, Jamie Mitchell, Giulia E. Tyzack, Carlos Martinez Ruiz, Nicholas McGranahan, Raphaelle Luisier, Anob Chakrabarti, Simon Boulton, Gavin Kelly, Jack Humphrey and Rickie Patani, in: BioRxiv, 2022

IDIAPers @ Causal News Corpus 2022: Efficient Causal Relation Identification Through a Prompt-based Few-shot Approach, Sergio Burdisso, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Martin Fajcik, Muskaan Singh, Pavel Smrz and Petr Motlicek, in: The 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE @ EMNLP 2022), 2022

[URL]

The RNA Binding proteome of axonal mRNAs in sympathetic neurons, Raphaelle Luisier, Catia Andreassi and Antonella Riccio, in: BioRxiv, 2022

Physiological intron retaining transcripts in the cytoplasm abound during human motor neurogenesis, Marija Petric-Howe, Hamish Crerar, Jacob Neeves, Jasmine Harley, Giulia E. Tyzack, Pierre Klein, Andres Ramos, Rickie Patani and Raphaelle Luisier, in: Genome Research, 2022

Efficient Training of Low-Curvature Neural Networks, Suraj Srinivas, Kyle Matoba, Himabindu Lakkaraju and Francois Fleuret, in: NeurIPS 2022, 2022

[URL]

VP-STO: Via-point-based Stochastic Trajectory Optimization for Reactive Robot Behavior, Julius Jankowski, Lara Brudermuller, Nick Hawes and Sylvain Calinon, Idiap-RR-04-2023

Passive Bimanual Skills Learning from Demonstration with Motion Graph Attention Networks, Z. Dong, Z. Li, Y. Yan, Sylvain Calinon and F. Chen, in: IEEE Robotics and Automation Letters (RA-L), 7(2):4917-4923, 2022

Robot Cooking with Stir-fry: Bimanual Non-prehensile Manipulation of Semi-fluid Objects, J. Liu, Y. Chen, Z. Dong, S. Wang, Sylvain Calinon, M. Li and F. Chen, in: IEEE Robotics and Automation Letters (RA-L), 7(2):5159-5166, 2022

Vision-Language Pretraining: Current Trends and the Future, Aishwarya Agrawal, Damien Teney and Aida Nematzadeh, in: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts, 2022

[URL]

SelecMix: Debiased Learning by Mixing up Contradicting Pairs, Inwoo Hwang, Sangjun Lee, Yunhyeok Kwak, Seong Joon Oh, Damien Teney, Jin-Hwa Kim and Byoung-Tak Zhang, in: ICML Workshop on Spurious Correlations, Invariance and Stability, 2022

EBMs vs. CL: Exploring Self-Supervised Visual Pretraining for Visual Question Answering, Violetta Shevchenko, Ehsan Abbasnejad, Anthony Dick, Anton van den Hengel and Damien Teney, in: arXiv, 2022

ID and OOD performance are sometimes inversely correlated on real-world datasets, Damien Teney, Yong Lin, Seong Joon Oh and Ehsan Abbasnejad, in: Advances in Neural Information Processing Systems (NeurIPS), 2023

SelecMix: Debiased Learning by Contradicting-pair Sampling, Inwoo Hwang, Sangjun Lee, Yunhyeok Kwak, Seong Joon Oh, Damien Teney, Jin-Hwa Kim and Byoung-Tak Zhang, in: Advances in Neural Information Processing Systems, 2022

Reasoning over vision and language: Exploring the benefits of supplemental knowledge, Violetta Shevchenko, Damien Teney, Anthony Dick and Anton van den Hengel, in: arXiv, 2022

Bayesian Recurrent Units and the Forward Backward Algorithm, Alexandre Bittar and Philip N. Garner, in: Proc. Interspeech 2022, pages 4137-4141, 2022

[DOI]

Readback Error Detection by Automatic Speech Recognition and Understanding -- Results of HAAWAII Project for Isavia’s Enroute Airspace, Hartmut Helmke, Karel Ondřej, Shruthi Shetty, Hörður Arilíusson, Teodor S. Simiganoschi, Matthias Kleinert, Oliver Ohneiser, heiko Ehr, Juan Zuluaga-Gomez and Pavel Smrz, in: 11th SESAR Innovation Days, SESAR, pages 9, 2022

How Does Pre-trained Wav2Vec 2.0 Perform on Domain-Shifted ASR? An Extensive Benchmark on Air Traffic Control Communications, Juan Zuluaga-Gomez, Amrutha Prasad, Nigmatulina Iuliia, Seyyed Saeed Sarfjoo, Petr Motlicek, Matthias Kleinert, Hartmut Helmke, Oliver Ohneiser and Qingran Zhan, in: 2023 IEEE Spoken Language Technology Workshop (SLT), IEEE, 2023

[URL]

BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications, Juan Zuluaga-Gomez, Seyyed Saeed Sarfjoo, Amrutha Prasad, Nigmatulina Iuliia, Petr Motlicek, Karel Ondřej and Oliver Ohneiser, in: 2023 IEEE Spoken Language Technology Workshop (SLT), IEEE, 2023

[URL]

Towards Smart Pruning: ViNet, a Deep-Learning Approach for Grapevine Structure Estimation, Théophile Gentilhomme, Michael Villamizar, Jérome Corre and Jean-Marc Odobez, in: Computers and Electronics in Agriculture, 207:107736, 2023

[DOI]
[URL]

Your Day in Your Pocket: Complex Activity Recognition from Smartphone Accelerometers, Emma Bouton--Bessac, Daniel Gatica-Perez and Lakmal Buddika Meegahapola, in: EAI Pervasive Health, 2022

Health Talk: Understanding Practices of Popular Professional YouTubers, Thanh-Trung Phan, Chloé Michoud, Lucia Volpato, María del Río Carral and Daniel Gatica-Perez, in: Int. Conf. on Mobile and Ubiquitous Multimedia, 2022

GAP: Differentially Private Graph Neural Networks with Aggregation Perturbation, Sina Sajadmanesh, Ali Shahin Shamsabadi, Aurélien Bellet and Daniel Gatica-Perez, in: 32nd USENIX Security Symposium (USENIX Security 23), 2023

Mechanical Artifacts in Optical Projection Tomography: Classification and Automatic Calibration, Yan Liu, Jonathan Dong, Thanh-An Pham, François Marelli and Michael Unser, in: Opt. Continuum, 1(12):2577--2589, 2022

[DOI]

TextGraphs 2022 Shared Task on Natural Language Premise Selection, Marco Valentino, Deborah Mendes, Mokanarangan Thayaparan, Andre Freitas and Dmitry Ustalov, in: Proceedings of TextGraphs-16: Graph-based Methods for Natural Language Processing, 2022

[URL]

Decomposing Natural Logic Inferences for Neural NLI, Julia Rozanova, Deborah Mendes, Mokanarangan Thayaparan, Marco Valentino and Andre Freitas, in: BlackBoxNLP: Workshop on analyzing and interpreting neural networks for NLP, 2022

Diff-Explainer: Differentiable Convex Optimization for Explainable Multi-Hop Inference, Mokanarangan Thayaparan, Marco Valentino, Deborah Mendes, Julia Rozanova and Andre Freitas, in: Transactions of the Association for Computational Linguistics, 2022

[DOI]

Case-Based Abductive Natural Language Inference, Marco Valentino, Mokanarangan Thayaparan and Andre Freitas, in: Proceedings of the 29th International Conference on Computational Linguistics, 2022

[URL]

Generalization and Personalization of Mobile Sensing-Based Mood Inference Models: An Analysis of College Students in Eight Countries, Lakmal Buddika Meegahapola, William Droz, Amalia de Götzen, PETER KUN, Chaitanya Nuttakki, Shyam Diwakar, Salvador Ruiz-Correa, Donglei Song, Hao Xu, Miriam Bidoglia, George Gaskell, Altangerel Chagnaa, Amarsanaa Ganbold, Tsolmon Zundui, Carlo Caprini, Daniele Miorandi, Alethia Hume, José Luis Zarza, Luca Cernuzzi, Ivano Bison, Marcelo Rodas Britez, Matteo Busso, Ronald Chenu-Abente, Can Gunel, Fausto Giunchiglia, Laura Schelenz and Daniel Gatica-Perez, in: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT), 6(4), 2022

[DOI]

What Do Compressed Multilingual Machine Translation Models Forget?, Alireza Mohammadshahi, Vassilina Nikoulina, Alexandre Berard, Caroline Brun, James Henderson and Laurent Besacier, in: The 2022 Conference on Empirical Methods in Natural Language Processing, 2022

SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages, Alireza Mohammadshahi, Vassilina Nikoulina, Alexandre Berard, Caroline Brun, James Henderson and Laurent Besacier, in: The 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Imitation of Manipulation Skills Using Multiple Geometries, Boyang Ti, Yongsheng Gao, Jie Zhao and Sylvain Calinon, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2022

Prepended Domain Transformer: Heterogeneous Face Recognition without Bells and Whistles, Anjith George, Amir Mohammadi and Sébastien Marcel, in: IEEE Transactions on Information Forensics and Security, 2022

Innovators@SMM4H'22: An Ensembles Approach for self-reporting of COVID-19 Vaccination Status Tweets, Mohammad Zohair, Aakash Bhatnagar, Nidhir Bhavsar, Muskaan Singh and Petr Motlicek, in: International Conference on Computational Linguistics (COLING 2022), 2022

Multimodality for NLP-Centered Applications: Resources, Advances and Frontiers, Seema Wazarkar, muskan garg, Muskaan Singh and Ondrej Bojar, in: International Conference on Language Resources and Evaluation (LREC 2022), 2022

Innovators@SMM4H'22: An Ensembles Approach for Stance and Premise Classification of COVID-19 Health Mandates Tweets, Vatsal Savaliya, Aakash Bhatnagar, Nidhir Bhavsar, Muskaan Singh and Petr Motlicek, in: International Conference on Computational Linguistics (COLING 2022), 2022

Automatic Summarization for Creative Writing: Denoising Auto-Encoder based Pipeline Method for Generating Summary of Movie Scripts, Aditya Upadhyay, Nidhir Bhavsar, Aakash Bhatnagar, Muskaan Singh and Petr Motlicek, in: Automatic Summarization for Creative Writing, International Conference on Computational Linguistics (COLING 2022), 2022

Automatic Minuting: A Pipeline Method for Generating Minutes, Kartik Shinde, Tirthankar Ghosal, Muskaan Singh and Ondrej Bojar, in: Pacific Asia Conference on Language, Information and Computation (PACLIC 36), In proceedings of ACL Anthology, 2022

An End-to-End Multilingual System for Automatic Minuting of Multi-Party Dialogues, Aakash Bhatnagar, Nidhir Bhavsar, Muskaan Singh and Petr Motlicek, in: Pacific Asia Conference on Language, Information and Computation (PACLIC 36) , In proceedings of ACL Anthology, 2022

An Empirical Comparison of Semantic Similarity Methods for Analyzing down-streaming Automatic Minuting task, Aditya Upadhyay, Aakash Bhatnagar, Nidhir Bhavsar, Muskaan Singh and Petr Motlicek, in: Pacific Asia Conference on Language, Information and Computation (PACLIC 36), In ACL Anthology Proceedings, 2022

Bio-Medical Multi-label Scientific Literature Classification using LWAN and Dual-attention module, Deepanshu Khanna, Aakash Bhatnagar, Nidhir Bhavsar, Muskaan Singh and Petr Motlicek, in: PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION (PACLIC 2022), In Proceedings of ACL Anthology, 2022

HMIST: Hierarchical Multilingual Isometric Speech Translation using Multi-Task Learning Framework for Automatic Dubbing, Nidhir Bhavsar, Aakash Bhatnagar, Muskaan Singh and Petr Motlicek, in: PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION (PACLIC 2022), In Proceedings of ACL Anthology, 2022

Two Simple and Domain-independent Approaches for Early Detection of Anorexia, Sergio Burdisso, Leticia Cagnina, Marcelo Errecalde and Manuel Montes-y-Gómez, in: Early Detection of Mental Health Disorders by Social Media Monitoring: The First Five Years of the eRisk Project, pages 159-182, Springer International Publishing, 2022

[DOI]
[URL]

Towards Automatic Prediction of Non-Expert Perceived Speech Fluency Ratings, S. Pavankumar Dubagunta, Edoardo Moneta, Eleni Theocharopoulos and Mathew Magimai-Doss, in: ACM International Conference on Multimodal Interaction (ICMI Companion), 2022

[DOI]

Towards Accessible Sign Language Learning and Assessment, Neha Tarigopula, Sandrine Tornay, Skanda Muralidhar and Mathew Magimai-Doss, in: ACM International Conference on Multimodal Interaction, Bangalore, INDIA, pages 626-631, 2022

[DOI]

Claim-Dissector: An Interpretable Fact-Checking System with Joint Re-ranking and Veracity Prediction, Martin Fajcik, Petr Motlicek and Pavel Smrz, Idiap-Com-03-2022

[URL]

Leveraging Events Sub-Categories for Violent-Events Detection in Social Media, Daniel Vallejo-Aldana, A. Pastor López-Monroy and Esaú Villatoro-Tello, in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022), 2022

[URL]

UNSL at eRisk 2022: Decision policies with history for early classification, Juan Martín Loyola, Horacio Thompson, Sergio Burdisso and Marcelo Errecalde, in: CEUR Workshop Proceedings, 2022

[URL]

IDIAPers @ Causal News Corpus 2022: Extracting Cause-Effect-Signal Triplets via Pre-trained Autoregressive Language Model, Martin Fajcik, Muskaan Singh, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Sergio Burdisso, Petr Motlicek and Pavel Smrz, Idiap-RR-12-2022

IDIAPers @ Causal News Corpus 2022: Efficient Causal Relation Identification Through a Prompt-based Few-shot Approach, Sergio Burdisso, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Martin Fajcik, Muskaan Singh, Pavel Smrz and Petr Motlicek, Idiap-RR-13-2022

A surrogate gradient spiking baseline for speech command recognition, Alexandre Bittar and Philip N. Garner, in: Frontiers in Neuroscience, 2022

[DOI]
[URL]

Local estimation of parametric point spread functions in thermal images via convolutional neural networks, Florian Piras, Edouard De Moura Presa, Peter Wellig and Michael Liebling, in: SPIE sensors + imaging, Target and Background Signatures VIII, Berlin, Germany, pages 1227009 1--8, SPIE, 2022

[DOI]
[URL]

Pre-training Data Quality and Quantity for a Low-Resource Language: New Corpus and BERT Models for Maltese, Kurt Micallef, Albert Gatt, Marc Tanti, Lonneke van der Plas and Claudia Borg, in: Proceedings of the workshop on Deep Learning for Low-Resource NLP, 2022

[URL]

SPEECH MODELING USING SPARSE AUTOENCODERS, Selen Hande Kabil and Hervé Bourlard, Idiap-RR-11-2022

A Systems Approach Towards Remote Health-Monitoring in Older Adults: Introducing a Zero-Interaction Digital Exhaust, Narayan Schütz, Samuel E. J. Knobel, Michael Single, Bruno Pais, Valérie Santschi, Daniel Gatica-Perez, Philipp Buluschek, Prabitha Urwyler, Stephan M. Gerber, René M. Müri, Urs P. Mosimann, Hugo Saner and Tobias Nef, in: npj Digital Medicine, 5(Article 116), 2022

An anomaly detection approach for backdoored neural networks: face recognition as a case study, Alexander Unnervik and Sébastien Marcel, in: 21st International Conference of the Biometrics Special Interest Group (BIOSIG 2022), Darmstadt, Germany, 2022

On the detection of morphing attacks generated by GANs, Laurent Colbois and Sébastien Marcel, in: 21st International Conference of the Biometrics Special Interest Group (BIOSIG 2022), 2022

HyperMixer: An MLP-based Green AI Alternative to Transformers, Florian Mai, Arnaud Pannatier, Fabio Fehr, Haolin Chen, François Marelli, Francois Fleuret and James Henderson, in: arxiv, 2022

A Variational AutoEncoder for Transformers with Nonparametric Variational Information Bottleneck, James Henderson and Fabio Fehr, in: arxiv, 2022

[DOI]
[URL]

DHgeN: Automated Generation of District Heating Network Layouts for Feasibility Studies, Giuseppe Peronato and Jérôme Kämpf, in: -, 2022

Fairness Index Measures to Evaluate Bias in Biometric Recognition, Ketan Kotwal and Sébastien Marcel, in: International Conference on Pattern Recognition Workshops, 2022

Towards Lifelong Human Assisted Speaker Diarization, Meysam Shamsi, Anthony Larcher, Loïc Barrault, Sylvain Meignier, Yevhenii Prokopalo, Marie Tahon, Ambuj Mehrish, Simon Petitrenaud, Olivier Galibert, Samuel Gaist, André Anjos, Sébastien Marcel and Marta Costa-Jussà, in: Computer Speech & Language, 2022

[DOI]
[URL]

Reactive Anticipatory Robot Skills with Memory, Hakan Girgin, Julius Jankowski and Sylvain Calinon, in: The International Symposium on Robotics Research, 2022

Modeling and Optimal Control of the Open Torque-Controlled Quadruped Robot Solo-12, Niederberger Adi, Idiap-Com-02-2022

On the detection of morphing attacks generated by GANs, Laurent Colbois and Sébastien Marcel, Idiap-RR-07-2022

An anomaly detection approach for backdoored neural networks: face recognition as a case study, Alexander Unnervik and Sébastien Marcel, Idiap-RR-08-2022

[URL]

Face Anthropometry Aware Audio-visual Age Verification, Pavel Korshunov and Sébastien Marcel, in: ACM Multimedia, 2022

Learning to Guide Online Multi-Contact Receding Horizon Planning, Jiayi Wang, Teguh Santoso Lembono, Sanghyun Kim, Sylvain Calinon, Sethu Vijayakumar and Steve Tonneau, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2022

Pulmonary Tuberculosis Screening from Radiological Signs on Chest X-Ray Images Using Deep Models, Geoffrey Raposo, Anete Trajman and André Anjos, in: Union World Conference on Lung Health, The Union, 2022

Classifying the Social Media Author Profile Through a Multimodal Representation, Miguel Á. Álvarez-Carmona, Esaú Villatoro-Tello, Luis Villaseñor Pineda and Manuel Montes-y-Gómez, in: Intelligent Technologies: Concepts, Applications, and Future Directions. Studies in Computational Intelligence, Springer, 2022

[DOI]
[URL]

drozBot: Using Ergodic Control to Draw Portraits, Tobias Löw, Jérémy Maceiras and Sylvain Calinon, in: IEEE Robotics and Automation Letters:7, 2022

[DOI]
[URL]

Memory of Motion for Initializing Optimization in Robotics, Teguh Santoso Lembono, École Polytechnique Fédérale de Lausanne, 2022

Data Privacy Concerns as a Source of Resistance to Complete Mobile Data Collection Tasks via a Smartphone App, Caroline Roberts, Jessica M. E. Herzing, Jimena Sobrino Piazza, Philip Abbet and Daniel Gatica-Perez, in: Journal of Survey Statistics and Methodology, 2022

Expanded Lattice Embeddings for Spoken Document Retrieval on Informal Meetings, Esaú Villatoro-Tello, Srikanth Madikeri, Petr Motlicek, Aravind Ganapathiraju and Alexei V. Ivanov, in: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, 2022

[DOI]

A Comparative Study Of Simulation Tools To Model The Solar Irradiation On Building Façades, Martin Thebault, Benjamin Govehovitch, Karine Bouty, Cyril Caliot, Raphaël Compagnon, Gilles Desthieux, Matteo Formolli, Stéphanie Giroux-Julien, Victor Guillot, Ellis Herman, Jérôme Kämpf, Jouri Kanters, Gabriele Lobaccaro, Christophe Ménézo, Giuseppe Peronato and Arnkell Jonas Petersen, in: Proceedings of SWC 2021: ISES Solar World Congress, ISES, 2021

[DOI]
[URL]

Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering, Eklavya Sarkar, RaviShankar Prasad and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2022

Conversational Speech Recognition Needs Data? Experiments with Austrian German, Julian Linke, Philip N. Garner, Gernot Kubin and Barbara Schuppler, in: Proceedings of the 13th Language Resources and Evaluation Conference, European Language Resources Association, pages 4684--4691, 2022

[URL]

Using synthetic fingerprint images to test the performance of an AFIS system, Alessandro Costa, Université de Lausanne, 2022

Autoencoders Reloaded, Hervé Bourlard and Selen Hande Kabil, in: Springer Biological Cybernetics, 2022

[DOI]
[URL]

Adversarial-free speaker identity-invariant representation learning for automatic dysarthric speech classification, Parvaneh Janbakhshi and Ina Kodrasi, in: Annual Conference of the International Speech Communication Association, 2022

Comparison of 5 methods for the evaluation of intelligibility in mild to moderate French dysarthric speech, Cecile Fougeron, Nicolas Audibert, Ina Kodrasi, Parvaneh Janbakhshi, Michaela Pernon, Nathalie Leveque, Stephanie Borel, Marina Laganaro, Hervé Bourlard and Frederic Assal, in: Annual Conference of the International Speech Communication Association, pages 2188-2192, 2022

[DOI]

Perceptual classification of motor speech disorders: the role of severity, speech task, and listener's expertise, Michaela Pernon, Frederic Assal, Ina Kodrasi and Marina Laganaro, in: Journal of Speech, Language, and Hearing Research, 2022

Sensing Eating Events in Context: A Smartphone-Only Approach, Lakmal Buddika Meegahapola, Wageesha Bangamuarachchi, Anju Chamantha, Salvador Ruiz-Correa, Indika Perera and Daniel Gatica-Perez, in: IEEE Access, 10, 2022

[DOI]
[URL]

How Did Europe’s Press Cover Covid-19 Vaccination News? A Five-Country Analysis, David Alonso del Barrio and Daniel Gatica-Perez, in: MAD '22: Proceedings of the 1st International Workshop on Multimedia AI against Disinformation, 2022

[DOI]
[URL]

Borrowing from yourself: Faster future video segmentation with partial channel update, Evann Courdier and Francois Fleuret, in: International Conference on Pattern Recognition, 2022

Saving energy by maximising daylight and minimising the impact on occupants: an automatic lighting system approach, Michael Papinutto, Roberto Boghetti, Moreno Colombo, Chantal Basurto, Kornelius Reutter, Denis Lalanne, Jérôme Kämpf and Julien Nembrini, in: Energy and Buildings, 2022

[DOI]

Visually Grounded Interpretation of Noun-Noun Compounds in English, Inga Lang, Lonneke van der Plas, Malvina Nissim and Albert Gatt, in: Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics, Association for Computational Linguistics, 2022

State-of-the-art retinal vessel segmentation with minimalistic models, Adrian Galdran, André Anjos, José Dolz, Hadi Chakor, Hervé Lombaert and Ismail Ben Ayed, in: Nature Scientific Reports, 12(6174), 2022

[DOI]

A Modular Multimodal Architecture for Gaze Target Prediction: Application to Privacy-Sensitive Settings, Anshul Gupta, Samy Tafasca and Jean-Marc Odobez, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Hierarchical Multi-task learning framework for Isometric-Speech Language Translation, Aakash Bhatnagar, Nidhir Bhavsar, Muskaan Singh and Petr Motlicek, in: ACL, 2022

IDIAP Submission@LT-EDI-ACL2022: Detecting Signs of Depression from Social Media Text, Muskaan Singh and Petr Motlicek, in: ACL, 2022

IDIAP Submission@LT-EDI-ACL2022: Homophobia/Transphobia Detection in social media comments, Muskaan Singh and Petr Motlicek, in: ACL Proceedings, 2022

IDIAP Submission@LT-EDI-ACL2022 : Hope Speech Detection for Equality, Diversity and Inclusion, Muskaan Singh and Petr Motlicek, in: ACL, 2022

IDIAP_TIET@LT-EDI-ACL2022 : Hope Speech Detection in Social Media using Contextualized BERT with Attention Mechanism, Deepanshu Khanna, Muskaan Singh and Petr Motlicek, in: ACL, 2022

PERFECT: Prompt-free and Efficient Few-shot Learning with Language Models, Rabeeh Karimi Mahabadi, Luke Zettlemoyer, James Henderson, saeidi marzieh, lambert mathias, Veselin Stoyanov and Majid Yazdani, in: ACL, 2022

The societal and ethical relevance of computational Creativity, Michele Loi, Eleonora Viganò and Lonneke van der Plas, in: Proceedings of the International Conference on Computational Creativity, 2020

Compositionality in English deverbal compounds:The role of the head, Gianina Iordachioaia, Lonneke van der Plas and Glorianna Jagfeld, in: The role of constituents in multiword expressions. Phraseology and Multiword Expressions, Language Science Press, Berlin, 2020

Compound or phrase or in between? Testing Linguistic Criteria for Compoundhood in English, Patrick Ziering and Lonneke van der Plas, in: Word Structure, 13(2):250-281, 2020

Biomarker identification using dynamic time warping analysis: a longitudinal cohort study of COVID-19 patients in a UK tertiary hospital, Hannah Burke, Anna Freeman, Paul O'Reagan, Oskar Wysocki, Andre Freitas and et al., in: BMJ Open, 2022

Voyager: Data Discovery for Onboarding in Data Science, Alex Bogatu, Norman Paton, Mark Douthwaite and Andre Freitas, in: 37th IEEE International Conference on Data Engineering (ICDE), 2022

Systematicity, Compositionality and Transitivity of Deep NLP Models: a Metamorphic Testing Perspective, Edoardo Manino, Julia Rozanova, Danilo Carvalho, Andre Freitas and Lucas Cordeiro, in: Findings of the ACL, 2022

To be or not to be an Integer? Encoding Variables for Mathematical Text, Deborah Mendes, Mokanarangan Thayaparan, Marco Valentino, Julia Rozanova and Andre Freitas, in: Findings of the ACL, 2022

Establishment of CORONET, COVID-19 Risk in Oncology Evaluation Tool, to Identify Cancer Patients at Low Versus High Risk of Severe Complications of COVID-19 Infection Upon Presentation to Hospital, Rebecca Lee, Oskar Wysocki, Andre Freitas and et al., in: Clinical Cancer Informatics, 2022

Active Learning by Feature Mixing, Amin Parvaneh, Ehsan Abbasnejad, Damien Teney, Reza Haffari, Anton van den Hengel and Javen Qinfeng Shi, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

GeoNeRF: Generalizing NeRF with Geometry Priors, Mohammad Mahdi Johari, Yann Lepoittevin and Francois Fleuret, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (CVPR), 2022

[URL]

End-to-End Accented Speech Recognition, Thibault Viglino, Petr Motlicek and Milos Cernak, in: International Conference on Speech and Language Processing, Interspeech, ISCA, Graz, Austria, pages 2140-2144, 2019

[DOI]

Generating Exact Lattices in The WFST Framework, Daniel Povey, Mirko Hannemann, Gilles Boulianne, Lukas Burget, Arnab Ghoshal, Milos Janda, Martin Karafiat, Stefan Kombrink, Petr Motlicek, Yanmin Qian, Korbinian Riedhammer, Karel Vesely and Ngoc Thang Vu, in: Proceedings of 2012 IEEE International Conference on Acoustics, Speech and Signal Processing., The 37th International Conference on Acoustics, Speech, and Signal Processing, Kyoto, JP, Kyoto, Japan, pages 4213-4216, IEEE Signal Processing Societ, 2012

[DOI]

Efficient Depth-based Deep Learning Methods for Multi-Party Pose Estimation, Angel Martínez-González, École polytechnique fédérale de Lausanne, 2021

[DOI]

Gradient-based Methods for Deep Model Interpretability, Suraj Srinivas, École polytechnique fédérale de Lausanne, 2021

[DOI]

Learning strategies and representations for intuitive robot learning from demonstration, Thibaut Kulak, EPFL, 2021

Expanded Lattice Embeddings for Spoken Document Retrieval on Informal Meetings, Esaú Villatoro-Tello, Srikanth Madikeri, Petr Motlicek, Aravind Ganapathiraju and Alexei V. Ivanov, Idiap-RR-06-2022

A two-step approach to leverage contextual data: speech recognition in air-traffic communications, Nigmatulina Iuliia, Juan Zuluaga-Gomez, Amrutha Prasad, Seyyed Saeed Sarfjoo and Petr Motlicek, in: Proc. of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 6282-6286, IEEE, 2022

[DOI]
[URL]

Are GAN-based Morphs Threatening Face Recognition?, Eklavya Sarkar, Pavel Korshunov, Laurent Colbois and Sébastien Marcel, in: International Conference on Acoustics, Speech and Signal Processing, 2022

Custom attribution loss for improving generalization and interpretability of deepfake detection, Pavel Korshunov, Anubhav Jain and Sébastien Marcel, in: International Conference on Acoustics, Speech, and Signal Processing, 2022

Domain-Adversarial Based Model with Phonological Knowledge for Cross-Lingual Speech Recognition, Qingran Zhan, Xiang Xie, Hu Chenguang, Juan Zuluaga-Gomez, Jing Wang and Haobo Cheng, in: Electronics, 10(24):1-15, 2021

[DOI]
[URL]

Experimental investigation on STFT phase Representations for deep learning-based dysarthric speech detection, Parvaneh Janbakhshi and Ina Kodrasi, in: International Conference on Acoustics, Speech, and Signal Processing, 2022

Domain-Specific Adaptation of CNN for Detecting Face Presentation Attacks in NIR, Ketan Kotwal, Sushil Bhattacharjee, Philip Abbet, Zohreh Mostaani, Huang Wei, Xu Wenkang, Zhao Yaxi and Sébastien Marcel, in: IEEE Transactions on Biometrics, Behavior, and Identity Science, 2022

From Key Positions to Optimal Basis Functions for Probabilistic Adaptive Control, Julius Jankowski, Mattia Racca and Sylvain Calinon, in: IEEE Robotics and Automation Letters, 2022

An Attention Mechanism for Deep Q-Networks with Applications in Robotic Pushing, Marco Ewerton, Sylvain Calinon and Jean-Marc Odobez, in: Proc. of Workshop on Emerging paradigms for robotic manipulation: from the lab to the productive world, ICRA, 2021

Modeling Source and System characteristics using Zero Frequency Filtering for Voice Activity Detection, Eklavya Sarkar, RaviShankar Prasad and Mathew Magimai-Doss, Idiap-Internal-RR-80-2021

Analysis of Vector Representations in Maintenance Logs in the Industry: Towards an Information Retrieval System, Jesús Roberto Enrique León Carmona, Samuel González-López, Esaú Villatoro-Tello and Jesús Miguel García-Gorrostieta, in: Journal of Research in Computing Science, 2021

Topic analysis and tracking from Mexico's President daily press briefing, Luis Armando Arias Romero, Gabriela Ramírez-de-la-Rosa and Esaú Villatoro-Tello, in: Journal of Research in Computing Science, 2021

Improving Generalization of Deepfake Detection with Data Farming and Few-Shot Learning, Pavel Korshunov and Sébastien Marcel, in: IEEE Transactions on Biometrics, Behavior, and Identity Science, 2021

Automatic processing pipeline for collecting and annotating air-traffic voice communication data, Martin Kocour, Karel Vesely, Igor Szoke, Santosh Kesiraju, Juan Zuluaga-Gomez, Blatt Alexander, Amrutha Prasad, Nigmatulina Iuliia, Petr Motlicek and et al., in: Proceedings of 9th OpenSky Symposium 2020, OpenSky Network, Brussels, Belgium, pages 1-9, MDPI, 2021

Multi-Adversarial Learning for Cross-Lingual Word Embeddings, Haozhou Wang, James Henderson and Paola Merlo, in: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Online, pages 463-472, 2021

Imposing Relation Structure in Language-Model Embeddings Using Contrastive Learning, Christos Theodoropoulos, James Henderson, Andrei Catalin Coman and Marie-Francine Moens, in: Proceedings of the 25th Conference on Computational Natural Language Learning, Online, pages 337-348, Association for Computational Linguistics, 2021

DepthInSpace: Exploitation and Fusion of Multiple Video Frames for Structured-Light Depth Estimation, Mohammad Mahdi Johari, Camilla Carta and Francois Fleuret, in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 6039-6048, 2021

[URL]

Recursive Non-Autoregressive Graph-to-Graph Transformer for Dependency Parsing with Iterative Refinement, Alireza Mohammadshahi and James Henderson, in: Transactions of the Association for Computational Linguistics (2021), 9:18, 2021

[DOI]
[URL]

Bag-of-Vectors Autoencoders for Unsupervised Conditional Text Generation, Florian Mai and James Henderson, Idiap-RR-21-2021

ParsiNLU: A Suite of Language Understanding Challenges for Persian, Daniel Khashabi, Arman Cohan, Siamak Shakeri, Pedram Hosseini, Pouya Pezeshkpour, Marzieh Bitaab, Faeze Brahman, Sarik Ghazarian, Arman Kabiri, Rabeeh Karimi Mahabadi, Omid Memarrast, Ahmadreza Mosallanezhad, Erfan Noury, Shahab Raji, Mohammad Sadegh Rasooli, Sepideh Sadeghi, Erfan Sadeqi Azer, Niloofar Safi Samghabadi, Mahsa Shafaei, Saber Sheybani, Ali Tazarv and Yadollah Yaghoobzadeh, in: TACL, 2021

Compacter: Efficient Low-Rank Hypercomplex Adapter Layers, Rabeeh Karimi Mahabadi, James Henderson and Sebastian Ruder, in: NeurIPS, 2021

Fairness in Biometrics: a figure of merit to assess biometric verification systems, Tiago de Freitas Pereira and Sébastien Marcel, in: IEEE Transactions on Biometrics, Behavior, and Identity Science, 2021

[DOI]

What is SemEval evaluating? A Systematic Analysis of Evaluation Campaigns in NLP, Oskar Wysocki, Malina Florea, Donal Landers and Andre Freitas, in: EMNLP, 2nd Workshop on Evaluation and Comparison of NLP Systems, 2021

Longitudinal characterisation of haematological and biochemical parameters in cancer patients prior to and during COVID-19 reveals features associated with outcome, Rebecca Lee, Oskar Wysocki, Andre Freitas and et al., in: ESMO Open, 2021

Wave comparisons of clinical characteristics and outcomes of COVID-19 admissions - Exploring the impact of treatment and strain dynamics, Anna Freeman, Alastair Watson, Paul O'Reagan, Oskar Wysocki, Hannah Burke, Andre Freitas and et al., in: Journal of Clinical Virology, 2022

Automated Interpretation of Air Traffic Control Communication: The Journey from Spoken Words to a Deeper Understanding of the Meaning, Matthias Kleinert, Hartmut Helmke, Shruthi Shetty, Oliver Ohneiser, heiko Ehr, Amrutha Prasad, Petr Motlicek and Julia Harfmann, in: 2021 IEEE/AIAA 40th Digital Avionics Systems Conference (DASC), San Antonio, TX, USA, pages 1-9, IEEE, 2021

[DOI]

Number and quality of diagrams in scholarly publications is associated with number of citations, Guy Marshall, Caroline Jay and Andre Freitas, in: Diagrams, 2021

Structuralist analysis for neural network system diagrams, Guy Marshall, Caroline Jay and Andre Freitas, in: Diagrams, 2021

Scholarly AI system diagrams as an access point to mental models, Guy Marshall, Caroline Jay and Andre Freitas, in: Diagrams, 2021

Similarity-Based Equational Inference in Physics, Jordan Meadows and Andre Freitas, in: Physics Review Research, 2021

Disentangling Generative Factors in Natural Language with Discrete Variational Autoencoders, Giangiacomo Mercatali and Andre Freitas, in: The 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Hybrid Autoregressive Inference for Scalable Multi-hop Explanation Regeneration, Marco Valentino, Mokanarangan Thayaparan, Deborah Mendes and Andre Freitas, in: Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Readback Error Detection by Automatic Speech Recognition to Increase ATM Safety, Hartmut Helmke, Matthias Kleinert, Shruthi Shetty, Oliver Ohneiser, heiko Ehr, Hörður Arilíusson, Teodor S. Simiganoschi, Amrutha Prasad, Petr Motlicek, Karel Vesely, Karel Ondřej, Pavel Smrz, Julia Harfmann and Christian Windisch, in: Fourteenth USA/Europe Air Traffic Management Research and Development Seminar (ATM2021), The United States Federal Aviation Administration (FAA), EUROCONTROL, pages 10, 2021

[URL]

On the Language-specificity of Multilingual BERT and the Impact of Fine-tuning, Marc Tanti, Lonneke van der Plas, Claudia Borg and Albert Gatt, in: Proceedings of the Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2021

Automated and unbiased discrimination of ALS from control tissue at single cell resolution, Cathleen Hagemann, Giulia E. Tyzack, Doaa M. Taha, Helen Devine, Linda Greensmith, Jia Newcombe, Rickie Patani, Andrea Serio and Raphaelle Luisier, in: Brain Pathology, 2021

Cytoplasmic cleavage of IMPA1 3' UTR is necessary for maintaining axon integrity, Catia Andreassi, Raphaelle Luisier, Hamish Crerar, Marousa Darsinou, Sasja Blokzijl-Franke, Lenn Tchern, Nicholas M. Luscombe, Giovanni Cuda, Marco Gaspari, Adolfo Saiardi and Antonella Riccio, in: Cell Reports, 2021

Aberrant cytoplasmic intron retention is a blueprint for RNA binding protein mislocalization in VCP-related amyotrophic lateral sclerosis, Giulia E. Tyzack, Jacob Neeves, Hamish Crerar, Pierre Klein, Oliver Ziff, Doaa M. Taha, Raphaelle Luisier, Nicholas M. Luscombe and Rickie Patani, in: Brain, 2021

Image-based deep learning reveals the responses of human motor neurons to stress and VCP-related ALS, Colombine Verzat, Jasmine Harley, Rickie Patani and Raphaelle Luisier, in: Neuropathology and Applied Neurobiology, 2021

A Comprehensive Evaluation on Multi-channel Biometric Face Presentation Attack Detection, Anjith George, David Geissbuhler and Sébastien Marcel, Idiap-RR-02-2022

Robust Face Presentation Attack Detection with Multi-channel Neural Networks, Anjith George and Sébastien Marcel, Idiap-RR-03-2022

Bilateral Teleoperation with Object-Adaptive Mapping, Xiao Gao, J. Silverio, Sylvain Calinon, Miao Li and Xiaohui Xiao, in: Complex & Intelligent Systems, 2021

Learning from Demonstration using Products of Experts: Applications to Manipulation and Task Prioritization, E. Pignat, J. Silverio and Sylvain Calinon, in: International Journal of Robotics Research, 41(2):163-188, 2022

Motion Mappings for Continuous Bilateral Teleoperation, Xiao Gao, J. Silverio, E. Pignat, Sylvain Calinon, Miao Li and Xiaohui Xiao, in: IEEE Robotics and Automation Letters, 6(3):5048-5055, 2021

Sequential Robot Imitation Learning from Observations, A. K. Tanwani, A. Yan, J. Lee, Sylvain Calinon and K. Goldberg, in: International Journal of Robotics Research (IJRR), 2021

Tensor-variate mixture of experts for proportional myographic control of a robotic hand, N. Jaquier, R. Haschke and Sylvain Calinon, in: Robotics and Autonomous Systems, 142:103812, 2021

Editorial: Artificial Intelligence and Human Movement in Industries and Creation, K. Dimitropoulos, P. Daras, S. Manitsaris, F. F. Leymarie and Sylvain Calinon, in: Frontiers in Robotics and AI, 8:712521, 2021

Towards an Engagement-Aware Attentive Artificial Listener for Multi-Party Interactions, Catharine Oertel, Patrik Jonell, Dimosthenis Kontogiorgos, Kenneth Alberto Funes Mora, Jean-Marc Odobez and Joakim Gustafson, in: Frontiers in Robotics and AI, 8:189, 2021

[DOI]
[URL]

Multimodal Neural Machine Translation System for English to Bengali, Shantipriya Parida, Subhadarshi Panda, Satya Prakash Biswal, Ketan Kotwal, Arghyadeep Sen, Satya Ranjan Dash and Petr Motlicek, in: Proceedings of the First Workshop on Multimodal Machine Translation for Low Resource Languages (MMTLRL 2021), Online (Virtual Mode), pages 31--39, INCOMA Ltd., 2021

[URL]

Automatic Dialect Detection for Low Resource Santali Language, Sunil Kumar Sahoo, Brojo Kishore Mishra, Shantipriya Parida, Satya Ranjan Dash, Jatindra Nath Besra and Esaú Villatoro-Tello, in: Proceeding of International Conference on Information Technology (OCIT), 2021

Open Machine Translation for Low Resource South American Languages (AmericasNLP 2021 Shared Task Contribution), Shantipriya Parida, Subhadarshi Panda, Amulya Ratna Dash, Esaú Villatoro-Tello, A. Seza Dogruöz, Rosa M. Ortega-Mendoza, Amadeo Hernández, Yashvardhan Sharma and Petr Motlicek, in: Proceedings of the First Workshop on Natural Language Processing for Indigenous Languages of the Americas, pages 218–223, Association for Computational Linguistics, 2021

[DOI]
[URL]

Unshuffling data for improved generalization in visual question answering, Damien Teney, Ehsan Abbasnejad and Anton van den Hengel, in: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021

Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models, Zheyuan Liu, Cristian Rodriguez-Opazo, Damien Teney and Stephen Gould, in: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021

Beyond question-based biases: Assessing multimodal shortcut learning in visual question answering, Corentin Dancette, Remi Cadene, Damien Teney and Matthieu Cord, in: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021

Perspectives and limitations of visible-thermal image pair synthesis via generative adversarial networks, Danick Panchard, François Marelli, Edouard De Moura Presa, Peter Wellig and Michael Liebling, in: Security + Defence, Target and Background Signatures VII, Proc. of SPIE, online only, pages 1186509-1--1186509-8, SPIE, 2021

[DOI]
[URL]

Zurich Like New: Analyzing Open Urban Multimodal Data, Marcel Granero-Moya, Thanh-Trung Phan and Daniel Gatica-Perez, in: Proceedings of the 1st International Workshop on Multimedia Computing for Urban Data, 2021

Professional YouTubers’ health videos as research material: Formulating a multi-method design in health psychology, María del Río Carral, Lucia Volpato, Chloé Michoud, Thanh-Trung Phan and Daniel Gatica-Perez, in: Methods in Psychology, Special Issue on Innovations in Qualitative Research, 5, 2021

A Sensor-Driven Visit Detection System in Older Adults’ Homes: Towards Digital Late-Life Depression Marker Extraction, Narayan Schütz, Angela Botros, Sami Ben Hassen, Hugo Saner, Philipp Buluschek, Prabitha Urwyler, Bruno Pais, Valérie Santschi, Daniel Gatica-Perez, René M. Müri and Tobias Nef, in: IEEE Journal of Biomedical And Health Informatics, 26(4):1560-1569, 2021

[DOI]
[URL]

Graph2Speak: Improving Speaker Identification using Network Knowledge in Criminal Conversational Data, Mael Fabien, Seyyed Saeed Sarfjoo, Srikanth Madikeri and Petr Motlicek, in: 1st ISCA Symposium on Security and Privacy in Speech Communication, pages 10--13, 2021

[DOI]

Open-Set Speaker Identification pipeline in live criminal investigations, Mael Fabien and Petr Motlicek, in: 1st ISCA Symposium on Security and Privacy in Speech Communication, 2021

ROXSD: a Simulated Dataset of Communication in Organized Crime, Hoang H. Nguyen, Mael Fabien, Petr Motlicek, Shantipriya Parida and Kvetoslav Maly, in: 1st ISCA Symposium on Security and Privacy in Speech Communication, 2021

Extreme Learning Machines with feature selection using GA for effective prediction of fetal heart disease: A Novel Approach, Debjani Panda, Divyajyoti Panda, Satya Ranjan Dash and Shantipriya Parida, in: Informatica, 45(3), 2021

[DOI]
[URL]

Optimization of robot configurations for motion planning in industrial riveting, Hakan Girgin, Teguh Santoso Lembono, Radu Cirligeanu and Sylvain Calinon, in: Proc. IEEE Intl Conf. on Advanced Robotics (ICAR), 2021

Optimal Control Combining Emulation and Imitation to Acquire Physical Assistance Skills, Amirreza Razmjoo, Teguh Santoso Lembono and Sylvain Calinon, in: Proc. IEEE Intl Conf. on Advanced Robotics (ICAR), 2021

Implementation of machine learning techniques for the quasi real-time blind and electric lighting optimization in a controlled experimental facility, Chantal Basurto, Roberto Boghetti, Moreno Colombo, Michael Papinutto, Julien Nembrini and Jérôme Kämpf, in: Journal of Physics: Conference Series, IOP Publishing, 2021

[DOI]
[URL]

Machine learning techniques for the daylight and electric lighting performance predictions, Chantal Basurto, Oliver Paul and Jérôme Kämpf, in: Proceedings of Building Simulation 2021, 2021

Trajectory Prediction with Compressed 3D Environment Representation using Tensor Train Decomposition, Lara Brudermuller, Teguh Santoso Lembono, Suhan Shetty and Sylvain Calinon, in: International Conference on Advanced Robotics, 2021

Social Robot Co-Design Canvases: A Participatory Design Framework, Minja Axelsson, Raquel Oliveira, Mattia Racca and V. Kyrki, in: ACM Transactions on Human-Robot Interaction, 11(1), 2022

[DOI]
[URL]

Application of Urban Scale Energy Modelling and Multi-Objective Optimization Techniques for Building Energy Renovation at District Scale, Fahad Haneef, Giovanni Pernigotto, Andrea Gasparella and Jérôme Kämpf, in: Sustainability, 13(20), 2021

[DOI]
[URL]

BERTraffic: A Robust BERT-Based Approach for Speaker Change Detection and Role Identification of Air-Traffic Communications, Juan Zuluaga-Gomez, Seyyed Saeed Sarfjoo, Amrutha Prasad, Nigmatulina Iuliia, Petr Motlicek, Oliver Ohneiser and Hartmut Helmke, Idiap-RR-15-2021

District heating network modelling for future integration of solar thermal energy, Clément Dromart, Loïc Puthod, Jérôme Kämpf and Diane von Gunten, in: Journal of Physics: Conference Series, pages 012089, IOP Publishing, 2021

[DOI]

Adjustable Deterministic Pseudonymization of Speech, S. Pavankumar Dubagunta, Rob J. J. H. van Son and Mathew Magimai-Doss, in: Computer, Speech & Language, 72, 2022

[DOI]

An Evaluation Benchmark for Automatic Speech Recognition of German-English Code-Switching, Abbas Khosravani, Philip N. Garner and Alexandros Lazaridis, in: IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Learning to Translate Low-Resourced Swiss German Dialectal Speech into Standard German Text, Abbas Khosravani, Philip N. Garner and Alexandros Lazaridis, in: IEEE Automatic Speech Recognition and Understanding Workshop, Colombia, Cartagena, IEEE, 2021

Novel Methods for Incorporating Prior Knowledge for Automatic Speech Assessment, S. Pavankumar Dubagunta, École polytechnique fédérale de Lausanne (EPFL), 2021

Pose Transformers (POTR): Human Motion Prediction with Non-Autoregressive Transformers, Angel Martínez-González, Michael Villamizar and Jean-Marc Odobez, in: International Conference in Computer Vision - Workshops, 2021

Classifier Implementation for Spontaneous EEG Activity during Schizophrenic Psychosis, Rekha Sahu, Satya Ranjan Dash, Lleuvelyn A Cacha, Roman R Poznanski and Shantipriya Parida, in: Computacion y Sistemas (CyS), 25(3), 2021

[URL]

Fusion of Acoustic and Linguistic Information Using Supervised Autoencoder for Improved Emotion Recognition, Bogdan Vlasenko, RaviShankar Prasad and Mathew Magimai-Doss, in: 2nd Multimodal Sentiment Analysis Challenge (MuSe '21), October 24, 2021, Virtual Event, China, 2021

[DOI]

Approximating the Mental Lexicon from Clinical Interviews as a Support Tool for Depression Detection, Esaú Villatoro-Tello, Gabriela Ramírez-de-la-Rosa, Daniel Gatica-Perez, Mathew Magimai-Doss and Héctor Jiménez-Salazar, in: Proceedings of the 2021 International Conference on Multimodal Interaction, ACM, 2021

[DOI]

Development of a lung segmentation algorithm for analog imaged chest X-Ray: preliminary results, Matheus A. Renzo, Natália Fernandez, André A. Baceti, Natanael Nunes de Moura Junior and André Anjos, in: XV Brazilian Congress on Computational Intelligence, Joinville, Brazil, 2021

[URL]

Vein Enhancement with Deep Auto-Encoders to improve Finger Vein Recognition, Victor Bros, Ketan Kotwal and Sébastien Marcel, in: Biometrics Special Interest Group (BIOSIG 2021), 2021

Probabilistic Iterative LQR for Short Time Horizon MPC, Teguh Santoso Lembono and Sylvain Calinon, in: International Conference on Intelligent Robots and Systems, pages 579-585, 2021

[DOI]

Multimodal Neural Machine Translation System for English to Bengali, Shantipriya Parida, Subhadarshi Panda, Satya Prakash Biswal, Ketan Kotwal, Arghyadeep Sen, Satya Ranjan Dash and Petr Motlicek, Idiap-RR-13-2021

Multi-channel Face Presentation Attack Detection Using Deep Learning, Anjith George and Sébastien Marcel, in: Deep Learning-Based Face Analytics, Springer International Publishing, 2021

Improving Generalization of Deepfake Detection by Training for Attribution, Anubhav Jain, Pavel Korshunov and Sébastien Marcel, in: International Workshop on Multimedia Signal Processing, 2021

Deep Learning Approaches for Auditory Perception in Robotics, Weipeng He, École polytechnique fédérale de Lausanne, 2021

Adjustable Deterministic Pseudonymization of Speech, S. Pavankumar Dubagunta, Rob Van Son and Mathew Magimai-Doss, Idiap-RR-12-2021

Towards Automatic Prediction of Non-Expert Perceived Speech Fluency Ratings, S. Pavankumar Dubagunta, Edoardo Moneta, Eleni Theocharopoulos and Mathew Magimai-Doss, Idiap-RR-11-2021

Modeling and Inferring Attention between Humans or for Human-Robot Interactions, Remy Siegfried, Ecole Polytechnique Federale de Lausanne, 2021

[DOI]
[URL]

Supervised Speech Representation Learning for Parkinson's Disease Classification, Parvaneh Janbakhshi and Ina Kodrasi, in: ITG Conference on Speech Communication, 2021

Overview of the 8th Workshop on Asian Translation, Toshiaki Nakazawa, Hideki Nakayama, Chenchen Ding, Raj Dabre, Shohei Higashiyama, Hideya Mino, Isao Goto, Win Pa Pa, Anoop Kunchukuttan, Shantipriya Parida, Ondrej Bojar, Chenhui Chu, Akiko Eriguchi, Kaori Abe, Yusuke Oda and Sadao Kurohashi, in: Proceedings of the 8th Workshop on Asian Translation (WAT2021), pages 1--45, Association for Computational Linguistics, 2021

[URL]

NLPHut's Participation at WAT2021, Shantipriya Parida, Subhadarshi Panda, Ketan Kotwal, Amulya Ratna Dash, Satya Ranjan Dash, Yashvardhan Sharma, Petr Motlicek and Ondrej Bojar, in: Proceedings of the 8th Workshop on Asian Translation (WAT2021), pages 146--154, Association for Computational Linguistics, 2021

[URL]

Applying Attention-Based Models for Detecting Cognitive Processes and Mental Health Conditions, Esaú Villatoro-Tello, Shantipriya Parida, Sajit Kumar and Petr Motlicek, in: Cognitive Computation:18, 2021

[DOI]
[URL]

Active tuberculosis detection from frontal chest X-ray images, Geoffrey Raposo, Idiap-Com-01-2021

[URL]

Modeling Dialectal Variation for Swiss German Automatic Speech Recognition, Abbas Khosravani, Philip N. Garner and Alexandros Lazaridis, in: Proceedings of Interspeech, 2021

[DOI]

Grammar Based Identification Of Speaker Role For Improving ATCO And Pilot ASR, Amrutha Prasad, Juan Zuluaga-Gomez, Petr Motlicek, Oliver Ohneiser, Hartmut Helmke, Seyyed Saeed Sarfjoo and Nigmatulina Iuliia, Idiap-RR-22-2021

LATTICE-FREE MMI ADAPTATION OF SELF-SUPERVISED PRETRAINED ACOUSTIC MODELS, Apoorv Vyas, Srikanth Madikeri and Hervé Bourlard, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, 2021

[URL]

Comparing CTC and LFMMI for out-of-domain adaptation of wav2vec 2.0 acoustic model, Apoorv Vyas, Srikanth Madikeri and Hervé Bourlard, in: Proceedings of Interspeech, 2021

[URL]

Examining the Social Context of Alcohol Drinking in Young Adults with Smartphone Sensing, Lakmal Buddika Meegahapola, Florian Labhart, Thanh-Trung Phan and Daniel Gatica-Perez, in: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT), 5(3):26, 2021

[DOI]

Rethinking the Role of Gradient-based Attribution Methods for Model Interpretability, Suraj Srinivas and Francois Fleuret, in: International Conference on Learning Representations, 2021

Improving callsign recognition with air-surveillance data in air-traffic communication, Nigmatulina Iuliia, Rudolf Braun, Juan Zuluaga-Gomez and Petr Motlicek, Idiap-RR-20-2021

[URL]

An Efficient Image-to-Image Translation HourGlass-based Architecture for Object Pushing Policy Learning, Marco Ewerton, Angel Martínez-González and Jean-Marc Odobez, in: IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Face Liveness Detection Competition (LivDet-Face) - 2021, Sandip Purnapatra, Nic Smalt, Keivan Bahmani, Priyanka Das, David Yambay, Amir Mohammadi, Anjith George, Thirimachos Bourlai, Sébastien Marcel and Stephanie Schuckers, in: International Joint Conference on Biometrics, 2021

Robust Unsupervised Gaze Calibration using Conversation and Manipulation Attention Priors, Remy Siegfried and Jean-Marc Odobez, in: ACM Transactions on Multimedia Computing, Communications, and Applications, 18(1):26, 2022

[DOI]
[URL]

PROMPT: Probabilistic Motion Primitives based Trajectory Planning, Tobias Löw, Tirthankar Bandyopadhyay, Jason Williams and Paulo Borges, in: Proceedings of Robotics: Science and Systems, 2021

[DOI]
[URL]

AUTOMATIC DYSARTHRIC SPEECH DETECTION EXPLOITING PAIRWISE DISTANCE-BASED CONVOLUTIONAL NEURAL NETWORKS, Parvaneh Janbakhshi, Ina Kodrasi and Hervé Bourlard, in: 45th International Conference on Acoustics, Speech, and Signal Processing, Toronto, Canada, pages 7328–7332, 2021

Multi-task Single Channel Speech Enhancement Using Speech Presence Probability As A Secondary Task Training Target, Lei Wang, Jie Zhu and Ina Kodrasi, in: European Signal Processing Conference, EUSIPCO 2021, 2021

An Objective Evaluation Framework for Pathological Speech Synthesis, Bence Halpern, Julian Fritsch, Enno Hermann, Rob Van Son, Odette Scharenborg and Mathew Magimai-Doss, in: Proceedings of ITG Conference on Speech Communication, 2021

Improving Emotional TTS with an Emotion Intensity Input from Unsupervised Extraction, Bastian Schnell and Philip N. Garner, in: 11th ISCA Speech Synthesis Workshop, 2021

[URL]

Boosting of contextual information in ASR for air-traffic call-sign recognition, Martin Kocour, Karel Vesely, Blatt Alexander, Juan Zuluaga-Gomez, Igor Szoke, Jan Cernocky, Dietrich Klakow and Petr Motlicek, in: Interspeech 2021, 2021

Contextual Semi-Supervised Learning: An Approach To Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems, Juan Zuluaga-Gomez, Nigmatulina Iuliia, Amrutha Prasad, Petr Motlicek, Karel Vesely, Martin Kocour and Igor Szoke, in: Interspeech 2021, 2021

[URL]

Applying Attention Based Models for Detecting Cognitive Processes and Mental Health Conditions, Esaú Villatoro-Tello, Shantipriya Parida, Sajit Kumar and Petr Motlicek, Idiap-RR-01-2022

Late Fusion of the Available Lexicon and Raw Waveform-based Acoustic Modeling for Depression and Dementia Recognition, Esaú Villatoro-Tello, S. Pavankumar Dubagunta, Julian Fritsch, Gabriela Ramírez-de-la-Rosa, Petr Motlicek and Mathew Magimai-Doss, in: Proceedings of Interspeech 2021, ISCA-International Speech Communication Association 2021, 2021

Handling acoustic variation in dysarthric speech recognition systems through model combination, Enno Hermann and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2021

Trust indicators and explainable AI: A study on user perceptions, Delphine Ribes Lemay, Nicolas Henchoz, Hélène Portier, Lara Defayes, Thanh-Trung Phan, Daniel Gatica-Perez and Andreas Sondereger, in: Proc. Int. Conf. on Human-Computer Interaction, Bari, Italy, 2021

Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks, Rabeeh Karimi Mahabadi, Sebastian Ruder, Dehghani Mostafa and James Henderson, in: ACL, 2021

Variational Information Bottleneck for Effective Low-Resource Fine-Tuning, Rabeeh Karimi Mahabadi, yonatan belinkov and James Henderson, in: ICLR, 2021

Unequivocal cardiac phase sorting from alternating ramp- and pulse-illuminated microscopy image sequences, Olivia Mariani, François Marelli, Christian Jaques, Alexander Ernst and Michael Liebling, in: 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI), pages 868-872, 2021

[DOI]
[URL]

Contactless Sleep Monitoring for Early Detection of Health Deteriorations in Community-Dwelling Older Adults: Exploratory Study, Narayan Schütz, Hugo Saner, Angela Botros, Bruno Pais, Valérie Santschi, Philipp Buluschek, Daniel Gatica-Perez, Prabitha Urwyler, René M. Müri and Tobias Nef, in: JMIR Mhealth Uhealth, 9(6), 2021

Declarative Variables in Online Dating: A Mixed-Method Analysis of a Mimetic-Distinctive Mechanism, Jessica Pidoux, Pascale Kuntz and Daniel Gatica-Perez, in: Proceedings of the ACM on Human-Computer Interaction, 5(CSCW1), 2021

Identification of F1 and F2 in speech using modified zero frequency filtering, RaviShankar Prasad and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2021

On Modeling Glottal Source Information for Phonation Assessment in Parkinson’s Disease, Juan Camilo Vasquez-Correa, Julian Fritsch, Juan Rafael Orozco-Arroyave, Elmar Nöth and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2021

ROXANNE Research Platform: Automate criminal investigations, Mael Fabien, Shantipriya Parida, Dawei Zhu, Petr Motlicek, Aravind Krishnan and Hoang H. Nguyen, in: Interspeech Show and Tell 2021, 2021

Phoneme based Respiratory Analysis of Read Speech, Venkata Srikanth Nallanthighal, Aki Härmä, Helmer Strik and Mathew Magimai-Doss, in: Proceedings of European Signal Processing Conference (EUSIPCO), 2021

Multi-task Neural Network for Robust Multiple Speaker Embedding Extraction, Weipeng He, Petr Motlicek and Jean-Marc Odobez, in: Proceedings of Interspeech 2021, 2021

Approximating the Mental Lexicon from Clinical Interviews as a Support Tool for Depression Detection, Esaú Villatoro-Tello, Gabriela Ramírez-de-la-Rosa, Héctor Jiménez-Salazar, Daniel Gatica-Perez and Mathew Magimai-Doss, Idiap-RR-19-2021

Late Fusion of the Available Lexicon and Raw Waveform-based Acoustic Modeling for Depression and Dementia Recognition, Esaú Villatoro-Tello, S. Pavankumar Dubagunta, Julian Fritsch, Gabriela Ramírez-de-la-Rosa, Petr Motlicek and Mathew Magimai-Doss, Idiap-RR-09-2021

Robust Command Recognition for Lithuanian Air Traffic Control Tower Utterances, Oliver Ohneiser, Seyyed Saeed Sarfjoo, Hartmut Helmke, Shruthi Shetty, Petr Motlicek, Matthias Kleinert, heiko Ehr and Šarūnas Murauskas, in: Interspeech, 2021

Speech Activity Detection Based on Multilingual Speech Recognition System, Seyyed Saeed Sarfjoo, Srikanth Madikeri and Petr Motlicek, in: Interspeech, 2021

Supporting Context Monotonicity Abstractions in Neural NLI Models, Julia Rozanova, Deborah Mendes, Mokanarangan Thayaparan, Marco Valentino and Andre Freitas, in: Natural Logic Meets Machine Learning Workshop, 2021

[URL]

On the use of automatically generated synthetic image datasets for benchmarking face recognition, Laurent Colbois, Tiago de Freitas Pereira and Sébastien Marcel, in: International Joint Conference on Biometrics (IJCB 2021), 2021

Ergodic Exploration using Tensor Train: Applications in Insertion Tasks, Suhan Shetty, J. Silverio and Sylvain Calinon, in: IEEE Transactions on Robotics, 38(2):906--921, 2022

[DOI]
[URL]

Does My Representation Capture X? Probe-Ably, Deborah Mendes, Julia Rozanova, Mokanarangan Thayaparan, Marco Valentino and Andre Freitas, in: 59th Annual Meeting of the Association for Computational Linguistics (Demonstration track), 2021

[URL]

Do Natural Language Explanations Represent Valid Logical Arguments? Verifying Entailment in Explainable NLI Gold Standards, Marco Valentino, Ian Pratt-Hartmann and Andre Freitas, in: 14th International Conference on Computational Semantics, 2021

[URL]

Switching Contexts: Transportability Measures for NLP, Guy Marshall, Mokanarangan Thayaparan, Philip Osborne and Andre Freitas, in: 14th International Conference on Computational Semantics, 2021

[URL]

Encoding Explanatory Knowledge for Zero-shot Science Question Answering, Zili Zhou, Marco Valentino, Donal Landers and Andre Freitas, in: 14th International Conference on Computational Semantics, 2021

[URL]

Unification-based Reconstruction of Multi-hop Explanations for Science Questions, Marco Valentino, Mokanarangan Thayaparan and Andre Freitas, in: 16th conference of the European Chapter of the Association for Computational Linguistics, 2021

[URL]

Explainable Inference Over Grounding-Abstract Chains for Science Questions, Mokanarangan Thayaparan, Marco Valentino and Andre Freitas, in: 59th Annual Meeting of the Association for Computational Linguistics (ACL Findings), 2021

On the Effectiveness of Vision Transformers for Zero-shot Face Anti-Spoofing, Anjith George and Sébastien Marcel, in: International Joint Conference on Biometrics (IJCB 2021), 2021

Supervised Speech Representation Learning for Parkinson's Disease Classification, Parvaneh Janbakhshi and Ina Kodrasi, Idiap-RR-08-2021

BertOdia: BERT pre-training for low resource Odia language, Shantipriya Parida, Satya Prakash Biswal, Biranchi Narayan Nayak, Mael Fabien, Esaú Villatoro-Tello and Petr Motlicek, Idiap-RR-16-2021

NLPHut’s Participation at WAT2021, Shantipriya Parida, Subhadarshi Panda, Ketan Kotwal, Amulya Ratna Dash, Satya Ranjan Dash, Yashvardhan Sharma, Petr Motlicek and Ondrej Bojar, Idiap-RR-10-2021

Evading the Simplicity Bias: Training a Diverse Set of Models Discovers Solutions with Superior OOD Generalization, Damien Teney, Ehsan Abbasnejad, Simon Lucey and Anton van den Hengel, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

The Theory, Practice, and Ethical Challenges of Designing a Diversity-Aware Platform for Social Relations, Laura Schelenz, Ivano Bison, Matteo Busso, Amalia de Götzen, Daniel Gatica-Perez, Fausto Giunchiglia, Lakmal Buddika Meegahapola and Salvador Ruiz-Correa, in: Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society, pages 11, ACM, 2021

[DOI]

Ten seconds of my nights: exploring methods to measure brightness, loudness and attendance and their associations with alcohol use from video clips, Florian Labhart, Skanda Muralidhar, Benoit Massé, Lakmal Buddika Meegahapola, Emmanuel Kuntsche and Daniel Gatica-Perez, in: PLOS ONE, 2021

[DOI]

Subjective and objective evaluation of deepfake videos, Pavel Korshunov and Sébastien Marcel, in: The international Conference on Acoustics, Speech, and Signal Processing, 2021

Visual Focus of Attention Estimation in 3D Scene with an Arbitrary Number of Targets, Remy Siegfried and Jean-Marc Odobez, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pages 9, IEEE, 2021

Deep learning architectures for estimating breathing signal and respiratory parameters from speech recordings, Venkata Srikanth Nallanthighal, Zohreh Mostaani, Aki Härmä, Helmer Strik and Mathew Magimai-Doss, in: Neural Networks, 141:211--224, 2021

[DOI]

Open Machine Translation for Low Resource South American Languages (AmericasNLP 2021 Shared Task Contribution), Shantipriya Parida, Subhadarshi Panda, Amulya Ratna Dash, Esaú Villatoro-Tello, A. Seza Dogruöz, Rosa M. Ortega-Mendoza, Amadeo Hernández, Yashvardhan Sharma and Petr Motlicek, Idiap-RR-07-2021

Syntax-Aware Graph-to-Graph Transformer for Semantic Role Labelling, Alireza Mohammadshahi and James Henderson, in: Arxiv, 2021

Optics Versus Computation: Influence of Illumination and Reconstruction Model Accuracy in Focal-Plane-Scanning Optical Projection Tomography, François Marelli and Michael Liebling, in: 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI), Nice, France, pages 567-570, IEEE, 2021

[DOI]

Contextual Semi-Supervised Learning: An Approach To Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems, Juan Zuluaga-Gomez, Nigmatulina Iuliia, Amrutha Prasad, Petr Motlicek, Karel Vesely, Martin Kocour and Igor Szoke, Idiap-RR-14-2021

[URL]

Explainable Phonology-based Approach for Sign Language Recognition and Assessment, Sandrine Tornay, Ecole Polytechnique Fédérale de Lausanne, 2021

Neural Network Adaptation and Data Augmentation for Multi-Speaker Direction-of-Arrival Estimation, Weipeng He, Petr Motlicek and Jean-Marc Odobez, in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 29:1303-1317, 2021

[DOI]
[URL]

Comparing CTC and LFMMI for out-of-domain adaptation of wav2vec 2.0 acoustic model, Apoorv Vyas, Srikanth Madikeri and Hervé Bourlard, Idiap-RR-04-2021

Semantic Behavior Analysis of COVID-19 Patients: A Collaborative Framework, Amlan Mohanty, Debasish Kumar Mallick, Shantipriya Parida and Satya Ranjan Dash, in: Machine Learning for Healthcare Applications, John Wiley & Sons, Inc. USA and Scrivener Publishing LLC, USA, 2021

[URL]

A Laser-based Dual-arm System for Precise Control of Collaborative Robots, J. Silverio, G. Clivaz and Sylvain Calinon, in: IEEE International Conference on Robotics and Automation, 2021

Estimating Nonplanar Flow from 2D Motion-blurred Widefield Microscopy Images via Deep Learning, Adrian Shajkofci and Michael Liebling, in: International Symposium on Biomedical Imaging, 2021, 2021

Natural Language Inference over Tables: Enabling Explainable Data Exploration on Data Lakes, Mario Ramirez, Alex Bogatu, Norman Paton and Andre Freitas, in: 18th Extended Semantic Web Conference (ESWC), 2021

[URL]

Explainable Natural Language Reasoning via Conceptual Unification, Marco Valentino, Mokanarangan Thayaparan and Andre Freitas, in: 16th conference of the European Chapter of the Association for Computational Linguistics (EACL), 2021

[URL]

STAR: Cross-modal Statement Representation for Selecting Relevant Mathematical Premises, Deborah Mendes and Andre Freitas, in: 16th conference of the European Chapter of the Association for Computational Linguistics (EACL), 2021

Cost–effective Variational Active Entity Resolution, Alex Bogatu, Norman Paton, Mark Douthwaite, Stuart Davie and Andre Freitas, in: 37th IEEE International Conference on Data Engineering (ICDE), 2021

[URL]

Signal-to-signal neural networks for improved spike estimation from calcium imaging data, Jilt Sebastian, Mriganka Sur, Hema A Murthy and Mathew Magimai-Doss, in: PLoS Computational Biology, 17(3):1--19, 2021

[DOI]

A Bayesian Interpretation of the Light Gated Recurrent Unit, Alexandre Bittar and Philip N. Garner, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2021

[DOI]

Accurate Nod and 3D Gaze Estimation for Social Interaction Analysis, Yu Yu, EDEE, EPFL, 2020

Whole Body Model Predictive Control with a Memory of Motion:Experiments on a Torque-Controlled Talos, Ewen Dantec, Rohan Budhiraja, Adria Roig, Teguh Santoso Lembono, Guilhem Saurel, Olivier Stasse, Pierre Fernbach, Steve Tonneau, Sethu Vijayakumar, Sylvain Calinon, Michel Taix and Nicolas Mansard, in: IEEE International Conference on Robotics and Automation, 2021

Learning Constrained Distributions of Robot Configurations with Generative Adversarial Network, Teguh Santoso Lembono, Emmanuel Pignat, Julius Jankowski and Sylvain Calinon, in: IEEE Robotics and Automation Letters, 2021

A COMPARISON OF METHODS FOR OOV-WORD RECOGNITION ON A NEW PUBLIC DATASET, Rudolf Braun, Srikanth Madikeri and Petr Motlicek, in: 2021 IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE Signal Processing Society, Toronto, Ontario, Canada, 2021

Learning Optimal Impedance Control During Complex 3D Arm Movements, A. Naceri, T. Schumacher, Q. Li, Sylvain Calinon and H. Ritter, in: IEEE Robotics and Automation Letters (RA-L), 6(2):1248-1255, 2021

[DOI]
[URL]

Cross Modal Focal Loss for RGBD Face Anti-Spoofing, Anjith George and Sébastien Marcel, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021

Computational methods for live heart imaging with speed-constrained microscopes, Olivia Mariani, EPFL, 2021

Probabilistic Adaptive Control for Robust Behavior Imitation, Julius Jankowski, Hakan Girgin and Sylvain Calinon, in: IEEE Robotics and Automation Letters, 2021

Automatic and perceptual discrimination between dysarthria, apraxia of speech, and neurotypical speech, Ina Kodrasi, Michaela Pernon, Marina Laganaro and Hervé Bourlard, in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2021

Inducing Meaningful Units from Character Sequences with Dynamic Capacity Slot Attention, Melika Behjati and James Henderson, in: Transactions on Machine Learning Research, 2023

[URL]

Graph2Speak: Improving Speaker Identification using Network Knowledge in Criminal Conversational Data, Mael Fabien, Seyyed Saeed Sarfjoo, Srikanth Madikeri and Petr Motlicek, Idiap-RR-01-2023

[URL]

Evaluation of Urban Scale Building Energy-Use Models and Tools – Application for the City of Fribourg, Switzerland, Valeria Todeschi, Roberto Boghetti, Jérôme Kämpf and Guglielmina Mutani, in: Sustainability, 13(7), 2021

[DOI]
[URL]

Discourse Phenomena in Machine Translation, Lesly Miculicich, École polytechnique fédérale de Lausanne, 2020

One More Bite? Inferring Food Consumption Level of College Students Using Smartphone Sensing and Self-Reports, Lakmal Buddika Meegahapola, Salvador Ruiz-Correa, Viridiana del Carmen Robledo-Valero, Emilio Ernesto Hernandez-Huerfano, Leonardo Alvarez-Rivera, Ronald Chenu-Abente and Daniel Gatica-Perez, in: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT), 5(1), 2021

Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications, Juan Zuluaga-Gomez, Karel Vesely, Blatt Alexander, Petr Motlicek, Dietrich Klakow, Allan Tart, Igor Szoke, Amrutha Prasad, Seyyed Saeed Sarfjoo, Pavel Kolcarek, Martin Kocour, Honza Cernocky, Claudia Cevenini, Khalid Choukri, Mickael Rigault and Fabian Landis, in: Proceedings of 8th OpenSky Symposium 2020, OpenSky Network, pages 1-10, MDPI, 2020

[DOI]
[URL]

Challenges for Using Impact Regularizers to Avoid Negative Side Effects, David Lindner, Kyle Matoba and Alexander Meulemans, in: SafeAI 2021 - AAAI's Workshop on Artificial Intelligence Safety, 2021

Exact Preimages of Neural Network Aircraft Collision Avoidance Systems, Kyle Matoba and Francois Fleuret, in: Machine Learning for Engineering Modeling, Simulation, and Design Workshop at Neural Information Processing Systems 2020, 2020

Predicting the Causal Effect Relationship Between COPD and Cardio Vascular Diseases, Debjani Panda, Satya Ranjan Dash, Ratula Ray and Shantipriya Parida, in: Informatica, 44(4), 2020

[DOI]
[URL]

Unsupervised Representation Learning for Gaze Estimation, Yu Yu and Jean-Marc Odobez, in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020

Paraspeckle components NONO and PSPC1 are not mislocalized from motor neuron nuclei in sporadic ALS, Giulia E. Tyzack, Giulia Manferrari, Jia Newcombe, Nicholas M. Luscombe, Raphaelle Luisier and Rickie Patani, in: Brain, 2020

[URL]

Mammary epithelial morphogenesis in 3D combinatorial microenvironments, Raphaelle Luisier, Mehmet Girgin, Matthias P. Lutolf and Adrian Ranga, in: Scientific Reports, 10(1), 2020

[URL]

Author Profiling in Social Media with Multimodal Information., Miguel Á. Álvarez-Carmona, Esaú Villatoro-Tello, Manuel Montes-y-Gómez and Luis Villaseñor Pineda, in: In Journal of Computacion y Sistemas (CyS), 24(3), 2020

[URL]

SPARSE AUTOENCODERS TO ENHANCE SPEECH RECOGNITION, Selen Hande Kabil and Hervé Bourlard, Idiap-RR-10-2022

Real-Time Segmentation Networks should be Latency Aware, Evann Courdier and Francois Fleuret, in: Asian Conference on Computer Vision, 2020

Fairness in Biometrics: a figure of merit to assess biometric verification systems, Tiago de Freitas Pereira and Sébastien Marcel, in: arXiv, 2020

Subspace-based Learning for Automatic Dysarthric Speech Detection, Parvaneh Janbakhshi, Ina Kodrasi and Hervé Bourlard, in: IEEE Signal Processing Letters, 2020

COMPARISON OF SUBWORD SEGMENTATION METHODS FOR OPEN-VOCABULARYEND-TO-END SPEECH RECOGNITION, Abbas Khosravani, Claudiu Musat, Philip N. Garner and Alexandros Lazaridis, Idiap-RR-34-2020

Smartphone Sensing for the Well-being of Young Adults: A Review, Lakmal Buddika Meegahapola and Daniel Gatica-Perez, in: IEEE Access, 2021

[DOI]
[URL]

LATTICE-FREE MMI ADAPTATION OF SELF-SUPERVISED PRETRAINED ACOUSTIC MODELS, Apoorv Vyas, Srikanth Madikeri and Hervé Bourlard, Idiap-RR-40-2020

[URL]

Fast Transformers with Clustered Attention, Apoorv Vyas, Angelos Katharopoulos and Francois Fleuret, in: Proceedings of the International Conference on Neural Information Processing Systems, 2020

Partially-supervised Mention Detection, Lesly Miculicich and James Henderson, in: Proceedings of the Third Workshop on Computational Models of Reference, Anaphora and Coreference, 2020

The Unstoppable Rise of Computational Linguistics in Deep Learning, James Henderson, in: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Online, pages 6294-6306, Association for Computational Linguistics, 2020

[DOI]
[URL]

Evaluation of 1-Year in-Home Monitoring Technology by Home-Dwelling Older Adults, Family Caregivers, and Nurses, Bruno Pais, Philipp Buluschek, Guillaume DuPasquier, Tobias Nef, Narayan Schütz, Hugo Saner, Daniel Gatica-Perez and Valérie Santschi, in: Frontiers in Public Health, 8:9, 2020

[DOI]
[URL]

BertAA: BERT fine-tuning for Authorship Attribution, Mael Fabien, Esaú Villatoro-Tello, Petr Motlicek and Shantipriya Parida, in: Proceedings of the 17th International Conference on Natural Language Processing, 2020

Free annotated data for deep learning in microscopy? A hitchhiker's guide, Adrian Shajkofci and Michael Liebling, in: Photoniques(104):30-33, 2020

[DOI]
[URL]

Aliasing mitigation in optical microscopy of dynamic biological samples by use of temporally modulated color illumination and a standard RGB camera, Christian Jaques and Michael Liebling, in: Journal of Biomedical Optics, 25(10):106505, 2020

[DOI]
[URL]

An Integrated and strategic evaluation of automatic blind controls to achieve energy and occupant's comfort objectives, Chantal Basurto and Jérôme Kämpf, in: Proceedings of the 5th IBPSA-England Conference on Building Simulation and Optimization (Virtual), Loughborough, UK, 2020

[URL]

Detection of Similar Languages and Dialects Using Deep Supervised Autoencoders, Shantipriya Parida, Esaú Villatoro-Tello, Sajit Kumar, Mael Fabien and Petr Motlicek, in: Proceedings of the 17th International Conference on Natural Language Processing, 2020

Overview of the 7th Workshop on Asian Translation, Toshiaki Nakazawa, Hideki Nakayama, Chenchen Ding, Raj Dabre, Shohei Higashiyama, Hideya Mino, Isao Goto, Win Pa Pa, Anoop Kunchukuttan, Shantipriya Parida, Ondrej Bojar and Sadao Kurohashi, in: Proceedings of the 7th Workshop on Asian Translation, Association for Computational Linguistics, 2020

[URL]

Generative adversarial training of product of policies for robust and adaptive movement primitives, Emmanuel Pignat, Hakan Girgin and Sylvain Calinon, in: In Proc. Conference on Robot Learning (CoRL), 2020

Learning, Generating and Adapting Wave Gestures for Expressive Human-Robot Interaction, M. Panteris, S. Manschitz and Sylvain Calinon, in: Proc. ACM/IEEE Intl Conf. on Human-Robot Interaction (HRI), pages 386-388, 2020

[DOI]
[URL]

Context is Everything: Using a Smartphone App to Capture Young People's Drinking Behaviours, Cognitions, Environments, and Consequences, Florian Labhart, La Trobe University, Melbourne, Australia, 2020

[DOI]

Do different drinks make you feel different emotions? Examination of young adolescents' beverage-specific alcohol expectancies using the Alcohol Expectancy Task, Megan Cook, Sandra Kuntsche, Florian Labhart and Emmanuel Kuntsche, in: Addictive Behaviors, 2020

[DOI]
[URL]

Fun/intoxication pre-drinking motives lead indirectly to more alcohol-related consequences via increased alcohol consumption on a given night, Koen Smit, Emmanuel Kuntsche, Dan Anderson-Luxford and Florian Labhart, in: Addictive Behaviors, 2020

[DOI]
[URL]

Alone or With Others? Understanding Eating Episodes of College Students with Mobile Sensing, Lakmal Buddika Meegahapola, Salvador Ruiz-Correa and Daniel Gatica-Perez, in: 19th International Conference on Mobile and Ubiquitous Multimedia, ACM, Essen, Germany, pages 162–166, Association for Computing Machinery, 2020

[DOI]
[URL]

Protecting Mobile Food Diaries from Getting too Personal, Lakmal Buddika Meegahapola, Salvador Ruiz-Correa and Daniel Gatica-Perez, in: 19th International Conference on Mobile and Ubiquitous Multimedia, Essen, Germany, pages 212–222, Association for Computing Machinery, 2020

[DOI]
[URL]

A Neural Model to Predict Parameters for a Generalized Command Response Model of Intonation, Bastian Schnell and Philip N. Garner, in: MLSLP-18 Proceedings, Hyderabad, 2018

[URL]

On the Effectiveness of Vision Transformers for Zero-shot Face Anti-Spoofing, Anjith George and Sébastien Marcel, Idiap-RR-30-2020

Spectro-temporal sparsity characterization for dysarthric speech detection, Ina Kodrasi and Hervé Bourlard, in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 28:1210-1222, 2020

Novel Architectures for Unsupervised Information Bottleneck based Speaker Diarization of Meetings, Nauman Dawalatabad, Srikanth Madikeri, C Chandra Sekhar and Hema A Murthy, in: IEEE/ACM Transactions on Audio Speech and Language Processing, 2020

Novel Architectures for Unsupervised Information Bottleneck based Speaker Diarization of Meetings, Nauman Dawalatabad, Srikanth Madikeri, C Chandra Sekhar and Hema A Murthy, Idiap-RR-26-2020

An Attention Mechanism for Deep Q-Networks with Applications in Robotic Pushing, Marco Ewerton, Sylvain Calinon and Jean-Marc Odobez, Idiap-RR-03-2021

Assisted teleoperation in changing environments with a mixture of virtual guides, Marco Ewerton, Oleg Arenz and Jan Peters, in: Advanced Robotics, 34(18):1157-1170, 2020

[DOI]
[URL]

A Comparison of Acoustic and Linguistics Methodologies for Alzheimer's Dementia Recognition, Nicholas Cummins, Yilin Pan, Zhao Ren, Julian Fritsch, Venkata Srikanth Nallanthighal, Heidi Christensen, Daniel Blackburn, Björn Schuller, Mathew Magimai-Doss, Helmer Strik and Aki Härmä, in: Proceedings of Interspeech, pages 2182-2186, 2020

Graph-to-Graph Transformer for Transition-based Dependency Parsing, Alireza Mohammadshahi and James Henderson, in: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Findings, ACL, Online, pages 3278–3289, Association for Computational Linguistics, 2020

[URL]

ODIANLP's Participation in WAT2020, Shantipriya Parida, Petr Motlicek, Amulya Ratna Dash, Satya Ranjan Dash, Debasish Kumar Mallick, Satya Prakash Biswal, Priyanka Pattnaik, Biranchi Narayan Nayak and Ondrej Bojar, in: Proceedings of the 7th Workshop on Asian Translation, ACL Anthology, 2020

On Joint Optimization of Automatic Speaker Verification and Anti-spoofing in the Embedding Space, Alejandro Gomez-Alanis, Jose A. Gonzalez-Lopez, S. Pavankumar Dubagunta, Antonio M. Peinado and Mathew Magimai-Doss, in: IEEE Transactions on Information Forensics and Security, 16:1579--1593, 2021

[DOI]

Vulnerability Analysis of Face Morphing Attacks from Landmarks and Generative Adversarial Networks, Eklavya Sarkar, Pavel Korshunov, Laurent Colbois and Sébastien Marcel, Idiap-RR-38-2020

Inferring Highly-dense Representations for Clustering Broadcast Media Content, Esaú Villatoro-Tello, Shantipriya Parida, Petr Motlicek and Ondrej Bojar, in: The Prague Bulletin of Mathematical Linguistics, 2020

[URL]

AUTOMATIC DYSARTHRIC SPEECH DETECTION EXPLOITING PAIRWISE DISTANCE-BASED CONVOLUTIONAL NEURAL NETWORKS, Parvaneh Janbakhshi, Ina Kodrasi and Hervé Bourlard, Idiap-RR-32-2020

Recursive Non-Autoregressive Graph-to-Graph Transformer for Dependency Parsing with Iterative Refinement, Alireza Mohammadshahi and James Henderson, in: Transactions of the Association for Computational Linguistics, 2020

[URL]

Graph-to-Graph Transformer for Transition-based Dependency Parsing, Alireza Mohammadshahi and James Henderson, in: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, 2020

[URL]

Shooting shots: Estimating alcoholic drink sizes in real life using event-level reports and annotations of close-up pictures, Florian Labhart, Thanh-Trung Phan, Daniel Gatica-Perez and Emmanuel Kuntsche, in: Drug and Alcohol Review, 2020

[DOI]
[URL]

A t-distribution based operator for enhancing out of distribution robustness of neural network classifiers, Niccolò Antonello and Philip N. Garner, in: IEEE Signal Processing Letters, 27:1070-1074, 2020

[DOI]

Plug and Play Autoencoders for Conditional Text Generation, Florian Mai, Nikolaos Pappas, Ivan Montero, Noah A. Smith and James Henderson, in: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, Online, 2020

Automatic Discrimination of Apraxia of Speech and Dysarthria using a Minimalistic Set of Handcrafted Features, Ina Kodrasi, Michaela Pernon, Marina Laganaro and Hervé Bourlard, in: Interspeech, 2020

Adaptive Ensemble-based Optimisation for Petrophysical Inversion, Rémi Moyen and Théophile Gentilhomme, in: Mathematical Geosciences, 2020

[DOI]
[URL]

A Phonology-based Approach for Isolated Sign Production Assessment in Sign Language, Sandrine Tornay, Necati Cihan Camgoz, Richard Bowden and Mathew Magimai-Doss, in: Companion Publication of the 2020 International Conference on Multimodal Interaction (ICMI '20 Companion), 2020

Idiap and UAM Participation at MEX-A3T Evaluation Campaign, Esaú Villatoro-Tello, Gabriela Ramírez-de-la-Rosa, Shantipriya Parida, Sajit Kumar and Petr Motlicek, in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2020) co-located with 36th Conference of the Spanish Society for Natural Language Processing (SEPLN 2020), pages 6, CEUR Workshop Proceedings, 2020

[URL]

The High-Quality Wide Multi-Channel Attack (HQ-WMCA) database, Zohreh Mostaani, Anjith George, Guillaume Heusch, David Geissbuhler and Sébastien Marcel, Idiap-RR-22-2020

The Little W-Net That Could: State-of-the-Art Retinal Vessel Segmentation with Minimalistic Models, Adrian Galdran, André Anjos, José Dolz, Hadi Chakor, Hervé Lombaert and Ismail Ben Ayed, in: Cornell University Pre-print Server, 2020

[URL]

Taming GANs with Lookahead, Tatjana Chavdarova, Matteo Pagliardini, Martin Jaggi and Francois Fleuret, Idiap-RR-20-2020

[URL]

Deep Generative Models and Applications, Tatjana Chavdarova and Francois Fleuret, EPFL, 2020

[DOI]
[URL]

Active Illumination and Computational Methods for Temporal and Spectral Super-Resolution Microscopy, Christian Jaques, EPFL, 2020

[DOI]

Deepfake detection: humans vs. machines, Pavel Korshunov and Sébastien Marcel, Idiap-RR-36-2020

Iris Liveness Detection Competition (LivDet-Iris) – The 2020 Edition, Priyanka Das, Joseph McGrath, Zhaoyuan Fang, Aidan Boyd, Ganghee Jang, Amir Mohammadi, Sandip Purnapatra, David Yambay, Sébastien Marcel, Mateusz Trokielewicz, Piotr Maciejewicz, Kevin Bowyer, Adam Czajka and Stephanie Schuckers, in: INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB 2020), 2020

[URL]

Robot skills learning with Riemannian manifolds : Leveraging geometry-awareness in robot learning, optimization and control, N. Jaquier, Ecole Polytechnique Fédérale de Lausanne, 2020

Understanding Applicants' Reactions to Asynchronous Video Interviews through Self-Reports and Nonverbal Cues, Skanda Muralidhar, Emmanuelle Patricia Kleinlogel, Eric Mayor, Adrian Bangerter, Marianne Schmid Mast and Daniel Gatica-Perez, in: Proc. ACM Int. Conf. on Multimodal Interaction (ICMI), Utrecht, 2020

Product of experts for robot learning from demonstration, Emmanuel Pignat, EPFL, 2020

Face Recognition systems: performance evaluation and bias analysis, Yannick Dayer, Idiap-Com-04-2020

Deep Learning of Charisma, Daniel Carron, Idiap-Com-03-2020

Planning and control of robot manipulation tasks, Jérémy Maceiras, Idiap-Com-01-2022

Machine Learning for Adverse Event Detection in Latent Tuberculosis Infection Treatment, Colombine Verzat, Idiap-Com-02-2020

Automatic Speech Recognition Engines Adapted for Embedded Platforms, Amrutha Prasad, Idiap-Com-01-2020

Detection of disguised speech in forensic science by humans and automatic systems, Michela Pettinato, Université de Lausanne Ecole des Sciences Criminelles, 2020

Automatic Speech Recognition Benchmark for Air-Traffic Communications, Juan Zuluaga-Gomez, Petr Motlicek, Qingran Zhan, Rudolf Braun and Karel Vesely, in: Proc. Interspeech 2020, pages 2297-2301, 2020

[DOI]

An HMM Approach with Inherent Model Selection for Sign Language and Gesture Recognition, Sandrine Tornay, Oya Aran and Mathew Magimai-Doss, in: Proceedings of the International Conference on Language Resources and Evaluation LREC 2020, 2020

Temporal resolution doubling in fluorescence light-sheet microscopy via a hue-encoded shutter and regularization, Christian Jaques, Alexander Ernst, Nadia Mercader and Michael Liebling, in: OSA Continuum, 3(8), 2020

Smartphone Multi-modal Biometric Authentication: Database and Evaluation, Ramachandra Raghavendra, Martin Stokkenes, Amir Mohammadi, Sushma Venkatesh, Kiran B. Raja, Pankaj Wasnik, Eric Poiret, Sébastien Marcel and Christoph Busch, Idiap-RR-17-2020

[URL]

Learning One Class Representations for Face Presentation Attack Detection using Multi-channel Convolutional Neural Networks, Anjith George and Sébastien Marcel, in: IEEE Transactions on Information Forensics and Security, 2020

The MuMMER data set for Robot Perception in multi-party HRI Scenarios, Olivier Canévet, Weipeng He, Petr Motlicek and Jean-Marc Odobez, in: Proceedings of the 29th IEEE International Conference on Robot & Human Interactive Communication, 2020

Transformers are RNNs: Fast Autoregressive Transformers with Linear Attention, Angelos Katharopoulos, Apoorv Vyas, Nikolaos Pappas and Francois Fleuret, in: Proceedings of International Conference on Machine Learning, 2020

A Bayesian Approach to Recurrence in Neural Networks, Philip N. Garner and Sibo Tong, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 43(8):2527--2537, 2021

[DOI]

Spatial Attention for Far-Field Speech Recognition with Deep Beamforming Neural Networks, Weipeng He, Lu Lu, Biqiao Zhang, Jay Mahadeokar, Kaustubh Kalgaonkar and Christian Fuegen, in: 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, pages 7499-7503, 2020

[DOI]

WatchNet++: Efficient and accurate depth-based network for detecting people attacks and intrusion, Michael Villamizar, Angel Martínez-González, Olivier Canévet and Jean-Marc Odobez, in: Machine Vision and Applications, 2020

Plug and Play Autoencoders for Conditional Text Generation, Florian Mai, Nikolaos Pappas, Ivan Montero, Noah A. Smith and James Henderson, Idiap-RR-24-2020

Optimizer Benchmarking Needs to Account for Hyperparameter Tuning, Prabhu Teja Sivaprasad, Florian Mai, Thijs Vogels, Martin Jaggi and Francois Fleuret, in: Proceedings of the 37th International Conference on Machine Learning, Vienna, Austria, 2020

[URL]

Gradient Alignment in Deep Neural Networks, Suraj Srinivas and Francois Fleuret, Idiap-RR-14-2020

Deep Models and Shortwave Infrared Information to Detect Face Presentation Attacks, Guillaume Heusch, Anjith George, David Geissbuhler, Zohreh Mostaani and Sébastien Marcel, in: IEEE Transactions on Biometrics, Behavior, and Identity Science, 2020

Active Improvement of Control Policies with Bayesian Gaussian Mixture Model, Hakan Girgin, E. Pignat, N. Jaquier and Sylvain Calinon, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, 2020

Idiap & UAM participation at GermEval 2020: Classification and Regression of Cognitive and Motivational Style from Text, Esaú Villatoro-Tello, Shantipriya Parida, Sajit Kumar, Petr Motlicek and Qingran Zhan, in: Proceedings of the GermEval 2020 Shared Task on the Classification and Regression of Cognitive and Motivational style from Text, 2020

[URL]

Idiap Submission to Swiss-German Language Detection Shared Task, Shantipriya Parida, Esaú Villatoro-Tello, Sajit Kumar, Petr Motlicek and Qingran Zhan, in: Proceedings of the 5th Swiss Text Analytics Conference (SwissText) & 16th Conference on Natural Language Processing (KONVENS), CEUR Workshop Proceedings, 2020

[URL]

Generating Master Faces for Use in PerformingWolf Attacks on Face Recognition Systems, Huy H. Nguyen, Junichi Yamagishi, Isao Echizen and Sébastien Marcel, in: International Join Conference on Biometrics, 2020

Automatic pathological speech intelligibility assessment exploiting subspace-based analyses, Parvaneh Janbakhshi, Ina Kodrasi and Hervé Bourlard, in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 28:1717 - 1728, 2020

[DOI]

End-to-End Bias Mitigation by Modelling Biases in Corpora, Rabeeh Karimi Mahabadi, yonatan belinkov and James Henderson, in: ACL, 2020

Comparison of Subword Segmentation Methods for Open-vocabulary ASR using a Difficulty Metric, Abbas Khosravani, Claudiu Musat, Philip N. Garner and Alexandros Lazaridis

OdiEnCorp 2.0: Odia-English Parallel Corpus for Machine Translation, Shantipriya Parida, Satya Ranjan Dash, Ondrej Bojar, Petr Motlicek, Priyanka Pattnaik and Debasish Kumar Mallick, European Language Resources Association (ELRA), 2020

[URL]

On quantifying the quality of acoustic models in hybrid DNN-HMM ASR, Pranay Dighe, Afsaneh Asaei and Hervé Bourlard, in: Speech Communication, 119:24-35, 2020

[DOI]

Parametric study of URBAN morphology on building solar energy potential in Singapore context, Kin Ho Poon, Jérôme Kämpf, S. E. R. Tay, N. H. Wong and T. G. Reindl, in: Urban Climate, 33(100624), 2020

[DOI]
[URL]

CNN Patch Pooling for Detecting 3D Mask Presentation Attacks in NIR, Ketan Kotwal and Sébastien Marcel, in: IEEE International Conference on Image Processing, 2020

Neural Network based End-to-End Query by Example Spoken Term Detection, Dhananjay Ram, Lesly Miculicich and Hervé Bourlard, in: IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 2020

pyannote.audio: neural building blocks for speaker diarization, Herve Bredin, Ruiqing Yin, Juan Manuel Coria, Pavel Korshunov, Marvin Lavechin, Diego Fustes, Hadrien Titeux, Wassim Bouaziz and Marie-Philippe Gill, in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2020

[URL]

Idiap Submission to Swiss-German Language Detection Shared Task, Shantipriya Parida, Esaú Villatoro-Tello, Sajit Kumar, Petr Motlicek and Qingran Zhan, Idiap-RR-11-2020

Spatially-Variant CNN-Based Point Spread Function Estimation for Blind Deconvolution and Depth Estimation in Optical Microscopy, Adrian Shajkofci and Michael Liebling, in: IEEE Transactions on Image Processing, 29:5848 - 5861, 2020

[DOI]

Understanding the performance gap: a machine learning approach on residential buildings in Turin, Italy, Roberto Boghetti, Fabio Fantozzi, Jérôme Kämpf and Giacomo Salvadori, in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019

[DOI]

Multilingual and Unsupervised Subword Modeling for Zero-Resource Languages, Enno Hermann, Herman Kamper and Sharon Goldwater, in: Computer Speech and Language, 65, 2021

[DOI]
[URL]

Competitive Neural Layer-based Method to Identify People with High Risk for Diabetic Foot, Ana Cláudia Barbosa Honório Ferreira, Danton Diego Ferreira, Henrique Ceretta Oliveira, Igor Carvalho de Resende, André Anjos and Maria Helena Baena de Moraes Lopes, in: Computers in Biology and Medicine, 120, 2020

[DOI]
[URL]

ManiGaze: a Dataset for Evaluating Remote Gaze Estimator in Object Manipulation Situations, Remy Siegfried, Bozorgmehr Aminian and Jean-Marc Odobez, in: Symposium on Eye Tracking Research and Applications, Stuttgart, Germany, pages 5, ACM, 2020

[DOI]

Epileptic seizure detection: a comparative study between deep and traditional machine learning techniques, Rekha Sahu, Satya Ranjan Dash, Lleuvelyn A Cacha, Roman R Poznanski and Shantipriya Parida, in: Journal of Integrative Neuroscience, 19(1):1-9, 2020

[URL]

Recursive Non-Autoregressive Graph-to-Graph Transformer for Dependency Parsing with Iterative Refinement, Alireza Mohammadshahi and James Henderson, in: Transactions of the Association for Computational Linguistics(under submission), 2020

Tractable Approaches to Learning and Planning in High Dimensions, Leonidas Lefakis, EPFL, 2014

[DOI]

Theory and Algorithms for Hypothesis Transfer Learning, Ilja Kuzborskij, EPFL, 2018

[DOI]

Variational Inference with Mixture Model Approximation for Applications in Robotics, Emmanuel Pignat, Teguh Santoso Lembono and Sylvain Calinon, in: International Conference on Robotics and Automation, 2020

Gaussians on Riemannian Manifolds for Robot Learning and Adaptive Control, Sylvain Calinon, in: IEEE Robotics and Automation Magazine (RAM), 2020

Memory of Motion for Warm-starting Trajectory Optimization, Teguh Santoso Lembono, Antonio Paolillo, Emmanuel Pignat and Sylvain Calinon, in: IEEE Robotics and Automation Letters, 5(2):2594-2601, 2020

[DOI]

A memory of motion for visual predictive control tasks, Antonio Paolillo, Teguh Santoso Lembono and Sylvain Calinon, in: International Conference on Robotics and Automation, 2020

Learning How to Walk: Warm-starting Optimal Control Solver with Memory of Motion, Teguh Santoso Lembono, Carlos Mastalli, Pierre Fernbach, Nicolas Mansard and Sylvain Calinon, in: International Conference on Robotics and Automation, 2020

Sparse and Low-rank Modeling for Automatic Speech Recognition, Pranay Dighe, EPFL, 2019

[DOI]

Trustworthy speaker recognition with minimal prior knowledge using neural networks, Hannah Muckenhirn, Ecole polytechnique fédérale de Lausanne (EPFL), 2019

[DOI]
[URL]

Towards Multilingual Sign Language Recognition, Sandrine Tornay, Marzieh Razavi and Mathew Magimai-Doss, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Detection of S1 and S2 locations in phonocardiogram signals using zero frequency filter, RaviShankar Prasad, Gürkan Yilmaz, Olivier Chetelat and Mathew Magimai-Doss, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

SYNTHETIC SPEECH REFERENCES FOR AUTOMATIC PATHOLOGICAL SPEECH INTELLIGIBILITY ASSESSMENT, Parvaneh Janbakhshi, Ina Kodrasi and Hervé Bourlard, in: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Barcelona, Spain, 2020

Dysarthric Speech Recognition with Lattice-Free MMI, Enno Hermann and Mathew Magimai-Doss, in: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 6109-6113, 2020

[DOI]
[URL]

Youth nightlife at home: towards a feminist conceptualisation of home, Katharina Pelzelmayer, Sara Landolt, Jasmine Truong, Florian Labhart, Darshan Santani, Emmanuel Kuntsche and Daniel Gatica-Perez, in: Children's Geographies, 2020

[DOI]
[URL]

Learning Trajectory Distributions for Assisted Teleoperation and Path Planning, Marco Ewerton, Oleg Arenz, Guilherme Maeda, Dorothea Koert, Zlatko Kolev, Masaki Takahashi and Jan Peters, in: Frontiers in Robotics and AI, 6:89, 2019

[DOI]
[URL]

Plucking Motions for Tea Harvesting Robots Using Probabilistic Movement Primitives, Kurena Motokura, Masaki Takahashi, Marco Ewerton and Jan Peters, in: IEEE International Conference on Robotics and Automation, 2020

Low-latency speaker spotting with online diarization and detection, Jose Patino, Ruiqing Yin, Hector Delgado, Herve Bredin, Alain Komaty, Guillaume Wisniewski, Claude Barras, Nicholas Evans and Sébastien Marcel, in: The Speaker and Language Recognition Workshop (Odyssey), 2018

AM-FM DECOMPOSITION OF SPEECH SIGNAL: APPLICATIONS FOR SPEECH PRIVACY AND DIAGNOSIS, Petr Motlicek, Hynek Hermansky, Srikanth Madikeri, Amrutha Prasad and Sriram Ganapathy, in: 11th International workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, Universita Degli Studi Firenze, Firenze, Italy, 2019

[URL]

INCREMENTAL TRANSFER LEARNING IN TWO-PASS INFORMATION BOTTLENECK BASED SPEAKER DIARIZATION SYSTEM FOR MEETINGS, Nauman Dawalatabad, Srikanth Madikeri, Hema A Murthy and C Chandra Sekhar, in: Proceedings of ICASSP 2019, pages 6291-6295, 2019

Implicit discourse relation classification with syntax-aware contextualized word representations, D. N. Popa, J. Perez, James Henderson and E. Gaussier, in: Proceedings of the 32nd International Florida Artificial Intelligence Research Society Conference, 2019

SATokE: How can Syntax-Aware Contextualized Word Representations Benefit Implicit Discourse Relation Classification?, D. N. Popa, J. Perez, James Henderson and E. Gaussier, in: Ptroc. 2019 Conference sur l'Apprentissage automatique, 2019

Weakly-Supervised Concept-based Adversarial Learning for Cross-lingual Word Embeddings, Haozhou Wang, James Henderson and Paola Merlo, in: Proc. 2019 Conference on Empirical Methods in Natural Language Processing, 2019

Joint estimation of RETF vector and power spectral densities for speech enhancement based on alternating least squares, Marvin Tammen, Ina Kodrasi and Simon Doclo, in: IEEE International Conference on Acoustics, Speech and Signal Processing, pages 795--799, 2019

Learning Entailment-Based Sentence Embeddings from Natural Language Inference, Rabeeh Karimi Mahabadi, Florian Mai and James Henderson, Idiap-RR-20-2019

[URL]

Learning an event sequence embedding for event-based deep stereo, Stepan Tulyakov, Francois Fleuret, Martin Kiefel, Peter Gehler and Michael Hirsch, in: Proceedings of the IEEE International Conference on Computer Vision, 2019

Reducing Noise in GAN Training with Variance Reduced Extragradient, Tatjana Chavdarova, Gauthier Gidel, Francois Fleuret and Simon Lacoste-Julien, in: Proceedings of the international conference on Neural Information Processing Systems, 2019

Uncertainty-aware imitation learning using kernelized movement primitives, J. Silverio, Y. Huang, F. J. Abu-Dakka, L. Rozo and D. G. Caldwell, in: IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

A Non-Euclidean Gradient Descent Framework for Non-Convex Matrix Factorization, Ya-Ping Hsieh, Yu-Chun Kao, Rabeeh Karimi Mahabadi, Alp Yurtsever, Anastasios Kyrillidis and Volkan Cevher, in: IEEE Transactions on Signal Processing, 2018

Real-Time DCT Learning-based Reconstruction of Neural Signals, Rabeeh Karimi Mahabadi, Cosimo Aprile and Volkan Cevher, in: EUSIPCO, 2018

Learning-Based Compressive MRI, Baran Gözcü, Rabeeh Karimi Mahabadi, Yen-Huan Li, Efe Ilıcak, Tolga Çukur, Jonathan Scarlett and Volkan Cevher, in: IEEE Transactions on Medical Imaging, 2018

On the Tunability of Optimizers in Deep Learning, Prabhu Teja Sivaprasad, Florian Mai, Thijs Vogels, Martin Jaggi and Francois Fleuret, Idiap-RR-19-2019

[URL]

Extractive Odia Text Summarization System: An OCR based Approach, Shantipriya Parida, Idiap-RR-02-2020

Reinforcement learning of trajectory distributions: Applications in assisted teleoperation and motion planning, Marco Ewerton, Guilherme Maeda, Dorothea Koert, Zlatko Kolev, Masaki Takahashi and Jan Peters, in: IEEE International Conference on Intelligent Robots and Systems, 2019

SCALAR: Simultaneous Calibration of 2-D Laser and Robot Kinematic Parameters Using Planarity and Distance Constraints, Teguh Santoso Lembono, Francisco Suarez-Ruiz and Quang-Cuong Pham, in: IEEE Transactions on Automation Science and Engineering, 16(4):1971-1979, 2019

[DOI]

Aligning Multilingual Word Embeddings for Cross-Modal Retrieval Task, Alireza Mohammadshahi, Karl Aberer and Rémi Lebret, in: Proceedings of the Second Workshop on Fact Extraction and VERification (FEVER), Hong Kong, pages 27-33, Association for Computational Linguistics, 2019

[DOI]
[URL]

Adaptive Design of Experiments for Conservative Estimation of Excursion Sets, Dario Azzimonti, David Ginsbourger, Clément Chevalier, Julien Bect and Yann Richet, in: Technometrics, 2019

[DOI]
[URL]

Cross-lingual Automatic Speech Recognition Exploiting Articulatory Features, Qingran Zhan, Petr Motlicek, Shixuan Du, Yahui Shan, Xiang Xie and Sifan Ma, in: Proceedings of APSIPA ASC 2019, 2019

A Differential Approach for Gaze Estimation, Gang Liu, Yu Yu and Jean-Marc Odobez, in: IEEE Transaction on Pattern Analysis and Machine Intelligence, 43(3):1092--1098, 2021

[DOI]
[URL]

Self-attention for Speech Emotion Recognition, Lorenzo Tarantino, Philip N. Garner and Alexandros Lazaridis, in: Proc. Interspeech 2019, 2019

[DOI]

Broadcast Media Content Categorization Using Low-Resolution Concepts, Esaú Villatoro-Tello, Shantipriya Parida, Petr Motlicek, Subhadeep Dey and Qingran Zhan, Idiap-RR-06-2021

Building energy models with Morphological urban-scale parameters: a case study in Turin, Roberto Boghetti, Fabio Fantozzi, Jérôme Kämpf, Guglielmina Mutani, Giacomo Salvadori and Valeria Todeschi, in: Proceedings of 4th Building Simulation Applications Conference - BSA 2019, 2019

[URL]

The Simulation of Mean Radiant Temperature in Outdoor Conditions: A Review of Architectural Tools Calculation Assumptions, Emanuele Naboni, Marco Meloni, Chris Makey and Jérôme Kämpf, in: Proceedings of Building Simulation 2019: 16th Conference of IBPSA, 2019

OdiEnCorp 2.0: Odia-English Parallel Corpus for Machine Translation, Shantipriya Parida, Satya Ranjan Dash, Ondrej Bojar, Petr Motlicek, Priyanka Pattnaik and Debasish Kumar Mallick, Idiap-RR-08-2020

Mixture Models for the Analysis, Edition, and Synthesis of Continuous Time Series, Sylvain Calinon, in: Mixture Models and Applications, pages 39-57, Springer, 2019

[DOI]

Interactive Generation of Calligraphic Trajectories from Gaussian Mixtures, D. Berio, F. F. Leymarie and Sylvain Calinon, in: Mixture Models and Applications, pages 23-38, Springer, 2019

[DOI]

A Survey on Policy Search Algorithms for Learning Robot Controllers in a Handful of Trials, K. Chatzilygeroudis, A. Vassiliades, F. Stulp, Sylvain Calinon and J. -B. Mouret, in: IEEE Trans. on Robotics, 32(2):328-347, 2020

[DOI]
[URL]

Improving dual-arm assembly by master-slave compliance, M. Suomalainen, Sylvain Calinon, E. Pignat and V. Kyrki, in: Proc. IEEE Intl Conf. on Robotics and Automation, pages 8676-8682, 2019

Bayesian Gaussian mixture model for robotic policy imitation, Emmanuel Pignat and Sylvain Calinon, in: IEEE Robotics and Automation Letters, 4(4):4452 - 4458, 2019

[DOI]
[URL]

Daylighting simulation for external Venetian blinds based on HDR sky luminance monitoring with matrix algebraic approach, Yujie Wu, Jérôme Kämpf and J. -L. Scartezzini, in: Energy Procedia, 158:2677-2682, 2019

[DOI]

Performance assessment of the BTDF data compression based on wavelet transforms in daylighting simulation, Yujie Wu, Jérôme Kämpf and J. -L. Scartezzini, in: Solar Energy, 2019

[DOI]

CityLearn v1.0: An OpenAI Gym Environment for Demand Response with Deep Reinforcement Learning, José Vázquez-Canteli, Jérôme Kämpf, Gregor Henze and Zoltán Nagy, in: Proceedings of the 6th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation, New-York, USA, pages 356-357, ACM, 2019

[DOI]

TOWARDS MULTILINGUAL SIGN LANGUAGE RECOGNITION, Sandrine Tornay, Marzieh Razavi and Mathew Magimai-Doss, Idiap-RR-16-2019

Retrofitting, district heating and energy storage: neighborhood energy planning, Diane von Gunten, Jakob Rager, Jérôme Kämpf, Fabien Kuchler and Fabien Poumadère, in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019

[DOI]

A smart luminaire in an office environment: impact on light distribution, user interactions and comfort, Julien Nembrini, Jérôme Kämpf, Michael Papinutto and Denis Lalanne, in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019

[DOI]

Multi-agent reinforcement learning for adaptive demand response in smart cities, José Vázquez-Canteli, Thomas Detjeen, Gregor Henze, Jérôme Kämpf and Zoltán Nagy, in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019

[DOI]

Daylight regulated by automated external Venetian blinds based on HDR sky luminance mapping in winter, Yujie Wu, Jérôme Kämpf and J. -L. Scartezzini, in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019

[DOI]

A morphological based PV generation and energy consumption predictive model for Singapore neighbourhood, Kin Ho Poon and Jérôme Kämpf, in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019

[DOI]

CO2 experimental measurements towards the development of a predictive framework using user actions in smart buildings, Rui Oliveira, Jérôme Kämpf, Romeu Vicente, Ricardo Almeida and António Figueiredo, in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019

[DOI]

Multi-scale sequential network for semantic text segmentation and localization, Michael Villamizar, Olivier Canévet and Jean-Marc Odobez, in: Pattern Recognition Letters, 129:63-69, 2020

[DOI]
[URL]

Can Your Face Detector Do Anti-spoofing? Face Presentation Attack Detection with a Multi-Channel Face Detector, Anjith George and Sébastien Marcel, Idiap-RR-12-2020

Language Independent Query by Example Spoken Term Detection, Dhananjay Ram, École Polytechnique Fédérale de Lausanne, 2019

Idiap submission to the NIST SRE 2019 Speaker Recognition Evaluation, Seyyed Saeed Sarfjoo, Srikanth Madikeri, Mahdi Hajibabaei, Petr Motlicek and Sébastien Marcel, Idiap-RR-15-2019

Overview of the 6th Workshop on Asian Translation, Shantipriya Parida, in: Proceedings of the 6th Workshop on Asian Translation, Hong Kong, China, pages 1–35, Association for Computational Linguistics, 2019

[DOI]
[URL]

Idiap NMT System for WAT 2019 Multimodal Translation Task, Shantipriya Parida and Petr Motlicek, in: Proceedings of the 6th Workshop on Asian Translation, Hong Kong, China, pages 175–180, Association for Computational Linguistics, 2019

[DOI]
[URL]

Improving the conditioning of the optimization criterion in acoustic multi-channel equalization using shorter reshaping filters, Ina Kodrasi and Simon Doclo, in: EURASIP Journal on Advances in Signal Processing(11), 2018

Selecting, Planning, and Rewriting: A Modular Approach for Data-to-Document Generation and Translation, Lesly Miculicich, Marc Marone and Hany Hassan, in: WNGT EMNLP, 2019

CHALLENGES IN BROADCAST MEDIA CONTENT CATEGORIZATION, Shantipriya Parida, Esaú Villatoro-Tello and Petr Motlicek, Idiap-RR-02-2021

Vulnerability of Face Recognition to Deep Morphing, Pavel Korshunov and Sébastien Marcel, in: International Conference on Biometrics for Borders, 2019

Idiap Abstract Text Summarization System for German Text Summarization Task, Shantipriya Parida and Petr Motlicek, in: Proceedings of the 4th edition of the Swiss Text Analytics Conference, 2019

[URL]

Multilingual Bottleneck Features for Query by Example Spoken Term Detection, Dhananjay Ram, Lesly Miculicich and Hervé Bourlard, in: IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Understanding Raw Waveform based CNN through Low-rank Spectro-Temporal Decoupling, Vinayak Abrol, S. Pavankumar Dubagunta and Mathew Magimai-Doss, Idiap-RR-11-2019

Subunits Inference and Lexicon Development Based on Pairwise Comparison of Utterances and Signs, Sandrine Tornay and Mathew Magimai-Doss, in: Information, 10:298, 2019

[DOI]
[URL]

Super-gaussianity of Speech Spectral Coefficients as a Potential Biomarker for Dysarthric Speech Detection, Ina Kodrasi and Hervé Bourlard, in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2019

Joint acoustic localization and dereverberation through plane wave decomposition and sparse regularization, Niccolò Antonello, Enzo De Sena, Marc Moonen, A. Patrick Naylor and Toon van Waterschoot, in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 27(12):1893-1905, 2019

[DOI]
[URL]

AM-FM DECOMPOSITION OF SPEECH SIGNAL: APPLICATIONS FOR SPEECH PRIVACY AND DIAGNOSIS, Petr Motlicek, Hynek Hermansky, Srikanth Madikeri, Amrutha Prasad and Sriram Ganapathy, Idiap-RR-01-2020

Full-Gradient Representation for Neural Network Visualization, Suraj Srinivas and Francois Fleuret, in: Advances in Neural Information Processing Systems, 2019

[URL]

Idiap NMT System for WAT 2019 Multimodal Translation Task, Shantipriya Parida and Petr Motlicek, Idiap-RR-04-2020

Multispectral Deep Embeddings As a Countermeasure To Custom Silicone Mask Presentation Attacks, Ketan Kotwal, Sushil Bhattacharjee and Sébastien Marcel, in: IEEE Transactions on Biometrics, Behavior, and Identity Science, 2019

Abstract Text Summarization: A Low Resource Challenge, Shantipriya Parida and Petr Motlicek, in: In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2019), HongKong, China, pages 5, Association for Computational Linguistics (ACL), 2019

Learning One Class Representations for Presentation Attack Detection using Multi-channel Convolutional Neural Networks, Anjith George and Sébastien Marcel, Idiap-RR-15-2020

A Comprehensive Experimental and Reproducible Study on Selfie Biometrics in Multistream and Heterogeneous Settings, Guillaume Heusch, Tiago de Freitas Pereira and Sébastien Marcel, in: IEEE Transactions on Biometrics, Behavior and Identity Science, 2019

[DOI]
[URL]

Discovering Eating Routines in Context with a Smartphone App, Daniel Gatica-Perez, Joan-Isaac Biel, David Labbe and Nathalie Martin, in: Ubicomp/Iswc'19 Adjunct: Proceedings Of The 2019 Acm International Joint Conference On Pervasive And Ubiquitous Computing And Proceedings Of The 2019 Acm International Symposium On Wearable Computers, London, pages 422-429, 2019

[DOI]

Validity of pervasive computing based continuous physical activity assessment in community-dwelling old and oldest-old, Narayan Schütz, Hugo Saner, Beatrice Rudin, Angela Botros, Bruno Pais, Valérie Santschi, Philipp Buluschek, Daniel Gatica-Perez, Prabitha Urwyler, Laura Marchal-Crespo, René M. Müri and Tobias Nef, in: Scientific Reports, 9(9662), 2019

German News Article Classification : A Multichannel CNN Approach, Shantipriya Parida, Petr Motlicek and Satya Ranjan Dash, Idiap-RR-09-2020

Cross-lingual Automatic Speech Recognition Exploiting Articulatory Features, Qingran Zhan, Shixuan Du, Petr Motlicek, Yahui Shan and Xiang Xie, Idiap-RR-05-2021

[URL]

Temporal Super-Resolution Microscopy Using a Hue-Encoded Shutter, Christian Jaques, Emmanuel Pignat, Sylvain Calinon and Michael Liebling, in: Optical Society of America Biomedical Optics Express, 10(09):4727-4741, 2019

[DOI]
[URL]

Generalized temporal sampling with active illumination in optical microscopy, Christian Jaques and Michael Liebling, in: Proceeding of the SPIE Conference Optics and Photonics, Wavelets and Sparsity XVIII, SPIE, San Diego, California, United States, SPIE, 2019

Processing Megapixel Images with Deep Attention-Sampling Models, Angelos Katharopoulos and Francois Fleuret, in: Proceedings of International Conference on Machine Learning, 2019

[URL]

The Speed Submission to DIHARD II: Contributions & Lessons Learned, Md Sahidullah, Jose Patino, Samuele Cornell, Ruiqing Yin, Sunit Sivasankaran, Herve Bredin, Pavel Korshunov, Alessio Brutti, Romain Serizel, Emmanuel Vincent, Nicholas Evans, Sébastien Marcel, Stefano Squartini and Claude Barras, Idiap-RR-14-2019

[UNK]: https://arxiv.org/abs/1911.02388

Tampered Speaker Inconsistency Detection with Phonetically Aware Audio-visual Features, Pavel Korshunov, Michael Halstead, Diego Castan, Martin Graciarena, Mitchell McLaren, Brian Burns, Aaron Lawson and Sébastien Marcel, in: International Conference on Machine Learning, 2019

Vulnerability assessment and detection of Deepfake videos, Pavel Korshunov and Sébastien Marcel, in: IAPR International Conference on Biometrics, 2019

Understanding and Visualizing Raw Waveform-based CNNs, Hannah Muckenhirn, Vinayak Abrol, Mathew Magimai-Doss and Sébastien Marcel, in: Proceedings of Interspeech, 2019

CBOW Is Not All You Need: Combining CBOW with the Compositional Matrix Space Model, Florian Mai, Lukas Galke and Ansgar Scherp, in: International Conference on Learning Representations, New Orleans, Louisiana, USA, 2019

[URL]

Automated Daylighting Control System based on Sky Luminance Monitoring and Lighting Computing, Yujie Wu, Jérôme Kämpf and J. -L. Scartezzini, EPFL, 2019

[DOI]

Spoken language identification using language bottleneck features, Malo Grisard, Petr Motlicek, Wissem Allouchi, Michael Baeriswyl, Alexandros Lazaridis and Qingran Zhan, in: Proceedings of TSD, 2019

The contexts of heavy drinking: A systematic review of the combinations of context-related factors associated with heavy drinking occasions, Oliver Stanesby, Florian Labhart, Paul Dietze, Cassandra Wright and Emmanuel Kuntsche, in: PLOS ONE, 14(7):29, 2019

[DOI]
[URL]

An End-to-end Network to Synthesize Intonation Using a Generalized Command Response Model, François Marelli, Bastian Schnell, Hervé Bourlard, T. Dutoit and Philip N. Garner, in: ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, United Kingdom, pages 7040-7044, IEEE, 2019

[DOI]
[URL]

Spectral Subspace Analysis for Automatic Assessment of Pathological Speech Intelligibility, Parvaneh Janbakhshi, Ina Kodrasi and Hervé Bourlard, in: Proceedings of Interspeech, Graz, Austria, pages 3038--3042, 2019

End-to-end Accented Speech Recognition, Thibault Viglino, Petr Motlicek and Milos Cernak, Idiap-RR-04-2022

Split-pane electrochromic window control based on an embedded photometric device with real-time daylighting computing, Yujie Wu, Taoning Wang, Eleanor S. Lee, Jérôme Kämpf and J. -L. Scartezzini, in: Building and Environment, 2019

[DOI]

Using Speech Production Knowledge for Raw Waveform Modelling based Styrian Dialect Identification, S. Pavankumar Dubagunta and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2019

Idiap Abstract Text Summarization System for German Text Summarization Task, Shantipriya Parida and Petr Motlicek, Idiap-RR-03-2020

BookTubing Across Regions: Examining Differences based on Nonverbal and Verbal Cues, Chinchu Thomas, Dinesh Jayagopi and Daniel Gatica-Perez, in: Proc. ACM Int. Conf. on Interactive Experiences for Television and Online Video (TVX), Salford, ENGLAND, 2019

[DOI]

Automated Eye-sight Venetian blinds based on an embedded photometric device with real-time daylighting computing, Yujie Wu, Jérôme Kämpf and J. -L. Scartezzini, in: Applied Energy, 252, 2019

[DOI]
[URL]

Deep Residual Output Layers for Neural Language Generation, Nikolaos Pappas and James Henderson, in: Proceedings of the 36th International Conference on Machine Learning (ICML), 2019

The Role of Sex and Age on Pre-drinking: An Exploratory International Comparison of 27 Countries, Jason Ferris, Cheneal Puljević, Florian Labhart, Adam Winstock and Emmanuel Kuntsche, in: Alcohol and Alcoholism, 54(4):378–385, 2019

[DOI]

Domain Adaptation in Multi-Channel Autoencoder based Features for Robust Face Anti-Spoofing, Olegs Nikisins, Anjith George and Sébastien Marcel, in: International Conference on Biometrics 2019, IEEE, 2019

Biometric Face Presentation Attack Detection with Multi-Channel Convolutional Neural Network, Anjith George, Zohreh Mostaani, David Geissbuhler, Olegs Nikisins, André Anjos and Sébastien Marcel, in: IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2019

Processing Megapixel Images with Deep Attention-Sampling Models, Angelos Katharopoulos and Francois Fleuret, Idiap-RR-07-2019

[URL]

A solar-based sustainable urban design: The effects of city-scale street-canyon geometry on solar access in Geneva, Switzerland, N. Mohajeri, A. Gudmundsson, G. Knuckler, D. Assouline, Jérôme Kämpf and J. -L. Scartezzini, in: Applied Energy, 240:173-190, 2019

[DOI]

SOCIAL SENSING METHODS FOR ANALYSIS OF DYADIC HOSPITALITY ENCOUNTERS, Skanda Muralidhar, EPFL, 2019

Deep Pixel-wise Binary Supervision for Face Presentation Attack Detection, Anjith George and Sébastien Marcel, in: International Conference on Biometrics, 2019

The emotional entanglements of smartphones in the field: On emotional discomfort, power relations, and research ethics, Jasmine Truong, Florian Labhart, Darshan Santani, Daniel Gatica-Perez, Emmanuel Kuntsche and Sara Landolt, in: Area, 52(1), 2020

[DOI]
[URL]

Multimodal Person Recognition in Audio-Visual Streams, Nam Le, EPFL, 2019

[DOI]

Custom Silicone Face Masks - Vulnerability of Commercial Face Recognition Systems & Presentation Attack Detection, Ramachandra Raghavendra, Sushma Venkatesh, Kiran B. Raja, Sushil Bhattacharjee, Pankaj Wasnik, Sébastien Marcel and Christoph Busch, in: Proceedings of 7th IAPR/IEEE International Workshop on Biometrics and Forensics, 2019

A Deep Learning Approach for Robust Head Pose Independent Eye Movements Recognition from Videos, Remy Siegfried, Yu Yu and Jean-Marc Odobez, in: 2019 ACM Symposium on Eye Tracking Research and Applications, pages 5, ACM, 2019

[DOI]

Conditions for the finiteness of the moments of the volume of level sets, Diego Armentano, Jean-Marc Azaïs, David Ginsbourger and Jose R. León, in: Electronic Communications in Probability, 24(17), 2019

[DOI]
[URL]

Contaminant source localization via Bayesian global optimization, Guillaume Pirot, Tipaluck Krityakierne, David Ginsbourger and Philippe Renard, in: Hydrology and Earth System Sciences, 23:351-369, 2019

[DOI]
[URL]

On the choice of the low-dimensional domain for global optimization via random embeddings, Mickaël Binois, David Ginsbourger and Olivier Roustant, in: Journal of Global Optimization, 2019

[DOI]
[URL]

PATHOLOGICAL SPEECH INTELLIGIBILITY ASSESSMENT BASED ON THE SHORT-TIME OBJECTIVE INTELLIGIBILITY MEASURE, Parvaneh Janbakhshi, Ina Kodrasi and Hervé Bourlard, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brighton, UK, pages 6405--6409, 2019

Adaptation of Multiple Sound Source Localization Neural Networks with Weak Supervision and Domain-Adversarial Training, Weipeng He, Petr Motlicek and Jean-Marc Odobez, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brighton, United Kingdom, pages 770-774, 2019

[DOI]

SPOKEN LANGUAGE IDENTIFICATION USING LANGUAGE BOTTLENECK FEATURES, Grisard Malo, Petr Motlicek, Wissem Allouchi, Michael Baeriswyl, Alexandros Lazaridis and Qingran Zhan, Idiap-RR-08-2019

EMPIRICAL EVALUATION AND COMBINATION OF PUNCTUATION PREDICTION MODELS APPLIED TO BROADCAST NEWS, Alexandre Nanchen and Philip N. Garner, in: Proceedings of 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2019

A Learning-Based Framework for Quantized Compressed Sensing, Rabeeh Karimi Mahabadi, Junhong lin and Volkan Cevher, in: A Learning-Based Framework for Quantized Compressed Sensing, 2019

End-to-End Acoustic Modeling using Convolutional Neural Networks for HMM-based Automatic Speech Recognition, Dimitri Palaz, Mathew Magimai-Doss and Ronan Collobert, in: Speech Communication, 108:15--32, 2019

[DOI]

Improving Children Speech Recognition through Feature Learning from Raw Speech Signal, S. Pavankumar Dubagunta, Selen Hande Kabil and Mathew Magimai-Doss, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

Learning voice source related information for depression detection, S. Pavankumar Dubagunta, Bogdan Vlasenko and Mathew Magimai-Doss, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

Segment-level training of ANNs based on acoustic confidence measures for hybrid HMM/ANN Speech Recognition, S. Pavankumar Dubagunta and Mathew Magimai-Doss, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

Automatic Diagnosis of Alzheimer's Disease Using Neural Network Language Models, Julian Fritsch, Sebastian Wankerl and Elmar Nöth, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

HMM-based Approaches to Model Multichannel Information in Sign Language inspired from Articulatory Features-based Speech Processing, Sandrine Tornay, Marzieh Razavi, Necati Cihan Camgoz, Richard Bowden and Mathew Magimai-Doss, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

GILE: A Generalized Input-Label Embedding for Text Classification, Nikolaos Pappas and James Henderson, in: Transactions of the Association for Computational Linguistics (TACL), 2019

Capturing drinking and nightlife behaviours and their social and physical context with a smartphone application - investigation of users' experience and reactivity, Florian Labhart, Flavio Tarsetti, Olivier Bornet, Darshan Santani, Jasmine Truong, Sara Landolt, Daniel Gatica-Perez and Emmanuel Kuntsche, in: Addiction Research and Theory, 28(1):62-75, 2020

[DOI]
[URL]

Adaptation of Assistant Based Speech Recognition to New Domains and Its Acceptance by Air Traffic Controllers, Matthias Kleinert, Hartmut Helmke, Gerald Siol, heiko Ehr, Dietrich Klakow, Mittul Singh, Petr Motlicek, Kern Christian, Cerna Aneta and Hlousek Petr, in: Proceedings of the 2nd International Conference on Intelligent Human Systems Integration (IHSI 2019): Integrating People and Intelligent Systems, San Diego, California, USA, pages 820 - 826, 2019

[DOI]

Building Blocks of Assistant Based Speech Recognition for Air Traffic Management Applications, Matthias Kleinert, Hartmut Helmke, heiko Ehr, Kern Christian, Dietrich Klakow, Petr Motlicek, Mittul Singh and Gerald Siol, in: Conference: SESAR Innovation Days 2018, European Union, Eurocontrol, Salzburg, Austria, SESARJU, 2018

[URL]

Iterative Learning of Speech Recognition Models for Air Traffic Control, Ajay Srinivasamurthy, Petr Motlicek, Mittul Singh, Youssef Oualil, Matthias Kleinert, heiko Ehr and Hartmut Helmke, in: Proceedings of Interspeech 2018, ISCA, Hyderabad, India, pages 3519-3523, 2018

[DOI]

CBOW Is Not All You Need: Combining CBOW with the Compositional Matrix Space Model, Florian Mai, Lukas Galke and Ansgar Scherp, Idiap-RR-06-2019

[URL]

Multi-Spectral Widefield Microscopy of the Beating Heart through Post-Acquisition Synchronization and Unmixing, Christian Jaques, Linda Bapst-Wicht, Daniel F. Schorderet and Michael Liebling, in: 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019), Venice, Italy, pages 1382-1385, 2019

[DOI]

Leveraging Convolutional Pose Machines for Fast and Accurate Head Pose Estimation, Yuanzhouhan Cao, Olivier Canévet and Jean-Marc Odobez, in: 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Madrid, SPAIN, pages 1089-1094, IEEE, 2018

Improving speech embedding using crossmodal transfer learning with audio-visual data, Nam Le and Jean-Marc Odobez, in: Multimedia Tools and Applications, 78(11):15681-15704, 2019

[DOI]

Voice Presentation Attack Detection Using Convolutional Neural Networks, Ivan Himawan, Srikanth Madikeri, Petr Motlicek, Milos Cernak, Sridha Sridharan and Clinton Fookes, in: Handbook of Biometric Anti-Spoofing, pages 391--415, Springer, 2019

[URL]

Multilingual bottleneck features for subword modeling in zero-resource languages, Enno Hermann and Sharon Goldwater, in: Proc. Interspeech, pages 2668-2672, 2018

[DOI]

Language model domain adaptation for automatic speech recognition, Amrutha Prasad, Petr Motlicek and Alexandre Nanchen, Idiap-RR-05-2020

Practical Deep Stereo (PDS): Toward applications-friendly deep stereo matching, Stepan Tulyakov, Anton Ivanov and Francois Fleuret, in: Proceedings of the international conference on Neural Information Processing Systems, 2018

Geodesic Convolutional Shape Optimization, Pierre Baqué, Edoardo Remelli, Francois Fleuret and Pascal Fua, in: Proceedings of the International Conference on Machine Learning, 2018

Kronecker Recurrent Units, Cijo Jose, Moustapha Cisse and Francois Fleuret, in: Proceedings of the International Conference on Machine Learning, 2018

Enhancing Trust in eAssessment - the TeSLA System Solution, Malinka Ivanova, Sushil Bhattacharjee, Sébastien Marcel, Anna Rozeva and Mariana Durcheva, in: Technology Enhanced Assessment Conference., 2018

A Neural Model to Predict Parameters for a Generalized Command Response Model of Intonation, Bastian Schnell and Philip N. Garner, in: Proc. Interspeech 2018, pages 3147-3151, 2018

[DOI]

SCALAR - Simultaneous Calibration of 2D Laser And Robot's Kinematic Parameters Using Three Planar Constraints, Teguh Santoso Lembono, Francisco Suarez-Ruiz and Quang-Cuong Pham, in: International Conference on Intelligent Robots, 2018

Recent Advances in Face Presentation Attack Detection, Sushil Bhattacharjee, Amir Mohammadi, André Anjos and Sébastien Marcel, in: Handbook of Biometric Anti-Spoofing, Springer, 2019

[URL]

Data-Driven Movement Subunit Extraction from Skeleton Information for Modeling Signs and Gestures, Sandrine Tornay, Marzieh Razavi and Mathew Magimai-Doss, Idiap-RR-02-2019

An Introduction to Vein Presentation Attacks and Detection, André Anjos, Pedro Tome and Sébastien Marcel, in: Handbook of Biometric Anti-Spoofing, Springer International Publishing, 2019

[DOI]
[URL]

DeepFakes: a New Threat to Face Recognition? Assessment and Detection, Pavel Korshunov and Sébastien Marcel, Idiap-RR-18-2018

A Cross-database Study of Voice Presentation Attack Detection, Pavel Korshunov and Sébastien Marcel, in: Handbook of Biometric Anti-Spoofing: Presentation Attack Detection, 2nd Edition, Springer, 2018

Profile extrema for visualizing and quantifying uncertainties on excursion regions. Application to coastal flooding, Dario Azzimonti, David Ginsbourger, Jérémy Rohmer and Déborah Idier, in: Technometrics, 61(4):474-493, 2019

[DOI]
[URL]

A supermartingale approach to Gaussian process based sequential design of experiments, Julien Bect, François Bachoc and David Ginsbourger, in: Bernoulli, 25(4A):2883-2919, 2019

Adaptation of Multiple Sound Source Localization Neural Networks with Weak Supervision and Domain-Adversarial Training, Weipeng He, Petr Motlicek and Jean-Marc Odobez, Idiap-Com-01-2019

Dexterous Underwater Manipulation from Distant Onshore Locations, A. Birk, T. Fromm, C. A. Mueller, T. Luczynski, A. Gomez Chavez, D. Koehntopp, A. Kupcsik, Sylvain Calinon, A. K. Tanwani, G. Antonelli, P. Di Lillo, E. Simetti, G. Casalino, G. Indiveri, L. Ostuni, A. Turetta, A. Caffaz, P. Weiss, T. Gobert, B. Chemisky, J. Gancet, T. Siedel, S. Govindaraj, X. Martinez and P. Letier, in: IEEE Robotics and Automation Magazine, 2018

Learning from Demonstration (Programming by Demonstration), Sylvain Calinon, in: Encyclopedia of Robotics, Springer, 2019

[DOI]
[URL]

Small Variance Asymptotics for Non-Parametric Online Robot Learning, A. K. Tanwani and Sylvain Calinon, in: International Journal of Robotics Research (IJRR), 38(1):3-22, 2019

Learning Task Priorities from Demonstrations, J. Silverio, Sylvain Calinon, L. Rozo and D. G. Caldwell, in: IEEE Transactions on Robotics, 35(1):78-94, 2019

[DOI]
[URL]

Learning from demonstration for semi-autonomous teleoperation, I. Havoutis and Sylvain Calinon, in: Autonomous Robots, 43(3):713-726, 2019

[DOI]
[URL]

Generalizing Robot Imitation Learning with Invariant Hidden Semi-Markov Models, A. K. Tanwani, J. Lee, B. Thananjeyan, M. Laskey, S. Krishnan, R. Fox, K. Goldberg and Sylvain Calinon, in: 13th Intl Workshop on the Algorithmic Foundations of Robotics (WAFR), 2018

Bimanual Skill Learning with Pose and Joint Space Constraints, J. Silverio, Sylvain Calinon, L. Rozo and D. G. Caldwell, in: Proc. of the IEEE-RAS Intl Conf. on Humanoid Robots (Humanoids), 2018

Probabilistic Learning of Torque Controllers from Kinematic and Force Constraints, J. Silverio, Y. Huang, L. Rozo, Sylvain Calinon and D. G. Caldwell, in: Proc. of the IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), pages 6552-6559, 2018

Heterogeneous Face Recognition Using Domain Specific Units, Tiago de Freitas Pereira, André Anjos and Sébastien Marcel, in: IEEE Transactions on Information Forensics and Security:13, 2019

[DOI]

INVESTIGATING TIME DELAY NEURAL NETWORK (TDNN) FOR LANGUAGE MODELING IN LOW RESOURCE AUTOMATIC SPEECH RECOGNITION, Banriskhem Khonglah, Srikanth Madikeri, Petr Motlicek and Hervé Bourlard, Idiap-RR-13-2019

STACKED NEURAL NETWORKS WITH PARAMETER SHARING FOR MULTILINGUAL LANGUAGE MODELING, Banriskhem Khonglah, Srikanth Madikeri, Navid Rekabsaz, Nikolaos Pappas, Petr Motlicek and Hervé Bourlard, Idiap-RR-12-2019

Designing second order recurrent neural networks for prosody modelling, François Marelli, Idiap-RR-16-2018

Fusing TensorFlow with building energy simulation for intelligent energy management in smart cities, José Vázquez-Canteli, Stepan Ulyanin, Jérôme Kämpf and Zoltán Nagy, in: Sustainable Cities and Society, 2018

[DOI]

Semi-supervised Adaptation of Assistant Based Speech Recognition Models for different Approach Areas, Matthias Kleinert, Hartmut Helmke, Gerald Siol, heiko Ehr, Cerna Aneta, Kern Christian, Dietrich Klakow, Petr Motlicek, Youssef Oualil, Mittul Singh and Ajay Srinivasamurthy, in: 37th AIAA/IEEE Digital Avionics Systems Conference, AIAA/IEEE, London, 2018

[URL]

EMPIRICAL EVALUATION AND COMBINATION OF PUNCTUATION PREDICTION MODELS APPLIED TO BROADCAST NEWS, Alexandre Nanchen and Philip N. Garner, Idiap-RR-01-2019

AN END-TO-END NETWORK TO SYNTHESIZE INTONATION USING A GENERALIZED COMMAND RESPONSE MODEL, François Marelli, Bastian Schnell, Hervé Bourlard, T. Dutoit and Philip N. Garner, Idiap-RR-05-2019

Words Worth: Verbal Content and Hirability Impressions in YouTube Video Resumes, Skanda Muralidhar, Laurent Son Nguyen and Daniel Gatica-Perez, in: Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, 2018

Facing Employers and Customers: What Do Gaze and Expressions Tell About Soft Skills?, Skanda Muralidhar, Remy Siegfried, Jean-Marc Odobez and Daniel Gatica-Perez, in: Proceedings of the 17th International Conference on Mobile and Ubiquitous Multimedia, Cairo, Egypt, pages 121-126, ASSOC COMPUTING MACHINERY, 2018

[DOI]

Vlogging Over Time: Longitudinal Impressions and Behavior in YouTube, Daniel Gatica-Perez, Dairazalia Sanchez-Cortes, Trinh-Minh-Tri Do, Dinesh Babu Jayagopi and Kazuhiro Otsuka, in: Proceedings of the 17th International Conference on Mobile and Ubiquitous Multimedia, Cairo, EGYPT, pages 37-46, 2018

[DOI]

UNICITY: A depth maps database for people detection in security airlocks, Joël Dumoulin, Olivier Canévet, Michael Villamizar, Hugo Nunes, Omar Abou Khaled, Elena Mugellini, Fabrice Moscheni and Jean-Marc Odobez, in: IEEE International Conference on Advanced Video and Signal-based Surveillance Workshop, 2018

WatchNet: Efficient and Depth-based Network for People Detection in Video Surveillance Systems, Michael Villamizar, Angel Martínez-González, Olivier Canévet and Jean-Marc Odobez, in: IEEE International Conference on Advanced Video and Signal-based Surveillance, Auckland, NEW ZEALAND, pages 109-114, 2018

Integrating Weakly Supervised Word Sense Disambiguation into Neural Machine Translation, Xiao Pu, Nikolaos Pappas, James Henderson and Andrei Popescu-Belis, in: Transactions of the Association for Computational Linguistics (TACL), 2018

Phonetic Subspace Features for Improved Query by Example Spoken Term Detection, Dhananjay Ram, Afsaneh Asaei and Hervé Bourlard, in: Speech Communication, 103:27-36, 2018

[DOI]

Investigating Depth Domain Adaptation for Efficient Human Pose Estimation, Angel Martínez-González, Michael Villamizar, Olivier Canévet and Jean-Marc Odobez, in: European Conference on Computer Vision - Workshops, 2018

Cross-lingual Adaptation of a CTC-based multilingual Acoustic Model, Sibo Tong, Philip N. Garner and Hervé Bourlard, in: Speech Communication, 104:39-46, 2018

[DOI]

Modeling Dyadic and Group Impressions with Inter-Modal and Inter-Person Features, Shogo Okada, Laurent Son Nguyen, Oya Aran and Daniel Gatica-Perez, in: ACM Transactions on Multimedia Computing, Communications, and Applications, 15(1), 2019

Mi Casa es su Casa? Examining Airbnb Hospitality Exchange Practices in a Developing Economy, Salvador Ruiz-Correa, Itzia Ruiz-Correa, Carlo Olmos Carrillo, Fátima Alba Rendón-Huerta, Beatriz Ramirez-Salazar, Laurent Son Nguyen and Daniel Gatica-Perez, in: ACM Transactions on Social Computing, 2(1), 2019

Beyond Weight Tying: Learning Joint Input-Output Embeddings for Neural Machine Translation, Nikolaos Pappas, Lesly Miculicich and James Henderson, in: Proceedings of the Third Conference on Machine Translation (WMT), 2018

Looking South: Learning Urban Perception in Developing Cities, Darshan Santani, Salvador Ruiz-Correa and Daniel Gatica-Perez, in: ACM Transactions on Social Computing, 2018

Document-Level Neural Machine Translation with Hierarchical Attention Networks, Lesly Miculicich, Dhananjay Ram, Nikolaos Pappas and James Henderson, in: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2018

Stochastic Variance Reduced Gradient Optimization of Generative Adversarial Networks, Tatjana Chavdarova, Sebastian Stich, Martin Jaggi and Francois Fleuret, in: International Conference on Machine Learning (ICML) workshop on Theoretical Foundations and Applications of Deep Generative Models, 2018

Single-channel late reverberation power spectral density estimation using denoising autoencoders, Ina Kodrasi and Hervé Bourlard, in: Proc. Annual Conference of the International Speech Communication Association, Hyderabad, India, 2018

Iterative alternating least-aquares approach to jointly estimate the RETFs and the diffuse PSD, Marvin Tammen, Ina Kodrasi and Simon Doclo, in: Proc. ITG conference on Speech Communication, Oldenburg, Germany, 2018

Statistical modeling of speech spectral coefficients in patients with Parkinson's disease, Ina Kodrasi and Hervé Bourlard, in: Proc. ITG conference on Speech Communication, Oldenburg, Germany, 2018

Modelling glottal source information for depression detection, D S Pavan Kumar, Bogdan Vlasenko and Mathew Magimai-Doss, Idiap-RR-13-2018

Word Sense Consistency in Statistical and Neural Machine Translation, Xiao Pu, École Polytechnique Fédérale de Lausanne, 2018

Supervised Gaze Bias Correction for Gaze Coding in Interactions, Remy Siegfried and Jean-Marc Odobez, in: ECEM COGAIN Symposium, pages 3, 2017

A Differential Approach for Gaze Estimation with Calibration, Gang Liu, Yu Yu, Kenneth Alberto Funes Mora and Jean-Marc Odobez, in: 29TH BRITISH MACHINE VISION CONFERENCE, 2018

Knowledge Transfer with Jacobian Matching, Suraj Srinivas and Francois Fleuret, in: Proceedings of the International Conference on Machine Learning, 2018

[URL]

Not All Samples Are Created Equal: Deep Learning with Importance Sampling, Angelos Katharopoulos and Francois Fleuret, in: Proceedings of International Conference on Machine Learning, 2018

Gradient-based spectral visualization of CNNs using raw waveforms, Hannah Muckenhirn, Vinayak Abrol, Mathew Magimai-Doss and Sébastien Marcel, Idiap-RR-11-2018

A Tale of Two Interactions: Inferring Performance in Hospitality Encounters from Cross-Situation Social Sensing, Skanda Muralidhar, Marianne Schmid Mast and Daniel Gatica-Perez, in: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 2(129), 2018

Spoofing Deep Face Recognition With Custom Silicone Masks, Sushil Bhattacharjee, Amir Mohammadi and Sébastien Marcel, in: Proceedings of BTAS2018, 2018

A Neural Model to Predict Parameters for a Generalized Command Response Model of Intonation, Bastian Schnell and Philip N. Garner, Idiap-RR-10-2018

Special issue on robot learning for human-robot collaboration, L. Rozo, H. Ben Amor, Sylvain Calinon, A. Dragan and D. Lee, in: Autonomous Robots, 42(5):953-956, 2018

[DOI]
[URL]

Programming by Demonstration for Shared Control with an Application in Teleoperation, M. Zeestraten, I. Havoutis and Sylvain Calinon, in: IEEE Robotics and Automation Letters (RA-L), 3(3):1848-1855, 2018

[DOI]
[URL]

Flexible Automation Driven by Demonstration: Leveraging Strategies that Simplify Robotics, A. Giusti, M. Zeestraten, E. Icer, A. Pereira, D. G. Caldwell, Sylvain Calinon and M. Althoff, in: IEEE Robotics and Automation Magazine (RAM), 25(2):18-27, 2018

[DOI]
[URL]

A Brief Survey on the Role of Dimensionality Reduction in Manipulation Learning and Control, F. Ficuciello, P. Falco and Sylvain Calinon, in: IEEE Robotics and Automation Letters (RA-L), 3(3):2608-2615, 2018

[DOI]
[URL]

Learning Control, Sylvain Calinon and D. Lee, in: Humanoid Robotics: a Reference, pages 1261-1312, Springer, 2019

[DOI]
[URL]

SIIP: An Innovative Speaker Identification Approach for Law Enforcement Agencies, Khaled Khelif, yann Mombrun, Gideon Hazzani, Petr Motlicek, Srikanth Madikeri, Farhan Sahito, Damien Kelly, Luca Scarpatto, Emmanouil Chatzigavriil and Gerhard Backfried, in: Big Data and Artificial Intelligence for Military Decision Making, http://www.sto.nato.int/, pages PT-1 - 1: PT-1 - 14, STO, 2018

[DOI]
[URL]

Phonological Posterior Hashing for Query by Example Spoken Term Detection, Afsaneh Asaei, Dhananjay Ram and Hervé Bourlard, in: Proceedings of Interspeech, 2018

CNN based Query by Example Spoken Term Detection, Dhananjay Ram, Lesly Miculicich and Hervé Bourlard, in: Proceedings of Interspeech, 2018

Semi-Blind Spatially-Variant Deconvolution in Optical Microscopy with Local Point Spread Function Estimation by Use of Convolutional Neural Networks, Adrian Shajkofci and Michael Liebling, in: 2018 25th IEEE International Conference on Image Processing (ICIP), pages 3818-3822, IEEE, 2018

[DOI]

Robust and Discriminative Speaker Embedding via Intra-Class Distance Variance Regularization, Nam Le and Jean-Marc Odobez, in: Proceedings of Interspeech, Hyderabad, INDIA, pages 2257-2261, 2018

[DOI]

On Learning Vocal Tract System Related Speaker Discriminative Information from Raw Signal Using CNNs, Hannah Muckenhirn, Mathew Magimai-Doss and Sébastien Marcel, in: Proceedings of Interspeech, Hyderabad, INDIA, pages 1116-1120, 2018

On Learning to Identify Genders from Raw Speech Signal Using CNNs, Selen Hande Kabil, Hannah Muckenhirn and Mathew Magimai-Doss, in: Proceedings of Interspeech, Hyderabad, INDIA, pages 287-291, 2018

[DOI]

Speaker Inconsistency Detection in Tampered Video, Pavel Korshunov and Sébastien Marcel, in: European Signal Processing Conference, 2018

Denoising and Raw-waveform Networks for Weakly-Supervised Gender Identification on Noisy Speech, Jilt Sebastian, Manoj Kumar, D S Pavan Kumar, Mathew Magimai-Doss, Hema A Murthy and Shrikanth Narayanan, in: Proceedings of Interspeech 2018, Hyderabad, INDIA, pages 292-296, 2018

[DOI]

Implementing Fusion Techniques for the Classification of Paralinguistic Information, Bogdan Vlasenko, Jilt Sebastian, D S Pavan Kumar and Mathew Magimai-Doss, in: Proceedings of Interspeech 2018, pages 526-530, 2018

Analysis of Language Dependent Front-End for Speaker Recognition, Srikanth Madikeri, Subhadeep Dey and Petr Motlicek, in: Proceedings of Interspeech 2018, Hyderabad, INDIA, pages 1101-1105, 2018

[DOI]

Experimental evaluation of speech enhancement methods in remote microphone systems for hearing aids, Gilles Curtois, Vincent Grimaldi, Hervé Lissek, Ina Kodrasi and Eleftheria Georganti, in: Proc. EuroNoise 2018, Crete, Greece, pages 351-358, 2018

How to Tell Ancient Signs Apart? Recognizing and Visualizing Maya Glyphs with CNNs, Gulcan Can, Jean-Marc Odobez and Daniel Gatica-Perez, in: ACM Journal on Computing and Cultural Heritage (JOCCH), 11(4):20, 2018

[DOI]

Warped Gaussian processes and derivative-based sequential design for functions with heterogeneous variations, Sébastien Marmin, David Ginsbourger, Jean Baccou and Jacques Liandrat, in: SIAM/ASA Journal on Uncertainty Quantification, 6(3):991-1018, 2018

Implementing gender-dependent vowel-level analysis for boosting speech-based depression recognition, Bogdan Vlasenko, Hesam Sagha, Nicholas Cummins and Björn Schuller, in: Proceedings of Interspeech 2017, 2017

Not All Samples Are Created Equal: Deep Learning with Importance Sampling, Angelos Katharopoulos and Francois Fleuret, Idiap-RR-12-2018

Local Affine Approximations for Improving Knowledge Transfer, Suraj Srinivas and Francois Fleuret, Idiap-Com-01-2018

[URL]

WILDTRACK: A Multi-camera HD Dataset for Dense Unscripted Pedestrian Detection, Tatjana Chavdarova, Pierre Baqué, Andrii Maksai, Stéphane Bouquet, Cijo Jose, Louis Lettry, Francois Fleuret, Pascal Fua and Luc Van Gool, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, pages 5030-5039, 2018

[DOI]

Knowledge Transfer with Jacobian Matching, Suraj Srinivas and Francois Fleuret, Idiap-RR-04-2018

[URL]

SGAN: An Alternative Training of Generative Adversarial Networks, Tatjana Chavdarova and Francois Fleuret, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, pages 9407-9415, IEEE, 2018

[DOI]

Implémentation d'un algorithme de réduction de taille des réseaux de neurones, François Marelli, Idiap-RR-03-2018

Sequential Design of Computer Experiments, David Ginsbourger, in: Wiley StatsRef: Statistics Reference Online, Wiley, 2018

Analysis of eigenvalue decomposition-based late reverberation power spectral density estimation, Ina Kodrasi and Simon Doclo, in: IEEE Transaction on Acoustics, Speech and Language Processing, 26(6):1106-1118, 2018

Complexity reduction of eigenvalue decomposition-based diffuse power spectral density estimators using the power method, Marvin Tammen, Ina Kodrasi and Simon Doclo, in: Proc. International Conference on Acoustics, Speech, and Signal Processing, Calgary, Canada, pages 451-455, 2018

Joint late reverberation and noise power spectral density estimation in a spatially homogeneous noise field, Ina Kodrasi and Simon Doclo, in: Proc. International Conference on Acoustics, Speech, and Signal Processing, Calgary, Canada, pages 441-445, 2018

Self-Attentive Residual Decoder for Neural Machine Translation, Lesly Miculicich, Nikolaos Pappas, Dhananjay Ram and Andrei Popescu-Belis, in: Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2018

Learning embeddings: efficient algorithms and applications, Cijo Jose, École Polytechnique Fédérale de Lausanne, 2018

[DOI]

Novel Algorithms for Clustering, James Newling, École polytechnique fédérale de Lausanne, 2018

[DOI]

Semi-blind spatially-variant deconvolution in optical microscopy with local Point Spread Function estimation by use of Convolutional Neural Networks, Adrian Shajkofci and Michael Liebling, Idiap-RR-07-2018

Deep Neural Networks for Multiple Speaker Detection and Localization, Weipeng He, Petr Motlicek and Jean-Marc Odobez, in: 2018 IEEE International Conference on Robotics and Automation (ICRA), Brisbane, AUSTRALIA, pages 74-79, 2018

[DOI]

Towards directly modeling raw speech signal for speaker verification using CNNs, Hannah Muckenhirn, Mathew Magimai-Doss and Sébastien Marcel, in: IEEE International Conference on Acoustics, Speech and Signal Processing, Calgary, CANADA, pages 4884-4888, 2018

NASAL SPEECH SOUNDS DETECTION USING CONNECTIONIST TEMPORAL, Milos Cernak and Sibo Tong, in: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 2018

Generative Models for Learning Robot Manipulation Skills from Humans, Ajay Kumar Tanwani, Ecole Polytechnique Federale de Lausanne, 2018

[DOI]

DrinkSense: Characterizing Youth Drinking Behavior using Smartphones, Darshan Santani, Trinh-Minh-Tri Do, Florian Labhart, Sara Landolt, Emmanuel Kuntsche and Daniel Gatica-Perez, in: IEEE Transactions on Mobile Computing, 2018

Cross-Eyed 2017: Cross-Spectral Iris/Periocular Recognition Competition., Ana Sequeira, Lulu Chen, James Ferryman, Peter Wild, Fernando Alonso-Fernandez, Josef Bigün, Kiran B. Raja, R. Raghavendra, Christoph Busch, Tiago de Freitas Pereira, Sébastien Marcel, Sushree Sangeeta Behera, Mahesh Gour and Vivek Kanhangad, in: IEEE/IAPR International Joint Conference on Biometrics, Denver, Colorado, USA, IEEE, 2017

A Poisson regression approach to model monthly hail occurrence in Northern Switzerland using large-scale environmental variables, Erica Madonna, David Ginsbourger and Olivia Martius, in: Atmospheric Research, 203:261-274, 2018

[DOI]

Theories and Models of Teams and Group, R. Reiter-Palmon, T. Sinha, J. Gevers, Jean-Marc Odobez and G. Volpe, in: Small Group Research, 48(5):544--567, 2017

[DOI]

SMILE Swiss German Sign Language Dataset, Sarah Ebling, Necati Cihan Camgoz, Penny Boyes Braem, Katja Tissi, Sandra Sidler-Miserez, Stephanie Stoll, Simon Hadfield, Tobias Haug, Richard Bowden, Sandrine Tornay, Marzieh Razavi and Mathew Magimai-Doss, in: Language Resources and Evaluation Conference, 2018

Active Online Anomaly Detection using Dirichlet Process Mixture Model and Gaussian Process Classification, Jagannadan Varadarajan, R Subramanian, Narendra Ahuja, Pierre Moulin and Jean-Marc Odobez, in: IEEE Winter Conference on Applications of Computer Vision (WACV), Washington, 2017

Towards a breakthrough Speaker Identification approach for Law Enforcement Agencies: SIIP, Khaled Khelif, yann Mombrun, Gerhard Backfried, Farhan Sahito, Luca Scarpatto, Petr Motlicek, Damien Kelly, Gideon Hazzani, Emmanouil Chatzigavriil and Srikanth Madikeri, in: European Intelligence and Security Informatics Conference (EISIC) 2017, Athenes, Greece, pages 32-39, IEEE Computer Society, 2017

[DOI]
[URL]

On the Use of Convolutional Neural Networks for Speech Presentation Attack Detection, Pavel Korshunov, Andreé R. Goncalves, Ricardo P. V. Violato, Flávio O. Simões and Sébastien Marcel, in: International Conference on Identity, Security and Behavior Analysis, 2018

Deep Multi-Camera People Detection, Tatjana Chavdarova and Francois Fleuret, in: Proceedings of the IEEE International Conference on Machine Learning and Applications, 2017

K-Medoids For K-Means Seeding, James Newling and Francois Fleuret, in: Proceedings of the international conference on Neural Information Processing Systems, 2017

Social Scene Understanding: End-to-End Multi-Person Action Localization and Collective Activity Recognition, Timur Bagautdinov, Alexandre Alahi, Francois Fleuret, Pascal Fua and Sylvio Savarese, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2017

Multi-Modal Mean-Fields via Cardinality-Based Clamping, Pierre Baqué, Francois Fleuret and Pascal Fua, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2017

Boosted Exudate Segmentation in Retinal Images using Residual Nets, Samaneh Abbasi-Sureshjani, Behdad Dasht Bozorg, Bart ter Haar Romeny and Francois Fleuret, in: Proceedings of the MICCAI Workshop on Ophthalmic Medical Image Analysis, 2017

Geometric calibration of Colour and Stereo Surface Imaging System of ESA's Trace Gas Orbiter, Stepan Tulyakov, Anton Ivanov, Nicolas Thomas, Victoria Roloff, Antoine Pommerol, Grabriele Cremonese, Thomas Weigel and Francois Fleuret, in: Advances in Space Research, 2018

Non-Markovian Globally Consistent Multi-Object Tracking, Andrii Maksai, Xinchao Wang, Francois Fleuret and Pascal Fua, in: Proceedings of the IEEE International Conference on Computer Vision, 2017

Deep Occlusion Reasoning for Multi-Camera Multi-Target Detection, Pierre Baqué, Francois Fleuret and Pascal Fua, in: Proceedings of the IEEE International Conference on Computer Vision, 2017

Weakly Supervised Learning of Deep Metrics for Stereo Reconstruction, Stepan Tulyakov, Anton Ivanov and Francois Fleuret, in: Proceedings of the IEEE International Conference on Computer Vision, 2017

Exploratory Study on Direct Prediction of Diabetes using Deep Residual Networks, Samaneh Abbasi-Sureshjani, Behdad Dasht Bozorg, Bart ter Haar Romeny and Francois Fleuret, in: Proceedings of the thematic conference on computational vision and medical image processing, 2017

Towards Weakly Supervised Acoustic Subword Unit Discovery and Lexicon Development Using Hidden Markov Models, Marzieh Razavi, Ramya Rasipuram and Mathew Magimai-Doss, in: Speech Communication, 96:168-183, 2018

[DOI]

DNN based speaker embedding using content information for text-dependent speaker verification, Subhadeep Dey, Takafumi Koshinaka, Petr Motlicek and Srikanth Madikeri, Idiap-RR-06-2018

Check Out This Place: Inferring Ambiance from Airbnb Photos, Laurent Son Nguyen, Salvador Ruiz-Correa, Marianne Schmid Mast and Daniel Gatica-Perez, in: IEEE transactions on Multimedia, 20(6):1499-1511, 2018

[DOI]
[URL]

Development of the Geographical Proportional-to-size Street-Intercept Sampling (GPSIS) method for recruiting urban nightlife-goers in an entire city, Florian Labhart, Flavio Tarsetti, Olivier Bornet, Darshan Santani, Jasmine Truong, Sara Landolt, Daniel Gatica-Perez and Emmanuel Kuntsche, in: International Journal of Social Research Methodology, 20(6):721-736, 2017

[DOI]

Examining Linguistic Content and Skill Impression Structure for Job Interview Analytics in Hospitality, Skanda Muralidhar and Daniel Gatica-Perez, in: Proceedings of the 16th International Conference on Mobile and Ubiquitous Multimedia, 2017

Learning Autonomous Behaviours for the Body of a Flexible Surgical Robot, D. Bruno, Sylvain Calinon and D. G. Caldwell, in: Autonomous Robots, 41(2):333-347, 2017

[DOI]
[URL]

Robot Learning with Task-Parameterized Generative Models, Sylvain Calinon, in: Robotics Research, pages 111-126, Springer, 2018

[DOI]
[URL]

Supervisory teleoperation with online learning and optimal control, I. Havoutis and Sylvain Calinon, in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Singapore, pages 1534-1540, IEEE, 2017

[URL]

Trajectory and Foothold Optimization using Low-Dimensional Models for Rough Terrain Locomotion, C. Mastalli, M. Focchi, I. Havoutis, A. Radulescu, Sylvain Calinon, J. Buchli, D. G. Caldwell and C. Semini, in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Singapore, pages 1096-1103, IEEE, 2017

[URL]

Learning Task-Space Synergies using Riemannian Geometry, M. Zeestraten, I. Havoutis, Sylvain Calinon and D. G. Caldwell, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), Vancouver, Canada, pages 73-78, IEEE, 2017

[URL]

Generating Calligraphic Trajectories with Model Predictive Control, D. Berio, Sylvain Calinon and F. F. Leymarie, in: Proc. 43rd Conf. on Graphics Interface, Edmonton, AL, Canada, pages 132-139, 2017

[DOI]

Dynamic Graffiti Stylisation with Stochastic Optimal Control, D. Berio, Sylvain Calinon and F. F. Leymarie, in: Intl Workshop on movement and computing (MOCO), London, UK, pages 1-8, ACM, 2017

[DOI]
[URL]

Visual Analysis of Maya Glyphs via Crowdsourcing and Deep Learning, Gulcan Can, École Polytechnique Fédérale de Lausanne, 2017

[DOI]

Deeply Vulnerable -- a study of the robustness of face recognition to presentation attacks, Amir Mohammadi, Sushil Bhattacharjee and Sébastien Marcel, in: IET (The Institution of Engineering and Technology) -- Biometrics:1--13, 2017

[DOI]

Planification adaptative d'expériences numériques par paquets en contexte non stationnaire pour une étude de fissuration mécanique, Sébastien Marmin, Jean Baccou, Frédéric Perales, David Ginsbourger and Jacques Liandrat, in: 23eme Congres Francais de Mecanique, 28 aout - 1er septembre 2017, Lille, France (FR), AFM, 2017

[URL]

Non-parametric warping via local scale estimation for non-stationary Gaussian process modelling, Sébastien Marmin, Jean Baccou, Jacques Liandrat and David Ginsbourger, in: Wavelets and Sparsity XVII, pages 1039421, International Society for Optics and Photonics, 2017

[DOI]
[URL]

Towards directly modeling raw speech signal for speaker verification using CNNs, Hannah Muckenhirn, Mathew Magimai-Doss and Sébastien Marcel, Idiap-RR-30-2017

Deep Neural Networks for Multiple Speaker Detection and Localization, Weipeng He, Petr Motlicek and Jean-Marc Odobez, Idiap-RR-02-2018

Estimating orthant probabilities of high dimensional Gaussian vectors with an application to set estimation, Dario Azzimonti and David Ginsbourger, in: Journal of Computational and Graphical Statistics, 27(2):255-267, 2018

[DOI]
[URL]

On uncertainty quantification in hydrogeology and hydrogeophysics, Niklas Linde, David Ginsbourger, James Iriving, Fabio Nobile and Arnaud Doucet, in: Advances in Water Resources, 110:166–181, 2017

[DOI]
[URL]

Towards Quantifying the Entropy of Fingervein Patterns across Different Feature Extractors, Vedrana Krivokuca and Sébastien Marcel, in: 2018 IEEE 4th International Conference on Identity, Security, and Behavior Analysis (ISBA), 2018

NASAL SPEECH SOUNDS DETECTION USING CONNECTIONIST TEMPORAL CLASSIFICATION, Milos Cernak and Sibo Tong, Idiap-RR-28-2017

Combining Electromyography and Tactile Myography to Improve Hand and Wrist Activity Detection in Prostheses, N. Jaquier, M. Connan, C. Castellini and Sylvain Calinon, in: Technologies, 5(4), 2017

On Job Training: Automated Interpersonal Behavior Assessment & Real-Time Feedback, Skanda Muralidhar, in: Proceedings of the 25th ACM International Conference on Multimedia, 2017, 2017

Bites'n'Bits: Inferring Eating Behavior from Contextual Mobile Data, Joan-Isaac Biel, Nathalie Martin, David Labbe and Daniel Gatica-Perez, in: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (PACM IMWUT), 1(4):125-157, 2017

Cognitive Speech Coding: Examining the Impact of Cognitive Speech Processing on Speech Compression, Milos Cernak, Afsaneh Asaei and Alexandre Hyafil, in: IEEE Signal Processing Magazine, 35(3):97-109, 2018

[DOI]

Direct inversion algorithm for focal plane scanning optical projection tomography, Kevin G. Chan and Michael Liebling, in: Biomedical Optics Express, 2017

What you can't see can help you -- extended-range imaging for 3D-mask presentation attack detection, Sushil Bhattacharjee and Sébastien Marcel, in: Proceedings of the 16th International Conference on Biometrics Special Interest Group., Darmstadt (Germany), Gesellschaft fuer Informatik e.V. (GI), 2017

Evaluating Attention Networks for Anaphora Resolution, Jonathan Pilault, Nikolaos Pappas, Lesly Miculicich and Andrei Popescu-Belis, Idiap-RR-27-2017

Towards Document-Level Neural Machine Translation, Lesly Miculicich, Idiap-RR-25-2017

A Domain Adaptation Approach to Improve Speaker Turn Embedding Using Face Representation, Nam Le and Jean-Marc Odobez, in: ACM International Conference on Multimodal Interaction, Glasgow, Scotland, ACM, 2017

Long-Term Spectral Statistics for Voice Presentation Attack Detection, Hannah Muckenhirn, Pavel Korshunov, Mathew Magimai-Doss and Sébastien Marcel, in: IEEE/ACM Transactions on Audio, Speech and Language Processing, 25(11):2098-2111, 2017

How May I Help You? Behavior and Impressions in Hospitality Service Encounters, Skanda Muralidhar, Marianne Schmid Mast and Daniel Gatica-Perez, in: Proceddings of 19th ACM International Conference on Multimodal Interaction, 2017

Elderly People Living Alone: Detecting Home Visits with Ambient and Wearable Sensing, Rui Hu, Hieu Pham, Philipp Buluschek and Daniel Gatica-Perez, in: In Proceedings of MMHealth, 2017

Venues in Social Media: Examining Ambiance Perception Through Scene Semantics, Yassir Benkhedda, Darshan Santani and Daniel Gatica-Perez, in: Proceedings of the 25th ACM International Conference on Multimedia, ACM, 2017, 2017

Multilingual Hierarchical Attention Networks for Document Classification, Nikolaos Pappas and Andrei Popescu-Belis, in: Proceedings of the 8th International Joint Conference on Natural Language Processing (IJCNLP), pages 1015-1025, 2017

A reproducible study on remote heart rate measurement, Guillaume Heusch, André Anjos and Sébastien Marcel, in: arXiv, 2017

[URL]

Biometrics: In Search of Identity and Security (Q & A), Zahid Akhtar, Abdenour Hadid, Mark Nixon, Massimo Tistarelli, Jean-Luc Dugelay and Sébastien Marcel, in: IEEE MultiMedia, PP, 2017

[DOI]

Maya Codical Glyph Segmentation: A Crowdsourcing Approach, Gulcan Can, Jean-Marc Odobez and Daniel Gatica-Perez, in: IEEE Transactions on Multimedia, 20(3):711-725, 2018

[DOI]
[URL]

On the Generalization of Fused Systems in Voice Presentation Attack Detection, Andreé R. Goncalves, Pavel Korshunov, Ricardo P. V. Violato, Flávio O. Simões and Sébastien Marcel, in: 16th International Conference of the Biometrics Special Interest Group, 2017

Presentation attack detection in voice biometrics, Pavel Korshunov and Sébastien Marcel, in: User-Centric Privacy and Security in Biometrics, The Institution of Engineering and Technology, 2017

Impact of score fusion on voice biometrics and presentation attack detection in cross-database evaluations, Pavel Korshunov and Sébastien Marcel, in: IEEE Journal of Selected Topics in Signal Processing, 11(4):695 - 705, 2017

[DOI]

Improving speaker turn embedding by crossmodal transfer learning from face embedding, Nam Le and Jean-Marc Odobez, in: ICCV Workshop on Computer Vision for Audio-Visual Media, 2017

Perceptual Information Loss due to Impaired Speech Production, Afsaneh Asaei, Milos Cernak and Hervé Bourlard, in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2017

Perceptual Information Loss due to Impaired Speech Production, Afsaneh Asaei, Milos Cernak and Hervé Bourlard, Idiap-RR-20-2017

On Modeling the Synergy Between Acoustic and Lexical Information for Pronunciation Lexicon Development, Marzieh Razavi, École polytechnique fédérale de Lausanne (EPFL), 2017

[DOI]

A Generative Model for Intention Recognition and Manipulation Assistance in Teleoperation, Ajay Kumar Tanwani and Sylvain Calinon, in: IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017

NeuroSpeech: An open-source software for Parkinson's speech analysis, Juan Rafael Orozco-Arroyave, Juan Camilo Vasquez-Correa, Jesús Francisco Vargas-Bonilla, Raman Arora, Najim Dehak, Phani Sankar Nidadavolu, Heidi Christensen, Frank Rudzicz, Maria Yancheva, Alyssa Vann, Nikolai Vogler, Tobias Bocklet, Milos Cernak, Julius Hannink and Elmar Nöth, in: Digital Signal Processing, 2017

[DOI]

A Sub-Quadratic Exact Medoid Algorithm, James Newling and Francois Fleuret, in: Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017

Template-matching for Text-dependent Speaker Verification, Subhadeep Dey, Petr Motlicek, Srikanth Madikeri and Marc Ferras, in: Speech Communication, 2017

Validation of an Automatic Metric for the Accuracy of Pronoun Translation (APT), Lesly Miculicich and Andrei Popescu-Belis, in: Proceedings of the Third Workshop on Discourse in Machine Translation (DiscoMT), Denmark, Copenhagen, Association for Computational Linguistics (ACL), 2017

End-to-End Convolutional Neural Network-based Voice Presentation Attack Detection, Hannah Muckenhirn, Mathew Magimai-Doss and Sébastien Marcel, in: International Joint Conference on Biometrics, Denver, Colorado, USA, 2017

Sense-Aware Statistical Machine Translation using Adaptive Context-Dependent Clustering, Xiao Pu, Nikolaos Pappas and Andrei Popescu-Belis, in: Proceedings of Second Conference on Machine Translation (WMT17), 2017

Learning Manipulability Ellipsoids for Task Compatibility in Robot Manipulation, L. Rozo, N. Jaquier, Sylvain Calinon and D. G. Caldwell, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, pages 3183-3189, 2017

[URL]

Gaussian Mixture Regression on Symmetric Positive Definite Matrices Manifolds: Application to Wrist Motion Estimation with sEMG, N. Jaquier and Sylvain Calinon, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, pages 59-64, 2017

[URL]

Improving hand and wrist activity detection using tactile sensors and tensor regression methods on Riemannian manifolds, N. Jaquier, C. Castellini and Sylvain Calinon, in: Proc. of the Myoelectric Control Symposium, 2017

[URL]

Cross-lingual Transfer for News Article Labeling: Benchmarking Statistical and Neural Models, Khalil Mrini, Nikolaos Pappas and Andrei Popescu-Belis, Idiap-RR-26-2017

An Investigation of Deep Neural Networks for Multilingual Speech Recognition Training and Adaptation, Sibo Tong, Philip N. Garner and Hervé Bourlard, in: Proc. of Interspeech, 2017

Characterisation of voice quality of Parkinson's disease using differential phonological posterior features, Milos Cernak, Juan Rafael Orozco-Arroyave, Frank Rudzicz, Heidi Christensen, Juan Camilo Vasquez-Correa and Elmar Nöth, in: Computer Speech and Language, 2017

Supervised Gaze Bias Correction for Gaze Coding in Interactions, Remy Siegfried and Jean-Marc Odobez, Idiap-RR-23-2017

MAAYA: Multimedia Methods to Support Maya Epigraphic Analysis, Daniel Gatica-Perez, Gulcan Can, Rui Hu, Stephane Marchand-Maillet, Jean-Marc Odobez, Carlos Pallan Gayol and Edgar Roman-Rangel, in: Arqueologia computacional: Nuevos enfoques para el analisis y la difusion del patrimonio cultural, INAH-RedTDPC, 2017

Towards large scale multimedia indexing: A case study on person discovery in broadcast news, Nam Le, Jean-Marc Odobez and et al., in: 15th International Workshop on Content-Based Multimedia Indexing, 2017

Shape Representations for Maya Codical Glyphs: Knowledge-driven or Deep?, Gulcan Can, Jean-Marc Odobez and Daniel Gatica-Perez, in: 15th International Workshop on Content-Based Multimedia Indexing, 2017

Bob Speaks Kaldi, Milos Cernak, Alain Komaty, Amir Mohammadi, André Anjos and Sébastien Marcel, in: Proc. of Interspeech, 2017

Characterisation of voice quality of Parkinson's disease using differential phonological posterior features, Milos Cernak, Juan Rafael Orozco-Arroyave, Frank Rudzicz, Heidi Christensen, Juan Camilo Vasquez-Correa and Elmar Nöth, Idiap-RR-16-2017

An Approach for Imitation Learning on Riemannian Manifolds, M. Zeestraten, I. Havoutis, J. Silverio, Sylvain Calinon and D. G. Caldwell, in: IEEE Robotics and Automation Letters (RA-L), 2(3):1240-1247, 2017

[DOI]
[URL]

Learning adaptive dressing assistance from human demonstration, E. Pignat and Sylvain Calinon, in: Robotics and Autonomous Systems, 93:61-75, 2017

[DOI]
[URL]

Insiders and Outsiders: Comparing Urban Impressions between Population Groups, Darshan Santani, Salvador Ruiz-Correa and Daniel Gatica-Perez, in: International Conference on Multimedia Retrieval, ACM, 2017

[DOI]

Sparse Pronunciation Codes for Perceptual Phonetic Information Assessment, Afsaneh Asaei, Milos Cernak, Hervé Bourlard and Dhananjay Ram, in: Workshop on Signal Processing with Adaptive Sparse Structured Representations (SPARS), 2017

Topic and Sentiment in Phrase-Based Statistical Machine Translation, Maryam Habibi, Nikolaos Pappas and Andrei Popescu-Belis, Idiap-RR-10-2017

A Posterior-Based Multi-Stream Formulation for G2P Conversion, Marzieh Razavi and Mathew Magimai-Doss, in: IEEE Signal Processing Letters, 2017

Object Detection with Active Sample Harvesting, Olivier Canévet, École Polytechnique Fédérale de Lausanne, 2017

Large-Scale Image Segmentation with Convolutional Networks, Pedro H. O. Pinheiro, Sciences et Techniques de l’Ingénieur (STI), 2017

Using Coreference Links to Improve Spanish-to-English Machine Translation, Lesly Miculicich and Andrei Popescu-Belis, in: Proceedings of the 2nd Workshop on Coreference Resolution Beyond OntoNotes (CORBON 2017), Valencia, Spain, pages 30-40, Association for Computational Linguistics (ACL), 2017

Using Coreference Links to Improve Spanish-to-English Machine Translation, Lesly Miculicich and Andrei Popescu-Belis, Idiap-RR-07-2017

Machine translation of Spanish personal and possessive pronouns using anaphora probabilities, Ngoc-Quang Luong and Andrei Popescu-Belis, in: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL), Valencia, Spain, Association for Computational Linguistics, 2017

Machine translation of Spanish personal and possessive pronouns using anaphora probabilities, Ngoc-Quang Luong and Andrei Popescu-Belis, Idiap-RR-06-2017

Multilingual Hierarchical Attention Networks for Document Classification, Nikolaos Pappas and Andrei Popescu-Belis, Idiap-RR-17-2017

[URL]

Explicit Document Modeling through Weighted Multiple-Instance Learning, Nikolaos Pappas and Andrei Popescu-Belis, in: Journal of Artificial Intelligence Research (JAIR), 58:591--626, 2017

Multilingual Visual Sentiment Concept Clustering and Analysis, Nikolaos Pappas, Miriam Redi, Mercan Topkara, Hongyi Liu, Brendan Jou, Tao Chen and Shih-Fu Chang, in: International Journal of Multimedia Information Retrieval, 2017

Real-time Multiple Head Tracking Using Texture and Colour Cues, Vasil Khalidov and Jean-Marc Odobez, Idiap-RR-02-2017

Intonation Modelling for Speech Synthesis and Emphasis Preservation, Pierre-Edouard Honnet, École Polytechnique Fédérale de Lausanne, 2017

[DOI]

The SIWIS French Speech Synthesis Database – Design and recording of a high quality French database for speech synthesis, Pierre-Edouard Honnet, Alexandros Lazaridis, Philip N. Garner and Junichi Yamagishi, Idiap-RR-03-2017

On the Impact of Non-modal Phonation On Phonological Features, Milos Cernak, Elmar Nöth, Frank Rudzicz, Heidi Christensen, Juan Rafael Orozco-Arroyave, Raman Arora, Tobias Bocklet, Hamidreza Chinaei, Julius Hannink, Phani Sankar Nidadavolu, Juan Camilo Vasquez, Maria Yancheva, Alyssa Vann and Nikolai Vogler, in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), 2017

Multi-view Representation Learning Via GCCA for Multimodal Analysis of Parkinson's Disease, Juan Camilo Vasquez-Correa, Juan Rafael Orozco-Arroyave, Raman Arora, Elmar Nöth, Najim Dehak, Heidi Christensen, Frank Rudzicz, Tobias Bocklet, Milos Cernak, Hamidreza Chinaei, Julius Hannink, Phani Sankar Nidadavolu, Maria Yancheva, Alyssa Vann and Nikolai Vogler, in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), 2017

Towards Weakly Supervised Acoustic Subword Unit Discovery and Lexicon Development Using Hidden Markov Models, Marzieh Razavi, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-15-2017

A MultiPath Network for Object Detection, Sergey Zagoruyko, Adam Lerer, Tsung-Yi Lin, Pedro H. O. Pinheiro, Sam Gross, Soumith Chintala and Piotr Dollar, in: Proceedings of the British Machine Vision Conference, BMVA Press, 2016

[URL]

Learning to Refine Object Segments, Pedro H. O. Pinheiro, Tsung-Yi Lin, Ronan Collobert and Piotr Dollar, in: Computer Vision - ECCV 2016, Amsterdam, pages 75-91, Springer, 2016

[DOI]
[URL]

Unsupervised Interpretable Pattern Discovery in Time Series Using Autoencoders, Kevin Bascol, Remi Emonet, Elisa Fromont and Jean-Marc Odobez, in: IAPR Int. Workshops on Structural and Syntactic Pattern Recognition (SSPR), 2016

CRF-Based Context Modeling for Person Identification in Broadcast Videos, Paul Gay, Sylvain Meignier, Paul Deleglise and Jean-Marc Odobez, in: Frontiers in ICT: Computer Image Analysis, 3, 2016

Consistent Translation of Repeated Nouns using Syntactic and Semantic Cues, Xiao Pu, Laura Mascarell and Andrei Popescu-Belis, in: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL), Valencia, Spain, Association for Computational Linguistics, 2017

Consistent Translation of Repeated Nouns using Syntactic and Semantic Cues, Xiao Pu, Laura Mascarell and Andrei Popescu-Belis, Idiap-RR-08-2017

Manual and automatic labeling of discourse connectives for machine translation (Keynote paper), Andrei Popescu-Belis, in: TextLink: Structuring Discourse in Multilingual Europe (Handbook of the Second Action Conference), Budapest, Hungary, pages 16-20, 2016

[URL]

Findings of the 2016 WMT Shared Task on Cross-lingual Pronoun Prediction, Liane Guillou, Christian Hardmeier, Preslav Nakov, Sara Stymne, Jorg Tiedemann, Yannick Versley, Mauro Cettolo, Bonnie Webber and Andrei Popescu-Belis, in: Proceedings of WMT 2016 (First Conference on Machine Translation), Association for Computational Linguistics, Berlin, Germany, pages 525–542, 2016

[URL]

Comparing Two Strategies for Query Expansion in a News Monitoring System, Parvaz Mahdabi and Andrei Popescu-Belis, in: Natural Language Processing and Information Systems: 21st International Conference on Applications of Natural Language to Information Systems, NLDB 2016, pages 267-275, Springer-Verlag, 2016

[DOI]

Towards building an attentive artificial listener: on the perception of attentiveness in audio-visual feedback tokens, Catharine Oertel, José David Lopes, Yu Yu, Kenneth Alberto Funes Mora, Joakim Gustafson, Alan Black and Jean-Marc Odobez, in: Proceedings of the 18th ACM International Conference on Multimodal Interaction, Tokyo, Japan, pages 21-28, ACM, 2016

[DOI]

EXPLOITING SEQUENCE INFORMATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION, Subhadeep Dey, Petr Motlicek, Srikanth Madikeri and Marc Ferras, in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, New Orleans, pages 5370-5374, 2017

CONTENT NORMALIZATION FOR TEXT-INDEPENDENT SPEAKER VERIFICATION, Subhadeep Dey, Marc Ferras, Petr Motlicek and Srikanth Madikeri, Idiap-RR-31-2017

EXPLOITING SEQUENCE INFORMATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION, Subhadeep Dey, Petr Motlicek, Srikanth Madikeri and Marc Ferras, Idiap-RR-04-2017

Template-matching for Text-dependent Speaker Verification, Subhadeep Dey, Petr Motlicek, Srikanth Madikeri and Marc Ferras, Idiap-RR-32-2017

Two-Pass IB based Speaker Diarization System using Meeting-Specific ANN based Features, Nauman Dawalatabad, Srikanth Madikeri, C Chandra Sekhar and Hema A Murthy, in: Proceedings of Interspeech 2016, pages 2199-2203, 2016

Two-Pass IB based Speaker Diarization System using Meeting-Specific ANN based Features, Nauman Dawalatabad, Srikanth Madikeri, C Chandra Sekhar and Hema A Murthy, Idiap-RR-09-2018

Computational Analysis of Urban Places Using Mobile Crowdsensing, Darshan Santani, Ecole Polytechnique Federale de Lausanne, 2016

[DOI]

Long Term Spectral Statistics for Voice Presentation Attack Detection, Hannah Muckenhirn, Pavel Korshunov, Mathew Magimai-Doss and Sébastien Marcel, Idiap-RR-11-2017

Online Inference in Bayesian Non-Parametric Mixture Models under Small Variance Asymptotics, Ajay Kumar Tanwani and Sylvain Calinon, in: NIPS workshop on Advances in Approximate Bayesian Inference, Barcelona, Spain, pages 1-5, 2016

[URL]

Learning From Humans, A. G. Billard, Sylvain Calinon and R. Dillmann, in: Handbook of Robotics, pages 1995-2014, Springer, 2016

[DOI]
[URL]

Online motion synthesis with minimal intervention control and formal safety guarantees, M. Zeestraten, A. Pereira, M. Althoff and Sylvain Calinon, in: Proc. IEEE Intl Conf. on Systems, Man, and Cybernetics, Budapest, Hungary, 2016

Dexterous Undersea Interventions with Far Distance Onshore Supervision: the DexROV Project, J. Gancet, P. Weiss, G. Antonelli, M. F. Pfingsthorn, Sylvain Calinon, A. Turetta, C. Walen, D. Urbina, S. Govindaraj, P. Letier, X. Martinez, J. Salini, B. Chemisky, G. Indiveri, G. Casalino, P. Di Lillo, E. Simetti, D. De Palma, A. Birk, A. K. Tanwani, I. Havoutis, A. Caffaz and L. Guilpain, in: IFAC Conference on Control Applications in Marine Systems (CAMS), Trondheim, Norway, pages 414-419, 2016

[DOI]
[URL]

Stochastic learning and control in multiple coordinate systems, Sylvain Calinon, in: Intl Workshop on Human-Friendly Robotics, Genoa, Italy, pages 1-5, 2016

Scalable greedy algorithms for transfer learning, Ilja Kuzborskij, Francesco Orabona and Barbara Caputo, in: Computer Vision and Image Understanding, 2016

Fast Rates by Transferring from Auxiliary Hypotheses, Ilja Kuzborskij and Francesco Orabona, in: Machine Learning, 2016

Maya Codical Glyph Segmentation: A Crowdsourcing Approach, Gulcan Can, Jean-Marc Odobez and Daniel Gatica-Perez, Idiap-RR-01-2017

Redundant Hash Addressing for Large-Scale Query by Example Spoken Query Detection, Afsaneh Asaei, Dhananjay Ram and Hervé Bourlard, Idiap-RR-31-2016

Validation of an Automatic Metric for the Accuracy of Pronoun Translation (APT), Lesly Miculicich and Andrei Popescu-Belis, Idiap-RR-29-2016

Information Theoretic Analysis of Production-Perception Efficiency: Case Study of Speech Pathology, Afsaneh Asaei, Milos Cernak and Hervé Bourlard, Idiap-RR-30-2016

What TripAdvisor Can't Tell: Crowdsourcing Urban Impressions for Whole Cities, Daniel Gatica-Perez, Salvador Ruiz-Correa and Darshan Santani, in: Digital Polis, L'Oeil d'Or (translated to French.), 2018

SenseCityVity: Mobile Crowdsourcing, Urban Awareness, and Collective Action in Mexico, Salvador Ruiz-Correa, Darshan Santani, Beatriz Ramirez-Salazar, Itzia Ruiz-Correa, Fátima Alba Rendón-Huerta, Carlo Olmos Carrillo, Brisa Sandoval Mexicano, Angel Arcos Garcia, Rogelio Hasimoto-Beltran and Daniel Gatica-Perez, in: IEEE Pervasive Computingg, Special Issue on Smart Cities, 16(2):44-53, 2017

Cognitive speech coding, Milos Cernak and Afsaneh Asaei, Idiap-RR-27-2016

Learning assistive teleoperation behaviors from demonstration, I. Havoutis and Sylvain Calinon, in: Proc. IEEE International Symposium on Safety, Security and Rescue Robotics, pages 258-263, 2016

Learning dynamic graffiti strokes with a compliant robot, D. Berio, Sylvain Calinon and F. F. Leymarie, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, pages 3981-3986, 2016

[URL]

Implementation of the Standard I-vector System for the Kaldi Speech Recognition Toolkit, Srikanth Madikeri, Subhadeep Dey, Petr Motlicek and Marc Ferras, Idiap-RR-26-2016

Nested Mini-Batch K-Means, James Newling and Francois Fleuret, in: Proceedings of NIPS, 2016

Speech vocoding for laboratory phonology, Milos Cernak, Štefan Beňuš and Alexandros Lazaridis, in: Computer Speech and Language, 2016

Investigating Spectral Amplitude Modulation Phase Hierarchy Features in Speech Synthesis, Alexandros Lazaridis, Milos Cernak, Pierre-Edouard Honnet and Philip N. Garner, in: 9th ISCA Speech Synthesis Workshop, 2016

Human versus Machine Attention in Document Classification: A Dataset with Crowdsourced Annotations, Nikolaos Pappas and Andrei Popescu-Belis, in: Proceedings of the EMNLP 2016 Workshop on Natural Language Processing for Social Media, Austin, USA, 2016

On the impact of non-modal phonation on phonological features, Milos Cernak, Elmar Nöth, Frank Rudzicz, Heidi Christensen, Juan Rafael Orozco-Arroyave, Raman Arora, Tobias Bocklet, Hamidreza Chinaei, Julius Hannink, Phani Sankar Nidadavolu, Juan Camilo Vasquez, Maria Yancheva, Alyssa Vann and Nikolai Vogler, Idiap-RR-28-2016

Anomaly detection in elderly daily behavior in ambient sensing environments, Oya Aran, Dairazalia Sanchez-Cortes, Trinh-Minh-Tri Do and Daniel Gatica-Perez, in: Proceedings of the 7th Int. Workshop on Human Behavior Understanding, ACM Multimedia, 2016, Amsterdam, Netherlands, 2016

The REPLAY-MOBILE Face Presentation-Attack Database, Artur Costa-Pazo, Sushil Bhattacharjee, Esteban Vazquez-Fernandez and Sébastien Marcel, in: Proceedings of the International Conference on Biometrics Special Interests Group, 2016

Composition of Deep and Spiking Neural Networks for Very Low Bit Rate Speech Coding, Milos Cernak, Alexandros Lazaridis, Afsaneh Asaei and Philip N. Garner, in: IEEE/ACM Trans. on Audio, Speech and Language Processing, 2016

Feature mapping using far-field microphones for distant speech recognition, Ivan Himawan, Petr Motlicek, David Imseng and Sridha Sridharan, Idiap-RR-20-2016

InnerView: Learning Place Ambiance from Social Media Images, Darshan Santani, Rui Hu and Daniel Gatica-Perez, in: Proceedings of the 24th ACM International Conference on Multimedia, ACM, 2016

[DOI]

The Night is Young: Urban Crowdsourcing of Nightlife Patterns, Darshan Santani, Joan-Isaac Biel, Florian Labhart, Jasmine Truong, Sara Landolt, Emmanuel Kuntsche and Daniel Gatica-Perez, in: Proceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing, ACM, 2016

[DOI]

Sparse Hidden Markov Models for Exemplar-based Speech Recognition Using Deep Neural Network Posterior Features, Pranay Dighe, Afsaneh Asaei and Hervé Bourlard, Idiap-RR-19-2016

Explicit Suggestion of Query Terms for News Search using Topic Models and Word Embeddings, Parvaz Mahdabi and Andrei Popescu-Belis, Idiap-RR-21-2016

Feature mapping using far-field microphones for distant speech recognition, Ivan Himawan, Petr Motlicek, David Imseng and Sridha Sridharan, in: Speech Communication, 83:1-9, 2016

[DOI]
[URL]

Efficient Posterior Exemplar Search Space Hashing Exploiting Class-Specific Sparsity Structures, Afsaneh Asaei, Gil Luyet, Milos Cernak and Hervé Bourlard, in: Interspeech, San Francisco, CA, 2016

On Structured Sparsity of Phonological Posteriors for Linguistic Parsing, Milos Cernak, Afsaneh Asaei and Hervé Bourlard, in: Speech Communication, 84:36-45, 2016

[DOI]
[URL]

PAoS Markers: Trajectory Analysis of Selective Phonological Posteriors for Assessment of Progressive Apraxia of Speech, Afsaneh Asaei, Milos Cernak and Marina Laganaro, in: Proceeding on the 7th Workshop on Speech and Language Processing for Assistive Technologies (SLPAT), 2016

Word Sequence Modeling using Deep Learning: and End-to-end Approach and its Applications, Joël Legrand, EPFL, 2016

[DOI]

Presentation Attack Detection Using Long-Term Spectral Statistics for Trustworthy Speaker Verification, Hannah Muckenhirn, Mathew Magimai-Doss and Sébastien Marcel, in: International Conference of the Biometrics Special Interest Group (BIOSIG), 2016

Transferring Neural Representations for Low-dimensional Indexing of Maya Hieroglyphic Art, Edgar Roman-Rangel, Gulcan Can, Stephane Marchand-Maillet, Rui Hu, Carlos Pallan Gayol, Guido Krempel, Jakub Spotak, Jean-Marc Odobez and Daniel Gatica-Perez, in: Proc. ECCV Workshop on Computer Vision for Art Analysis, Amsterdam, pages 842-855, Springer, 2016

[DOI]
[URL]

Emphasis Recreation for TTS using Intonation Atoms, Pierre-Edouard Honnet and Philip N. Garner, in: 9th ISCA Speech Synthesis Workshop, pages 14--20, 2016

[DOI]

Learning Controllers for Reactive and Proactive Behaviors in Human-Robot Collaboration, L. Rozo, J. Silverio, Sylvain Calinon and D. G. Caldwell, in: Frontiers in Robotics and AI, 3(30):1-11, 2016

[DOI]

Learning Physical Collaborative Robot Behaviors from Human Demonstrations, L. Rozo, Sylvain Calinon, D. G. Caldwell, P. Jimenez and C. Torras, in: IEEE Trans. on Robotics, 32(3):513-527, 2016

[DOI]
[URL]

Variable Duration Movement Encoding with Minimal Intervention Control, M. Zeestraten, Sylvain Calinon and D. G. Caldwell, in: Proc. of the IEEE Intl Conf. on Robotics and Automation (ICRA), pages 497-503, 2016

Joint Operation of Voice Biometrics and Presentation Attack Detection, Pavel Korshunov and Sébastien Marcel, in: IEEE International Conference on Biometrics: Theory, Applications and Systems, 2016

[URL]

Overview of BTAS 2016 Speaker Anti-spoofing Competition, Pavel Korshunov, Sébastien Marcel, Hannah Muckenhirn, A. R. Gonçalves, A. G. Souza Mello, R. P. Velloso Violato, F. O. Simões, M. U. Neto, M. de Assis Angeloni, J. A. Stuchi, H. Dinkel, N. Chen, Y. Qian, D. Paul, G. Saha and Md Sahidullah, in: IEEE International Conference on Biometrics: Theory, Applications and Systems, 2016

[URL]

Cross-database evaluation of audio-based spoofing detection systems, Pavel Korshunov and Sébastien Marcel, in: Interspeech, San Francisco, USA, 2016

[URL]

Inter-task System Fusion for Speaker Recognition, Marc Ferras, Srikanth Madikeri, Subhadeep Dey, Petr Motlicek and Hervé Bourlard, in: Proceeedings of the INTERSPEECH, 2016

Scalable Metric Learning via Weighted Approximate Rank Component Analysis, Cijo Jose and Francois Fleuret, in: ECCV 2016, 2016

Fast K-Means with Accurate Bounds, James Newling and Francois Fleuret, in: Proceedings of the International Conference on Machine Learning (ICML), New York, 2016

Phrase Representations for Multiword Expressions, Joël Legrand and Ronan Collobert, in: Proceedings of the 12th Workshop on Multiword Expressions, 2016

Neural Network-based Word Alignment through Score Aggregation, Joël Legrand, Michael Auli and Ronan Collobert, in: Proceedings of the ACL 1st Conference on Machine Translation, 2016

Deep Neural Networks for Syntactic Parsing of Morphologically Rich Languages, Joël Legrand and Ronan Collobert, in: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

A Large-Scale Open-Source Acoustic Simulator for Speaker Recognition, Marc Ferras, Srikanth Madikeri, Subhadeep Dey, Petr Motlicek and Hervé Bourlard, in: IEEE Signal Processing Letters, 23(4):527 - 531, 2016

Building Word Embeddings for Solving Natural Language Processing, Rémi Lebret, École Polytechnique Fédérale de Lausanne, 2016

[DOI]

On ANOVA Decompositions of Kernels and Gaussian Random Field Paths, David Ginsbourger, Olivier Roustant, Dominic Schuhmacher, Nicolas Durrande and Nicolas Lenz, in: Monte Carlo and Quasi-Monte Carlo Methods, pages 315-330, Springer International Publishing, 2016

[DOI]

Design of Computer Experiments Using Competing Distances Between Set-Valued Inputs, David Ginsbourger, Jean Baccou, Clément Chevalier and Frédéric Perales, in: mODa 11 - Advances in Model-Oriented Design and Analysis, pages 123-131, Springer International Publishing, 2016

[DOI]

End-to-End Acoustic Modeling using Convolutional Neural Networks for Automatic Speech Recognition, Dimitri Palaz, Mathew Magimai-Doss and Ronan Collobert, Idiap-RR-18-2016

Subspace Detection of DNN Posterior Probabilities via Sparse Representation for Query by Example Spoken Term Detection, Dhananjay Ram, Afsaneh Asaei and Hervé Bourlard, in: Interspeech, 2016

HMM-based Non-native Accent Assessment using Posterior Features, Ramya Rasipuram, Milos Cernak and Mathew Magimai-Doss, in: Proceedings of Interspeech, San Francisco, USA, 2016

When Naïve Bayes Nearest Neighbors Meet Convolutional Neural Networks, Ilja Kuzborskij, Fabio M. Carlucci and Barbara Caputo, in: Computer Vision and Pattern Recognition (CVPR), IEEE Conference on, 2016

Pronoun Language Model and Grammatical Heuristics for Aiding Pronoun Prediction, Ngoc-Quang Luong and Andrei Popescu-Belis, in: Proceedings of the First Conference on Machine Translation (WMT16), Berlin, Germany, ACL, 2016

Investigating Spectral Amplitude Modulation Phase Hierarchy Features in Speech Synthesis, Alexandros Lazaridis, Milos Cernak, Pierre-Edouard Honnet and Philip N. Garner, Idiap-RR-22-2016

Improving Under-Resourced Language ASR Through Latent Subword Unit Space Discovery, Marzieh Razavi and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2016

Modeling Unvoiced Sounds In Statistical Parametric Speech Synthesis with a Continuous Vocoder, Tamas Gabor Csapo, Geza Nemeth, Milos Cernak and Philip N. Garner, in: Proc. of EUSIPCO, Budapest, Hungary, 2016

PhonVoc: A Phonetic and Phonological Vocoding Toolkit, Milos Cernak and Philip N. Garner, in: Interspeech, San Francisco, USA, 2016

Sound Pattern Matching for Automatic Prosodic Event Detection, Milos Cernak, Afsaneh Asaei, Pierre-Edouard Honnet, Philip N. Garner and Hervé Bourlard, in: Interspeech, San Francisco, USA, 2016

Improving Pronoun Translation by Modeling Coreference Uncertainty, Ngoc-Quang Luong and Andrei Popescu-Belis, in: Proceedings of the First Conference on Machine Translation (WMT16), Berlin, Germany, 2016

The SIWIS database: a multilingual speech database with acted emphasis, Jean-Philippe Goldman, Pierre-Edouard Honnet, Rob Clark, Philip N. Garner, Maria Ivanova, Alexandros Lazaridis, Hui Liang, Tiago Macedo, Beat Pfister, Manuel Sam Ribeiro, Eric Wehrli and Junichi Yamagishi, in: Proceedings of Interspeech, San Francisco, USA, pages 1532--1535, 2016

[DOI]

Probabilistic Amplitude Demodulation Features in Speech Synthesis for Improving Prosody, Alexandros Lazaridis, Milos Cernak and Philip N. Garner, in: Proceedings of Interspeech, San Francisco, USA, 2016

Simultaneous temporal superresolution and denoising for cardiac fluorescence microscopy, Kevin G. Chan, Sebastian J. Streichan, Le A. Trinh and Michael Liebling, in: IEEE Transactions on Computational Imaging, 2016

[DOI]
[URL]

Proceedings of the 16th International Conference on Multimodal Interaction, ICMI 2014, Istanbul, Turkey, November 12-16, 2014., Albert Ali Salah, Jeffrey F. Cohn, Björn Schuller, Oya Aran, Louis-Philippe Morency and Philip R. Cohen, ACM, 2014

Brief Introduction to the Special Issue on Behavior Understanding for Arts and Entertainment, Albert Ali Salah, Hayley Hung, Oya Aran, Hatice Gunes and Matthew Turk, in: ACM Transactions on Interactive Intelligent Systems, 5(2):6, 2015

[DOI]

Rapport with Virtual Agents: What do Human Social Cues and Personality Explain?, Aleksandra Cerekovic, Oya Aran and Daniel Gatica-Perez, in: IEEE Transactions on Affective Computing, 8(3):382-395, 2017

[DOI]

Predicting the Performance in Decision-Making Tasks: From Individual Cues to Group Interaction, Umut Avci and Oya Aran, in: IEEE Transactions on Multimedia, 18(4):643--658, 2016

[DOI]
[URL]

High-slope terrain locomotion for torque-controlled quadruped robots, Michele Focchi, Andrea del Prete, I. Havoutis, Roy Featherstone, D. G. Caldwell and Claudio Semini, in: Autonomous Robots, 2016

[DOI]
[URL]

Hierarchical Planning of Dynamic Movements without Scheduled Contact Sequences, Carlos Mastalli, I. Havoutis, Michele Focchi, Claudio Semini and D. G. Caldwell, in: Proceedings of the IEEE International Conference of Robotics and Automation, 2016

Question Answering in Conversations: Query Refinement Using Contextual and Semantic Information, Maryam Habibi, Parvaz Mahdabi and Andrei Popescu-Belis, in: Data & Knowledge Engineering Journal, 2016

Question Answering in Conversations: Query Refinement Using Contextual and Semantic Information, Maryam Habibi, Parvaz Mahdabi and Andrei Popescu-Belis, Idiap-RR-16-2016

Towards End-to-End Speech Recognition, Dimitri Palaz, Ecole polytechnique Fédérale de Lausanne, 2016

[DOI]

Importance Sampling Tree for Large-scale Empirical Expectation, Olivier Canévet, Cijo Jose and Francois Fleuret, in: Proceedings of the International Conference on Machine Learning (ICML), New-York, 2016

A Sub-Quadratic Exact Medoid Algorithm, James Newling and Francois Fleuret, Idiap-RR-19-2017

Heterogeneous Face Recognition using Inter-Session Variability Modelling, Tiago de Freitas Pereira and Sébastien Marcel, in: IEEE Computer Society Workshop on Biometrics, Las Vegas - USA, IEEE, 2016

A Contextual Language Model to Improve Machine Translation of Pronouns by Re-ranking Translation Hypotheses, Ngoc-Quang Luong and Andrei Popescu-Belis, in: European Association for Machine Translation, 2016

ISWC 2013--Wearables are Here to Stay, Daniel Roggen, Daniel Gatica-Perez, Masaaki Fukumoto and Kristof van Laerhoven, in: IEEE Pervasive Computing, 13(1):14-18, 2014

[DOI]

SYSTEM FUSION AND SPEAKER LINKING FOR LONGITUDINAL DIARIZATION OF TV SHOWS, Marc Ferras, Srikanth Madikeri, Petr Motlicek and Hervé Bourlard, in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5495-5499, IEEE, 2016

Quantifying uncertainties on excursion sets under a Gaussian random field prior, Dario Azzimonti, Julien Bect, Clément Chevalier and David Ginsbourger, in: SIAM/ASA J. Uncertainty Quantification, 4(1):850-874, 2016

[DOI]
[URL]

Acoustic data-driven grapheme-to-phoneme conversion in the probabilistic lexical modeling framework, Marzieh Razavi, Ramya Rasipuram and Mathew Magimai-Doss, in: Speech Communication, 80, 2016

[DOI]

Exploiting Low-dimensional Structures to Enhance DNN based Acoustic Modeling in Speech Recognition, Pranay Dighe, Gil Luyet, Afsaneh Asaei and Hervé Bourlard, in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5690-5694, IEEE, 2016

INFORMATION THEORETIC CLUSTERING FOR UNSUPERVISED DOMAIN-ADAPTATION, Subhadeep Dey, Srikanth Madikeri and Petr Motlicek, in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5580-5584, IEEE, 2016

DEEP NEURAL NETWORK BASED POSTERIORS FOR TEXT-DEPENDENT SPEAKER VERIFICATION, Subhadeep Dey, Srikanth Madikeri, Marc Ferras and Petr Motlicek, in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5050-5054, IEEE, 2016

Phonetic and Phonological Posterior Search Space Hashing Exploiting Class-Specific Sparsity Structures, Afsaneh Asaei, Gil Luyet, Milos Cernak and Hervé Bourlard, Idiap-RR-10-2016

Composition of Deep and Spiking Neural Networks for Very Low Bit Rate Speech Coding, Milos Cernak, Alexandros Lazaridis, Afsaneh Asaei and Philip N. Garner, Idiap-RR-11-2016

"Can you hear me now?" --- Automatic assessment of background noise intrusiveness and speech intelligibility in telecommunications, Raphael Ullmann, Sciences et Techniques de l’Ingénieur (STI), 2016

[DOI]

Overview of BTAS 2016 Speaker Anti-spoofing Competition, Pavel Korshunov, Sébastien Marcel, Hannah Muckenhirn, A. R. Gonçalves, A. G. Souza Mello, R. P. Velloso Violato, Flávio Simões, Mário Uliani Neto, Marcus de Assis Angeloni, J. A. Stuchi, H. Dinkel, N. Chen, Yanmin Qian, D. Paul, G. Saha and Md Sahidullah, Idiap-RR-24-2016

[URL]

Joint Operation of Voice Biometrics and Presentation Attack Detection, Pavel Korshunov and Sébastien Marcel, Idiap-RR-25-2016

[URL]

Audiovisual Summarization of Lectures and Meetings Using a Segment Similarity Graph, Chidansh A. Bhatt, Andrei Popescu-Belis and Matthew Cooper, in: Proceedings of the ACM International Conference on Multimedia Retrieval (ICMR), ACM, New York, NY, ACM Press, 2016

Using the TED Talks to Evaluate Spoken Post-editing of Machine Translation, Jeevanthi Liyanapathirana and Andrei Popescu-Belis, in: Proceedings of the 10th Language Resources and Evaluation Conference (LREC), Portoroz, Slovenia, 2016

Tracking Interacting Objects Using Intertwined Flows, Xinchao Wang, Engin Turetken, Francois Fleuret and Pascal Fua, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016

Principled Parallel Mean-Field Inference for Discrete Random Fields, Pierre Baqué, Timur Bagautdinov, Francois Fleuret and Pascal Fua, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2016

The SIWIS database: a multilingual speech database with acted emphasis, Jean-Philippe Goldman, Pierre-Edouard Honnet, Rob Clark, Philip N. Garner, Maria Ivanova, Alexandros Lazaridis, Hui Liang, Tiago Macedo, Beat Pfister, Manuel Sam Ribeiro, Eric Wehrli and Junichi Yamagishi, Idiap-RR-13-2016

Probabilistic Amplitude Demodulation features in Speech Synthesis for Improving Prosody, Alexandros Lazaridis, Milos Cernak and Philip N. Garner, Idiap-RR-12-2016

On Structured Sparsity of Phonological Posteriors for Linguistic Parsing, Milos Cernak, Afsaneh Asaei and Hervé Bourlard, Idiap-RR-07-2016

[URL]

Subspace Detection of DNN Posterior Probabilities via Sparse Representation for Query by Example Spoken Term Detection, Dhananjay Ram, Afsaneh Asaei and Hervé Bourlard, Idiap-RR-06-2016

Low-Rank Representation For Enhanced Deep Neural Network Acoustic Models, Gil Luyet, Idiap-RR-05-2016

Low-Rank Representation of Nearest Neighbor Phone Posterior Probabilities to Enhance DNN Acoustic Modeling, Gil Luyet, Pranay Dighe, Afsaneh Asaei and Hervé Bourlard, Idiap-RR-04-2016

Sound Pattern Matching for Automatic Prosodic Event Detection, Milos Cernak, Afsaneh Asaei, Pierre-Edouard Honnet, Philip N. Garner and Hervé Bourlard, Idiap-RR-03-2016

Multilingual Visual Sentiment Concept Matching, Nikolaos Pappas, Mercan Topkara, Miriam Redi, Brendan Jou, Tao Chen, Hongyi Liu and Shih-Fu Chang, in: Proceedings of the International Conference on Multimedia Retrieval (ICMR), 2016

Large Scale Hard Sample Mining with Monte Carlo Tree Search, Olivier Canévet and Francois Fleuret, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2016

Deep Dynamic Neural Networks for Multimodal Gesture Segmentation and Recognition, Di Wu, Lionel Pigou, Pieter-Jan Kindermans, Nam Le, Ling Shao, Joni Dambre and Jean-Marc Odobez, in: IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016

Wiki-LDA: A Mixed-Method Approach for Effective Interest Mining on Twitter Data, Xiao Pu, Mohamed Amine Chatti, Hendrik Thues and Ulrik Schroeder, in: Proceedings of CSEDU 2016, 2016

Learning Robot Manipulation Tasks with Task-Parameterized Semi-Tied Hidden Semi-Markov Model, Ajay Kumar Tanwani and Sylvain Calinon, in: IEEE Robotics and Automation Letters, 1(1):235-242, 2016

[DOI]
[URL]

Learning Explainable User Sentiment and Preferences for Information Filtering, Nikolaos Pappas, École Polytechnique Fédérale de Lausanne, 2016

[DOI]

A Point-Spread-Function-Aware Filtered Backprojection Algorithm for Focal-Plane-Scanning Optical Projection Tomography, Kevin G. Chan and Michael Liebling, in: 2016 IEEE International Symposium on Biomedical Imaging, 2016

An Analysis of Rhythmic Staccato-Vocalization Based on Frequency Demodulation for Laughter Detection in Conversational Meetings, Sucheta Ghosh, Milos Cernak, Sarbani Palit and B. B. Chaudhuri, Idiap-RR-02-2016

On Learning Grapheme-to-Phoneme Relationships through the Acoustic Speech Signal, Mathew Magimai-Doss and Ramya Rasipuram, in: The Phonetician, 109–110:6-23, 2014

Global Optimization with Sparse and Local Gaussian Process Models, Tipaluck Krityakierne and David Ginsbourger, in: Machine Learning, Optimization, and Big Data, pages 185-196, Springer International Publishing, 2015

[DOI]

Differentiating the Multipoint Expected Improvement for Optimal Batch Design, Sébastien Marmin, Clément Chevalier and David Ginsbourger, in: Machine Learning, Optimization, and Big Data, pages 37-48, Springer International Publishing, 2015

[DOI]

Sparse Subspace Modeling for Query by Example Spoken Term Detection, Dhananjay Ram, Afsaneh Asaei and Hervé Bourlard, Idiap-RR-01-2016

Trustworthy Biometric Verification under Spoofing Attacks: Application to the Face Mode, Ivana Chingovska, École Polytechnique Fédérale de Lausanne, 2015

[URL]

Proceedings of the Second Workshop on Discourse in Machine Translation (DiscoMT 2015), Bonnie Webber, Marine Carpuat, Andrei Popescu-Belis and Christian Hardmeier, Association for Computational Linguistics, 2015

[URL]

Klewel Webcast: from Research to Growing Company, Maël Guillemot, Jean-Marc Odobez, Alessandro Vinciarelli and Sandy Ingram, in: IEEE Multimedia, 22(4):94-99, 2015

Computer vision profiling of neurite outgrowth dynamics reveals spatio-temporal modularity of Rho GTPase signaling, L. Fusco, Riwal Lefort, Kevin C. Smith, F. Benmansour, German Gonzalez, Caterina Barilari, Bernd Rinn, Francois Fleuret, Pascal Fua and O. Pertz, in: Journal of Cell Biology, 212(1):91-111, 2016

[DOI]

Combining dynamic head pose-gaze mapping with the robot conversational state for attention recognition in human-robot interactions, Samira Sheikhi and Jean-Marc Odobez, in: Pattern Recognition Letters, 66:81-90, 2015

Integration of Real-Time Speech Processing Technologies for Online Gaming, Dhananjay Ram, Petr Motlicek and Blaise Potard, Idiap-Com-01-2016

Transfer Learning through Greedy Subset Selection, Ilja Kuzborskij, Francesco Orabona and Barbara Caputo, in: Image Analysis and Processing - ICIAP 2015, Genoa, Italy, pages 3-14, Springer International Publishing, 2015

[DOI]

Robot Learning with Task-Parameterized Generative Models, Sylvain Calinon, in: Proc. Intl Symp. on Robotics Research, 2015

Learned Minimal Intervention Control Synthesis based on Hidden Semi-Markov Models, M. Zeestraten, Sylvain Calinon and D. G. Caldwell, in: Proc. of the 8th Intl Workshop on Human-Friendly Robotics, pages 17, 2015

Jointly Informative Feature Selection, Leonidas Lefakis and Francois Fleuret, in: Journal of Machine Learning Research, 2016

Kullback-Leibler Proximal Variational Inference, Emtiyaz Khan, Pierre Baqué, Francois Fleuret and Pascal Fua, in: Proceedings of the international conference on Neural Information Processing Systems, pages 3402-3410, 2015

Loud and Trendy: Crowdsourcing Impressions of Social Ambiance in Popular Indoor Urban Places, Darshan Santani and Daniel Gatica-Perez, in: Proceedings of the 23rd ACM International Conference on Multimedia, Brisbane, Australia, pages 211-220, ACM, 2015

[DOI]
[URL]

CommuniSense: Crowdsourcing Road Hazards in Nairobi, Darshan Santani, Jidraph Njuguna, Tierra Bills, Aisha W. Bryant, Reginald Bryant, Jonathan Ledgard and Daniel Gatica-Perez, in: Proceedings of the 17th International Conference on Human-Computer Interaction with Mobile Devices and Services, Copenhagen, Denmark, pages 445-456, ACM, 2015

[DOI]
[URL]

Looking at Cities in Mexico with Crowds, Darshan Santani, Salvador Ruiz-Correa and Daniel Gatica-Perez, in: Proceedings of the 2015 Annual Symposium on Computing for Development, London, United Kingdom, pages 127-135, ACM, 2015

[DOI]
[URL]

Learning to Segments Objects Candidates, Pedro H. O. Pinheiro, Ronan Collobert and Piotr Dollar, in: Advances in Neural Information Processing Systems, Montreal, Canada, pages 1990-1998, Curran Associates, Inc., 2015

[URL]

Modeling Users’ Information Needs in a Document Recommender for Meetings, Maryam Habibi, EPFL, 2015

On degeneracy and invariances of random fields paths with applications in Gaussian process modelling, David Ginsbourger, Olivier Roustant and Nicolas Durrande, in: Journal of Statistical Planning and Inference, 170:117-128, 2016

[DOI]

Evaluating Shape Representations for Maya Glyph Classification, Gulcan Can, Jean-Marc Odobez and Daniel Gatica-Perez, in: ACM Journal on Computing and Cultural Heritage (JOCCH), 9(3), 2016

Ancient Maya Writings as High-Dimensional Data: a Visualization Approach, Gulcan Can, Jean-Marc Odobez, Carlos Pallan Gayol and Daniel Gatica-Perez, in: Digital Humanities (DH), Krakow, 2016

On Compressibility of Neural Network phonological Features for Low Bit Rate Speech Coding, Afsaneh Asaei, Milos Cernak and Hervé Bourlard, in: Proceeding of Interspeech, pages 418-422, ISCA, 2015

Intonation atom based emphasis transfer, Pierre-Edouard Honnet and Philip N. Garner, Idiap-RR-14-2016

TDOA Matrices: Algebraic Properties and their Application to Robust Denoising with Missing Data, Jose Velasco, Daniel Pizarro, Javier Macias-Guarasa and Afsaneh Asaei, in: IEEE Transactions on Signal Processing, 64(20):5242-5254, 2016

[DOI]
[URL]

INFORMATION THEORETIC CLUSTERING FOR UNSUPERVISED DOMAIN-ADAPTATION, Subhadeep Dey, Srikanth Madikeri and Petr Motlicek, Idiap-RR-09-2016

DEEP NEURAL NETWORK BASED POSTERIORS FOR TEXT-DEPENDENT SPEAKER VERIFICATION, Subhadeep Dey, Srikanth Madikeri, Marc Ferras and Petr Motlicek, Idiap-RR-08-2016

Pronunciation Lexicon Development for Under-Resourced Languages Using Automatically Derived Subword Units: A Case Study on Scottish Gaelic, Marzieh Razavi, Ramya Rasipuram and Mathew Magimai-Doss, in: 4th Biennial Workshop on Less-Resourced Languages, 2015

International Conference on Mobile and Ubiquitous Multimedia, Gilberto Chávez-Martínez, Salvador Ruiz-Correa and Daniel Gatica-Perez, in: Happy and Agreeable? Multi-Label Classification of Impressions in Social Video, Linz, Austria, pages 109-120, ACM, 2015

[DOI]
[URL]

Towards Multiple Pronunciation Generation in Acoustic G2P Conversion Framework, Marzieh Razavi, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-34-2015

3D Gaze Estimation from Remote RGB-D Sensors, Kenneth Alberto Funes Mora, École Polytechnique Fédérale de Lausanne, 2015

[DOI]

Head Nod Detection from a Full 3D Model, Yiqiang Chen, Yu Yu and Jean-Marc Odobez, in: Proceedings of the ICCV 2015, pages 528-536, 2015

Posterior-Based Multi-Stream Formulation To Combine Multiple Grapheme-to-Phoneme Conversion Techniques, Marzieh Razavi and Mathew Magimai-Doss, Idiap-RR-33-2015

HMM-based Non-native Accent Assessment using Posterior Features, Ramya Rasipuram, Milos Cernak and Mathew Magimai-Doss, Idiap-RR-32-2015

Predicting the intrusiveness of noise through sparse coding with auditory kernels, Raphael Ullmann and Hervé Bourlard, in: Speech Communication, 76:186-200, 2016

[DOI]
[URL]

Fast K-Means with Accurate Bounds, James Newling and Francois Fleuret, Idiap-RR-17-2016

A Tutorial on Task-Parameterized Movement Learning and Retrieval, Sylvain Calinon, in: Intelligent Service Robotics, 9(1):1-29, 2016

[DOI]
[URL]

Towards utterance-based neural network adaptation in acoustic modeling, Ivan Himawan, Petr Motlicek, Marc Ferras and Srikanth Madikeri, in: IEEE Automatic Speech Recognition and Understanding Workshop, pages 289-295, 2015

Gaze Estimation in the 3D Space Using RGB-D sensors. Towards Head-Pose And User Invariance., Kenneth Alberto Funes Mora and Jean-Marc Odobez, in: International Journal of Computer Vision, 118(2):194-216, 2016

[DOI]
[URL]

Deciphering the Silent Participant. On the Use of Audio-Visual Cues for the Classification of Listener Categories in Group Discussions, Catharine Oertel, Kenneth Alberto Funes Mora, Joakim Gustafson and Jean-Marc Odobez, in: Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, ACM, Seattle, Washington, USA, pages 107--114, ACM, 2015

[DOI]

Within- and Cross- Database Evaluations for Gender Classification via BeFIT Protocols, Nesli Erdogmus, Matthias Vanoni and Sébastien Marcel, in: International Workshop on Multimedia Signal Processing, pages 1-6, 2014

[DOI]
[URL]

Palm Vein Database and Experimental Framework for Reproducible Research, Pedro Tome and Sébastien Marcel, in: IEEE International Conference of the Biometrics Special Interest Group, pages 1-7, 2015

[DOI]
[URL]

Finger vein Liveness Detection Using Motion Magnification, Ramachandra Raghavendra, Manasa Avinas, Christoph Busch and Sébastien Marcel, in: IEEE International Conference on Biometrics: Theory, Applications and Systems, pages 1-7, 2015

[DOI]

Personality Trait Classification via Co-Occurrent Multiparty Multimodal Event Discovery, Shogo Okada, Oya Aran and Daniel Gatica-Perez, in: Proceedings of the ACM International Conference on Multimodal Interaction, Seattle, Washington, USA, pages 15-22, ACM, 2015

[DOI]

Learning bimanual end-effector poses from demonstrations using task-parameterized dynamical systems, J. Silverio, L. Rozo, Sylvain Calinon and D. G. Caldwell, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), Hamburg, Germany, pages 464-470, 2015

Learning Optimal Controllers in Human-robot Cooperative Transportation Tasks with Position and Force Constraints, L. Rozo, D. Bruno, Sylvain Calinon and D. G. Caldwell, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), Hamburg, Germany, pages 1024-1030, 2015

Learning the Stiffness of a Continuous Soft Manipulator from Multiple Demonstrations, D. Bruno, Sylvain Calinon, M. S. Malekzadeh and D. G. Caldwell, in: Intelligent Robotics and Applications, pages 185-195, Springer, 2015

[DOI]
[URL]

Overlapping Speech, Utterance Duration and Affective Content in HHI and HCI - an Comparison, Ingo Siegert, Ronald Boeck, Bogdan Vlasenko, Kerstin Ohnemus and Andreas Wendemuth, in: 6th IEEE Conference on Cognitive Infocommunications, Gyor, pages 83-88, 2015

[DOI]

Enabling speech applications using Ad-Hoc Microphone Arrays, Mohammad J. Taghizadeh, École Polytechnique Fédérale de Lausanne, 2015

Binary Sparse Coding of Convolutive Mixtures for Sound Localization and Separation via Spatialization, Afsaneh Asaei, Mohammad J. Taghizadeh, Saeid Haghighatshoar, Bhiksha Raj, Hervé Bourlard and Volkan Cevher, in: IEEE Transactions on Signal Processing, 64(3):567-579, 2016

[DOI]

Adaptive Sentiment-Aware One-Class Collaborative Filtering, Nikolaos Pappas and Andrei Popescu-Belis, in: Expert Systems with Applications, 43:23-41, 2016

[DOI]
[URL]

Visual Affect Around the World: A Large-scale Multilingual Visual Sentiment Ontology, Brendan Jou, Tao Chen, Nikolaos Pappas, Miriam Redi, Mercan Topkara and Shih-Fu Chang, in: Proceedings of the ACM International Conference on Multimedia, pages 159--168, 2015

Statistical Models in Automatic Speech Recognition, Sandrine Revaz, University of Fribourg, Department of Mathematics, 2015

Exploring Dataset Similarities using PCA-based Feature Selection, Ingo Siegert, Ronald Boeck, Bogdan Vlasenko and Andreas Wendemuth, in: Proceedings of ACII, Xi'an, pages 387-393, IEEE, 2015

[DOI]

Cross-Corpus Acoustic Emotion Recognition: Variances and Strategies (Extended Abstract), Björn Schuller, Bogdan Vlasenko, Florian Eyben, Martin Wöllmer, André Stuhlsatz, Andreas Wendemuth and Gerhard Rigoll, in: Proceedings of ACII, Xi'an, pages 470-476, IEEE, 2015

[DOI]

Annotators' agreement and spontaneous emotion classification performance, Bogdan Vlasenko and Andreas Wendemuth, in: Proceedings of Interspeech, pages 1546-1550, 2015

Pronoun Translation and Prediction with or without Coreference Links, Ngoc-Quang Luong, Lesly Miculicich and Andrei Popescu-Belis, in: Proceedings of the Second Workshop on Discourse in Machine Translation (DiscoMT), Lisbon, Portugal, pages 94–100, 2015

Spatial Sound Localization via Multipath Euclidean Distance Matrix Recovery, Mohammad J. Taghizadeh, Afsaneh Asaei, Saeid Haghighatshoar, Philip N. Garner and Hervé Bourlard, in: IEEE Journal of Selected Topics in Signal Processing, 9(5):802-814, 2015

Computational Methods for Underdetermined Convolutive Speech Localization and Separation via Model-based Sparse Component Analysis, Afsaneh Asaei, Hervé Bourlard, Mohammad J. Taghizadeh and Volkan Cevher, in: Speech Communication, 76:201-217, 2016

Automatic social role recognition and its application in structuring multiparty interactions, A. Sapru, EPFL, 2015

Articulatory feature based continuous speech recognition using probabilistic lexical modeling, Ramya Rasipuram and Mathew Magimai-Doss, in: Computer Speech and Language, 36:233-259, 2016

[DOI]

HAVC-II - Idiap Private Cloud (Technical Inside-Out), Cédric Dufour, Idiap-Com-01-2015

On the Vulnerability of Speaker Verification to Realistic Voice Spoofing, Serife Kucur Ergunay, Elie Khoury, Alexandros Lazaridis and Sébastien Marcel, in: IEEE International Conference on Biometrics: Theory, Applications and Systems, pages 1-8, IEEE, 2015

[DOI]
[URL]

Exploiting foreign resources for DNN-based ASR, Petr Motlicek, David Imseng, Blaise Potard, Philip N. Garner and Ivan Himawan, in: EURASIP Journal on Audio, Speech, and Music Processing(2015:17), 2015

[DOI]

Syntactic Parsing of Morphologically Rich Languages Using Deep Neural Networks, Joël Legrand and Ronan Collobert, Idiap-RR-25-2015

Joint RNN-Based Greedy Parsing and Word Composition, Joël Legrand and Ronan Collobert, in: Proceedings of ICLR 2015, 2015

Analysis of CNN-based Speech Recognition System using Raw Speech as Input, Dimitri Palaz, Mathew Magimai-Doss and Ronan Collobert, in: Proceedings of Interspeech, ISCA, Dresden, pages 11-15, ISCA, 2015

Face Recognition Systems Under Spoofing Attacks, Ivana Chingovska, Nesli Erdogmus, André Anjos and Sébastien Marcel, Idiap-RR-18-2020

A Deeper Look at Dataset Bias, Tatiana Tommasi, Novi Patricia, Barbara Caputo and Tinne Tuytelaars, in: German Conference on Pattern Recognition, Aachen, Germany, pages 504–516, Springer International Publishing, 2015

[DOI]

Rewards-driven control of robot arm by decoding EEG signals, Ajay Kumar Tanwani, José del R. Millán and Aude Billard, in: Engineering in Medicine and Biology Society (EMBC), 2014 36th Annual International Conference of the IEEE, pages 1658-1661, IEEE, 2014

[DOI]
[URL]

Simple Image Description Generator via a Linear Phrase-based Model, Rémi Lebret, Pedro H. O. Pinheiro and Ronan Collobert, Idiap-RR-22-2015

Transfer in Inverse Reinforcement Learning for Multiple Strategies, Ajay Kumar Tanwani and Aude Billard, in: IEEE/RSJ International Conference on Intelligent Robots and Systems, Tokyo, Japan, pages 3244-3250, IEEE, 2013

[DOI]
[URL]

Autonomous reinforcement learning with experience replay, P. Wawrzyński and Ajay Kumar Tanwani, in: Neural Networks, 41:156 - 167, 2013

[DOI]
[URL]

"The Sum of Its Parts": Joint Learning of Word and Phrase Representations with Autoencoders, Rémi Lebret and Ronan Collobert, Idiap-RR-21-2015

Rehabilitation of Count-based Models for Word Vector Representations, Rémi Lebret and Ronan Collobert, in: Computational Linguistics and Intelligent Text Processing, pages 417-429, Springer International Publishing, 2015

From Image-level to Pixel-level Labeling with Convolutional Networks, Pedro H. O. Pinheiro and Ronan Collobert, in: Computer Vision and Patter Recognition (CVPR), Boston, MA, pages 1713-1721, IEEE, 2015

[DOI]
[URL]

Phrase-based Image Captioning, Rémi Lebret, Pedro H. O. Pinheiro and Ronan Collobert, in: International Conference on Machine Learning (ICML), Lille, France, pages 2085–2094, JMLR, 2015

[URL]

DexROV: Dexterous Undersea Inspection and Maintenance in Presence of Communication Latencies, J. Gancet, D. Urbina, P. Letier, M. Ilzokvitz, P. Weiss, F. Gauch, G. Antonelli, G. Indiveri, G. Casalino, A. Birk, M. F. Pfingsthorn, Sylvain Calinon, Ajay Kumar Tanwani, A. Turetta, C. Walen and L. Guilpain, in: IFAC Workshop on Navigation, Guidance and Control of Underwater Vehicles (NGCUV), pages 218-223, 2015

Dictionary Learning for Sparse Representation of Neural Network Exemplars in Speech Recognition, Pranay Dighe, Afsaneh Asaei and Hervé Bourlard, in: Proceedings of SPARS 2015: Workshop on Signal Processing with Adaptive Sparse Structured Representations, 2015, pages 1093, 2015

Sparse Modeling of Neural Network Posterior Probabilities for Exemplar-Based Speech Recognition, Pranay Dighe, Afsaneh Asaei and Hervé Bourlard, in: Proceedings of SPARS 2015: Workshop on Signal Processing with Adaptive Sparse Structured Representations, 2015

DNN-based Speech Synthesis: Importance of input features and training data, Alexandros Lazaridis, Blaise Potard and Philip N. Garner, in: International Conference on Speech and Computer , SPECOM, pages 193-200, Springer Berlin Heidelberg, 2015

[DOI]

Integrating Online I-vector extractor with Information Bottleneck based Speaker Diarization system, Srikanth Madikeri, Ivan Himawan, Petr Motlicek and Marc Ferras, Idiap-RR-20-2015

KL-HMM BASED SPEAKER DIARIZATION SYSTEM FOR MEETINGS, Srikanth Madikeri and Hervé Bourlard, in: Proceedings of ICASSP 2015, pages 4435-4439, 2015

COMBINING SGMM SPEAKER VECTORS AND KL-HMM APPROACH FOR SPEAKER DIARIZATION, Srikanth Madikeri, Petr Motlicek and Hervé Bourlard, in: Proceedings of ICASSP 2015, pages 4834-4837, 2015

EMPLOYMENT OF SUBSPACE GAUSSIAN MIXTURE MODELS IN SPEAKER RECOGNITION, Petr Motlicek, Subhadeep Dey, Srikanth Madikeri and Lukas Burget, in: 2015 IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE, Brisbane, Australia, pages 4445-4449, 2015

[URL]

Exploiting foreign resources for DNN-based ASR, Petr Motlicek, David Imseng, Blaise Potard, Philip N. Garner and Ivan Himawan, Idiap-RR-27-2015

Sparse Modeling of Posterior Exemplars for Keyword Detection, Dhananjay Ram, Afsaneh Asaei, Pranay Dighe and Hervé Bourlard, in: Proceedings of Interspeech, pages 3690-3694, 2015

Weighted Correlation based Atom Decomposition Intonation Modelling, Branislav Gerazov, Pierre-Edouard Honnet, Aleksandar Gjoreski and Philip N. Garner, in: Proceedings of Interspeech, Dresden, Germany, pages 1601--1605, 2015

Automatic Recognition of Emergent Social Roles in Small Group Interactions, A. Sapru and Hervé Bourlard, in: Multimedia, IEEE Transactions, 17(5):746 - 760, 2015

[DOI]

Channel Selection in the Short-time Modulation Domain for Distant Speech Recognition, Ivan Himawan, Petr Motlicek, Sridha Sridharan, David Dean and Dian Tjondronegoro, in: Proceedings of Interspeech, pages 741-745, 2015

Neuromorphic Based Oscillatory Device for Incremental Syllable Boundary Detection, Alexandre Hyafil and Milos Cernak, in: Proc. of Interspeech, Dresden, Germany, pages 1191-1195, ISCA, 2015

Neuromorphic Based Oscillatory Device for Incremental Syllable Boundary Detection, Alexandre Hyafil and Milos Cernak, Idiap-RR-14-2015

Automatic Accentedness Evaluation of Non-Native Speech Using Phonetic and Sub-Phonetic Posterior Probabilities, Ramya Rasipuram, Milos Cernak, Alexandre Nanchen and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2015

Automatic Accentedness Evaluation of Non-Native Speech Using Phonetic and Sub-Phonetic Posterior Probabilities, Ramya Rasipuram, Milos Cernak, Alexandre Nanchen and Mathew Magimai-Doss, Idiap-RR-12-2015

Computational Analysis Of Behavior In Employment Interviews And Video Resumes, Laurent Son Nguyen, École Polytechnique Fédérale de Lausanne, 2015

Probability Occupancy Maps for Occluded Depth Images, Timur Bagautdinov, Francois Fleuret and Pascal Fua, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, pages 2829-2837, 2015

Machine learning-based tools to model and to remove the off-target effect, Riwal Lefort, L. Fusco, O. Pertz and Francois Fleuret, in: Pattern Analysis and Applications, 20(1):87-100, 2017

[DOI]

Adaptive relevance feedback for large-scale image retrieval, Nicolae Suditu and Francois Fleuret, in: Multimedia Tools and Applications, 75(12):6777-6807, 2016

[DOI]

Sparse Modeling of Neural Network Posterior Probabilities for Exemplar-based Speech Recognition, Pranay Dighe, Afsaneh Asaei and Hervé Bourlard, in: Speech Communication: Special Issue on Advances in Sparse Modeling and Low-rank Modeling for Speech Processing, 76:230–244, 2016

[DOI]

Dynamic structure and protein expression of the live embryonic heart captured by 2-photon light sheet microscopy and retrospective registration, Vikas Trivedi, Thai V. Truong, Le A. Trinh, Daniel B. Holland, Michael Liebling and Scott E. Fraser, in: Biomedical Optics Express, 6(6):2056-2066, 2015

[DOI]
[URL]

An Empirical Model of Emphatic Word Detection, Milos Cernak and Pierre-Edouard Honnet, in: Proc. of Interspeech, Dresden, Germany, pages 573-577, ISCA, 2015

Speaker diarization of spontaneous meeting room conversations, Sree Harsha Yella, EPFL, 2015

Acoustic Data-Driven Grapheme-to-Phoneme Conversion in the Probabilistic Lexical Modeling Framework, Marzieh Razavi, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-10-2015

Leveraging Compounds to Improve Noun Phrase Translation from Chinese and German, Xiao Pu, Laura Mascarell, Andrei Popescu-Belis, Mark Fishel, Ngoc-Quang Luong and Martin Volk, Idiap-RR-09-2015

Leveraging Compounds to Improve Noun Phrase Translation from Chinese and German, Xiao Pu, Laura Mascarell, Andrei Popescu-Belis, Mark Fishel, Ngoc-Quang Luong and Martin Volk, in: Proceedings of the ACL-IJCNLP 2015 Student Research Workshop, Beijing, China, pages 8-15, 2015

Disambiguating Discourse Connectives for Statistical Machine Translation, Thomas Meyer, Najeh Hajlaoui and Andrei Popescu-Belis, in: IEEE/ACM Transactions on Audio, Speech and Language Processing, 23(7):1184-1197, 2015

[DOI]

In the Mood for Vlog: Multimodal Inference in Conversational Social Video, Dairazalia Sanchez-Cortes, Shiro Kumano, Kazuhiro Otsuka and Daniel Gatica-Perez, in: ACM Transactions on Interactive Intelligent Systems, 5(2), 2015

[DOI]

Speech vocoding for laboratory phonology, Milos Cernak, Štefan Beňuš and Alexandros Lazaridis, Idiap-RR-07-2015

Channel Selection in the Short-time Modulation Domain for Distant Speech Recognition; Comparison with the Envelope-Variance Measure, Ivan Himawan, Petr Motlicek, Sridha Sridharan, David Dean and Dian Tjondronegoro, Idiap-RR-30-2015

Learning Feature Mapping using Deep Neural Network Bottleneck Features for Distant Large Vocabulary Speech Recognition, Ivan Himawan, Petr Motlicek, David Imseng, Blaise Potard, Namhoon Kim and Jaewon Lee, in: IEEE International Conference on Acoustics, Speech, and Signal Processing, pages 4540-4544, 2015

[DOI]

Joint Speaker Verification and Anti-Spoofing in the i-Vector Space, Aleksandr Sizov, Elie Khoury, Tomi Kinnunen, Zhizheng Wu and Sébastien Marcel, in: IEEE Transactions on Information Forensics and Security, 10(4):821-832, 2015

[DOI]

Gender Classification by LUT based boosting of Overlapping Block Patterns, Rakesh Metha, Manuel Günther and Sébastien Marcel, in: Scandinavian Conference on Image Analysis, pages 530-542, Springer International Publishing, 2015

[DOI]
[URL]

An Empirical Model of Emphatic Word Detection, Milos Cernak and Pierre-Edouard Honnet, Idiap-RR-11-2015

Reconstruction of Images from Gabor Graphs with Applications in Facial Image Processing, Manuel Günther, Stefan Böhringer, Dagmar Wieczorek and Rolf P. Würtz, in: Journal of Wavelets, Multiresolution and Information Processing, 13(4):25, 2015

[DOI]

Incremental Syllable-Context Phonetic Vocoding, Milos Cernak, Philip N. Garner, Alexandros Lazaridis, Petr Motlicek and Xingyu Na, in: IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 23(6), 2015

[URL]

Learning linearly separable features for speech recognition using convolutional neural networks, Dimitri Palaz, Mathew Magimai-Doss and Ronan Collobert, Idiap-RR-24-2015

[URL]

Analysis of CNN-based Speech Recognition System using Raw Speech as Input, Dimitri Palaz, Mathew Magimai-Doss and Ronan Collobert, Idiap-RR-23-2015

On the Application of Automatic Subword Unit Derivation and Pronunciation Generation for Under-Resourced Language ASR: A Study on Scottish Gaelic, Marzieh Razavi, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-13-2015

Query Refinement Using Conversational Context: a Method and an Evaluation Resource, Maryam Habibi and Andrei Popescu-Belis, in: Proceedings of NLDB 2015 (20th International Conference on Applications of Natural Language to Information Systems), Passau, Germany, pages 89-102, Springer-Verlag Berlin, 2015

[DOI]

Integrated Pronunciation Learning for Automatic Speech Recognition Using Probabilistic Lexical Modeling, Ramya Rasipuram, Marzieh Razavi and Mathew Magimai-Doss, in: International Conference on Acoustics, Speech and Signal Processing, South Brisbane, QLD, pages 5176-5180, 2015

[DOI]

An HMM-Based Formalism for Automatic Subword Unit Derivation and Pronunciation Generation, Marzieh Razavi and Mathew Magimai-Doss, in: International Conference on Acoustics, Speech and Signal Processing, pages 4639-4643, IEEE, 2015

[DOI]

Convolutional Neural Networks-based Continuous Speech Recognition using Raw Speech Signal, Dimitri Palaz, Mathew Magimai-Doss and Ronan Collobert, in: International Conference on Acoustics, Speech and Signal Procecssing, IEEE, South Brisbane, QLD, pages 4295 - 4299, IEEE, 2015

On the Vulnerability of Palm Vein Recognition to Spoofing Attacks, Pedro Tome and Sébastien Marcel, in: The 8th IAPR International Conference on Biometrics (ICB), pages 319 - 325, 2015

[DOI]
[URL]

The 1st Competition on Counter Measures to Finger Vein Spoofing Attacks, Pedro Tome, Ramachandra Raghavendra, Christoph Busch, Santosh Tirunagari, Norman Poh, B. H. Shekar, Diego Gragnaniello, Carlo Sansone, Luisa Verdoliva and Sébastien Marcel, in: The 8th IAPR International Conference on Biometrics (ICB), pages 513 - 518, 2015

[DOI]
[URL]

A New Identity for the Least-square Solution of Overdetermined Set of Linear Equations, Saeid Haghighatshoar, Mohammad J. Taghizadeh and Afsaneh Asaei, Idiap-RR-35-2015

Analysis of Small Groups, Daniel Gatica-Perez, Oya Aran and Dinesh Babu Jayagopi, in: Social Signal Processing, pages 349-367, Cambridge University Press. Editors J. Burgoon, N. Magnenat-Thalmann, M. Pantic, and A. Vinciarelli, 2017

[DOI]

Twitter Sentiment Analysis (Almost) from Scratch, Rémi Lebret, Pedro H. O. Pinheiro and Ronan Collobert, Idiap-RR-15-2016

N-gram-Based Low-Dimensional Representation for Document Classification, Rémi Lebret and Ronan Collobert, in: International Conference on Learning Representations, 2015

[URL]

Phrase-based Image Captioning, Rémi Lebret, Pedro H. O. Pinheiro and Ronan Collobert, Idiap-RR-08-2015

Estimation of Divergence-Free 3D Cardiac Blood Flow in a Zebrafish Larva Using Multi-View Microscopy, Kevin G. Chan and Michael Liebling, in: Biomedical Imaging (ISBI), 2015 IEEE 12th International Symposium on, IEEE, Brooklyn, NY, USA, pages 385-388, 2015

[DOI]

Atom Decomposition-based Intonation Modelling, Pierre-Edouard Honnet, Branislav Gerazov and Philip N. Garner, in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, Australia, pages 4744--4748, IEEE, 2015

[DOI]

Intensity-Based Point-Spread-Function-Aware Registration for Multi-View Applications in Optical Microscopy, Nikhil Chacko, Kevin G. Chan and Michael Liebling, in: Biomedical Imaging (ISBI), 2015 IEEE 12th International Symposium on, pages 306-309, IEEE, 2015

[DOI]

Phonological Vocoding Using Artificial Neural Networks, Milos Cernak, Blaise Potard and Philip N. Garner, in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, Australia, pages 4844-4848, IEEE, 2015

[DOI]

On the use of client identity information for face anti-spoofing, Ivana Chingovska and André Anjos, in: IEEE Transactions on Information Forensics and Security, Special Issue on Biometric Anti-spoofing, 10(4):787-796, 2015

On Application Of Non-Negative Matrix Factorization for Ad Hoc Microphone Array Calibration from Incomplete Noisy Distances, Afsaneh Asaei, Nasser Mohammadiha, Mohammad J. Taghizadeh, Simon Doclo and Hervé Bourlard, in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 2694-2698, IEEE, 2015

[DOI]

Novel GCC-PHAT Model in Diffuse Sound Field for Microphone Array Pairwise Distance Based Calibration, Jose Velasco, Mohammad J. Taghizadeh, Afsaneh Asaei, Hervé Bourlard, Carlos J. Martín-Arguedas, Javier Macias-Guarasa and Daniel Pizarro, in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 2669-2673, 2015

Robust Microphone Placement for Source Localization from Noisy Distance Measurements, Mohammad J. Taghizadeh, Saeid Haghighatshoar, Afsaneh Asaei, Philip N. Garner and Hervé Bourlard, in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 2579-2583, IEEE, 2015

[DOI]

Building context-dependent DNN acoustic models using Kullback-Leibler divergence-based state tying, Gábor Gosztolya, Tamás Grósz, László Tóth and David Imseng, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2015

What Your Face Vlogs About: Expressions of Emotion and Big-Five Traits Impressions in YouTube, Lucia Teijeiro-Mosquera, Joan-Isaac Biel, Jose Luis Alba-Castro and Daniel Gatica-Perez, in: IEEE Transactions Affective Computing, 2014

The Workshop on Computational Personality Recognition 2014, Fabio Celli, Bruno Lepri, Joan-Isaac Biel, Giuseppe Riccardi, Daniel Gatica-Perez and Fabio Pianesi, in: Proceedings of the ACM International Conference on Multimedia, 2014

Mining Crowdsourced First Impressions in Online Social Video, Joan-Isaac Biel and Daniel Gatica-Perez, in: IEEE Transactions on Multimedia, 16(7), 2014

Preliminary Work on Speaker Adaptation for DNN-Based Speech Synthesis, Blaise Potard, Petr Motlicek and David Imseng, Idiap-RR-02-2015

Automatic Blinking Detection towards Stress Discovery, Alvaro Marcos-Ramiro, Daniel Pizarro-Perez, Marta Marron-Romera and Daniel Gatica-Perez, in: Proc. ACM Int. Conf. on Multimodal Interaction, Istanbul, pages 307-310, ACM New York, 2014

[DOI]

Capturing Upper Body Motion in Conversation: an Appearance Quasi-Invariant Approach, Alvaro Marcos-Ramiro, Daniel Pizarro-Perez, Marta Marron-Romera and Daniel Gatica-Perez, in: Proc. ACM Int. Conf. on Multimodal Interaction, Istanbul, pages 327-334, ACM New York, 2014

[DOI]

Signal Processing in the Workplace, Daniel Gatica-Perez, in: IEEE Signal Processing Magazine, 32(1):121-125, 2015

Leveraging Colour Segmentation for Upper-Body Detection, Stefan Duffner and Jean-Marc Odobez, in: Pattern Recognition, 47(6):2222-2230, 2014

Detecting and Labeling Speakers on Overlapping Speech using Vector Taylor Series, Pranay Dighe, Marc Ferras and Hervé Bourlard, in: INTERSPEECH, 2014

Multi-source Posteriors for Speech Activity Detection on Public Talks, Marc Ferras and Hervé Bourlard, in: INTERSPEECH, 2014

Diarizing Large Corpora using Multi-modal Speaker Linking, Marc Ferras, Stefano Masneri, Oliver Schreer and Hervé Bourlard, in: INTERSPEECH 2014, 2014

LETHA: Learning from High Quality Inputs for 3D Pose Estimation in Low Quality Images, Adrian Penate-Sanchez, Francesc Moreno-Noguer, Juan Andrade-Cetto and Francois Fleuret, in: Proceedings of the International Conference on 3D vision, pages 517–524, 2014

Tracking Interacting Objects Optimally Using Integer Programming, Xinchao Wang, Engin Turetken, Francois Fleuret and Pascal Fua, in: Proceedings of the European Conference on Computer Vision, pages 17-32, 2014

Phoneme Background Model for Information Bottleneck based Speaker Diarization, Sree Harsha Yella, Petr Motlicek and Hervé Bourlard, in: Interspeech, Singapore, 2014

Artificial neural network features for speaker diarization, Sree Harsha Yella, Andreas Stolcke and Malcolm Slaney, in: IEEE Spoken Language Technology workshop, South Lake Tahoe, USA, 2014

Overlapping speech detection using long-term conversational features for speaker diarization in meeting room conversations., Sree Harsha Yella and Hervé Bourlard, in: Audio, Speech and Language processing, IEEE/ACM Transaction on, 22(12):1688-1700, 2014

Evaluation Databases, Stan Z.Li, Javier Galbally, André Anjos and Sébastien Marcel, in: Handbook of Biometric Anti-Spoofing, pages 247-278, Springer-Verlag, 2014

[DOI]

LETHA: Learning from High Quality Inputs for 3D Pose Estimation in Low Quality Images., Adrian Penate-Sanchez, Francesc Moreno-Noguer, Juan Andrade-Cetto and Francois Fleuret, Idiap-RR-22-2014

EMPLOYMENT OF SUBSPACE GAUSSIAN MIXTURE MODELS IN SPEAKER RECOGNITION, Petr Motlicek, Subhadeep Dey, Srikanth Madikeri and Lukas Burget, Idiap-RR-16-2015

Development of Bilingual ASR System for MediaParl Corpus, Petr Motlicek, David Imseng, Milos Cernak and Namhoon Kim, in: Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech 2014), Singapore, ISCA, 2014

Development of Bilingual ASR System for MediaParl Corpus, Petr Motlicek, David Imseng, Milos Cernak and Namhoon Kim, Idiap-RR-21-2014

Sample Distillation for Object Detection and Image Classification, Olivier Canévet, Leonidas Lefakis and Francois Fleuret, in: Proceedings of the 6th Asian Conference on Machine Learning (ACML), Nha Trang, Vietnam, 2014

Efficient Sample Mining for Object Detection, Olivier Canévet and Francois Fleuret, in: Proceedings of the 6th Asian Conference on Machine Learning (ACML), Nha Trang, Vietnam, 2014

Keyword Extraction and Clustering for Document Recommendation in Conversations, Maryam Habibi and Andrei Popescu-Belis, in: IEEE/ACM Transactions on Audio Speech and Language Processing, 23(4):746 - 759, 2015

[DOI]

Otomatik İşaret Dili Tanıma ve Türk İşaret Dili için Bilgisayar Uygulamaları, Oya Aran, Ismail Ari, Alp Kindiroglu, Pinar Santemiz and Lale Akarun, in: Ellerle Konusmak: Turk Isaret Dili Arastirmalari / Research on Turkish Sign Language, pages 471-498, Koc University Press, 2016

Modeling Annotator Behaviors for Crowd Labeling, Yunus Emre Kara, Gaye Genc, Oya Aran and Lale Akarun, in: Neurocomputing, 160:141–156, 2015

[DOI]

Discourse connectives: theoretical models and empirical validations in humans and computers, Sandrine Zufferey and Andrei Popescu-Belis, in: Papers dedicated to Jacques Moeschler, University of Geneva, 2014

[URL]

ROCKIT: Roadmap for Conversational Interaction Technologies, Steve Renals, Jean Carletta, K Edwards, Hervé Bourlard, Philip N. Garner, Andrei Popescu-Belis, Dietrich Klakow, A Girenko, Volha Petukhova, P Wacker, A Joscelyne, C Kompis, S Aliwell, W Stevens and Y Sabbah, in: Proceedings of the 2014 Workshop on Roadmapping the Future of Multimodal Interaction Research including Business Opportunities and Challenges (RFMIR '14), pages 39-42, ACM, 2014

[DOI]

Syllabic Pitch Tuning for Neutral-to-Emotional Voice Conversion, Lakshmi Saheer, Xingyu Na and Milos Cernak, Idiap-RR-31-2015

Transfer Learning through Greedy Subset Selection, Ilja Kuzborskij, Francesco Orabona and Barbara Caputo, Idiap-RR-26-2015

Incremental Syllable-Context Phonetic Vocoding, Milos Cernak, Philip N. Garner, Alexandros Lazaridis, Petr Motlicek and Xingyu Na, Idiap-RR-05-2015

Phonological vocoding using artificial neural networks, Milos Cernak, Blaise Potard and Philip N. Garner, Idiap-RR-04-2015

Acoustic and Lexical Resource Constrained ASR using Language-Independent Acoustic Model and Language-Dependent Probabilistic Lexical Model, Ramya Rasipuram and Mathew Magimai-Doss, in: Speech Communication, 68:23–40, 2015

[DOI]
[URL]

Impact of Eye Detection Error on Face Recognition Performance, Abhishek Dutta, Manuel Günther, Laurent El Shafey, Sébastien Marcel, Raymond Veldhuis and Luuk Spreeuwers, in: IET Biometrics, 2015

[URL]

A Skill Transfer Approach for Continuum Robots - Imitation of Octopus Reaching Motion with the STIFF-FLOP Robot, M. S. Malekzadeh, Sylvain Calinon, D. Bruno and D. G. Caldwell, in: In Proc. of the AAAI Symp. on Knowledge, Skill, and Behavior Transfer in Autonomous Robots, Arlington, VA, USA, pages 49-52, 2014

[URL]

Skills Learning in Robots by Interaction with Users and Environment, Sylvain Calinon, in: In Proc. of the Intl Conf. on Ubiquitous Robots and Ambient Intelligence (URAI), Kuala Lumpur, Malaysia, pages 161-162, 2014

[URL]

COMBINING SGMM SPEAKER VECTORS AND KL-HMM APPROACH FOR SPEAKER DIARIZATION, Srikanth Madikeri, Petr Motlicek and Hervé Bourlard, Idiap-RR-17-2015

KL-HMM BASED SPEAKER DIARIZATION SYSTEM FOR MEETINGS, Srikanth Madikeri and Hervé Bourlard, Idiap-RR-19-2015

Articulatory Feature based Continuous Speech Recognition using Probabilistic Lexical Modeling, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-19-2014

Who Will Get the Grant ? A Multimodal Corpus for the Analysis of Conversational Behaviours in Group Interviews, Catharine Oertel, Kenneth Alberto Funes Mora, Samira Sheikhi, Jean-Marc Odobez and Joakim Gustafson, in: International Conference on Multimodal Interaction, Understanding and Modeling Multiparty, Multimodal Interactions Workshop, Istanbul, Turkey, ACM, 2014

[DOI]

Joint Phoneme Segmentation Inference and Classification using CRFs, Dimitri Palaz, Mathew Magimai-Doss and Ronan Collobert, in: Global Conference on Signal and Information Processing, Atlanta, GA, pages 587 - 591, IEEE, 2014

[DOI]

Convolutional Neural Networks-based Continuous Speech Recognition using Raw Speech Signal, Dimitri Palaz, Mathew Magimai-Doss and Ronan Collobert, Idiap-RR-18-2014

Learning by Imitation with the STIFF-FLOP Surgical Robot: A Biomimetic Approach Inspired by Octopus Movements, M. S. Malekzadeh, Sylvain Calinon, D. Bruno and D. G. Caldwell, in: Robotics and Biomimetics, 1(13):1-15, 2014

[URL]

Learning adaptive movements from demonstration and self-guided exploration, D. Bruno, Sylvain Calinon and D. G. Caldwell, in: Proc. IEEE Intl Conf. on Development and Learning and on Epigenetic Robotics (ICDL-EpiRob), Genoa, Italy, pages 160-165, 2014

Learning Force and Position Constraints in Human-robot Cooperative Transportation, L. Rozo, Sylvain Calinon and D. G. Caldwell, in: Proc. IEEE Intl Symposium on Robot and Human Interactive Communication (Ro-Man), Edinburgh, Scotland, UK, pages 619-624, 2014

Saliency-based Representations and Multi-component Classifiers for Visual Scene Recognition, Marco Fornoni, École Polytechnique Fédérale de Lausanne (EPFL), 2014

Emergent Power Hierarchies and Group Performance, Denise Frauendorfer, Marianne Schmid Mast, Dairazalia Sanchez-Cortes and Daniel Gatica-Perez, in: International Journal of Psychology, 50(5):392–396, 2015

[DOI]
[URL]

The MAAYA Project: Multimedia Analysis and Access for Documentation and Decipherment of Maya Epigraphy, Daniel Gatica-Perez, Carlos Pallan Gayol, Stephane Marchand-Maillet, Jean-Marc Odobez, Edgar Roman-Rangel, Guido Krempel and Nikolai Grube, in: Proc. Digital Humanities Conference, Lausanne, 2014

Is That a Jaguar? Segmenting Ancient Maya Glyphs via Crowdsourcing, Gulcan Can, Jean-Marc Odobez and Daniel Gatica-Perez, in: Proc. ACM Int. Workshop on Crowdsourcing for Multimedia, Orlando, pages 37-40, ACM New York, 2014

[DOI]

Feature Mapping of Multiple Beamformed Sources for Robust Overlapping Speech Recognition Using a Microphone Array, Weifeng Li, Longbiao Wang, Yicong Zhou, John Dines, Mathew Magimai-Doss, Hervé Bourlard and Qingmin Liao, Idiap-RR-17-2014

Grapheme-based Automatic Speech Recognition using Probabilistic Lexical Modeling, Ramya Rasipuram, École polytechnique fédérale de Lausanne, 2014

[DOI]

A Probabilistic Kernel Method for Human Mobility Prediction with Smartphones, Trinh-Minh-Tri Do, O. Dousse, Markus Miettinen and Daniel Gatica-Perez, in: Pervasive and Mobile Computing, 2014

The SP2 SCOPES Project on Speech Prosody, Gyorgy Szaszak, Tamas Gabor Csapo, Philip N. Garner, Branislav Gerazov, Zoran Ivanovski, Geza Nemeth, Balint Toth, Milan Secujski and Vlado Delic, in: DOGS2014 - Digital speech and image processing, 2014

Automatic Speech Recognition and Translation of a Swiss German Dialect: Walliserdeutsch, Philip N. Garner, David Imseng and Thomas Meyer, in: Proceedings of Interspeech, 2014

Overview of the ImageCLEF 2014 Domain Adaptation Task, Barbara Caputo and Novi Patricia, in: ImageCLEF 2014: Overview and analysis of the results, 2014

The Young and the City: Crowdsourcing Urban Awareness in a Developing Country, Salvador Ruiz-Correa, Darshan Santani and Daniel Gatica-Perez, in: Proceedings of the First International Conference on IoT in Urban Space, pages 74-79, 2014

[DOI]
[URL]

Effect of nonverbal behavioral patterns on the performance of small groups, Umut Avci and Oya Aran, in: ICMI Workshop on Understanding and Modeling Multiparty Multimodal Interactions, Istanbul, Turkey, 2014

MODIFIED GROUP DELAY FEATURE BASED TOTAL VARIABILITY SPACE MODELLING FOR SPEAKER RECOGNITION, Srikanth Madikeri, Asha T and Hema A Murthy, in: International Journal of Speech Techonology, 18(1):17-23, 2014

[DOI]

Feature Switching in the i-vector Framework for Speaker Verification, Asha T, Saranya M S, Karthik Pandia D S, Srikanth Madikeri and Hema A Murthy, in: Proc. of Interspeech 2014, pages 5, 2014

Cross-Database Evaluation With an Open Finger Vein Sensor, Matthias Vanoni, Pedro Tome, Laurent El Shafey and Sébastien Marcel, in: IEEE Workshop on Biometric Measurements and Systems for Security and Medical Applications (BioMS), Rome, Italy, pages 30-35, IEEE, 2014

[DOI]

The DBOX Corpus Collection of Spoken Human-Human and Human-Machine Dialogues, Volha Petukhova, Martin Gropp, Dietrich Klakow, Anna Schmidt, Gregor Eigner, Mario Topf, Stefan Srb, Petr Motlicek, Blaise Potard, John Dines, O. Deroo, Ronny Egeler, Uwe Meinz and Steffen Liersch, in: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), Reykjavik, Iceland, European Language Resources Association (ELRA), 2014

[URL]

How Do You Like Your Virtual Agent?: Human-Agent Interaction Experience through Nonverbal Features and Personality Traits, Aleksandra Cerekovic, Oya Aran and Daniel Gatica-Perez, in: Human Behavior Understanding, pages 1-15, Springer, 2014

Inferring Visual Attention and Addressee in Human Robot Interaction, Samira Sheikhi, École Polytechnique Fédérale de Lausanne (EPFL), 2014

Biometrics Evaluation Under Spoofing Attacks, Ivana Chingovska, André Anjos and Sébastien Marcel, in: IEEE Transactions on Information Forensics and Security, 9(12):2264-2276, 2014

[DOI]

Automatic Maya Hieroglyph Retrieval Using Shape and Context Information, Rui Hu, Carlos Pallan, Guido Krempel, Jean-Marc Odobez and Daniel Gatica-Perez, in: ACM MM, pages 4, 2014

[URL]

Evaluation Methodologies, Ivana Chingovska, André Anjos and Sébastien Marcel, in: Handbook of Biometric Antispoofing, Springer, 2014

Exemplar-based Sparse Representation for Posterior Features, Sara Bahaadini, Afsaneh Asaei and Hervé Bourlard, Idiap-RR-11-2014

Weakly Supervised Object Segmentation with Convolutional Neural Networks, Pedro H. O. Pinheiro and Ronan Collobert, Idiap-RR-13-2014

Ad Hoc Microphone Array Calibration: Euclidean Distance Matrix Completion Algorithm and Theoretical Guarantees, Mohammad J. Taghizadeh, Reza Parhizkar, Philip N. Garner, Hervé Bourlard and Afsaneh Asaei, in: Signal Processing, 107:123–140, 2015

[DOI]

Explaining the Stars: Weighted Multiple-Instance Learning for Aspect-Based Sentiment Analysis, Nikolaos Pappas and Andrei Popescu-Belis, in: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar, 2014

Human Tracking and Pose Estimation in Open Spaces, Alexandre Heili, École Polytechnique Fédérale de Lausanne (EPFL), 2014

On the Vulnerability of Finger Vein Recognition to Spoofing, Pedro Tome, Matthias Vanoni and Sébastien Marcel, in: IEEE International Conference of the Biometrics Special Interest Group (BIOSIG), Darmstadt, Germay, pages 1 - 10, IEEE, 2014

Phoneme Background Model for Information Bottleneck based Speaker Diarization, Sree Harsha Yella, Petr Motlicek and Hervé Bourlard, in: Interspeech 2014, 2014

Information Bottleneck based Speaker Diarization of Meetings using Non-speech as Side Information, Sree Harsha Yella and Hervé Bourlard, in: ICASSP, Florence, IT, pages 96 - 100, IEEE, 2014

[DOI]

Inferring social relationships in a phone call from a single party's speech, Sree Harsha Yella, Xavier Anguera and Jordi Luque, in: ICASSP, Florence, IT, pages 4843 - 4847, IEEE, 2014

[DOI]

Detecting speaker roles and topic changes in multiparty conversations using latent topic models, A. Sapru and Hervé Bourlard, in: Proceedings of Interspeech, 2014

Dynamic Programming Boosting for Discriminative Macro-Action Discovery, Leonidas Lefakis and Francois Fleuret, in: International Conference on Machine Learning, 2014

Jointly Informative Feature Selection, Leonidas Lefakis and Francois Fleuret, in: International Conference on Artificial Intelligence and Statistics, pages 567–575, 2014

Audio-Visual Gender Recognition in Uncontrolled Environment Using Variability Modeling Techniques, Laurent El Shafey, Elie Khoury and Sébastien Marcel, in: International Joint Conference on Biometrics, Clearwater, Florida, USA, pages 1 - 8, IEEE, 2014

[DOI]
[URL]

Improving Head and Body Pose Estimation through Semi-supervised Manifold Alignment, Alexandre Heili, Jagannadan Varadarajan, Bernard Ghanem, Narendra Ahuja and Jean-Marc Odobez, in: International Conference on Image Processing, 2014

Exploiting Long-Term Connectivity and Visual Motion in CRF-based Multi-Person Tracking, Alexandre Heili, Adolfo Lopez-Mendez and Jean-Marc Odobez, in: Transactions on Image Processing, 2014

On Recognition of Non-Native Speech Using Probabilistic Lexical Model, Marzieh Razavi and Mathew Magimai-Doss, in: Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech 2014), 2014

Posterior-based Sparse Representation for Automatic Speech Recognition, Sara Bahaadini, Afsaneh Asaei, David Imseng and Hervé Bourlard, in: Proceeding of Interspeech, 2014

Raw Speech Signal-based Continuous Speech Recognition using Convolutional Neural Networks, Dimitri Palaz, Mathew Magimai-Doss and Ronan Collobert, Idiap-RR-15-2014

Stress and Accent Transmission In HMM-Based Syllable-Context Very Low Bit Rate Speech Coding, Milos Cernak, Alexandros Lazaridis, Philip N. Garner and Petr Motlicek, Idiap-RR-10-2014

Recurrent Greedy Parsing with Neural Networks, Joël Legrand and Ronan Collobert, in: Proceedings of ECML 2014, pages 130-144, Springer Berlin Heidelberg, 2014

[DOI]

Geometric Generative Gaze Estimation (G3E) for Remote RGB-D Cameras, Kenneth Alberto Funes Mora and Jean-Marc Odobez, in: IEEE Computer Vision and Pattern Recognition Conference, Columbus, Ohio,USA, pages 1773-1780, IEEE, 2014

[DOI]

Topic-Level Extractive Summarization of Lectures and Meetings Using a Snippet Similarity Graph, Chidansh A. Bhatt and Andrei Popescu-Belis, Idiap-RR-09-2014

Stress and Accent Transmission In HMM-Based Syllable-Context Very Low Bit Rate Speech Coding, Milos Cernak, Alexandros Lazaridis, Philip N. Garner and Petr Motlicek, in: Interspeech, 2014

Syllable-based Regional Swiss French Accent Identification using Prosodic Features, Alexandros Lazaridis, Jean-Philippe Goldman, Mathieu Avanzi and Philip N. Garner, in: Nouveaux cahiers de linguistique francaise, 2014

Dialect Levelling in Finnish: A Universal Speech Attribute Approach, Hamid Behravan, Ville Hautamäki, Sabato Marco Siniscalchi, Elie Khoury, Tommi Kurki, Tomi Kinnunen and Chin-Hui Lee, in: The 15th Annual Conference of the International Speech Communication Association, 2014

Introducing I-Vectors for Joint Anti-spoofing and Speaker Verification, Elie Khoury, Tomi Kinnunen, Aleksandr Sizov, Zhizheng Wu and Sébastien Marcel, in: The 15th Annual Conference of the International Speech Communication Association, 2014

Comparison of Two Methods for Unsupervised Person Identification in TV Shows, Paul Gay, Gregor Dupuy, Jean-Marc Odobez, Sylvain Meignier and Paul Deleglise, in: 12th International Workshop on Content-Based Multimedia Indexing, 2014

Enhanced Diffuse Field Model for Ad Hoc Microphone Array Calibration, Mohammad J. Taghizadeh, Philip N. Garner and Hervé Bourlard, in: Signal Processing, 101:242-255, 2014

Modeling Overlapping Speech using Vector Taylor Series, Pranay Dighe, Marc Ferras and Hervé Bourlard, in: Odyssey: The Speaker and Language Recognition Workshop, Joensuu, Finland, 2014

MLP-based Factor Analysis for Tandem Speech Recognition, Marc Ferras and Hervé Bourlard, in: Proceedings of the International Conference on Acoustics, Speech and Signal Processing, 2013

Enforcing Topic Diversity in a Document Recommender for Conversations, Maryam Habibi and Andrei Popescu-Belis, in: Proceedings of the Coling 2014 (25th International Conference on Computational Linguistics), Dublin, Ireland, pages 746-759, IEEE, 2014

Recurrent Convolutional Neural Networks for Scene Labeling, Pedro H. O. Pinheiro and Ronan Collobert, in: 31st International Conference on Machine Learning (ICML), Beijing, China, pages 82-90, JMLR, 2014

[URL]

Ad-Hoc Microphone Array Calibration from Partial Distance Measurements, Mohammad J. Taghizadeh, Afsaneh Asaei, Philip N. Garner and Hervé Bourlard, in: Proceedings of the 4th Joint Workshop on Hands-free speech communication and Microphone Arrays, Villers-les-Nancy, pages 1 - 5, IEEE, 2014

[DOI]

Ad-Hoc Microphone Array Calibration from Partial Distance Measurements, Mohammad J. Taghizadeh, Afsaneh Asaei, Philip N. Garner and Hervé Bourlard, in: Proceeding of 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014

Face identification from overlaid texts using Local Face Recurrent Patterns and CRF models, Paul Gay, Elie Khoury, Sylvain Meignier, Jean-Marc Odobez and Paul Deleglise, in: IEEE International Conference on Image Processing 2014, Paris, IEEE, 2014

Scene Recognition with Naive Bayes Non-linear Learning, Marco Fornoni and Barbara Caputo, in: Proceedings of the 22nd International Conference on Pattern Recognition, Stockholm, pages 3404 - 3409, IEEE, 2014

[DOI]

Spoofing Face Recognition with 3D Masks, Nesli Erdogmus and Sébastien Marcel, in: IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY:1084-1097, 2014

[DOI]

A task-parameterized probabilistic model with minimal intervention control, Sylvain Calinon, D. Bruno and D. G. Caldwell, in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Hong Kong, pages 3339 - 3344, IEEE, 2014

[DOI]

Null space redundancy learning for a flexible surgical robot, D. Bruno, Sylvain Calinon and D. G. Caldwell, in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Hong Kong, pages 2443 - 2448, IEEE, 2014

[DOI]

Learning from demonstrations with partially observable task parameters, T. Alizadeh, Sylvain Calinon and D. G. Caldwell, in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Hong Kong, pages 3309 - 3314, IEEE, 2014

[DOI]

Mode of Teaching Based Segmentation and Annotation of Video Lectures, Yogesh Singh Rawat, Chidansh A. Bhatt and Mohan S. Kankanhalli, in: International Workshop on Content-Based Multimedia Indexing, 2014

Improving Speaker Diarization using social role information, A. Sapru, Sree Harsha Yella and Hervé Bourlard, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2014

Prosody in Swiss French Accents: Investigation using Analysis by Synthesis, Pierre-Edouard Honnet, Alexandros Lazaridis, Jean-Philippe Goldman and Philip N. Garner, in: Speech Prosody, 2014

Exploiting Long-Term Connectivity and Visual Motion in CRF-based Multi-Person Tracking, Alexandre Heili, Adolfo Lopez-Mendez and Jean-Marc Odobez, Idiap-RR-05-2014

Improving Real Time Factor of Information Bottleneck-based Speaker Diarization System, Srikanth Madikeri, David Imseng and Hervé Bourlard, Idiap-RR-18-2015

Hierarchical speaker clustering methods for the NIST i-vector Challenge, Elie Khoury, Laurent El Shafey, Marc Ferras and Sébastien Marcel, in: Odyssey: The Speaker and Language Recognition Workshop, 2014

Multi-Source Adaptive Learning for Fast Control of Prosthetics Hand, Novi Patricia, Tatiana Tommasi and Barbara Caputo, in: Proceedings of the International Conference on Pattern Recognition, Stockholm, pages 2769 - 2774, 2014

[DOI]

Learning to Learn, from Transfer Learning to Domain Adaptation: A Unifying Perspective, Novi Patricia and Barbara Caputo, in: Proceedings of the Computer Vision and Pattern Recognition, Columbus, OH, pages 1442-1449, IEEE, 2014

[DOI]

Sparse Gammatone Signal Model Predicts Perceived Noise Intrusiveness, Raphael Ullmann and Hervé Bourlard, Idiap-RR-07-2014

On Modeling Context-Dependent Clustered States: Comparing HMM/GMM, Hybrid HMM/ANN and KL-HMM Approaches, Marzieh Razavi, Ramya Rasipuram and Mathew Magimai-Doss, in: International Conference on Acoustics, Speech, and Signal Processing, Florence, IT, pages 7659 - 7663, IEEE, 2014

[DOI]

SVR vs MLP for Phone Duration Modelling in HMM-based Speech Synthesis, Alexandros Lazaridis, Pierre-Edouard Honnet and Philip N. Garner, in: Speech Prosody, 2014

SWISS FRENCH REGIONAL ACCENT IDENTIFICATION, Alexandros Lazaridis, Elie Khoury, Jean-Philippe Goldman, Mathieu Avanzi, Sébastien Marcel and Philip N. Garner, in: Odyssey: The Speaker and Language Recognition Workshop, 2014

Scalable Probabilistic Models for Face and Speaker Recognition, Laurent El Shafey, École Polytechnique Fédérale de Lausanne (EPFL), 2014

[URL]

Multilingual Deep Neural Network based Acoustic Modeling For Rapid Language Adaptation, Ngoc Thang Vu, David Imseng, Daniel Povey, Petr Motlicek, Tanja Schultz and Hervé Bourlard, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Florence, pages 7639-7643, IEEE, 2014

[DOI]

Exploiting un-transcribed foreign data for speech recognition in well-resourced languages, David Imseng, Blaise Potard, Petr Motlicek, Alexandre Nanchen and Hervé Bourlard, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Florence, IT, pages 2322 - 2326, IEEE, 2014

[DOI]

A Conditional Random field approach for audio-visual people diarization, Paul Gay, Elie Khoury, Sylvain Meignier, Jean-Marc Odobez and Paul Deleglise, in: Proceedings of the 39th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 116 - 120, IEEE, 2014

[DOI]

EYEDIAP: A Database for the Development and Evaluation of Gaze Estimation Algorithms from RGB and RGB-D Cameras, Kenneth Alberto Funes Mora, Florent Monay and Jean-Marc Odobez, in: Proceedings of the ACM Symposium on Eye Tracking Research and Applications, Safety Harbor, Florida, United States of America, ACM, 2014

[DOI]

Cross-linguistic annotation of narrativity for English/French verb tense disambiguation, Cristina Grisot and Thomas Meyer, in: 9th Edition of the Language Resources and Evaluation Conference, Reykjavik, Iceland, 2014

English-French Verb Phrase Alignment in Europarl for Tense Translation Modeling, Sharid Loaiciga, Thomas Meyer and Andrei Popescu-Belis, in: The Ninth Language Resources and Evaluation Conference, Reykjavik, Iceland, 2014

Multimodal Reranking of Content-based Recommendations for Hyperlinking Video Snippets, Chidansh A. Bhatt, Nikolaos Pappas, Maryam Habibi and Andrei Popescu-Belis, in: ACM International Conference on Multimedia Retrieval, 2014

Acoustic and Lexical Resource Constrained ASR using Language-Independent Acoustic Model and Language-Dependent Probabilistic Lexical Model, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-02-2014

Word Embeddings through Hellinger PCA, Rémi Lebret and Ronan Collobert, in: 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

Hi YouTube! Personality Impressions and Verbal Content in Social Video, Joan-Isaac Biel, Daniel Gatica-Perez, John Dines and Vagia Tsminiaki, in: 15th ACM International Conference on Multimodal Interaction, Sydney, Australia, ACM, 2013, 2013

Theoretical Analysis of Euclidean Distance Matrix Completion for Ad hoc Microphone Array Calibration, Mohammad J. Taghizadeh, Idiap-RR-20-2014

EYEDIAP Database: Data Description and Gaze Tracking Evaluation Benchmarks, Kenneth Alberto Funes Mora, Florent Monay and Jean-Marc Odobez, Idiap-RR-08-2014

The Robot Vision Track at ImageCLEF 2010, Andrzej Pronobis, Marco Fornoni, Henrik I. Christensen and Barbara Caputo, in: CLEF 2010 Notebook Papers/LABs/Workshops, 2010

[URL]

Combining Content with User Preferences for Non-Fiction Multimedia Recommendation: A Study on TED Lectures, Nikolaos Pappas and Andrei Popescu-Belis, in: Multimedia Tools and Applications, Special Issue on Content Based Multimedia Indexing, 74(4):1175-1197, 2015

[DOI]

SVR vs MLP for Phone Duration Modelling in HMM-based Speech Synthesis, Alexandros Lazaridis, Pierre-Edouard Honnet and Philip N. Garner, Idiap-RR-03-2014

Prosody in Swiss French Accents: Investigation using Analysis by Synthesis, Pierre-Edouard Honnet, Alexandros Lazaridis, Jean-Philippe Goldman and Philip N. Garner, Idiap-RR-04-2014

What to Show? Automatic Stream Selection Among Multiple Sensors, Remi Emonet, E. Oberzaucher and Jean-Marc Odobez, in: International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, 2014

Object Classification and Detection in High Dimensional Feature Space, Charles Dubout, Programme doctoral en Informatique, Communications et Information, 2013

Clustering flood events from water quality time-series using Latent Dirichlet Allocation model, Alice Aubert, Romain Tavenard, Remi Emonet, A. de Lavenne, Simon Malinowski, Thomas Guyet, René Quiniou, Jean-Marc Odobez, Philippe Merot and Chantal Gascuel, in: Water Resources Research, 2013

[DOI]

Speech Processing, Mathew Magimai-Doss, in: Interactive Multimodal Information Management, pages 221--245, EPFL Press, 2013

Gammatone Wavelet Cepstral Coefficients for Robust Speech Recognition, Aniruddha Adiga, Mathew Magimai-Doss and Chandra Sekhar Seelamantula, in: Proceedings of IEEE TENCON, 2013

A Savitzky-Golay Filtering Perspective of Dynamic Feature Computation, S. R. Krishnan, Mathew Magimai-Doss and C. S. Seelamantula, in: IEEE Signal Processing Letters, 20(3):281 -- 284, 2013

[DOI]

A Probabilistic Framework for Multiple Speaker Localization, Youssef Oualil, Mathew Magimai-Doss, Friedrich Faubel and Dietrich Klakow, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2013

Alignment of Binocular-Binaural Data Using a Moving Audio-Visual Target, Vasil Khalidov, Florence Forbes and Radu Horaud, in: Proc. of IEEE Int. Workshop on Multimedia Signal Processing (MMSP), 2013

Proceedings of the ACL Workshop on Discourse in Machine Translation (DiscoMT 2013), Bonnie Webber, Andrei Popescu-Belis, Katja Markert and Jorg Tiedemann, Association for Computational Linguistics, 2013

[URL]

On Modeling Context-dependent Clustered States: Comparing HMM/GMM, Hybrid HMM/ANN and KL-HMM Approaches, Marzieh Razavi, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-43-2013

3D Head Pose and Gaze Tracking and Their Application to Diverse Multimodal Tasks, Kenneth Alberto Funes Mora, in: Doctoral consortium of the 15th ACM International Conference on Multimodal Interaction, Sydney, Australia, 2013

[DOI]

Context Aware Addressee Estimation for Human Robot Interaction, Samira Sheikhi, Dinesh Babu Jayagopi, Vasil Khalidov and Jean-Marc Odobez, in: Proceedings of the 6th Workshop on Eye Gaze in Intelligent Human Machine Interaction: Gaze in Multimodal Interaction, 2013

Leveraging the robot dialog state for visual focus of attention recognition, Samira Sheikhi, Vasil Khalidov, David Klotz, Britta Wrede and Jean-Marc Odobez, in: Proceedings of the 15th ACM on International Conference on Multimodal Interaction, 2013

Multimodal Analysis of Body Communication Cues in Employment Interviews, Laurent Son Nguyen, Alvaro Marcos-Ramiro, Marta Marron-Romera and Daniel Gatica-Perez, in: 15th ACM International Conference on Multimodal Interaction Proceedings, 2013

Evaluating Intra- and Crosslingual Adaptation for Non-native Speech Recognition in a Bilingual Environment, Gyorgy Szaszak and Philip N. Garner, in: Proceedings of the 4th IEEE International Conference on Cognitive Infocommunications, IEEE, Budapest, Hungary, pages 357-361, 2013

Adaptive Sampling for Large Scale Boosting, Charles Dubout and Francois Fleuret, in: Journal of Machine Learning Research, 15:1431-1453, 2014

Is Deep Learning Really Necessary for Word Embeddings?, Rémi Lebret, Joël Legrand and Ronan Collobert, Idiap-RR-44-2013

Introduction to the Special Issue on Learning Semantics, Antoine Bordes, Léon Bottou, Ronan Collobert, Dan Roth, Jason Weston and Luke Zettlemoyer, in: Machine Learning, 2013

[DOI]

Recurrent Convolutional Neural Networks for Scene Labeling, Pedro H. O. Pinheiro and Ronan Collobert, Idiap-RR-41-2013

End-to-end Phoneme Sequence Recognition using Convolutional Neural Networks, Dimitri Palaz, Ronan Collobert and Mathew Magimai-Doss, Idiap-RR-40-2013

Re-Identification for Improved People Tracking, Francois Fleuret, Horesh Ben Shitrit and Pascal Fua, in: Person Re-Identification, pages 311-336, Springer, 2014

Using the Europarl corpus for cross-linguistic research, Bruno Cartoni, Sandrine Zufferey and Thomas Meyer, in: Belgian Journal of Linguistics(27):23 – 42, 2013

[URL]

Stable Myoelectric Control of a Hand Prosthesis using Non-Linear Incremental Learning, Arjan Gijsberts, Rashida Bohra, David Sierra González, Alexander Werner, Markus Nowak, Barbara Caputo, Maximo A. Roa and Claudio Castellini, in: Frontiers in Neurorobotics, 8, 2014

[DOI]

The Movement Error Rate for Evaluation of Machine Learning Methods for sEMG-based Hand Movement Classification, Arjan Gijsberts, Manfredo Atzori, Claudio Castellini, Henning Müller and Barbara Caputo, in: Transactions on Neural Systems and Rehabilitation Engineering:735 - 744, 2014

[DOI]

Characterization of a Benchmark Database for Myoelectric Movement Classification, Manfredo Atzori, Arjan Gijsberts, Ilja Kuzborskij, Simone Heynen, Anne-Gabrielle Mittaz Hager, Olivier Deriaz, Claudio Castellini, Henning Müller and Barbara Caputo, in: Transactions on Neural Systems and Rehabilitation Engineering, 23:73-83, 2014

[DOI]

Crosslingual Tandem-SGMM: Exploiting Out-Of-Language Data for Acoustic Model and Feature Level Adaptation, Petr Motlicek, David Imseng and Philip N. Garner, in: Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013), ISCA - International Speech Communication Association, Lyon, France, pages 510-514, ISCA, 2013

Crosslingual Tandem-SGMM: Exploiting Out-Of-Language Data for Acoustic Model and Feature Level Adaptation, Petr Motlicek, David Imseng and Philip N. Garner, Idiap-RR-39-2013

ACCENT ADAPTATION USING SUBSPACE GAUSSIAN MIXTURE MODELS, Petr Motlicek, Philip N. Garner, Namhoon Kim and Jeongmi Cho, in: The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Vancouver, BC, Canada, pages 7170-7174, 2013

[DOI]

ACCENT ADAPTATION USING SUBSPACE GAUSSIAN MIXTURE MODELS, Petr Motlicek, Philip N. Garner, Namhoon Kim and Jeongmi Cho, Idiap-RR-38-2013

FEATURE AND SCORE LEVEL COMBINATION OF SUBSPACE GAUSSIANS IN LVCSR TASK, Petr Motlicek, Daniel Povey and Martin Karafiat, in: The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Vancouver, BC, Canada, pages 7604-7608, 2013

[DOI]

FEATURE AND SCORE LEVEL COMBINATION OF SUBSPACE GAUSSIANS IN LVCSR TASK, Petr Motlicek, Daniel Povey and Martin Karafiat, Idiap-RR-37-2013

Manifold Sparse Beamforming, Baran Gözcü, Afsaneh Asaei and Volkan Cevher, in: IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing, Saint Martin, France, pages 113-116, IEEE, 2013

[DOI]

Convexity in source separation: Models, geometry, and algorithms, Michael McCoy, Volkan Cevher, Quoc Tran Dinh, Afsaneh Asaei and Luca Baldassarre, in: IEEE Signal Processing Magazine, Special Issue on Source Separation and Applications, 2013

Fusing Matching and Biometric Similarity Measures for Face Diarization in Video, Elie Khoury, Paul Gay and Jean-Marc Odobez, Idiap-RR-31-2013

An Open-source State-of-the-art Toolbox for Broadcast News Diarization, Mickael Rouvier, Gregor Dupuy, Paul Gay, Elie Khoury, Teva Merlin and Sylvain Meignier, Idiap-RR-33-2013

Gesture control interface for immersive panoramic displays, Marcel Alcoverro, Xavier Suau, Adolfo Lopez-Mendez, Josep R. Morros, Javier Ruiz-Hidalgo, Albert Gil and Josep R. Casas, in: Multimedia Tools and Applications, 1380-7501:1-27, 2013

[DOI]

Exploiting Scene Cues for Dropped Object Detection, Adolfo Lopez-Mendez, Florent Monay and Jean-Marc Odobez, in: 9th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications., 2014

Creative Applications of Human Behavior Understanding. HBU 2013: 1-14, Albert Ali Salah, Hayley Hung, Oya Aran and Hatice Gunes, in: Human Behavior Understanding, pages 1-14, 2013

Inferring Mood in Ubiquitous Conversational Video, Dairazalia Sanchez-Cortes, Joan-Isaac Biel, Shiro Kumano, Junji Yamato, Kazuhiro Otsuka and Daniel Gatica-Perez, in: 12th International Conference on Mobile and Ubiquitous Multimedia, Luleå, Sweden, ACM Press, 2013

Model-based Sparse Component Analysis for Reverberant Speech Localization, Afsaneh Asaei, Hervé Bourlard, Mohammad J. Taghizadeh and Volkan Cevher, in: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing, pages 1439 - 1443, IEEE, 2014

[DOI]

Combining Vocal Tract Length Normalization with Hierarchical Linear Transformations, Lakshmi Saheer, Junichi Yamagishi, Philip N. Garner and John Dines, in: IEEE Journal of Selected Topics in Signal Processing - Special Issue on Statistical Parametric Speech Synthesis, 8(2):262 - 272, 2014

[DOI]

Broadcasting oneself: Visual Discovery of Vlogging Styles, Oya Aran, Joan-Isaac Biel and Daniel Gatica-Perez, in: IEEE Transactions on Multimedia, 16(1):201-215, 2014

[DOI]

One of a Kind: Inferring Personality Impressions in Meetings, Oya Aran and Daniel Gatica-Perez, in: 15th ACM International Conference on Multimodal Interaction, 2013

Cross-Domain Personality Prediction: From Video Blogs to Small Group Meetings, Oya Aran and Daniel Gatica-Perez, in: 15th ACM International Conference on Multimodal Interaction, 2013

Multiclass Latent Locally Linear Support Vector Machines, Marco Fornoni, Barbara Caputo and Francesco Orabona, in: JMLR W&CP, Volume 29: Asian Conference on Machine Learning, Canberra, Australia, pages 229-244, 2013

[URL]

A Semi-Automated System for Accurate Gaze Coding in Natural Dyadic Interactions, Kenneth Alberto Funes Mora, Laurent Son Nguyen, Daniel Gatica-Perez and Jean-Marc Odobez, in: 15th ACM International Conference on Multimodal Interaction, Sydney, Australia, ACM, 2013

[DOI]

Unsupervised methods for activity analysis and detection of abnormal events, Remi Emonet and Jean-Marc Odobez, in: Intelligent Video Surveillance Systems (ISTE), Wiley, 2013

[DOI]

Temporal Analysis of Motif Mixtures using Dirichlet Processes, Remi Emonet, Jagannadan Varadarajan and Jean-Marc Odobez, in: IEEE Trans. Pattern Analysis and Machine Intelligence (PAMI), 36(1), 2014

Observation of Vehicle Axles Through Pass-by Noise: A Strategy of Microphone Array Design, Patrick Marmaroli, M. Carmona, Xavier Falourd, Hervé Lissek and Jean-Marc Odobez, in: IEEE Trans. on Intelligent Transportation Systems, 2013

Exploiting Long-Term Connectivity and Visual Motion in CRF-based Multi-Person Tracking, Alexandre Heili, Adolfo Lopez Mendez and Jean-Marc Odobez, Idiap-RR-06-2014

Bi-Modal Biometric Authentication on Mobile Phones in Challenging Conditions, Elie Khoury, Laurent El Shafey, Chris McCool, Manuel Günther and Sébastien Marcel, in: Image and Vision Computing:1147-1160, 2014

[DOI]
[URL]

On the Improvements of Uni-modal and Bi-modal Fusions of Speaker and Face Recognition for Mobile Biometrics, Elie Khoury, Manuel Günther, Laurent El Shafey and Sébastien Marcel, in: Biometric Technologies in Forensic Science, Nijmegen, The Netherlands, 2013

Processing and Linking Audio Events in Large Multimedia Archives: The EU inEvent Project, Hervé Bourlard, Marc Ferras, Nikolaos Pappas, Andrei Popescu-Belis, Steve Renals, Fergus McInnes, Peter Bell, Sandy Ingram and Maël Guillemot, in: Workshop on Speech, Language and Audio in Multimedia, 2013

Estimating Phoneme Class Conditional Probabilities from Raw Speech Signal using Convolutional Neural Networks, Dimitri Palaz, Ronan Collobert and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2013

Idiap at MediaEval 2013: Search and Hyperlinking Task, Chidansh A. Bhatt, Nikolaos Pappas, Maryam Habibi and Andrei Popescu-Belis, in: MediaEval 2013 Workshop, Barcelona, Spain, CEUR-WS.org, 2013

Probabilistic Lexical Modeling and Unsupervised Training for Zero-Resourced ASR, Ramya Rasipuram, Marzieh Razavi and Mathew Magimai-Doss, in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, 2013

Reservoir Boosting : Between Online and Offline Ensemble Learning, Leonidas Lefakis and Francois Fleuret, in: Proceedings of the international conference on Neural Information Processing Systems, 2013

Multi-Commodity Network Flow for Tracking Multiple People, Horesh Ben Shitrit, Jerome Berclaz, Francois Fleuret and Pascal Fua, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013

Impact of deep MLP architecture on different acoustic modeling techniques for under-resourced speech recognition, David Imseng, Petr Motlicek, Philip N. Garner and Hervé Bourlard, in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, 2013

Biometrics Evaluation under Spoofing Attacks, Ivana Chingovska, André Anjos and Sébastien Marcel, Idiap-RR-12-2014

A Survey of Personality Computing, Alessandro Vinciarelli and Gelareh Mohammadi, in: IEEE Transaction on Affective Computing, 5(3):273-291, 2014

Interactive Multimodal Information Management, Hervé Bourlard and Andrei Popescu-Belis, EPFL Press, 2013

Interactive Multimodal Information Management: Shaping the Vision, Andrei Popescu-Belis and Hervé Bourlard, in: Interactive Multimodal Information Management, pages 1-17, EPFL Press, 2013

Real-Time Audio-Visual Analysis for Multiperson Videoconferencing, Petr Motlicek, Stefan Duffner, Danil Korchagin, Hervé Bourlard, Carl Scheffler, Jean-Marc Odobez, Giovanni Del Galdo, Markus Kallinger and Oliver Thiergart, in: Advances in Multimedia, 2013:21, 2013

[DOI]
[URL]

Automatic Staging of Audio with Emotions, Lakshmi Saheer and Milos Cernak, in: International Conference on Affective Computing and Intelligent Interaction, 2013

Understanding Factors in Emotion Perception, Lakshmi Saheer and Blaise Potard, Idiap-RR-28-2013

Understanding Factors in Emotion Perception, Lakshmi Saheer and Blaise Potard, in: ISCA Speech Synthesis Workshop, 2013

Inferring social activities with mobile sensor networks, Trinh-Minh-Tri Do, Kyriaki Kalimeri, Bruno Lepri, Fabio Pianesi and Daniel Gatica-Perez, in: 15th ACM International Conference on Multimodal Interaction, 2013

From Big Smartphone Data to Worldwide Research: The Mobile Data Challenge, J. K. Laurila, Daniel Gatica-Perez, Jan Blom, Olivier Bornet, Trinh-Minh-Tri Do, O. Dousse, Julien Eberle and Markus Miettinen, in: Pervasive and Mobile Computing, 9(6):752–771, 2013

Multi-factor Segmentation for Topic Visualization and Recommendation: the MUST-VIS System, Chidansh A. Bhatt, Andrei Popescu-Belis, Maryam Habibi, Sandy Ingram, Stefano Masneri, Fergus McInnes, Nikolaos Pappas and Oliver Schreer, in: Proceedings of the 21st ACM International Conference on Multimedia, Barcelona, Spain, pages 365-368, ACM, 2013

[DOI]
[URL]

Revisiting the Generality of the Rank-based Human Mobility Model, Darshan Santani and Daniel Gatica-Perez, in: Proceedings of the 2013 ACM Conference on Pervasive and Ubiquitous Computing Adjunct Publication, Zurich, Switzerland, pages 1209-1218, ACM, 2013

[DOI]
[URL]

Speaking Swiss: Languages and Venues in Foursquare, Darshan Santani and Daniel Gatica-Perez, in: Proceedings of the 21st ACM International Conference on Multimedia, Barcelona, Spain, pages 501-504, ACM, 2013

[DOI]
[URL]

On the Improvements of Uni-modal and Bi-modal Fusions of Speaker and Face Recognition for Mobile Biometrics, Elie Khoury, Manuel Günther, Laurent El Shafey and Sébastien Marcel, Idiap-RR-35-2013

Spoofing Attacks To 2D Face Recognition Systems With 3D Masks, Nesli Erdogmus and Sébastien Marcel, in: International Conference of the Biometrics Special Interes Group, Darmstadt, Germany, 2013

Spoofing in 2D Face Recognition with 3D Masks and Anti-spoofing with Kinect, Nesli Erdogmus and Sébastien Marcel, in: Biometrics: Theory, Applications and Systems, Washington DC, USA, 2013

Investigating time-sensitive topic model approaches for action recognition, Romain Tavenard, Remi Emonet and Jean-Marc Odobez, Idiap-RR-26-2013

Euclidean Distance Matrix Completion for Ad-hoc Microphone Array Calibration, Mohammad J. Taghizadeh, Reza Parhizkar, Philip N. Garner and Hervé Bourlard, in: Proceedings IEEE International Conference On Digital Signal Processing, 2013

The vernissage corpus: a conversational human-robot-interaction dataset, Dinesh Babu Jayagopi, Samira Sheikhi, David Klotz, Johannes Wienke, Jean-Marc Odobez, Sebastian Wrede, Vasil Khalidov, Laurent Son Nguyen, Britta Wrede and Daniel Gatica-Perez, in: Proceedings of the 8th ACM/IEEE international conference on Human-robot interaction, 2013

Word Embeddings through Hellinger PCA, Rémi Lebret and Ronan Collobert, Idiap-RR-29-2013

Given that, Should I Respond? Contextual Addressee Estimation in Multi-Party Human-Robot Interactions, Dinesh Babu Jayagopi and Jean-Marc Odobez, in: Proceedings of Human Robot Interaction (HRI) Conference, 2013

Discovering Temporal Patterns in Water Quality Time Series, Focusing on Floods with the LDA method, Alice Aubert, Romain Tavenard, Simon Malinowski, Thomas Guyet, René Quiniou, Jean-Marc Odobez, Remi Emonet and Chantal Gascuel, in: European Geosciences Union, 2013

Time-Sensitive Topic Models for Action Recognition in Videos, Romain Tavenard, Remi Emonet and Jean-Marc Odobez, in: IEEE International Conference on Image Processing, 2013

Learning to Rank on Network Data, Majid Yazdani, Ronan Collobert and Andrei Popescu-Belis, in: Mining and Learning with Graphs, 2013

Computing Text Semantic Relatedness using the Contents and Links of a Hypertext Encyclopedia, Majid Yazdani and Andrei Popescu-Belis, in: International Joint Conference on artificial intelligence, 2013

Automatic Speech Indexing System of Bilingual Video Parliament Interventions, Gyorgy Szaszak, Milos Cernak, Philip N. Garner, Petr Motlicek, Alexandre Nanchen and Flavio Tarsetti, Idiap-RR-25-2013

Deformable Part Models with Individual Part Scaling, Charles Dubout and Francois Fleuret, in: British Machine Vision Conference, 2013

Are ACT's scores increasing with better translation quality?, Najeh Hajlaoui, in: Are ACT's scores increasing with better translation quality?, pages 6, 2013

Accelerated Training of Linear Object Detectors, Charles Dubout and Francois Fleuret, in: CVPR 2013 Workshop on Structured Prediction, 2013

[URL]

Medical image annotation, Barbara Caputo, in: Interactive Multimodal Information Management, EPFL Press, 2013

Learning to learn new models of human activities in indoor settings1, Fabian Nater, Tatiana Tommasi, Luc Van Gool and Barbara Caputo, in: Interactive Multimodal Information Management, EPFL Press, 2013

Overview of the ImageCLEF 2013 Robot Vision Task, Jesus Martinez-Gomez, Ismael Garcia-Varea, Miguel Cazorla and Barbara Caputo, in: Working Notes, CLEF 2013, 2013

Investigating the Impact of Language Style and Vocal Expression on Social Roles of Participants in Professional Meetings, A. Sapru and Hervé Bourlard, in: Affective Computing and Intelligent Interaction, Geneva, pages 324-329, IEEE, 2013

[DOI]

Noise Intrusiveness Factors in Speech Telecommunications, Raphael Ullmann, Hervé Bourlard, Jens Berger and Anna Llagostera Casanovas, in: Proceedings of the AIA-DAGA 2013 International Conference on Acoustics, Merano, Italy, pages 436-439, 2013

Multilingual speech recognition A posterior based approach, David Imseng, École Polytechnique Fédérale de Lausanne (EPFL), 2013

Mining Conversational Social Video, Joan-Isaac Biel, EPFL, 2013

Detecting Narrativity to Improve English to French Translation of Simple Past Verbs, Thomas Meyer, Cristina Grisot and Andrei Popescu-Belis, in: Proceedings of the 1st DiscoMT Workshop at ACL 2013 (51st Annual Meeting of the Association for Computational Linguistics), Sofia, Bulgaria, pages 33-42, 2013

Syllable-based Pitch Encoding for Low Bit Rate Speech Coding with Recognition/Synthesis Architecture, Milos Cernak, Xingyu Na and Philip N. Garner, Idiap-RR-24-2013

Where and What: Using Smartphones to Predict Next Locations and Applications in Daily Life, Trinh-Minh-Tri Do and Daniel Gatica-Perez, in: Pervasive and Mobile Computing, 2013

Automatic Social Role Recognition In Professional Meetings Using Conditional Random Fields, A. Sapru and Hervé Bourlard, in: Proceedings of Interspeech, 2013

Using Phonological Phrase Segmentation to Improve Automatic Keyword Spotting for the Highly Agglutinating Hungarian Language, Gyorgy Szaszak and Andras Beke, Idiap-RR-23-2013

Recurrent Convolutional Neural Networks for Scene Parsing, Pedro H. O. Pinheiro and Ronan Collobert, Idiap-RR-22-2013

Improving Grapheme-based ASR by Probabilistic Lexical Modeling Approach, Ramya Rasipuram and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2013

Machine Translation with Many Manually Labeled Discourse Connectives, Thomas Meyer and Lucie Polakova, in: Proceedings of the 1st DiscoMT Workshop at ACL 2013 (51st Annual Meeting of the Association for Computational Linguistics), Sofia, Bulgaria, pages 43-50, 2013

Implicitation of Discourse Connectives in (Machine) Translation, Thomas Meyer and Bonnie Webber, in: Proceedings of the 1st DiscoMT Workshop at ACL 2013 (51st Annual Meeting of the Association for Computational Linguistics), Sofia, Bulgaria, pages 19-26, 2013

Syllable-based Pitch Encoding for Low Bit Rate Speech Coding with Recognition/Synthesis Architecture, Milos Cernak, Xingyu Na and Philip N. Garner, in: Proc. of Interspeech 2013, Lyon, France, 2013

Person Independent 3D Gaze Estimation From Remote RGB-D Cameras, Kenneth Alberto Funes Mora and Jean-Marc Odobez, in: International Conference on Image Processing, Melbourne, Australia, IEEE, 2013

[DOI]

Automatic Personality Perception: Inferring Personality Traits from Nonverbal Vocal Behavior, Gelareh Mohammadi, Electrical Engineering Department, EPFL, 2013

Who is Persuasive? The Role of Perceived Personality and Communication Modality in Social Multimedia, Gelareh Mohammadi, Sunghyun Park, Kenji Sagae, Alessandro Vinciarelli and Louis-Philippe Morency, in: International Conference on Multimodal Interaction, 2013

A Survey on Perceived Speaker Traits: Personality, Likability, Pathology and the First Challenge, Björn Schuller, Stefan Steidl, Anton Batliner, Elmar Nöth, Alessandro Vinciarelli, Felix Burkhardt, felix Weninger, Florian Eyben, Tobias Bocklet, Gelareh Mohammadi and Benjamin Weiss, in: Computer Speech and Language, 19(1):100-131, 2015

[DOI]

Diverse Keyword Extraction from Conversations, Maryam Habibi and Andrei Popescu-Belis, in: Proceedings of the ACL 2013 (51th Annual Meeting of the Association for Computational Linguistics ), Short Papers, Sofia, Bulgaria, pages 651-657, ACL, 2013

Using Phonological Phrase Segmentation to Improve Automatic Keyword Spotting for the Highly Agglutinating Hungarian Language, Gyorgy Szaszak and Andras Beke, in: Proc. of Interspeech 2013, 2013

An Open-source State-of-the-art Toolbox for Broadcast News Diarization, Mickael Rouvier, Gregor Dupuy, Paul Gay, Elie Khoury, Teva Merlin and Sylvain Meignier, in: INTERSPEECH, 2013

Unsupervised Methods for Activity Analysis and Detection of Abnormal Events, Remi Emonet and Jean-Marc Odobez, Idiap-RR-21-2013

Analyse non supervisée d'activités en vidéo surveillance pour l'analyse de scène et la détection d'événements anormaux, Remi Emonet and Jean-Marc Odobez, Idiap-RR-20-2013

[URL]

Extracting Motifs from Time Series Generated by Concurrent Activities., Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, in: NIPS workshop on Learning and Planning from Batch Time Series Data, 2010

I4U Submission to NIST SRE 2012: a large-scale collaborative effort for noise-robust speaker verification, Rahim Saedi, Kong Aik Lee, Tomi Kinnunen, Tawfik Hasan, Benoit Fauve, Pierre-Michel Bousquet, Elie Khoury, Pablo Luis Sordo Martinez, Jia Min Karen Kua, Changhuai You, Hanwu Sun, Anthony Larcher, Padmanabhan Rajan, Ville Hautamäki, Cemal Hanilci, Billy Braithwaite, Gonzalez-Hautamäki Rosa, Seyed Omid Sadjadi, Gang Liu, Hynek Boril, Navid Shokouhi, Driss Matrouf, Laurent El Shafey, Pejman Mowlaee, Julien Epps, Tharmarajah Thiruvaran, David Van Leeuwen, Bin Ma, Haizhou Li, John Hansen, Jean-François Bonastre, Sébastien Marcel, John Mason and Eliathamby Ambikairajah, in: INTERSPEECH, Lyon, France, 2013

Stability and Hypothesis Transfer Learning, Ilja Kuzborskij and Francesco Orabona, in: International Conference on Machine Learning, 2013

Annotating the meaning of discourse connectives by looking at their translation: The translation-spotting technique, Bruno Cartoni, Sandrine Zufferey and Thomas Meyer, in: Dialogue & Discourse, 4(2):65-86, 2013

[DOI]

Bi-Modal Biometric Authentication on Mobile Phones in Challenging Conditions, Elie Khoury, Laurent El Shafey, Chris McCool, Manuel Günther and Sébastien Marcel, Idiap-RR-30-2013

The 2013 Face Recognition Evaluation in Mobile Environment, Manuel Günther, Artur Costa-Pazo, Changxing Ding, Elhocine Boutellaa, Giovani Chiachia, Honglei Zhang, Marcus de Assis Angeloni, Vitomir Struc, Elie Khoury, Esteban Vazquez-Fernandez, Dacheng Tao, Messaoud Bengherabi, David Cox, Serkan Kiranyaz, Tiago de Freitas Pereira, Jerneja Zganec-Gros, Enrique Argones-Rúa, Nicolas Pinto, Moncef Gabbouj, Flávio Simões, Simon Dobrisek, Daniel González-Jiménez, Anderson Rocha, Mário Uliani Neto, Nikola Pavesic, Alexandre Falcão, Ricardo Violato and Sébastien Marcel, in: The 6th IAPR International Conference on Biometrics, 2013

The 2013 Speaker Recognition Evaluation in Mobile Environment, Elie Khoury, Bostjan Vesnicer, Javier Franco-Pedroso, Ricardo Violato, Zenelabidine Boulkenafet, Luis-Miguel Mazaira Fernandez, Mireia Diez, Justina Kosmala, Houssemeddine Khemiri, Tomas Cipr, Rahim Saedi, Manuel Günther, Jerneja Zganec-Gros, Ruben Zazo Candil, Flávio Simões, Messaoud Bengherabi, Augustin Alvarez Marquina, Mikel Penagarikano, Alberto Abad, Mehdi Boulayemen, Petr Schwarz, David Van Leeuwen, Javier Gonzalez-Domınguez, Mário Uliani Neto, Elhocine Boutellaa, Pedro Gomez Vilda, Amparo Varona, Dijana Petrovska-Delacretaz, Pavel Matejka, Joaquin Gonzalez-Rodrıguez, Tiago de Freitas Pereira, Farid Harizi, Luis Javier Rodriguez-Fuentes, Laurent El Shafey, Marcus Angeloni, German Bordel, Gérard Chollet and Sébastien Marcel, in: The 6th IAPR International Conference on Biometrics, 2013

Fusing Matching and Biometric Similarity Measures for Face Diarization in Video, Elie Khoury, Paul Gay and Jean-Marc Odobez, in: Proceedings of the 3rd ACM conference on International conference on multimedia retrieval, Dallas, Texas, USA, pages 97-104, ACM, 2013

Exploiting Accelerometers to Improve Movement Classification for Prosthetics, Arjan Gijsberts and Barbara Caputo, in: International Conference on Rehabilitation Robotics, 2013

Anti-spoofing in action: joint operation with a verification system, Ivana Chingovska, André Anjos and Sébastien Marcel, in: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Workshop on Biometrics, Portland, Oregon, 2013

The 2nd competition on counter measures to 2D face spoofing attacks, Ivana Chingovska, Jinwei Yang, Zhen Lei, Dong Yi, Stan Z.Li, Olga Kähm, Naser Damer, Christian Glaser, Arjan Kuijper, Alexander Nouak, Jukka Komulainen, Tiago de Freitas Pereira, Shubham Gupta, Shubham Bansal, Shubham Khandelwal, Ayush Rai, Tarun Krishna, Dushyant Goyal, Muhammad-Adeel Waris, Honglei Zhang, Iftikhar Ahmad, Serkan Kiranyaz, Moncef Gabbouj, Roberto Tronci, Maurizio Pili, Nicola Sirena, Fabio Roli, Javier Galbally, Julian Fierrez, Allan Pinto, Helio Pedrini, William Robson Schwartz, Anderson Rocha, André Anjos and Sébastien Marcel, in: International Conference of Biometrics 2013, Madrid, Spain, 2013

Sentiment Analysis of User Comments for One-Class Collaborative Filtering over TED Talks, Nikolaos Pappas and Andrei Popescu-Belis, in: 36th ACM SIGIR Conference on Research and Development in Information Retrieval, Dublin, Ireland, ACM, 2013

Probabilistic Lexical Modeling and Grapheme-based Automatic Speech Recognition, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-15-2013

From N to N+1: Multiclass Transfer Incremental Learning, Ilja Kuzborskij, Francesco Orabona and Barbara Caputo, in: Proceedings of the Conference on Computer Vision and Pattern Recognition, 2013

Applying multi- and cross-lingual stochastic phone space transformations to non-native speech recognition, David Imseng, Hervé Bourlard, John Dines, Philip N. Garner and Mathew Magimai-Doss, in: IEEE Transactions on Audio, Speech, and Language Processing, 2013

[DOI]

Combining Content with User Preferences for TED Lecture Recommendation, Nikolaos Pappas and Andrei Popescu-Belis, in: Proceedings of the 11th International Workshop on Content Based Multimedia Indexing, Veszprém, Hungary, IEEE, 2013

The 2nd Competition on Counter Measures to 2D Face Spoofing Attacks, Ivana Chingovska, André Anjos and Sébastien Marcel, Idiap-RR-18-2013

Improving Grapheme-based ASR by Probabilistic Lexical Modeling Approach, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-14-2013

The 2013 Speaker Recognition Evaluation in Mobile Environment, Elie Khoury, Bostjan Vesnicer, Javier Franco-Pedroso, Ricardo Violato, Zenelabidine Boulkenafet, Luis-Miguel Mazaira Fernandez, Mireia Diez, Justina Kosmala, Houssemeddine Khemiri, Tomas Cipr, Rahim Saedi, Manuel Günther, Jerneja Zganec-Gros, Ruben Zazo Candil, Flávio Simões, Messaoud Bengherabi, Augustin Alvarez Marquina, Mikel Penagarikano, Alberto Abad, Mehdi Boulayemen, Petr Schwarz, David Van Leeuwen, Javier Gonzalez-Domınguez, Mário Uliani Neto, Elhocine Boutellaa, Pedro Gomez Vilda, Amparo Varona, Dijana Petrovska-Delacretaz, Pavel Matejka, Joaquin Gonzalez-Rodrıguez, Tiago de Freitas Pereira, Farid Harizi, Luis Javier Rodriguez-Fuentes, Laurent El Shafey, Marcus de Assis Angeloni, German Bordel, Gérard Chollet and Sébastien Marcel, Idiap-RR-32-2013

The 2013 Face Recognition Evaluation in Mobile Environment, Manuel Günther, Artur Costa-Pazo, Changxing Ding, Elhocine Boutellaa, Giovani Chiachia, Honglei Zhang, Marcus de Assis Angeloni, Vitomir Struc, Elie Khoury, Esteban Vazquez-Fernandez, Dacheng Tao, Messaoud Bengherabi, David Cox, Serkan Kiranyaz, Tiago de Freitas Pereira, Jerneja Zganec-Gros, Enrique Argones-Rúa, Nicolas Pinto, Moncef Gabbouj, Flávio Simões, Simon Dobrisek, Daniel González-Jiménez, Anderson Rocha, Mário Uliani Neto, Nikola Pavesic, Alexandre Falcão, Ricardo Violato and Sébastien Marcel, Idiap-RR-36-2013

Estimating Phoneme Class Conditional Probabilities from Raw Speech Signal using Convolutional Neural Networks, Dimitri Palaz, Ronan Collobert and Mathew Magimai-Doss, Idiap-RR-13-2013

Spoofing in 2D Face Recognition with 3D Masks and Anti-spoofing with Kinect, Nesli Erdogmus and Sébastien Marcel, Idiap-RR-27-2013

Anti-spoofing in action: joint operation with a verification system, Ivana Chingovska, André Anjos and Sébastien Marcel, Idiap-RR-19-2013

Evaluating Shape Descriptors for Detection of Maya Hieroglyphs, Edgar Roman-Rangel, Jean-Marc Odobez and Daniel Gatica-Perez, in: in Proc. Mexican Conf. on Pattern Recognition, Queretaro, 2013

Statistical models for HMM/ANN hybrids, Philip N. Garner and David Imseng, Idiap-RR-11-2013

From Foursquare to my Square: Learning Check-in Behavior from Multiple Sources, Eric Malmi, Trinh-Minh-Tri Do and Daniel Gatica-Perez, in: The 7th International AAAI Conference on Weblogs and Social Media, 2013

Bias Adaptation for Vocal Tract Length Normalization, Lakshmi Saheer, Junichi Yamagishi, Philip N. Garner and John Dines, Idiap-RR-12-2013

CONNECTIONIST SPEECH RECOGNITION - A Hybrid Approach, Hervé Bourlard and Nelson Morgan, KLUWER ACADEMIC PUBLISHERS, 1994

Adaptation Experiments on French MediaParl ASR, Gyorgy Szaszak, Idiap-RR-10-2013

Grapheme and Multilingual Posterior Features for Under-Resourced Speech Recognition: A Study on Scottish Gaelic, Ramya Rasipuram, Peter Bell and Mathew Magimai-Doss, in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2013

Improved Overlap Speech Diarization of Meeting Recordings using Long-term Conversational Features, Sree Harsha Yella and Hervé Bourlard, in: ICASSP, 2013

Enhancing State Mapping-Based Cross-Lingual Speaker Adaptation using Phonological Knowledge in a Data-Driven Manner, Hui Liang and John Dines, Idiap-RR-08-2013

Speaker adaptive Kullback-Leibler divergence based hidden Markov models, David Imseng and Hervé Bourlard, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2013

A Scalable Formulation of Probabilistic Linear Discriminant Analysis: Applied to Face Recognition, Laurent El Shafey, Chris McCool, Roy Wallace and Sébastien Marcel, Idiap-RR-07-2013

[URL]

On the (Un)importance of the Contextual Factors In HMM-Based Speech Synthesis, Milos Cernak, Petr Motlicek and Philip N. Garner, in: Proceedings of the IEEE Intl. Conference on Acoustics, Speech and Signal Processing (ICASSP), Vancouver, Canada, pages 8140 - 8143, 2013

Convolutional Pitch Target Approximation Model for Speech Synthesis, Xingyu Na and Philip N. Garner, Idiap-RR-05-2013

Fast Object Detection with Entropy-Driven Evaluation, Raphael Sznitman, Carlos Becker, Francois Fleuret and Pascal Fua, in: Proceedings of the Conference on Computer Vision and Pattern Recognition, 2013

KL-HMM and Probabilistic Lexical Modeling, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-04-2013

Who Wants To Be A Millionaire? (II), Huseyn Gasimov, Petr Motlicek and Hervé Bourlard, Idiap-Com-02-2013

Learning Categories from Few Examples with Multi Model Knowledge Transfer, Tatiana Tommasi, Francesco Orabona and Barbara Caputo, Idiap-RR-16-2013

Learning to Learn by Exploiting Prior Knowledge, Tatiana Tommasi, EDIC, 2013

The Places of Our Lives: Visiting Patterns and Automatic Labeling from Longitudinal Smartphone Data, Trinh-Minh-Tri Do and Daniel Gatica-Perez, in: IEEE Transactions on Mobile Computing, 2013

Using out-of-language data to improve an under-resourced speech recognizer, David Imseng, Petr Motlicek, Hervé Bourlard and Philip N. Garner, Idiap-RR-09-2013

Using out-of-language data to improve an under-resourced speech recognizer, David Imseng, Petr Motlicek, Hervé Bourlard and Philip N. Garner, in: Speech Communication, 2013

[DOI]
[URL]

Adaptive Relevance Feedback for Large-scale Image Retrieval, Nicolae Suditu, EPFL, 2013

Statistical Shape Descriptors for Ancient Maya Hieroglyphs Analysis, Edgar Roman-Rangel, École Polytechnique Fédérale de Lausanne, 2012

Distinguishing the Popularity Between Topics: A System for Up-to-date Opinion Retrieval and Mining in the Web, Nikolaos Pappas, Georgios Katsimpras and Efstathios Stamatatos, in: Proceedings of the 14th International Conference on Intelligent Text Processing and Computational Linguistics, LNCS, Samos, Greece, ACM, 2013

[URL]

Assessing the Accuracy of Discourse Connective Translations: Validation of an Automatic Metric, Najeh Hajlaoui and Andrei Popescu-Belis, in: 14th International Conference on Intelligent Text Processing and Computational Linguistics, University of the Aegean, Samos, Greece, pages 236-247, Springer, 2013

[DOI]

Regularized Bundle Methods for Convex and Non-Convex Risks, Trinh-Minh-Tri Do and Thierry Artieres, in: Journal of Machine Learning Research, 13:3539-3583, 2012

Spoofing Attacks To 2D Face Recognition Systems With 3D Masks, Nesli Erdogmus and Sébastien Marcel, Idiap-RR-42-2013

Body communicative cue extraction for conversational analysis, Alvaro Marcos-Ramiro, Daniel Pizarro-Perez, Marta Marron-Romera, Laurent Son Nguyen and Daniel Gatica-Perez, in: Proceedings of IEEE International Conference on Automatic Face and Gesture Recognition, 2013

Unified Framework Of Feature Based Adaptation For Statistical Speech Synthesis And Recognition, Lakshmi Saheer, Ecole Polytechnique Federale de Lausanne (EPFL), 2012

Data-Driven Enhancement of State Mapping-Based Cross-Lingual Speaker Adaptation, Hui Liang, Idiap-RR-38-2012

FaceTube: predicting personality from facial expressions of emotion in online conversational video, Joan-Isaac Biel, Lucia Teijeiro-Mosquera and Daniel Gatica-Perez, in: Proceedings International Conference on Multimodal Interfaces (ICMI-MLMI), 2012

Speaker Diarization and Linking of Large Corpora, Marc Ferras and Hervé Bourlard, in: Proceedings of the IEEE Workshop on Spoken Language Technology, 2012

Robot-to-group Interaction in a Vernissage: Architecture & Dataset for Multi-party Dialog, David Klotz, Johannes Wienke, Britta Wrede, Sebastian Wrede, Samira Sheikhi, Dinesh Babu Jayagopi, Vasil Khalidov and Jean-Marc Odobez, in: Proceedings of 5th International Conference on Cognitive Systems, 2012

Implementing Neural Networks Efficiently, Ronan Collobert, Koray Kavukcuoglu and Clément Farabet, in: Neural Networks: Tricks of the Trade, Springer, 2012

Deep Learning via Semi-Supervised Embedding, Jason Weston, Frédéric Ratle, Hossein Mobahi and Ronan Collobert, in: In Neural Networks: Tricks of the Trade, Springer, 2012

A Method, Apparatus and Computer Program for Determining the Location of a Plurality of Speech Source, Afsaneh Asaei, Hervé Bourlard and Volkan Cevher, in: 2012US-13/654055, 2012

[URL]

Structured Sparse Acoustic Modeling for Speech Separation, Afsaneh Asaei, Mohammad Golbabaee, Hervé Bourlard and Volkan Cevher, in: Signal Processing with Adaptive Sparse Structured Representations SPARS, SPARS, 2013

Model-based Sparse Component Analysis for Multiparty Distant Speech Recognition, Afsaneh Asaei, École Polytechnique Fédérale de Lausanne, 2013

A Multipath Sparse Beamfroming Method, Afsaneh Asaei, Bhiksha Raj, Hervé Bourlard and Volkan Cevher, in: Signal Processing with Adaptive Sparse Structured Representations SPARS, 2013

Unsupervised Activity Analysis and Monitoring algorithms for Effective Surveillance Systems, Jean-Marc Odobez, C. Carincotte, Remi Emonet, E. Jouneau, Sofia Zaidenberg, Bertrand Raverra, Francois Bremond and Andrea Grifoni, in: European Conference on Computer Vision, 2012

A Track Creation and Deletion Framework for Long-Term Online Multi-Face Tracking, Stefan Duffner and Jean-Marc Odobez, in: IEEE Transactions on Image Processing, 2013

Sampling techniques for audio-visual tracking and head pose estimation, Jean-Marc Odobez and Oswald Lanz, in: Multimodal Signal Processing: Human Interactions in Meetings, pages 84-102, Cambridge University Press, 2012

Parameter Estimation and Contextual Adaptation for a Multi-Object Tracking CRF Model, Alexandre Heili and Jean-Marc Odobez, in: IEEE Workshop on Performance Evaluation of Tracking and Surveillance, 2013

Recognizing the Visual Focus of Attention for Human Robot Interaction, Samira Sheikhi, Vasil Khalidov and Jean-Marc Odobez, in: IEEE International Conference on Intelligent Robots and Systems (IROS) - Human Behavior Understanding Workshop(IROS-HBU), 2012

Investigating the Midline Effect for Visual Focus of Attention Recognition, Samira Sheikhi and Jean-Marc Odobez, in: Int Conf. on Multimodal Interaction (ICMI), Santa Monica, 2012

The I4U Submission to the 2012 NIST Speaker Recognition Evaluation, Kong Aik Lee, Rahim Saedi, Tawfik Hasan, Tomi Kinnunen, Benoit Fauve, Pierre-Michel Bousquet, Elie Khoury, Pablo Luis Sordo Martinez, Tharmarajah Thiruvaran, Changhuai You, Padmanabhan Rajan, David Van Leeuwen, Seyed Omid Sadjadi, Driss Matrouf, Laurent El Shafey, John Mason, Eliathamby Ambikairajah, Hanwu Sun, Anthony Larcher, Bin Ma, Ville Hautamäki, Cemal Hanilci, Billy Braithwaite, Gonzalez-Hautamäki Rosa, Gang Liu, Hynek Boril, Navid Shokouhi, John Hansen, Jean-François Bonastre and Sébastien Marcel, in: NIST Speaker Recognition Conference, 2012

Together Anywhere, Together Anytime, Technologies for Intimate Interactions, Dick C. A. Bulterman, Petr Motlicek, Stefan Duffner and Danil Korchagin, Centrum Wiskunde & Informatica, 2012

IMPROVING ACOUSTIC BASED KEYWORD SPOTTING USING LVCSR LATTICES, Petr Motlicek, Fabio Valente and Igor Szoke, in: Proceedings on IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE, Japan, pages 4413-4416, 2012

IMPROVING ACOUSTIC BASED KEYWORD SPOTTING USING LVCSR LATTICES, Petr Motlicek, Fabio Valente and Igor Szoke, Idiap-RR-36-2012

ICB 2013 - Competition on speaker recognition in mobile environment using the MOBIO database: The Evaluation Plan, Elie Khoury, Sébastien Marcel and Manuel Günther, Idiap-Com-04-2012

I4U Submission to NIST SRE 2012: a large-scale collaborative effort for noise-robust speaker verification, Rahim Saedi, Kong Aik Lee, Tomi Kinnunen, Tawfik Hasan, Benoit Fauve, Pierre-Michel Bousquet, Elie Khoury, Pablo Luis Sordo Martinez, Jia Min Karen Kua, Changhuai You, Hanwu Sun, Anthony Larcher, Padmanabhan Rajan, Ville Hautamäki, Cemal Hanilci, Billy Braithwaite, Gonzalez-Hautamäki Rosa, Seyed Omid Sadjadi, Gang Liu, Hynek Boril, Navid Shokouhi, Driss Matrouf, Laurent El Shafey, Pejman Mowlaee, Julien Epps, Tharmarajah Thiruvaran, David Van Leeuwen, Bin Ma, Haizhou Li, John Hansen, Jean-François Bonastre, Sébastien Marcel, John Mason and Eliathamby Ambikairajah, Idiap-RR-34-2013

The Idiap Speaker Recognition Evaluation System at NIST SRE 2012, Elie Khoury, Laurent El Shafey and Sébastien Marcel, in: NIST Speaker Recognition Conference, NIST, Orlando, USA, 2012

Automatic Social Role Recognition In Professional Meetings, A. Sapru and Hervé Bourlard, Idiap-RR-35-2012

Bi-Modal Authentication in Mobile Environments Using Session Variability Modelling, Petr Motlicek, Laurent El Shafey, Roy Wallace, Chris McCool and Sébastien Marcel, in: Proceedings of the 21st International Conference on Pattern Recognition, 2012

Face Verification using Gabor Filtering and Adapted Gaussian Mixture Models, Laurent El Shafey, Roy Wallace and Sébastien Marcel, in: Proceedings of the 11th International Conference of the Biometrics Special Interest Group, Darmstadt, Germany, pages 397-408, GI-Edition, 2012

Grapheme and Multilingual Posterior Features For Under-Resource Speech Recognition: A Study on Scottish Gaelic, Ramya Rasipuram, Peter Bell and Mathew Magimai-Doss, Idiap-RR-34-2012

Modeling dominance effects on nonverbal behaviors using granger causality, Kyriaki Kalimeri, Bruno Lepri, Oya Aran, Dinesh Babu Jayagopi, Daniel Gatica-Perez and Fabio Pianesi, in: Proceedings of International Conference on Multimodal Interaction, ICMI 2012, Santa Monica, CA, 2012

The TA2 Database – A Multi-Modal Database From Home Entertainment, Stefan Duffner, Petr Motlicek and Danil Korchagin, in: International Journal of Computer and Electrical Engineering, 4(5):670-673, 2012

[URL]

Real-time model learning using Incremental Sparse Spectrum Gaussian Process Regression, Arjan Gijsberts and Giorgio Metta, in: Neural Networks, 2012

A Simple Continuous Pitch Estimation Algorithm, Philip N. Garner, Milos Cernak and Petr Motlicek, in: IEEE Signal Processing Letters, 20(1):102--105, 2013

[URL]

treeKL: A distance between high dimension empirical distributions, Riwal Lefort and Francois Fleuret, in: Pattern Recognition Letters, 34(2):140-145, 2013

On the Effectiveness of Local Binary Patterns in Face Anti-spoofing, Ivana Chingovska, André Anjos and Sébastien Marcel, in: Proceedings of the 11th International Conference of the Biometrics Special Interes Group, 2012

ON THE (UN)IMPORTANCE OF THE CONTEXTUAL FACTORS IN HMM-BASED SPEECH SYNTHESIS AND CODING, Milos Cernak, Petr Motlicek and Philip N. Garner, Idiap-RR-06-2013

Data-Driven Enhancement of State Mapping-Based Cross-Lingual Speaker Adaptation, Hui Liang, École Polytechnique Fédérale de Lausanne, 2012

Improving Object Classification using Pose Information, Hugo Penedones, Ronan Collobert, Francois Fleuret and David Grangier, Idiap-RR-30-2012

Leveraging speaker diarization for meeting recognition from distant microphones, Andreas Stolcke, Gerald Friedland and David Imseng, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, pages 4390--4393, 2010

Checking In or Checked In: Comparing Large-Scale Manual and Automatic Location Disclosure Patterns, Eric Malmi, Trinh-Minh-Tri Do and Daniel Gatica-Perez, in: Proceedings of the 11th International Conference on Mobile and Ubiquitous Multimedia, Ulm, Germany, 2012

Socio-Technical Network Analysis from Wearable Interactions, Katayoun Farrahi, Remi Emonet and Alois Ferscha, in: International Symposium on Wearable Computers, 2012

A Sequential Topic Model for Mining Recurrent Activities from Long Term Video Logs, Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, in: International Journal of Computer Vision, 103(1):100-126, 2013

Macro-Action Discovery Based on Change Point Detection and Boosting, Leonidas Lefakis and Francois Fleuret, in: International Conference on Machine Learning and Applications, 2012

An Open Source Framework for Standardized Comparisons of Face Recognition Algorithms, Manuel Günther, Roy Wallace and Sébastien Marcel, in: Computer Vision - ECCV 2012. Workshops and Demonstrations, Idiap Research Institute, Heidelberg, pages 547-556, Springer Berlin, 2012

[DOI]
[URL]

Overview of the ImageCLEF 2012 Robot Vision Task, Jesus Martinez-Gomez, Ismael Garcia-Varea and Barbara Caputo, in: Working Notes of the ImageCLEF 2012 Laboratory, 2012

Baseline Multimodal Place Classifier for the 2012 Robot Vision Task, Jesus Martinez-Gomez, Ismael Garcia-Varea and Barbara Caputo, in: Working Notes of the ImageCLEF 2012 Laboratory, 2012

MediaParl: Bilingual mixed language accented speech database, David Imseng, Hervé Bourlard, Holger Caesar, Philip N. Garner, Gwénolé Lecorvé and Alexandre Nanchen, Idiap-RR-03-2013

MediaParl: Bilingual mixed language accented speech database, David Imseng, Hervé Bourlard, Holger Caesar, Philip N. Garner, Gwénolé Lecorvé and Alexandre Nanchen, in: Proceedings of the 2012 IEEE Workshop on Spoken Language Technology, pages 263--268, 2012

Beyond Dataset Bias: Multi-task Unaligned Shared Knowledge Transfer, Tatiana Tommasi, Novi Quadrianto, Barbara Caputo and Christoph H. Lampert, in: Asian Conference on Computer Vision, 2012

Face Recognition with Disparity Corrected Gabor Phase Differences, Manuel Günther, Dennis Haufe and Rolf P. Würtz, in: Artificial Neural Networks and Machine Learning, Heidelberg, pages 411-418, Springer Berlin, 2012

[DOI]

An Open Source Framework for Standardized Comparisons of Face Recognition Algorithms, Manuel Günther, Roy Wallace and Sébastien Marcel, Idiap-RR-29-2012

Exact Acceleration of Linear Object Detectors, Charles Dubout and Francois Fleuret, in: Proceedings of the European Conference on Computer Vision, 2012

Empirical validations of multilingual annotation schemes for discourse relations, Sandrine Zufferey, Liesbeth Degand, Andrei Popescu-Belis and Ted Sanders, in: 8th Joint ACL-ISO Workshop on Interoperable Semantic Annotation, 2012

Predicting the Conflict Level in Television Political Debates: an Approach Based on Crowdsourcing, Nonverbal Communication and Gaussian Processes, Samuel Kim, Maurizio Filippone, Fabio Valente and Alessandro Vinciarelli, in: ACM Multimedia, 2012

Collecting data for socially intelligent surveillance and monitoring approaches: the case of conflict in competitive conversations, Alessandro Vinciarelli, Samuel Kim, Fabio Valente and Hugues Salamin, in: International Symposium on Communications, Control, and Signal Processing, 2012

Wordless Sounds: Robust Speaker Diarization using Privacy-Preserving Audio Representations, Sree Hari Krishnan Parthasarathi, Hervé Bourlard and Daniel Gatica-Perez, in: IEEE Transactions on Audio, Speech, and Language Processing, 2012

Crowdsourcing Micro-Level Multimedia Annotations: The Challenges of Evaluation and Interface, Sunghyun Park, Gelareh Mohammadi, Ron Artstein and Louis-Philippe Morency, in: Proceedings of International ACM Workshop on Crowdsourcing for Multimedia, 2012

Annotation and Recognition of Personality Traits in Spoken Conversations from the AMI Meetings Corpus, Fabio Valente, Samuel Kim and Petr Motlicek, in: Proceedings of Interspeech 2012, 2012

DiarTk : An Open Source Toolkit for Research in Multistream Speaker Diarization and its Application to Meetings Recordings, Deepu Vijayasenan and Fabio Valente, in: Proceedings of Interspeech, 2012

Detecting and Labeling Folk Literature in Spoken Cultural Heritage Archives using Structural and Prosodic Features, Fabio Valente and Petr Motlicek, in: IEEE Content Based Multimedia Indexing, 2012

Speaker Diarization of Meetings based on large TDOA feature vectors, Deepu Vijayasenan and Fabio Valente, in: Proceedings of International Conference on Acoustic, Speech and Signal Processing, 2012

Translating English Discourse Connectives into Arabic: a Corpus-based Analysis and an Evaluation Metric, Najeh Hajlaoui and Andrei Popescu-Belis, in: Fourth Workshop on Computational Approaches to Arabic Script-based Languages at Proceedings of the Tenth Biennial Conference of the Association for Machine Translation in the Americas (AMTA), 2012

An Agent-Based Focused Crawling Framework for Topic- and Genre-Related Web Document Discovery, Nikolaos Pappas, Georgios Katsimpras and Efstathios Stamatatos, in: 24th IEEE International Conference on Tools with Artificial Intelligence, Athens, Greece, IEEE, 2012

[URL]

Linking Speaking and Looking Behavior Patterns with Group Composition, Perception, and Performance, Dinesh Babu Jayagopi, Dairazalia Sanchez-Cortes, Kazuhiro Otsuka, Junji Yamato and Daniel Gatica-Perez, in: Proceedings of the International Conference on Multimodal Interaction (ICMI), Santa Monica, USA, 2012

Sequential Topic Models for Mining Recurrent Activities and their Relationships : Application to long term video recordings, Jagannadan Varadarajan, École Polytechnique Fédérale de Lausanne, 2012

Iterative Relevance Feedback with Adaptive Exploration/Exploitation Trade-off, Nicolae Suditu and Francois Fleuret, in: Proceedings of the 21st ACM Conference on Information and Knowledge Management, pages 1323-1331, 2012

Machine Translation of Labeled Discourse Connectives, Thomas Meyer, Andrei Popescu-Belis, Najeh Hajlaoui and Andrea Gesmundo, in: Proceedings of the Tenth Biennial Conference of the Association for Machine Translation in the Americas (AMTA), pages 10, 2012

Using Sparse Classification Outputs as Feature Observations for Noise Robust ASR, Yang Sun, B. Cranen, Jort F. Gemmeke, Lou Boves, Louis ten Bosch and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2012

Combination of Sparse Classification and Multilayer Perceptron for Noise Robust ASR, Yang Sun, Mathew Magimai-Doss, Jort F. Gemmeke, B. Cranen, Louis ten Bosch and Lou Boves, in: Proceedings of Interspeech, 2012

Baseline System for Automatic Speech Recognition with French GlobalPhone Database, Sandrine Revaz and Milos Cernak, Idiap-RR-26-2012

Reading Companion: The Technical and Social Design of an Automated Reading Tutor, Arthur Kantor, Milos Cernak, Jiri Havelka, Sean Huber, Jan Kleindienst and Doris B. Gonzalez, in: Workshop on Child, Computer and Interaction, Portland, Oregon, U.S.A., 2012

The Vernissage Corpus: A Multimodal Human-Robot-Interaction Dataset, Dinesh Babu Jayagopi, Samira Sheikhi, David Klotz, Johannes Wienke, Jean-Marc Odobez, Sebastian Wrede, Vasil Khalidov, Laurent Son Nguyen, Britta Wrede and Daniel Gatica-Perez, Idiap-RR-33-2012

From Research to Reality: Evaluation of a Single-Computer Real-Time LVCSR System for Speech-Based Retrieval, Andrei Popescu-Belis, Maryam Habibi, Philip N. Garner and Nan Li, Idiap-RR-12-2017

Indoor Scene Recognition using Task and Saliency-driven Feature Pooling, Marco Fornoni and Barbara Caputo, in: Proceedings of the British Machine Vision Conference, Guildford, UK, 2012

Leveraging over prior knowledge for online learning of visual categories, Tatiana Tommasi, Francesco Orabona, Mohsen Kaboli and Barbara Caputo, in: Proceedings of the British Machine Vision Conference, 2012

Contextual Conditional Models for Smartphone-based Human Mobility Prediction, Trinh-Minh-Tri Do and Daniel Gatica-Perez, in: Proceedings of the 14th ACM International Conference on Ubiquitous Computing, 2012

Using Crowdsourcing to Compare Document Recommendation Strategies for Conversations, Maryam Habibi and Andrei Popescu-Belis, in: RecSys, Recommendation Utility Evaluation (RUE 2012), Dublin, Ireland, pages 15-20, 2012

Impact du degré de supervision sur l'adaptation à un domaine d'un modèle de langage à partir du Web, Gwénolé Lecorvé, John Dines, Thomas Hain and Petr Motlicek, Idiap-RR-23-2012

Application of Subspace Gaussian Mixture Models in Contrastive Acoustic Scenarios, Petr Motlicek, Philip N. Garner, David Imseng and Fabio Valente, Idiap-RR-20-2012

Building the NinaPro Database: a Resource for the Biorobotics Community, Manfredo Atzori, Arjan Gijsberts, Simone Heynen, Anne-Gabrielle Mittaz Hager, Olivier Deriaz, Patrick van der Smagt, Claudio Castellini, Barbara Caputo and Henning Müller, in: Proceedings of the Fourth IEEE RAS/EMBS International Conference on Biomedical Robotics and Biomechatronics, 2012

Who Wants To Be A Millionaire?, Huseyn Gasimov, Aleksei Triastcyn, Petr Motlicek and Hervé Bourlard, Idiap-Com-03-2012

Integrating Posterior Features and Self-Organizing Maps for Isolated Word Recognition without Dynamic Programming, Serena Soldo and Mathew Magimai-Doss, Idiap-RR-17-2012

The Good, the Bad, and the Angry: Analyzing Crowdsourced Impressions of Vloggers, Joan-Isaac Biel and Daniel Gatica-Perez, in: Proceedings of AAAI International Conference on Weblogs and Social Media, 2012

From Speech to Personality: Mapping Voice Quality and Intonation into Personality Differences, Gelareh Mohammadi, Antonio Origlia, Maurizio Pili and Alessandro Vinciarelli, in: in Proceedings of ACM Multimedia 2012, 2012

Microphone Array Beampattern Characterization for Hands-free Speech Applications, Mohammad J. Taghizadeh, Philip N. Garner and Hervé Bourlard, in: IEEE 7th Sensor Array and Multichannel Signal Processing Workshop(SAM), Hoboken, NJ, USA, pages 473-476, 2012

Sparsity in Topic Models, Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, in: Practical Applications of Sparse Modeling: Biology, Signal Processing and Beyond, MIT Press, 2012

Bi-Modal Authentication in Mobile Environments Using Session Variability Modelling, Petr Motlicek, Laurent El Shafey, Roy Wallace, Chris McCool and Sébastien Marcel, Idiap-RR-18-2012

Template-based ASR using Posterior features and synthetic references: comparing different TTS systems, Serena Soldo, Mathew Magimai-Doss and Hervé Bourlard, in: SAPA-SCALE Conference, International Speech Communication Association, 2012

Gaze Estimation From Multimodal Kinect Data, Kenneth Alberto Funes Mora and Jean-Marc Odobez, in: IEEE Conference in Computer Vision and Pattern Recognition, Workshop on Gesture Recognition, Providence, RI, USA, 2012

[DOI]

On Speaker-Independent Personality Perception and Prediction from Speech, Polzehl Tim, Schoenenberg Katrin, Moller Sebastian, Metze Florian, Gelareh Mohammadi and Alessandro Vinciarelli, in: in Proceedings of INTERSPEECH 2012, 2012

Conversion of Recurrent Neural Network Language Models to Weighted Finite State Transducers for Automatic Speech Recognition, Gwénolé Lecorvé and Petr Motlicek, in: Proceedings of Interspeech, Portland, Oregon, USA, pages to appear, 2012

Supervised and unsupervised Web-based language model domain adaptation, Gwénolé Lecorvé, John Dines, Thomas Hain and Petr Motlicek, in: Proceedings of Interspeech, Portland, Oregon, USA, pages to appear, 2012

Structured Sparse Coding for Microphone Array Location Calibration, Afsaneh Asaei, Bhiksha Raj, Hervé Bourlard and Volkan Cevher, in: SAPA-SCALE Conference, The 5th ISCA workshop on statistical and perceptual audition, 2012

Computing Text Semantic Relatedness using the Contents and Links of a Hypertext Encyclopedia, Majid Yazdani and Andrei Popescu-Belis, in: Artificial Intelligence Journal, 194:176–202, 2013

[DOI]

Sub-Band Based Log-Energy and its Dynamic Range Stretching for Robust In-Car Speech Recognition, Weifeng Li and Hervé Bourlard, in: Proceedings of the 13th Annual Conference of the International Speech Communication Association (InterSpeech), Portland, Oregon, 2012

Sub-Band Based Log-Energy and its Dynamic Range Stretching for Robust In-Car Speech Recognition, Weifeng Li and Hervé Bourlard, Idiap-RR-16-2012

Integrating Language Identification to improve Multilingual Speech Recognition, Holger Caesar, Idiap-RR-24-2012

Combining Acoustic Data Driven G2P and Letter-to-Sound Rules for Under Resource Lexicon Generation, Ramya Rasipuram and Mathew Magimai-Doss, in: Proceedings of Interspeech, Portland, Oregon, 2012

Emergent leaders through looking and speaking: from audio-visual data to multimodal recognition, Dairazalia Sanchez-Cortes, Oya Aran, Dinesh Babu Jayagopi, Marianne Schmid Mast and Daniel Gatica-Perez, in: Journal on Multimodal User Interfaces, 2012

Automatic detection of conflict escalation in spoken conversations, Samuel Kim, Sree Harsha Yella and Fabio Valente, in: INTERSPEECH, ISCA, Portland, Oregon, USA., 2012

Speaker diarization of overlapping speech based on silence distribution in meeting recordings, Sree Harsha Yella and Fabio Valente, in: INTERSPEECH, Portland, Oregon, USA, 2012

Using Crowdsourcing to Compare Document Recommendation Strategies for Conversations, Maryam Habibi and Andrei Popescu-Belis, Idiap-RR-14-2012

Extracting Informative Textual Parts from Web Pages Containing User-Generated Content, Nikolaos Pappas, Georgios Katsimpras and Efstathios Stamatatos, in: 12th International Conference on Knowledge Management and Knowledge Technologies, ACM ICPS, Graz, Austria, pages 4:1--4:8, ACM, 2012

[URL]

Robust triphone mapping for acoustic modeling, Milos Cernak, David Imseng and Hervé Bourlard, Idiap-RR-02-2013

Robust triphone mapping for acoustic modeling, Milos Cernak, David Imseng and Hervé Bourlard, in: Proceedings of Interspeech, Portland, Oregon, 2012

Comparing different acoustic modeling techniques for multilingual boosting, David Imseng, John Dines, Petr Motlicek, Philip N. Garner and Hervé Bourlard, Idiap-RR-01-2013

Synthetic References for Template-based ASR using Posterior Features, Serena Soldo, Mathew Magimai-Doss and Hervé Bourlard, in: Proceedings of Interspeech, Portland, Oregon, USA, 2012

Comparing different acoustic modeling techniques for multilingual boosting, David Imseng, John Dines, Petr Motlicek, Philip N. Garner and Hervé Bourlard, in: Proceedings of Interspeech, Portland, Oregon, 2012

Phase AutoCorrelation (PAC) features for noise robust speech recognition, Shajith Ikbal, Hemant Misra, Hynek Hermansky and Mathew Magimai-Doss, in: Speech Communication, 54(7):867–880, 2012

[DOI]

A Survey on Language Modeling using Neural Networks, Nikolaos Pappas and Thomas Meyer, Idiap-RR-32-2012

Notes on Probabilistic Linear Discriminant Analysis, Chris McCool and Laurent El Shafey, Idiap-Com-03-2013

Bob: a free signal processing and machine learning toolbox for researchers, André Anjos, Laurent El Shafey, Roy Wallace, Manuel Günther, Chris McCool and Sébastien Marcel, Idiap-RR-25-2012

Assessing Sparse Coding Methods for Contextual Shape Indexing of Maya Hieroglyphs, Edgar Roman-Rangel, Jean-Marc Odobez and Daniel Gatica-Perez, in: Journal of Multimedia, 7(2):179--192, 2012

Multivariate Boosting with Look-up Tables for Face Processing, Cosmin Atanasoaei, EPFL, 2012

Bridging the Past, Present and Future: Modeling Scene Activities From Event Relationships and Global Rules, Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, in: IEEE Conference on Computer Vision and Pattern Recognition, 2012, Providence, Rhode Island, USA, 2012

Conversion of Recurrent Neural Network Language Models to Weighted Finite State Transducers for Automatic Speech Recognition, Gwénolé Lecorvé and Petr Motlicek, Idiap-RR-21-2012

Supervised and unsupervised Web-based language model domain adaptation, Gwénolé Lecorvé, John Dines, Thomas Hain and Petr Motlicek, Idiap-RR-22-2012

Impact du degré de supervision sur l'adaptation à un domaine d'un modèle de langage à partir du Web, Gwénolé Lecorvé, John Dines, Thomas Hain and Petr Motlicek, in: Actes de la conference conjointe JEP-TALN-RECITAL 2012, Grenoble, France, pages 193-200, ATALA/AFCP, 2012

Audiovisual Diarization Of People In Video Content, Elie Khoury, Christine Sénac and Philippe Joly, in: Multimedia Tools and Applications, 2012

Combining transcription-based and acoustic-based speaker identifications for broadcast news, Elie Khoury, Antoine Laurent, Sylvain Meignier and Simon Petitrenaud, in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2012

Bi-Modal Person Recognition on a Mobile Phone: using mobile phone data, Chris McCool, Sébastien Marcel, Abdenour Hadid, Matti Pietikainen, Pavel Matejka, Jan Cernocky, Norman Poh, J. Kittler, Anthony Larcher, Christophe Levy, Driss Matrouf, Jean-François Bonastre, Phil Tresadern and Timothy Cootes, in: IEEE ICME Workshop on Hot Topics in Mobile Multimedia, 2012

Session Variability Modelling for Face Authentication, Chris McCool, Roy Wallace, Mitchell McLaren, Laurent El Shafey and Sébastien Marcel, Idiap-RR-17-2013

Bi-Modal Person Recognition on a Mobile Phone: using mobile phone data, Chris McCool, Sébastien Marcel, Abdenour Hadid, Matti Pietikainen, Pavel Matejka, Jan Cernocky, Norman Poh, J. Kittler, Anthony Larcher, Christophe Levy, Driss Matrouf, Jean-François Bonastre, Phil Tresadern and Timothy Cootes, Idiap-RR-13-2012

Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns, Andrei Popescu-Belis, Thomas Meyer, Jeevanthi Liyanapathirana, Bruno Cartoni and Sandrine Zufferey, in: Proceedings of the eighth international conference on Language Resources and Evaluation (LREC), pages 5, 2012

Vocal Tract Length Normalization for Statistical Parametric Speech Synthesis, Lakshmi Saheer, John Dines and Philip N. Garner, in: IEEE Transactions on Audio, Speech and Language Processing, 2012

COMBINING VOCAL TRACT LENGTH NORMALIZATION WITH HIERARCHIAL LINEAR TRANSFORMATIONS, Lakshmi Saheer, Junichi Yamagishi, Philip N. Garner and John Dines, in: Proceedings in International conference on Speech and Signal processing, Kyoto, Japan, pages 4493-4496, IEEE SPS (ICASSP), 2012

On the Challenge of Classifying 52 Hand Movements from Surface Electromyography, Ilja Kuzborskij, Arjan Gijsberts and Barbara Caputo, in: 34th Annual Conference of the IEEE Engineering in Medicine & Biology Society, 2012

Boosting under-resourced speech recognizers by exploiting out of language data - Case study on Afrikaans, David Imseng, Hervé Bourlard and Philip N. Garner, Idiap-RR-15-2012

Alternative search techniques for face detection using location estimation and binary features, Venkatesh Bala Subburaman, ECOLE POLYTECHNIQUE FEDERALE DE LAUSANNE, 2012

Automatic Speaker Role Labeling in AMI Meetings: Recognition of Formal and Social Roles, A. Sapru and Fabio Valente, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, Japan, 2012, 2012

Bayesian Approaches to Uncertainty in Speech Processing, Philip N. Garner, School of Computing Sciences, University of East Anglia, 2011

Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies, Bruno Cartoni and Thomas Meyer, in: Proceedings of the eighth international conference on Language Resources and Evaluation (LREC), Istanbul, TR, pages 6, 2012

Using Sense-labeled Discourse Connectives for Statistical Machine Translation, Thomas Meyer and Andrei Popescu-Belis, in: Proceedings of the EACL2012 Workshop on Hybrid Approaches to Machine Translation (HyTra), Avignon, FR, pages 129--138, 2012

Automatic detection of conflicts in spoken conversations: ratings and analysis of broadcast political debates, Samuel Kim, Fabio Valente and Alessandro Vinciarelli, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, Japan, 2012

Boosting under-resourced speech recognizers by exploiting out of language data - Case study on Afrikaans, David Imseng, Hervé Bourlard and Philip N. Garner, in: Proceedings of the 3rd International Workshop on Spoken Languages Technologies for Under-resourced Languages, Cape Town, pages 60--67, 2012

Progress report of a project in very low bit-rate speech coding, Milos Cernak, Philip N. Garner and Petr Motlicek, Idiap-RR-08-2012

From Nonverbal Cues to Perception: Personality and Social Attractiveness, Alessandro Vinciarelli, Hugues Salamin, Anna Polychroniou, Gelareh Mohammadi and Antonio Origlia, in: LNCS Proceedings on COGNITIVE BEHAVIOURAL SYSTEMS, Springer, 2012

Automatic Attribution of Personality Traits Based on Prosodic Features, Gelareh Mohammadi and Alessandro Vinciarelli, in: IEEE Transactions on Affective Computing, 2012

Translation Error Spotting from a User's Point of View, Thomas Meyer, Idiap-RR-31-2012

Acoustic Data-driven Grapheme-to-Phoneme Conversion using KL-HMM, Ramya Rasipuram and Mathew Magimai-Doss, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2012

Decision tree clustering for KL-HMM, David Imseng and John Dines, Idiap-Com-01-2012

Improving Control of Dexterous Hand Prostheses Using Adaptive Learning, Tatiana Tommasi, Francesco Orabona, Claudio Castellini and Barbara Caputo, Idiap-RR-07-2012

Improving Control of Dexterous Hand Prostheses Using Adaptive Learning, Tatiana Tommasi, Francesco Orabona, Claudio Castellini and Barbara Caputo, in: IEEE TRANSACTIONS ON ROBOTICS, 2012

[DOI]

Transfer Learning of Visual Concepts across Robots: a Discriminative Approach, Sriram Prasath Elango, Tatiana Tommasi and Barbara Caputo, Idiap-RR-06-2012

The INTERSPEECH 2012 Speaker Trait Challenge, Björn Schuller, Stefan Steidl, Anton Batliner, Elmar Nöth, Alessandro Vinciarelli, Felix Burkhardt, Rob Van Son, felix Weninger, Florian Eyben, Tobias Bocklet, Gelareh Mohammadi and Benjamin Weiss, in: in Proceedings of INTERSPEECH, 2012

The ICSI RT-09 Speaker Diarization System, Gerald Friedland, Adam Janin, David Imseng, Xavier Anguera, Luke Gottlieb, Marijn Huijbregts, Mary Tai Knox and Oriol Vinyals, in: IEEE Transactions on Audio, Speech, and Language Processing, 20(2):371--381, 2012

[DOI]

The Kaldi Speech Recognition Toolkit, Daniel Povey, Arnab Ghoshal, Gilles Boulianne, Lukas Burget, Ondrej Glembek, Nagendra Goel, Mirko Hannemann, Petr Motlicek, Yanmin Qian, Petr Schwarz, Jan Silovsky, Georg Stemmer and Karel Vesely, in: IEEE 2011 Workshop on Automatic Speech Recognition and Understanding, Hilton Waikoloa Village, Big Island, Hawaii, US, IEEE Signal Processing Society, 2011

A tree-based distance between distributions: application to classification of neurons, Riwal Lefort and Francois Fleuret, in: ICASSP 2012 : IEEE International Conference on Acoustics, Speech and Signal Processing, 2012

Morphodynamic profiling to explore spatio-temporal signaling networks regulating neurite outgrowth, L. Fusco, Kevin C. Smith, F. Benmansour, Riwal Lefort, Francois Fleuret, Pascal Fua and O. Pertz, in: 1st International SystemsX.ch Conference on Systems Biology, 2011

Automated Quantification of Morphodynamics for High-Throughput Live Cell Imaging Datasets, German Gonzalez, L. Fusco, Riwal Lefort, F. Benmansour, Pascal Fua and Kevin C. Smith, in: 1st International SystemsX.ch Conference on Systems Biology, 2011

Machine learning techniques to analyse complex, computer vision-extracted, dynamic cellular phenotypes, Riwal Lefort, L. Fusco, F. Benmansour, Kevin C. Smith, O. Pertz and Francois Fleuret, in: 1st International SystemsX.ch Conference on Systems Biology, 2011

Computational Methods For Structured Sparse Component Analysis of Convolutive Speech Mixtures, Afsaneh Asaei, Michael E. Davies, Hervé Bourlard and Volkan Cevher, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2012

Cross-pollination of normalisation techniques from speaker to face authentication using Gaussian mixture models, Roy Wallace, Mitchell McLaren, Chris McCool and Sébastien Marcel, in: IEEE Transactions on Information Forensics and Security, 7(2):553 -- 562, 2012

Cross-pollination of normalisation techniques from speaker to face authentication using Gaussian mixture models, Roy Wallace, Mitchell McLaren, Chris McCool and Sébastien Marcel, Idiap-RR-03-2012

Using KL-divergence and multilingual information to improve ASR for under-resourced languages, David Imseng, Hervé Bourlard and Philip N. Garner, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, pages 4869--4872, 2012

Hierarchical Tandem Features for ASR in Mandarin, Joel Pinto, Mathew Magimai-Doss and Hervé Bourlard, in: Proceedings of Interspeech, 2011

Look at who's talking, M. Cristani, A. Pesarin, Alessandro Vinciarelli, M. Crocco and V. Murino, in: Proceedings of International Conference on Ambient Intelligence, pages 68-76, 2011

Recent Developments in Social Signal Processing, Albert Ali Salah, Maja Pantic and Alessandro Vinciarelli, in: Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, pages 380-385, 2011

Understanding Social Signals in Multi-party Conversations: Automatic Recognition of Socio-Emotional Roles in the AMI Meeting Corpus, Fabio Valente, Alessandro Vinciarelli, Sree Harsha Yella and A. Sapru, in: Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, pages 374-379, 2011

Towards Computational Proxemics: Inferring Social Relations from Interpersonal Distances, M. Cristani, G. Paggetti, Alessandro Vinciarelli, L. Bazzani, G. Menegaz and V. Murino, in: Proceedings of the IEEE International Conference on Social Computing, pages 290-297, 2011

Conversation Analysis at Work: Detection of Conflict in Competitive Discussions through Automatic Turn-Organization Analysis, A. Pesarin, M. Cristani, V. Murino and Alessandro Vinciarelli, in: Cognitive Processing, 2012

Bridging the Gap Between Social Animal and Unsocial Machine: A Survey of Social Signal Processing, Alessandro Vinciarelli, Maja Pantic, Dirk Heylen, C. Pelachaud, I. Poggi, F. D'Errico and M. Schroeder, in: IEEE Transactions on Affective Computing, 2012

Automatic Role Recognition in Multiparty Conversations: an Approach Based on Turn Organization, Prosody and Conditional Random Fields, Hugues Salamin and Alessandro Vinciarelli, in: IEEE Transactions on Multimedia, 2012

Introduction to Sequence Analysis for Human Behavior Understanding, Hugues Salamin and Alessandro Vinciarelli, in: "Computer Analysis of Human Behavior" by A.Salah and T.Gevers (eds.), pages 21-40, Springer Verlag, 2011

Social Signal Processing: The Research Agenda, Maja Pantic, R. Cowie, F. D'Errico, Dirk Heylen, M. Mehu, C. Pelachaud, I. Poggi, M. Schroeder and Alessandro Vinciarelli, in: "Visual Analysis of Humans" by T.B.Moeslund, A.Hilton, V.Krueger and L.Sigal (eds.), pages 511-538, Springer Verlag, 2011

Analysis of Verbal and Nonverbal Communication and Enactment: The Processing Issues, A. Esposito, Alessandro Vinciarelli, K. Vicsi, C. Pelachaud and A. Nijholt, Springer Verlag, 2011

Open-ended Learning of Visual and Multi-modal Patterns, Jie Luo, Ecole polytechnique fédérale de Lausanne, 2011

A Bimodal Sound Source Model for Vehicle Tracking in Traffic Monitoring, Patrick Marmaroli, Jean-Marc Odobez, Xavier Falourd and Hervé Lissek, in: European Signal Processing Conference, 2011

Torch7: A Matlab-like Environment for Machine Learning, Ronan Collobert, Koray Kavukcuoglu and Clément Farabet, in: BigLearn, NIPS Workshop, 2011

Learning Structured Embeddings of Knowledge Bases, Antoine Bordes, Jason Weston, Ronan Collobert and Yoshua Bengio, in: Conference on Artificial Intelligence, 2011

Deep Learning for Efficient Discriminative Parsing, Ronan Collobert, in: International Conference on Artificial Intelligence and Statistics, 2011

Natural Language Processing (Almost) from Scratch, Ronan Collobert, Jason Weston, Léon Bottou, Michael Karlen, Koray Kavukcuoglu and Pavel Kuksa, in: Journal of Machine Learning Research, 12:2493-2537, 2011

Fast Human Detection from Joint Appearance and Foreground Feature Subset Covariances, Jian Yao and Jean-Marc Odobez, in: Computer Vision and Image Understanding, 115(10):1414-1426, 2011

Evaluation of Meeting Support Technology, Simon Tucker and Andrei Popescu-Belis, in: Multimodal Signal Processing: Human Interactions in Meetings, pages 237-252, Cambridge University Press, 2012

User Requirements for Meeting Support Technology, Denis Lalanne and Andrei Popescu-Belis, in: Multimodal Signal Processing: Human Interactions in Meetings, pages 210-221, Cambridge University Press, 2012

Multimodal Signal Processing for Meetings: an Introduction, Andrei Popescu-Belis and Jean Carletta, in: Multimodal Signal Processing: Human Interactions in Meetings, pages 1-11, Cambridge University Press, 2012

BROADBAND BEAMPATTERN FOR MULTI-CHANNEL SPEECH ACQUISITION AND DISTANT SPEECH RECOGNITION, Mohammad J. Taghizadeh, Philip N. Garner and Hervé Bourlard, Idiap-RR-39-2011

Finding Audio-Visual Events in Informal Social Gatherings, Xavier Alameda-Pineda, Vasil Khalidov, Radu Horaud and Florence Forbes, in: IEEE/ACM 13th International Conference on Multimodal Interaction, 2011

Engagement-based Multi-party Dialog with a Humanoid Robot, David Klotz, Johannes Wienke, Julia Peltason, Britta Wrede, Sebastian Wrede, Vasil Khalidov and Jean-Marc Odobez, in: Proceedings of the SIGDIAL 2011: the 12th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pages 341-343, 2011

Acoustic Data-driven Grapheme-to-Phoneme Conversion using KL-HMM, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-38-2011

OM-2: An Online Multi-class Multi-kernel Learning Algorithm, Jie Luo, Francesco Orabona, Marco Fornoni, Barbara Caputo and Nicolo Cesa-Bianchi, in: In Proceeding of CVPR 2010, Online Learning for Computer Vision Workshop, 2010

Analysis of Group Conversations: Modeling Social Verticality, Oya Aran and Daniel Gatica-Perez, in: Computer Analysis of Human Behavior, pages 293-322, Springer London, 2011

A Nonverbal Behavior Approach to Identify Emergent Leaders in Small Groups, Dairazalia Sanchez-Cortes, Oya Aran, Marianne Schmid Mast and Daniel Gatica-Perez, in: IEEE Transactions on Multimedia, 14(3-2):816-832, 2012

[DOI]

Personalising speech-to-speech translation: Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis, John Dines, Hui Liang, Lakshmi Saheer, Matthew Gibson, William Byrne, Keiichiro Oura, Keiichi Tokuda, Junichi Yamagishi, Simon King, Mirjam Wester, Teemu Hirsimäki, Reima Karhila and Mikko Kurimo, in: Computer Speech and Language, 2011

[DOI]
[URL]

Environment - Application - Adaptation: a Community Architecture for Ambient Intelligence, Remi Emonet, in: International Conference on Ambient Computing, Applications, Services and Technologies, 2011

Speaker Diarization, Fabio Valente and Gerald Friedland, in: Multimodal Signal Processing: Human Interactions in Meetings, Cambridge University Press, 2012

[URL]

Language-Independent Socio-Emotional Role Recognition in the AMI Meetings Corpus, Fabio Valente and Alessandro Vinciarelli, in: Proceedings of Interspeech, 2011

Analysis and Comparison of Recent MLP Features for LVCSR Systems, Fabio Valente, Mathew Magimai-Doss and Wen Wang, in: Proceedings of Interspeech 2011, 2011

Multistream speaker diarization of meetings recordings beyond MFCC and TDOA features, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: Speech Communication, 54(1), 2012

[DOI]

Transcribing Mandarin Broadcast Speech Using Multi-Layer Perceptron Acoustic Features, Fabio Valente, Mathew Magimai-Doss, Christian Plahl, Suman Ravuri and Wen Wang, in: IEEE Transactions on Audio, Speech, and Language Processing, 19(8), 2011

[DOI]

Data-driven extraction of spectral-dynamics based posteriors, Fabio Valente, in: Handbook of Natural Language Processing and Machine Translation Handbook of Natural Language Processing and Machine Translation, Springer, 2011

[URL]

Speaker Diarization of Meetings based on Speaker Role N-gram Models, Fabio Valente, Deepu Vijayasenan and Petr Motlicek, in: Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011

MULTISTREAM SPEAKER DIARIZATION THROUGH INFORMATION BOTTLENECK SYSTEM OUTPUTS COMBINATION, Deepu Vijayasenan, Fabio Valente and Petr Motlicek, in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2011

Overview of the CLEF 2009 medical image annotation track, Tatiana Tommasi, Barbara Caputo, Petra Welter, Mark O. Güld and Thomas M Deserno, in: Workshop of the Cross-Language Evaluation Forum, Corfu, Greece, pages 85-93, Springer Berlin Heidelberg, 2009

[DOI]

Object Recognition using Visuo-Affordance Maps, Arjan Gijsberts, Tatiana Tommasi, Giorgio Metta and Barbara Caputo, in: International Conference on Intelligent Robots and Systems, Taipei, pages 1572-1578, IEEE, 2010

[DOI]

Towards a quantitative measure of rareness, Tatiana Tommasi and Barbara Caputo, in: DIRAC Workshop at the European Conference on Machine Learning, pages 129-136, Springer Berlin Heidelberg, 2010

[DOI]

Transferring Activities: Updating Human Behavior Analysis, Fabian Nater, Tatiana Tommasi, Helmut Grabner, Luc Van Gool and Barbara Caputo, in: Visual Surveillance Workshop at ICCV, 2011

Safety in Numbers: Learning Categories from Few Examples with Multi Model Knowledge Transfer, Tatiana Tommasi, Francesco Orabona and Barbara Caputo, in: Proceedings of IEEE Computer Vision and Pattern Recognition Conference, San Francisco, CA, pages 3081-3088, IEEE, 2010

[DOI]

Using Modality Replacement to Facilitate Communication between Visually and Hearing-Impaired People, K. Moustakas, D. Tzovaras, L. Dybkjaer, N. Bernsen and Oya Aran, in: IEEE Multimedia, 18(2):26-37, 2011

[DOI]

Domain-specific language model adaptation: a case study, Gwénolé Lecorvé, Petr Motlicek and John Dines, Idiap-Com-01-2013

VTLN-Based Rapid Cross-Lingual Adaptation for Statistical Parametric Speech Synthesis, Lakshmi Saheer, Hui Liang, John Dines and Philip N. Garner, Idiap-RR-12-2012

Combining Vocal Tract Length Normalization with Linear Transformations in a Bayesian Framework, Lakshmi Saheer, Junichi Yamagishi, Philip N. Garner and John Dines, Idiap-RR-11-2012

Inter-session Variability Modelling and Joint Factor Analysis for Face Authentication, Roy Wallace, Mitchell McLaren, Chris McCool and Sébastien Marcel, in: International Joint Conference on Biometrics, 2011

An Audio Visual Corpus for Emergent Leader Analysis, Dairazalia Sanchez-Cortes, Oya Aran and Daniel Gatica-Perez, in: Multimodal Corpora for Machine Learning: Taking Stock and Road mapping the Future, 2011

Robustness of Group Delay Representations for Noisy Speech Signals, Sree Hari Krishnan Parthasarathi, Padmanabhan Rajan and Hema A Murthy, Idiap-RR-36-2011

Robustness of Group Delay Representations for Noisy Speech Signals, Sree Hari Krishnan Parthasarathi, Padmanabhan Rajan and Hema A Murthy, in: IJST (Springer), 14(4), 2011

Privacy-Sensitive Audio Features for Conversational Speech Processing, Sree Hari Krishnan Parthasarathi, Ecole Polytechnique Fédérale de Lausanne, 2011

Human Interaction Discovery in Smartphone Proximity Networks, Trinh-Minh-Tri Do and Daniel Gatica-Perez, in: Personal and Ubiquitous Computing, 2012

Joint Optimization of Hidden Conditional Random Fields and Non Linear Feature Extraction, Antoine Vinel, Trinh-Minh-Tri Do and Thierry Artieres, in: Proceedings of International Conference on Document Analysis and Recognition, 2011

Mining Large-Scale Smartphone Data for Personality Studies, Gokul Chittaranjan, Jan Blom and Daniel Gatica-Perez, in: Personal and Ubiquitous Computing, 2012

Boosting Localized Features for Speaker and Speech Recognition, Anindya Roy, Ecole Polytechnique Federale de Lausanne (EPFL), 2011

Multi-camera Open Space Human Activity Discovery for Anomaly Detection, Remi Emonet, Jagannadan Varadarajan and Jean-Marc Odobez, in: 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011

Information Bottleneck Features for HMM/GMM Speaker Diarization of Meetings Recordings, Sree Harsha Yella and Fabio Valente, in: Interspeech, Florence, Italy, pages 953-956, 2011

Continuous Speech Recognition using Boosted Binary Features, Anindya Roy, Mathew Magimai-Doss and Sébastien Marcel, Idiap-RR-35-2011

Multimodal Cue Detection Engine for Orchestrated Entertainment, Danil Korchagin, Stefan Duffner, Petr Motlicek and Carl Scheffler, in: Proceedings International Conference on MultiMedia Modeling, Klagenfurt, Austria, 2012

Multimodal Cue Detection Engine for Orchestrated Entertainment, Danil Korchagin, Stefan Duffner, Petr Motlicek and Carl Scheffler, Idiap-RR-34-2011

Estimating Cohesion in Small Groups Using Audio-Visual Nonverbal Behavior, Hayley Hung and Daniel Gatica-Perez, in: IEEE Trans. on Multimedia, Special Issue on Multimodal Affective Interaction, 12(6):563 - 575, 2010

Competition on Counter Measures to 2-D Facial Spoofing Attacks, Murali Mohan Chakka, André Anjos, Sébastien Marcel, Roberto Tronci, Daniele Muntoni, Gianluca Fadda, Maurizio Pili, Nicola Sirena, Gabriele Murgia, Marco Ristori, Fabio Roli, Junjie Yan, Dong Yi, Zhen Lei, Zhiwei Zhang, Stan Z.Li, William Robson Schwartz, Anderson Rocha, Helio Pedrini, Javier Lorenzo-Navarro, Modesto Castrillón-Santana, Jukka Maatta, Abdenour Hadid and Matti Pietikainen, in: Proceedings of IAPR IEEE International Joint Conference on Biometrics (IJCB), Washington DC, USA, 2011

Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition, David Imseng, Ramya Rasipuram and Mathew Magimai-Doss, in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Hawaii, USA, pages 348-353, 2011

Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition, David Imseng, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-01-2012

Comparing machines and humans on a visual categorization test, Francois Fleuret, Ting Li, Charles Dubout, Emma K. Wampler, Steven Yantis and Donald Geman, in: Proceedings of the National Academy of Sciences, 2011

Boosting with Maximum Adaptive Sampling, Charles Dubout and Francois Fleuret, in: Proceedings of the Neural Information Processing Systems Conference, 2011

Detection-Based Multi-Human Tracking Using a CRF Model, Alexandre Heili, Cheng Chen and Jean-Marc Odobez, in: The Eleventh IEEE International Workshop on Visual Surveillance, 2011

A Joint Estimation of Head and Body Orientation Cues in Surveillance Video, Cheng Chen, Alexandre Heili and Jean-Marc Odobez, in: IEEE International Workshop on Socially Intelligent Surveillance and Monitoring, 2011

Smartphone usage in the wild: a large-scale analysis of applications and context, Trinh-Minh-Tri Do, Jan Blom and Daniel Gatica-Perez, in: 13th International Conference on Multimodal Interaction, 2011

Building 'directional corpora' for unbiased contrastive analysis, Bruno Cartoni and Thomas Meyer, in: Proceedings of Corpus Linguistics Conference, Birmingham, UK, pages 29-30, 2011

Disambiguating discourse connectives using parallel corpora: senses vs. translations, Thomas Meyer, Charlotte Roze, Bruno Cartoni, Laurence Danlos, Sandrine Zufferey and Andrei Popescu-Belis, in: Proceedings of Corpus Linguistics Conference, Birmingham, UK, pages 104-105, 2011

A Corpus-based Contrastive Analysis for Defining Minimal Semantics of Inter-sentential Dependencies for Machine Translation, Thomas Meyer, Andrei Popescu-Belis, Jeevanthi Liyanapathirana and Bruno Cartoni, in: Proceedings of the GSCL2011 Workshop on "Contrastive Analysis - Translation Studies - Machine Translation: What can we learn from each other?", Hamburg, Germany, pages 5, 2011

A Fast Parts-based Approach to Speaker Verification using Boosted Slice Classifiers, Anindya Roy, Mathew Magimai-Doss and Sébastien Marcel, in: IEEE Transactions on Information Forensics and Security, 7(1):241-254, 2012

Fast Speaker Verification on Mobile Phone data using Boosted Slice Classifiers, Anindya Roy, Mathew Magimai-Doss and Sébastien Marcel, in: IAPR IEEE International Joint Conference on Biometrics, Washington DC, 2011

VlogSense: Conversational Behavior and Social Attention in YouTube, Joan-Isaac Biel and Daniel Gatica-Perez, in: Transactions on Multimedia Computing, Communications and Applications, 2011

The Kaldi Speech Recognition Toolkit, Daniel Povey, Arnab Ghoshal, Gilles Boulianne, Lukas Burget, Ondrej Glembek, Nagendra Goel, Mirko Hannemann, Petr Motlicek, Yanmin Qian, Petr Schwarz, Jan Silovsky, Georg Stemmer and Karel Vesely, Idiap-RR-04-2012

HEAT: Iterative Relevance Feedback with One Million Images, Nicolae Suditu and Francois Fleuret, in: Proceedings of the IEEE International Conference on Computer Vision, pages 2118-2125, 2011

Multimodal Signal Processing: Human Interactions in Meetings, Steve Renals, Hervé Bourlard, Jean Carletta and Andrei Popescu-Belis, Cambridge University Press, 2012

[URL]

A Just-in-Time Document Retrieval System for Dialogues or Monologues, Andrei Popescu-Belis, Majid Yazdani, Alexandre Nanchen and Philip N. Garner, in: SIGDIAL 2011 (12th annual SIGDIAL Meeting on Discourse and Dialogue), Demonstration Session, Portland, OR, pages 350-352, 2011

Finding Information in Multimedia Records of Meetings, Andrei Popescu-Belis, Denis Lalanne and Hervé Bourlard, in: IEEE Multimedia, 19(2):48-57, 2012

[DOI]
[URL]

A Speech-based Just-in-Time Retrieval System using Semantic Search, Andrei Popescu-Belis, Majid Yazdani, Alexandre Nanchen and Philip N. Garner, in: Proceedings of the ACL-HLT 2011 System Demonstrations (49th Annual Meeting of the Association for Computational Linguistics), Portland, OR, pages 80-86, 2011

[URL]

Learning from Images with Captions Using the Maximum Margin Set Algorithm, Jie Luo, Francesco Orabona, Barbara Caputo and Vittorio Ferrari, Idiap-RR-30-2011

A Large-Scale Database of Images and Captions for Automatic Face Naming, Mert Ozcan, Jie Luo, Vittorio Ferrari and Barbara Caputo, in: Proceedings of the 22nd British Machine Vision Conference, 2011

A Large-Scale Database of Images and Captions for Automatic Face Naming, Mert Ozcan, Jie Luo, Vittorio Ferrari and Barbara Caputo, Idiap-RR-26-2011

Multiclass Transfer Learning from Unconstrained Priors, Jie Luo, Tatiana Tommasi and Barbara Caputo, in: Proceedings of the 13th International Conference on Computer Vision, 2011

Multiclass Transfer Learning from Unconstrained Priors, Jie Luo, Tatiana Tommasi and Barbara Caputo, Idiap-RR-25-2011

Joint Adaptive Colour Modelling and Skin, Hair and Clothing Segmentation Using Coherent Probabilistic Index Maps, Carl Scheffler and Jean-Marc Odobez, in: British Machine Vision Conference, British Machine Vision Association, Dundee, UK, 2011

Speech Enhancement using Beta-order MMSE Spectral Amplitude Estimator with Laplacian Prior, Hamid Reza Abutalebi, Mehdi Rashidinejad, Hervé Bourlard and Ali Akbar Tadaion, Idiap-RR-24-2011

Integrating articulatory features using Kullback-Leibler divergence based acoustic model for phoneme recognition, Ramya Rasipuram and Mathew Magimai-Doss, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, pages 5192 - 5195, 2011

[DOI]

Improving Articulatory Feature and Phoneme Recognition using Multitask Learning, Ramya Rasipuram and Mathew Magimai-Doss, in: Artificial Neural Networks and Machine Learning - ICANN 2011, pages 299-306, Springer Berlin / Heidelberg, 2011

[DOI]
[URL]

Inferring truth from multiple annotators for social interaction analysis, Gokul Chittaranjan, Oya Aran and Daniel Gatica-Perez, in: Neural Information Processing Systems (NIPS) Workshop on Modeling Human Communication Dynamics (HCD), pages 4, 2011

Who's Who with Big-Five: Analyzing and Classifying Personality Traits with Smartphones, Gokul Chittaranjan, Jan Blom and Daniel Gatica-Perez, in: International Symposium on Wearable Computing, pages 8, 2011

Exploiting observers' judgements for nonverbal group interaction analysis, Gokul Chittaranjan, Oya Aran and Daniel Gatica-Perez, in: IEEE Conference on Automatic Face and Gesture Recognition, pages 6, IEEE, 2011

An Integrated Framework for Multi-Channel Multi-Source Localization and Voice Activity Detection, Mohammad J. Taghizadeh, Philip N. Garner, Hervé Bourlard, Hamid Reza Abutalebi and Afsaneh Asaei, in: The Third Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2011

Intuitive Recipes for Uncertainty Decoding with SNR Features for Noise Robust ASR, Georgios Skoumas and Philip N. Garner, Idiap-RR-23-2011

Privacy-sensitive recognition of group conversational context with sociometers, Dinesh Babu Jayagopi, Taemie Kim, Alex Pentland and Daniel Gatica-Perez, in: Springer Multimedia Systems Journal, 2011

Multi-party Speech Recovery Exploiting Structured Sparsity Models, Afsaneh Asaei, Mohammad J. Taghizadeh, Hervé Bourlard and Volkan Cevher, in: Proceedings of Interspeech, 2011

Model-based Compressive Sensing for Multi-party Distant Speech Recognition, Afsaneh Asaei, Hervé Bourlard and Volkan Cevher, in: 2011 IEEE International Conference on Acoustics, Speech and Signal Processing, 2011

A novel framework for noise robust ASR using cochlear implant-like spectrally reduced speech, Cong-Thanh Do, Dominique Pastor and André Goalic, in: Speech Communication, 2011

[DOI]

Grapheme-based Automatic Speech Recognition using KL-HMM, Mathew Magimai-Doss, Ramya Rasipuram, Guillermo Aradilla and Hervé Bourlard, in: Proceedings of Interspeech, 2011

The MASH Project, Francois Fleuret, Philip Abbet, Charles Dubout and Leonidas Lefakis, in: European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2011

Using a Wikipedia-based Semantic Relatedness Measure for Document Clustering., Majid Yazdani and Andrei Popescu-Belis, in: Graph-based Methods for Natural Language Processing, 2011

Humans as Feature Extractors: Combining Prosody and Personality Perception for Better Speaking Style Recognition, Gelareh Mohammadi and Alessandro Vinciarelli, in: Proceeding of IEEE Int Conference on Systems, Man, and Cybernetics - Special Sessions, 2011

Tracking Multiple Objects under Global Appearance Constraints, Horesh Ben Shitrit, Jerome Berclaz, Francois Fleuret and Pascal Fua, in: Proceedings of the IEEE International Conference on Computer Vision, 2011

A real-time deformable detector., Karim Ali, Francois Fleuret, David Hasler and Pascal Fua, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2012

Multitask Learning to Improve Articulatory Feature Estimation and Phoneme Recognition, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-21-2011

Sensing the `Health State` of our Society, Anmol Madan, Manuel Cebrian, Sai Moturu, Katayoun Farrahi and Alex Pentland, in: IEEE Pervasive Computing, Special Issue on Large-Scale Opportunistic Sensing, 2011

Pervasive Sensing to Model Political Opinions in Face-to-Face Networks, Anmol Madan, Katayoun Farrahi, Daniel Gatica-Perez and Alex Pentland, in: Pervasive, San Francisco, 2011

A Probabilistic Approach to Socio-Geographic Reality Mining, Katayoun Farrahi, Ecole Polytechnique Fédérale de Lausanne, 2011

GroupUs: Smartphone Proximity Data and Human Interaction Type Mining, Trinh-Minh-Tri Do and Daniel Gatica-Perez, in: 15th annual International Symposium on Wearable Computers, San Francisco, USA, 2011

Impact of Excitation Frequency on Short-Term Recording Synchronisation and Confidence Estimation, Danil Korchagin, in: Proceedings European Signal Processing Conference, Barcelona, Spain, 2011

Improving non-native ASR through stochastic multilingual phoneme space transformations, David Imseng, Hervé Bourlard, John Dines, Philip N. Garner and Mathew Magimai-Doss, in: Proceedings of Interspeech, Florence, Italy, pages 537-540, 2011

Improving non-native ASR through stochastic multilingual phoneme space transformations, David Imseng, Hervé Bourlard, John Dines, Philip N. Garner and Mathew Magimai-Doss, Idiap-RR-19-2011

AN INTEGRATED FRAMEWORK FOR MULTI-CHANNEL MULTI-SOURCE LOCALIZATION AND VOICE ACTIVITY DETECTION, Mohammad J. Taghizadeh, Philip N. Garner, Hervé Bourlard, Hamid Reza Abutalebi and Afsaneh Asaei, Idiap-RR-16-2011

Modeling and understanding communities in online social media using probabilistic methods, Radu-Andrei Negoescu, Ecole polytechnique fédérale de Lausanne, 2011

[DOI]
[URL]

How Comparable are Parallel Corpora? Measuring the Distribution of General Vocabulary and Connectives, Bruno Cartoni, Sandrine Zufferey, Thomas Meyer and Andrei Popescu-Belis, in: Proceedings of 4th Workshop on Building and Using Comparable Corpora, ACL, Portland, OR, pages 78--86, 2011

A BSS-based Approach for Localization of Simultaneous Speakers in Reverberant Conditions, Hamid Reza Abutalebi, Hedieh Heli, Danil Korchagin and Hervé Bourlard, in: Proceedings of the 19th European Signal Processing Conference (EUSIPCO), 2011

LP Residual Features for Robust, Privacy-Sensitive Speaker Diarization, Sree Hari Krishnan Parthasarathi, Hervé Bourlard and Daniel Gatica-Perez, in: Interspeech, 2011

LP Residual Features for Robust, Privacy-Sensitive Speaker Diarization, Sree Hari Krishnan Parthasarathi, Hervé Bourlard and Daniel Gatica-Perez, Idiap-RR-14-2011

Wordless Sounds: Robust Speaker Diarization using Privacy-Preserving Audio Representations, Sree Hari Krishnan Parthasarathi, Hervé Bourlard and Daniel Gatica-Perez, Idiap-RR-28-2012

Privacy-Sensitive Audio Features for Speech/Nonspeech Detection, Sree Hari Krishnan Parthasarathi, Daniel Gatica-Perez, Hervé Bourlard and Mathew Magimai-Doss, in: IEEE Transactions on Audio, Speech, and Language Processing, 19(8), 2011

A Compressive Sensing Based Compressed Neural Network for Sound Source Localization, Mehdi Banitalebi Dehkordi, Hamid Reza Abutalebi and Hossein Ghanei, in: Proceedings of International Symposium on Artificial Intelligence and Signal Processing, 2011

Inter-session Variability Modelling and Joint Factor Analysis for Face Authentication, Roy Wallace, Mitchell McLaren, Chris McCool and Sébastien Marcel, Idiap-RR-28-2011

Multilingual Annotation and Disambiguation of Discourse Connectives for Machine Translation, Thomas Meyer, Andrei Popescu-Belis, Sandrine Zufferey and Bruno Cartoni, in: Proceedings of 12th SIGdial Meeting on Discourse and Dialogue, Association for Computational Linguistics, Portland, OR, pages 194--203, 2011

Ultra-Fast Optimization Algorithm for Sparse Multi Kernel Learning, Francesco Orabona and Jie Luo, in: Proceedings of the 28th International Conference on Machine Learning, 2011

Ultra-Fast Optimization Algorithm for Sparse Multi Kernel Learning, Francesco Orabona and Jie Luo, Idiap-RR-11-2011

You Are Known by How You Vlog: Personality Impressions and Nonverbal Behavior in YouTube, Joan-Isaac Biel, Oya Aran and Daniel Gatica-Perez, in: Proceedings of AAAI International Conference on Weblogs and Social Media, Barcelona, 2011

Just-in-Time Multimodal Association and Fusion from Home Entertainment, Danil Korchagin, Petr Motlicek, Stefan Duffner and Hervé Bourlard, in: Proceedings IEEE International Conference on Multimedia & Expo, Barcelona, Spain, 2011

Social Focus of Attention as a Time Function Derived from Multimodal Signals, Danil Korchagin and Hamid Reza Abutalebi, in: Proceedings IEEE International Conference on Multimedia & Expo, Barcelona, Spain, 2011

Audio Spatio-Temporal Fingerprints for Cloudless Real-Time Hands-Free Diarization on Mobile Devices, Danil Korchagin, in: Proceedings of the 3rd Joint Workshop on Hands-Free Speech Communication and Microphone Arrays, Edinburgh, UK, 2011

Disambiguating Temporal-Contrastive Discourse Connectives for Machine Translation, Thomas Meyer, in: Proceedings of ACL-HLT 2011 Student Session, Association for Computational Linguistics, Portland, OR, pages 46--51, 2011

Multi-party Speech Recovery Exploiting Structured Sparsity Models, Afsaneh Asaei, Mohammad J. Taghizadeh, Hervé Bourlard and Volkan Cevher, Idiap-RR-22-2011

Extracting and Locating Temporal Motifs in Video Scenes Using a Hierarchical Non Parametric Bayesian Model, Remi Emonet, Jagannadan Varadarajan and Jean-Marc Odobez, in: IEEE Conference on Computer Vision and Pattern Recognition, 2011

Performance Improvement of TDOA-Based Speaker Localization in Joint Noisy and Reverberant Conditions, Hamid Reza Abutalebi and Hossein Momenzadeh, in: EURASIP Journal on Advances in Signal Processing, 2011

[DOI]

Impact of Excitation Frequency on Short-Term Recording Synchronisation and Confidence Estimation, Danil Korchagin, Idiap-RR-20-2011

Social Focus of Attention as a Time Function Derived from Multimodal Signals, Danil Korchagin and Hamid Reza Abutalebi, Idiap-RR-09-2011

HEAT: Iterative Relevance Feedback with One Million Images, Nicolae Suditu and Francois Fleuret, Idiap-RR-33-2011

A Speech-based Just-in-Time Retrieval System using Semantic Search, Andrei Popescu-Belis, Majid Yazdani, Alexandre Nanchen and Philip N. Garner, Idiap-RR-31-2011

When Users Meet Technology: The Meeting Browser Development Helix, Andrei Popescu-Belis, Denis Lalanne and Hervé Bourlard, Idiap-RR-05-2011

Verified Speaker Localization Utilizing Voicing Level in Split-bands, Afsaneh Asaei, Mohammad J. Taghizadeh, Marjan Bahrololum and Mohammed Ghanbari, in: Signal Processing, 89(6):1038-1049, 2009

Multiple Object Tracking using K-Shortest Paths Optimization, Jerome Berclaz, Engin Turetken, Francois Fleuret and Pascal Fua, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2011

FlowBoost - Appearance Learning from Sparsely Annotated Video, Karim Ali, David Hasler and Francois Fleuret, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2011

Delineating Trees in Noisy 2D Images and 3D Image Stacks, German Gonzalez, Engin Turetken, Francois Fleuret and Pascal Fua, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, pages 2799–2806, 2010

Joint Cascade Optimization Using a Product Of Boosted Classifiers, Leonidas Lefakis and Francois Fleuret, in: Proceedings of the Neural Information Processing Systems Conference, pages 1315–1323, 2010

Using object affordances to improve object recognition, Claudio Castellini, Tatiana Tommasi, Nicoletta Noceti, Francesca Odone and Barbara Caputo, in: IEEE Transaction on Autonomous Mental Development, 2011

Towards semi-supervised learning of semantic spatial concepts for mobile robots, Jesus Martinez-Gomez and Barbara Caputo, in: Journal of Physical Agents, 2011

Phoneme Recognition using Boosted Binary Features, Anindya Roy, Mathew Magimai-Doss and Sébastien Marcel, in: IEEE Intl. Conference on Acoustics, Speech and Signal Processing 2011, 2011

Posterior Features for Template-based ASR, Serena Soldo, Mathew Magimai-Doss, Joel Praveen Pinto and Hervé Bourlard, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Prague, Czech Republic, 2011

Cue integration through discriminative accumulation, Maria Elena Nilsback and Barbara Caputo, in: International Conference on Computer Vision and Pattern Recognition, 2004

Towards semi-supervised learning of semantic spatial concepts, Jesus Martinez-Gomez and Barbara Caputo, in: IEEE International Conference on Robotics and Automation, 2011

Towards semi-supervised learning of semantic spatial concepts, Jesus Martinez-Gomez and Barbara Caputo, Idiap-RR-03-2011

Language dependent universal phoneme posterior estimation for mixed language speech recognition, David Imseng, Hervé Bourlard, Mathew Magimai-Doss and John Dines, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Prag, CZ, pages 5012-5015, 2011

Audio Spatio-Temporal Fingerprints for Cloudless Real-Time Hands-Free Diarization on Mobile Devices, Danil Korchagin, Idiap-RR-08-2011

Call me Guru: user categories and large-scale behavior in YouTube, Joan-Isaac Biel and Daniel Gatica-Perez, in: Social Media Computing, Springer, 2011

Vlogcast Yourself: Nonverbal Behavior and Attention in Social Media, Joan-Isaac Biel and Daniel Gatica-Perez, in: Proceedings International Conference on Multimodal Interfaces (ICMI-MLMI), 2010

Automatic Content Linking: Speech-based Just-in-time Retrieval for Multimedia Archives, Andrei Popescu-Belis, Jonathan Kilgour, Peter Poller, Alexandre Nanchen, Erik Boertjes and Joost de Wit, in: Proceedings of the 33rd Annual ACM SIGIR Conference, Geneva, Switzerland, pages 703, 2010

[DOI]

Socially Intelligent Surveillance and Monitoring: Analysing Social Dimensions of Physical Space, V. Murino, M. Cristani and Alessandro Vinciarelli, in: Proceedings of International Workshop on Socially Intelligent Surveillance and Monitoring, San Francisco, pages 51-58, 2010

Mobile Social Signal Processing: vision and research issues, Alessandro Vinciarelli, Roderick Murray-Smith and Hervé Bourlard, in: Proceedings of the International Workshop on Mobile HCI, Lisbon, pages 513-516, 2010

Human Behavior Understanding, Alessandro Vinciarelli, Springer Verlag, 2010

Computational modeling of face-to-face social interaction using nonverbal behavioral cues, Dinesh Babu Jayagopi, Ecole Polytechnique Fédérale de Lausanne, 2011

Exploiting Long-Term Observations for Track Creation and Deletion in Online Multi-Face Tracking, Stefan Duffner and Jean-Marc Odobez, in: IEEE Conference on Automatic Face and Gesture Recognition, pages 525-530, IEEE, 2011

Finding Information in Multimedia Records of Meetings, Andrei Popescu-Belis, Denis Lalanne and Hervé Bourlard, Idiap-RR-32-2011

Automatic Identification of Discourse Markers in Multiparty Dialogues: An In-Depth Study of Like and Well, Andrei Popescu-Belis and Sandrine Zufferey, in: Computer Speech and Language, 25(3):499-518, 2011

[DOI]

Multi-Person Bayesian Tracking with Multiple Cameras, Jian Yao and Jean-Marc Odobez, in: Multi-camera networks: principles and applications, pages 363-388, Academic Press, 2009

View-Based Appearance Model Online Learning for 3D Deformable Face Tracking, Stéphanie Lefèvre and Jean-Marc Odobez, in: Proc. Int. Conf. on Computer Vision Theory and Applications, Angers, 2010

Identifying Emergent Leadership in Small Groups using Nonverbal Communicative Cues, Dairazalia Sanchez-Cortes, Oya Aran, Marianne Schmid Mast and Daniel Gatica-Perez, in: Proc. ICMI-MLMI '10 International Conference on Multimodal Interfaces and the Workshop on Machine Learning for Multimodal Interaction, Beijing, ACM New York, NY, USA ©2010, 2010

[DOI]

Speech Enhancement using an Improved MMSE Estimator with Laplacian Prior, Mehdi Rashidinejad, Hamid Reza Abutalebi and Ali Akbar Tadaion, in: Proceedings of 5th International Symposium on Telecommunications, 2010

Single Channel Speech Separation with a Frame-based Pitch Range Estimation Method in Modulation Frequency, Azar Mahmoodzadeh, Hamid Reza Abutalebi, Hamid Soltanianzadeh and Hamid Sheikhzadeh, in: Proceedings of 5th International Symposium on Telecommunications, 2010

Determination of Pitch Range Based on Onset and Offset Analysis in Modulation Frequency Domain, Azar Mahmoodzadeh, Hamid Reza Abutalebi, Hamid Soltanianzadeh and Hamid Sheikhzadeh, in: Proceedings of 5th International Symposium on Telecommunications, 2010

Social Network Analysis for Automatic Role Recognition, Sarah Favre, Ecole Polytechnique Fédérale de Lausanne, 2010

Learning a 3D Human Pose Distance Metric from Geometric Pose Descriptor, Cheng Chen, in: IEEE Transactions on Visualization and Computer Graphics, 17(11):1676-1689, 2011

3D human pose recovery from image by efficient visual feature selection, Cheng Chen, Yi Yang, Feiping Nie and Jean-Marc Odobez, in: Computer Vision and Image Understanding, 115(3), 2011

Discovering Human Places of Interest from Multimodal Mobile Phone Data, Raul. Montoliu and Daniel Gatica-Perez, in: Proceedings of 9th International Conference on on Mobile and Ubiquitous Multimedia (MUM,',','), Limassol, Cyprus, 2010

Parts-Based Face Verification using Local Frequency Bands, Chris McCool and Sébastien Marcel, Idiap-RR-06-2011

Feature distribution modelling techniques for 3D face recognition, Chris McCool, Jordi Sanchez-Riera and Sébastien Marcel, in: Pattern Recognition Letters, 31:1324-1330, 2010

An Information Theoretic Approach to Speaker Diarization of Meeting Recordings, Deepu Vijayasenan, Ecole polytechnique fédérale de Lausanne, 2010

Hierarchical and Parallel Processing of Auditory and Modulation Frequencies for Automatic Speech Recognition, Fabio Valente, in: Speech Communication, 52(10):790-800, 2010

[DOI]

Multi-Stream Speech Recognition based on Dempster-Shafer Combination Rule, Fabio Valente, in: Speech Communication, 52(3):213-222, 2010

[DOI]

VARIATIONAL BAYESIAN SPEAKER DIARIZATION OF MEETING RECORDINGS, Fabio Valente, Petr Motlicek and Deepu Vijayasenan, in: Proceedings of ICASSP, 2010

A Comparative Study of MLP Front-ends for Mandarin ASR, Fabio Valente, Mathew Magimai-Doss, Christian Plahl, Ravuri Suman and Wang Wen, in: Proceedings of Interspeech, Japan, 2010

Improving Speech Processing trough Social Signals: Automatic Speaker Segmentation of Political Debates using Role based Turn-Taking Patterns., Fabio Valente and Alessandro Vinciarelli, in: Proceedings of ACM Multimedia Workshop on Social Signal Processing, 2010

Social Signal Processing: Understanding Nonverbal Communication in Social Interactions, Alessandro Vinciarelli and Fabio Valente, in: Proceedings of Measuring Behavior 2010, Eindhoven (The Netherlands), 2010

Automatic Time Skew Detection and Correction, Danil Korchagin, in: Proceedings International Conference on Signal Acquisition and Processing, Singapore, 2011

Integrating Articulatory Features using Kullback-Leibler Divergence based Acoustic Model for Phoneme Recognition, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-02-2011

Hierarchical Tandem Features for ASR in Mandarin, Joel Praveen Pinto, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-39-2010

Automatic Time Skew Detection and Correction, Danil Korchagin, Idiap-RR-42-2010

Face detection using boosted Jaccard distance-based regression, Cosmin Atanasoaei, Chris McCool and Sébastien Marcel, Idiap-RR-02-2012

A Sparsity Constraint for Topic Models - Application to Temporal Activity Mining, Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, in: NIPS-2010 Workshop on Practical Applications of Sparse Modeling: Open Issues and New Directions, 2010

Kodak Moments and Flickr Diamonds: How Users Shape Large-scale Media, Radu-Andrei Negoescu, Alexander Loui and Daniel Gatica-Perez, in: Proc. of the 18th Intl. Conf. on Multimedia, Firenze, Italy, 2010

Advances in Fast Multistream Diarization based on the Information Bottleneck Framework, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: Proceedings of Interspeech, 2010

An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: IEEE Transactions on Audio Speech and Language Processing, 19(2), 2011

[DOI]

Fast Bounding Box Estimation based Face Detection, Venkatesh Bala Subburaman and Sébastien Marcel, in: ECCV, Workshop on Face Detection: Where we are, and what next?, 2010

[URL]

Exploiting Long-Term Observations for Track Creation and Deletion in Online Multi-Face Tracking, Stefan Duffner and Jean-Marc Odobez, Idiap-RR-01-2011

The TA2 Database - A Multi-Modal Database from Home Entertainment, Stefan Duffner, Petr Motlicek and Danil Korchagin, in: International Conference on Signal Acquisition and Processing, Singapore, 2011

The TA2 Database - A Multi-Modal Database from Home Entertainment, Stefan Duffner, Petr Motlicek and Danil Korchagin, Idiap-RR-37-2010

Recognizing conversational context in group interaction using privacy-sensitive mobile sensors, Dinesh Babu Jayagopi, Taemie Kim, Alex Pentland and Daniel Gatica-Perez, in: Proceedings of International Conference on Mobile and Ubiquitous Multimedia, Limassol, Cyprus, 2010

Learning from Candidate Labeling Sets, Jie Luo and Francesco Orabona, in: Advances in Neural Information Processing Systems 23 (NIPS10), NIPS Foundation, Vancouver, B.C., Canada, MIT Press, 2010

Language dependent universal phoneme posterior estimation for mixed language speech recognition, David Imseng, Hervé Bourlard, Mathew Magimai-Doss and John Dines, Idiap-RR-13-2011

Discovering Routines from Large-Scale Human Locations using Probabilistic Topic Models, Katayoun Farrahi and Daniel Gatica-Perez, in: ACM Transactions on Intelligent Systems and Technology, 2(1), 2011

Mining Human Location-Routines Using a Multi-Level Approach to Topic Modeling, Katayoun Farrahi and Daniel Gatica-Perez, in: 2010 IEEE Second International Conference on Social Computing, SIN Symposium, Minneapolis, Minnesota, USA, 2010

Model-Based Compressive Sensing for Multi-Party Distant Speech Recognition, Afsaneh Asaei, Hervé Bourlard and Volkan Cevher, Idiap-RR-04-2011

A Sparsity Constraint for Topic Models - Application to Temporal Activity Mining, Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, Idiap-RR-36-2010

Towards a Technology of Nonverbal Communication: Vocal Behavior in Social and Affective Phenomena, Alessandro Vinciarelli and Gelareh Mohammadi, in: "Affective Computing and Interaction: Psychological, Cognitive and Neuroscientific Perspectives" by D. Gokcay & G. Yildirim (eds.), igi-global, 2010

The Voice of Personality: Mapping Nonverbal Vocal Behavior into Trait Attributions, Gelareh Mohammadi, Alessandro Vinciarelli and Marcello Mortillaro, in: Proceedings of ACM Multimedia Workshop on Social Signal Processing, 2010

More than Words: Inference of Socially Relevant Information from Nonverbal Vocal Cues in Speech, Hugues Salamin, Gelareh Mohammadi, Khiet Truong and Alessandro Vinciarelli, in: Toward Autonomous, Adaptive, and Context-Aware Multimodal Interfaces: Theoretical and Practical Issues, A.Esposito (ed.), LNCS,Springer, 2010

Automatic Role Recognition Based on Conversational and Prosodic Behaviour, Hugues Salamin, Khiet Truong, Gelareh Mohammadi and Alessandro Vinciarelli, in: Proceedings of the ACM International Conference on Multimedia, 2010

On-line unsupervised adaptation for face verification using Gaussian Mixture Models with multiple user models, Niklas Johansson, Chris McCool and Sébastien Marcel, Idiap-RR-07-2011

Face Verification using Gabor Filtering and Adapted Gaussian Mixture Models, Laurent El Shafey, Roy Wallace and Sébastien Marcel, Idiap-RR-37-2011

Measuring the gap between HMM-based ASR and TTS, John Dines, Junichi Yamagishi and Simon King, Idiap-RR-34-2010

Measuring the gap between HMM-based ASR and TTS, John Dines, Junichi Yamagishi and Simon King, in: IEEE Journal of Selected Topics in Signal Processing, in print, 2010

Tuning-Robust Initialization Methods for Speaker Diarization, David Imseng and Gerald Friedland, in: IEEE Transactions on Audio, Speech, and Language Processing, 18(8):2028-2037, 2010

[DOI]

A Multi Cue Discriminative Approach to Semantic Place Classification, Marco Fornoni, Jesus Martinez-Gomez and Barbara Caputo, in: CLEF 2010 Notebook Papers/LABs/Workshops, 2010

The Wolf Corpus: Exploring group behaviour in a competitive role-playing game, Hayley Hung and Gokul Chittaranjan, in: ACM Multimedia, 2010

Are you a Werewolf? Detecting deceptive roles and outcomes in a conversational role-playing game, Gokul Chittaranjan and Hayley Hung, in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2010

Automatic nonverbal analysis of social interaction in small groups: A review, Daniel Gatica-Perez, in: Image and Vision Computing, Special Issue on Human Behavior, 27(12), 2009

YOU ARE FIRED! NONVERBAL ROLE ANALYSIS IN COMPETITIVE MEETINGS, Raducanu Bogdan, Vitria J. and Daniel Gatica-Perez, in: Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP,',','), Taiwan., 2009

Modeling interest in face-to-face conversations from multimodal nonverbal behavior, Daniel Gatica-Perez, in: In J.-P. Thiran, H. Bourlard, and F. Marques, (Eds.,',','), Multimodal Signal Processing, Academic Press, Academic Press, 2009

Voices of Vlogging, Joan-Isaac Biel and Daniel Gatica-Perez, in: Proceedings of AAAI International Conference on Weblogs and Social Media, Washington DC, 2010

Towards rich mobile phone datasets: Lausanne data collection campaign, N. Kiukkonen, Blom J., O. Dousse, Daniel Gatica-Perez and J. K. Laurila, in: Proc. ACM Int. Conf. on Pervasive Services (ICPS,',','), Berlin., 2010

Visual Attention, Speaking Activity, and Group Conversational Analysis in Multi-Sensor Environments, Daniel Gatica-Perez and Jean-Marc Odobez, in: In H. Nakashima, J. Augusto, H. Aghajan (Eds.,',','), Handbook of Ambient Intelligence and Smart Environments, Springer, 2010

Inferring competitive role patterns in reality TV show through nonverbal analysis, Raducanu Bogdan and Daniel Gatica-Perez, in: Multimedia Tools and Applications, Special issue on Social Media, 2010

Estimating Dominance in Multi-Party Meetings Using Speaker Diarization, Hayley Hung, Yan Huang, Gerald Friedland and Daniel Gatica-Perez, in: IEEE Transactions on Audio, Speech, and Language Processing, 19(4):847-860, 2011

Mining group nonverbal conversational patterns using probabilistic topic models, Dinesh Babu Jayagopi and Daniel Gatica-Perez, in: IEEE Transactions on Multimedia, 2010

Modeling and Understanding Flickr Communities through Topic-based Analysis, Radu-Andrei Negoescu and Daniel Gatica-Perez, in: IEEE Transactions on Multimedia, 12(5), 2010

[DOI]

Predicting Remote Versus Collocated Group Interactions using Nonverbal Cues, Dairazalia Sanchez-Cortes, Dinesh Babu Jayagopi and Daniel Gatica-Perez, in: Proc. Int. Conf. on Multimodal Interfaces, Workshop on Multimodal Sensor-Based Systems and Mobile Phones for Social Computing,, Cambridge, 2009

[DOI]

Introducing Crossmodal Biometrics:Person Identification from Distinct Audio & Visual Streams, Anindya Roy and Sébastien Marcel, in: IEEE Fourth International Conference on Biometrics: Theory, Applications and Systems, 2010

Hands Free Audio Analysis from Home Entertainment, Danil Korchagin, Philip N. Garner and Petr Motlicek, in: Proceedings of Interspeech, Makuhari, Japan, 2010

A Random Walk Framework to Compute Textual Semantic Similarity: a Unified Model for Three Benchmark Tasks, Majid Yazdani and Andrei Popescu-Belis, in: Proceedings of the 4th IEEE International Conference on Semantic Computing (ICSC 2010 ), Carnegie Mellon University, Pittsburgh, PA, USA, 2010

Towards a standard for dialogue act annotation, Harry Bunt, Jan Alexandersson, Jean Carletta, Jae-Woong Choe, Alex Fang, Koiti Hasida, Kiyong Lee, Volha Petukhova, Andrei Popescu-Belis, Laurent Romary, Claudia Soria and Traum. David, in: 7th International Conference on Language Resources and Evaluation, Malta, 2010

[URL]

The ACLD: Speech-based Just-in-Time Retrieval of Meeting Transcripts, Documents and Websites, Andrei Popescu-Belis, Jonathan Kilgour, Alexandre Nanchen and Peter Poller, in: ACM Multimedia Workshop on Searching Spontaneous Conversational Speech, Florence, Italy, 2010

The ACLD: Speech-based Just-in-Time Retrieval of Multimedia Documents and Websites, Andrei Popescu-Belis, Jonathan Kilgour, Alexandre Nanchen and Peter Poller, Idiap-RR-26-2010

English Spoken Term Detection in Multilingual Recordings, Petr Motlicek, Fabio Valente and Philip N. Garner, in: Proceedings of Interspeech, Makuhari, Japan, 2010, ISCA, Makuhari, Japan, 2010

Sparse Component Analysis for Speech Recognition in Multi-Speaker Environment, Afsaneh Asaei, Hervé Bourlard and Philip N. Garner, in: Proceedings of Interspeech, Makuhari, Japan, 2010

Study of Jacobian Normalization for VTLN, Lakshmi Saheer, Philip N. Garner and John Dines, Idiap-RR-25-2010

Implementation of VTLN for Statistical Speech Synthesis, Lakshmi Saheer, John Dines, Philip N. Garner and Hui Liang, in: Proceedings of ISCA Speech Synthesis Workshop, Kyoto, Japan, 2010

Floor Holder Detection and End of Speaker Turn Prediction in Meetings, Alfred Dielmann, Giulia Garau and Hervé Bourlard, in: International Conference on Speech and Language Processing, Interspeech, Makuhari, Japan, ISCA, 2010

Kodak Moments and Flickr Diamonds: How Users Shape Large-scale Media, Radu-Andrei Negoescu, Alexander Loui and Daniel Gatica-Perez, Idiap-RR-20-2010

A Multi-class Classification Strategy for Fisher Scores: Application to Signer Independent Sign Language Recognition, Oya Aran and Lale Akarun, in: Pattern Recognition, 43(5), 2010

[DOI]

Fusing Audio-Visual Nonverbal Cues to Detect Dominant People in Conversations, Oya Aran and Daniel Gatica-Perez, in: 20th International Conference on Pattern Recognition, Istanbul, Turkey, 2010, Istanbul, Turkey, 2010

Fusing Audio-Visual Nonverbal Cues to Detect Dominant People in Conversations, Oya Aran and Daniel Gatica-Perez, Idiap-RR-17-2010

An Analysis of Language Mismatch in HMM State Mapping-Based Cross-Lingual Speaker Adaptation, Hui Liang and John Dines, in: Proceedings of Interspeech, Makuhari, Japan, 2010

An Analysis of Language Mismatch in HMM State Mapping-Based Cross-Lingual Speaker Adaptation, Hui Liang and John Dines, Idiap-RR-16-2010

Towards mixed language speech recognition systems, David Imseng, Hervé Bourlard and Mathew Magimai-Doss, in: Proceedings of Interspeech, Makuhari, Japan, pages 278-281, 2010

Hierarchical Multilayer Perceptron based Language Identification, David Imseng, Mathew Magimai-Doss and Hervé Bourlard, in: Proceedings of Interspeech, Makuhari, Japan, pages 2722-2725, 2010

Audioâ€“Visual Synchronisation for Speaker Diarisation, Giulia Garau, Alfred Dielmann and Hervé Bourlard, in: International Conference on Speech and Language Processing, Interspeech, Makuhari, Japan, 2010

An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, Idiap-RR-22-2010

Crossmodal Matching of Speakers using Lip and Voice Features in Temporally Non-overlapping Audio and Video Streams, Anindya Roy and Sébastien Marcel, in: 20th International Conference on Pattern Recognition, Istanbul, Turkey, International Association for Pattern Recognition (IAPR), Istanbul, Turkey, 2010

Advances in Fast Multistream Diarization based on the Information Bottleneck Framework, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, Idiap-RR-23-2010

Estimating Cohesion in Small Groups using Audio-Visual Nonverbal Behavior, Hayley Hung and Daniel Gatica-Perez, Idiap-RR-12-2010

Mining Human Location-Routines using a Multi-Level Topic Model, Katayoun Farrahi and Daniel Gatica-Perez, Idiap-RR-28-2010

Probabilistic Mining of Socio-Geographic Routines from Mobile Phone Data, Katayoun Farrahi and Daniel Gatica-Perez, in: IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 4(4), 2010

Introducing Crossmodal Biometrics: Person Identification from Distinct Audio & Visual Streams, Anindya Roy and Sébastien Marcel, Idiap-RR-29-2010

Just-in-Time Multimodal Association and Fusion from Home Entertainment, Danil Korchagin, Petr Motlicek, Stefan Duffner and Hervé Bourlard, Idiap-RR-10-2011

Learning from Candidate Labeling Sets, Jie Luo and Francesco Orabona, Idiap-RR-27-2011

Probabilistic Latent Sequential Motifs: Discovering temporal activity patterns in video scenes, Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, Idiap-RR-33-2010

Hierarchical Multilayer Perceptron based Language Identification, David Imseng, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-14-2010

Towards mixed language speech recognition systems, David Imseng, Hervé Bourlard and Mathew Magimai-Doss, Idiap-RR-15-2010

Privacy-Sensitive Audio Features for Speech/Nonspeech Detection, Sree Hari Krishnan Parthasarathi, Daniel Gatica-Perez, Hervé Bourlard and Mathew Magimai-Doss, Idiap-RR-12-2011

Hands Free Audio Analysis from Home Entertainment, Danil Korchagin, Philip N. Garner and Petr Motlicek, Idiap-RR-27-2010

Fast Bounding Box Estimation based Face Detection, Venkatesh Bala Subburaman and Sébastien Marcel, Idiap-RR-38-2010

The More you Learn, the Less you Store: Memory-controlled Incremental SVM for Visual Place Recognition, Andrzej Pronobis, Jie Luo and Barbara Caputo, Idiap-RR-08-2010

Online-Batch Strongly Convex Multi Kernel Learning, Francesco Orabona, Jie Luo and Barbara Caputo, Idiap-RR-07-2010

OM-2: An Online Multi-class Multi-kernel Learning Algorithm, Jie Luo, Francesco Orabona, Marco Fornoni, Barbara Caputo and Nicolo Cesa-Bianchi, Idiap-RR-06-2010

Implementation of VTLN for Statistical Speech Synthesis, Lakshmi Saheer, John Dines, Philip N. Garner and Hui Liang, Idiap-RR-32-2010

Neural conditional random fields, Trinh-Minh-Tri Do and Thierry Artieres, in: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, Chia Laguna, Sardinia, Italy, JMLR: W&CP, 2010

Online-Batch Strongly Convex Multi Kernel Learning, Francesco Orabona, Jie Luo and Barbara Caputo, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2010

The More you Learn, the Less you Store: Memory-controlled Incremental SVM for Visual Place Recognition, Andrzej Pronobis, Jie Luo and Barbara Caputo, in: Image and Vision Computing, 2010

[DOI]

A Multimodal Corpus for Studying Dominance in Small Group Conversations, Oya Aran, Hayley Hung and Daniel Gatica-Perez, in: LREC workshop on Multimodal Corpora: Advances in Capturing, Coding and Analyzing Multimodality, Malta, May 2010, 2010

Multilayer Perceptron Based Hierarchical Acoustic Modeling for Automatic Speech Recognition, Joel Praveen Pinto, Ecole polytechnique fédérale de Lausanne, 2010

Joint Pose Estimator and Feature Learning for Object Detection, Karim Ali, Francois Fleuret, David Hasler and Pascal Fua, in: Proceedings of the IEEE International Conference on Computer Vision, 2009

Evaluation of Probabilistic Occupancy Map People Detection for Surveillance Systems, Jerome Berclaz, Ali Shahrokni, Francois Fleuret, James Ferryman and Pascal Fua, in: Proceedings of the IEEE International Workshop on Performance Evaluation of Tracking and Surveillance, 2009

Analysis of Phone Posterior Feature Space Exploiting Class Specific Sparsity and MLP-based Similarity Measure, Afsaneh Asaei, Benjamin Picart and Hervé Bourlard, in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, 2010

Multi-Person Visual Focus of Attention from Head Pose and Meeting Contextual Cues, Silèye O. Ba and Jean-Marc Odobez, in: IEEE Trans. on Pattern Analysis and Machine Intelligence, 33(1):101-116, 2011

Learning Large Margin Likelihood for Realtime Head Pose Tracking, Elisa Ricci and Jean-Marc Odobez, in: IEEE Int. Conference on Image Processing, Cairo, Egypt, IEEE, 2009

Structure and appearance features for robust 3D facial actions tracking, Stéphanie Lefèvre and Jean-Marc Odobez, in: IEEE Proc. Int. Conf. on Multimedia and Expo, IEEE, 2009

Analysis of MLP Based Hierarchical Phoneme Posterior Probability Estimator, Joel Praveen Pinto, G. S. V. S. Sivaram, Mathew Magimai-Doss, Hynek Hermansky and Hervé Bourlard, in: IEEE Transcations on Audio, Speech, and Language Processing, 19(2):225-241, 2011

Crossmodal Matching of Speakers using Lip and Voice Features in Temporally Non-overlapping Audio and Video Streams, Anindya Roy and Sébastien Marcel, Idiap-RR-13-2010

Finding without searching, Andrei Popescu-Belis, Idiap-Com-01-2010

Application of Out-Of-Language Detection To Spoken-Term Detection, Petr Motlicek and Fabio Valente, in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010

BOOSTED BINARY FEATURES FOR NOISE-ROBUST SPEAKER VERIFICATION, Anindya Roy, Mathew Magimai-Doss and Sébastien Marcel, in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, Texas, 2010

Multistream Speaker Diarization beyond Two Acoustic Feature Streams, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: International Conference on Acoustics, Speech, and Signal Processing, 2010

AMIDA/Klewel Mini-Project, Petr Motlicek, Philip N. Garner, Maël Guillemot and Vincent Bozzo, Idiap-RR-03-2010

An Alternative Scanning Strategy to Detect Faces, Venkatesh Bala Subburaman and Sébastien Marcel, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010

Canal9: A database of political debates for analysis of social interactions, Alessandro Vinciarelli, Alfred Dielmann, Sarah Favre and Hugues Salamin, in: Proceedings of the International Conference on Affective Computing and Intelligent Interaction (IEEE International Workshop on Social Signal Processing), Amsterdam, Netherlands, 2009

[DOI]

Evaluating the Robustness of Privacy-Sensitive Audio Features for Speech Detection in Personal Audio Log Scenarios, Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Hervé Bourlard and Daniel Gatica-Perez, in: ICASSP 2010, 2010

Using Audio and Visual Cues for Speaker Diarisation Initialisation, Giulia Garau and Hervé Bourlard, in: International Conference on Acoustics, Speech and Signal Processing, 2010

An Adaptive Initialization Method for Speaker Diarization based on Prosodic Features, David Imseng and Gerald Friedland, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, pages 4946-4949, 2010

On Improving Face Detection Performance by Modelling Contextual Information, Cosmin Atanasoaei, Chris McCool and Sébastien Marcel, Idiap-RR-43-2010

Automatic Temporal Alignment of AV Data with Confidence Estimation, Danil Korchagin, Philip N. Garner and John Dines, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010

MLP Based Hierarchical System for Task Adaptation in ASR, Joel Praveen Pinto, Mathew Magimai-Doss and Hervé Bourlard, in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, 2009

VTLN Adaptation for Statistical Speech Synthesis, Lakshmi Saheer, Philip N. Garner, John Dines and Hui Liang, Idiap-RR-41-2009

VTLN Adaptation for Statistical Speech Synthesis, Lakshmi Saheer, Philip N. Garner, John Dines and Hui Liang, in: Proceedings of ICASSP, Dallas, Texas, 2010

Towards a Technology of Nonverbal Communication: Vocal Behavior in Social and Affective Phenomena, Gelareh Mohammadi and Alessandro Vinciarelli, Idiap-RR-05-2012

A Comparison of Supervised and Unsupervised Cross-Lingual Speaker Adaptation Approaches for HMM-Based Speech Synthesis, Hui Liang, John Dines and Lakshmi Saheer, Idiap-RR-05-2010

A Comparison of Supervised and Unsupervised Cross-Lingual Speaker Adaptation Approaches for HMM-Based Speech Synthesis, Hui Liang, John Dines and Lakshmi Saheer, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Dallas, U.S.A., 2010

Bayesian Networks as Generative Models for Face Recognition, Guillaume Heusch, EPFL, 2009

A Multimedia Retrieval System Using Speech Input, Andrei Popescu-Belis, Peter Poller, Jonathan Kilgour, Erik Boertjes, Jean Carletta, Sandro Castronovo, Michal Fapso, Alexandre Nanchen, Theresa Wilson, Joost de Wit and Majid Yazdani, in: Proceedings of ICMI-MLMI 2009 (11th International Conference on Multimodal Interfaces and 6th Workshop on Machine Learning for Multimodal Interaction), Cambridge, MA, 2009

The FEMTI guidelines for contextual MT evaluation: principles and tools, Paula Estrella, Andrei Popescu-Belis and Margaret King, in: Linguistica Antverpiensia New Series, 8, 2009

Managing Multimodal Data, Metadata and Annotations: Challenges and Solutions, Andrei Popescu-Belis, in: Multimodal Signal Processing for Human-Computer Interaction, Elsevier / Academic Press, 2009

Accessing a Large Multimodal Corpus using an Automatic Content Linking Device, Andrei Popescu-Belis, Jean Carletta, Jonathan Kilgour and Peter Poller, in: Multimodal Corpora: From Models of Natural Interaction to Systems and Applications, Springer-Verlag, 2009

[DOI]

User Interface Design in a Just-in-time Retrieval System for Meetings, Andrei Popescu-Belis, Peter Poller, Jonathan Kilgour, Mike Flynn, Sebastian Germesin, Alexandre Nanchen and Majid Yazdani, Idiap-RR-38-2009

On MLP-based Posterior Features for Template-based ASR, Serena Soldo, Mathew Magimai-Doss, Joel Praveen Pinto and Hervé Bourlard, Idiap-RR-37-2009

Memoirs of Togetherness from Audio Logs, Danil Korchagin, in: Proceedings International ICST Conference on User Centric Media, Venice, Italy, 2009

Autoregressive Models of Amplitude Modulations in Audio Compression, Sriram Ganapathy, Petr Motlicek and Hynek Hermansky, in: IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 2010

[URL]

Wide-Band Audio Coding based on Frequency Domain Linear Prediction, Petr Motlicek, Sriram Ganapathy, Hynek Hermansky and Harinath Garudadri, in: EURASIP Journal on Audio Speech and Music Processing, 2010(856280), 2010

[DOI]
[URL]

MDCT for Encoding Residual Signals in Frequency Domain Linear Prediction, Sriram Ganapathy, Petr Motlicek and Hynek Hermansky, in: Audio Engineering Society (AES,',','), 127th Convention, Audio Engineering Society (AES), Audio Engineering Society, 60 East 42nd Street, New York, New York 10165-2520, USA;, 2009

[URL]

APPLICATIONS OF SIGNAL ANALYSIS USING AUTOREGRESSIVE MODELS FOR AMPLITUDE MODULATION, Sriram Ganapathy, Samuel Thomas, Petr Motlicek and Hynek Hermansky, in: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009, WASPAA '09., IEEE, Mohonk Mountain House, New Paltz, New York, USA, 2009

[URL]

APPLICATIONS OF SIGNAL ANALYSIS USING AUTOREGRESSIVE MODELS FOR AMPLITUDE MODULATION, Sriram Ganapathy, Samuel Thomas, Petr Motlicek and Hynek Hermansky, Idiap-RR-35-2009

MDCT for Encoding Residual Signals in Frequency Domain Linear Prediction, Sriram Ganapathy, Petr Motlicek and Hynek Hermansky, Idiap-RR-34-2009

Autoregressive Models of Amplitude Modulations in Audio Compression, Sriram Ganapathy, Petr Motlicek and Hynek Hermansky, Idiap-RR-33-2009

Wide-Band Audio Coding based on Frequency Domain Linear Prediction, Petr Motlicek, Sriram Ganapathy and Hynek Hermansky, Idiap-RR-32-2009

On the vulnerability of face verification systems to hill-climbing attacks, Javier Galbally, Chris McCool, Julian Fierrez, Sébastien Marcel and Javier Ortega-Garcia, in: Pattern Recognition, 2009

Wearing a YouTube hat: directors, comedians, gurus, and user aggregated behavior, Joan-Isaac Biel and Daniel Gatica-Perez, in: Proceedings of the 17th ACM International Conference on Multimedia, ACM, 2009

MOBIO Database for the ICPR 2010 Face and Speech Competition, Chris McCool and Sébastien Marcel, Idiap-Com-02-2009

Out-of-Scene AV Data Detection, Danil Korchagin, in: Proceedings IADIS International Conference Applied Computing, Rome, Italy, 2009

Analysis of F0 and Cepstral Features for Robust Automatic Gender Recognition, Marianna Pronobis and Mathew Magimai-Doss, Idiap-RR-30-2009

Bayesian Networks to Combine Intensity and Color Information in Face Recognition, Guillaume Heusch and Sébastien Marcel, in: International Conference on Biometrics, Springer, 2009

A novel statistical generative model dedicated to face recognition, Guillaume Heusch and Sébastien Marcel, in: Image & Vision Computing, 2009

Who's Doing What: Joint Modeling of Names and Verbs for Simultaneous Face and Pose Annotation, Jie Luo, Barbara Caputo and Vittorio Ferrari, in: Advances in Neural Information Processing Systems 22 (NIPS09), NIPS Foundation, Vancouver, B.C., Canada, MIT Press, 2009

Visual processing-inspired Fern-Audio features for Noise-Robust Speaker Verification, Anindya Roy and Sébastien Marcel, in: ACM 25th Symposium on Applied Computing, 2010, Sierre, Switzerland, Association for Computing Machinery, 2010

Haar Local Binary Pattern Feature for Fast Illumination Invariant Face Detection, Anindya Roy and Sébastien Marcel, in: British Machine Vision Conference 2009, 2009

Topic Models for Scene Analysis and Abnormality Detection, Jagannadan Varadarajan and Jean-Marc Odobez, in: 9th International Workshop in Visual Surveillance, IEEE, Kyoto, Japan, IEEE, 2009

Social Network Analysis in Multimedia Indexing: Making Sense of People in Multiparty Recordings, Sarah Favre, in: Proceedings of the Doctoral Consortium of the International Conference on Affective Computing & Intelligent Interaction (ACII), 2009

Flickr Hypergroups, Radu-Andrei Negoescu, Brett Adams, Dinh Phung, Svetha Venkatesh and Daniel Gatica-Perez, in: Proceedings of the 17th ACM International Conference on Multimedia, 2009

Multimodal Signal Processing: Methods and Techniques to Build Multimodal Interactive Systems, Jean-Philippe Thiran, Hervé Bourlard and Ferran Marques, Academic Press, 2009

Memoirs of Togetherness from Audio Logs, Danil Korchagin, Idiap-RR-36-2009

Learning and Predicting Multimodal Daily Life Patterns from Cell Phones, Katayoun Farrahi and Daniel Gatica-Perez, in: ICMI-MLMI, 2009

An Adaptive Initialization Method for Speaker Diarization based on Prosodic Features, David Imseng and Gerald Friedland, Idiap-RR-02-2010

Multimodal Data Flow Controller, Danil Korchagin, Idiap-Com-01-2009

Haar Local Binary Pattern Feature for Fast Illumination Invariant Face Detection, Anindya Roy and Sébastien Marcel, Idiap-RR-28-2009

Automatic Temporal Alignment of AV Data with Confidence Estimation, Danil Korchagin, Philip N. Garner and John Dines, Idiap-RR-40-2009

Dynamic Partitioned Sampling For Tracking With Discriminative Features, Stefan Duffner, Jean-Marc Odobez and Elisa Ricci, in: Proceedings of the British Maschine Vision Conference, London, 2009

Application of Out-Of-Language Detection To Spoken-Term Detection, Petr Motlicek and Fabio Valente, Idiap-RR-04-2010

Arithmetic Coding of Sub-Band Residuals in FDLP Speech/Audio Codec, Petr Motlicek, Sriram Ganapathy and Hynek Hermansky, in: 10th Annual Conference of the International Speech Communication Association, ISCA, Brighton, England, ISCA 2009, 2009

Automatic Out-of-Language Detection Based on Confidence Measures Derived fromLVCSR Word and Phone Lattices, Petr Motlicek, in: 10thAnnual Conference of the International Speech Communication Association, ISCA, Brighton, England, 2009

Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, Sriram Ganapathy, Petr Motlicek and Hynek Hermansky, in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, Pilsen, Czech Republic, Springer - Verlag, Berlin Heidelberg 2009, 2009

Robust Speaker Diarization for Short Speech Recordings, David Imseng and Gerald Friedland, in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, pages 432-437, 2009

Robust Speaker Diarization for Short Speech Recordings, David Imseng and Gerald Friedland, Idiap-RR-26-2009

On the design of audio features robust to the album-effect for music information retrieval., Nicolas Scaringella, Ecole Polytechnique Fédérale de Lausanne, 2009

An online framework for learning novel concepts over multiple cues, Jie Luo, Francesco Orabona and Barbara Caputo, in: Proceeding of The 9th Asian Conference on Computer Vision, Xi'an, China, 2009

Discovering Group Nonverbal Conversational Patterns with Topics, Dinesh Babu Jayagopi and Daniel Gatica-Perez, in: Proceedings ICMI-MLMI, 2009

Speaker Change Detection with Privacy-Preserving Audio Cues, Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Daniel Gatica-Perez and Hervé Bourlard, in: Proceedings of ICMI-MLMI 2009, 2009

The more you know, the less you learn: from knowledge transfer to one-shot learning of object categories, Tatiana Tommasi and Barbara Caputo, in: British Machine Vision Conference, 2009

Hill-Climbing Attack to an Eigenface-Based Face Verification System, Javier Galbally, Chris McCool, Julian Fierrez, Sébastien Marcel and Javier Ortega-Garcia, in: Proceedings of the First IEEE International Conference on Biometrics, Identity and Security (BIdS), 2009

Co-occurrence Models for Image Annotation and Retrieval, Nikhil Garg, Idiap-RR-22-2009

Tagging and Retrieving Images with Co-Occurrence Models: from Corel to Flickr, Nikhil Garg and Daniel Gatica-Perez, Idiap-RR-21-2009

Automatic Role Recognition in Multiparty Recordings: Using Social Affiliation Networks for Feature Extraction, Hugues Salamin, Sarah Favre and Alessandro Vinciarelli, in: IEEE Transactions on Multimedia, 11(7), 2009

Automatic Role Recognition in Multiparty Recordings Using Social Networks and Probabilistic Sequential Models, Sarah Favre, Alfred Dielmann and Alessandro Vinciarelli, in: ACM International Conference on Multimedia, 2009

Investigating the use of Visual Focus of Attention for Audio-Visual Speaker Diarisation, Giulia Garau, Silèye O. Ba, Hervé Bourlard and Jean-Marc Odobez, in: Proceedings of the ACM International Conference on Multimedia, Beijing, China, 2009

Visual Speaker Localization Aided by Acoustic Models, Gerald Friedland, Chuohao Yeo and Hayley Hung, in: ACM Multimedia, 2009

Fast Human Detection in Videos using Joint Appearance and Foreground Learning from Covariances of Image Feature Subsets, Jian Yao and Jean-Marc Odobez, Idiap-RR-19-2009

Improved Phone Posterior Estimation Through k-NN and MLP-Based Similarity, Benjamin Picart, Idiap-RR-18-2009

You live, you learn, you forget: continuous learning of visual places with a forgetting mechanism, Muhammad Muneeb Ullah, Francesco Orabona and Barbara Caputo, in: International Conference on Robotic and Systems, 2009

Towards a theoretical framework for learning multi-modal patterns for embodied agents, Nicoletta Noceti, Barbara Caputo, Claudio Castellini, Luca Baldassarre, Annalisa Barla, Lorenzo Rosasco, Francesca Odone and Giulio Sandini, in: International Conference on Image Analysis and Processing, 2009

A theoretical framework for transfer of knowledge across modalities in artificial and cognitive systems, Francesco Orabona, Barbara Caputo, Antje Fillbrandt and Frank Ohl, in: International Conference on Developmental Learning, 2009

Model adaptation with least-square SVM for adaptive hand prosthetics, Francesco Orabona, Claudio Castellini, Barbara Caputo, Angelo Emanuele Fiorilla and Giulio Sandini, in: IEEE International conference on Robotics and Automation, 2009

Bounded kernel-based perceptrons, Francesco Orabona, Joseph Keshet and Barbara Caputo, in: Journal of Machine Learning Research, Accepted for pub, 2009

Cue Integration for Medical Image Annotation, Tatiana Tommasi, Francesco Orabona and Barbara Caputo, in: Advances in Multilingual and Multimodal Information Retrieval: 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Budapest, Hungary, September 19-21, 2007, Revised Selected Papers, Springer-Verlag, 2008

Investigation of kNN Classifier on Posterior Features Towards Application in Automatic Speech Recognition, Afsaneh Asaei, Hervé Bourlard and Benjamin Picart, Idiap-RR-11-2010

Towards Life-long Learning for Cognitive Systems: Online Independent Support Vector Machine, Francesco Orabona, Claudio Castellini, Barbara Caputo, Jie Luo and Giulio Sandini, in: Pattern Recognition, Accepted for Pub, 2009

Classifying Material in the Real World, Barbara Caputo, Eric Hayman, Mario Fritz and J-O Ekluhnd, in: Image and vision Computing, accepted for pub, 2009

COLD: The COsy Localization Database, Andrzej Pronobis and Barbara Caputo, in: International Journal of Robotics Research, 28(5), 2009

KL Realignment for Speaker Diarization with Multiple Feature Streams, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: 10th Annual Conference of the International Speech Communication Association, 2009

Mutual Information based Channel Selection for Speaker Diarization of Meetings Data, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: Proceedings of International conference on acoustics speech and signal processing, 2009

Robustness of Phase based Features for Speaker Recognition, Padmanabhan Rajan, Sree Hari Krishnan Parthasarathi and Hema A Murthy, in: Proceedings of Interspeech, 2009

Automatic vs. human question answering over multimedia meeting recordings, Quoc Anh Le and Andrei Popescu-Belis, in: 10th Annual Conference of the International Speech Communication Association, 2009

Robustness of Phase based Features for Speaker Recognition, Padmanabhan Rajan, Sree Hari Krishnan Parthasarathi and Hema A Murthy, Idiap-RR-14-2009

Automatic vs. human question answering over multimedia meeting recordings, Quoc Anh Le and Andrei Popescu-Belis, Idiap-RR-13-2009

Investigating Privacy-Sensitive Features for Speech Detection in Multiparty Conversations, Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Hervé Bourlard and Daniel Gatica-Perez, in: Proceedings of Interspeech 2009, 2009

Comparing meeting browsers using a task-based evaluation method, Andrei Popescu-Belis, Idiap-RR-11-2009

Tuning-Robust Initialization Methods for Speaker Diarization, David Imseng and Gerald Friedland, Idiap-RR-35-2010

A Novel Criterion for Classifiers Combination in Multistream Speech Recognition, Fabio Valente, in: IEEE Signal Processing Letters, 16(7), 2009

[DOI]

Hierarchical Processing of the Modulation Spectrum for GALE Mandarin LVCSR system, Fabio Valente, Mathew Magimai-Doss, Christian Plahl and Ravuri Suman, in: Proceedings of the 10thAnnual Conference of the International Speech Communication Association (Interspeech), Brighton, 2009

KL Realignment for Speaker Diarization with Multiple Feature Streams, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, Idiap-RR-24-2010

Speech/Non-Speech Detection in Meetings from Automatically Extracted Low Resolution Visual Features, Hayley Hung and Silèye O. Ba, Idiap-RR-20-2009

Speaker Change Detection with Privacy-Preserving Audio Cues, Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Daniel Gatica-Perez and Hervé Bourlard, Idiap-RR-23-2009

Out-of-Scene AV Data Detection, Danil Korchagin, Idiap-RR-31-2009

Novel initialization methods for Speaker Diarization, David Imseng, Idiap-RR-07-2009

Steerable Features for Statistical 3D Dendrite Detection, German Gonzalez, Francois Aguet, Francois Fleuret, Michael Unser and Pascal Fua, in: Proceedings of the International Conference on Medical Image Computing and Computer Assisted Intervention, 2009

Automatic Temporal Alignment of AV Data, Danil Korchagin, Philip N. Garner and John Dines, Idiap-RR-39-2009

Investigating Privacy-Sensitive Features for Speech Detection in Multiparty Conversations, Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Hervé Bourlard and Daniel Gatica-Perez, Idiap-RR-12-2009

Characterising Conversationsal Group Dynamics Using Nonverbal Behaviour, Dinesh Babu Jayagopi, Raducanu Bogdan and Daniel Gatica-Perez, in: Proceedings ICME 2009, 2009

Visual processing-inspired Fern-Audio features for Noise-Robust Speaker Verification, Anindya Roy and Sébastien Marcel, Idiap-RR-29-2009

Visual Activity Context For Focus of Attention Estimation in Dynamic Meetings, Silèye O. Ba, Hayley Hung and Jean-Marc Odobez, in: International Conference on Multimedia & Expo, 2009

An SVM Confidence-Based Approach to Medical Image Annotation, Tatiana Tommasi, Francesco Orabona and Barbara Caputo, in: Workshop of the Cross-Language Evaluation Forum, 2008

Learning Rotational Features for Filament Detection, German Gonzalez, Francois Fleuret and Pascal Fua, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2009

Discriminative Keyword Spotting, Joseph Keshet, David Grangier and Samy Bengio, in: Speech Communication, 51(4), 2009

Parts-Based Face Verification using Local Frequency Bands, Chris McCool and Sébastien Marcel, in: in Proceedings of IEEE/IAPR International Conference on Biometrics, 2009

Modeling and Understanding Flickr Communities through Topic-based Analysis, Radu-Andrei Negoescu and Daniel Gatica-Perez, Idiap-RR-19-2010

Non-linear mapping for multi-channel speech separation and robust overlapping speech recognition, Weifeng Li, John Dines, Mathew Magimai-Doss and Hervé Bourlard, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009

Volterra Series for Analyzing MLP based Phoneme Posterior Probability Estimator, Joel Praveen Pinto, G. S. V. S. Sivaram, Hynek Hermansky and Mathew Magimai-Doss, in: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2009

Visual activity context for focus of attention estimation in dynamic meetings, Silèye O. Ba, Hayley Hung and Jean-Marc Odobez, Idiap-RR-02-2009

MULTI-MODAL SPEAKER DIARIZATION OF REAL-WORLD MEETINGS USING COMPRESSED-DOMAIN VIDEO FEATURES, Gerald Friedland, Hayley Hung and Chuohao Yeo, in: International Conference on Audio, Speech and Signal Processing, 2009

Topickr: Flickr Groups and Users Reloaded, Radu-Andrei Negoescu and Daniel Gatica-Perez, in: MM '08: Proc. of the 16th ACM Intl. Conf. on Multimedia, ACM, 2008

Analyzing Flickr Groups, Radu-Andrei Negoescu and Daniel Gatica-Perez, in: Proc. of the Intl. Conf. on Image and Video Retrieval, ACM, 2008

Discriminative Keyword Spotting, David Grangier, Joseph Keshet and Samy Bengio, in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009

A Proposal for a Kernel-based Algorithm for Large Vocabulary Continuous Speech Recognition, Joseph Keshet, in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009

A Kernel Wrapper for Phoneme Sequence Recognition, Joseph Keshet and Dan Chazan, in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009

A Large Margin Algorithm for Forced Alignment, Joseph Keshet, Shai Shalev-Shwartz, Yoram Singer and Dan Chazan, in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009

Robust Discriminative Keyword Spotting for Emotionally Colored Spontaneous Speech using Bidirectional LSTM Networks, Martin Wöllmer, Florian Eyben, Joseph Keshet, Alex Graves, Björn Schuller and Gerhard Rigoll, in: IEEE International Conference on Acoustic, Speech, and Signal Processing, 2009

Support Vector Machines with a Reject Option, Yves Grandvalet, Alain Rakotomamonjy, Joseph Keshet and Stéphane Canu, in: Proceedings of the 22nd Annual Conference on Neural Information Processing Systems, 2008

MUTUAL INFORMATION BASED CHANNEL SELECTION FOR SPEAKER DIARIZATION OF MEETINGS DATA, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2009

An Information Theoretic Approach to Speaker Diarization of Meeting Data, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: IEEE Transactions on Audio Speech and Language Processing, 17(7), 2009

[DOI]

Role Recognition in Multiparty Recordings using Social Affiliation Networks and Discrete Distributions, Sarah Favre, Hugues Salamin, John Dines and Alessandro Vinciarelli, in: International Conference on Multimodal Interfaces, Chania, Greece, 2008

Tracking the visual focus of attention for a varying number of wandering people, Kevin C. Smith, Silèye O. Ba, Jean-Marc Odobez and Daniel Gatica-Perez, in: IEEE Trans. on Pattern Analysis and Machine Intelligence, 30(7), 2008

Multi-camera 3d person tracking with particle filter in a surveillance environment, Jian Yao and Jean-Marc Odobez, in: 16th European Signal processing Conference (EUSIPCO), 2008

Detecting queues at vending machines: a statistical layered approach, Xavier Naturel and Jean-Marc Odobez, in: Proc. Int. Conf. on Pattern Recognition (ICPR), Tampa, 2008

Multi-camera multi-person 3d space tracking with mcmc in surveillance scenarios, Jian Yao and Jean-Marc Odobez, in: European Conference on Computer Vision, workshop on Multi Camera and Multi-modal Sensor Fusion Algorithms and Applications (ECCV-M2SFA2), Marseille, 2008

Fast human detection from videos using covariance features, Jian Yao and Jean-Marc Odobez, in: European Conference on Computer Vision, workshop on Visual Surveillance (ECCV-VS), Marseille, 2008

Understanding Metro Station Usage using Closed Circuit Television Cameras Analysis, C. Carincotte, Xavier Naturel, M. Hick, Jean-Marc Odobez, Jian Yao, A. Bastide and B. Corbucci, in: 11th International IEEE Conference on Intelligent Transportation Systems (ITSC), Bejing, 2008

Exploiting Contextual Information for Speech/Non-Speech Detection, Sree Hari Krishnan Parthasarathi, Petr Motlicek and Hynek Hermansky, in: Text, Speech and Dialogue, Brno, Czech Republic, Springer-Verlag Berlin, Heidelberg, 2008

Perceptually motivated Sub-band Decomposition for FDLP Audio Coding, Petr Motlicek, Sriram Ganapathy, Hynek Hermansky, Harinath Garudadri and Marios Athineos, in: Text, Speech and Dialogue, Brno, Czech Republic, Springer-Verlag Berlin, Heidelberg, 2008

Recognizing Human Visual Focus of Attention from Head Pose in Meetings, Silèye O. Ba and Jean-Marc Odobez, in: IEEE Transactions on Systems, Man, Cybernetics, Part-B, Vol. 39(No. 1), 2009

Contextual classification of image patches with latent aspect models, Florent Monay, Pedro Quelhas, Jean-Marc Odobez and Daniel Gatica-Perez, in: EURASIP Journal on Image and Video Processing, Special Issue on Patches in Vision, 2009

Bayesian Networks to Combine Intensity and Color Information in Face Recognition, Guillaume Heusch and Sébastien Marcel, Idiap-RR-27-2009

Model Adaptation with Least-Squares SVM for Adaptive Hand Prosthetics, Francesco Orabona, Claudio Castellini, Barbara Caputo, Angelo Emanuele Fiorilla and Giulio Sandini, Idiap-RR-05-2009

CLEF2008 Image Annotation Task: an SVM Confidence-Based Approach, Tatiana Tommasi, Francesco Orabona and Barbara Caputo, Idiap-RR-77-2008

Face Detection using Ferns, Venkatesh Bala Subburaman and Sébastien Marcel, Idiap-Com-01-2011

Low-Delay Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, Sriram Ganapathy, Petr Motlicek and Hynek Hermansky, Idiap-RR-75-2008

MODIFIED DISCRETE COSINE TRANSFORM FOR ENCODING RESIDUAL SIGNALS IN FREQUENCY DOMAIN LINEAR PREDICTION, Sriram Ganapathy, Petr Motlicek and Hynek Hermansky, Idiap-RR-74-2008

Enhancing posterior based speech recognition systems, Hamed Ketabdar, Ecole Polytechnique Fédérale de Lausanne, 2008

Automatic Out-of-Language Detection based on Confidence Measures derived from LVCSR Word and Phone Lattices, Petr Motlicek, Idiap-RR-06-2009

Predicting Two Facets of Social Verticality in Meetings from Five-Minute Time Slices and Nonverbal Cues, Dinesh Babu Jayagopi, Silèye O. Ba, Jean-Marc Odobez and Daniel Gatica-Perez, in: Proceedings - ICMI 2008, 2008

Modeling Dominance in Group Conversations using NonVerbal Activity Cues, Dinesh Babu Jayagopi, Hayley Hung, Chuohao Yeo and Daniel Gatica-Perez, in: IEEE Transactions on Audio, Speech and Language Processing, 2008

Methods for Asynchronous and Non-Invasive EEG-Based Brain-Computer Interfaces. Towards Intelligent Brain-Actuated Wheelchairs, Ferran Galán, University of Barcelona, 2008

Principled Detection-by-classification from Multiple Views, Jerome Berclaz, Francois Fleuret and Pascal Fua, in: proceedings of the International Conference on Computer Vision Theory and Applications, 2008

Classification-based Probabilistic Modeling of Texture Transition for Fast Line Search Tracking and Delineation, Ali Shahrokni, Tom Drummond, Francois Fleuret and Pascal Fua, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008

Multi-layer Boosting for Pattern Recognition, Francois Fleuret, in: Pattern Recognition Letter, 30, 2009

Multi-Camera People Tracking with a Probabilistic Occupancy Map, Francois Fleuret, Jerome Berclaz, Richard Lengagne and Pascal Fua, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(2), 2008

Multiple Object Tracking using Flow Linear Programming, Jerome Berclaz, Francois Fleuret and Pascal Fua, Idiap-RR-10-2009

Integrating audio and vision for robust automatic gender recognition, Marianna Pronobis and Mathew Magimai-Doss, Idiap-RR-73-2008

Parts-Based Face Verification using Local Frequency Bands, Chris McCool and Sébastien Marcel, Idiap-RR-03-2009

How does a dictation machine recognize speech ?, T. Dutoit, L. Couvreur and Hervé Bourlard, in: Applied Signal Processing--A MATLAB approach, Springer MA, 2008

How does a dictation machine recognize speech?, T. Dutoit, L. Couvreur and Hervé Bourlard, Idiap-RR-72-2008

Entropy coding of Quantized Spectral Components in FDLP audio codec, Petr Motlicek, Sriram Ganapathy and Hynek Hermansky, Idiap-RR-71-2008

Modulation Frequency Features For Phoneme Recognition In Noisy Speech, Sriram Ganapathy, Samuel Thomas and Hynek Hermansky, in: Journal of Acoustical Society of America - Express Letters, 2008

Modulation Frequency Features For Phoneme Recognition In Noisy Speech, Sriram Ganapathy, Samuel Thomas and Hynek Hermansky, Idiap-RR-70-2008

CLEF2007 Image Annotation Task: an SVM-based Cue Integration Approach, Tatiana Tommasi, Francesco Orabona and Barbara Caputo, in: Proceedings of ImageCLEF 2007 -LNCS, 2007

The DIRAC AWEAR Audio-Visual Platform for Detection of Unexpected and Incongruent Events, Joern Anemueller, Joerg-Henrik Back, Barbara Caputo, michal havlena, Jie Luo, Hendrik Kayser, Bastian Leibe, Petr Motlicek, Tomas Pajdla, Misha Pavel, Akihiko Torii, Luc Van Gool, Alon Zweig and Hynek Hermansky, in: Proceedings of the International Conference on Multimodal Interfaces, 2008

The DIRAC AWEAR Audio-Visual Platform for Detection of Unexpected and Incongruent Events, Joern Anemueller, Joerg-Henrik Back, Barbara Caputo, michal havlena, Jie Luo, Hendrik Kayser, Bastian Leibe, Petr Motlicek, Tomas Pajdla, Misha Pavel, Akihiko Torii, Luc Van Gool, Hynek Hermansky and Alon Zweig, Idiap-RR-41-2010

Biologically Motivated Audio-Visual Cue Integration for Object, Joern Anemueller, Joerg-Henrik Back, Barbara Caputo, Jie Luo, Frank Ohl, Francesco Orabona, Rufin Vogels, Daphna Weinshall and Alon Zweig, in: Proceedings of the first Internatinal Conference on Cognitive Systems, 2008

SVM-based Discriminative Accumulation Scheme for Place Recognition, Andrzej Pronobis, Oscar Martinez Monos and Barbara Caputo, in: Proceedings of the IEEE International Conference on Robotics and Automation (ICRA08), 2008

Inference in switching linear dynamical systems applied to noise robust speech recognition of isolated digits, Bertrand Mesot, Ecole Polytechnique Fédérale de Lausanne, 2008

Probabilistic models for music, Jean-François Paiement, Ecole Polytechnique Fédérale de Lausanne, 2008

[URL]

Machine Learning for Information Retrieval, David Grangier, Ecole Polytechnique Fédérale de Lausanne, 2008

ESTIMATING THE DOMINANT PERSON IN MULTI-PARTY CONVERSATIONS USING SPEAKER DIARIZATION STRATEGIES, Hayley Hung, Yan Huang, Gerald Friedland and Daniel Gatica-Perez, in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2007

Investigating Automatic Dominance Estimation in Groups From Visual Attention and Speaking Activity, Hayley Hung, Dinesh Babu Jayagopi, Silèye O. Ba, Jean-Marc Odobez and Daniel Gatica-Perez, in: International Conference on Multi-modal Interfaces, 2008

Towards Audio-Visual On-line Diarization Of Participants In Group Meetings, Hayley Hung and Gerald Friedland, in: European Conference on Computer Vision Workshop on Multi-camera and Multi-modal Sensor Fusion, 2008

Kernel Based Text-Independnent Speaker Verification, Johnny Mariéthoz, Samy Bengio and Yves Grandvalet, Idiap-RR-68-2008

Towards Robust Place Recognition for Robot Localization, Muhammad Muneeb Ullah, Andrzej Pronobis, Barbara Caputo, Jie Luo, Patric Jensfelt and Henrik I. Christensen, in: IEEE International Conference on Robotics ad Automation, 2008

Towards Robust Place Recognition for Robot Localization, Muhammad Muneeb Ullah, Andrzej Pronobis, Barbara Caputo, Jie Luo, Patric Jensfelt and Henrik I. Christensen, Idiap-RR-40-2010

Class specific object recognition using kernel Gibbs distributions, Barbara Caputo, in: ELectronic Letters on Computer vision and Image Analysis, 7(2), 2008

Discriminative cue integration for medical image annotation, Tatiana Tommasi, Francesco Orabona and Barbara Caputo, in: Pattern Recognition Letters, 2008

Acoustic Models for Posterior Features in Speech Recognition, Guillermo Aradilla, Ecole Polytechnique Fédérale de Lausanne, 2008

Acoustic Models for Posterior Features in Speech Recognition, Guillermo Aradilla, Idiap-RR-67-2008

Fast Recognition of Anticipation Related Potentials, Gangadhar Garipelli, Ricardo Chavarriaga and José del R. Millán, in: IEEE Transactions on Biomedical Engineering, 2008

SimpleMKL, Alain Rakotomamonjy, Francis Bach, Stéphane Canu and Yves Grandvalet, in: Journal of Machine Learning Research, 9, 2008

Multi-layer Boosting for Pattern Recognition, Francois Fleuret, Idiap-RR-76-2008

Multi-Person Visual Focus of Attention from Head Pose and Meeting Contextual Cues, Silèye O. Ba and Jean-Marc Odobez, Idiap-RR-47-2008

Graphical representation of meetings on mobile devices, Lukas Matena, Alejandro Jaimes and Andrei Popescu-Belis, in: MobileHCI 2008 (10th International Conference on Human-Computer Interaction with Mobile Devices and Services, Demonstrations Session), Amsterdam, 2008

Improving Contextual Quality Models for MT Evaluation Based on Evaluators' Feedback, Paula Estrella, Andrei Popescu-Belis and Margaret King, in: 6th International Conference on Language Resources and Evaluation, Marrakech, Morocco, 2008

Task-based evaluation of meeting browsers: from BET task elicitation to user behavior analysis, Andrei Popescu-Belis, Mike Flynn, Pierre Wellner and Philippe Baudrion, in: 6th International Conference on Language Resources and Evaluation, Marrakech, Morocco, 2008

Reference-based vs. task-based evaluation of human language technology, Andrei Popescu-Belis, in: LREC 2008 ELRA Workshop on Evaluation, ELRA, Marrakech, Morocco, 2008

The AMIDA Automatic Content Linking Device: Just-in-Time Document Retrieval in Meetings, Andrei Popescu-Belis, Erik Boertjes, Jonathan Kilgour, Peter Poller, Sandro Castronovo, Theresa Wilson, Alejandro Jaimes and Jean Carletta, in: Machine Learning for Multimodal Interaction V, Utrecht, Springer-Verlag, 2008

[DOI]

Dimensionality of Dialogue Act Tagsets: An Empirical Analysis of Large Corpora, Andrei Popescu-Belis, in: Language Resources and Evaluation, 42(1), 2008

[DOI]

Towards an Objective Test for Meeting Browsers: the BET4TQB Pilot Experiment, Andrei Popescu-Belis, Philippe Baudrion, Mike Flynn and Pierre Wellner, in: Machine Learning for Multimodal Interaction IV, Springer-Verlag, 2008

[DOI]

Machine Learning for Multimodal Interaction V, Andrei Popescu-Belis and Rainer Stiefelhagen, Springer-Verlag, LNCS, volume 5237, 2008

[DOI]

Machine Learning for Multimodal Interaction IV, Andrei Popescu-Belis, Hervé Bourlard and Steve Renals, Springer-Verlag, LNCS, volume 4892, 2008

[DOI]

Social Signal Processing: State-of-the-Art and Future Perspectives of an Emerging Domain, Alessandro Vinciarelli, Maja Pantic, Hervé Bourlard and Alex Pentland, in: Proceedings of the ACM International Conference on Multimedia, 2008

Social Signals, their Function, and Automatic Analysis: A Survey, Alessandro Vinciarelli, Maja Pantic, Hervé Bourlard and Alex Pentland, in: Proceedings of International Conference on Multimodal Interfaces (to appear), 2008

Fast Human Detection from Videos Using Covariance Features, Jian Yao and Jean-Marc Odobez, Idiap-RR-68-2007

Multi-Layer Background Subtraction Based on Color and Texture, Jian Yao and Jean-Marc Odobez, Idiap-RR-67-2007

Multi-Layer Background Subtraction Based on Color and Texture, Jian Yao and Jean-Marc Odobez, in: CVPR 2007 Workshop on Visual Surveillance (VS2007), 2007

Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, Joseph Keshet and Samy Bengio, John Wiley & Sons, 2008

Support Vector Machines with a Reject Option, Yves Grandvalet, Joseph Keshet, Alain Rakotomamonjy and Stéphane Canu, Idiap-RR-01-2009

Discriminative Keyword Spotting, Joseph Keshet, David Grangier and Samy Bengio, in: Workshop on Non-Linear Speech Processing, Paris, France, 2007

Discriminative Kernel-Based Phoneme Sequence Recognition, Joseph Keshet, Shai Shalev-Shwartz, Yoram Singer, Samy Bengio and Dan Chazan, in: The 9th International Conference on Spoken Language Processing (INTERSPEECH), Pittsburgh, PA, 2006

Role Recognition for Meeting Participants: an Approach Based on Lexical Information and Social Network Analysis, Sarah Favre, Hugues Salamin, Alessandro Vinciarelli, Dilek Hakkani Tür and N. P. Garg, in: ACM International Conference on Multimedia, Vancouver, Canada, 2008

Identifying Dominant People in Meetings from Audio-Visual Sensors, Hayley Hung and Daniel Gatica-Perez, in: International Conference on Automatic Face and Gesture Recognition, Amsterdam, The Netherlands, 2008

Identifying Dominant People in Meetings from Audio-Visual Sensors, Hayley Hung and Daniel Gatica-Perez, Idiap-RR-65-2008

Associating Audio-Visual Activity Cues in a Dominance Estimation Framework, Hayley Hung, Yan Huang, Chuohao Yeo and Daniel Gatica-Perez, Idiap-RR-66-2008

Associating Audio-Visual Activity Cues in a Dominance Estimation Framework, Hayley Hung, Yan Huang, Chuohao Yeo and Daniel Gatica-Perez, in: First IEEE Workshop on CVPR for Human Communicative Behavior Analysis, 2008

Predicting the Dominant Clique in Meetings through Fusion of Nonverbal Cues, Dinesh Babu Jayagopi, Hayley Hung, Chuohao Yeo and Daniel Gatica-Perez, in: ACM MM 2008, 2008

Topickr: Flickr Groups and Users Reloaded, Radu-Andrei Negoescu and Daniel Gatica-Perez, Idiap-RR-61-2008

Stationary Features and Cat Detection, Francois Fleuret and Donald Geman, in: Journal of Machine Learning Research, 9, 2008

Automated Delineation of Dendritic Networks in Noisy Image Stacks, German Gonzalez, Francois Fleuret and Pascal Fua, in: proceedings of the European Conference on Computer Vision, 2008

Multi-Camera Tracking and Atypical Motion Detection with Behavioral Maps, Jerome Berclaz, Francois Fleuret and Pascal Fua, in: proceedings of the European Conference on Computer Vision, 2008

What Did You Do Today? Discovering Daily Routines from Large-Scale Mobile Data, Katayoun Farrahi and Daniel Gatica-Perez, Idiap-RR-49-2008

What Did You Do Today? Discovering Daily Routines from Large-Scale Mobile Data, Katayoun Farrahi and Daniel Gatica-Perez, in: ACM International Conference on Multimedia (ACMMM), 2008

Discovering Human Routines from Cell Phone Data with Topic Models, Katayoun Farrahi and Daniel Gatica-Perez, in: IEEE International Symposium on Wearable Computers (ISWC), 2008

Discovering Human Routines from Cell Phone Data with Topic Models, Katayoun Farrahi and Daniel Gatica-Perez, Idiap-RR-32-2008

Daily Routine Classification from Mobile Phone Data, Katayoun Farrahi and Daniel Gatica-Perez, in: Workshop on Machine Learning and Multimodal Interaction (MLMI08), 2008

Role Recognition in Multiparty Recordings using Social Affiliation Networks and Discrete Distributions, Sarah Favre, Hugues Salamin and Alessandro Vinciarelli, Idiap-RR-64-2008

Calibration from statistical properties of the visual world, Etienne Grossmann, José António Gaspar and Francesco Orabona, Idiap-RR-63-2008

Calibration from statistical properties of the visual world, Etienne Grossmann, José António Gaspar and Francesco Orabona, in: European Conf. on Computer Vision, 2008

Predicting the dominant clique in meetings through fusion of nonverbal cues, Dinesh Babu Jayagopi, Hayley Hung, Chuohao Yeo and Daniel Gatica-Perez, Idiap-RR-08-2008

Performance Generalization in Biometric Authentication Using Joint User-Specific and Sample Bootstraps, Norman Poh, Alvin Martin and Samy Bengio, Idiap-RR-60-2005

Using Chimeric Users to Construct Fusion Classifiers in Biometric Authentication Tasks: An Investigation, Norman Poh and Samy Bengio, Idiap-RR-59-2005

Optimisation de réseaux de neurones, Jean-Luc Beuchat, {EPFL}, Lausanne, Switzerland, 1995

Role Recognition for Meeting Participants: an Approach Based on Lexical Information and Social Network Analysis, N. P. Garg, Sarah Favre, Hugues Salamin, D. Hakkani Tür and Alessandro Vinciarelli, Idiap-RR-57-2008

understanding metro station usage using closed circuit television cameras analysis, C. Carincotte, M. Hick, Xavier Naturel, Jean-Marc Odobez, Jian Yao, A. Bastide and B. Corbucci, Idiap-RR-38-2008

The COLD Database, Muhammad Muneeb Ullah, Andrzej Pronobis, Barbara Caputo, Jie Luo and Patric Jensfelt, Idiap-RR-49-2007

Visual Focus of Attention Estimation from Head Pose Posterior Probability Distributions, Silèye O. Ba and Jean-Marc Odobez, Idiap-RR-75-2007

Analysis of Confusion Matrix to Combine Evidence for Phoneme Recognition, S. R. Mahadeva Prasanna, B. Yegnanarayana, Joel Praveen Pinto and Hynek Hermansky, Idiap-RR-27-2007

Classifying Materials in the Real World, Barbara Caputo, Eric Hayman, Mario Fritz and Jan-Olof Eklhund, Idiap-RR-69-2007

Using Chimeric Users to Construct Fusion Classifiers in Biometric Authentication Tasks: An Investigation, Norman Poh and Samy Bengio, Idiap-RR-59-2005

A System for the Off-Line Recognition of Handwritten Text, Thomas M. Breuel, Idiap-RR-02-1994

View-Based Recognition, Thomas M. Breuel, Idiap-RR-09-1993

On the Combination of Auditory and Modulation Frequency Channels for ASR applications, Fabio Valente and Hynek Hermansky, in: Interspeech 2008, 2008

Integration of TDOA Features in Information Bottleneck Framework for Fast Speaker Diarization, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: Interspeech 2008, 2008

Melanoma Recognition Using Representative and Discriminative Kernel Classifiers, Tatiana Tommasi, Elisabetta La Torre and Barbara Caputo, in: International Workshop on Computer Vision Applications for Medical Image Analysis, 2006

A Discriminative Approach to Robust Visual Place Recognition, Andrzej Pronobis, Barbara Caputo, Patric Jensfelt and Henrik I. Christensen, in: IEEE International Conference on Intelligent RObot Systems (IROS), 2006

Biometric Person Authentication IS A Multiple Classifier Problem, Samy Bengio and Johnny Mariéthoz, in: 7th International Workshop on Multiple Classifier Systems, MCS, 2007

Spin Glass Models of Markov Random Fields, Barbara Caputo, in: International Journal on Image, Systems and Technology, 16(5), 2006

Neural Network Initialization, Georg Thimm and Emile Fiesler, in: From Natural to Artificial Neural Computation, Springer Verlag, 1995

Les domaines d'application des technologies vocales, Gérard Chollet, in: Fondements et perspectives en traitement automatique de la parole, GDR-PRC Communication Homme-Machine, 1995

A Hybrid Approach to Continuous Speech Recognition, Kari Torkkola and Teuvo Kohonen, in: The handbook of brain theory and neural networks, The MIT Press, 1995

Assessment of speaker verification systems, Gérard Chollet and Frédéric Bimbot, in: Spoken Language Ressources and Assessment, EAGLES Handbook, 1995

Handwriting Recognition, Thomas M. Breuel, in: Recent Developments in Computer Vision, Springer, 1995

Applying Handwriting Recognition to US Census Forms, Thomas M. Breuel, in: Recent Developments in Computer Vision, Springer, 1995

An All-Optical Forward Propagation Multilayer Neural Network, Indu Saxena and Emile Fiesler, in: From Natural to Artificial Neural Computation, Springer Verlag, 1995

Composite Kernel Learning, Marie Szafranski, Yves Grandvalet and Alain Rakotomamonjy, Idiap-RR-59-2008

Composite Kernel Learning, Marie Szafranski, Yves Grandvalet and Alain Rakotomamonjy, in: Proceedings of the 25th Annual International Conference on Machine Learning (ICML 2008), Omnipress, 2008

Joint Head Tracking and Pose Estimation for Visual Focus of Attention Recognition, Silèye O. Ba, École Polytechnique Fédérale de Lausanne, 2007

Autoregressive Modelling of Hilbert Envelopes for Wide-band Audio Coding, Sriram Ganapathy, Petr Motlicek, Hynek Hermansky and Harinath Garudadri, Idiap-RR-40-2008

Autoregressive Modelling of Hilbert Envelopes for Wide-band Audio Coding, Sriram Ganapathy, Petr Motlicek, Hynek Hermansky and Harinath Garudadri, in: AES 124th Convention, Audio Engineering Society, 2008

Continuous Brain-Actuated Control of an Intelligent Wheelchair by Human EEG, Ferran Galán, Marnix Nuttin, Dirk Vanhooydonck, Eileen Lew, Pierre W. Ferrez, Johan Philips and José del R. Millán, Idiap-RR-53-2008

Continuous Brain-Actuated Control of an Intelligent Wheelchair by Human EEG, Ferran Galán, Marnix Nuttin, Dirk Vanhooydonck, Eileen Lew, Pierre W. Ferrez, Johan Philips and José del R. Millán, in: In proceedings, 4th Intl. Brain-Computer Interface Workshop and Training Course, 2008

A Brain-Actuated Wheelchair: Asynchronous and Non-Invasive Brain-Computer Interfaces for Continuous Control of Robots, Ferran Galán, Marnix Nuttin, Eileen Lew, Pierre W. Ferrez, G. Vanacker, Johan Philips and José del R. Millán, in: Clinical Neurophysiology, 2008

Error-related EEG potentials in brain-computer interfaces, Pierre W. Ferrez, École Polytechnique Fédérale de Lausanne, 2007

EEG-Based Brain-Computer Interaction: Improved Accuracy by Automatic Single-Trial Error Detection, Pierre W. Ferrez and José del R. Millán, in: Advances in Neural Information Processing Systems 21, 2007

Simultaneous Real-Time Detection of Motor Imagery and Error-Related Potentials for Improved BCI Accuracy, Pierre W. Ferrez and José del R. Millán, in: Proceedings of the 4th International Brain-Computer Interface Workshop and Training Course, 2008

Daily Routine Classification from Mobile Phone Data, Katayoun Farrahi and Daniel Gatica-Perez, Idiap-RR-62-2007

Analyzing Interactions Between Navigation Strategies Using a Computational Model of Action Selection, Laurent Dollé, Mehdi Khamassi, Benoît Girard, Agnès Guillot and Ricardo Chavarriaga, in: Int Conf Spatial Cognition 2008, 2008

Analyzing Interactions Between Navigation Strategies Using a Computational Model of Action Selection, Laurent Dollé, Mehdi Khamassi, Benoît Girard, Agnès Guillot and Ricardo Chavarriaga, Idiap-RR-48-2008

Asynchronous detection and classification of oscillatory brain activity, Ricardo Chavarriaga, Ferran Galán and José del R. Millán, Idiap-RR-36-2008

Asynchronous detection and classification of oscillatory brain activity, Ricardo Chavarriaga, Ferran Galán and José del R. Millán, in: 16 European Signal Processing Conference, 2008

Fast Approximate Spoken Term Detection from Sequence of Phonemes, Joel Praveen Pinto, Igor Szoke, S. R. Mahadeva Prasanna and Hynek Hermansky, in: Workshop on Searching Spontaneous Conversational Speech at SIGIR, 2008

Combining Evidence from a Generative and a Discriminative Model in Phoneme Recognition, Joel Praveen Pinto and Hynek Hermansky, in: Proceedings of Interspeech, 2008

Fast Approximate Spoken Term Detection from Sequence of Phonemes, Joel Praveen Pinto, Igor Szoke, S. R. Mahadeva Prasanna and Hynek Hermansky, Idiap-RR-45-2008

Silence Models in Weighted Finite-State Transducers, Philip N. Garner, in: Interspeech, 2008

Predictive Models for Music, Jean-François Paiement, Yves Grandvalet and Samy Bengio, Idiap-RR-51-2008

Probabilistic Models for Melodic Prediction, Jean-François Paiement, Samy Bengio and Douglas Eck, Idiap-RR-50-2008

In-Context Phone Posteriors as Complementary Features for Tandem ASR, Hamed Ketabdar and Hervé Bourlard, in: ICSLP'08, 2008

Hierarchical Integration of Phonetic and Lexical Knowledge in Phone Posterior Estimation, Hamed Ketabdar and Hervé Bourlard, in: ICASSP'08, 2008

Enhanced Phone Posteriors for Improving Speech Recognition Systems, Hamed Ketabdar and Hervé Bourlard, Idiap-RR-39-2008

Recognition of Anticipatory Behavior from Human EEG, Gangadhar Garipelli, Ricardo Chavarriaga and José del R. Millán, Idiap-RR-52-2008

Recognition of Anticipatory Behavior from Human EEG, Gangadhar Garipelli, Ricardo Chavarriaga and José del R. Millán, in: In proceedings, 4th Intl. Brain-Computer Interface Workshop and Training Course, 2008

Spectral Noise Shaping: Improvements in Speech/Audio Codec Based on Linear Prediction in Spectral Domain, Sriram Ganapathy, Petr Motlicek, Hynek Hermansky and Harinath Garudadri, in: INTERSPEECH 2008, 2008

Hilbert Envelope Based Features for Far-Field Speech Recognition, Samuel Thomas, Sriram Ganapathy and Hynek Hermansky, in: MLMI 2008, 2008

Hilbert Envelope Based Spectro-Temporal Features for Phoneme Recognition in Telephone Speech, Samuel Thomas, Sriram Ganapathy and Hynek Hermansky, in: Interspeech 2008, 2008

Front-end for Far-field Speech Recognition based on Frequency Domain Linear Prediction, Sriram Ganapathy, Samuel Thomas and Hynek Hermansky, in: Interspeech 2008, 2008

Recognition Of Reverberant Speech Using Frequency Domain Linear Prediction, Samuel Thomas, Sriram Ganapathy and Hynek Hermansky, in: IEEE Signal Processing Letters, 2008

Hilbert Envelope Based Features for Far-Field Speech Recognition, Samuel Thomas, Sriram Ganapathy and Hynek Hermansky, Idiap-RR-42-2008

Recognition Of Reverberant Speech Using Frequency Domain Linear Prediction, Samuel Thomas, Sriram Ganapathy and Hynek Hermansky, Idiap-RR-41-2008

Spectro-Temporal Features for Automatic Speech Recognition using Linear Prediction in Spectral Domain, Samuel Thomas, Sriram Ganapathy and Hynek Hermansky, in: EUSIPCO 2008, 2008

Introducing Temporal Asymmetries in Feature Extraction for Automatic Speech Recognition, G. S. V. S. Sivaram and Hynek Hermansky, in: Interspeech 2008, 2008

Emulating Temporal Receptive Fields of Auditory Mid-Brain Neurons for Automatic Speech Recognition, G. S. V. S. Sivaram and Hynek Hermansky, in: Proc. 16th European Signal Processing Conference (EUSIPCO), 2008

Timbre and Rhythmic TRAP-TANDEM features for music information retrieval, Nicolas Scaringella, in: "Int. Conf. on Music Information Retrieval (ISMIR)", 2008

Timbre and Rhythmic TRAP-TANDEM features for music information retrieval, Nicolas Scaringella, Idiap-RR-46-2008

Reverse Correlation for analyzing MLP Posterior Features in ASR, Joel Praveen Pinto, G. S. V. S. Sivaram and Hynek Hermansky, in: 11th International Conference on Text, Speech, and Dialogue, 2008

An Information Theoretic Approach to Speaker Diarization of Meeting Data, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, Idiap-RR-58-2008

Scene image classification and segmentation with quantized local descriptors and latent aspect modeling, Pedro Quelhas, École Polytechnique Fédérale de Lausanne, 2007

Bayesian methods for visual multi-object tracking with applications to human activity recognition, Kevin C. Smith, École Polytechnique Fédérale de Lausanne, 2007

CLIENT / WORLD MODEL SYNCHRONOUS ALIGNEMENT FOR SPEAKER VERIFICATION, Johnny Mariéthoz, Dominique Genoud, Frédéric Bimbot and Chafic Mokbel, Idiap-RR-23-1999

Benchmarking Non-Parametric Statistical Tests, Mikaela Keller, Samy Bengio and Siew Yeung Wong, Idiap-RR-38-2005

Multi-resolution Spectral Entropy Based Feature for Robust ASR, Hemant Misra, Shajith Ikbal, Sunil Sivadas and Hervé Bourlard, Idiap-RR-37-2004

Off-Line Cursive Script Recognition Based on Continuous Density HMM, Alessandro Vinciarelli and Juergen Luettin, Idiap-RR-25-1999

Discriminant linear processing of time-frequency plane, Fabio Valente and Hynek Hermansky, Idiap-RR-20-2006

Text Segmentation and Recognition in Complex Background Based on Markov Random Field, Datong Chen, Jean-Marc Odobez and Hervé Bourlard, Idiap-RR-17-2002

Nonlinear Spectral Transformations for Robust Speech Recognition, Shajith Ikbal, Hynek Hermansky and Hervé Bourlard, Idiap-RR-36-2003

Hand Posture Classification and Recognition using the Modified Census Transform, Agnès Just, Yann Rodriguez and Sébastien Marcel, Idiap-RR-02-2006

Test of several external posterior weighting functions for multiband Full Combination ASR, Hervé Glotin and Frédéric Berthommier, Idiap-RR-27-2000

Extracting Information from Multimedia Meeting Collections, Daniel Gatica-Perez, Dong Zhang and Samy Bengio, Idiap-RR-50-2005

Machine Learning Approaches to Text Representation using Unlabeled Data, Mikaela Keller, Idiap-RR-76-2006

Motion likelihood and proposal modeling in Model-Based Stochastic Tracking, Jean-Marc Odobez and Daniel Gatica-Perez, Idiap-RR-61-2004

Low cost duration modelling for noise robust speech recognition, Andrew Morris, Simon Payne and Hervé Bourlard, Idiap-RR-08-2002

Indexing spoken audio by LSA and SOMs, Mikko Kurimo, Idiap-RR-06-2000

On the Complexity of Recognizing Regions Computable by Two-Layered Perceptrons, Eddy Mayoraz, Idiap-RR-03-1998

New Nonsense Syllables Database -- Analyses and Preliminary ASR Experiments, Petr Fousek, Petr Svojanovsky, Frantisek Grezl and Hynek Hermansky, Idiap-RR-29-2004

Combining multiple tracking algorithms for improved general performance, Kim Shearer, Kirrily D Wong and Svetha Venkatesh, Idiap-RR-13-2000

A Rao-Blackwellized Mixed State Particle Filter for Head Pose Tracking, Silèye O. Ba and Jean-Marc Odobez, Idiap-RR-35-2005

Modeling Individual and Group Actions in Meetings: a Two-Layer HMM Framework, Dong Zhang, Daniel Gatica-Perez, Samy Bengio, Iain A. McCowan and Guillaume Lathoud, Idiap-RR-09-2004

Intrinsic dimension estimation of data: an approach based on Grassberger-Procaccia's algorithm, Francesco Camastra and Alessandro Vinciarelli, Idiap-RR-33-2000

Speech/Music Discrimination using Entropy and Dynamism Features in a HMM Classification Framewor, Jitendra Ajmera, Iain A. McCowan and Hervé Bourlard, Idiap-RR-26-2001

Subjective Evaluation of Join Cost and Smoothing Methods for Unit Selection Speech Synthesis, Jithendra Vepa and Simon King, Idiap-RR-34-2005

Automatic Speech Recognition: an Auditory Perspective, Nelson Morgan, Hervé Bourlard and Hynek Hermansky, Idiap-RR-17-1998

A State-of-the-art Neural Network for Robust Face Verification, Sébastien Marcel, Christine Marcel and Samy Bengio, Idiap-RR-36-2002

Robust Speech Recognition and Feature Extraction Using HMM2, Katrin Weber, Shajith Ikbal, Samy Bengio and Hervé Bourlard, Idiap-RR-42-2001

Face Verification Using Synthesized Non-Frontal Models, Conrad Sanderson and Samy Bengio, Idiap-RR-60-2003

Adaptive ML-Weighting in Multi-Band Recombination of Gaussian Mixture ASR, Astrid Hagen, Hervé Bourlard and Andrew Morris, Idiap-RR-05-2001

Investigating Lexical Substitution Scoring for Subtitle Generation, Oren Glickman, Ido Dagan, Mikaela Keller, Samy Bengio and Walter Daelemans, Idiap-RR-36-2006

Head-Pose Invariant Facial Expression Recognition using Convolutional Neural Networks, B. Fasel, Idiap-RR-51-2002

Robust Speaker Change Detection, Jitendra Ajmera, Iain A. McCowan and Hervé Bourlard, Idiap-RR-39-2002

Robust Speech Recognition with Small Microphone Arrays using the Missing Data Approach, Iain A. McCowan, Andrew Morris and Hervé Bourlard, Idiap-RR-09-2002

Speechreading using Probabilistic Models, Juergen Luettin and Neil A. Thacker, Idiap-RR-12-1997

Handwritten Digit Recognition with Binary Optical Perceptron, Indu Saxena, Perry Moerland, Emile Fiesler and A. R. Pourzand, Idiap-RR-15-1997

Video OCR for Sport Video Annotation and Retrieval, Datong Chen and Hervé Bourlard, Idiap-RR-28-2001

Online Policy Adaptation for Ensemble Classifiers, Christos Dimitrakakis and Samy Bengio, Idiap-RR-69-2003

Exploring Contextual Information in a Layered Framework for Group Action Recognition, Dong Zhang, Daniel Gatica-Perez and Samy Bengio, Idiap-RR-41-2006

Taking on the Curse of Dimensionality in Joint Distributions Using Neural Networks, Samy Bengio and Yoshua Bengio, Idiap-RR-01-2000

Spectro-Temporal Activity Pattern (STAP) Features for Noise Robust ASR, Shajith Ikbal, Mathew Magimai-Doss, Hemant Misra and Hervé Bourlard, Idiap-RR-20-2004

LP-TRAP: Linear predictive temporal patterns, Marios Athineos, Hynek Hermansky and Daniel P. W. Ellis, Idiap-RR-59-2004

Nearly optimal exploration-exploitation decision thresholds, Christos Dimitrakakis, Idiap-RR-12-2006

From missing data to maybe useful data: soft data modelling for noise robust ASR, Andrew Morris, Jon Barker and Hervé Bourlard, Idiap-RR-06-2001

Estimating the Intrinsic Dimension of Data with a Fractal-Based Method, Francesco Camastra and Alessandro Vinciarelli, Idiap-RR-02-2002

Multiple Hypotheses Video OCR, Datong Chen and Juergen Luettin, Idiap-RR-28-2000

On Use of Task Independent Training Data in Tandem Feature Extraction, Sunil Sivadas and Hynek Hermansky, Idiap-RR-57-2003

On the Use of Speech and Face Information for Identity Verification, Conrad Sanderson and Kuldip K. Paliwal, Idiap-RR-10-2004

A Multi-sample Multi-source Model for Biometric Authentication, Norman Poh, Samy Bengio and Jerzy Korczak, Idiap-RR-14-2002

A Statistical Significance Test for Person Authentication, Samy Bengio and Johnny Mariéthoz, Idiap-RR-83-2003

Audio-Visual Person Verification, Souheil Ben-Yacoub, Juergen Luettin, K. Jonsson, J. Matas and J. Kittler, Idiap-RR-18-1998

Comparison and Combination of Features in a Hybrid HMM/MLP and a HMM/GMM Speech Recognition System, Pere Pujol, Susagna Pol, Climent Nadeu, Astrid Hagen and Hervé Bourlard, Idiap-RR-48-2003

Revisiting Doddington's Zoo: A Systematic Method to Assess User-dependent Variabilities, Norman Poh, Samy Bengio and Arun Ross, Idiap-RR-04-2006

Truncation Confusion Patterns in Onset Consonants, Andrew Lovitt, Idiap-RR-05-2007

Constructing visual models with a latent space approach, Florent Monay, Pedro Quelhas, Daniel Gatica-Perez and Jean-Marc Odobez, Idiap-RR-14-2005

Performance Generalization in Biometric Authentication Using Joint User-Specific and Sample Bootstraps, Norman Poh, Alvin Martin and Samy Bengio, Idiap-RR-60-2005

Adapted Generative Models For Face Verification, Fabien Cardinaux, Conrad Sanderson and Samy Bengio, Idiap-RR-76-2003

A New Margin-Based Criterion for Efficient Gradient Descent, Ronan Collobert and Samy Bengio, Idiap-RR-16-2003

Tracking the Multi Person Wandering Visual Focus of Attention, Kevin C. Smith, Silèye O. Ba, Daniel Gatica-Perez and Jean-Marc Odobez, Idiap-RR-80-2005

On the Complexity of Recognizing Iterated Differences of Polyhedra, Eddy Mayoraz, Idiap-RR-10-1997

An Overview of the PICASSO Project Research Activities in Speaker Verification for Telephone Applications, Frédéric Bimbot, Mats Blomberg, Louis Boves, Gérard Chollet, Cédric Jaboulet, Bruno Jacob, Jamal Kharroubi, Johan Koolwaaij, Johan Lindberg, Johnny Mariéthoz, Chafic Mokbel and Houda Mokbel, Idiap-RR-24-1999

Using pitch frequency information in speech recognition, Mathew Magimai-Doss, Todd Andrew Stephenson and Hervé Bourlard, Idiap-RR-23-2003

Juicer: A Weighted Finite-State Transducer speech decoder, Darren Moore, John Dines, Mathew Magimai-Doss, Jithendra Vepa, Octavian Cheng and Thomas Hain, Idiap-RR-21-2006

On Confusions in a Phoneme Recognizer, Andrew Lovitt, Joel Praveen Pinto and Hynek Hermansky, Idiap-RR-10-2007

Hierarchical Multi-Stream Posterior Based Speech Recognition System, Hamed Ketabdar, Hervé Bourlard and Samy Bengio, Idiap-RR-25-2005

A Hierarchical Keyframe User Interface for Browsing Video over the Internet, Maël Guillemot, Pierre Wellner, Daniel Gatica-Perez and Jean-Marc Odobez, Idiap-Com-02-2003

Using Auxiliary Sources of Knowledge for Automatic Speech Recognition, Mathew Magimai-Doss, Idiap-RR-90-2005

Using more informative posterior probabilities for speech recognition, Hamed Ketabdar, Jithendra Vepa, Samy Bengio and Hervé Bourlard, Idiap-RR-91-2005

Towards Predicting Optimal Subsets of Base-Experts in Biometric Authentication Task, Norman Poh and Samy Bengio, Idiap-RR-17-2004

Automatic Speech Recognition using Dynamic Bayesian Networks with both Acoustic and Articulatory Variables, Todd Andrew Stephenson, Hervé Bourlard, Samy Bengio and Andrew Morris, Idiap-RR-19-2000

Entropy-based Multi-stream Combination, Hemant Misra, Hervé Bourlard and Vivek Tyagi, Idiap-RR-31-2002

Combination of Acoustic Classifiers based on Dempster-Shafer Theory of evidence, Fabio Valente and Hynek Hermansky, Idiap-RR-61-2006

Multi-Modal Data Fusion for Person Authentication using SVM, Souheil Ben-Yacoub, Idiap-RR-07-1998

HMM2- A Novel Approach to HMM Emission Probability Estimation, Katrin Weber, Samy Bengio and Hervé Bourlard, Idiap-RR-30-2000

Multi-Person Tracking in Meetings: A Comparative Study, Kevin C. Smith, Sascha Schreiber, Vítezslav Beran, Igor Potúcek, Gerhard Rigoll and Daniel Gatica-Perez, Idiap-RR-38-2006

Finding groups of people in Google news, Dhiraj Joshi and Daniel Gatica-Perez, Idiap-RR-68-2005

Mutliscale Facial Expression Recognition using Convolutional Neural Networks, B. Fasel, Idiap-RR-52-2002

Automatic Facial Expression Analysis: A Survey, B. Fasel and Juergen Luettin, Idiap-RR-19-1999

Combinatorial Approach for Data Binarization, Eddy Mayoraz and Miguel Moreira, Idiap-RR-08-1999

Sociometry Based Multiparty Audio Recordings Segmentation, Alessandro Vinciarelli, Idiap-RR-78-2005

Experimental Protocol on the BANCA Database, Samy Bengio, Frédéric Bimbot, Johnny Mariéthoz, Vlad Popovici, F. Porée, E. Bailly-Baillière, G. Matas and B. Ruiz, Idiap-RR-05-2002

Robust Speech Recognition based on Multi-Stream Features, Stéphane Dupont, Hervé Bourlard and Christophe Ris, Idiap-RR-01-1997

Subjective Evaluation of Join Cost Functions Used in Unit Selection Speech Synthesis, Jithendra Vepa and Simon King, Idiap-RR-26-2004

Modeling Individual and Group Actions in Meetings With Layered HMMs, Dong Zhang, Daniel Gatica-Perez, Samy Bengio, Iain A. McCowan and Guillaume Lathoud, Idiap-RR-33-2004

Measuring the Performance of Face Localization Systems, Yann Rodriguez, Fabien Cardinaux, Samy Bengio and Johnny Mariéthoz, Idiap-RR-53-2005

Tracking People in Meetings with Particles, Daniel Gatica-Perez, Jean-Marc Odobez, Silèye O. Ba, Kevin C. Smith and Guillaume Lathoud, Idiap-RR-71-2004

Spectral Entropy Based Feature for Robust ASR, Hemant Misra, Shajith Ikbal, Hervé Bourlard and Hynek Hermansky, Idiap-RR-56-2003

Text Identification in Complex Background using SVM, Datong Chen, Hervé Bourlard and Jean-Philippe Thiran, Idiap-RR-20-2001

Recognition of Isolated Complex Mono- and Bi-Manual 3D Hand Gestures, Agnès Just, O. Bernier and Sébastien Marcel, Idiap-RR-63-2003

A Kernel Trick For Sequences Applied to Text-Independent Speaker Verification Systems, Johnny Mariéthoz and Samy Bengio, Idiap-RR-77-2005

Analytic Assessment of Telephone Transmission Impact on ASR Performance Using a Simulation Model, S. Moeller and Hervé Bourlard, Idiap-RR-17-2001

The Expected Performance Curve, Samy Bengio, Mikaela Keller and Johnny Mariéthoz, Idiap-RR-85-2003

A new normalization technique for cursive handwritten words, Alessandro Vinciarelli and Juergen Luettin, Idiap-RR-32-2000

Unified Inference for Variational Bayesian Linear Gaussian State-Space Models, David Barber and Silvia Chiappa, Idiap-RR-50-2006

Text detection and recognition in images and video sequences, Datong Chen, Idiap-RR-44-2003

Face Authentication Using Adapted Local Binary Pattern Histograms, Yann Rodriguez and Sébastien Marcel, Idiap-RR-06-2006

A Probabilistic Interpretation of SVMs with an Application to Unbalanced Classification, Yves Grandvalet, Johnny Mariéthoz and Samy Bengio, Idiap-RR-26-2005

Boosting Pixel-based Classifiers for Face Verification, Yann Rodriguez and Sébastien Marcel, Idiap-RR-65-2003

Audio visual speech recognition, C. Neti, G. Potamianos, Juergen Luettin, I. Matthews, Hervé Glotin, D. Vergyri, J. Sison and A. Mashari, Idiap-RR-35-2000

Modeling Human Interaction in Meetings, Iain A. McCowan, Samy Bengio, Daniel Gatica-Perez, Guillaume Lathoud, Florent Monay, Darren Moore, Pierre Wellner and Hervé Bourlard, Idiap-RR-59-2002

Segmenting Multiple Concurrent Speakers Using Microphone Arrays, Guillaume Lathoud, Iain A. McCowan and Darren Moore, Idiap-RR-21-2003

Localized mixtures of experts, Perry Moerland, Idiap-RR-14-1998

Database, Protocol and Tools for Evaluating Score-Level Fusion Algorithms in Biometric Authentication, Norman Poh and Samy Bengio, Idiap-RR-44-2004

Non-Linear Variance Reduction Techniques in Biometric Authentication, Norman Poh and Samy Bengio, Idiap-RR-26-2003

Multimodal Integration for Meeting Group Action Segmentation and Recognition, Marc Al-Hames, Alfred Dielmann, Daniel Gatica-Perez, Stephan Reiter, Steve Renals and Dong Zhang, Idiap-RR-31-2005

Microphone Array Speech Recognition : Experiments on Overlapping Speech in Meetings, Darren Moore and Iain A. McCowan, Idiap-RR-41-2002

Improving Face Verification using Skin Color Information, Sébastien Marcel and Samy Bengio, Idiap-RR-44-2001

Improving Posterior Based Confidence Measures in Hybrid HMM/ANN Speech Recognition Systems, Giulia Bernardis and Hervé Bourlard, Idiap-RR-11-1998

Automatic Analysis of Multimodal Group Actions in Meetings, Iain A. McCowan, Daniel Gatica-Perez, Samy Bengio, Guillaume Lathoud, Mark Barnard and Dong Zhang, Idiap-RR-27-2003

Detecting Abandoned Luggage Items in a Public Space, Kevin C. Smith, Pedro Quelhas and Daniel Gatica-Perez, Idiap-RR-39-2006

Microphone Array Post-filter based on Noise Field Coherence, Iain A. McCowan and Hervé Bourlard, Idiap-RR-40-2001

Speech Recognition Using Advanced HMM2 Features, Katrin Weber, Samy Bengio and Hervé Bourlard, Idiap-RR-24-2001

Analyzing Group Interactions in Conversations: a Review, Daniel Gatica-Perez, Idiap-RR-63-2006

Probabilistic Home Video Structuring: Feature Selection and Performance Evaluation, Daniel Gatica-Perez, Ming-Ting Sun and Alexander Loui, Idiap-RR-11-2002

Scalability Analysis of Audio-Visual Person Identity Verification, J. Czyz, Samy Bengio, Christine Marcel and L. Vandendorpe, Idiap-RR-04-2003

2D Multi-Person Tracking: A Comparative Study in AMI Meetings, Kevin C. Smith, Sascha Schreiber, Vítezslav Beran, Igor Potúcek, Gerhard Rigoll and Daniel Gatica-Perez, Idiap-RR-37-2006

Effect of Segmentation Method on Video Retrieval Performance, David Grangier and Alessandro Vinciarelli, Idiap-RR-83-2004

Text Enhancement with Asymmetric Filter for Video OCR, Datong Chen, Kim Shearer and Hervé Bourlard, Idiap-RR-19-2001

Clustering And Segmenting Speakers And Their Locations In Meetings, Jitendra Ajmera, Guillaume Lathoud and Iain A. McCowan, Idiap-RR-55-2003

MAP Combination of Multi-Stream HMM or HMM/ANN Experts, Andrew Morris, Astrid Hagen and Hervé Bourlard, Idiap-RR-14-2001

Linking Objects in Videos by Importance Sampling, Daniel Gatica-Perez and Ming-Ting Sun, Idiap-RR-20-2002

Improving Face Authetication Using Virtual Samples, Norman Poh, Sébastien Marcel and Samy Bengio, Idiap-RR-40-2002

A Symmetric Transformation for LDA-based Face Verification, Sébastien Marcel, Idiap-RR-67-2003

Detecting Group Interest-level in Meetings, Daniel Gatica-Perez, Iain A. McCowan, Dong Zhang and Samy Bengio, Idiap-RR-51-2004

An Implicit Motion Likelihood for Tracking with Particle Filters, Jean-Marc Odobez, Silèye O. Ba and Daniel Gatica-Perez, Idiap-RR-15-2003

A Parallel Mixture of SVMs for Very Large Scale Problems, Ronan Collobert, Samy Bengio and Yoshua Bengio, Idiap-RR-12-2001

On Performance Evaluation of Face Detection and Localization Algorithms, Vlad Popovici, Yann Rodriguez, Jean-Philippe Thiran and Sébastien Marcel, Idiap-RR-80-2003

Improved Pairwise Coupling Classification With Correcting Classifiers, Miguel Moreira and Eddy Mayoraz, Idiap-RR-09-1997

Noise-Robust Multi-Stream Fusion for Text-Independent Speaker Authentication, Norman Poh and Samy Bengio, Idiap-RR-01-2004

User-Customized Password Speaker Verification based on HMM/ANN and GMM Models, Mohamed Faouzi BenZeghiba and Hervé Bourlard, Idiap-RR-10-2002

Likelihood ratio adjustment for the compensation of model mismatch in speaker verification, Frédéric Bimbot and Dominique Genoud, Idiap-RR-05-1997

Improving Speech Recognition Using a Data-Driven Approach, Guillermo Aradilla, Jithendra Vepa and Hervé Bourlard, Idiap-RR-66-2005

Robust speech recognition based on multi-stream processing, Astrid Hagen, Idiap-RR-41-2001

Multi-stream ASR: Oracle Test and Embedded Training, Hemant Misra, Jithendra Vepa and Hervé Bourlard, Idiap-RR-62-2005

Object Localization in Metric Spaces for Video Linking, Daniel Gatica-Perez and Ming-Ting Sun, Idiap-RR-09-2003

Modeling Interactions from Email Communication, Dong Zhang, Daniel Gatica-Perez, Deb Roy and Samy Bengio, Idiap-RR-51-2005

On Automatic Annotation of Images with Latent Space Models, Florent Monay and Daniel Gatica-Perez, Idiap-RR-31-2003

An Acoustic Model Based on Kullback-Leibler Divergence for Posterior Features, Guillermo Aradilla, Jithendra Vepa and Hervé Bourlard, Idiap-RR-60-2006

Offline Cursive Word Recognition using Continuous Density Hidden Markov Models trained with PCA or ICA Features, Alessandro Vinciarelli and Samy Bengio, Idiap-RR-46-2001

Phase AutoCorrelation (PAC) derived Robust Speech Features, Shajith Ikbal, Hemant Misra and Hervé Bourlard, Idiap-RR-38-2002

Face Authentication Based on Local Features and Generative Models, Fabien Cardinaux, Idiap-RR-85-2005

Offline Recognition of Unconstrained Handwritten Texts Using HMMs and Statistical Language Models, Alessandro Vinciarelli, Samy Bengio and Horst Bunke, Idiap-RR-22-2003

Information Fusion and Person Verification Using Speech & Face Information, Conrad Sanderson and Kuldip K. Paliwal, Idiap-RR-33-2002

A Probabilistic Model for Chord Progressions, Jean-François Paiement, Douglas Eck and Samy Bengio, Idiap-RR-57-2005

Assessing Scene Structuring in Consumer Videos, Daniel Gatica-Perez, Napat Triroj, Jean-Marc Odobez, Alexander Loui and Ming-Ting Sun, Idiap-RR-11-2004

Microphone Array Post-filter for Diffuse Noise Field, Iain A. McCowan and Hervé Bourlard, Idiap-RR-39-2001

Nonlinear Feature Transformations for Noise Robust Speech Recognition, Shajith Ikbal, Idiap-RR-70-2004

Auxiliary Variables in Conditional Gaussian Mixtures for Automatic Speech Recognition, Todd Andrew Stephenson, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-25-2002

An Asynchronous Hidden Markov Model for Audio-Visual Speech Recognition, Samy Bengio, Idiap-RR-26-2002

Exploiting Phoneme Similarities in Hybrid HMM-ANN Keyword Spotting, Joel Praveen Pinto, Andrew Lovitt and Hynek Hermansky, Idiap-RR-11-2007

Phoneme-Grapheme Based Speech Recognition System, Mathew Magimai-Doss, Todd Andrew Stephenson, Hervé Bourlard and Samy Bengio, Idiap-RR-37-2003

Multi-stream Processing for Noise Robust Speech Recognition, Hemant Misra, Idiap-RR-28-2006

PhD Thesis: Speech Analysis with Production Constraints, Sacha Krstulović, Idiap-RR-35-2001

The ami meeting corpus: a pre-announcement, Jean Carletta, Simone Ashby, Sebastien Bourban, Mike Flynn, Maël Guillemot, Thomas Hain, Jaroslav Kadlec, Vasilis Karaiskos, Wessel Kraaij, Melissa Kronenthal, Guillaume Lathoud, Mike Lincoln, Agnes Lisowska, Iain A. McCowan, Wilfried Post, Dennis Reidsma and Pierre Wellner, Idiap-RR-82-2005

An Investigation of Spectral Subband Centroids for Speaker Authentication, Norman Poh, Conrad Sanderson and Samy Bengio, Idiap-RR-62-2003

Phase AutoCorrelation (PAC) features in Entropy based Multi-Stream for Robust Speech Recognition, Shajith Ikbal, Hemant Misra, Hervé Bourlard and Hynek Hermansky, Idiap-RR-54-2003

Boosting word error rates, Christos Dimitrakakis and Samy Bengio, Idiap-RR-49-2004

Offline Recognition of Large Vocabulary Cursive Handwritten Text, Alessandro Vinciarelli, Samy Bengio and Horst Bunke, Idiap-RR-01-2003

Gradient estimates of return, Christos Dimitrakakis and Samy Bengio, Idiap-RR-29-2005

Entropy Based Combination of Tandem Representations for Noise Robust ASR, Shajith Ikbal, Hemant Misra, Sunil Sivadas, Hynek Hermansky and Hervé Bourlard, Idiap-RR-19-2004

Cursive Character Challenge: a New Database for Machine Learning and Pattern Recognition, Francesco Camastra, Marco Spinetti and Alessandro Vinciarelli, Idiap-RR-79-2005

Robust Face Analysis using Convolutional Neural Networks, B. Fasel, Idiap-RR-48-2001

A Comparative Study of Adaptation Methods for Speaker Verification, Johnny Mariéthoz and Samy Bengio, Idiap-RR-34-2001

PLSA-based Image Auto-Annotation: Constraining the Latent Space, Florent Monay and Daniel Gatica-Perez, Idiap-RR-30-2004

Local Binary Patterns as an Image Preprocessing for Face Authentication, Guillaume Heusch, Yann Rodriguez and Sébastien Marcel, Idiap-RR-76-2005

A Sector-Based, Frequency-Domain Approach to Detection and Localization of Multiple Speakers, Guillaume Lathoud and Mathew Magimai-Doss, Idiap-RR-54-2004

Comparison of HMM experts with MLP experts in the Full Combination Multi-Band Approach to Robust ASR, Astrid Hagen and Andrew Morris, Idiap-RR-21-2000

Increasing Speech Recognition Noise Robustness with HMM2, Katrin Weber, Samy Bengio and Hervé Bourlard, Idiap-RR-36-2001

Multimodal Multispeaker Probabilistic Tracking in Meetings, Daniel Gatica-Perez, Guillaume Lathoud, Jean-Marc Odobez and Iain A. McCowan, Idiap-RR-66-2004

Speaker Normalization using HMM2, Shajith Ikbal, Katrin Weber and Hervé Bourlard, Idiap-RR-15-2002

Bayesian Factorial Linear Gaussian State-Space Models for Biosignal Decomposition, Silvia Chiappa and David Barber, Idiap-RR-84-2005

Developing and Enhancing Posterior Based Speech Recognition Systems, Hamed Ketabdar, Jithendra Vepa, Samy Bengio and Hervé Bourlard, Idiap-RR-23-2005

A Graphical Model for Chord Progressions Embedded in a Psychoacoustic Space, Jean-François Paiement, Douglas Eck, Samy Bengio and David Barber, Idiap-RR-33-2005

Modelling Auxiliary Features in Tandem Systems, Mathew Magimai-Doss, Todd Andrew Stephenson, Shajith Ikbal and Hervé Bourlard, Idiap-RR-21-2004

Modeling Scenes with Local Descriptors and Latent Aspects, Pedro Quelhas, Florent Monay, Jean-Marc Odobez, Daniel Gatica-Perez, Tinne Tuytelaars and Luc Van Gool, Idiap-RR-79-2004

Confidence Evaluation for Risk Prediction, Nicolas Gilardi, Tom Melluish and Michel Maignan, Idiap-RR-22-2001

Facial Expression Analysis using Shape and Motion Information Extracted by Convolutional Neural Networks, B. Fasel, Idiap-RR-49-2001

Infinite Models for Speaker Clustering, Fabio Valente, Idiap-RR-19-2006

Continuous Audio-Visual Speech Recognition, Juergen Luettin and Stéphane Dupont, Idiap-RR-02-1998

Indexing Audio Documents by using Latent Semantic Analysis and SOM, Mikko Kurimo, Idiap-RR-13-1999

Joint Decoding for Phoneme-Grapheme Continuous Speech Recognition, Mathew Magimai-Doss, Samy Bengio and Hervé Bourlard, Idiap-RR-52-2003

Using Pitch as Prior Knowledge in Template-Based Speech Recognition, Guillermo Aradilla, Jithendra Vepa and Hervé Bourlard, Idiap-RR-65-2005

EEG pattern recognition through multi-stream evidence combination, Andrew Morris, Bernhard Obermaier and Gert Pfurtscheller, Idiap-RR-31-2001

Writer Identification for Smart Meeting Room Systems, Marcus Liwicki, Andreas Schlapbach, Horst Bunke, Samy Bengio, Johnny Mariéthoz and Jonas Richiardi, Idiap-RR-70-2005

Error Correcting Posterior Combination for Robust Multi-Band Speech Recognition, Astrid Hagen and Hervé Bourlard, Idiap-RR-10-2001

EER of Fixed and Trainable Fusion Classifiers: A Theoretical Study with Application to Biometric Authentication Tasks, Norman Poh and Samy Bengio, Idiap-RR-01-2005

Behavior of a Bayesian adaptation method for incremental enrollment in speaker verification, Corinne Fredouille, Johnny Mariéthoz, Cédric Jaboulet, Jean Hennebert, Chafic Mokbel and Frédéric Bimbot, Idiap-RR-02-2000

On Performance / Robustness / Complexity Trade-Offs in Face Verification, Conrad Sanderson, Fabien Cardinaux and Samy Bengio, Idiap-RR-74-2004

A Neural Network for Text Representation, Mikaela Keller and Samy Bengio, Idiap-RR-12-2005

Video Indexing and Similarity Retrieval by Largest Common Subgraph Detection using Decision Trees, Kim Shearer, Horst Bunke and Svetha Venkatesh, Idiap-RR-15-2000

Learning the Decision Function for Speaker Verification, Samy Bengio and Johnny Mariéthoz, Idiap-RR-40-2000

Spectral Entropy Feature in Full-Combination Multi-stream for Robust ASR, Hemant Misra and Hervé Bourlard, Idiap-RR-10-2005

Evaluation of Formant-Like Features for ASR, Katrin Weber, F. de Wet, B. Cranen, Louis Boves, Samy Bengio and Hervé Bourlard, Idiap-RR-04-2002

Multi-Modal Audio-Visual Event Recognition for Football Analysis, Mark Barnard, Jean-Marc Odobez and Samy Bengio, Idiap-RR-12-2003

On the Decomposition of Polychotomies into Dichotomies, Eddy Mayoraz and Miguel Moreira, Idiap-RR-08-1996

On Variable-Scale Piecewise Stationary Spectral Analysis of Speech Signals for ASR, Vivek Tyagi, Hervé Bourlard and Christian Wellekens, Idiap-RR-09-2005

A Color and Gradient Local Descriptor Fusion Scheme For Object Recognition, Pedro Quelhas and Jean-Marc Odobez, Idiap-RR-71-2003

Tangent Vector Kernels for Invariant Image Classification with SVMs, Alexei Pozdnoukhov and Samy Bengio, Idiap-RR-75-2003

Fast latent semantic indexing of spoken documents by using self-organizing maps, Mikko Kurimo, Idiap-RR-20-1999

A Discriminative Approach for the Retrieval of Images from Text Queries, David Grangier, Florent Monay and Samy Bengio, Idiap-RR-15-2006

Audio-visual probabilistic tracking of multiple speakers in meetings, Daniel Gatica-Perez, Guillaume Lathoud, Jean-Marc Odobez and Iain A. McCowan, Idiap-RR-27-2005

Semi-supervised Meeting Event Recognition with Adapted HMMs, Dong Zhang, Daniel Gatica-Perez and Samy Bengio, Idiap-RR-15-2005

Joint Speech and Speaker Recognition, Mohamed Faouzi BenZeghiba, Idiap-RR-28-2005

Short-Term Spatio-Temporal Clustering of Sporadic and Concurrent Events, Guillaume Lathoud, Iain A. McCowan and Jean-Marc Odobez, Idiap-RR-14-2004

HMM and IOHMM Modeling of EEG Rhythms for Asynchronous BCI Systems, Silvia Chiappa and Samy Bengio, Idiap-RR-49-2003

Person Authentication using Brainwaves (EEG) and Maximum A Posteriori Model Adaptation, Sébastien Marcel and José del R. Millán, Idiap-RR-81-2005

User Authentication via Adapted Statistical Models of Face Images, Fabien Cardinaux, Conrad Sanderson and Samy Bengio, Idiap-RR-38-2004

New Approaches Towards Robust and Adaptive Speech Recognition, Hervé Bourlard, Samy Bengio and Katrin Weber, Idiap-RR-01-2001

Posteriori Probabilities and Likelihoods Combination for Speech and Speaker Recognition, Mohamed Faouzi BenZeghiba and Hervé Bourlard, Idiap-RR-23-2004

An Optical Thresholding Perceptron, Indu Saxena, Perry Moerland, Emile Fiesler, A. R. Pourzand and N. Collings, Idiap-RR-16-1997

A Neural Network to Retrieve Images from Text Queries, David Grangier and Samy Bengio, Idiap-RR-33-2006

Client Dependent GMM-SVM Models for Speaker Verification, Quan Le and Samy Bengio, Idiap-RR-03-2003

Speech recognition of spontaneous, noisy speech using auxiliary information in Bayesian networks, Todd Andrew Stephenson, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-44-2002

A Robust Speaker Clustering Algorithm, Jitendra Ajmera and Charles Wooters, Idiap-RR-38-2003

Threshold Selection for Unsupervised Detection, with an Application to Microphone Arrays, Guillaume Lathoud, Mathew Magimai-Doss, Jean-Marc Odobez and Hervé Bourlard, Idiap-RR-52-2005

Data binarization by discriminant elimination, Miguel Moreira, Alain Hertz and Eddy Mayoraz, Idiap-RR-04-1999

A survey on Off-Line Cursive Word Recognition, Alessandro Vinciarelli, Idiap-RR-43-2000

Sector-Based Detection for Hands-Free Speech Enhancement in Cars, Guillaume Lathoud, Julien Bourgeois and Jürgen Freudenberger, Idiap-RR-67-2004

Noisy Text Categorization, Alessandro Vinciarelli, Idiap-RR-61-2003

Links between Perceptrons, MLPs and SVMs, Ronan Collobert and Samy Bengio, Idiap-RR-06-2004

Incorporating Domain Knowledge with Video and Voice Data Analysis in News Broadcasts, Kim Shearer, Chitra Dorai and Svetha Venkatesh, Idiap-RR-14-2000

Estimating the Quality of Face Localization for Face Verification, Yann Rodriguez, Fabien Cardinaux, Samy Bengio and Johnny Mariéthoz, Idiap-RR-07-2004

A Mixed-State I-Particle Filter for Multi-Camera Speaker Tracking, Daniel Gatica-Perez, Guillaume Lathoud, Iain A. McCowan and Jean-Marc Odobez, Idiap-RR-25-2003

Theme Topic Mixture Model: A Graphical Model for Document Representation, Mikaela Keller and Samy Bengio, Idiap-RR-05-2004

Application of Information Retrieval Techniques to Single Writer Documents, Alessandro Vinciarelli, Idiap-RR-12-2004

A Localization/Verification Scheme for Finding Text in Images and Video Frames Based on Contrast Independent Features and Machine Learning Methods, Datong Chen and Jean-Marc Odobez, Idiap-RR-42-2003

A Study on Visual Focus of Attention Recognition from Head Pose in a Meeting Room, Silèye O. Ba and Jean-Marc Odobez, Idiap-RR-10-2006

Learning influence among interacting Markov chains, Dong Zhang, Daniel Gatica-Perez, Samy Bengio and Deb Roy, Idiap-RR-48-2005

Video Text Segmentation Using Particle Filters, Datong Chen and Jean-Marc Odobez, Idiap-RR-43-2003

Statistical Transformation Techniques for Face Verification Using Faces Rotated in Depth, Conrad Sanderson and Samy Bengio, Idiap-RR-04-2004

Modeling Auxiliary Information in Bayesian Network Based ASR, Todd Andrew Stephenson, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-11-2001

Boosting HMMs with an application to speech recognition, Christos Dimitrakakis and Samy Bengio, Idiap-RR-41-2003

User-Customized Password Speaker Verification Using Multiple Reference and Background Models, Mohamed Faouzi BenZeghiba and Hervé Bourlard, Idiap-RR-41-2004

EEG Classification using Generative Independent Component Analysis, Silvia Chiappa and David Barber, Idiap-RR-77-2004

Dichotomy Between Clustering Performance and Minimum Distortion in Piecewise-Dependent-Data (PDD) Clustering, I. Lapidot and H. Guterman, Idiap-RR-48-2002

More Efficiency in Multiple Kernel Learning, Alain Rakotomamonjy, Francis Bach, Stéphane Canu and Yves Grandvalet, Idiap-RR-18-2007

Why Do Multi-Stream, Multi-Band and Multi-Modal Approaches Work on Biometric User Authentication Tasks?, Norman Poh and Samy Bengio, Idiap-RR-59-2003

Analysis and Classification of EEG Signals using Probabilistic Models for Brain Computer Interfaces, Silvia Chiappa, Idiap-RR-48-2006

[URL]

Natural Scene Image Modeling using Color and Texture Visterms., Pedro Quelhas and Jean-Marc Odobez, Idiap-RR-17-2006

Writer adaptation techniques in HMM based Off-Line Cursive Script Recognition, Alessandro Vinciarelli and Samy Bengio, Idiap-RR-15-2001

The segmentation of multi-channel meeting recordings for automatic speech recognition, John Dines, Jithendra Vepa and Thomas Hain, Idiap-RR-22-2006

Location Based Speaker Segmentation, Guillaume Lathoud and Iain A. McCowan, Idiap-RR-43-2002

Online Policy Adaptation for Ensemble Classifiers, Christos Dimitrakakis and Samy Bengio, Idiap-RR-69-2003

A Frequency-Domain Silence Noise Model, Guillaume Lathoud, Mathew Magimai-Doss and Bertrand Mesot, Idiap-RR-13-2005

Sociometry Based Multiparty Audio Recordings Summarization, Alessandro Vinciarelli, Idiap-RR-27-2006

Learning the structure of image collections with latent aspect models, Florent Monay, Idiap-RR-06-2007

HMM Mixtures (HMM2) for Robust Speech Recognition, Katrin Weber, Idiap-RR-34-2003

A Probabilistic Framework for Joint Head Tracking and Pose Estimation, Silèye O. Ba and Jean-Marc Odobez, Idiap-RR-78-2003

Finding Structure in Consumer Videos by Probabilistic Hierarchical Clustering, Daniel Gatica-Perez, Alexander Loui and Ming-Ting Sun, Idiap-RR-22-2002

Illumination-robust Pattern Matching Using Distorted Color Histograms, Georg Thimm and Juergen Luettin, Idiap-RR-09-1998

Multi-resolution RASTA filtering for TANDEM-based ASR, Hynek Hermansky and Petr Fousek, Idiap-RR-18-2005

Hidden Markov Models and other Finite State Automata for Sequence Processing, Hervé Bourlard and Samy Bengio, Idiap-RR-37-2001

A Novel Approach to Combining Client-Dependent and Confidence Information in Multimodal Biometric, Norman Poh and Samy Bengio, Idiap-RR-68-2004

Local Machine Learning Models for Spatial Data Analysis, Nicolas Gilardi and Samy Bengio, Idiap-RR-34-2000

Multimodal Authentication using Asynchronous HMMs, Samy Bengio, Idiap-RR-02-2003

Fusion of Face and Speech Data for Person Identity Verification, Souheil Ben-Yacoub, Yousri Abdeljaoued and Eddy Mayoraz, Idiap-RR-03-1999

Cursive Character Recognition by Learning Vector Quantization, Francesco Camastra and Alessandro Vinciarelli, Idiap-RR-47-2000

Role Recognition in Broadcast News Using Social Network Analysis and Duration Distribution Modeling, Alessandro Vinciarelli, Idiap-RR-35-2006

How Do Correlation and Variance of Base-Experts Affect Fusion in Biometric Authentication Tasks?, Norman Poh and Samy Bengio, Idiap-RR-18-2004

Detection and Application of Influence Rankings in Small Group Meetings, Rutger Rienks, Dong Zhang, Daniel Gatica-Perez and Wilfried Post, Idiap-RR-49-2006

Confidence Measures for Multimodal Identity Verification, Samy Bengio, Christine Marcel, Sébastien Marcel and Johnny Mariéthoz, Idiap-RR-38-2001

Unsupervised Spectral Substraction for Noise-Robust ASR, Guillaume Lathoud, Mathew Magimai-Doss, Bertrand Mesot and Hervé Bourlard, Idiap-RR-42-2005

Neural Networks in Automatic Speech Recognition, F. Beaufays, Hervé Bourlard, H. Franco and Nelson Morgan, Idiap-RR-09-2001

Application of Information Retrieval Technologies to Presentation Slides, Alessandro Vinciarelli and Jean-Marc Odobez, Idiap-RR-36-2005

Speech Coding based on Spectral Dynamics, Petr Motlicek, Hynek Hermansky, Harinath Garudadri and Naveen Srinivasamurthy, Idiap-RR-05-2006

Evaluation of Multiple Cues Head Pose Tracking Algorithm in Indoor Environments, Silèye O. Ba and Jean-Marc Odobez, Idiap-RR-05-2005

Mixtures of Experts Estimate A Posteriori Probabilities, Perry Moerland, Idiap-RR-07-1997

Effect of Recognition Errors on Information Retrieval Performance, Alessandro Vinciarelli, Idiap-RR-08-2004

Posterior Based Keyword Spotting with A Priori Thresholds, Hamed Ketabdar, Jithendra Vepa, Samy Bengio and Hervé Bourlard, Idiap-RR-67-2006

Image Classification by Neural Networks for the Quality Control of Watches, Miguel Moreira, Emile Fiesler and Gianni Pante, Idiap-RR-10-1996

Latent Semantic Indexing by Self-Organizing Map, Mikko Kurimo and Chafic Mokbel, Idiap-RR-12-1999

Video Text Recognition Based on Markov Random Field and Grayscale Consistency Constraint, Jean-Marc Odobez and Datong Chen, Idiap-RR-18-2002

PLP$^2$: Autoregressive modeling of auditory-like 2-D spectro-temporal patterns, Marios Athineos, Hynek Hermansky and Daniel P. W. Ellis, Idiap-RR-60-2004

Confusion matrix based posterior probabilities correction, Andrew Morris and Hemant Misra, Idiap-RR-53-2002

Recognition of Asymmetric Facial Action Unit Activities and Intensities, B. Fasel and Juergen Luettin, Idiap-RR-22-1999

Using Posterior-Based Features in Template Matching for Speech Recognition, Guillermo Aradilla, Jithendra Vepa and Hervé Bourlard, Idiap-RR-23-2006

Combining Neural Gas and Learning Vector Quantization for Cursive Character Recognition, Francesco Camastra and Alessandro Vinciarelli, Idiap-RR-18-2001

An Investigation of F-ratio Client-Dependent Normalisation on Biometric Authentication Tasks, Norman Poh and Samy Bengio, Idiap-RR-46-2004

Vessel Segmentation and Branching Detection using an Adaptive Profile Kalman Filter in Retinal Blood Vessel Structure Analysis, Pedro Quelhas and James Boyce, Idiap-RR-58-2003

A supervised learning approach based on STDP and polychronization in spiking neuron networks, Hélène Paugam-Moisy, R. Martinez and Samy Bengio, Idiap-RR-54-2006

Confidence Measures in Multiple pronunciations Modeling For Speaker Verification, Mohamed Faouzi BenZeghiba and Hervé Bourlard, Idiap-RR-53-2003

Sparse Probabilistic Classifiers, Romain Hérault and Yves Grandvalet, Idiap-RR-19-2007

Using Multiple Time Scales in the Framework of Multi-Stream Speech Recognition, Astrid Hagen and Hervé Bourlard, Idiap-RR-22-2000

AV16.3: an Audio-Visual Corpus for Speaker Localization and Tracking, Guillaume Lathoud, Jean-Marc Odobez and Daniel Gatica-Perez, Idiap-RR-28-2004

Semi-supervised Adapted HMMs for Unusual Event Detection, Dong Zhang, Daniel Gatica-Perez and Samy Bengio, Idiap-RR-80-2004

Mel-Cepstrum Modulation Spectrum (MCMS) Features for Robust ASR, Vivek Tyagi, Iain A. McCowan, Hervé Bourlard and Hemant Misra, Idiap-RR-47-2003

Real-Time Face Detection Using Boosting Learning in Hierarchical Feature Spaces, Dong Zhang, S. Z. Li and Daniel Gatica-Perez, Idiap-RR-70-2003

Speech Acquisition in Meetings with an Audio-Visual Sensor Array, Iain A. McCowan, Maganti Hari Krishna, Daniel Gatica-Perez, Darren Moore and Silèye O. Ba, Idiap-RR-03-2005

Embedding Motion in Model-Based Stochastic Tracking, Jean-Marc Odobez, Daniel Gatica-Perez and Silèye O. Ba, Idiap-RR-72-2003

Noisy Text Categorization, Alessandro Vinciarelli, Idiap-RR-03-2004

A Meeting Browser Evaluation Test, Pierre Wellner, Mike Flynn, Simon Tucker and Steve Whittaker, Idiap-RR-02-2005

On the Combination of Speech and Speaker Recognition, Mohamed Faouzi BenZeghiba and Hervé Bourlard, Idiap-RR-19-2003

On automatic annotation of meeting databases, Daniel Gatica-Perez, Iain A. McCowan, Mark Barnard, Samy Bengio and Hervé Bourlard, Idiap-RR-06-2003

An Online Audio Indexing System, Jitendra Ajmera, Iain A. McCowan and Hervé Bourlard, Idiap-RR-39-2003

Speech recognition with auxiliary information, Todd Andrew Stephenson, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-58-2002

Inferring Document Similarity from Hyper-links, David Grangier and Samy Bengio, Idiap-RR-21-2005

Mixed Bayesian Networks with Auxiliary Variables for Automatic Speech Recognition, Todd Andrew Stephenson, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-45-2001

HMM2- Extraction of Formant Features and their Use for Robust ASR, Katrin Weber, Samy Bengio and Hervé Bourlard, Idiap-RR-42-2000

A Sector-Based Approach for Localization of Multiple Speakers with Microphone Arrays, Guillaume Lathoud and Iain A. McCowan, Idiap-RR-15-2004

Audio-Visual Speaker Tracking with Importance Particle Filters, Daniel Gatica-Perez, Guillaume Lathoud, Iain A. McCowan, Jean-Marc Odobez and Darren Moore, Idiap-RR-37-2002

On Factorizing Spectral Dynamics for Robust Speech Recognition, Vivek Tyagi, Iain A. McCowan, Hervé Bourlard and Hemant Misra, Idiap-RR-32-2003

Large Scale Machine Learning, Ronan Collobert, Idiap-RR-42-2004

Acoustic-Labial Speaker Verification, Pierre Jourlin, Juergen Luettin, Dominique Genoud and H. Wassner, Idiap-RR-13-1997

Text dependent speaker verification using binary classifiers, Dominique Genoud, Miguel Moreira and Eddy Mayoraz, Idiap-RR-08-1997

Towards Robust and Adaptive Speech Recognition Models, Hervé Bourlard, Samy Bengio and Katrin Weber, Idiap-RR-01-2002

Pseudo-Syntactic Language Modeling for Disfluent Speech Recognition, Michael McGreevy, Idiap-RR-55-2004

Face Verification using MLP and SVM, Fabien Cardinaux and Sébastien Marcel, Idiap-RR-21-2002

Conditional Gaussian Mixture Models for Environmental Risk Mapping, Nicolas Gilardi, Samy Bengio and Mikhail Kanevski, Idiap-RR-12-2002

User-Customized Password HMM Based Speaker Verification, Mohamed Faouzi BenZeghiba and Hervé Bourlard, Idiap-RR-35-2002

Adaptation robuste de modèles HMM pour la vérification du locuteur dépendante du texte, Johnny Mariéthoz and Frédéric Bimbot, Idiap-RR-08-2000

Unknown-Multiple Speaker clustering using HMM, Jitendra Ajmera, Hervé Bourlard, I. Lapidot and Iain A. McCowan, Idiap-RR-07-2002

Improving Fusion with Margin-Derived Confidence In Biometric Authentication Tasks, Norman Poh and Samy Bengio, Idiap-RR-63-2004

Multiple Timescale Feature Combination towards Robust Speech Recognition, Katrin Weber, Idiap-RR-29-2000

A MAP approach, with synchronous decoding and unit-based normalization for text-dependent speaker verification, Johnny Mariéthoz, Johan Lindberg and Frédéric Bimbot, Idiap-RR-48-2000

Multimodal Group Action Clustering in Meetings, Dong Zhang, Daniel Gatica-Perez, Samy Bengio, Iain A. McCowan and Guillaume Lathoud, Idiap-RR-24-2004

A Unified Framework for Score Normalization Techniques Applied to Text Independent Speaker Verification, Johnny Mariéthoz and Samy Bengio, Idiap-RR-62-2004

The Role of Speech in Multimodal Human-Computer Interaction (Towards Reliable Rejection of Non-Keyword Input), Hynek Hermansky, Petr Fousek and Mikko Lehtonen, Idiap-RR-63-2005

Hybrid HMM/ANN and GMM Combination for User-Customized Password Speaker Verification, Mohamed Faouzi BenZeghiba and Hervé Bourlard, Idiap-RR-45-2002

Indexation de Documents Manuscrits, Alessandro Vinciarelli, Idiap-RR-31-2006

From Multi-Band Full Combination to Multi-Stream Full Combination Processing in Robust ASR, Astrid Hagen, Andrew Morris and Hervé Bourlard, Idiap-RR-20-2000

Using RASTA in task independent TANDEM feature extraction, Guillermo Aradilla, John Dines and Sunil Sivadas, Idiap-RR-22-2004

On Spectral Methods and the Structuring of Home Videos, Jean-Marc Odobez, Daniel Gatica-Perez and Maël Guillemot, Idiap-RR-55-2002

Data utility modelling for mismatch reduction, Andrew Morris, Idiap-RR-30-2001

Order Matters: A Distributed Sampling Method for Multi-Object Tracking, Kevin C. Smith, Idiap-RR-25-2004

Integrating co-occurrence and spatial contexts on patch-based scene segmentation, Florent Monay, Pedro Quelhas, Jean-Marc Odobez and Daniel Gatica-Perez, Idiap-RR-30-2005

Dynamic Bayesian Network Based Speech Recognition with Pitch and Energy as Auxiliary Variables, Todd Andrew Stephenson, Jaume Escofet, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-24-2002

The Expected Performance Curve: a New Assessment Measure for Person Authentication, Samy Bengio and Johnny Mariéthoz, Idiap-RR-84-2003

Robust HMM-Based Speech/Music Segmentation, Jitendra Ajmera, Iain A. McCowan and Hervé Bourlard, Idiap-RR-33-2001

A neural network for classification with incomplete data, Andrew Morris, Idiap-RR-23-2000

TRAP-TANDEM: Data-driven extraction of temporal features from speech, Hynek Hermansky, Idiap-RR-50-2003

Learning to Retrieve Images from Text Queries with a Discriminative Model, David Grangier, Florent Monay and Samy Bengio, Idiap-RR-32-2006

Bi-Modal Face and Speech Authentication: a BioLogin Demonstration System, Sébastien Marcel, Johnny Mariéthoz, Yann Rodriguez and Fabien Cardinaux, Idiap-RR-18-2006

Swiss French PolyPhone and PolyVar: telephone speech databases to model inter- and intra-speaker variability, Gérard Chollet, Jean-Luc Cochard, Andrei Constantinescu, Cédric Jaboulet and Philippe Langlais, Idiap-RR-01-1996

Supervised Ontogenic Networks, Emile Fiesler and K. Cios, in: Handbook of Neural Computation, 1996

Superceptron Construction, R. Visscher, Emile Fiesler and Georg Thimm, in: Proceedings of the '96 SIPAR-Workshop on Parallel and Distributed Computing, Scientific and Parallel Computing Group, University of Geneva, Geneva, Switzerland, 1996

Sun Workstation and SwissNet Platform for Speech Recognition and Speaker Verification over the Telephone, Andrzej Drygajlo, Jean-Luc Cochard, Gérard Chollet, Olivier Bornet and Philippe Renevey, in: Proceedings of Workstations und ihre Anwendungen, SIWORK'96, 1996

Statistical lip modelling for visual speech recognition, Juergen Luettin, Neil A. Thacker and Steve W. Beet, in: Proceedings of the 8th European Signal Processing Conference (Eusipco'96), 1996

Speaker-Dependent Speech Recognition Based on Phone-Like Units Models --- Application to Voice Dialing, Vincent Fontaine and Hervé Bourlard, Idiap-RR-09-1996

Speaker identification by lipreading, Juergen Luettin, Neil A. Thacker and Steve W. Beet, in: Proceedings of the 4th International Conference on Spoken Language Processing (ICSLP'96), 1996

Speachreading using shape and intensity information, Juergen Luettin, Neil A. Thacker and Steve W. Beet, in: Proceedings of the 4th International Conference on Spoken Language Processing (ICSLP'96), 1996

Sparse Initial Topologies for High Order Perceptrons, Andrea De Pol, Georg Thimm and Emile Fiesler, in: Proceedings of the International Conference on Neural Networks, IEEE, 1996

Semi-automatic HMM-based annotation of the PolyCOST Database, Dijana Petrovska-Delacretaz, Jean Hennebert, Dominique Genoud and Gérard Chollet, in: Application of speaker recognition techniques in telephony, COST250, 1996

Secured vocal access to telephone servers, Olivier Bornet, Gérard Chollet, Jean-Luc Cochard, Andrei Constantinescu and Dominique Genoud, in: Proceedings of IVTTA 1996 IEEE Third Workshop Interactive Voice Technology for Telecommunications Applications, 1996

Secured vocal access to telephone servers, Olivier Bornet, Gérard Chollet, Jean-Luc Cochard, Andrei Constantinescu and Dominique Genoud, Idiap-RR-04-1996

Reconnaissance et compréhension de la parole: évaluation et applications, F. Néel, Gérard Chollet, F. Lamel, W. Minker and Andrei Constantinescu, in: Fondements et perspectives en traitement automatique de la parole, AUPELF -- UREF, 1996

Présentation du Modèle DRM, Sacha Krstulović, Idiap-Com-03-1996

Polycost Database, Dominique Genoud, Jean Hennebert and H. Melin, 1996

Overcoming Inaccuracies in Optical Multilayer Perceptrons, Perry Moerland, Emile Fiesler and Indu Saxena, in: Proceedings of the First International Symposium on Neuro-Fuzzy Systems (AT'96), Lausanne, Switzerland, AATI, 1996

On Variations of the Convex Hull Operator, Eddy Mayoraz, Idiap-RR-06-1996

On the Power of Democratic Networks, Eddy Mayoraz, in: SIAM Journal of Discr. Math, 9(02), 1996

On the Complexity of the Class of Regions Computable by a Two-Layered Perceptron, Eddy Mayoraz, Idiap-RR-03-1996

New time-frequency derived cepstral coefficients for automatic speech recognition, Hubert Wassner and Gérard Chollet, in: Proceedings of the 8th European Signal Processing Conference (Eusipco'96), 1996

Neural Network Topologies, Emile Fiesler, in: Handbook of Neural Computation, 1996

Neural Network Pruning and Pruning Parameters, Georg Thimm and Emile Fiesler, in: The 1st Workshop on Soft Computing, Dept. of Information Electronics Nagoya University, 1996

Multi-Stream Speech Recognition, Hervé Bourlard, Stéphane Dupont and Christophe Ris, Idiap-RR-07-1996

Multi-modal person verification tools using speech and images, M. Acheroy et al., in: European Conference on Multimedia Applications, Services and Techniques, 1996

Machine Recognition and Applications, Juergen Luettin, Michael Vogt and Christoph Bregler, in: Speechreading by Humans and Machines, Springer Verlag, 1996

Locating and tracking facial speech features, Juergen Luettin, Neil A. Thacker and Steve W. Beet, in: Proceedings of the International Conference on Pattern Recognition (ICPR'96), IAPR, 1996

Learning to recognise talking faces, Juergen Luettin, Neil A. Thacker and Steve W. Beet, in: Proceedings of the International Conference on Pattern Recognition (ICPR'96), IAPR, 1996

Incorporation of Liquid-Crystal Light Valve Non-Linearities in Optical Multilayer Neural Networks, Perry Moerland, Emile Fiesler and Indu Saxena, in: Applied Optics, 35(26), 1996

Image Classification by Neural Networks for the Quality Control of Watches, Miguel Moreira, Emile Fiesler and Gianni Pante, in: Proceedings ISAI /IFIS 1996, ITESM, Cancun, Mexico, ITESM, 1996

Hardware-Friendly Learning Algorithms for Neural Networks: An Overview, Perry Moerland and Emile Fiesler, in: Proceedings of the Fifth International Conference on Microelectronics for Neural Networks and Fuzzy Systems: MicroNeuro'96, EPFL and CSEM, Lausanne, Switzerland, IEEE Computer Society Press, 1996

Handbook of Neural Computation, Institute of Physics and Oxford University Press, The Computational Intelligence Library, 1996

Generalized Cauchy Machines, S. Cuche and Emile Fiesler, in: Neurocomputing, 1996

Finding Lines Under Bounded Error, Thomas M. Breuel, in: Pattern Recognition, 29(01), 1996

Extended Cauchy Machines, S. Cuche and Emile Fiesler, in: Proceedings of the International Conference on Neural Information Processing, 1996

ETC\_vérif : un environnement multi-agents de reconnaissance automatique de la parole en continu, Jean-Luc Cochard and Murielle Vial, in: Proceedings of JEP'96: XXIemes Journees d'etude sur la Parole, 1996

Datapump Full-Duplex, Florian Salamin, François Corthay, Olivier Bornet and Jean-Luc Cochard, Idiap-Com-02-1996

Constructive Training Methods for Feedforward Neural Networks with Binary Weights, Eddy Mayoraz and Frédéric Aviolat, in: International Journal of Neural Systems, 7(2), 1996

Connectionist Quantization Functions, Tomas Lundin, Emile Fiesler and Perry Moerland, in: Proceedings of the '96 SIPAR-Workshop on Parallel and Distributed Computing, Scientific and Parallel Computing Group, University of Geneva, Geneva, Switzerland, 1996

Combining methods to improve speaker verification decision, Dominique Genoud, Guillaume Gravier, Frédéric Bimbot and Gérard Chollet, Idiap-RR-02-1996

Combining methods to improve speaker verification decision, Dominique Genoud, Frédéric Bimbot, Guillaume Gravier and Gérard Chollet, in: Proceedings of The Fourth International Conference on Spoken Language Processing, ICSLP, ICSLP, 1996

Bounds on the Degree of High Order Binary Perceptrons, Eddy Mayoraz, in: Proceedings of ESANN'96, D facto, 1996

Annulation d'écho sur une ligne téléphonique, Florian Salamin, François Corthay, Olivier Bornet and Jean-Luc Cochard, Idiap-Com-06-1996

An Implementation of Logical Analysis of Data, Endre Boros, Peter L. Hammer, Toshihide Ibaraki, Alexander Kogan, Eddy Mayoraz and Ilya Muchnik, Idiap-RR-05-1996

Amelioration des performances de verification du locuteur par combinaison de methodes, Dominique Genoud, Guillaume Gravier, Frédéric Bimbot and Gérard Chollet, in: Journees d'etudes sur la parole, JEP, 1996

Active Shape Models for Visual Speech Feature Extraction, Juergen Luettin, Neil A. Thacker and Steve W. Beet, in: Speechreading by Humans and Machines, Springer Verlag, 1996

A Review of MicroNeuro'96, February 12-14, 1996, Lausanne, Switzerland, Perry Moerland, in: Neurocomputing, 12(04), 1996

A Method for All-Positive Optical Multilayer Perceptrons, Indu Saxena, Emile Fiesler and Perry Moerland, in: Proceedings of the Third IEEE International Conference on Electronics, Circuits, and Systems, University of Patras, Rhodos, Greece, IEEE, 1996

A Boolean Approach to Construct Neural Networks for Non-Boolean Problems, Georg Thimm and Emile Fiesler, in: Proceedings of the 8th IEEE International Conference on Tools with Artificial Intelligence, IEEE, 1996

Zeolite cycle sequences, Georg Thimm and W. E. Klee, in: Zeolites, 19, 1997

Visual Speech and Speaker Recognition, Juergen Luettin, University of Sheffield, 1997

Using the Multi-Stream Approach for Continuous Audio-Visual Speech Recognition, Stéphane Dupont and Juergen Luettin, Idiap-RR-14-1997

Using Multiple Time Scales in a Multi-Stream Speech Recognition System, Stéphane Dupont and Hervé Bourlard, in: EUROSPEECH'97, 1997

Two neural network construction methods, Georg Thimm and Emile Fiesler, in: Neural Processing Letters, 6(01), 1997

Towards Speaker Independent Continuous Speechreading, Juergen Luettin, in: Proceedings of the European Conference on Speech Communication and Technology, 1997

The 3-regular nets with 4 and 6 vertices per unit cell, M. Bader, W. E. Klee and Georg Thimm, in: Zeitschrift fur Kristallographie, 212, 1997

SWISSCOM ``AVIS'' PROJECT (No. 392) Advanced Vocal Interfaces Services, Johan M. Andersen, Gilles Caloz and Hervé Bourlard, Idiap-Com-06-1997

Subband-Based Speech Recognition, Hervé Bourlard and Stéphane Dupont, in: IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, 1997

State-of-the-Art and Recent Progress in Hybrid HMM/ANN Speech Recognition, Hervé Bourlard, in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'97), Springer-Verlag, 1997

Speechreading using Probabilistic Models, Juergen Luettin and Neil A. Thacker, in: Computer Vision and Image Understanding, 65(02), 1997

Speaker-Dependent Speech Recognition Based on Phone-Like Unit Model -- Application to Voice Dialing, Vincent Fontaine and Hervé Bourlard, in: IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, 1997

Speaker Verification in the Telephone Network : Research Activities in the CAVE Project, Frédéric Bimbot, H. P. Hutter, Cédric Jaboulet, Johan Koolwaaij, Johan Lindberg and J. B. Pierrot, in: Proceedings of the European Conference on Speech Communication and Technology (EUROSPEECH'97), 1997

Speaker Verification by Pairwise Coupling, Michael Schmal, Idiap-Com-07-1997

Some Methods for Training Mixtures of Experts, Perry Moerland, Idiap-Com-05-1997

Robust Speech Recognition based on Multi-Stream Features, Stéphane Dupont, Hervé Bourlard and Christophe Ris, in: Proc. of the ESCA-NATO Workshop on Robust Speech Recognition for Unknown Communication Channels, 1997

Reconnaissance de caractères manuscrits à l'aide de réseaux neuromimétiques, Jean-Luc Beuchat, Idiap-RR-18-1997

Réalisation d'un Majordome vocal, Samuel Vannay, Idiap-Com-04-1997

Quantization and Pruning of Multilayer Perceptrons: Towards Compact Neural Networks, Tomas Lundin and Perry Moerland, Idiap-Com-02-1997

Pruning of Neural Networks, Georg Thimm and Emile Fiesler, Idiap-RR-03-1997

Person Authentication by Fusing Face and Speech Information, Benoît Duc, Gilbert Maître, Stefan Fischer and Josef Bigün, in: Proceedings of the First International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'97), Springer Verlag, 1997

Optimization of high order perceptrons, Georg Thimm, École Polytechnique Fédérale de Lausanne, 1997

Optimal Setting of Weights, Learning Rate, and Gain, Georg Thimm and Emile Fiesler, Idiap-RR-04-1997

On the Decomposition of Polychotomies into Dichotomies, Eddy Mayoraz and Miguel Moreira, in: Proceedings of The Fourteenth International Conference on Machine Learning, Morgan Kaufmann, 1997

On the Complexity of Recognizing Iterated Differences of Polyhedra, Eddy Mayoraz, in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'97), Springer-Verlag, 1997

Neural Network Adaptations to Hardware Implementations, Perry Moerland and Emile Fiesler, Idiap-RR-17-1997

Neural Network Adaptations to Hardware Implementations, Perry Moerland and Emile Fiesler, in: Handbook of Neural Computation, Institute of Physics Publishing and Oxford University Publishing, 1997

Mixtures of Experts Estimate A Posteriori Probabilities, Perry Moerland, in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'97), Springer-Verlag, 1997

Likelihood ratio adjustment for the compensation of model mismatch in speaker verification, Frédéric Bimbot and Dominique Genoud, in: Eurospeech 97, 1997

Investigation of a possible process identity between DRM and Linear Filtering, Sacha Krstulović, Idiap-RR-19-1997

Integrating Acoustic and Labial Information for Speaker Identification and Verification, Pierre Jourlin, Juergen Luettin, Dominique Genoud and H. Wassner, in: Proceedings of the European Conference on Speech Communication and Technology, 1997

Improved Pairwise Coupling Classification With Correcting Classifiers, Miguel Moreira and Eddy Mayoraz, in: Machine Learning: ECML-98, Springer, 1998

Hybrid HMM/ANN Systems for Training Independent Tasks: Experiments on 'Phonebook' and Related Improvements, Stéphane Dupont, Hervé Bourlard, O. Deroo, Vincent Fontaine and J. -M. Boite, in: IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, 1997

Hybrid HMM/ANN Systems for Speech Recognition: Overview and New Research Directions, Hervé Bourlard and Nelson Morgan, in: International School on Neural Nets: Adaptive Processing of Temporal Information, Springer Verlag, 1997

High Order and Multilayer Perceptron Initialization, Georg Thimm and Emile Fiesler, in: IEEE Transactions on Neural Networks, 8(02), 1997

Handwritten Digit Recognition with Binary Optical Perceptron, Indu Saxena, Perry Moerland, Emile Fiesler and A. R. Pourzand, in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'97), Springer-Verlag, 1997

Fusion of audio and video information for multi modal person authentication, Benoît Duc, Elizabeth Saers Bigün, Josef Bigün, Gilbert Maître and Stefan Fischer, in: Pattern Recognition Letters, 18(9), 1997

Fast Object Detection using MLP and FFT, Souheil Ben-Yacoub, Idiap-RR-11-1997

Estimation of Global Posteriors and Forward-Backward Training of Hybrid HMM/ANN Systems, Jean Hennebert, Christophe Ris, Hervé Bourlard and Steve Renals, in: EUROSPEECH'97, 1997

Ellipsometry, Indu Saxena, in: Optical Metrology, Artech House, 1997

Discrete All-Positive Multilayer Perceptrons for Optical Implementation, Perry Moerland, Emile Fiesler and Indu Saxena, Idiap-RR-02-1997

Decision fusion in a multi-modal identity verification system using a multi-linear classifier, Patrick Verlinde, Gilbert Maître and Eddy Mayoraz, Idiap-RR-06-1997

CRC Comprehensive Dictionary of Electrical Engineering, Emile Fiesler, CRC Press, 1997

Calendar of meetings (several issues), Georg Thimm, in: Neurocomputing, 1997

An Optical Thresholding Perceptron, Indu Saxena, Perry Moerland, Emile Fiesler, A. R. Pourzand and N. Collings, in: Proceedings of the Workshop on Optics and Computer Science, Geneva, Switzerland, 1997

Adapting the 2-Class Recursive Deterministic Perceptron Neural Network to m Classes, M. Tajine, D. Elizondo, Emile Fiesler and Jerzy Korczak, in: Proceedings of the International Conference on Neural Networks, IEEE, IEEE, 1997

Activity Report 1996, Hervé Bourlard, Jean-Luc Cochard, Emile Fiesler, Gilbert Maître and Eddy Mayoraz, Idiap-Com-01-1997

Acoustic-Labial Speaker Verification, Pierre Jourlin, Juergen Luettin, Dominique Genoud and H. Wassner, in: Pattern Recognition Letters, 18(09), 1997

Acoustic-Labial Speaker Verification, Pierre Jourlin, Juergen Luettin, Dominique Genoud and Hubert Wassner, in: Proceedings of the First International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'97), Springer Verlag, 1997

A Connectionist System for Two-Dimensional Representation of Multivariate Location Data, Emile Fiesler and Michel Maignan, in: Proceedings of the Fifth International Workshop on Artificial Intelligence for High Energy Physics, AIHENP, Lausanne, Switzerland, Elsevier Science, 1997

1997 NIST Evaluation: Text independent speaker detection (verification), Dominique Genoud and Gilles Caloz, Idiap-Com-03-1997

Voice-B System, Gilles Caloz, Cédric Jaboulet, Johnny Mariéthoz, A. Glaeser and Dominique Genoud, in: IEEE 4th Workshop on Intercative Voice Technology for Telecommunications Applications (IVTTA'98) September 29--30, Torino, Italy, 1998

Voice transformation, a tool for imposture of speaker verification, Dominique Genoud and Gérard Chollet, in: Proceedings of International Phonetic Science conference IPS98, Washington, 1998

Using the Multi-Stream Approach for Continuous Audio-Visual Speech Recognition: Experiments on the M2VTS Database, Stéphane Dupont and Juergen Luettin, in: Proc. 5th Int. Conf. on Spoken Language Processing, 1998

Text dependent speaker verification using binary classifiers, Dominique Genoud, Miguel Moreira and Eddy Mayoraz, in: Proceedings of the 1998 IEEE International Conference on Acoustics, Speech, and Signal Processing --- ICASSP'98, IEEE, IEEE, 1998

Support Vector Machine for Multiclass Classification, Eddy Mayoraz and Ethem Alpaydin, Idiap-RR-06-1998

Subband-Based Speech Recognition in Noisy Conditions: The Full Combination Approach, Astrid Hagen, Andrew Morris and Hervé Bourlard, Idiap-RR-15-1998

Speech pre-processing against intentional imposture in speaker recognition, Dominique Genoud and Gérard Chollet, in: Proceedings of ICSLP, Sidney, 1998

Speaker Verification: A Quick Overview, Hervé Bourlard and Nelson Morgan, Idiap-RR-12-1998

Reconnaissance robuste de la parole par segmentation signal/bruit en sous-bandes, Hervé Glotin, Emmanuel Tessier, Hervé Bourlard and Frédéric Berthommier, in: Neurosciences et Sciences de l'Ingenieur'98 - Munster, CNRS, 1998

Reconnaissance multi-bandes de la parole bruitée par couplage entre les niveaux primitifs et d'identification, Hervé Glotin, Emmanuel Tessier, Hervé Bourlard and Frédéric Berthommier, in: Journees Etude Parole - Martigny, 1998

POLYCOST: a telephone-speech database for speaker recognition, Dijana Petrovska-Delacretaz, Jean Hennebert, H. Melin and Dominique Genoud, in: Reconnaissance du locuteur et ses applications commerciales et criminalistiques, 1998

Optimal Parameterization of Point Distribution Models, Georg Thimm and Juergen Luettin, Idiap-RR-01-1998

On the Complexity of Recognizing Regions Computable by Two-Layered Perceptrons, Eddy Mayoraz, in: Annals Mathematics and Artificial Intelligence, 1999

Multi-Modal Data Fusion for Person Authentication using SVM, Souheil Ben-Yacoub, in: Proc. Second International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'99), 1999

Introduction à la reconnaissance de la parole et du locuteur, Hervé Bourlard, Idiap-RR-13-1998

Interfacing of CASA and partial recognition based on a multistream technique, Frédéric Berthommier, Hervé Glotin, Emmanuel Tessier and Hervé Bourlard, in: ICSLP'98, Sidney, 1998

Interfacing of CASA and Multistream recognition, Hervé Glotin, Frédéric Berthommier, Emmanuel Tessier and Hervé Bourlard, in: TSD'98-Text, Speech and Dialog International Workshop, BRNO-Czech Republic, 1998

Improving Posterior Based Confidence Measures in Hybrid HMM/ANN Speech Recognition Systems, Giulia Bernardis and Hervé Bourlard, in: Proceedings of International Conference on Spoken Language Processing (ICSLP'98) Sydney, Australia, 1998

Hybrid HMM/ANN Systems for Speech Recognition: Overview and New Research Directions, Hervé Bourlard and Nelson Morgan, in: Adaptive Processing of Sequences and Data Structures, Springer Verlag, 1998

Fast Multi-Scale Face Detection, B. Fasel, Idiap-Com-04-1998

Evaluation Protocol for the extended M2VTS Database (XM2VTSDB), Juergen Luettin and Gilbert Maître, Idiap-Com-05-1998

Evaluating the Complexity of Databases for Person Identification and Verification, Georg Thimm, Souheil Ben-Yacoub and Juergen Luettin, Idiap-RR-10-1998

Discrete All-Positive Multilayer Perceptrons for Optical Implementation, Perry Moerland, Emile Fiesler and Indu Saxena, in: Optical Engineering, 37(4), 1998

Decision fusion using a multi-linear classifier, Patrick Verlinde, Gilbert Maître and Eddy Mayoraz, in: 1st International Conference on Multisource-Multisensor Data Fusion, 1998

Continuous Audio-Visual Speech Recognition, Juergen Luettin and Stéphane Dupont, in: Proc. 5th European Conference on Computer Vision, Springer Verlag, 1998

Connectionist Techniques, Hervé Bourlard and Nelson Morgan, in: Survey of the State of the Art in Human Language Technology, Cambridge University Press, 1998

Connectionist speech recognition, Hervé Bourlard, in: Proceedings of IK'98, Interdisziplinares Kolleg, Spring Scholl, Gunne am Mohnessee, Germany, March 7--14, 1998

Confidence Measures in Hybrid HMM/ANN Speech Recognition, Giulia Bernardis and Hervé Bourlard, in: Proceedings of Workshop on Text, Speech and Dialog (TSD'98) Brno, Czech Republic, 1998

Combining Linear Dichomotizers to Construct Nonlinear Polychotomizers, Ethem Alpaydin and Eddy Mayoraz, Idiap-RR-05-1998

Combined 5x2cv $F$-Test for Comparing Supervised Classification Learning Algorithms, Ethem Alpaydin, Idiap-RR-04-1998

Classification using localized mixtures of experts, Perry Moerland, in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'99), London: IEE, 1999

Baseline System for Hybrid Speech Recognition on French (Experiments on BREF), Johan M. Andersen, Idiap-Com-07-1998

Automatic Speech Recognition: an Auditory Perspective, Nelson Morgan, Hervé Bourlard and Hynek Hermansky, in: Speech Processing in the Auditory System, Springer Verlag, New York, 2000

Audio-Visual Person Verification, Souheil Ben-Yacoub, Juergen Luettin, K. Jonsson, J. Matas and J. Kittler, in: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, June 1999, Fort Collins, USA, 1999

An overview of the cave project research activities in speaker verification, Frédéric Bimbot, H. P. Hutter, Cédric Jaboulet, Johan Koolwaaij, Johan Lindberg and J. B. Pierrot, in: Reconnaissance du locuteur et ses applications commerciales et criminalistiques, 1998

Acoustico-articulatory inversion of unequal-length tube models through lattice inverse filtering, Sacha Krstulović, Idiap-RR-16-1998

A comparison of mixture models for density estimation, Perry Moerland, in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'99), London: IEE, 1999

A comparison of a priori threshold setting procedures for speaker verification in the CAVE project, J. B. Pierrot, Johan Lindberg, Johan Koolwaaij, H. P. Hutter, Dominique Genoud, Mats Blomberg and Frédéric Bimbot, in: ICASSP 98, 1998

XM2VTSDB: The Extended M2VTS Database, K. Messer, J. Matas, J. Kittler, Juergen Luettin and Gilbert Maître, in: Proc. Second International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'99), 1999

Tracking Articulators in X-ray Movies of the Vocal Tract, Georg Thimm, in: 8th Int. Conf. Computer Analysis of Images and Patterns, Springer Verlag, 1999

Towards introducing long-term statistics in MUSE for robust speech recognition, Christopher Kermorvant and Chafic Mokbel, Idiap-RR-18-1999

Towards introducing long-term statistics in MUSE for robust speech recognition, Christopher Kermorvant and Chafic Mokbel, in: Automatic Speech Recognition and Understanding (ASRU) workshop, 1999

The full combination sub-bands approach to noise robust HMM/ANN based ASR, Andrew Morris, Astrid Hagen and Hervé Bourlard, in: 6th European Conference on Speech Communication and Technology --- Eurospeech'99, 1999

The Elisa'99 Speaker Recognition and Tracking Systems, B. Nedic, Guillaume Gravier, Jamal Kharroubi, Gérard Chollet, Dijana Petrovska-Delacretaz, G. Durou, Frédéric Bimbot, Raphaël Blouet, M. Seck, Jean-François Bonastre, Corinne Fredouille, Teva Merlin, I. Magrin-Chagnolleau, S. Pigeon, Patrick Verlinde and Jan Cernocky, in: IEEE Workshop on Automatic Advanced Technologies, 1999

The ELISA Systems for the NIST'99 Evaluation in Speaker Detection and Tracking, B. Nedic, Frédéric Bimbot, Raphaël Blouet, Jean-François Bonastre, Gilles Caloz, Jan Cernocky, Gérard Chollet, G. Durou, Corinne Fredouille, Dominique Genoud, Guillaume Gravier, Jean Hennebert, Jamal Kharroubi, I. Magrin-Chagnolleau, Teva Merlin, Chafic Mokbel, Dijana Petrovska-Delacretaz, S. Pigeon, M. Seck, Patrick Verlinde and M. Zouhal, in: DSP Journal (Special Issue on the Nist Speaker Recognition Workshop), 1999

Synchronous Alignment, Johnny Mariéthoz and Chafic Mokbel, Idiap-RR-06-1999

Speech Reading, Juergen Luettin, in: Modern Interface Technology: The Leading Edge, Research Studies Press Ltd., 1999

Speaker verification experiments on the XM2VTS database, Juergen Luettin, Idiap-RR-02-1999

Segmentation of X-ray Image Sequences Showing the Vocal Tract (with tool documentation), Georg Thimm, Idiap-RR-01-1999

Segmentation of X-ray Image Sequences Showing the Vocal Tract, Georg Thimm, Idiap-RR-01-1999

Robust Person Verification based on Speech and Facial Images, Juergen Luettin and Souheil Ben-Yacoub, in: Proceedings of the European Conference on Speech Communication and Technology, 1999

Reconnaissance et Transformation de Locuteurs, Dominique Genoud, École Polytechnique Fédérale de Lausanne, 1999

Off-Line Cursive Script Recognition Based on Continuous Density HMM, Alessandro Vinciarelli and Juergen Luettin, in: Proceedings of 7th International Workshop on Frontiers in Handwriting Recognition, 2000

Numerical Experiments with Support Vector Machines, Mikhail Kanevski and Nicolas Gilardi, Idiap-RR-15-1999

Non-Stationary Multi-Channel (Multi-Stream) Processing Towards Robust and Adaptive ASR, Hervé Bourlard, in: Proc. of the ESCA Workshop on Robust Methods for Speech Recognition in Adverse Conditions, 1999

Multi-stream adaptive evidence combination for noise robust ASR, Andrew Morris, Astrid Hagen, Hervé Glotin and Hervé Bourlard, Idiap-RR-26-1999

Multi Modal Verification for Teleservices and Security Applications, G. Richard, Y. Menguy, I. Guis, N. Suaudeau, J. Boudy, P. Lockwood, C. Fernández, F. Fernàndez, D. Garcia-Plaza, C. Kotropoulos, A. Tefas, I. Pitas, R. Heimgartner, P. Ryser, C. Beumier, Patrick Verlinde, S. Pigeon, G. Matas, J. Kittler, Josef Bigün, Yousri Abdeljaoued, E. Meurville, Laurent Besacier, M. Ansorge, Gilbert Maître, Juergen Luettin, Souheil Ben-Yacoub, B. Ruiz, J. Cortés and K. Aldama, in: IEEE International Conference on Multimedia Computing and Systems, 1999

LPC-based inversion of the DRM articulatory model, Sacha Krstulović, in: Proc. Eurospeech'99, 1999

Latent variable decomposition for posteriors or likelihood based subband ASR, Andrew Morris, Idiap-Com-04-1999

Latent Semantic Indexing by Self-Organizing Map, Mikko Kurimo and Chafic Mokbel, in: ESCA ETRW workshop on Accessing Information in Spoken Audio, 1999

Iterative Posterior-Based Keyword Spotting Without Filler Models: Iterative Viterbi Decoding and One-Pass Approach, Marius-Calin Silaghi and Hervé Bourlard, Idiap-RR-27-1999

Iterative Posterior-Based Keyword Spotting Without Filler Models, Marius-Calin Silaghi and Hervé Bourlard, in: Proceedings of the IEEE Automatic Speech Recognition and Understanding (ASRU'99) Workshop, 1999

Iterative Posterior-Based Keyword Spotting Without Filler Models, Marius-Calin Silaghi and Hervé Bourlard, Idiap-RR-16-1999

INtegrating SPEech acoustic and linguistic Constraints: Baseline System Development, Giulia Bernardis, Hervé Bourlard, Martin Rajman and Jean-Cédric Chappelier, Idiap-RR-21-1999

Indexing Audio Documents by using Latent Semantic Analysis and SOM, Mikko Kurimo, in: Kohonen Maps, Elsevier, 1999

Incremental Enrollment of Speech Recognizers, Chafic Mokbel and Olivier Collin, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'99,',','), Phoenix, Arizona, USA, 1999

Illumination-robust Pattern Matching Using Distorted Color Histograms, Georg Thimm and Juergen Luettin, in: Pattern Recognition and Image Understanding, Infix, 1999

Fusion of Face and Speech Data for Person Identity Verification, Souheil Ben-Yacoub, Yousri Abdeljaoued and Eddy Mayoraz, in: IEEE Transactions on Neural Networks, 10(05), 1999

Fast Face Detection using MLP and FFT, Souheil Ben-Yacoub, B. Fasel and Juergen Luettin, in: Proc. Second International Conference on Audio and Video-based Biometric Person Authentication (AVBPA'99), 1999

Extraction of Articulators in X-Ray Image Sequences, Georg Thimm and Juergen Luettin, in: Proceedings of the European Conference on Speech Communication and Technology, 1999

Experimental evaluation of text-dependent speaker verification on laboratory and field test databases in the M2VTS project, Laurent Besacier, Juergen Luettin, Gilbert Maître and E. Meurville, in: Proceedings of the European Conference on Speech Communication and Technology, 1999

Evaluating the Complexity of Databases for Person Identification and Verification, Georg Thimm, Souheil Ben-Yacoub and Juergen Luettin, in: 8th Int. Conf. Computer Analysis of Images and Patterns, Springer Verlag, 1999

Environmental spatial data classification with Support Vector Machines, Mikhail Kanevski, Nicolas Gilardi, Eddy Mayoraz and Michel Maignan, Idiap-RR-07-1999

Environmental and Pollution Spatial Data Classification with Support Vector Machines and Geostatistics, Nicolas Gilardi, Mikhail Kanevski, Michel Maignan and Eddy Mayoraz, in: Intelligent techniques for Spatio-Temporal Data Analysis in Environmental Applications. Workshop W07, 1999

DynaBoost: Combining Boosted Hypotheses in a Dynamic Way, Perry Moerland and Eddy Mayoraz, Idiap-RR-09-1999

Different Weighting Schemes in the Full Combination Subbands Approach for Noise Robust ASR, Astrid Hagen, Andrew Morris and Hervé Bourlard, in: Robust Methods for Speech Recognition in Adverse Conditions, 1999

Deliberate Imposture: a challenge for automatic speaker verification systems, Dominique Genoud and Gérard Chollet, in: Proceedings of the European Conference on Speech Communication and Technology, 1999

Decision-Oriented Environmental Mapping with Radial Basis Function Neural Networks, V. Demyanov, Nicolas Gilardi, Mikhail Kanevski, Michel Maignan and V. Polishchuk, in: Intelligent techniques for Spatio-Temporal Data Analysis in Environmental Applications. Workshop W07, 1999

Data binarization by discriminant elimination, Miguel Moreira, Alain Hertz and Eddy Mayoraz, in: Proceedings of the ICML-99 Workshop: From Machine Learning to Knowledge Discovery in Databases, 1999

Combining Wavelet-domain Hidden Markov Trees with Hidden Markov Models, Katrin Keller, Souheil Ben-Yacoub and Chafic Mokbel, Idiap-RR-14-1999

Combinatorial Approach for Data Binarization, Eddy Mayoraz and Miguel Moreira, in: Principles of Data Mining and Knowledge Discovery: third european conference; proceedings / PKDD'99, Springer, 1999

CLIENT / WORLD MODEL SYNCHRONOUS ALIGNEMENT FOR SPEAKER VERIFICATION, Johnny Mariéthoz, Dominique Genoud, Frédéric Bimbot and Chafic Mokbel, in: 6th European Conference on Speech Communication and Technology --- Eurospeech'99, 1999

Blind separation of delayed and superimposed acoustic sources : learning algorithms an experimental study, Seunjin Choi, Youngki Lyu, Frédéric Berthommier, Hervé Glotin and Andrzej Cichocki, in: Proc. IEEE Int. Conference on Speech Processing (ICSP), IEEE, 1999

An Overview of the PICASSO Project Research Activities in Speaker Verification for Telephone Applications, Frédéric Bimbot, Mats Blomberg, Louis Boves, Gérard Chollet, Cédric Jaboulet, Bruno Jacob, Jamal Kharroubi, Johan Koolwaaij, Johan Lindberg, Johnny Mariéthoz, Chafic Mokbel and Houda Mokbel, in: 6th european conference on speech communication and technology --- eurospeech'99, 1999

A new SNR-feature mapping for robust multistream speech recognition, Frédéric Berthommier and Hervé Glotin, in: Proc. Int. Congress on Phonetic Sciences (ICPhS), 1999

A measure of speech and pitch reliability from voicing, Frédéric Berthommier and Hervé Glotin, in: Proc. Int. Joint Conf. on Artificial Intelligence (IJCAI), Scandinavian AI Society, 1999

A comparison of two strategies for ASR in additive noise : Missing Data and Spectral Subtraction, Christopher Kermorvant and Andrew Morris, in: 6th European Conference on Speech Communication and Technology --- Eurospeech'99, 1999

A comparison of two strategies for ASR in additive noise : Missing Data and Spectral Subtraction, Christopher Kermorvant and Andrew Morris, Idiap-RR-17-1999

A comparison of noise reduction techniques for robust speech recognition, Christopher Kermorvant, Idiap-RR-10-1999

A CASA-labelling model using the localisation cue for robust cocktail-party speech recognition, Hervé Glotin, Frédéric Berthommier and Emmanuel Tessier, in: Proc.\ European Conf.\ on Speech Communication and Technology (EUROSPEECH), 1999

A CASA front-end using the localisation cue for segregation and then cocktail-party speech recognition, Emmanuel Tessier, Frédéric Berthommier, Hervé Glotin and Seunjin Choi, in: Proc. IEEE Int. Conference on Speech Processing (ICSP), IEEE, 1999

Weighting schemes for audio-visual fusion in speech recognition, Hervé Glotin, D. Vergyri, C. Neti, G. Potamianos and Juergen Luettin, Idiap-RR-44-2000

Video sequence matching via decision tree path following, Kim Shearer, Svetha Venkatesh and Horst Bunke, Idiap-RR-12-2000

Video Indexing and Similarity Retrieval by Largest Common Subgraph Detection using Decision Trees, Kim Shearer, Horst Bunke and Svetha Venkatesh, in: Pattern Recognition, 34(05), 2000

Various adaptive weighting schemes for large vocabulary robust audio-visual ASR, with particular reference to the cocktail party effect, Hervé Glotin, Idiap-Com-04-2000

Using Multiple Time Scales in the Framework of Multi-Stream Speech Recognition, Astrid Hagen and Hervé Bourlard, in: ICSLP, 2000

Traitement de la Parole, R. Boite, Hervé Bourlard, T. Dutoit, J. Hancq and H. Leich, Presses Polytechniques Universitaires Romandes, 2000

Thematic Indexing of Spoken Documents by Using Self-Organizing Maps, Mikko Kurimo, Idiap-RR-05-2000

The use of Boolean concepts in general classification contexts, Miguel Moreira, Idiap-RR-46-2000

The use of Boolean concepts in general classification contexts, Miguel Moreira, Ecole Polytechnique Federale de Lausanne, 2000

Test of several external posterior weighting functions for multiband Full Combination ASR, Hervé Glotin and Frédéric Berthommier, in: Int. Conf. on Spoken Language Processing (ICSLP), 2000

Taking on the Curse of Dimensionality in Joint Distributions Using Neural Networks, Samy Bengio and Yoshua Bengio, in: IEEE Transaction on Neural Networks special issue on data mining and knowledge discovery, 2000

Support Vector Machines, Théorie et Application, Ronan Collobert, Idiap-Com-03-2000

Support Vector Machines for Large-Scale Regression Problems, Ronan Collobert and Samy Bengio, Idiap-RR-17-2000

Spatial Data Mapping with Support Vector Regression, Mikhail Kanevski and Stéphane Canu, Idiap-RR-09-2000

Some applications of a priori knowledge in multi-stream HMM and HMM/ANN based ASR, Andrew Morris, in: Phonus No.5,Dec.2000, ISSN 0949-1791, Proc. Workshop on Phonetics and Phonology in ASR, 2000

Robust multi-stream speech recognition based on the combined reliabilities of the speech signal and phonemes estimates, Hervé Glotin, Idiap-RR-36-2000

Relating LPC modeling to a factor-based articulatory model, Sacha Krstulović, in: Proc. ICSLP 2000, 2000

Reconnaissance de la parole dans le bruit après renforcement fondé sur l'harmonicité, Frédéric Berthommier and Hervé Glotin, in: Proceedings of JEP'2000, no IDIAP RR, see RESPITE www, 2000

Recognition of Asymmetric Facial Action Unit Activities and Intensities, B. Fasel and Juergen Luettin, in: Proceedings of the International Conference on Pattern Recognition (ICPR 2000), 2000

Recent Developments in Speaker Verification at IDIAP, B. Nedic and Hervé Bourlard, Idiap-RR-26-2000

Personal Voice Dialing over PC, Frédéric Bressoud and Haiyan Wang, Idiap-Com-05-2000

On the Convergence of SVMTorch, an Algorithm for Large-Scale Regression Problems, Ronan Collobert and Samy Bengio, Idiap-RR-24-2000

Neural Networks in Automatic Speech Recognition, F. Beaufays, Hervé Bourlard, H. Franco and Nelson Morgan, in: to be published in The Handbook of Brain Theory and Neural Networks, Bradford Books, The MIT Press, 2000

Neural Network Residual Stochastic Co-simulation for Environmental Data Analysis, V. Demyanov, Mikhail Kanevski, E. Savelieva, V. Timonin and S. Chernov, in: Neural Computation 2000, 2000

Multiple Timescale Feature Combination towards Robust Speech Recognition, Katrin Weber, in: KONVENS 2000 / Sprachkommunikation, 2000

Multiple Hypotheses Video OCR, Datong Chen and Juergen Luettin, in: Proceedings of the 4th International Workshop on Document Analysis System, 2000

Multichannel signal separation for cocktail party speech recognition: a dynamic recurrent network, Seunjin Choi, H. Hong, Hervé Glotin and Frédéric Berthommier, in: Int. Conf. on Spoken Language Processing (ICSLP), no IDIAP RR, see RESPITE www, 2000

Mixtures of latent variable models for density estimation and classification, Perry Moerland, Idiap-RR-25-2000

Mixture Models for Unsupervised and Supervised Learning, Perry Moerland, Idiap-RR-18-2000

Mixture Models for Unsupervised and Supervised Learning, Perry Moerland, École Polytechnique Fédérale de Lausanne, Computer Science Department, 2000

LPC modeling with speech production constraints, Sacha Krstulović, in: Proc. 5th Speech Production Seminar, 2000

Local Machine Learning Models for Spatial Data Analysis, Nicolas Gilardi and Samy Bengio, in: Journal of Geographic Information and Decision Analysis, 4(01), 2000

Language modeling based on neural clustering of words, Vesa Siivola, Idiap-Com-02-2000

Iterative Posterior-Based Keyword Spotting Without Filler Models, Marius-Calin Silaghi and Hervé Bourlard, in: Proceedings of the IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, 2000

Inverse lattice filtering of speech with adapted non-uniform delays, Sacha Krstulović and Frédéric Bimbot, in: Proc. ICSLP 2000, 2000

Indoor Radon Risk Assessment with Geostatistics and Artificial Neural Networks, V. Demyanov, Mikhail Kanevski, Michel Maignan, E. Savelieva, V. Timonin, S. Chernov and G. Piller, in: Geostatistical congress 2000, 2000

Indexing spoken audio by LSA and SOMs, Mikko Kurimo, in: Proceedings of the European Signal Processing Conference EUSIPCO'2000, 2000

Incorporating Domain Knowledge with Video and Voice Data Analysis in News Broadcasts, Kim Shearer, Chitra Dorai and Svetha Venkatesh, in: Proceedings of the Sixth ACM International Conference on Knowledge Discovery and Data Mining, ACM, Boston, MA, USA, 2000

HMM2- A Novel Approach to HMM Emission Probability Estimation, Katrin Weber, Samy Bengio and Hervé Bourlard, in: International Conference on Spoken Langugae Processing (ICSLP 2000), 2000

Handwritten Digits Recognition, Eric Grand, Idiap-RR-07-2000

From Multi-Band Full Combination to Multi-Stream Full Combination Processing in Robust ASR, Astrid Hagen, Andrew Morris and Hervé Bourlard, in: ISCA ITRW ASR2000, 2000

Fast latent semantic indexing of spoken documents by using self-organizing maps, Mikko Kurimo, in: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP'2000, 2000

Etudes comparatives des robustesses au bruit de l'approche 'Full Combination' et de son approximation, Astrid Hagen and Hervé Glotin, in: Journee d'Etudes sur la Parole, Aussois, 2000

Environmental Data Mapping with Support Vector Regression and Geostatistics, Mikhail Kanevski, Patrick Wong and Stéphane Canu, Idiap-RR-10-2000

Environmental and Pollution Spatial Data Classification with Support Vector Machines and Geostatistics, Nicolas Gilardi, Mikhail Kanevski, Michel Maignan and Eddy Mayoraz, in: Geostatistical congress 2000, 2000

Cursive Character Recognition by Learning Vector Quantization, Francesco Camastra and Alessandro Vinciarelli, in: Pattern Recognition Letters, 22(6), 2001

Comparison of Unsupervised and Supervised Training of RBF Neural Networks. Case Study: Mapping of Contamination Data, V. Polishchuk and Mikhail Kanevski, in: Neural Computation 2000, 2000

Comparison of HMM experts with MLP experts in the Full Combination Multi-Band Approach to Robust ASR, Astrid Hagen and Andrew Morris, in: ICSLP, 2000

Combining multiple tracking algorithms for improved general performance, Kim Shearer, Kirrily D Wong and Svetha Venkatesh, in: Pattern Recognition, 34(06), 2000

Blind acoustic source separation for cocktail party speech recognition, H. Hong, Seunjin Choi, Hervé Glotin and Frédéric Berthommier, in: ICONIP, 7th IEEE Int. Conf. on Neural Information Processing, 2000

Behavior of a Bayesian adaptation method for incremental enrollment in speaker verification, Corinne Fredouille, Johnny Mariéthoz, Cédric Jaboulet, Jean Hennebert, Chafic Mokbel and Frédéric Bimbot, in: ICASSP2000 - IEEE International Conference on Acoustics, Speech, and Signal Processing, 2000

Automatic Speech Recognition using Pitch Information in Dynamic Bayesian Networks, Todd Andrew Stephenson, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-41-2000

Automatic Speech Recognition using Dynamic Bayesian Networks with both Acoustic and Articulatory Variables, Todd Andrew Stephenson, Hervé Bourlard, Samy Bengio and Andrew Morris, in: 6th International Conference on Spoken Language Processing: ICSLP~2000 (Interspeech~2000), 2000

Auto-Association by Multilayer Perceptrons and Singular Value Decomposition, Hervé Bourlard, Idiap-RR-16-2000

Audio-Visual Speech Modelling for Continuous Speech Recognition, Stéphane Dupont and Juergen Luettin, in: IEEE Transactions on Multimedia, 2000

Audio visual speech recognition, C. Neti, G. Potamianos, Juergen Luettin, I. Matthews, Hervé Glotin, D. Vergyri, J. Sison and A. Mashari, Johns Hopkins University-CLSP, 2000

ASYMMETRIC FILTER FOR TEXT RECOGNITION IN VIDEO, Datong Chen and Kim Shearer, Idiap-RR-37-2000

Approches génératives pour le traitement de séquences d'images: application à la reconnaissance dynamique des gestes de la main, Sébastien Marcel, Idiap-RR-45-2000

An Introduction to Bayesian Network Theory and Usage, Todd Andrew Stephenson, Idiap-RR-03-2000

An EM Algorithm for HMMs with Emission Distributions Represented by HMMs, Samy Bengio, Hervé Bourlard and Katrin Weber, Idiap-RR-11-2000

Advanced Spatial Data Analysis and Modelling with Support Vector Machines, Mikhail Kanevski, Alexei Pozdnoukhov, Stéphane Canu and Michel Maignan, Idiap-RR-31-2000

Adaptation robuste de modèles HMM pour la vérification du locuteur dépendante du texte, Johnny Mariéthoz and Frédéric Bimbot, in: Journee d'Etudes sur la Parole, Aussois, 2000

Activity Report 1999, IDIAP, Idiap-Com-01-2000

A survey on Off-Line Cursive Word Recognition, Alessandro Vinciarelli, in: Pattern Recognition, 35(07), 2002

A Survey of Text Detection and Recognition in Images and Videos, Datong Chen and Juergen Luettin, Idiap-RR-38-2000

A new normalization technique for cursive handwritten words, Alessandro Vinciarelli and Juergen Luettin, in: Pattern Recognition Letters, 22(09), 2001

A neural network for classification with incomplete data: application to robust ASR, Andrew Morris, Ljubomir Josifovski, Hervé Bourlard, Martin Cooke and Phil Green, in: Proc. ICSLP, 2000

A MAP approach, with synchronous decoding and unit-based normalization for text-dependent speaker verification, Johnny Mariéthoz, Johan Lindberg and Frédéric Bimbot, in: ICSLP, 2000

A front-end using the harmonicity cue for speech enhancement in loud noise, Frédéric Berthommier, Hervé Glotin and Emmanuel Tessier, in: Int. Conf. on Spoken Language Processing (ICSLP), 2000

Video OCR for Sport Video Annotation and Retrieval, Datong Chen, Kim Shearer and Hervé Bourlard, in: Proceedings of the 8th IEEE International Conference on Mechatronics and Machine Vision in Practice, 2001

Using posterior probabilities for speech/music discrimination, Maja Popović, Idiap-RR-08-2001

User Customized HMM/ANN Based Speaker Verification, Mohamed Faouzi BenZeghiba and Hervé Bourlard, Idiap-RR-32-2001

Text Identification in Complex Background using SVM, Datong Chen, Hervé Bourlard and Jean-Philippe Thiran, in: Proceedings of the Int. Conf. on computer vision and pattern recognition, 2001

Text Enhancement with Asymmetric Filter for Video OCR, Datong Chen, Kim Shearer and Hervé Bourlard, in: Proceedings of the 11th International Conference on Image Analysis and Processing, 2001

SVMTorch: Support Vector Machines for Large-Scale Regression Problems, Ronan Collobert and Samy Bengio, in: Journal of Machine Learning Research, 1, 2001

Support Vector Machines for Classification and Mapping of Reservoir Data, Mikhail Kanevski, Alexei Pozdnoukhov, Stéphane Canu, Michel Maignan, Patrick Wong and S. Shibli, Idiap-RR-04-2001

Speech/Music Discrimination using Entropy and Dynamism Features in a HMM Classification Framework, Jitendra Ajmera, Iain A. McCowan and Hervé Bourlard, in: Speech Communication, 40, 2003

Speech Recognition Using Advanced HMM2 Features, Katrin Weber, Samy Bengio and Hervé Bourlard, in: Automatic Speech Recognition and Understanding Workshop, 2001

Speech Recognition Engine for Interactive Voice Response application on Windows, Haiyan Wang, Idiap-Com-10-2001

Speaker Verification Based On User-Customized Password, Mohamed Faouzi BenZeghiba, Hervé Bourlard and Johnny Mariéthoz, Idiap-RR-13-2001

Signal modeling with Non Uniform Topology lattice filters, Sacha Krstulović and Frédéric Bimbot, in: Proc. ICASSP 2001, 2001

Robust speech recognition based on multi-stream processing, Astrid Hagen, École Polytechnique Fédérale de Lausanne, 2001

Robust Speech Recognition and Feature Extraction Using HMM2, Katrin Weber, Shajith Ikbal, Samy Bengio and Hervé Bourlard, in: Computer Speech & Language, 17(2-3), 2003

Rebuilding Speech Recognition on Windows, Haiyan Wang, Idiap-Com-09-2001

Pronunciation models and their evaluation using confidence measures, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-29-2001

PhD Thesis: Speech Analysis with Production Constraints, Sacha Krstulović, École Polytechnique Fédérale de Lausanne, 2001

Offline Cursive Word Recognition using Continuous Density Hidden Markov Models trained with PCA or ICA Features, Alessandro Vinciarelli and Samy Bengio, in: Proceedings of International Conference on Pattern Recognition, 2002

New Approaches Towards Robust and Adaptive Speech Recognition, Hervé Bourlard, Samy Bengio and Katrin Weber, in: Advances in Neural Information Processing Systems 13, MIT Press, 2001

Multi-stream adaptive evidence combination for noise robust ASR, Andrew Morris, Astrid Hagen, Hervé Glotin and Hervé Bourlard, in: Speech Communication, 2001

Modeling Auxiliary Information in Bayesian Network Based ASR, Todd Andrew Stephenson, Mathew Magimai-Doss and Hervé Bourlard, in: 7th European Conference on Speech Communication and Technology (Eurospeech~2001), 2001

Microphone Array Post-filter for Diffuse Noise Field, Iain A. McCowan and Hervé Bourlard, in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2002

Microphone Array Post-filter based on Noise Field Coherence, Iain A. McCowan and Hervé Bourlard, in: IEEE Transactions on Speech and Audio Processing, 11(6), 2003

MAP Combination of Multi-Stream HMM or HMM/ANN Experts, Andrew Morris, Astrid Hagen and Hervé Bourlard, in: Proc. Eurospeech, 2001

Learning the Decision Function for Speaker Verification, Samy Bengio and Johnny Mariéthoz, in: IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP, 2001

Intrinsic dimension estimation of data: an approach based on Grassberger-Procaccia's algorithm, Francesco Camastra and Alessandro Vinciarelli, in: Neural Processing Letters, 14(01), 2001

Improving Face Verification using Skin Color Information, Sébastien Marcel and Samy Bengio, in: Proceedings of the 16th International Conference on Pattern Recognition, IEEE Computer Society Press, 2002

IDIAP HMM/HMM2 System: Theoretical Basis and Software Specifications, Shajith Ikbal, Hervé Bourlard, Samy Bengio and Katrin Weber, Idiap-RR-27-2001

HMM2- Extraction of Formant Features and their Use for Robust ASR, Katrin Weber, Samy Bengio and Hervé Bourlard, in: European Conference on Speech Communication and Technology (Eurospeech 2001), 2001

From missing data to maybe useful data: soft data modelling for noise robust ASR, Andrew Morris, Jon Barker and Hervé Bourlard, in: Proc. WISP, 2001

Facial Expression Analysis using Shape and Motion Information Extracted by Convolutional Neural Networks, B. Fasel, in: International IEEE Workshop on Neural Networks for Signal Processing (NNSP 02), 2002

Evaluation of SVM Binary Classification with Nonparametric Stochastic Simulations, Mikhail Kanevski, Idiap-RR-07-2001

Evaluation of Biometric Technology on XM2VTS, Samy Bengio, Johnny Mariéthoz and Sébastien Marcel, Idiap-RR-21-2001

Error Correcting Posterior Combination for Robust Multi-Band Speech Recognition, Astrid Hagen and Hervé Bourlard, in: EUROSPEECH, 2001

EPFL lab session 2/2: Introduction to Hidden Markov Models, Sacha Krstulović, Idiap-Com-07-2001

EPFL lab session 1/2: Introduction to Gaussian statistics and pattern recognition, Sacha Krstulović, Idiap-Com-06-2001

EEG pattern recognition through multi-stream evidence combination, Andrew Morris, Bernhard Obermaier and Gert Pfurtscheller, in: Proc. World Congress on Neuroinformatics, 2001

Development of a DTW based Speech Recognition System over the telephone line, Frank Formaz, Manish Goyal and Olivier Bornet, Idiap-Com-05-2001

Developement d'un systeme de demande interactif via le telephone (INFOVOX), Thierry Collado, Idiap-Com-08-2001

Detection of Narrative Structure for Annotation of News Broadcasts, Kim Shearer, Chitra Dorai and Svetha Venkatesh, Idiap-RR-03-2001

Data utility modelling for mismatch reduction, Andrew Morris, in: Proc. CRAC (workshop on Consistent & Reliable Acoustic Cues for sound analysis), 2001

Confidence Evaluation for Risk Prediction, Nicolas Gilardi, Tom Melluish and Michel Maignan, in: 2001 Annual Conference of the IAMG, 2001

Comparison of Client Model Adaptation Schemes, Samy Bengio and Johnny Mariéthoz, Idiap-RR-25-2001

Combining Neural Gas and Learning Vector Quantization for Cursive Character Recognition, Francesco Camastra and Alessandro Vinciarelli, in: Neurocomputing, 51, 2003

Artifacts of the colour coherence vector and an alternative similarity measure, Kim Shearer and Svetha Venkatesh, Idiap-RR-02-2001

Analytic Assessment of Telephone Transmission Impact on ASR Performance Using a Simulation Model, S. Moeller and Hervé Bourlard, in: Speech Communication, 2002

Adaptive ML-Weighting in Multi-Band Recombination of Gaussian Mixture ASR, Astrid Hagen, Hervé Bourlard and Andrew Morris, in: ICASSP, 2001

Activity Report 2000, IDIAP, Idiap-Com-01-2001

A Pragmatic View of the Application of HMM2 for ASR, Katrin Weber, Samy Bengio and Hervé Bourlard, Idiap-RR-23-2001

A Parallel Mixture of SVMs for Very Large Scale Problems, Ronan Collobert, Samy Bengio and Yoshua Bengio, in: Advances in Neural Information Processing Systems, NIPS 14, MIT Press, 2002

A Comparative Study of Adaptation Methods for Speaker Verification, Johnny Mariéthoz and Samy Bengio, in: International Conference on Spoken Language Processing ICSLP, 2002

Writer adaptation techniques in HMM based Off-Line Cursive Script Recognition, Alessandro Vinciarelli and Samy Bengio, in: Pattern Recognition Letters, 23(8), 2002

Writer adaptation techniques in HMM based Off-Line Cursive Script Recognition, Alessandro Vinciarelli and Samy Bengio, in: Proceedings of 8$^{th}$ International Conference on Frontiers on Handwriting Recognition, 2002

What is Better: GMM of Two Gaussians or Two Clusters With One Gaussian?, I. Lapidot, Idiap-RR-56-2002

Video Text Recognition Based on Markov Random Field and Grayscale Consistency Constraint, Jean-Marc Odobez and Datong Chen, in: Int. Conf. Image Processing 2002, 2002

User-Customized Password Speaker Verification based on HMM/ANN and GMM Models, Mohamed Faouzi BenZeghiba and Hervé Bourlard, in: International Conference on Spoken Language Processing (ICSLP~2002), 2002

User-Customized Password HMM Based Speaker Verification, Mohamed Faouzi BenZeghiba and Hervé Bourlard, in: Proceedings of the COST275 Workshop on the Advent of Biometrics on the Internet, 2002

Unknown-Multiple Speaker clustering using HMM, Jitendra Ajmera, Hervé Bourlard, I. Lapidot and Iain A. McCowan, in: ICSLP, 2002

Transforming the feature vectors to improve HMM based cursive word recognition systems, Alessandro Vinciarelli and Samy Bengio, Idiap-RR-32-2002

Towards Robust and Adaptive Speech Recognition Models, Hervé Bourlard, Samy Bengio and Katrin Weber, Idiap-RR-47-2002

Towards Robust and Adaptive Speech Recognition Models, Hervé Bourlard, Samy Bengio and Katrin Weber, in: Mathematical Foundations of Speech Processing and Recognition, Springer-Verlag, 2002

Torch: a modular machine learning software library, Ronan Collobert, Samy Bengio and Johnny Mariéthoz, Idiap-RR-46-2002

TODE: A Decoder for Continuous Speech Recognition, Darren Moore, Idiap-Com-09-2002

The VidTIMIT Database, Conrad Sanderson, Idiap-Com-06-2002

The MNIST Database of Handwritten upper-case letters, Haiyan Wang and Samy Bengio, Idiap-Com-04-2002

The IDIAP Smart Meeting Room, Darren Moore, Idiap-Com-07-2002

The BANCA Database and Experimental Protocol for Speaker Verification, F. Porée, Johnny Mariéthoz, Samy Bengio and Frédéric Bimbot, Idiap-RR-13-2002

The analysis of kernel ridge regression learning algorithm., Alexei Pozdnoukhov, Idiap-RR-54-2002

Text Segmentation and Recognition in Complex Background Based on Markov Random Field, Datong Chen, Jean-Marc Odobez and Hervé Bourlard, in: Int. Conf. Pattern Recognition 2002, 2002

Text Detection and Recognition in Images and Videos, Datong Chen, Jean-Marc Odobez and Hervé Bourlard, Idiap-RR-61-2002

Structurally noise resistant classifier for multi-modal person verification, Conrad Sanderson and Kuldip K. Paliwal, in: Pattern Recognition Letters, 24(16), 2003

Speech Processing & Text-Independent Automatic Person Verification, Conrad Sanderson, Idiap-Com-08-2002

Speaker Normalization using HMM2, Shajith Ikbal, Katrin Weber and Hervé Bourlard, in: Proceedings of the 2002 IEEE International Workshop on Neural Networks for Signal Processing (NNSP-02), 2002

SOM-Based Clustering for On-Line Fraud Behavior Classification: a Case Study, V. Lemaire and F. Clérot, Idiap-RR-30-2002

Self-Organizing-Maps With BIC For Speaker Clustering, I. Lapidot, Idiap-RR-60-2002

Scaling Large Learning Problems with Hard Parallel Mixtures, Ronan Collobert, Yoshua Bengio and Samy Bengio, in: International Workshop on Pattern Recognition with Support Vector Machines, SVM'2002, 2002

Robust Speech Recognition with Small Microphone Arrays using the Missing Data Approach, Iain A. McCowan, Andrew Morris and Hervé Bourlard, in: Proceedings of International Conference on Speech and Language Processing (ICSLP), 2002

Robust Speaker Change Detection, Jitendra Ajmera, Iain A. McCowan and Hervé Bourlard, in: IEEE Signal Processing Letters (to appear), 2003

Robust HMM-Based Speech/Music Segmentation, Jitendra Ajmera, Iain A. McCowan and Hervé Bourlard, in: ICASSP, 2002

Robust Face Verification using Skin Color and Neural Networks, Sébastien Marcel, Idiap-RR-49-2002

Robust Face Analysis using Convolutional Neural Networks, B. Fasel, in: Proceedings of the International Conference on Pattern Recognition (ICPR 02), 2002

Robot Navigation, José del R. Millán, in: The Handbook of Brain Theory and Neural Networks: The Second Edition, The MIT Press, 2002

Recent advances in the multi-stream HMM/ANN hybrid approach to noise robust ASR, Astrid Hagen and Andrew Morris, Idiap-RR-57-2002

Proceedings of the Twelfth IEEE Workshop on Neural Networks for Signal Processing (NNSP), IEEE Press, 2002

Probabilistic Home Video Structuring: Feature Selection and Performance Evaluation, Daniel Gatica-Perez, Ming-Ting Sun and Alexander Loui, in: IEEE International Conference on Image Processing, 2002

Phase AutoCorrelation (PAC) derived Robust Speech Features, Shajith Ikbal, Hemant Misra and Hervé Bourlard, in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003

Online Policy Adaptation for Ensemble Algorithms, Christos Dimitrakakis and Samy Bengio, Idiap-RR-28-2002

Object Localization in Metric Spaces for Video Linking, Daniel Gatica-Perez and Ming-Ting Sun, in: IEEE Workshop on Motion and Video Computing, 2002

Noise Resistant Audio-Visual Verification via Structural Constraints, Conrad Sanderson and Kuldip K. Paliwal, in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003

Noise PDF transformation in secondary feature processing, Andrew Morris, Idiap-RR-29-2002

New Entropy Based Combination Rules in HMM/ANN Multi-stream ASR, Hemant Misra, Hervé Bourlard and Vivek Tyagi, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2003

Mutliscale Facial Expression Recognition using Convolutional Neural Networks, B. Fasel, in: Indian Conference on Computer Vision, Graphics and Image Processing (ICVGIP 02), 2002

Modelling auxiliary information (pitch frequency) in hybrid HMM/ANN based ASR systems, Mathew Magimai-Doss, Todd Andrew Stephenson and Hervé Bourlard, Idiap-RR-62-2002

Modeling Human Interaction in Meetings, Iain A. McCowan, Samy Bengio, Daniel Gatica-Perez, Guillaume Lathoud, Florent Monay, Darren Moore, Pierre Wellner and Hervé Bourlard, in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2003

Mixed Bayesian Networks with Auxiliary Variables for Automatic Speech Recognition, Todd Andrew Stephenson, Mathew Magimai-Doss and Hervé Bourlard, in: International Conference on Pattern Recognition (ICPR~2002), 2002

Low cost duration modelling for noise robust speech recognition, Andrew Morris, Simon Payne and Hervé Bourlard, in: Proc. ICSLP, 2002

Linking Objects in Videos by Importance Sampling, Daniel Gatica-Perez and Ming-Ting Sun, in: IEEE International Conference on Multimedia and Expo, 2002

Increasing Speech Recognition Noise Robustness with HMM2, Katrin Weber, Samy Bengio and Hervé Bourlard, in: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 02), 2002

Improved Unknown-Multiple Speaker clustering using HMM, Jitendra Ajmera, Hervé Bourlard and I. Lapidot, Idiap-RR-23-2002

Hybrid generative-discriminative models for speech and speaker recognition, Quan Le and Samy Bengio, Idiap-RR-06-2002

Hidden Markov Models and other Finite State Automata for Sequence Processing, Hervé Bourlard and Samy Bengio, in: The Handbook of Brain Theory and Neural Networks: The Second Edition, The MIT Press, 2002

Head-Pose Invariant Facial Expression Recognition using Convolutional Neural Networks, B. Fasel, in: International IEEE Conference on Multimodal Interfaces (ICMI 02), 2002

Handwriting Recognition Demo, Haiyan Wang, Alessandro Vinciarelli and Frank Formaz, Idiap-Com-02-2002

Gestures for Multi-Modal Interfaces: A Review, Sébastien Marcel, Idiap-RR-34-2002

Face Verification using MLP and SVM, Fabien Cardinaux and Sébastien Marcel, in: XI Journees NeuroSciences et Sciences pour l'Ingenieur (NSI 2002), 2002

Extended BIC Criterion for Model Selection, I. Lapidot and Andrew Morris, Idiap-RR-42-2002

Evolution of the Mental States Operating a Brain-Computer Interface, J. Mouriño, Silvia Chiappa, R. Jané and José del R. Millán, in: Proceedings of the International Federation for Medical and Biological Engineering, 2002

Evaluation Protocols and Comparative Results for the Triesch Hand Posture Database, Sébastien Marcel, Idiap-RR-50-2002

Evaluation of Formant-Like Features for ASR, Katrin Weber, F. de Wet, B. Cranen, Louis Boves, Samy Bengio and Hervé Bourlard, in: International Conference on Spoken Language Processing (ICSLP 2002), 2002

Estimation of Conditional Distributions using Gaussian Mixture Models, Nicolas Gilardi, Samy Bengio and Mikhail Kanevski, Idiap-RR-03-2002

Estimating the Intrinsic Dimension of Data with a Fractal-Based Method, Francesco Camastra and Alessandro Vinciarelli, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(10), 2002

Dynamic Bayesian Network Based Speech Recognition with Pitch and Energy as Auxiliary Variables, Todd Andrew Stephenson, Jaume Escofet, Mathew Magimai-Doss and Hervé Bourlard, in: 2002 IEEE International Workshop on Neural Networks for for Signal Processing (NNSP~2002), 2002

Dichotomy Between Clustering Performance and Minimum Distortion in Piecewise-Dependent-Data (PDD) Clustering, I. Lapidot and H. Guterman, in: to be published in IEEE Signal Processing Letters, 2003

Confidence Measures for Multimodal Identity Verification, Samy Bengio, Christine Marcel, Sébastien Marcel and Johnny Mariéthoz, in: Information Fusion, 3(04), 2002

Conditional Gaussian Mixture Models for Environmental Risk Mapping, Nicolas Gilardi, Samy Bengio and Mikhail Kanevski, in: IEEE International Workshop on Neural Networks for Signal Processing (NNSP), 2002

Comparison of Support Vector Machine and Neural Network for Text Texture Verification, Datong Chen and Jean-Marc Odobez, Idiap-RR-19-2002

Brain-Computer Interfaces, José del R. Millán, in: The Handbook of Brain Theory and Neural Networks: The Second Edition, The MIT Press, 2002

Bagging Using the VMSE Cost Function, V. Lemaire, Idiap-RR-27-2002

Auxiliary Variables in Conditional Gaussian Mixtures for Automatic Speech Recognition, Todd Andrew Stephenson, Mathew Magimai-Doss and Hervé Bourlard, in: Seventh International Conference on Spoken Language Processing (ICSLP~2002), 2002

An information theoretic measure of sequence recognition performance, Andrew Morris, Idiap-Com-03-2002

An Asynchronous Hidden Markov Model for Audio-Visual Speech Recognition, Samy Bengio, in: Advances in Neural Information Processing Systems, NIPS 15, MIT Press, 2003

Algorithms for Video Structuring, Maël Guillemot, Jean-Marc Odobez and Daniel Gatica-Perez, Idiap-Com-05-2002

Activity Report 2001, IDIAP, Idiap-Com-01-2002

A State-of-the-art Neural Network for Robust Face Verification, Sébastien Marcel, Christine Marcel and Samy Bengio, in: Proceedings of the COST275 Workshop on The Advent of Biometrics on the Internet, 2002

A Parallel Mixture of SVMs for Very Large Scale Problems, Ronan Collobert, Samy Bengio and Yoshua Bengio, in: Neural Computation, 14(05), 2002

A New Method of Contrast Normalization for Verification of Extracted Video Text Having Complex Backgrounds, Datong Chen and Jean-Marc Odobez, Idiap-RR-16-2002

A Multi-sample Multi-source Model for Biometric Authentication, Norman Poh, Samy Bengio and Jerzy Korczak, in: IEEE International Workshop on Neural Networks for Signal Processing (NNSP), 2002

Why Do Multi-Stream, Multi-Band and Multi-Modal Approaches Work on Biometric User Authentication Tasks?, Norman Poh and Samy Bengio, in: Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04), 2004

Video Shot Clustering using Spectral Methods, Jean-Marc Odobez, Daniel Gatica-Perez and Maël Guillemot, in: 3rd Workshop on Content-Based Multimedia Indexing (CBMI), 2003

Vessel Segmentation and Branching Detection using an Adaptive Profile Kalman Filter in Retinal Blood Vessel Structure Analysis, Pedro Quelhas and James Boyce, in: Pattern Recognition and Image Analysis: First Iberian Conference, IbPRIA 2003, Springer-Verlag LNCS, 2003

Variance Reduction Techniques in Biometric Authentication, Norman Poh and Samy Bengio, Idiap-RR-17-2003

Using pitch frequency information in speech recognition, Mathew Magimai-Doss, Todd Andrew Stephenson and Hervé Bourlard, in: Proceedings of Eurospeech, 2003

TRAP-TANDEM: Data-driven extraction of temporal features from speech, Hynek Hermansky, in: large part published in Proceedings of ASRU-2003, 2003

Towards Computer Understanding of Human Interactions, Iain A. McCowan, Daniel Gatica-Perez, Samy Bengio and Hervé Bourlard, Idiap-RR-45-2003

The Expected Performance Curve, Samy Bengio, Johnny Mariéthoz and Mikaela Keller, in: International Conference on Machine Learning, ICML, Workshop on ROC Analysis in Machine Learning, 2005

The BANCA Database and Evaluation Protocol, E. Bailly-Baillière, Samy Bengio, Frédéric Bimbot, M. Hamouz, J. Kittler, Johnny Mariéthoz, J. Matas, K. Messer, Vlad Popovici, F. Porée, B. Ruiz and Jean-Philippe Thiran, in: 4th International Conference on Audio- and Video-Based Biometric Person Authentication, AVBPA, Springer-Verlag, 2003

Textual Data Representation, Mikaela Keller and Samy Bengio, Idiap-RR-74-2003

Text detection and recognition in images and video sequences, Datong Chen, École Polytechnique Fédérale de Lausanne, 2003

Studying Phase Synchrony for Classification of Mental Tasks in Brain Machine Interfaces, E. Gysels, José del R. Millán, Silvia Chiappa and P. Celka, in: Proceedings of the Conference of the International Society for Brain Electromagnetic Topography, 2003

Speech & Face Based Biometric Authentication at IDIAP, Conrad Sanderson, Samy Bengio, Hervé Bourlard, Johnny Mariéthoz, Ronan Collobert, Mohamed Faouzi BenZeghiba, Fabien Cardinaux and Sébastien Marcel, Idiap-RR-13-2003

Speech & Face Based Biometric Authentication at IDIAP, Conrad Sanderson, Samy Bengio, Hervé Bourlard, Johnny Mariéthoz, Ronan Collobert, Mohamed Faouzi BenZeghiba, Fabien Cardinaux and Sébastien Marcel, in: Proceedings of the 2003 IEEE International Conference on Multimedia & Expo (ICME-03), 2003

Speech Recognition with Auxiliary Information, Todd Andrew Stephenson, École Polytechnique Fédérale de Lausanne, Computer Science Department, 2003

Speech Recognition with Auxiliary Information, Todd Andrew Stephenson, Idiap-RR-28-2003

Speech recognition of spontaneous, noisy speech using auxiliary information in Bayesian networks, Todd Andrew Stephenson, Mathew Magimai-Doss and Hervé Bourlard, in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003

Spectral Structuring of Home Videos, Jean-Marc Odobez, Daniel Gatica-Perez and Maël Guillemot, in: International Conference on Image and Video Retrieval (CIVR'03), Springer Verlag, 2003

Spectral Entropy Based Feature for Robust ASR, Hemant Misra, Shajith Ikbal, Hervé Bourlard and Hynek Hermansky, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2004

Some Emerging Concepts in Speech Recognition., Hynek Hermansky and Hervé Bourlard, Idiap-RR-82-2003

Small Microphone Array: Algorithms and Hardware, Iain A. McCowan and Darren Moore, Idiap-Com-07-2003

Show What You Know: Musings on the Reporting of Negative Results in Speech Recognition Research, Hynek Hermansky and Nelson Morgan, Idiap-RR-81-2003

Sequential Monte Carlo Video Text Segmentation, Datong Chen and Jean-Marc Odobez, in: ICIP, 2003

Segmenting Multiple Concurrent Speakers Using Microphone Arrays, Guillaume Lathoud, Iain A. McCowan and Darren Moore, in: Proceedings of Eurospeech 2003, 2003

Scaling Large Learning Problems with Hard Parallel Mixtures, Ronan Collobert, Yoshua Bengio and Samy Bengio, in: International Journal on Pattern Recognition and Artificial Intelligence (IJPRAI), 17(3), 2003

Scalability Analysis of Audio-Visual Person Identity Verification, J. Czyz, Samy Bengio, Christine Marcel and L. Vandendorpe, in: 4th International Conference on Audio- and Video-Based Biometric Person Authentication, AVBPA, Springer-Verlag, 2003

Robust Features for Frontal Face Authentication in Difficult Image Conditions, Conrad Sanderson and Samy Bengio, Idiap-RR-05-2003

Robust Features for Frontal Face Authentication in Difficult Image Conditions, Conrad Sanderson and Samy Bengio, in: Proceedings of 4th International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA-03), 2003

Reconnaissance de gestes 3D bi-manuels, Agnès Just, Sébastien Marcel, O. Bernier and J. E. Viallet, Idiap-RR-79-2003

Recognition of Isolated Complex Mono- and Bi-Manual 3D Hand Gestures, Agnès Just, O. Bernier and Sébastien Marcel, in: Proc. of the sixth International Conference on Automatic Face and Gesture Recognition, 2004

Real-Time Face Detection Using Boosting Learning in Hierarchical Feature Spaces, Dong Zhang, S. Z. Li and Daniel Gatica-Perez, in: the International Conference on Pattern Recognition (ICPR), 2004

Phoneme-Grapheme Based Speech Recognition System, Mathew Magimai-Doss, Todd Andrew Stephenson, Hervé Bourlard and Samy Bengio, in: Proceedings of IEEE ASRU, 2003

Online Policy Adaptation for Ensemble Classifiers, Christos Dimitrakakis and Samy Bengio, in: 12th European Symposium on Artificial Neural Networks, ESANN 04, 2004

Online Policy Adaptation for Ensemble Classifiers, Christos Dimitrakakis and Samy Bengio, in: Neurocomputing, 2005

On Use of Task Independent Training Data in Tandem Feature Extraction, Sunil Sivadas and Hynek Hermansky, in: Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04), 2004

On the Need for On-Line Learning in Brain-Computer Interfaces, José del R. Millán, Idiap-RR-30-2003

On the Combination of Speech and Speaker Recognition, Mohamed Faouzi BenZeghiba and Hervé Bourlard, in: European Conference On Speech, Communication and Technology (EUROSPEECH'03), 2003

On Performance Evaluation of Face Detection and Localization Algorithms, Vlad Popovici, Jean-Philippe Thiran, Yann Rodriguez and Sébastien Marcel, in: 17th International Conference on Pattern Recognition (ICPR), 2004

On Multi-scale Fourier Transform Analysis of Speech Signals, Vivek Tyagi and Hervé Bourlard, Idiap-RR-33-2003

On Image Auto-Annotation with Latent Space Models, Florent Monay and Daniel Gatica-Perez, in: Proc. ACM Int. Conf. on Multimedia (ACM MM), 2003

On Factorizing Spectral Dynamics for Robust Speech Recognition, Vivek Tyagi, Iain A. McCowan, Hervé Bourlard and Hemant Misra, in: Eurospeech, 2003

On automatic annotation of meeting databases, Daniel Gatica-Perez, Iain A. McCowan, Mark Barnard, Samy Bengio and Hervé Bourlard, in: IEEE International Conference on Image Processing (ICIP), 2003

Offline Recognition of Unconstrained Handwritten Texts Using HMMs and Statistical Language Models, Alessandro Vinciarelli, Samy Bengio and Horst Bunke, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 26(6), 2004

Offline Recognition of Large Vocabulary Cursive Handwritten Text, Alessandro Vinciarelli, Samy Bengio and Horst Bunke, in: Proceedings of International Conference on Document Analysis and Recognition (ICDAR), 2003

Offline Cursive Handwriting: From Word To Text Recognition, Alessandro Vinciarelli, Idiap-RR-24-2003

Non-Linear Variance Reduction Techniques in Biometric Authentication, Norman Poh and Samy Bengio, in: Workshop on Multimodal User Authentication, 2003

Nonlinear Spectral Transformations for Robust Speech Recognition, Shajith Ikbal, Hynek Hermansky and Hervé Bourlard, in: Proceedings of the IEEE Automatic Speech Recognition and Understanding (ASRU) Workshop 2003, 2003

Nonlinear Analysis of Cognitive and Motor-related EEG Signals, Silvia Chiappa and Samy Bengio, Idiap-RR-14-2003

Non-Invasive Brain-Actuated Control of a Mobile Robot, José del R. Millán, F. Renkens, J. Mouriño and W. Gerstner, in: Proceedings of the 18th International Joint Conference on Artificial Intelligence, 2003

Noisy Text Categorization, Alessandro Vinciarelli, in: Proceedings of International Conference on Pattern Recognition (ICPR), 2004

Noise Robust Discriminative Models, Quan Le and Samy Bengio, Idiap-RR-40-2003

Multimodal Identity Verification at IDIAP, Christine Marcel, Idiap-Com-04-2003

Multimodal Authentication using Asynchronous HMMs, Samy Bengio, in: 4th International Conference on Audio- and Video-Based Biometric Person Authentication, AVBPA, Springer-Verlag, 2003

Multi-Modal Audio-Visual Event Recognition for Football Analysis, Mark Barnard, Jean-Marc Odobez and Samy Bengio, in: Proc. IEEE Workshop on Neural Networks for Signal Processing (NNSP), 2003

Monte Carlo Video Text Segmentation, Datong Chen and Jean-Marc Odobez, Idiap-RR-07-2003

Modélisation implicite du mouvement en suivi par filtrage de Monte Carlo séquentiel, Jean-Marc Odobez and Silèye O. Ba, in: GRETSI conference, Signal and Image Processing,, 2003

Microphone Array Speech Recognition : Experiments on Overlapping Speech in Meetings, Darren Moore and Iain A. McCowan, in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003

Mel-Cepstrum Modulation Spectrum (MCMS) Features for Robust ASR, Vivek Tyagi, Iain A. McCowan, Hervé Bourlard and Hemant Misra, in: IEEE ASRU, 2003

Meeting Data Collection Specifications, Iain A. McCowan, Daniel Gatica-Perez and Samy Bengio, Idiap-Com-10-2003

Location Based Speaker Segmentation, Guillaume Lathoud and Iain A. McCowan, in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003

Joint Decoding for Phoneme-Grapheme Continuous Speech Recognition, Mathew Magimai-Doss, Samy Bengio and Hervé Bourlard, in: Proceedings of ICASSP, 2004

Internship Report : Summer 2003, Jean-Sébastien Senécal, Idiap-Com-09-2003

Information Retrieval on Noisy Text, David Grangier, Alessandro Vinciarelli and Hervé Bourlard, Idiap-Com-08-2003

In Search of a Good BET, Mike Flynn and Pierre Wellner, Idiap-Com-11-2003

Improving Face Verification using Symmetric Transformation, Sébastien Marcel, Idiap-RR-68-2003

Improving Face Authetication Using Virtual Samples, Norman Poh, Sébastien Marcel and Samy Bengio, in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003

IDIAP Demonstration Management, Haiyan Wang and Frank Formaz, Idiap-Com-06-2003

Hybrid HMM/ANN and GMM Combination for User-Customized Password Speaker Verification, Mohamed Faouzi BenZeghiba and Hervé Bourlard, in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003

HMM Mixtures (HMM2) for Robust Speech Recognition, Katrin Weber, Ecole Polytechnique Federale de Lausanne, 2003

HMM inference towards flexible speech recognition, Ait-Hassou Aissa, Idiap-Com-03-2003

From Samples to Objects in Kernel Methods, Alexei Pozdnoukhov and Samy Bengio, Idiap-RR-29-2003

Finding Structure in Home Videos by Probabilistic Hierarchical Clustering, Daniel Gatica-Perez, Alexander Loui and Ming-Ting Sun, in: IEEE Transactions on Circuits and Systems for Video Technology, 13(6), 2003

Fast features for face authentication under illumination direction changes, Conrad Sanderson and Kuldip K. Paliwal, in: Pattern Recognition Letters, 24(14), 2003

[DOI]

Face Verification using LDA and MLP on the BANCA database, Sébastien Marcel, Idiap-RR-66-2003

Face Verification Using Adapted Generative Models, Fabien Cardinaux, Conrad Sanderson and Samy Bengio, in: The 6th International Conference on Automatic Face and Gesture Recognition, FG2004, IEEE, 2004

Face Processing & Frontal Face Verification, Conrad Sanderson, Idiap-RR-20-2003

Evaluation of formant-like features for automatic speech recognition, F. de Wet, Katrin Weber, Louis Boves, B. Cranen, Samy Bengio and Hervé Bourlard, Idiap-RR-08-2003

Enhanced Performance of Multimodal Biometric Systems by Confidence Estimation, Sutapa Sarangi, Idiap-Com-05-2003

EEG-based BCI Systems and IDIAP EEG Database, Silvia Chiappa and José del R. Millán, Idiap-RR-64-2003

Direct Non-Invasive Brain Computer Interfaces, R. Grave de Peralta Menendez, S. L. González Andino, José del R. Millán, T. Pun and C. M. Michel, in: Proceedings of the 9th International Conference on Functional Mapping of the Human Brain, 2003

Confusion Matrix Based Entropy Correction in Multi-stream Combination, Hemant Misra and Andrew Morris, in: Proceedings of ISCA European Conference on Speech Communication and Technology (Eurospeech), 2003

Conditional Gaussian Mixtures, Todd Andrew Stephenson, Idiap-RR-11-2003

Comparison of MLP and GMM Classifiers for Face Verification on XM2VTS, Fabien Cardinaux, Conrad Sanderson and Sébastien Marcel, Idiap-RR-10-2003

Comparison of MLP and GMM Classifiers for Face Verification on XM2VTS, Fabien Cardinaux, Conrad Sanderson and Sébastien Marcel, in: 4th International Conference on AUDIO- and VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, 2003

Comparison of different feature classifiers for brain computer interfaces, F. Cincotti, A. Scipione, A. Tiniperi, D. Mattia, M. G. Marciani, José del R. Millán, S. Salinari, L. Bianchi and F. Babiloni, in: Proceedings of the 1st International IEEE EMBS Conference on Neural Engineering, 2003

Comparison and Combination of Features in a Hybrid HMM/MLP and a HMM/GMM Speech Recognition System, Pere Pujol, Susagna Pol, Climent Nadeu, Astrid Hagen and Hervé Bourlard, in: to be published in IEEE Transactions on Speech and Audio Processing(48), 2003

Clustering And Segmenting Speakers And Their Locations In Meetings, Jitendra Ajmera, Guillaume Lathoud and Iain A. McCowan, in: ICASSP, 2004

Client Dependent GMM-SVM Models for Speaker Verification, Quan Le and Samy Bengio, in: International Conference on Artificial Neural Networks, ICANN/ICONIP 2003, Springer Verlag, 2003

Boosting Pixel-based Classifiers for Face Verification, Sébastien Marcel and Yann Rodriguez, in: Biometric Authentication Workshop of the 8th European Conference on Computer Vision, BIOAW2004, Springer-Verlag, 2004

Boosting HMMs with an application to speech recognition, Christos Dimitrakakis and Samy Bengio, in: IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP, 2004

Automatic Speech Recognition using Dynamic Bayesian Networks with the Energy as an Auxiliary Variable, Jaume Escofet and Todd Andrew Stephenson, Idiap-RR-18-2003

Automatic Facial Expression Analysis: A Survey, B. Fasel and Juergen Luettin, in: Pattern Recognition, 36(1), 2003

Augmenting Frontal Face Models for Non-Frontal Verification, Conrad Sanderson and Samy Bengio, in: Proceedings of the 2003 Workshop on Multimodal User Authentication (MMUA'03), 2003

Audio-Visual Speaker Tracking with Importance Particle Filters, Daniel Gatica-Perez, Guillaume Lathoud, Iain A. McCowan, Jean-Marc Odobez and Darren Moore, in: IEEE International Conference on Image Processing (ICIP), 2003

Audio-Video Person Clustering in Video Databases, F. Kottelat and Jean-Marc Odobez, Idiap-RR-46-2003

Asynchronous BCI and Local Neural Classifiers: An Overview of the Adaptive Brain Interface Project, José del R. Millán and J. Mouriño, in: IEEE Trans. on Neural Systems and Rehabilitation Engineering, Special Issue on Brain-Computer Interface Technology, 11(2), 2003

An Investigation of Spectral Subband Centroids for Speaker Authentication, Norman Poh, Conrad Sanderson and Samy Bengio, in: Int'l Conf. on Biometric Authentication, 2004

An Implicit Motion Likelihood for Tracking with Particle Filters, Jean-Marc Odobez, Silèye O. Ba and Daniel Gatica-Perez, in: British Machine Vision Conference (BMVC), Springer Verlag, 2003

An Alternative To Silence Removal For Text-Independent Speaker Verification, Johnny Mariéthoz and Samy Bengio, Idiap-RR-51-2003

Adaptive Importance Sampling to Accelerate Training of a Neural Probabilistic Language Model, Yoshua Bengio and Jean-Sébastien Senécal, Idiap-RR-35-2003

Adaptive Brain Interfaces for Communication and Control, José del R. Millán, in: Proceedings of the 10th International Conference on Human-Computer Interaction, 2003

Adaptive Brain Interfaces, José del R. Millán, in: Communications of the ACM, 46(3), 2003

Activity Report 2002, IDIAP, Idiap-Com-01-2003

A Symmetric Transformation for LDA-based Face Verification, Sébastien Marcel, in: Proceedings of the 6th International Conference on Automatic Face and Gesture Recognition, IEEE Computer Society Press, 2004

A Statistical Significance Test for Person Authentication, Samy Bengio and Johnny Mariéthoz, in: Proceedings of Odyssey 2004: The Speaker and Language Recognition Workshop, 2004

A Robust Speaker Clustering Algorithm, Jitendra Ajmera and Charles Wooters, in: IEEE Automatic Speech Recognition Understanding Workshop, 2003

A Mixed-State I-Particle Filter for Multi-Camera Speaker Tracking, Daniel Gatica-Perez, Guillaume Lathoud, Iain A. McCowan and Jean-Marc Odobez, in: IEEE Int. Conf. on Computer Vision Workshop on Multimedia Technologies for E-Learning and Collaboration (ICCV-WOMTEC), 2003

A Hierarchical Keyframe User Interface for Browsing Video over the Internet, Maël Guillemot, Pierre Wellner, Daniel Gatica-Perez and Jean-Marc Odobez, in: Proceedings of the 9th International Conference on Human-Computer Interaction (INTERACT-2003), IOS Press, 2003

Variational Information Maximization in Gaussian Channels, Felix Agakov and David Barber, Idiap-RR-88-2004

Variational Information Maximization for Population Coding, David Barber, Idiap-RR-85-2004

Using RASTA in task independent TANDEM feature extraction, Guillermo Aradilla, John Dines and Sunil Sivadas, in: Proceedings of ICSLP, 2004, 2004

User Authentication via Adapted Statistical Models of Face Images, Fabien Cardinaux, Conrad Sanderson and Samy Bengio, in: IEEE Transaction on Signal Processing, 2005

Unsupervised Location-Based Segmentation of Multi-Party Speech, Guillaume Lathoud, Iain A. McCowan and Jean-Marc Odobez, in: Proceedings of the 2004 ICASSP-NIST Meeting Recognition Workshop, 2004

Une application de reconnaissance du locuteur : \\ le User-Customized Password Speaker Verification, Jérôme Kowalczyk, Idiap-Com-04-2004

Tracking People in Meetings with Particles, Daniel Gatica-Perez, Jean-Marc Odobez, Silèye O. Ba, Kevin C. Smith and Guillaume Lathoud, in: Proc. Int. Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS,',','), invited paper, 2005

Towards using hierarchical posteriors for flexible automatic speech recognition systems, Hervé Bourlard, Samy Bengio, Mathew Magimai-Doss, Qifeng Zhu, Bertrand Mesot and Nelson Morgan, Idiap-RR-58-2004

Towards Predicting Optimal Subsets of Base-Experts in Biometric Authentication Task, Norman Poh and Samy Bengio, 2004

Theme Topic Mixture Model: A Graphical Model for Document Representation, Mikaela Keller and Samy Bengio, in: Pascal Workshop on Text Mining and Understanding, 2004

The IDIAP Multimedia File Server, Frank Formaz and Norbert Crettol, Idiap-Com-05-2004

The Expected Performance Curve: a New Assessment Measure for Person Authentication, Samy Bengio and Johnny Mariéthoz, in: Proceedings of Odyssey 2004: The Speaker and Language Recognition Workshop, 2004

The Auxiliary Variable Trick for deriving Kalman Smoothers, David Barber, Idiap-RR-87-2004

Text Detection and Recognition in Images and Videos, Datong Chen, Jean-Marc Odobez and Hervé Bourlard, in: Pattern Recognition, 37(3), 2004

Tangent Vector Kernels for Invariant Image Classification with SVMs, Alexei Pozdnoukhov and Samy Bengio, in: 17th Int. Conf. Pattern Recognition (ICPR), 2004

Subjective Evaluation of Join Cost Functions Used in Unit Selection Speech Synthesis, Jithendra Vepa and Simon King, in: Proceedings of ICSLP, 2004

Stochastic techniques in deriving perceptual knowledge, Hynek Hermansky, Idiap-RR-84-2004

Statistical Transformations of Frontal Models for Non-Frontal Face Verification, Conrad Sanderson and Samy Bengio, in: Proceedings of the IEEE International Conference on Image Processing (ICIP), 2004

Speech recognition with auxiliary information, Todd Andrew Stephenson, Mathew Magimai-Doss and Hervé Bourlard, in: IEEE Trans. on Speech and Audio Processing, 4, 2004

Spectro-Temporal Activity Pattern (STAP) Features for Noise Robust ASR, Shajith Ikbal, Mathew Magimai-Doss, Hemant Misra and Hervé Bourlard, in: Proceedings of the INTERSPEECH-ICSLP-04, 2004

{S}ignificance {T}ests for {\em Bizarre} {M}easures in 2-{C}lass {C}lassification {T}asks, Mikaela Keller, Johnny Mariéthoz and Samy Bengio, Idiap-RR-34-2004

Sequence Classification with Input-Output Hidden Markov Models, Silvia Chiappa and Samy Bengio, Idiap-RR-13-2004

Sector-Based Detection for Hands-Free Speech Enhancement in Cars, Guillaume Lathoud, Julien Bourgeois and Jürgen Freudenberger, in: EURASIP Journal on Applied Signal Processing, Special Issue on Advances in Multimicrophone Speech Processing, 2006

Robust Playfield Segmentation using MAP Adaptation, Mark Barnard and Jean-Marc Odobez, in: Proc. 17th International Conference on Pattern Recognition (ICPR 2004), 2004

Robust Audio Segmentation, Jitendra Ajmera, Iain A. McCowan and Hervé Bourlard, École Polytechnique Fédérale de Lausanne, 2004

Robust Audio Segmentation, Jitendra Ajmera, Iain A. McCowan and Hervé Bourlard, Idiap-RR-35-2004

Restoring Locomotion with a Thought Controlled Mobile Robot, José del R. Millán, in: Proceedings of the 4th Forum of European Neuroscience, 2004

Pseudo-Syntactic Language Modeling for Disfluent Speech Recognition, Michael McGreevy, in: Proceedings of SST 2004 (10th Australian International Conference on Speech Science & Technology,',','), Sydney, Australia, 2004, 2004

Posteriori Probabilities and Likelihoods Combination for Speech and Speaker Recognition, Mohamed Faouzi BenZeghiba and Hervé Bourlard, in: International Conference on Spoken Language Processing (ICSLP~2004), 2004

PLSA-based Image Auto-Annotation: Constraining the Latent Space, Florent Monay and Daniel Gatica-Perez, in: Proc. ACM Int. Conf. on Multimedia (ACM MM), 2004

PLP$^2$: Autoregressive modeling of auditory-like 2-D spectro-temporal patterns, Marios Athineos, Hynek Hermansky and Daniel P. W. Ellis, 2004

Phoneme vs Grapheme Based Automatic Speech Recognition, Mathew Magimai-Doss, John Dines, Hervé Bourlard and Hynek Hermansky, Idiap-RR-48-2004

Phase AutoCorrelation (PAC) features in Entropy based Multi-Stream for Robust Speech Recognition, Shajith Ikbal, Hemant Misra, Hervé Bourlard and Hynek Hermansky, in: Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04), 2004

Phase AutoCorrelation (PAC) Features for Noise Robust ASR, Shajith Ikbal, Hemant Misra, Hervé Bourlard and Hynek Hermansky, Idiap-RR-40-2004

Order Matters: A Distributed Sampling Method for Multi-Object Tracking, Kevin C. Smith, in: British Machine Vision Conference (BMVC), 2004

On the Use of Information Retrieval Measures for Speech Recognition Evaluation, Iain A. McCowan, Darren Moore, John Dines, Daniel Gatica-Perez, Mike Flynn, Pierre Wellner and Hervé Bourlard, Idiap-RR-73-2004

On the Need for On-Line Learning in Brain-Computer Interfaces, José del R. Millán, in: Proceedings of the International Joint Conference on Neural Networks, 2004

On the Adequacy of Baseform Pronunciations and Pronunciation Variants, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-27-2004

On Local Features for Face Verification, Marc Saban and Conrad Sanderson, Idiap-RR-36-2004

Nonlinear Feature Transformations for Noise Robust Speech Recognition, Shajith Ikbal, Ecole Polytechnique Fédérale de Lausanne, 2004

Non-Invasive Brain-Actuated Control of a Mobile Robot by Human EEG, José del R. Millán, F. Renkens, J. Mouriño and W. Gerstner, in: IEEE Trans. on Biomedical Engineering, Special Issue on Brain-Machine Interfaces, 51(6), 2004

Noisy Text Clustering, David Grangier and Alessandro Vinciarelli, Idiap-RR-31-2004

Noise-Robust Multi-Stream Fusion for Text-Independent Speaker Authentication, Norman Poh and Samy Bengio, in: The Speaker and Recognition Workshop, 2004

New Nonsense Syllables Database -- Analyses and Preliminary ASR Experiments, Petr Fousek, Petr Svojanovsky, Frantisek Grezl and Hynek Hermansky, in: Proceedings of International Conference on Spoken Language Processing (ICSLP), 2004

Multi-resolution Spectral Entropy Based Feature for Robust ASR, Hemant Misra, Shajith Ikbal, Sunil Sivadas and Hervé Bourlard, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2005

Multimodal Speech Processing Using Asynchronous Hidden Markov Models, Samy Bengio, in: Information Fusion, 5(2), 2004

Multimodal Group Action Clustering in Meetings, Dong Zhang, Daniel Gatica-Perez, Samy Bengio, Iain A. McCowan and Guillaume Lathoud, in: ACM 2nd International Workshop on Video Surveillance & Sensor Networks in conjunction with 12th ACM International Conference on Multimedia, 2004

Modelling Auxiliary Features in Tandem Systems, Mathew Magimai-Doss, Todd Andrew Stephenson, Shajith Ikbal and Hervé Bourlard, in: Proceedings of ICSLP, 2004

Modeling Individual and Group Actions in Meetings: a Two-Layer HMM Framework, Dong Zhang, Daniel Gatica-Perez, Samy Bengio, Iain A. McCowan and Guillaume Lathoud, in: the Second IEEE Workshop on Event Mining: Detection and Recognition of Events in Video, In Association with CVPR, 2004

Modeling Individual and Group Actions in Meetings With Layered HMMs, Dong Zhang, Daniel Gatica-Perez, Samy Bengio, Iain A. McCowan and Guillaume Lathoud, in: IEEE Transaction on Multimedia, June, 2006, 2004

Making Retrieval Faster Through Document Clustering, David Grangier and Alessandro Vinciarelli, Idiap-RR-02-2004

LP-TRAP: Linear predictive temporal patterns, Marios Athineos, Hynek Hermansky and Daniel P. W. Ellis, 2004

Links Between Perceptrons, MLPs and SVMs, Ronan Collobert and Samy Bengio, in: International Conference on Machine Learning, ICML, 2004

Large Scale Machine Learning, Ronan Collobert, Université de Paris VI, 2004

Invariances in Kernel Methods: From Samples to Objects, Alexei Pozdnoukhov and Samy Bengio, Idiap-RR-56-2004

Improving Single Modal and Multimodal Biometric Authentication Using F-ratio Client-Dependent Normalisation, Norman Poh and Samy Bengio, Idiap-RR-52-2004

Identity verification using speech and face information, Conrad Sanderson and Kuldip K. Paliwal, in: Digital Signal Processing, 14(5), 2004

[DOI]

HMM/ANN Based Spectral Peak Location Estimation for Noise Robust Speech Recognition, Shajith Ikbal, Hervé Bourlard and Mathew Magimai-Doss, Idiap-RR-50-2004

HMM and IOHMM Modeling of EEG Rhythms for Asynchronous BCI Systems, Silvia Chiappa and Samy Bengio, in: European Symposium on Artificial Neural Networks ESANN, 2004

HMM and IOHMM for the Recognition of Mono- and Bi-Manual 3D Hand Gestures, Agnès Just, O. Bernier and Sébastien Marcel, Idiap-RR-39-2004

Fusion of Structural and Color Local Descriptors for Enhanced Object Recognition, Pedro Quelhas and Jean-Marc Odobez, in: Proceedings IEEE WIAMIS 2004(5th International Workshop on Image Analysis for Multimedia Interactive Services,',','), 21-23 April, 2004, Lisboa, Portugal, 2004

Face Authentication using Client-specific Matching Pursuit, Sébastien Marcel, P. Jost, P. Vandergheynst and Jean-Philippe Thiran, Idiap-RR-78-2004

Evidences of Equal Error Rate Reduction in Biometric Authentication Fusion, Norman Poh and Samy Bengio, Idiap-RR-43-2004

Evaluation of Formant-Like Features for Automatic Speech Recognition, F. de Wet, Katrin Weber, Louis Boves, B. Cranen, Samy Bengio and Hervé Bourlard, in: Journal of the Acoustical Society of America (JASA), 116(3), 2004

Estimating the Quality of Face Localization for Face Verification, Yann Rodriguez, Fabien Cardinaux, Samy Bengio and Johnny Mariéthoz, in: IEEE International Conference on Image Processing, ICIP, 2004

Estimates of Parameter Distributions for Optimal Action Selection, Christos Dimitrakakis and Samy Bengio, Idiap-RR-72-2004

Entropy Based Combination of Tandem Representations for Noise Robust ASR, Shajith Ikbal, Hemant Misra, Sunil Sivadas, Hynek Hermansky and Hervé Bourlard, in: Proceedings of the INTERSPEECH-ICSLP-04, 2004

Embedding motion in model-based stochastic tracking, Jean-Marc Odobez and Daniel Gatica-Perez, in: 17th Int. Conf. Pattern Recognition (ICPR), 2004

Embedding Motion in Model-Based Stochastic Tracking, Jean-Marc Odobez, Daniel Gatica-Perez and Silèye O. Ba, in: IEEE Transaction on Image Processing, 15(11), 2006

Effect of Segmentation Method on Video Retrieval Performance, David Grangier and Alessandro Vinciarelli, in: Proceedings of the 2005 IEEE International Conference on Multimedia and Expo (ICME-05), 2005

Effect of Recognition Errors on Text Clustering, David Grangier and Alessandro Vinciarelli, Idiap-RR-82-2004

Effect of Recognition Errors on Information Retrieval Performance, Alessandro Vinciarelli, in: Proceedings of International Workshop on Frontiers in Handwriting Recognition, 2004

Detecting Group Interest-level in Meetings, Daniel Gatica-Perez, Iain A. McCowan, Dong Zhang and Samy Bengio, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2005

Database, Protocol and Tools for Evaluating Score-Level Fusion Algorithms in Biometric Authentication, Norman Poh and Samy Bengio, in: Fifth Int'l. Conf. Audio- and Video-Based Biometric Person Authentication AVBPA, 2005

Database, Protocol and Tools for Evaluating Score-Level Fusion Algorithms in Biometric Authentication, Norman Poh and Samy Bengio, in: Pattern Recognition Journal, 2005

Confidence Measures in Multiple pronunciations Modeling For Speaker Verification, Mohamed Faouzi BenZeghiba and Hervé Bourlard, in: Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04), 2004

Browsing Recorded Meetings with Ferret, Pierre Wellner, Mike Flynn and Maël Guillemot, Idiap-RR-32-2004

Brain-Actuated Interaction, José del R. Millán, F. Renkens, J. Mouriño and W. Gerstner, in: Artificial Intelligence, 159(1-2), 2004

Boosting word error rates, Christos Dimitrakakis and Samy Bengio, in: IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP, 2005

Automatic Analysis of Multimodal Group Actions in Meetings, Iain A. McCowan, Daniel Gatica-Perez, Samy Bengio, Guillaume Lathoud, Mark Barnard and Dong Zhang, in: IEEE Transactions on Pattern Analysis and Machine Intelligence (to appear), 2004

Assessing Scene Structuring in Consumer Videos, Daniel Gatica-Perez, Napat Triroj, Jean-Marc Odobez, Alexander Loui and Ming-Ting Sun, in: Int. Conf. on Image and Video Retrieval (CIVR), 2004

Are two Classifiers performing equally? A treatment using Bayesian Hypothesis Testing, David Barber, Idiap-RR-57-2004

An Online Audio Indexing System, Jitendra Ajmera, Iain A. McCowan and Hervé Bourlard, 2004

An Auxiliary Variational Method, Felix Agakov and David Barber, Idiap-RR-86-2004

Activity Report 2003, IDIAP, Idiap-Com-01-2004

A video package for Torch, Julien Tiphaigne and Sébastien Marcel, Idiap-Com-02-2004

A Study of the Effects of Score Normalisation Prior to Fusion in Biometric Authentication Tasks, Norman Poh and Samy Bengio, Idiap-RR-69-2004

A Stable Switching Kalman Smoother, David Barber, Idiap-RR-89-2004

A Sector-Based Approach for Localization of Multiple Speakers with Microphone Arrays, Guillaume Lathoud and Iain A. McCowan, in: Proceedings of the 2004 SAPA Workshop, 2004

A probabilistic framework for joint head tracking and pose estimation, Silèye O. Ba and Jean-Marc Odobez, in: 17th Int. Conf. Pattern Recognition (ICPR), 2004

A Novel Approach to Combining Client-Dependent and Confidence Information in Multimodal Biometric, Norman Poh and Samy Bengio, in: Fifth Int'l. Conf. Audio- and Video-Based Biometric Person Authentication AVBPA, 2005

A New Speech Recognition Baseline System for Numbers 95 Version 1.3 Based on Torch, Johnny Mariéthoz and Samy Bengio, Idiap-RR-16-2004

A Meeting Browser Evaluation Test, Pierre Wellner, Mike Flynn, Simon Tucker and Steve Whittaker, Idiap-RR-53-2004

A Localization/Verification Scheme for Finding Text in Images and Video Frames Based on Contrast Independent Features and Machine Learning Methods, Datong Chen, Jean-Marc Odobez and Jean-Philippe Thiran, in: Signal Processing: Image Communication, 19(3), 2004

A Gentle Hessian for Efficient Gradient Descent, Ronan Collobert and Samy Bengio, in: IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP, 2004

A Generative Model for Music Transcription, A. T. Cemgil, B. Kappen and David Barber, in: IEEE Transactions on Speech and Audio Processing, 2004

You Are Wrong!---Automatic Detection of Interaction Errors from Brain Waves, Pierre W. Ferrez and José del R. Millán, in: Proceedings of the 19th International Joint Conference on Artificial Intelligence, 2005

Writer Identification for Smart Meeting Room Systems, Marcus Liwicki, Andreas Schlapbach, Horst Bunke, Samy Bengio, Johnny Mariéthoz and Jonas Richiardi, in: Seventh IAPR Workshop on Document Analysis Systems, DAS, 2006

Video Text Recognition using Sequential Monte Carlo and Error Voting Methods, Datong Chen and Jean-Marc Odobez, in: Pattern Recognition Letters, 26(9), 2005

Using Pitch as Prior Knowledge in Template-Based Speech Recognition, Guillermo Aradilla, Jithendra Vepa and Hervé Bourlard, in: Proceedings of ICASSP, 2006, 2006

Using Auxiliary Sources of Knowledge for Automatic Speech Recognition, Mathew Magimai-Doss, École Polytechnique Fédérale de Lausanne, Computer Science Department, 2005

Unsupervised Spectral Subtraction for Noise-Robust ASR, Guillaume Lathoud, Mathew Magimai-Doss, Bertrand Mesot and Hervé Bourlard, in: Proceedings of the 2005 IEEE ASRU Workshop, 2005

Two-Handed Gesture Recognition, Agnès Just and Sébastien Marcel, Idiap-RR-24-2005

Towards Explaining the Success (Or Failure) of Fusion in Biometric Authentication, Norman Poh and Samy Bengio, Idiap-RR-43-2005

Towards ASR Based on Hierarchical Posterior-Based Keyword Recognition, Petr Fousek and Hynek Hermansky, Idiap-RR-64-2005

Threshold Selection for Unsupervised Detection, with an Application to Microphone Arrays, Guillaume Lathoud, Mathew Magimai-Doss and Hervé Bourlard, in: Proceedings of ICASSP 2006, 2006

The Role of Speech in Multimodal Human-Computer Interaction (Towards Reliable Rejection of Non-Keyword Input), Hynek Hermansky, Petr Fousek and Mikko Lehtonen, in: Proceedings of 8th International Conference on Text, Speech and Dialogue - TSD 2005, 2005

The Multi-Channel Wall Street Journal Audio Visual Corpus (MC-WSJ-AV): Specification and Initial Experiments, Mike Lincoln, Iain A. McCowan, Jithendra Vepa and Hari Krishna Maganti, Idiap-RR-69-2005

The AMI Meeting Corpus: a Pre-Announcement, Jean Carletta, Simone Ashby, Sebastien Bourban, Mike Flynn, Maël Guillemot, Thomas Hain, Jaroslav Kadlec, Vasilis Karaiskos, Wessel Kraaij, Melissa Kronenthal, Guillaume Lathoud, Mike Lincoln, Agnes Lisowska, Iain A. McCowan, Wilfried Post, Dennis Reidsma and Pierre Wellner, in: Machine Learning for Multimodal Interaction: Second International Workshop, MLMI'2005, 2005

Stable Directed Belief Propagation in Gaussian DAGs using the auxiliary variable trick, David Barber and Peter Sollich, Idiap-RR-72-2005

Sports Event Recognition using Layered HMMs, Mark Barnard and Jean-Marc Odobez, Idiap-RR-07-2005

Speech Acquisition in Meetings with an Audio-Visual Sensor Array, Iain A. McCowan, Maganti Hari Krishna, Daniel Gatica-Perez, Darren Moore and Silèye O. Ba, in: Pro. IEEE ICME, 2005

Spectral Entropy Feature in Multi-stream for Robust ASR, Hemant Misra and Hervé Bourlard, Idiap-RR-45-2005

Spectral Entropy Feature in Full-Combination Multi-Stream for Robust ASR, Hemant Misra and Hervé Bourlard, in: Proceedings of ISCA European Conference on Speech Communication and Technology (Eurospeech), 2005

Sociometry Based Multiparty Audio Recordings Segmentation, Alessandro Vinciarelli, in: Proceedings of the IEEE Conference on Multimedia and Expo (ICME 2006), 2006

Semi-supervised Meeting Event Recognition with Adapted HMMs, Dong Zhang, Daniel Gatica-Perez and Samy Bengio, in: Pro. IEEE ICME, 2005

Semi-supervised Adapted HMMs for Unusual Event Detection, Dong Zhang, Daniel Gatica-Perez, Samy Bengio and Iain A. McCowan, in: Pro. IEEE CVPR, 2005

Probabilistic Tagging of Unstructured Genealogical Records, Mike Perrow and David Barber, Idiap-RR-86-2005

Person Authentication using Brainwaves (EEG) and Maximum A Posteriori Model Adaptation, Sébastien Marcel and José del R. Millán, in: IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE Special Issue on Biometrics, 2007

Performance Generalization in Biometric Authentication Using Joint User-Specific and Sample Bootstraps, Norman Poh, Alvin Martin and Samy Bengio, in: IEEE Pattern Analysis and Machine intelligence, 2007

Optimal Spike-Timing Dependent Plasticity for Precise Action Potential Firing in Supervised Learing, J-P. Pfister, T. Toyoizumi, David Barber and W. Gerstner, Idiap-RR-88-2005

Optimal Spike-Timing Dependent Plasticity for Precise Action Potential Firing in Supervised Learing, J-P. Pfister, T. Toyoizumi, David Barber and W. Gerstner, 2005

On Variable-scale Piecewise Stationary Spectral Analysis of Speech Signals for Asr, Vivek Tyagi, Hervé Bourlard and Christian Wellekens, in: Speech Communication, 48(9), 2006

On Variable-Scale Piecewise Stationary Spectral Analysis of Speech Signals for ASR, Vivek Tyagi, Hervé Bourlard and Christian Wellekens, Idiap-RR-19-2005

On transforming statistical models for non-frontal face verification, Conrad Sanderson, Samy Bengio and Yongsheng Gao, in: Pattern Recognition (in press), 2005

[DOI]

On Accuracy/Robustness/Complexity Trade-Offs in Face Verification, Conrad Sanderson, Fabien Cardinaux and Samy Bengio, in: IEEE International Conference on Information Technology and Applications, ICITA, 2005

OCR Based Slide Retrieval, Nabil Daddaoua, Jean-Marc Odobez and Alessandro Vinciarelli, Idiap-RR-11-2005

Non-Invasive Estimation of Local Field Potentials for Neuroprosthesis Control, R. Grave de Peralta Menendez, S. L. González Andino, L. Perez, Pierre W. Ferrez and José del R. Millán, in: Cognitive Processing, Special Issue on Motor Planning in Humans and Neuroprosthesis Control, 6(1), 2005

Noisy Text Categorization, Alessandro Vinciarelli, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(12), 2005

Multiview Face Detection, Tiffany Sauquet, Yann Rodriguez and Sébastien Marcel, Idiap-RR-49-2005

Multi-stream ASR: An Oracle Perspective, Hemant Misra, Jithendra Vepa and Hervé Bourlard, in: Proceedings of ISCA International Conference on Spoken Language Processing (ICSLP), 2006

Multi-resolution RASTA filtering for TANDEM-based ASR, Hynek Hermansky and Petr Fousek, in: Proceedings of Interspeech 2005, 2005

Multimodal Multispeaker Probabilistic Tracking in Meetings, Daniel Gatica-Perez, Guillaume Lathoud, Jean-Marc Odobez and Iain A. McCowan, in: Proc. Int. Conf. on Multimodal Interfaces (ICMI), 2005

Multimodal Integration for Meeting Group Action Segmentation and Recognition, Marc Al-Hames, Alfred Dielmann, Daniel Gatica-Perez, Stephan Reiter, Steve Renals and Dong Zhang, in: MLMI, 2005

Multimedia event modelling and recognition, Mark Barnard, École Polytechnique Fédérale de Lausanne, 2005

Multichannel Speech Enhancement in Cars: Explicit vs. Implicit Adaptation Control, Guillaume Lathoud, Julien Bourgeois and Jürgen Freudenberger, in: Proceedings of HSCMA 2005, 2005

Multi Channel Sequence Processing, Samy Bengio and Hervé Bourlard, Idiap-RR-04-2005

Monte Carlo Video Text Segmentation, Datong Chen, Jean-Marc Odobez and Jean-Philippe Thiran, in: International Journal of Pattern Recognition and Artificial Intelligence (IJPRAI), 19(5), 2005

Modeling Scenes with Local Descriptors and Latent Aspects, Pedro Quelhas, Florent Monay, Jean-Marc Odobez, Daniel Gatica-Perez, Tinne Tuytelaars and Luc Van Gool, in: IEEE Int. Conf. on Computer Vision, 2005

Modeling Interactions from Email Communication, Daniel Gatica-Perez, Dong Zhang and Samy Bengio, in: Proc. IEEE International Conference on Multimedia & Expo (ICME,',','), 2006, 2006

Measuring the Performance of Face Localization Systems, Yann Rodriguez, Fabien Cardinaux, Samy Bengio and Johnny Mariéthoz, in: Image and Vision Computing Journal, 24(8), 2006

Machine Learning for Multimodal Interaction: First International Workshop, MLMI'2004, Springer-Verlag Heidelberg, 2005

Local Features and 1D-HMMs for Fast and Robust Face Authentication, Fabien Cardinaux, Idiap-RR-17-2005

Local Binary Patterns as an Image Preprocessing for Face Authentication, Guillaume Heusch, Yann Rodriguez and Sébastien Marcel, in: IEEE Int. Conf. on Automatic Face and Gesture Recognition (AFGR), 2006

Lighting Normalization Algorithms for Face Verification, Guillaume Heusch, Fabien Cardinaux and Sébastien Marcel, Idiap-Com-03-2005

Learning influence among interacting Markov chains, Dong Zhang, Daniel Gatica-Perez, Samy Bengio and Deb Roy, in: NIPS, 2005

Kernelized Infomax Clustering, Felix Agakov and David Barber, Idiap-RR-73-2005

Joint Training of Multi-Stream HMMs, Samy Bengio, Idiap-RR-22-2005

Joint Speech and Speaker Recognition, Mohamed Faouzi BenZeghiba, École Polytechnique Fédérale de Lausanne, Computer Science Department, 2005

Interfaces Cerebrales, José del R. Millán, in: Mente y Cerebro, 13(July), 2005

Inferring Document Similarity from Hyperlinks, David Grangier and Samy Bengio, in: ACM Conference on Information and Knowledge Management, 2005

Improving Speech Recognition Using a Data-Driven Approach, Guillermo Aradilla, Jithendra Vepa and Hervé Bourlard, in: Proceedings of Interspeech, 2005, 2005

Improving Fusion with Margin-Derived Confidence In Biometric Authentication Tasks, Norman Poh and Samy Bengio, in: Fifth Int'l. Conf. Audio- and Video-Based Biometric Person Authentication AVBPA, 2005

Improving Continuous Speech Recognition System Performance with Grapheme Modelling, Mathew Magimai-Doss, John Dines, Hervé Bourlard and Hynek Hermansky, Idiap-RR-16-2005

Implicit Control of Noise Canceller for Speech Enhancement, Julien Bourgeois, Jürgen Freudenberger and Guillaume Lathoud, in: Proceedings of INTERSPEECH 2005, 2005

How Do Correlation and Variance of Base-Experts Affect Fusion in Biometric Authentication Tasks?, Norman Poh and Samy Bengio, in: IEEE Trans. on Signal Processing, 2005

Hierarchical Multi-Stream Posterior Based Speech Recognition System, Hamed Ketabdar, Hervé Bourlard and Samy Bengio, in: Proceedings MLMI workshop, 2005

Hierarchical approach for spotting keywords, Mikko Lehtonen, Idiap-RR-41-2005

Harmonic Plus Noise Model for Concatenative Speech Synthesis, D. Vandromme, Idiap-RR-37-2005

Gradient estimates of return distributions, Christos Dimitrakakis and Samy Bengio, in: PASCAL Workshop on Principled Methods of Trading Exploration and Exploitation, 2005

Generative Temporal ICA for Classification in Asynchronous BCI Systems, Silvia Chiappa and David Barber, in: The 2nd International IEEE EMBS Conference On Neural Engineering, 2005

Generative Temporal ICA for Classification in Asynchronous BCI Systems, Silvia Chiappa and David Barber, Idiap-RR-08-2005

Generative Independent Component Analysis for EEG Classification, Silvia Chiappa and David Barber, in: European Symposium on Artificial Neural Networks ESANN, 2005

From Meeting Recordings to Web Distribution: Description of the Process, Maël Guillemot and Bastien Crettol, Idiap-Com-05-2005

F-ratio Client-Dependent Normalisation on Biometric Authentication Tasks, Norman Poh and Samy Bengio, in: Proceedings of the 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-05), 2005

Finding groups of people in Google news, Dhiraj Joshi and Daniel Gatica-Perez, in: ACM Int. Conf. on Human-Centered Multimedia (HCM), 2006

Face Authentication Based on Local Features and Generative Models, Fabien Cardinaux, École Polytechnique Fédérale de Lausanne, 2005

Extracting Information from Multimedia Meeting Collections, Daniel Gatica-Perez, Dong Zhang and Samy Bengio, in: 7th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2005

Exploiting Hyperlinks to Learn a Retrieval Model, David Grangier and Samy Bengio, in: NIPS Workshop on Learning to Rank, 2005

Evaluation of Multiple Cues Head Pose Tracking Algorithm in Natural Environments, Silèye O. Ba and Jean-Marc Odobez, in: International Conference on Multimedia & Expo ICME 2005, 2005

Efficient Kalman Smoothing for Harmonic State-Space Models, David Barber, Idiap-RR-87-2005

Efficient Diffusion-based Illumination Normalization for Face Verification, Guillaume Heusch, Fabien Cardinaux and Sébastien Marcel, Idiap-RR-46-2005

EER of Fixed and Trainable Fusion Classifiers: A Theoretical Study with Application to Biometric Authentication Tasks, Norman Poh and Samy Bengio, in: Sixth International Workshop on Multiple Classifier System (MCS2005), 2005

Developing and Enhancing Posterior Based Speech Recognition Systems, Hamed Ketabdar, Jithendra Vepa, Samy Bengio and Hervé Bourlard, in: Proceedings of Interspeech, 2005

Cursive Character Challenge: a New Database for Machine Learning and Pattern Recognition, Francesco Camastra, Marco Spinetti and Alessandro Vinciarelli, in: Proceedings of International Conference on Pattern Recognition (ICPR), 2006

Continuous Microphone Array Speech Recognition on Wall Street Journal Corpus, Hari Krishna Maganti, Jithendra Vepa and Hervé Bourlard, Idiap-RR-47-2005

Construction and comparison of approximations for switching linear gaussian state space models, David Barber and Bertrand Mesot, Idiap-RR-06-2005

Construction and comparison of approximations for switching linear gaussian state space models, David Barber, Idiap-RR-71-2005

Constructing visual models with a latent space approach, Florent Monay, Pedro Quelhas, Daniel Gatica-Perez and Jean-Marc Odobez, in: the Springer series of Lecture Notes in Computer Science, 2006

Compensating User-Specific Information with User-Independent Information in Biometric Authentication Tasks, Norman Poh and Samy Bengio, Idiap-RR-44-2005

Chord Representations for Probabilistic Models, Jean-François Paiement, Douglas Eck and Samy Bengio, Idiap-RR-58-2005

Can Chimeric Persons Be Used in Multimodal Biometric Authentication Experiments?, Norman Poh and Samy Bengio, Idiap-RR-20-2005

Can a Professional Imitator Fool a GMM-Based Speaker Verification System?, Johnny Mariéthoz and Samy Bengio, Idiap-RR-61-2005

Benchmarking Non-Parametric Statistical Tests, Mikaela Keller, Samy Bengio and Siew Yeung Wong, in: Advances in Neural Information Processing Systems, NIPS 18. MIT Press, 2005

Bayesian Factorial Linear Gaussian State-Space Models for Biosignal Decomposition, Silvia Chiappa and David Barber, in: IEEE Signal Processing Letters, 2007

AV16.3: an Audio-Visual Corpus for Speaker Localization and Tracking, Guillaume Lathoud, Jean-Marc Odobez and Daniel Gatica-Perez, in: Proceedings of the 2004 MLMI Workshop, S. Bengio and H. Bourlard Eds, Springer Verlag, 2005

Audio-visual probabilistic tracking of multiple speakers in meetings, Daniel Gatica-Perez, Guillaume Lathoud, Jean-Marc Odobez and Iain A. McCowan, in: IEEE Trans. on Audio, Speech, and Language Processing, accepted for publication., 2006

Application of Information Retrieval Techniques to Single Writer Documents, Alessandro Vinciarelli, in: Pattern Recognition Letters, 26(14-15), 2005

Activity Report 2004, IDIAP, Idiap-Com-01-2005

A Video Database for Head Pose Tracking Evaluation, Silèye O. Ba and Jean-Marc Odobez, Idiap-Com-04-2005

A Unified Framework for Score Normalization Techniques Applied to Text Independent Speaker Verification, Johnny Mariéthoz and Samy Bengio, in: IEEE Signal Processing Letters, Volume 12, 12(7), 2005

A Spectrogram Model for Enhanced Source Localization and Noise-Robust ASR, Guillaume Lathoud, Mathew Magimai-Doss and Bertrand Mesot, in: Proceedings of INTERSPEECH 2005, 2005

A Sector-Based, Frequency-Domain Approach to Detection and Localization of Multiple Speakers, Guillaume Lathoud and Mathew Magimai-Doss, in: Proceedings of ICASSP 2005, 2005

A Rao-Blackwellized Mixed State Particle Filter for Head Pose Tracking, Silèye O. Ba and Jean-Marc Odobez, in: ACM ICMI Workshop on Multimodal Multiparty Meeting Processing (MMMP), 2005

A Probabilistic Model for Chord Progressions, Jean-François Paiement, Douglas Eck and Samy Bengio, in: Proceedings of the Sixth International Conference on Music Information Retrieval (ISMIR), 2005

A Probabilistic Interpretation of SVMs with an Application to Unbalanced Classification, Yves Grandvalet, Johnny Mariéthoz and Samy Bengio, in: Advances in Neural Information Processing Systems, NIPS 15, 2005

A Neural Network for Text Representation, Mikaela Keller and Samy Bengio, in: International Conference on Artificial Neural Networks, ICANN, 2005

A Meeting Browser Evaluation Test, Pierre Wellner, Mike Flynn, Simon Tucker and Steve Whittaker, in: CHI '92: Proceedings of the SIGCHI conference on Human factors in computing systems, Portland, OR, USA, ACM Press, 2005

A Kernel Classifier for Distributions, Alexei Pozdnoukhov and Samy Bengio, Idiap-RR-32-2005

A Graphical Model for Chord Progressions Embedded in a Psychoacoustic Space, Jean-François Paiement, Douglas Eck, Samy Bengio and David Barber, in: Proceedings of the 22nd International Conference on Machine Learning, 2005

A Generative Model for Music Transcription, A. T. Cemgil, B. Kappen and David Barber, Idiap-RR-89-2005

A Discriminative Decoder for the Recognition of Phoneme Sequences, David Grangier and Samy Bengio, Idiap-RR-67-2005

Wide-Band Perceptual Audio Coding based on Frequency-Domain Linear Prediction, Petr Motlicek, Vijay Ullal and Hynek Hermansky, Idiap-RR-58-2006

Very High Frequency Oscillations (VHFO) as a Predictor of Movement Intentions, S. L. González Andino, R. Grave de Peralta Menendez, G. Thut, José del R. Millán, P. Morier and T. Landis, in: NeuroImage, 32(1), 2006

Using Posterior-Based Features in Template Matching for Speech Recognition, Guillermo Aradilla, Jithendra Vepa and Hervé Bourlard, in: International Conference on Spoken Language Processing, 2006

Using more informative posterior probabilities for speech recognition, Hamed Ketabdar, Jithendra Vepa, Samy Bengio and Hervé Bourlard, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2006

Using Chimeric Users to Construct Fusion Classifiers in Biometric Authentication Tasks: An Investigation, Norman Poh and Samy Bengio, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2006

User-Customized Password Speaker Verification Using Multiple Reference and Background Models, Mohamed Faouzi BenZeghiba and Hervé Bourlard, in: Speech Communication, 8, 2006

Unsupervised Speech/Non-speech Detection for Automatic Speech Recognition in Meeting Rooms, Hari Krishna Maganti, Petr Motlicek and Daniel Gatica-Perez, Idiap-RR-57-2006

Unsupervised Spectral Subtraction for Noise-Robust ASR on Unknown Transmission Channels, Guillaume Lathoud, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-09-2006

Unified Inference for Variational Bayesian Linear Gaussian State-Space Models, David Barber and Silvia Chiappa, in: NIPS, 2006

Two-Handed Gestures for Human-Computer Interaction, Agnès Just, École Polytechnique Fédérale de Lausanne, 2006

Two-Handed Gestures for Human-Computer Interaction, Agnès Just, Idiap-RR-73-2006

Tracking the Multi Person Wandering Visual Focus of Attention, Kevin C. Smith, Silèye O. Ba, Daniel Gatica-Perez and Jean-Marc Odobez, in: International Conference on Multimodal Interfaces (ICMI06), 2006

Tracking Attention for Multiple People: Wandering Visual Focus of Attention Estimation, Kevin C. Smith, Silèye O. Ba, Jean-Marc Odobez and Daniel Gatica-Perez, Idiap-RR-40-2006

Towards using slide information to enhance speech transcription of meetings, Artem Peregoudov, Alessandro Vinciarelli and Hervé Bourlard, Idiap-RR-01-2006

Towards a Robust BCI: Error Potentials and Online Learning, Anna Buttfield, Pierre W. Ferrez and José del R. Millán, in: IEEE Trans. on Neural Systems and Rehabilitation Engineering, 14(2), 2006

The segmentation of multi-channel meeting recordings for automatic speech recognition, John Dines, Jithendra Vepa and Thomas Hain, in: Int. Conf. on Spoken Language Processing (Interspeech ICSLP), 2006

The more you learn, the less you store: memory\--controlled incremental SVM, Andrzej Pronobis and Barbara Caputo, Idiap-RR-51-2006

The More you Learn, the Less you Store: Memory-Controlled Incremental SVM, Andrzej Pronobis and Barbara Caputo, in: Proceedings of International Cognitive Vision Workshop (ICVW) 2006), 2006

The Juicer LVCSR Decoder - User Manual for Juicer version 0.5.0, Darren Moore, Idiap-Com-03-2006

The BCI Competition III: Validating Alternative Approaches to Actual BCI Problems, B. Blankertz, K. -R. Müller, D. Krusienski, G. Schalk, J. R. Wolpaw, A. Schlögl, Gert Pfurtscheller, José del R. Millán, M. Schroeder and N. Birbaumer, in: IEEE Trans. on Neural Systems and Rehabilitation Engineering, 14(2), 2006

Switching Linear Dynamical Systems for Noise Robust Speech Recognition, Bertrand Mesot and David Barber, Idiap-RR-08-2006

SVM-based Transfer of Visual Knowledge Across Robotic Platforms, Jie Luo, Andrzej Pronobis and Barbara Caputo, Idiap-RR-65-2006

Subjective Evaluation of Join Cost and Smoothing Methods for Unit Selection Speech Synthesis, Jithendra Vepa and Simon King, in: IEEE Trans. on Audio, Speech and Language Processing, 14(5), 2006

Spiking Neuron Networks A survey, Hélène Paugam-Moisy, Idiap-RR-11-2006

Speech Enhancement and Recognition in Meetings with an Audio-Visual Sensor Array, Hari Krishna Maganti, Daniel Gatica-Perez and Iain A. McCowan, Idiap-RR-24-2006

Speech Coding based on Spectral Dynamics, Petr Motlicek, Hynek Hermansky, Harinath Garudadri and Naveen Srinivasamurthy, in: Ninth International Conference on TEXT, SPEECH and DIALOGUE (TSD), 2006

Speaker Localization for Microphone Array-Based ASR: The Effects of Accuracy on Overlapping Speech, Hari Krishna Maganti and Daniel Gatica-Perez, Idiap-RR-29-2006

Speaker Localization for Microphone Array-Based ASR: The Effects of Accuracy on Overlapping Speech, Hari Krishna Maganti and Daniel Gatica-Perez, in: Proc. Int. Conf. on Multimodal Interfaces (ICMI), 2006

Spatio-Temporal Analysis of Spontaneous Speech with Microphone Arrays, Guillaume Lathoud, Idiap-RR-77-2006

Spatio-Temporal Analysis of Spontaneous Speech with Microphone Arrays, Guillaume Lathoud, Ecole Polytechnique Fédérale de Lausanne, 2006

Sociometry Based Multiparty Audio Recordings Summarization, Alessandro Vinciarelli, in: Proceedings of International Conference on Pattern Recognition (ICPR 2006), 2006

Semantic Segmentation of Radio Programs Using Social Network Analysis and Duration Distribution Modeling, Alessandro Vinciarelli, F. Fernàndez and Sarah Favre, Idiap-RR-75-2006

Robust-to-Illumination Face Localisation using Active Shape Models and Local Binary Patterns, Sébastien Marcel, Jean Keomany and Yann Rodriguez, Idiap-RR-47-2006

Revisiting Doddington's Zoo: A Systematic Method to Assess User-dependent Variabilities, Norman Poh, Samy Bengio and Arun Ross, in: Multimodal User Authentication (MMUA), 2006

Recognizing People's Focus of Attention from Head Poses: a Study, Silèye O. Ba and Jean-Marc Odobez, Idiap-RR-42-2006

Probabilistic Graphical Models for Human Interaction Analysis, Dong Zhang, Idiap-RR-78-2006

Probabilistic Graphical Models for Human Interaction Analysis, Dong Zhang, École Polytechnique Fédérale de Lausanne, 2006

Prior Knowledge in Kernel Methods, Alexei Pozdnoukhov, École Polytechnique Fédérale de Lausanne, 2006

Posterior Based Keyword Spotting with A Priori Thresholds, Hamed Ketabdar, Jithendra Vepa, Samy Bengio and Hervé Bourlard, in: International Conference on Spoken Language Processing (ICSLP), 2006

ORGIDIAP : le couteau suisse pour la gestion d'une entreprise, Jonathan Rey and Frank Formaz, Idiap-Com-05-2006

Online statistical estimation for vehicle control, Christos Dimitrakakis, Idiap-RR-13-2006

Online Classifier Adaptation in Brain-Computer Interfaces, Anna Buttfield and José del R. Millán, Idiap-RR-16-2006

On the Recent Use of Local Binary Patterns for Face Authentication, Sébastien Marcel, Yann Rodriguez and Guillaume Heusch, Idiap-RR-34-2006

Observations on Multi-Band Asynchrony in Distant Speech Recordings, Guillaume Lathoud, Idiap-RR-74-2006

Non-Invasive Brain-Actuated Control of a Mobile Robot by Human EEG, José del R. Millán, F. Renkens, J. Mouriño and W. Gerstner, in: 2006 IMIA Yearbook of Medical Informatics, Schattauer Verlag, 2006

Nearly optimal exploration-exploitation decision thresholds, Christos Dimitrakakis, in: Int. Conf. on Artificial Neural Networks (ICANN), 2006

Natural Scene Image Modeling using Color and Texture Visterms., Pedro Quelhas and Jean-Marc Odobez, in: Conference on Image and Video Retrieval CIVR, 2006

Multi-system Biometric Authentication: Optimal Fusion and User-Specific Information, Norman Poh, École Polytechnique Fédérale de Lausanne, 2006

Multi-stream Processing for Noise Robust Speech Recognition, Hemant Misra, École Polytechnique Fédérale de Lausanne, 2006

Multi-Person Tracking in Meetings: A Comparative Study, Kevin C. Smith, Sascha Schreiber, Vítezslav Beran, Igor Potúcek, Gerhard Rigoll and Daniel Gatica-Perez, in: Multimodal Interaction and Related Machine Learning Algorithms (MLMI), 2006

Model Adaptation for Sentence Unit Segmentation from Speech, Sébastien Cuendet, Idiap-RR-64-2006

Melanoma Recognition using Kernel Classifiers, Elisabetta La Torre, Barbara Caputo and Tatiana Tommasi, Idiap-RR-53-2006

Master Thesis: Integration of the Harmonic plus Noise Model (HNM) into the Hidden Markov Model-Based Speech Synthesis System (HTS), Coralie Hemptinne, Idiap-RR-69-2006

Managing IDIAP Inventory (Computers, Components, Software and Licences), Jonathan Rey and Frank Formaz, Idiap-Com-04-2006

Machine Learning Approaches to Text Representation using Unlabeled Data, Mikaela Keller, Ecole Polytechnique Fédérale de Lausanne, 2006

Learning to Retrieve Images from Text Queries with a Discriminative Model, David Grangier, Florent Monay and Samy Bengio, in: International Workshop on Adaptive Multimedia Retrieval (AMR), 2006

Kernel Methods for Melanoma Recognition, Tatiana Tommasi, Elisabetta La Torre and Barbara Caputo, in: Proceedings of Workshop on Computer Vision Approaches to Medical Image Analysis (CVAMIA) 2006), 2006

Juicer: A Weighted Finite-State Transducer speech decoder, Darren Moore, John Dines, Mathew Magimai-Doss, Jithendra Vepa, Octavian Cheng and Thomas Hain, in: 3rd Joint Workshop on Multimodal Interaction and Related Machine LEarning Algorithms MLMI'06, 2006

Investigating Lexical Substitution Scoring for Subtitle Generation, Oren Glickman, Ido Dagan, Mikaela Keller, Samy Bengio and Walter Daelemans, in: Proceedings of the 10th Conference on Computational Natural Language Learning (CoNLL)., 2006

Integrating co-occurrence and spatial contexts on patch-based scene segmentation, Florent Monay, Pedro Quelhas, Jean-Marc Odobez and Daniel Gatica-Perez, in: Beyond Patches Workshop, in conjunction with CVPR, 2006

Infinite Models for Speaker Clustering, Fabio Valente, in: International Conference on Spoken Language Processing, 2006

Indexation de Documents Manuscrits, Alessandro Vinciarelli, in: Proceedings du Colloque International Francophone sur l'Ecrit et le Document (CIFED06), 2006

Incremental Learning for Place Recognition in Dynamic Environments, Jie Luo, Andrzej Pronobis, Barbara Caputo and Patric Jensfelt, Idiap-RR-52-2006

Identifying unexpected words using in-context and out-of-context phoneme posteriors, Hamed Ketabdar and Hynek Hermansky, Idiap-RR-68-2006

Hand Posture Classification and Recognition using the Modified Census Transform, Agnès Just, Yann Rodriguez and Sébastien Marcel, in: IEEE Int. Conf. on Automatic Face and Gesture Recognition (AFGR), 2006

Further Applications of Sector-Based Detection and Short-Term Clustering, Guillaume Lathoud, Idiap-RR-26-2006

Face Detection and Verification using Local Binary Patterns, Yann Rodriguez, Idiap-RR-79-2006

Face Detection and Verification using Local Binary Patterns, Yann Rodriguez, École Polytechnique Fédérale de Lausanne, 2006

Face Authentication Using Adapted Local Binary Pattern Histograms, Yann Rodriguez and Sébastien Marcel, in: 9th European Conference on Computer Vision (ECCV), 2006

Exploring Contextual Information in a Layered Framework for Group Action Recognition, Dong Zhang, Daniel Gatica-Perez and Samy Bengio, in: In the Eighth International Conference on Multimodal Interfaces (ICMI'06), 2006

Estimating the Confidence Interval of Expected Performance Curve in Biometric Authentication Using Joint Bootstrap, Norman Poh and Samy Bengio, Idiap-RR-25-2006

Ensembles for Sequence Learning, Christos Dimitrakakis, École Polytechnique Fédérale de Lausanne, 2006

EEG Classification using Generative Independent Component Analysis, Silvia Chiappa and David Barber, in: Neurocomputing, 2006

Discrmininant Models for Text-independent Speaker Verification, Johnny Mariéthoz, Idiap-RR-70-2006

Discriminative Kernel-Based Phoneme Sequence Recognition, Joseph Keshet, Samy Bengio, Dan Chazan, Shai Shalev-Shwartz and Yoram Singer, Idiap-RR-14-2006

Discriminant linear processing of time-frequency plane, Fabio Valente and Hynek Hermansky, in: International Conference on Spoken Language Processing, 2006

Detection and Application of Influence Rankings in Small Group Meetings, Rutger Rienks, Dong Zhang, Daniel Gatica-Perez and Wilfried Post, in: In the Eighth International Conference on Multimodal Interfaces (ICMI'06), 2006

Detecting Intentional Mental Transitions in an Asynchronous BCI, Ferran Galán, Francesc Oliva, Joan Guàrdia, Pierre W. Ferrez and José del R. Millán, Idiap-RR-43-2006

Detecting Abandoned Luggage Items in a Public Space, Kevin C. Smith, Pedro Quelhas and Daniel Gatica-Perez, in: IEEE Performance Evaluation of Tracking and Surveillance Workshop (PETS), 2006

Combination of Acoustic Classifiers based on Dempster-Shafer Theory of evidence, Fabio Valente and Hynek Hermansky, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007

Bi-Modal Face and Speech Authentication: a BioLogin Demonstration System, Sébastien Marcel, Johnny Mariéthoz, Yann Rodriguez and Fabien Cardinaux, in: Workshop on Multimodal User Authentication (MMUA), 2006

Audio Coding Based on Long Temporal Segments: Experiments With Quantization of Excitation Signal, Vijay Ullal and Petr Motlicek, Idiap-RR-46-2006

Audio Coding Based on Long Temporal Contexts, Petr Motlicek, Hynek Hermansky, Harinath Garudadri and Naveen Srinivasamurthy, Idiap-RR-30-2006

Assessing the effectiveness of slides as a mean to improve the automatic transcription of oral presentations, Artem Peregoudov, Alessandro Vinciarelli and Hervé Bourlard, Idiap-RR-56-2006

Application of Information Retrieval Technologies to Presentation Slides, Alessandro Vinciarelli and Jean-Marc Odobez, in: IEEE Transactions on Multimedia, 8(5), 2006

Annotation of face detection: description of XML format and files, Sébastien Marcel, Yann Rodriguez, Maël Guillemot and Andrei Popescu-Belis, Idiap-Com-06-2006

Analyzing Group Interactions in Conversations: a Review, Daniel Gatica-Perez, in: IEEE Int. Conf. on Multisensor Fusion and Integration for Intelligent Systems (MFI), 2006

Analysis and Classification of EEG Signals using Probabilistic Models for Brain Computer Interfaces, Silvia Chiappa, École Polytechnique Fédérale de Lausanne, 2006

[DOI]
[URL]

An Acoustic Model Based on Kullback-Leibler Divergence for Posterior Features, Guillermo Aradilla, Jithendra Vepa and Hervé Bourlard, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007

Activity Report 2005, IDIAP, Idiap-Com-01-2006

Active Shape Models Using Local Binary Patterns, Jean Keomany and Sébastien Marcel, Idiap-RR-07-2006

A Study on Visual Focus of Attention Recognition from Head Pose in a Meeting Room, Silèye O. Ba and Jean-Marc Odobez, in: 3rd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI06), 2006

A Neural Network to Retrieve Images from Text Queries, David Grangier and Samy Bengio, in: International Conference on Artificial Neural Networks (ICANN), 2006

A Multitask Learning Approach to Document Representation using Unlabeled Data, Mikaela Keller and Samy Bengio, Idiap-RR-44-2006

A Max Kernel For Text-Independent Speaker Verification Systems, Johnny Mariéthoz and Samy Bengio, in: Second Workshop on Multimodal User Authentication, MMUA, 2006

A Generalized Dynamic Composition Algorithm of Weighted Finite State Transducers for Large Vocabulary Speech Recognition, Octavian Cheng, John Dines and Mathew Magimai-Doss, Idiap-RR-62-2006

A Bayesian Alternative to Gain Adaptation in Autoregressive Hidden Markov Models, Bertrand Mesot and David Barber, Idiap-RR-55-2006

2D Multi-Person Tracking: A Comparative Study in AMI Meetings, Kevin C. Smith, Sascha Schreiber, Vítezslav Beran, Igor Potúcek, Gerhard Rigoll and Daniel Gatica-Perez, in: Classification of Events, Activities, and Relationships (CLEAR) 2006, 2006

Towards Brain-Computer Interfacing, R. Grave de Peralta Menendez, S. L. González Andino, Pierre W. Ferrez and José del R. Millán, The MIT Press, 2007

The IDIAP Brain-Computer Interface: An Asynchronous Multi-Class Approach, José del R. Millán, Pierre W. Ferrez and Anna Buttfield, in: Towards Brain-Computer Interfacing, The MIT Press, 2007

Tapping the Mind or Resonating Minds?, José del R. Millán, in: European Visions for the Knowledge Age, Cheshire Henbury, 2007

Speech Recognition based on Template Matching and Phone Posterior Probabilities, Cédric Gaudard, Guillermo Aradilla and Hervé Bourlard, Idiap-Com-02-2007

Sparse Probabilistic Classifiers, Romain Hérault and Yves Grandvalet, in: International Conference on Machine Learning (ICML), 2007

Short-Term Spatio-Temporal Clustering Applied to Multiple Moving Speakers, Guillaume Lathoud and Jean-Marc Odobez, in: IEEE Transactions on Audio, Speech and Language Processing, 15(5):15, 2007

Scalable Wide-band Audio Codec based on Frequency Domain Linear Prediction, Petr Motlicek, Sriram Ganapathy, Hynek Hermansky and Harinath Garudadri, Idiap-RR-16-2007

Role Recognition in Broadcast News Using Social Network Analysis and Duration Distribution Modeling, Alessandro Vinciarelli, in: IEEE Transactions on Multimedia, 2007

On Confusions in a Phoneme Recognizer, Andrew Lovitt, Joel Praveen Pinto and Hynek Hermansky, 2007

Non-Invasive Estimates of Local Field Potentials for Brain-Computer Interfaces, R. Grave de Peralta Menendez, S. L. González Andino, Pierre W. Ferrez and José del R. Millán, in: Towards Brain-Computer Interfacing, The MIT Press, 2007

Multi-stream Features Combination based on Dempster-Shafer Rule for LVCSR System, Fabio Valente, Jithendra Vepa and Hynek Hermansky, Idiap-RR-09-2007

More Efficiency in Multiple Kernel Learning, Alain Rakotomamonjy, Francis Bach, Stéphane Canu and Yves Grandvalet, in: International Conference on Machine Learning (ICML), 2007

Learning the structure of image collections with latent aspect models, Florent Monay, École Polytechnique Fédérale de Lausanne, 2007

Learning the Inter-frame Distance for Discriminative Template-based Keyword Detection, David Grangier and Samy Bengio, Idiap-RR-15-2007

Joint Bi-Modal Face and Speaker Authentication using Explicit Polynomial Expansion, Sébastien Marcel, Idiap-RR-14-2007

Hierarchical Neural Networks Feature Extraction for LVCSR system, Fabio Valente, Jithendra Vepa, Christian Plahl, Christian Gollan, Hynek Hermansky and Ralf Schlüter, Idiap-RR-08-2007

Feature Selection Methods on Distributed Linear Inverse Solutions for a Non-Invasive Brain-Machine Interface, Laurent Uldry, Pierre W. Ferrez and José del R. Millán, Idiap-Com-04-2007

Face Authentication with Salient Local Features and Static Bayesian Network, Guillaume Heusch and Sébastien Marcel, Idiap-RR-04-2007

Exploiting Phoneme Similarities in Hybrid HMM-ANN Keyword Spotting, Joel Praveen Pinto, Andrew Lovitt and Hynek Hermansky, 2007

Error-Related EEG Potentials in Brain-Computer Interfaces, Pierre W. Ferrez and José del R. Millán, in: Towards Brain-Computer Interfacing, The MIT Press, 2007

Direct optimisation of a multilayer perceptron for the estimation of cepstral mean and variance statistics, John Dines and Jithendra Vepa, Idiap-RR-13-2007

Correcting Confusion Matrices for Phone Recognizers, Andrew Lovitt, Idiap-Com-03-2007

Confidence-based Cue Integration for Visual Place Recognition, Andrzej Pronobis and Barbara Caputo, Idiap-RR-17-2007

Adaptation in Brain-Computer Interfaces, José del R. Millán, Anna Buttfield, C. Vidaurre, M. Krauledat, A. Schlögl, P. Shenoy, B. Blankertz, R.P.N. Rao, R. Cabeza, Gert Pfurtscheller and K. -R. Müller, in: Towards Brain-Computer Interfacing, The MIT Press, 2007

A study of phoneme and grapheme based context-dependent ASR systems, John Dines and Mathew Magimai-Doss, Idiap-RR-12-2007

A Kernel Trick For Sequences Applied to Text-Independent Speaker Verification Systems, Johnny Mariéthoz and Samy Bengio, in: Pattern Recognition, 2007

Broadcast News Story Segmentation Using Social Network Analysis and Hidden Markov Models, Alessandro Vinciarelli and Sarah Favre, Idiap-RR-30-2007

COMBINATION OF AGGLOMERATIVE AND SEQUENTIAL CLUSTERING FOR SPEAKER DIARIZATION, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, Idiap-RR-51-2007

AGGLOMERATIVE INFORMATION BOTTLENECK FOR SPEAKER DIARIZATION OF MEETINGS DATA, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, Idiap-RR-31-2007

Hierarchical and Parallel Processing of Modulation Spectrum for ASR applications, Fabio Valente and Hynek Hermansky, Idiap-RR-45-2007

Recognition and Understanding of Meetings The AMI and AMIDA Projects, Steve Renals, Thomas Hain and Hervé Bourlard, Idiap-RR-46-2007

A Thousand Words in a Scene, Pedro Quelhas, Jean-Marc Odobez, Daniel Gatica-Perez and Tinne Tuytelaars, Idiap-RR-40-2005

Exploiting Contextual Information for Improved Phoneme Recognition, Joel Praveen Pinto, B. Yegnanarayana, Hynek Hermansky and Mathew Magimai-Doss, Idiap-RR-65-2007

Bayesian Controller for a Novel Semi-Autonomous Navigation Concept, Xavier Perrin, Ricardo Chavarriaga, Roland Siegwart and José del R. Millán, Idiap-RR-26-2007

A Generative Model for Rhythms, Jean-François Paiement, Yves Grandvalet, Samy Bengio and Douglas Eck, Idiap-RR-70-2007

A Distance Model for Rhythms, Jean-François Paiement, Yves Grandvalet, Samy Bengio and Douglas Eck, Idiap-RR-33-2008

The Projectron: a Bounded Kernel-Based Perceptron, Francesco Orabona, Joseph Keshet and Barbara Caputo, Idiap-RR-30-2008

A Cognitive and Unsupervised MAP Adaptation Approach to the Recognition of the Focus of Attention from Head Pose, Jean-Marc Odobez and Silèye O. Ba, Idiap-RR-20-2007

Modeling semantic aspects for cross-media image indexing, Florent Monay and Daniel Gatica-Perez, Idiap-RR-56-2005

Non-linear Spectral Contrast Stretching for In-car Speech Recognition, Weifeng Li and Hervé Bourlard, Idiap-RR-53-2007

Using Audio and Video Features to Classify the Most Dominant Person in a Group Meeting, Hayley Hung, Dinesh Babu Jayagopi, Chuohao Yeo, Gerald Friedland, Silèye O. Ba, Jean-Marc Odobez, Kannan Ramchandran, Nikki Mirghafori and Daniel Gatica-Perez, Idiap-RR-29-2007

A Discriminative Kernel-based Model to Rank Images from Text Queries, David Grangier and Samy Bengio, Idiap-RR-38-2007

Hierarchical Penalization, Marie Szafranski, Yves Grandvalet and Pierre Morizet-Mahoudeaux, Idiap-RR-76-2007

The use of brain-computer interfacing for ambient intelligence, Gangadhar Garipelli, Ferran Galán, Ricardo Chavarriaga, Pierre W. Ferrez, Eileen Lew and José del R. Millán, Idiap-RR-61-2007

Temporal Masking for Bit-rate Reduction in Audio Codec Based on Frequency Domain Linear Prediction, Sriram Ganapathy, Petr Motlicek, Hynek Hermansky and Harinath Garudadri, Idiap-RR-48-2007

Feature Extraction for Multi-class BCI using Canonical Variates Analysis, Ferran Galán, Pierre W. Ferrez, Francesc Oliva, Joan Guàrdia and José del R. Millán, Idiap-RR-23-2007

To Err Is Human: Learning from Error Potentials in Brain-Computer Interfaces, Ricardo Chavarriaga, Pierre W. Ferrez and José del R. Millán, Idiap-RR-37-2007

A Comparative Psychophysical and EEG Study of Different Feedback Modalities for HRI, Xavier Perrin, Ricardo Chavarriaga, Céline Ray, Roland Siegwart and José del R. Millán, Idiap-RR-78-2007

Recognition and Understanding of Meetings Overview of the European AMI and AMIDA Projects, Hervé Bourlard and Steve Renals, Idiap-RR-27-2008

Characterizing the EEG Correlates of Exploratory Behavior, Nicolas Bourdaud, Ricardo Chavarriaga, Ferran Galán and José del R. Millán, Idiap-RR-28-2008

Multi-party Focus of Attention Recognition in Meetings from Head Pose and Multimodal Contextual Cues, Silèye O. Ba and Jean-Marc Odobez, Idiap-RR-50-2007

Probabilistic Head Pose Tracking Evaluation in Single and Multiple Camera Setups, Silèye O. Ba and Jean-Marc Odobez, Idiap-RR-21-2007

Detection and Recognition of Number Sequences in Spoken Utterances, Guillermo Aradilla and Jitendra Ajmera, Idiap-RR-42-2007

Posterior-Based Features and Distances in Template Matching for Speech Recognition, Guillermo Aradilla and Hervé Bourlard, Idiap-RR-41-2007

A graphical tool for monitoring Oz objects activity, Jean-Luc Cochard and Dinh Van Linh Nguyen, in: Proc. of WOz'95: International Workshop on Oz Programming, IDIAP, Uni. Fribourg, 1995

ETC\_vérif, a Prototype of a Cooperative Automatic Speech Recognition System, Jean-Luc Cochard and Murielle Vial, in: Proc. of WOz'95: International Workshop on Oz Programming, IDIAP, Uni. Fribourg, 1995

Role Recognition in Radio Programs using Social Affiliation Networks and Mixtures of Discrete Distributions: an Approach Inspired by Social Cognition, Alessandro Vinciarelli and Sarah Favre, Idiap-RR-40-2007

Mapping Nonverbal Communication into Social Status: Automatic Recognition of Journalists and Non-journalists in Radio News, Alessandro Vinciarelli, Idiap-RR-33-2007

Semantic Segmentation of Radio Programs Using Social Network Analysis and Duration Distribution Modeling, Alessandro Vinciarelli, F. Fernàndez and Sarah Favre, in: IEEE International Conference on Multimedia and Expo (ICME), 2007

Broadcast News Story Segmentation Using Social Network Analysis and Hidden Markov Models, Alessandro Vinciarelli and Sarah Favre, in: ACM International Conference on Multimedia, 2007

COMBINATION OF AGGLOMERATIVE AND SEQUENTIAL CLUSTERING FOR SPEAKER DIARIZATION, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008

AGGLOMERATIVE INFORMATION BOTTLENECK FOR SPEAKER DIARIZATION OF MEETINGS DATA, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: IEEE Automatic Speech Recognition and Understanding Workshop, 2007

Integration of TDOA Features in Information Bottleneck Framework for Fast Speaker Diarization, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, Idiap-RR-26-2008

On the Combination of Auditory and Modulation Frequency Channels for ASR applications, Fabio Valente and Hynek Hermansky, Idiap-RR-12-2008

Hierarchical Neural Networks Feature Extraction for LVCSR system, Fabio Valente, Jithendra Vepa, Christian Plahl, Christian Gollan, Hynek Hermansky and Ralf Schlüter, in: Interspeech 2007, 2007

Multi-stream Features Combination based on Dempster-Shafer Rule for LVCSR System, Fabio Valente, Jithendra Vepa and Hynek Hermansky, in: Interspeech 2007, 2007

Hierarchical and Parallel Processing of Modulation Spectrum for ASR applications, Fabio Valente and Hynek Hermansky, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008

Hilbert Envelope Based Specto-Temporal Features for Phoneme Recognition in Telephone Speech, Samuel Thomas, Sriram Ganapathy and Hynek Hermansky, Idiap-RR-18-2008

Front-end for Far-field Speech Recognition based on Frequency Domain Linear Prediction, Sriram Ganapathy, Samuel Thomas and Hynek Hermansky, Idiap-RR-17-2008

Spectro-Temporal Features for Automatic Speech Recognition using Linear Prediction in Spectral Domain, Samuel Thomas, Sriram Ganapathy and Hynek Hermansky, Idiap-RR-05-2008

Discriminative Cue Integration for Medical Image Annotation, Tatiana Tommasi, Francesco Orabona and Barbara Caputo, Idiap-RR-64-2007

The Interchangeability of Learning Rate and Gain in Backpropagation Neural Networks, Georg Thimm, Perry Moerland and Emile Fiesler, in: Neural Computation, 8(02), 1996

Evaluating pruning methods, Georg Thimm and Emile Fiesler, in: 1995 International Symposium on Artificial Neural Networks (ISANN'95), 1995

Gain Elimination form Backpropagation Neural Networks, Georg Thimm, Emile Fiesler and Perry Moerland, in: Proceedings of the International Conference on Neural Networks, IEEE, Perth, IEEE, 1995

High Order and Multilayer Perceptron Initialization, Georg Thimm and Emile Fiesler, Idiap-RR-07-1994

Weight Initialization for High Order and Multilayer Perceptrons, Georg Thimm and Emile Fiesler, in: Proceedings of the '94 SIPAR--Workshop on Parallel and Distributed Computing, SI Group for Parallel Systems, 1994

Modular Object-Oriented Neural Network Simulators and Topology Generalizations, Georg Thimm, R. Grau and Emile Fiesler, in: Proceedings of the International Conference on Artificial Neural Networks (ICANN 94), Sorrento, Italy, Springer-Verlag, 1994

Introducing Temporal Asymmetries in Feature Extraction for Automatic Speech Recognition, G. S. V. S. Sivaram and Hynek Hermansky, Idiap-RR-25-2008

Emulating Temporal Receptive Fields of Auditory Mid-Brain Neurons for Automatic Speech Recognition, G. S. V. S. Sivaram and Hynek Hermansky, Idiap-RR-24-2008

Time Resolved Polarimetry on an Optical Fiber Ammeter, Indu Saxena and R. B. Torbert, in: Journal of the European Optical Society, 5, 1996

Optical Multilayer Perceptrons based on Liquid Crystal Devices, Indu Saxena, Emile Fiesler, N. Collings and A. R. Pourzand, in: Optics and Information, Cercle SFO/SEE d'Opto-informatique, Mulhouse, France, European Optical Society (EOS), 1995

Adaptive Multilayer Optical Neural Network with Optical Thresholding, Indu Saxena and Emile Fiesler, in: Optical Engineering, 34(08), 1995

Adaptive Multilayer Optical Neural Network Design, Indu Saxena and Emile Fiesler, Idiap-RR-04-1994

Recognition and Understanding of Meetings The AMI and AMIDA Projects, Steve Renals, Thomas Hain and Hervé Bourlard, in: Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU'07, 2007

A Thousand Words in a Scene, Pedro Quelhas, Jean-Marc Odobez, Daniel Gatica-Perez and Tinne Tuytelaars, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2007

Confidence-based Cue Integration for Visual Place Recognition, Andrzej Pronobis and Barbara Caputo, in: IEEE International Conference on Intelligent RObot Systems (IROS), 2007

Boolean Logic Inspired High Order Perceptron Construction, Andrea De Pol, Georg Thimm and Emile Fiesler, in: SIPAR Workshop'95 Parallel and Distributed Systems, SIPAR SI Group for Parallel Systems, Biel School of Engineering, Computer Science Department, 1995

Combining Evidence from a Generative and a Discriminative Model in Phoneme Recognition, Joel Praveen Pinto and Hynek Hermansky, Idiap-RR-20-2008

Reverse Correlation for analyzing MLP Posterior Features in ASR, Joel Praveen Pinto, G. S. V. S. Sivaram and Hynek Hermansky, Idiap-RR-13-2008

Comparing Different Word Lattice Rescoring Approaches Towards Keyword Spotting, Joel Praveen Pinto, Hervé Bourlard, Zacharie De Greve and Hynek Hermansky, Idiap-RR-32-2007

Significance of Contextual Information in Phoneme Recognition, Joel Praveen Pinto, S. R. Mahadeva Prasanna, B. Yegnanarayana and Hynek Hermansky, Idiap-RR-28-2007

Exploiting Contextual Information for Improved Phoneme Recognition, Joel Praveen Pinto, Hynek Hermansky, B. Yegnanarayana and Mathew Magimai-Doss, in: "IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)", 2008

Silence Models in Weighted Finite-State Transducers, Philip N. Garner, Idiap-RR-19-2008

A Weighted Finite State Transducer tutorial, Philip N. Garner, Idiap-Com-03-2008

Bayesian Controller for a Novel Semi-Autonomous Navigation Concept, Xavier Perrin, Ricardo Chavarriaga, Roland Siegwart and José del R. Millán, in: 3rd European Conference on Mobile Robots (ECMR 2007), 2007

A supervised learning approach based on STDP and polychronization in spiking neuron networks, Hélène Paugam-Moisy, R. Martinez and Samy Bengio, in: European Symposium on Artificial Neural Networks, ESANN, 2007

A Data-driven Approach to Speech/Non-speech Detection, Sree Hari Krishnan Parthasarathi and Hynek Hermansky, Idiap-RR-23-2008

Exploiting contextual information for speech/non-speech detection, Sree Hari Krishnan Parthasarathi and Hynek Hermansky, Idiap-RR-22-2008

Exploiting temporal context for speech/non-speech detection, Sree Hari Krishnan Parthasarathi, Petr Motlicek and Hynek Hermansky, Idiap-RR-21-2008

A Generative Model for Rhythms, Jean-François Paiement, Yves Grandvalet, Samy Bengio and Douglas Eck, in: NIPS Workshop on Brain, Music and Cognition, 2007

A Distance Model for Rhythms, Jean-François Paiement, Yves Grandvalet, Samy Bengio and Douglas Eck, in: 25th International Conference on Machine Learning (ICML), 2008

On-line Independent Support Vector Machines for Cognitive Systems, Francesco Orabona, Claudio Castellini, Barbara Caputo, Jie Luo and Giulio Sandini, Idiap-RR-63-2007

The Projectron: a Bounded Kernel-Based Perceptron, Francesco Orabona, Joseph Keshet and Barbara Caputo, in: Int. Conf. on Machine Learning, 2008

Indoor Place Recognition using Online Independent Support Vector Machines, Francesco Orabona, Claudio Castellini, Barbara Caputo, Jie Luo and Giulio Sandini, in: 18th British Machine Vision Conference (BMVC07), 2007

A Cognitive and Unsupervised MAP Adaptation Approach to the Recognition of the Focus of Attention from Head Pose, Jean-Marc Odobez and Silèye O. Ba, in: International Conference on Multi-Media & Expo (ICME07), 2007

Analyzing Flickr Groups, Radu-Andrei Negoescu and Daniel Gatica-Perez, Idiap-RR-03-2008

Detecting queues at vending machines: a statistical layered approach, Xavier Naturel and Jean-Marc Odobez, Idiap-RR-04-2008

Non-Uniform Speech/Audio Coding Exploiting Predictability of Temporal Evolution of Spectral Envelopes, Petr Motlicek, Hynek Hermansky, Sriram Ganapathy and Harinath Garudadri, in: Tenth International Conference on TEXT, SPEECH and DIALOGUE (TSD), 2007

LP-TRAPs in all senses, Petr Motlicek, Idiap-RR-66-2007

Non-uniform QMF Decomposition for Wide-band Audio Coding based on Frequency Domain Linear Prediction, Petr Motlicek, Sriram Ganapathy, Hynek Hermansky and Harinath Garudadri, Idiap-RR-43-2007

Frequency Domain Linear Prediction for QMF Sub-bands and Applications to Audio Coding, Petr Motlicek, Hynek Hermansky, Sriram Ganapathy and Harinath Garudadri, in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), 2007

Wide-Band Perceptual Audio Coding based on Frequency-Domain Linear Prediction, Petr Motlicek, Vijay Ullal and Hynek Hermansky, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007

Unsupervised Speech/Non-speech Detection for Automatic Speech Recognition in Meeting Rooms, Hari Krishna Maganti, Petr Motlicek and Daniel Gatica-Perez, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007

Neural Networks with Adaptive Learning Rate and Momentum Terms, Miguel Moreira and Emile Fiesler, Idiap-RR-04-1995

Modeling semantic aspects for cross-media image indexing, Florent Monay and Daniel Gatica-Perez, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2007

The Effects of Optical Thresholding in Backpropagation Neural Networks, Perry Moerland, Emile Fiesler and Indu Saxena, in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'95 and NeuroNimes'95), ENNS, Paris, France, EC2 & Cie, 1995

Results on the Steepness in Backpropagation Neural Networks, Perry Moerland, Georg Thimm and Emile Fiesler, in: Proceedings of the '94 SIPAR-Workshop on Parallel and Distributed Computing, SI Group for Parallel Systems, 1994

Non-Invasive Brain-Machine Interaction, José del R. Millán, Pierre W. Ferrez, Ferran Galán, Eileen Lew and Ricardo Chavarriaga, in: International Journal of Pattern Recognition and Artificial Intelligence, 2008

Brain-Controlled Robots, José del R. Millán, in: IEEE Intelligent Systems, 2008

Brain-Computer Interfaces for HCI and Games, A. Nijholt, D. Tan, B. Allison, José del R. Millán, M. Moore and B. Graimann, in: Proceedings of the 26th Annual CHI Conference on Human Factors in Computing Systems, Extended Abstracts, 2008

High-Resolution EEG Techniques for Brain-Computer Interface Applications, F. Cincotti, D. Mattia, F. Aloise, S. Bufalari, L. Astolfi, F. De Vico Fallani, A. Tocci, L. Bianchi, M. G. Marciani, S. Gao, José del R. Millán and F. Babiloni, in: Journal of Neuroscience Methods, 2007

An Asynchronous and Non-Invasive Brain-Actuated Wheelchair, Ferran Galán, Marnix Nuttin, Eileen Lew, Pierre W. Ferrez, G. Vanacker, Johan Philips, H. Van Brussel and José del R. Millán, in: Proceedings of the 13th International Symposium on Robotics Research, 2007

Augmenting Astronaut's Capabilities through Brain-Machine Interfaces, M. Broschart, Christina de Negueruela, José del R. Millán and C. Menon, in: Proceedings of the 20th International Joint Conference on Artificial Intelligence, Workshop on Artificial Intelligence for Space Applications, 2007

Adaptive Shared Control of a Brain-Actuated Simulated Wheelchair, Johan Philips, José del R. Millán, G. Vanacker, Eileen Lew, Ferran Galán, Pierre W. Ferrez, H. Van Brussel and Marnix Nuttin, in: Proceedings of the 10th IEEE International Conference on Rehabilitation Robotics, 2007

Brain-Machine Interfaces through Control of Electroencephalographic Signals and Vibrotactile Feedback, F. Aloise, N. Caporusso, D. Mattia, F. Babiloni, L. Kauhanen, José del R. Millán, Marnix Nuttin, M. G. Marciani and F. Cincotti, in: Proceedings of the 12th International Conference on Human-Computer Interaction, 2007

Vibrotactile Feedback in the Context of Mu-Rhythm based BCI, F. Cincotti, L. Kauhanen, F. Aloise, T. Palomäki, N. Caporusso, P. Jylänki, D. Mattia, F. Babiloni, G. Vanacker, Marnix Nuttin, M. G. Marciani and José del R. Millán, in: Proceedings of the 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2007

Context-based Filtering for Assisted Brain-Actuated Wheelchair Driving, G. Vanacker, José del R. Millán, Eileen Lew, Pierre W. Ferrez, Ferran Galán, Johan Philips, H. Van Brussel and Marnix Nuttin, in: Computational Intelligence and Neuroscience, 2007, 2007

Vibrotactile Feedback for Brain-Computer Interface Operation, F. Cincotti, L. Kauhanen, F. Aloise, T. Palomäki, C. Caporusso, P. Jylänki, D. Mattia, F. Babiloni, G. Vanacker, Marnix Nuttin, M. G. Marciani and José del R. Millán, in: Computational Intelligence and Neuroscience, 2007, 2007

Non-Invasive Brain-Actuated Interaction, José del R. Millán, Pierre W. Ferrez, Ferran Galán, Eileen Lew and Ricardo Chavarriaga, in: Proceedings of the 2nd International Symposium on Brain, Vision and Artificial Intelligence, 2007

Prospects on Brain-Machine Interfaces for Space System Control, C. Menon, Christina de Negueruela, José del R. Millán, O. Tonet, F. Carpi, M. Broschart, Pierre W. Ferrez, Anna Buttfield, P. Dario, L. Citi, C. Laschi, M. Tombini, F. Sepulveda, R. Poli, R. Palaniappan, F. Tecchio, P. M. Rossini and D. de Rossi, in: Proceedings of the 57th International Astronautical Conference, 2006

Haptic Feedback Compared with Visual Feedback for BCI, L. Kauhanen, T. Palomäki, P. Jylänki, F. Aloise, Marnix Nuttin and José del R. Millán, in: Proceedings of the 3rd International Brain-Computer Interface Workshop & Training Course 2006, 2006

Inference in Switching Linear Dynamical Systems Applied to Noise Robust Speech Recognition of Isolated Digits, Bertrand Mesot, Idiap-RR-35-2008

A Bayesian Switching Linear Dynamical System for Scale-Invariant robust speech extraction, Bertrand Mesot and David Barber, Idiap-RR-52-2007

Google Portrait, Sébastien Marcel, Philip Abbet and Maël Guillemot, Idiap-Com-07-2007

On the Recent Use of Local Binary Patterns for Face Authentication, Sébastien Marcel, Yann Rodriguez and Guillaume Heusch, in: International Journal on Image and Video Processing Special Issue on Facial Image Processing, 2007

Traitement préliminaire de l'image d'un texte manuscrit en vue de sa reconnaissance: une méthode de sur-segmentation, Gilbert Maître, Stéphane Brunet and Gianni Pante, in: 4eme Colloque National sur l'A?crit et le Document (CNED'96), 1996

Experiments with robust similarity measures for OCR, Gilbert Maître, Idiap-RR-03-1995

Object Category Detection using Audio-visual Cues, Jie Luo, Barbara Caputo, Alon Zweig, Joerg-Henrik Back and Joern Anemueller, Idiap-RR-58-2007

Incremental Learning for Place Recognition in Dynamic Environments, Jie Luo, Andrzej Pronobis, Barbara Caputo and Patric Jensfelt, in: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS07), 2007

Object Category Detection using Audio-visual Cues, Jie Luo, Barbara Caputo, Alon Zweig, Joerg-Henrik Back and Joern Anemueller, in: International Conference on Computer Vision Systems (ICVS08), 2008

SVM-based Transfer of Visual Knowledge Across Robotic Platforms, Jie Luo, Andrzej Pronobis and Barbara Caputo, in: International Conference on Computer Vision Systems (ICVS07), 2007

Visual Speech Recognition using Active Shape Models and Hidden Markov Models, Juergen Luettin, Neil A. Thacker and Steve W. Beet, in: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'96), 1996

The Anterior Cingulate Cortex, Perruchoud Loise, Idiap-Com-02-2008

A Neural Network based Regression Approach for Recognizing Simultaneous Speech, Weifeng Li, Kenichi Kumatani, John Dines, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-10-2008

Neural Network based Regression for Robust Overlapping Speech Recognition using Microphone Arrays, Weifeng Li, John Dines, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-09-2008

Effective post-processing for single-channel frequency-domain speech enhancement, Weifeng Li, Idiap-RR-71-2007

Robust overlapping speech recognition based on neural networks, Weifeng Li, John Dines and Mathew Magimai-Doss, Idiap-RR-55-2007

MLP-based Log Spectral Energy Mapping for Robust Overlapping Speech Recognition, Weifeng Li, Mathew Magimai-Doss, John Dines and Hervé Bourlard, Idiap-RR-54-2007

Non-linear Spectral Contrast Stretching for In-car Speech Recognition, Weifeng Li and Hervé Bourlard, in: Interspeech-Eurospeech # to appear in html, 2007

Non-Invasive Brain Computer Interface for Mental Control of a Simulated Wheelchair, Eileen Lew, Marnix Nuttin, Pierre W. Ferrez, A. Degeest, Anna Buttfield, G. Vanacker and José del R. Millán, in: Proceedings of the 3rd International Brain-Computer Interface Workshop & Training Course 2006, 2006

Dynamical Dirichlet Mixture Model, Le Chen, David Barber and Jean-Marc Odobez, Idiap-RR-02-2007

Kernel Methods for Melanoma Recognition, Elisabetta La Torre, Tatiana Tommasi and Barbara Caputo, in: Medical Informatics in Europe (MIE), 2006

Local velocity-adapted motion events for spatio-temporal recognition, Ivan Laptev, Barbara Caputo and Tony Lindberg, in: Computer Vision and Image Undertanding, 108(3), 2007

Adaptive Beamforming with a Maximum Negentropy Criterion, Kenichi Kumatani, John McDonough, Barbara Rauch, Philip N. Garner, Weifeng Li and John Dines, Idiap-RR-29-2008

Maximum Negentropy Beamforming, Kenichi Kumatani, John McDonough, Dietrich Klakow, Philip N. Garner and Weifeng Li, Idiap-RR-07-2008

Adaptive Beamforming with a Maximum Negentropy Criterion, Kenichi Kumatani, John McDonough, Dietrich Klakow, Philip N. Garner and Weifeng Li, Idiap-RR-06-2008

Filter Bank Design for Subband Adaptive Beamforming and Application to Speech Recognition, Kenichi Kumatani, John McDonough, Stefan Schacht, Dietrich Klakow, Philip N. Garner and Weifeng Li, Idiap-RR-02-2008

Filter Bank Design based on Minimization of Individual Aliasing Terms for Minimum Mutual Information Subband Adaptive Beamforming, Kenichi Kumatani, John McDonough, Stefan Schacht, Dietrich Klakow, Philip N. Garner and Weifeng Li, Idiap-RR-77-2007

Adaptive Beamforming with a Minimum Mutual Information Criterion, Kenichi Kumatani, Uwe Mayer, Tobias Gehrig, Emilian Stoimenov, John McDonough and Matthias Wölfel, Idiap-RR-74-2007

Minimum Mutual Information Beamforming for Simultaneous Active Speakers, Kenichi Kumatani, Uwe Mayer, Tobias Gehrig, Emilian Stoimenov, John McDonough and Matthias Wölfel, Idiap-RR-73-2007

Unsupervised Learning for Information Distillation, Kamand Kamangar, Idiap-RR-47-2007

Discriminatove Keyword Spotting, Joseph Keshet, David Grangier and Samy Bengio, Idiap-RR-31-2008

Theoretical Foundations for Large-Margin Kernel-Based Continuous Speech Recognition, Joseph Keshet, Idiap-RR-44-2007

Human-Centered Computing: Toward a Human Revolution, Alejandro Jaimes, Daniel Gatica-Perez, Nicu Sebe and Thomas S. Huang, Idiap-RR-57-2007

Human-centered Computing: Toward a Human Revolution, Alejandro Jaimes, Daniel Gatica-Perez, Nicu Sebe and Thomas S. Huang, in: IEEE Computer, 40(5), 2007

Automatic Word Recognition in Cars, Gérard Chollet and Chafic Mokbel, in: IEEE Speech and Audio Processing, 1995

Définition et évaluation d'un protocole de négociation dans un système multi-agents de reconnaissance de la parole, Murielle Vial, Idiap-RR-02-1995

Lexical filtrering by means of prosodic information, Frédéric Béchet, Philippe Langlais and Henri Méloni, in: International Congress of Phonetic Sciences, 1995

The use of prosodic agents in a cooperative automatic speech recognition system, Philippe Langlais and Jean-Luc Cochard, in: International Congress of Phonetic Sciences, 1995

A study of Intra- and Inter-Speaker Variability in the Voices of Twins for Speaker Verification, Gérard Chollet and M. Homayounpour, in: International Congress of Phonetic Sciences, 1995

Neural nets approaches to Speaker Verification: comparison with Second Order Statistical Measure, Gérard Chollet and M. Homayounpour, in: ICASSP, 1995

Environnement multi-agents de reconnaissance automatique de la parole en continu, Jean-Luc Cochard and Philippe Froidevaux, in: Actes des 3emes Journees Francophones sur l'Intelligence Artificielle Distribuee et les Systemes Multi-agents, 1995

ESTIMATING THE DOMINANT PERSON IN MULTI-PARTY CONVERSATIONS USING SPEAKER DIARIZATION STRATEGIES, Hayley Hung, Daniel Gatica-Perez, Yan Huang and Gerald Friedland, Idiap-RR-60-2007

Using Audio and Video Features to Classify the Most Dominant Person in a Group Meeting, Hayley Hung, Dinesh Babu Jayagopi, Chuohao Yeo, Gerald Friedland, Silèye O. Ba, Jean-Marc Odobez, Kannan Ramchandran, Nikki Mirghafori and Daniel Gatica-Perez, in: "", 2007

A Novel Statistical Generative Model Dedicated To Face Recognition, Guillaume Heusch and Sébastien Marcel, Idiap-RR-39-2007

Face Authentication with Salient Local Features and Static Bayesian Network, Guillaume Heusch and Sébastien Marcel, in: IEEE / IAPR Intl. Conf. On Biometrics (ICB), 2007

VoicePhone: An Interactive Vocal Server for Telephone Numbers, Hans Jongebloed, Idiap-Com-04-1996

Swiss-French Polyphone: a Telephone Speech Database to develop Interactive Voice Servers, Gérard Chollet, Jean-Luc Cochard, Philippe Langlais and R. van Kommer, in: Linguistic Databases, 1995

A Discriminative Kernel-based Model to Rank Images from Text Queries, David Grangier and Samy Bengio, in: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), X, 2008

Machine Learning for Information Retrieval, David Grangier, Idiap-RR-34-2008

Learning the Inter-frame Distance for Discriminative Template-based Keyword Detection, David Grangier and Samy Bengio, in: International Conference on Speech Communication and Technology (INTERSPEECH), 2007

A Discriminative Approach for the Retrieval of Images from Text Queries, David Grangier, Florent Monay and Samy Bengio, in: European Conference on Machine Learning (ECML), 2006

Hierarchical Penalization, Marie Szafranski, Yves Grandvalet and Pierre Morizet-Mahoudeaux, in: Advances in Neural Information Processing Systems 21, 2007

The use of brain-computer interfacing for ambient intelligence, Gangadhar Garipelli, Ferran Galán, Ricardo Chavarriaga, Pierre W. Ferrez, Eileen Lew and José del R. Millán, in: In the book, Constructing Ambient Intelligence: AmI-07 Workshops Proceedings, Max M\:uhlh\:auser, Alois Ferscha, and Erwin Aitenbichler (Eds.,',','), LNCS, Springer Verlag, 2008., 2007

Spectral Noise Shaping: Improvements in Speech/Audio Codec Based on Linear Prediction in Spectral Domain, Sriram Ganapathy, Petr Motlicek, Hynek Hermansky and Harinath Garudadri, Idiap-RR-16-2008

Temporal Masking for Bit-rate Reduction in Audio Codec Based on Frequency Domain Linear Prediction, Sriram Ganapathy, Petr Motlicek, Hynek Hermansky and Harinath Garudadri, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008

Feature Extraction for Multi-Class BCI using Canonical Variates Analysis, Ferran Galán, Pierre W. Ferrez, Francesc Oliva, Joan Guàrdia and José del R. Millán, in: Proceedings of the IEEE International Symposium on Intelligent Signal Processing, 2007

Visuo-Spatial Attention Frame Recognition for Brain-Computer Interfaces, Ferran Galán, J. Palix, Ricardo Chavarriaga, Pierre W. Ferrez, Eileen Lew, C. -A. Hauert and José del R. Millán, in: Proceedings of the 1st International Conference on Cognitive Neurodynamics, 2007

Stationary Features and Cat Detection, Francois Fleuret and Donald Geman, Idiap-RR-56-2007

Neural Network Classification and Formalization, Emile Fiesler, in: Computer Standards & Interfaces, 16(03), 1994

Neural Network Formalization, Emile Fiesler, Idiap-RR-01-1992

Error-Related EEG Potentials Generated during Simulated Brain-Computer Interaction, Pierre W. Ferrez and José del R. Millán, in: IEEE Trans. on Biomedical Engineering, 55(3), 2008

High Frequency Bands and Estimated Local Field Potentials to Improve Single-Trial Classification of Electroencephalographic Signals, Pierre W. Ferrez, Ferran Galán, Anna Buttfield, S. L. González Andino, R. Grave de Peralta and José del R. Millán, in: Proceedings of the 3rd International Brain-Computer Interface Workshop & Training Course 2006, 2006

Reliability in a Multi-agent Spoken Language Recognition System, Jean-Luc Cochard and Olivier Oppizzi, in: 4th European Conference on Speech Communication and Technology, 1995

Microprosodic study of isolated French word corpora, Philippe Langlais, in: 4th European Conference on Speech Communication and Technology, 1995

Discrimination of the voices of twins and siblings for speaker verification, Gérard Chollet and M. Homayounpour, in: 4th European Conference on Speech Communication and Technology, 1995

Non-Ontogenic Sparse Neural Networks, D. Elizondo, Emile Fiesler and Jerzy Korczak, in: Proceedings of the International Conference on Neural Networks, IEEE, IEEE, 1995

Keyword Spotting on Word Lattices, De Greve Zacharie and Joel Praveen Pinto, Idiap-RR-22-2007

Ontogenic High Order Cauchy Machines, S. Cuche and Emile Fiesler, in: Proceedings of the SIPAR Workshop '95: Parallel and Distributed Systems, Biel School of Engineering, 1995

Validating Different Flexible Vocabulary Approaches on the Swiss French PolyPhone and PolyVar databases, Andrei Constantinescu, Olivier Bornet, Gilles Caloz and Gérard Chollet, in: Proceedings of ICSLP 96, 1996

Un système prédictif de la structuration syntaxico-rythmique d'un énoncé à l'aide d'informations prosodiques, Philippe Langlais, Henri Méloni and Jean-Luc Cochard, in: Proceedings of JEP'96: XXIemes Journees d'etude sur la Parole, 1996

Towards a Multi-agents Approach for Understanding Speech, Murielle Vial and Jean-Luc Cochard, Idiap-Com-05-1996

Un interface de recherche documentaire: I de r, version 2.0, Jean-Luc Cochard, Idiap-RR-04-1993

Un interface d'indexation documentaire: I d'i, version 2.0, Jean-Luc Cochard, Idiap-RR-03-1993

Un interface d'indexation documentaire: I d'i, version 1.4, Jean-Luc Cochard, Idiap-RR-01-1993

Une technique efficace de traitement en Prolog de la morphologie flexionnelle du français, Jean-Luc Cochard, Idiap-RR-04-1992

Un environnement d'analyse linguistique robuste: CPD, version 1.7, Jean-Luc Cochard, Idiap-RR-03-1992

To Err Is Human: Learning from Error Potentials in Brain-Computer Interfaces, Ricardo Chavarriaga, Pierre W. Ferrez and José del R. Millán, in: 1st International Conference on Cognitive Neurodynamics (ICCN 2007), 2007

A Comparative Psychophysical and EEG Study of Different Feedback Modalities for HRI, Xavier Perrin, Ricardo Chavarriaga, Céline Ray, Roland Siegwart and José del R. Millán, in: 3rd ACM/IEEE Conf on Human-Robot Interaction (HRI08), 2008

Online Classifier Adaptation in High Frequency EEG, Anna Buttfield, Pierre W. Ferrez and José del R. Millán, in: Proceedings of the 3rd International Brain-Computer Interface Workshop & Training Course 2006, 2006

Apprentissage de prototypes de caractères à partir de l'image d'un texte manuscrit et avec l'aide d'un opérateur, Stéphane Brunet, Idiap-RR-01-1995

A system for the off-line recognition of handwritten text, Thomas M. Breuel, in: International Conference on Pattern Recognition (ICPR,',','), Jerusalem, 1994

Recognition of Handprinted Digits using Optimal Bounded Error Matching, Thomas M. Breuel, in: International Conference on Document Analysis and Retrieval (ICDAR,',','), Tsukuba Science City, Japan, 1993

Design and Implementation of a System for the Recognition of Handwritten Responses on US Census Forms, Thomas M. Breuel, in: IAPR Workshop on Document Analysis Systems, 1994

Higher-Order Statistics in Visual Object Recognition, Thomas M. Breuel, in: Proc. IEEE Conf. on Computer Vision and Pattern Recognition, 1993

Handwriting Recognition, Thomas M. Breuel, in: Second Asian Conference on Computer Vision (ACCV'95,',','), Singapore, 1995

Finding Lines under Bounded Error, Thomas M. Breuel, Idiap-RR-11-1993

An RBF Network that Learns Some Aspects of Perceptual Organization, Thomas M. Breuel, Idiap-RR-10-1993

The 3D Indexing Problem, Thomas M. Breuel, Idiap-RR-08-1993

Geometric Matching in Computer Vision--Algorithms and Open Problems, Thomas M. Breuel, Idiap-RR-07-1993

Recognition of Handprinted Digits, Thomas M. Breuel, Idiap-RR-06-1993

Higher-Order Statistics in Visual Object Recognition, Thomas M. Breuel, Idiap-RR-02-1993

Recognition and Understanding of Meetings Overview of the European AMI and AMIDA Projects, Hervé Bourlard and Steve Renals, in: LangTech 2008, 2008

Characterizing the EEG Correlates of Exploratory Behavior, Nicolas Bourdaud, Ricardo Chavarriaga, Ferran Galán and José del R. Millán, in: IEEE Transactions on Neural Systems & Rehabilitation Engineering, 2008

Biometric Person Authentication IS A Multiple Classifier Problem, Samy Bengio and Johnny Mariéthoz, Idiap-RR-03-2007

Do Backpropagation trained neural networks have normal weight distributions?, I Bellido and Emile Fiesler, in: International Conference on Artificial neural Networks, 1993

Visual Focus of Attention Estimation from Head Pose Posterior Probability Distributions, Silèye O. Ba and Jean-Marc Odobez, in: International Conference on Multi-media & Expo, 2008

Multi-party Focus of Attention Recognition in Meetings from Head Pose and Multimodal Contextual Cues, Silèye O. Ba and Jean-Marc Odobez, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008

Probabilistic Head Pose Tracking Evaluation in Single and Multiple Camera Setups, Silèye O. Ba and Jean-Marc Odobez, in: Classification of Events, Activities and Relationship Evaluation and Workshop, 2007

Detection and Recognition of Number Sequences in Spoken Utterances, Guillermo Aradilla and Jitendra Ajmera, in: 2nd Workshop on Speech in Mobile and Pervasive Environments (SiMPE), 2007

Posterior Features Applied to Speech Recognition Tasks with Limited Training Data, Guillermo Aradilla, Hervé Bourlard and Mathew Magimai-Doss, Idiap-RR-15-2008

Using KL-based Acoustic Models in a Large Vocabulary Recognition Task, Guillermo Aradilla, Hervé Bourlard and Mathew Magimai-Doss, Idiap-RR-14-2008

Posterior-Based Features and Distances in Template Matching for Speech Recognition, Guillermo Aradilla and Hervé Bourlard, in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), 2007

Swiss PolyPhone and PolyVar: Building Databases for Speech Recognition and Speaker Verification, Andrei Constantinescu and Gérard Chollet, in: Proceedings of The 3rd Slovenian-German and 2nd SDRV Workshop, Speech and Image Understanding, 1996

Machine Learning for Audio, Image and Video Analysis, Francesco Camastra and Alessandro Vinciarelli, Springer Verlag, 2008