Publication list - Idiap Publications

Inter-session Variability Modelling and Joint Factor Analysis for Face Authentication, Roy Wallace, Mitchell McLaren, Chris McCool and Sébastien Marcel, in: International Joint Conference on Biometrics, 2011

An Audio Visual Corpus for Emergent Leader Analysis, Dairazalia Sanchez-Cortes, Oya Aran and Daniel Gatica-Perez, in: Multimodal Corpora for Machine Learning: Taking Stock and Road mapping the Future, 2011

Robustness of Group Delay Representations for Noisy Speech Signals, Sree Hari Krishnan Parthasarathi, Padmanabhan Rajan and Hema A Murthy, Idiap-RR-36-2011

Robustness of Group Delay Representations for Noisy Speech Signals, Sree Hari Krishnan Parthasarathi, Padmanabhan Rajan and Hema A Murthy, in: IJST (Springer), 14(4), 2011

Privacy-Sensitive Audio Features for Conversational Speech Processing, Sree Hari Krishnan Parthasarathi, Ecole Polytechnique Fédérale de Lausanne, 2011

Joint Optimization of Hidden Conditional Random Fields and Non Linear Feature Extraction, Antoine Vinel, Trinh-Minh-Tri Do and Thierry Artieres, in: Proceedings of International Conference on Document Analysis and Recognition, 2011

Boosting Localized Features for Speaker and Speech Recognition, Anindya Roy, Ecole Polytechnique Federale de Lausanne (EPFL), 2011

Multi-camera Open Space Human Activity Discovery for Anomaly Detection, Remi Emonet, Jagannadan Varadarajan and Jean-Marc Odobez, in: 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011

Information Bottleneck Features for HMM/GMM Speaker Diarization of Meetings Recordings, Sree Harsha Yella and Fabio Valente, in: Interspeech, Florence, Italy, pages 953-956, 2011

Continuous Speech Recognition using Boosted Binary Features, Anindya Roy, Mathew Magimai-Doss and Sébastien Marcel, Idiap-RR-35-2011

Multimodal Cue Detection Engine for Orchestrated Entertainment, Danil Korchagin, Stefan Duffner, Petr Motlicek and Carl Scheffler, Idiap-RR-34-2011

IMPROVING MICROPHONE ARRAY SPEECH RECOGNITION WITH COCHLEAR IMPLANT-LIKE SPECTRALLY REDUCED SPEECH, Cong-Thanh Do, Mohammad J. Taghizadeh and Philip N. Garner, Idiap-RR-40-2011

Competition on Counter Measures to 2-D Facial Spoofing Attacks, Murali Mohan Chakka, André Anjos, Sébastien Marcel, Roberto Tronci, Daniele Muntoni, Gianluca Fadda, Maurizio Pili, Nicola Sirena, Gabriele Murgia, Marco Ristori, Fabio Roli, Junjie Yan, Dong Yi, Zhen Lei, Zhiwei Zhang, Stan Z.Li, William Robson Schwartz, Anderson Rocha, Helio Pedrini, Javier Lorenzo-Navarro, Modesto Castrillón-Santana, Jukka Maatta, Abdenour Hadid and Matti Pietikainen, in: Proceedings of IAPR IEEE International Joint Conference on Biometrics (IJCB), Washington DC, USA, 2011

Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition, David Imseng, Ramya Rasipuram and Mathew Magimai-Doss, in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Hawaii, USA, pages 348-353, 2011

Comparing machines and humans on a visual categorization test, Francois Fleuret, Ting Li, Charles Dubout, Emma K. Wampler, Steven Yantis and Donald Geman, in: Proceedings of the National Academy of Sciences, 2011

Boosting with Maximum Adaptive Sampling, Charles Dubout and Francois Fleuret, in: Proceedings of the Neural Information Processing Systems Conference, 2011

Detection-Based Multi-Human Tracking Using a CRF Model, Alexandre Heili, Cheng Chen and Jean-Marc Odobez, in: The Eleventh IEEE International Workshop on Visual Surveillance, 2011

A Joint Estimation of Head and Body Orientation Cues in Surveillance Video, Cheng Chen, Alexandre Heili and Jean-Marc Odobez, in: IEEE International Workshop on Socially Intelligent Surveillance and Monitoring, 2011

Smartphone usage in the wild: a large-scale analysis of applications and context, Trinh-Minh-Tri Do, Jan Blom and Daniel Gatica-Perez, in: 13th International Conference on Multimodal Interaction, 2011

Building 'directional corpora' for unbiased contrastive analysis, Bruno Cartoni and Thomas Meyer, in: Proceedings of Corpus Linguistics Conference, Birmingham, UK, pages 29-30, 2011

Disambiguating discourse connectives using parallel corpora: senses vs. translations, Thomas Meyer, Charlotte Roze, Bruno Cartoni, Laurence Danlos, Sandrine Zufferey and Andrei Popescu-Belis, in: Proceedings of Corpus Linguistics Conference, Birmingham, UK, pages 104-105, 2011

A Corpus-based Contrastive Analysis for Defining Minimal Semantics of Inter-sentential Dependencies for Machine Translation, Thomas Meyer, Andrei Popescu-Belis, Jeevanthi Liyanapathirana and Bruno Cartoni, in: Proceedings of the GSCL2011 Workshop on "Contrastive Analysis - Translation Studies - Machine Translation: What can we learn from each other?", Hamburg, Germany, pages 5, 2011

Fast Speaker Verification on Mobile Phone data using Boosted Slice Classifiers, Anindya Roy, Mathew Magimai-Doss and Sébastien Marcel, in: IAPR IEEE International Joint Conference on Biometrics, Washington DC, 2011

VlogSense: Conversational Behavior and Social Attention in YouTube, Joan-Isaac Biel and Daniel Gatica-Perez, in: Transactions on Multimedia Computing, Communications and Applications, 2011

HEAT: Iterative Relevance Feedback with One Million Images, Nicolae Suditu and Francois Fleuret, in: Proceedings of the IEEE International Conference on Computer Vision, pages 2118-2125, 2011

A Just-in-Time Document Retrieval System for Dialogues or Monologues, Andrei Popescu-Belis, Majid Yazdani, Alexandre Nanchen and Philip N. Garner, in: SIGDIAL 2011 (12th annual SIGDIAL Meeting on Discourse and Dialogue), Demonstration Session, Portland, OR, pages 350-352, 2011

A Speech-based Just-in-Time Retrieval System using Semantic Search, Andrei Popescu-Belis, Majid Yazdani, Alexandre Nanchen and Philip N. Garner, in: Proceedings of the ACL-HLT 2011 System Demonstrations (49th Annual Meeting of the Association for Computational Linguistics), Portland, OR, pages 80-86, 2011

[URL]

Learning from Images with Captions Using the Maximum Margin Set Algorithm, Jie Luo, Francesco Orabona, Barbara Caputo and Vittorio Ferrari, Idiap-RR-30-2011

People-Centric Mobile Sensing with a Pragmatic Twist: from Behavioral Data Points to Active User Involvement, Jan Blom, Daniel Gatica-Perez and N. Kiukkonen, in: International Conference on Human-Computer Interaction with Mobile Devices and Services, 2011

A Large-Scale Database of Images and Captions for Automatic Face Naming, Mert Ozcan, Jie Luo, Vittorio Ferrari and Barbara Caputo, in: Proceedings of the 22nd British Machine Vision Conference, 2011

A Large-Scale Database of Images and Captions for Automatic Face Naming, Mert Ozcan, Jie Luo, Vittorio Ferrari and Barbara Caputo, Idiap-RR-26-2011

Multiclass Transfer Learning from Unconstrained Priors, Jie Luo, Tatiana Tommasi and Barbara Caputo, in: Proceedings of the 13th International Conference on Computer Vision, 2011

Multiclass Transfer Learning from Unconstrained Priors, Jie Luo, Tatiana Tommasi and Barbara Caputo, Idiap-RR-25-2011

Searching the Past: An Improved Shape Descriptor to Retrieve Maya Hieroglyphs., Edgar Roman-Rangel, Carlos Pallan, Jean-Marc Odobez and Daniel Gatica-Perez, in: Proceedings of the ACM International Conference in Multimedia, Scottsdale, USA, ACM, 2011

New world, New Worlds: Visual Analysis of Pre-Columbian Pictorial Collections., Daniel Gatica-Perez, Edgar Roman-Rangel, Jean-Marc Odobez and Carlos Pallan, in: Proceedings of the International Workshop on Multimedia for Cultural Heritage, Modena, Italy., Springer CCIS series book, 2011

Joint Adaptive Colour Modelling and Skin, Hair and Clothing Segmentation Using Coherent Probabilistic Index Maps, Carl Scheffler and Jean-Marc Odobez, in: British Machine Vision Conference, British Machine Vision Association, Dundee, UK, 2011

Speech Enhancement using Beta-order MMSE Spectral Amplitude Estimator with Laplacian Prior, Hamid Reza Abutalebi, Mehdi Rashidinejad, Hervé Bourlard and Ali Akbar Tadaion, Idiap-RR-24-2011

Integrating articulatory features using Kullback-Leibler divergence based acoustic model for phoneme recognition, Ramya Rasipuram and Mathew Magimai-Doss, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, pages 5192 - 5195, 2011

[DOI]

Improving Articulatory Feature and Phoneme Recognition using Multitask Learning, Ramya Rasipuram and Mathew Magimai-Doss, in: Artificial Neural Networks and Machine Learning - ICANN 2011, pages 299-306, Springer Berlin / Heidelberg, 2011

[DOI]
[URL]

Inferring truth from multiple annotators for social interaction analysis, Gokul Chittaranjan, Oya Aran and Daniel Gatica-Perez, in: Neural Information Processing Systems (NIPS) Workshop on Modeling Human Communication Dynamics (HCD), pages 4, 2011

Who's Who with Big-Five: Analyzing and Classifying Personality Traits with Smartphones, Gokul Chittaranjan, Jan Blom and Daniel Gatica-Perez, in: International Symposium on Wearable Computing, pages 8, 2011

Exploiting observers' judgements for nonverbal group interaction analysis, Gokul Chittaranjan, Oya Aran and Daniel Gatica-Perez, in: IEEE Conference on Automatic Face and Gesture Recognition, pages 6, IEEE, 2011

An Integrated Framework for Multi-Channel Multi-Source Localization and Voice Activity Detection, Mohammad J. Taghizadeh, Philip N. Garner, Hervé Bourlard, Hamid Reza Abutalebi and Afsaneh Asaei, in: The Third Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2011

Intuitive Recipes for Uncertainty Decoding with SNR Features for Noise Robust ASR, Georgios Skoumas and Philip N. Garner, Idiap-RR-23-2011

Privacy-sensitive recognition of group conversational context with sociometers, Dinesh Babu Jayagopi, Taemie Kim, Alex Pentland and Daniel Gatica-Perez, in: Springer Multimedia Systems Journal, 2011

Multi-party Speech Recovery Exploiting Structured Sparsity Models, Afsaneh Asaei, Mohammad J. Taghizadeh, Hervé Bourlard and Volkan Cevher, in: Proceedings of Interspeech, 2011

Model-based Compressive Sensing for Multi-party Distant Speech Recognition, Afsaneh Asaei, Hervé Bourlard and Volkan Cevher, in: 2011 IEEE International Conference on Acoustics, Speech and Signal Processing, 2011

A novel framework for noise robust ASR using cochlear implant-like spectrally reduced speech, Cong-Thanh Do, Dominique Pastor and André Goalic, in: Speech Communication, 2011

[DOI]

Grapheme-based Automatic Speech Recognition using KL-HMM, Mathew Magimai-Doss, Ramya Rasipuram, Guillermo Aradilla and Hervé Bourlard, in: Proceedings of Interspeech, 2011