Publication list - Idiap Publications

The Kaldi Speech Recognition Toolkit, Daniel Povey, Arnab Ghoshal, Gilles Boulianne, Lukas Burget, Ondrej Glembek, Nagendra Goel, Mirko Hannemann, Petr Motlicek, Yanmin Qian, Petr Schwarz, Jan Silovsky, Georg Stemmer and Karel Vesely, in: IEEE 2011 Workshop on Automatic Speech Recognition and Understanding, Hilton Waikoloa Village, Big Island, Hawaii, US, IEEE Signal Processing Society, 2011

Morphodynamic profiling to explore spatio-temporal signaling networks regulating neurite outgrowth, L. Fusco, Kevin C. Smith, F. Benmansour, Riwal Lefort, Francois Fleuret, Pascal Fua and O. Pertz, in: 1st International SystemsX.ch Conference on Systems Biology, 2011

Automated Quantification of Morphodynamics for High-Throughput Live Cell Imaging Datasets, German Gonzalez, L. Fusco, Riwal Lefort, F. Benmansour, Pascal Fua and Kevin C. Smith, in: 1st International SystemsX.ch Conference on Systems Biology, 2011

Machine learning techniques to analyse complex, computer vision-extracted, dynamic cellular phenotypes, Riwal Lefort, L. Fusco, F. Benmansour, Kevin C. Smith, O. Pertz and Francois Fleuret, in: 1st International SystemsX.ch Conference on Systems Biology, 2011

Hierarchical Tandem Features for ASR in Mandarin, Joel Pinto, Mathew Magimai-Doss and Hervé Bourlard, in: Proceedings of Interspeech, 2011

Look at who's talking, M. Cristani, A. Pesarin, Alessandro Vinciarelli, M. Crocco and V. Murino, in: Proceedings of International Conference on Ambient Intelligence, pages 68-76, 2011

Recent Developments in Social Signal Processing, Albert Ali Salah, Maja Pantic and Alessandro Vinciarelli, in: Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, pages 380-385, 2011

Understanding Social Signals in Multi-party Conversations: Automatic Recognition of Socio-Emotional Roles in the AMI Meeting Corpus, Fabio Valente, Alessandro Vinciarelli, Sree Harsha Yella and A. Sapru, in: Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, pages 374-379, 2011

Towards Computational Proxemics: Inferring Social Relations from Interpersonal Distances, M. Cristani, G. Paggetti, Alessandro Vinciarelli, L. Bazzani, G. Menegaz and V. Murino, in: Proceedings of the IEEE International Conference on Social Computing, pages 290-297, 2011

Introduction to Sequence Analysis for Human Behavior Understanding, Hugues Salamin and Alessandro Vinciarelli, in: "Computer Analysis of Human Behavior" by A.Salah and T.Gevers (eds.), pages 21-40, Springer Verlag, 2011

Social Signal Processing: The Research Agenda, Maja Pantic, R. Cowie, F. D'Errico, Dirk Heylen, M. Mehu, C. Pelachaud, I. Poggi, M. Schroeder and Alessandro Vinciarelli, in: "Visual Analysis of Humans" by T.B.Moeslund, A.Hilton, V.Krueger and L.Sigal (eds.), pages 511-538, Springer Verlag, 2011

Analysis of Verbal and Nonverbal Communication and Enactment: The Processing Issues, A. Esposito, Alessandro Vinciarelli, K. Vicsi, C. Pelachaud and A. Nijholt, Springer Verlag, 2011

Open-ended Learning of Visual and Multi-modal Patterns, Jie Luo, Ecole polytechnique fédérale de Lausanne, 2011

A Bimodal Sound Source Model for Vehicle Tracking in Traffic Monitoring, Patrick Marmaroli, Jean-Marc Odobez, Xavier Falourd and Hervé Lissek, in: European Signal Processing Conference, 2011

Torch7: A Matlab-like Environment for Machine Learning, Ronan Collobert, Koray Kavukcuoglu and Clément Farabet, in: BigLearn, NIPS Workshop, 2011

Learning Structured Embeddings of Knowledge Bases, Antoine Bordes, Jason Weston, Ronan Collobert and Yoshua Bengio, in: Conference on Artificial Intelligence, 2011

Deep Learning for Efficient Discriminative Parsing, Ronan Collobert, in: International Conference on Artificial Intelligence and Statistics, 2011

Natural Language Processing (Almost) from Scratch, Ronan Collobert, Jason Weston, Léon Bottou, Michael Karlen, Koray Kavukcuoglu and Pavel Kuksa, in: Journal of Machine Learning Research, 12:2493-2537, 2011

Fast Human Detection from Joint Appearance and Foreground Feature Subset Covariances, Jian Yao and Jean-Marc Odobez, in: Computer Vision and Image Understanding, 115(10):1414-1426, 2011

BROADBAND BEAMPATTERN FOR MULTI-CHANNEL SPEECH ACQUISITION AND DISTANT SPEECH RECOGNITION, Mohammad J. Taghizadeh, Philip N. Garner and Hervé Bourlard, Idiap-RR-39-2011

Finding Audio-Visual Events in Informal Social Gatherings, Xavier Alameda-Pineda, Vasil Khalidov, Radu Horaud and Florence Forbes, in: IEEE/ACM 13th International Conference on Multimodal Interaction, 2011

Engagement-based Multi-party Dialog with a Humanoid Robot, David Klotz, Johannes Wienke, Julia Peltason, Britta Wrede, Sebastian Wrede, Vasil Khalidov and Jean-Marc Odobez, in: Proceedings of the SIGDIAL 2011: the 12th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pages 341-343, 2011

Acoustic Data-driven Grapheme-to-Phoneme Conversion using KL-HMM, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-38-2011

Hand Gesture Analysis, Cem Keskin, Oya Aran and Lale Akarun, in: Computer Analysis of Human Behavior,, pages 125-149, Springer London, 2011

Analysis of Group Conversations: Modeling Social Verticality, Oya Aran and Daniel Gatica-Perez, in: Computer Analysis of Human Behavior, pages 293-322, Springer London, 2011

Personalising speech-to-speech translation: Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis, John Dines, Hui Liang, Lakshmi Saheer, Matthew Gibson, William Byrne, Keiichiro Oura, Keiichi Tokuda, Junichi Yamagishi, Simon King, Mirjam Wester, Teemu Hirsimäki, Reima Karhila and Mikko Kurimo, in: Computer Speech and Language, 2011

[DOI]
[URL]

Environment - Application - Adaptation: a Community Architecture for Ambient Intelligence, Remi Emonet, in: International Conference on Ambient Computing, Applications, Services and Technologies, 2011

Language-Independent Socio-Emotional Role Recognition in the AMI Meetings Corpus, Fabio Valente and Alessandro Vinciarelli, in: Proceedings of Interspeech, 2011

Analysis and Comparison of Recent MLP Features for LVCSR Systems, Fabio Valente, Mathew Magimai-Doss and Wen Wang, in: Proceedings of Interspeech 2011, 2011

Transcribing Mandarin Broadcast Speech Using Multi-Layer Perceptron Acoustic Features, Fabio Valente, Mathew Magimai-Doss, Christian Plahl, Suman Ravuri and Wen Wang, in: IEEE Transactions on Audio, Speech, and Language Processing, 19(8), 2011

[DOI]

Data-driven extraction of spectral-dynamics based posteriors, Fabio Valente, in: Handbook of Natural Language Processing and Machine Translation Handbook of Natural Language Processing and Machine Translation, Springer, 2011

[URL]

Speaker Diarization of Meetings based on Speaker Role N-gram Models, Fabio Valente, Deepu Vijayasenan and Petr Motlicek, in: Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011

MULTISTREAM SPEAKER DIARIZATION THROUGH INFORMATION BOTTLENECK SYSTEM OUTPUTS COMBINATION, Deepu Vijayasenan, Fabio Valente and Petr Motlicek, in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2011

Current trends in multilingual speech processing, Hervé Bourlard, John Dines, Mathew Magimai-Doss, Philip N. Garner, David Imseng, Petr Motlicek, Hui Liang, Lakshmi Saheer and Fabio Valente, in: Sadhana, 36(5):885–915, 2011

[DOI]
[URL]

Transferring Activities: Updating Human Behavior Analysis, Fabian Nater, Tatiana Tommasi, Helmut Grabner, Luc Van Gool and Barbara Caputo, in: Visual Surveillance Workshop at ICCV, 2011

Using Modality Replacement to Facilitate Communication between Visually and Hearing-Impaired People, K. Moustakas, D. Tzovaras, L. Dybkjaer, N. Bernsen and Oya Aran, in: IEEE Multimedia, 18(2):26-37, 2011

[DOI]

Domain-specific language model adaptation: a case study, Gwénolé Lecorvé, Petr Motlicek and John Dines, Idiap-Com-01-2013

Inter-session Variability Modelling and Joint Factor Analysis for Face Authentication, Roy Wallace, Mitchell McLaren, Chris McCool and Sébastien Marcel, in: International Joint Conference on Biometrics, 2011

An Audio Visual Corpus for Emergent Leader Analysis, Dairazalia Sanchez-Cortes, Oya Aran and Daniel Gatica-Perez, in: Multimodal Corpora for Machine Learning: Taking Stock and Road mapping the Future, 2011

Robustness of Group Delay Representations for Noisy Speech Signals, Sree Hari Krishnan Parthasarathi, Padmanabhan Rajan and Hema A Murthy, Idiap-RR-36-2011

Robustness of Group Delay Representations for Noisy Speech Signals, Sree Hari Krishnan Parthasarathi, Padmanabhan Rajan and Hema A Murthy, in: IJST (Springer), 14(4), 2011

Privacy-Sensitive Audio Features for Conversational Speech Processing, Sree Hari Krishnan Parthasarathi, Ecole Polytechnique Fédérale de Lausanne, 2011

Joint Optimization of Hidden Conditional Random Fields and Non Linear Feature Extraction, Antoine Vinel, Trinh-Minh-Tri Do and Thierry Artieres, in: Proceedings of International Conference on Document Analysis and Recognition, 2011

Boosting Localized Features for Speaker and Speech Recognition, Anindya Roy, Ecole Polytechnique Federale de Lausanne (EPFL), 2011

Multi-camera Open Space Human Activity Discovery for Anomaly Detection, Remi Emonet, Jagannadan Varadarajan and Jean-Marc Odobez, in: 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011

Information Bottleneck Features for HMM/GMM Speaker Diarization of Meetings Recordings, Sree Harsha Yella and Fabio Valente, in: Interspeech, Florence, Italy, pages 953-956, 2011

Continuous Speech Recognition using Boosted Binary Features, Anindya Roy, Mathew Magimai-Doss and Sébastien Marcel, Idiap-RR-35-2011

Multimodal Cue Detection Engine for Orchestrated Entertainment, Danil Korchagin, Stefan Duffner, Petr Motlicek and Carl Scheffler, Idiap-RR-34-2011

IMPROVING MICROPHONE ARRAY SPEECH RECOGNITION WITH COCHLEAR IMPLANT-LIKE SPECTRALLY REDUCED SPEECH, Cong-Thanh Do, Mohammad J. Taghizadeh and Philip N. Garner, Idiap-RR-40-2011

Competition on Counter Measures to 2-D Facial Spoofing Attacks, Murali Mohan Chakka, André Anjos, Sébastien Marcel, Roberto Tronci, Daniele Muntoni, Gianluca Fadda, Maurizio Pili, Nicola Sirena, Gabriele Murgia, Marco Ristori, Fabio Roli, Junjie Yan, Dong Yi, Zhen Lei, Zhiwei Zhang, Stan Z.Li, William Robson Schwartz, Anderson Rocha, Helio Pedrini, Javier Lorenzo-Navarro, Modesto Castrillón-Santana, Jukka Maatta, Abdenour Hadid and Matti Pietikainen, in: Proceedings of IAPR IEEE International Joint Conference on Biometrics (IJCB), Washington DC, USA, 2011