Publication list - Idiap Publications

Update cookies preferences

| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 | 84 | 85 | 86 | 87 |

Understanding Social Signals in Multi-party Conversations: Automatic Recognition of Socio-Emotional Roles in the AMI Meeting Corpus, Fabio Valente, Alessandro Vinciarelli, Sree Harsha Yella and A. Sapru, in: Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, pages 374-379, 2011

Towards Computational Proxemics: Inferring Social Relations from Interpersonal Distances, M. Cristani, G. Paggetti, Alessandro Vinciarelli, L. Bazzani, G. Menegaz and V. Murino, in: Proceedings of the IEEE International Conference on Social Computing, pages 290-297, 2011

Conversation Analysis at Work: Detection of Conflict in Competitive Discussions through Automatic Turn-Organization Analysis, A. Pesarin, M. Cristani, V. Murino and Alessandro Vinciarelli, in: Cognitive Processing, 2012

Bridging the Gap Between Social Animal and Unsocial Machine: A Survey of Social Signal Processing, Alessandro Vinciarelli, Maja Pantic, Dirk Heylen, C. Pelachaud, I. Poggi, F. D'Errico and M. Schroeder, in: IEEE Transactions on Affective Computing, 2012

Automatic Role Recognition in Multiparty Conversations: an Approach Based on Turn Organization, Prosody and Conditional Random Fields, Hugues Salamin and Alessandro Vinciarelli, in: IEEE Transactions on Multimedia, 2012

Introduction to Sequence Analysis for Human Behavior Understanding, Hugues Salamin and Alessandro Vinciarelli, in: "Computer Analysis of Human Behavior" by A.Salah and T.Gevers (eds.), pages 21-40, Springer Verlag, 2011

Social Signal Processing: The Research Agenda, Maja Pantic, R. Cowie, F. D'Errico, Dirk Heylen, M. Mehu, C. Pelachaud, I. Poggi, M. Schroeder and Alessandro Vinciarelli, in: "Visual Analysis of Humans" by T.B.Moeslund, A.Hilton, V.Krueger and L.Sigal (eds.), pages 511-538, Springer Verlag, 2011

Analysis of Verbal and Nonverbal Communication and Enactment: The Processing Issues, A. Esposito, Alessandro Vinciarelli, K. Vicsi, C. Pelachaud and A. Nijholt, Springer Verlag, 2011

Open-ended Learning of Visual and Multi-modal Patterns, Jie Luo, Ecole polytechnique fédérale de Lausanne, 2011

attachment

A Bimodal Sound Source Model for Vehicle Tracking in Traffic Monitoring, Patrick Marmaroli, Jean-Marc Odobez, Xavier Falourd and Hervé Lissek, in: European Signal Processing Conference, 2011

attachment

Torch7: A Matlab-like Environment for Machine Learning, Ronan Collobert, Koray Kavukcuoglu and Clément Farabet, in: BigLearn, NIPS Workshop, 2011

attachment

Learning Structured Embeddings of Knowledge Bases, Antoine Bordes, Jason Weston, Ronan Collobert and Yoshua Bengio, in: Conference on Artificial Intelligence, 2011

attachment

Deep Learning for Efficient Discriminative Parsing, Ronan Collobert, in: International Conference on Artificial Intelligence and Statistics, 2011

attachment

Natural Language Processing (Almost) from Scratch, Ronan Collobert, Jason Weston, Léon Bottou, Michael Karlen, Koray Kavukcuoglu and Pavel Kuksa, in: Journal of Machine Learning Research, 12:2493-2537, 2011

attachment

Fast Human Detection from Joint Appearance and Foreground Feature Subset Covariances, Jian Yao and Jean-Marc Odobez, in: Computer Vision and Image Understanding, 115(10):1414-1426, 2011

attachment

Evaluation of Meeting Support Technology, Simon Tucker and Andrei Popescu-Belis, in: Multimodal Signal Processing: Human Interactions in Meetings, pages 237-252, Cambridge University Press, 2012

User Requirements for Meeting Support Technology, Denis Lalanne and Andrei Popescu-Belis, in: Multimodal Signal Processing: Human Interactions in Meetings, pages 210-221, Cambridge University Press, 2012

Multimodal Signal Processing for Meetings: an Introduction, Andrei Popescu-Belis and Jean Carletta, in: Multimodal Signal Processing: Human Interactions in Meetings, pages 1-11, Cambridge University Press, 2012

attachment

BROADBAND BEAMPATTERN FOR MULTI-CHANNEL SPEECH ACQUISITION AND DISTANT SPEECH RECOGNITION, Mohammad J. Taghizadeh, Philip N. Garner and Hervé Bourlard, Idiap-RR-39-2011

attachment

Finding Audio-Visual Events in Informal Social Gatherings, Xavier Alameda-Pineda, Vasil Khalidov, Radu Horaud and Florence Forbes, in: IEEE/ACM 13th International Conference on Multimodal Interaction, 2011

attachment

Engagement-based Multi-party Dialog with a Humanoid Robot, David Klotz, Johannes Wienke, Julia Peltason, Britta Wrede, Sebastian Wrede, Vasil Khalidov and Jean-Marc Odobez, in: Proceedings of the SIGDIAL 2011: the 12th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pages 341-343, 2011

attachment

Acoustic Data-driven Grapheme-to-Phoneme Conversion using KL-HMM, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-38-2011

attachment

OM-2: An Online Multi-class Multi-kernel Learning Algorithm, Jie Luo, Francesco Orabona, Marco Fornoni, Barbara Caputo and Nicolo Cesa-Bianchi, in: In Proceeding of CVPR 2010, Online Learning for Computer Vision Workshop, 2010

attachment

Hand Gesture Analysis, Cem Keskin, Oya Aran and Lale Akarun, in: Computer Analysis of Human Behavior,, pages 125-149, Springer London, 2011

Analysis of Group Conversations: Modeling Social Verticality, Oya Aran and Daniel Gatica-Perez, in: Computer Analysis of Human Behavior, pages 293-322, Springer London, 2011

A Nonverbal Behavior Approach to Identify Emergent Leaders in Small Groups, Dairazalia Sanchez-Cortes, Oya Aran, Marianne Schmid Mast and Daniel Gatica-Perez, in: IEEE Transactions on Multimedia, 14(3-2):816-832, 2012

attachment

[DOI]

Personalising speech-to-speech translation: Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis, John Dines, Hui Liang, Lakshmi Saheer, Matthew Gibson, William Byrne, Keiichiro Oura, Keiichi Tokuda, Junichi Yamagishi, Simon King, Mirjam Wester, Teemu Hirsimäki, Reima Karhila and Mikko Kurimo, in: Computer Speech and Language, 2011

attachment

[DOI]
[URL]

Environment - Application - Adaptation: a Community Architecture for Ambient Intelligence, Remi Emonet, in: International Conference on Ambient Computing, Applications, Services and Technologies, 2011

Speaker Diarization, Fabio Valente and Gerald Friedland, in: Multimodal Signal Processing: Human Interactions in Meetings, Cambridge University Press, 2012

[URL]

Language-Independent Socio-Emotional Role Recognition in the AMI Meetings Corpus, Fabio Valente and Alessandro Vinciarelli, in: Proceedings of Interspeech, 2011

attachment

Analysis and Comparison of Recent MLP Features for LVCSR Systems, Fabio Valente, Mathew Magimai-Doss and Wen Wang, in: Proceedings of Interspeech 2011, 2011

attachment

Multistream speaker diarization of meetings recordings beyond MFCC and TDOA features, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: Speech Communication, 54(1), 2012

[DOI]

Transcribing Mandarin Broadcast Speech Using Multi-Layer Perceptron Acoustic Features, Fabio Valente, Mathew Magimai-Doss, Christian Plahl, Suman Ravuri and Wen Wang, in: IEEE Transactions on Audio, Speech, and Language Processing, 19(8), 2011

[DOI]

Data-driven extraction of spectral-dynamics based posteriors, Fabio Valente, in: Handbook of Natural Language Processing and Machine Translation Handbook of Natural Language Processing and Machine Translation, Springer, 2011

[URL]

Speaker Diarization of Meetings based on Speaker Role N-gram Models, Fabio Valente, Deepu Vijayasenan and Petr Motlicek, in: Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011

attachment

MULTISTREAM SPEAKER DIARIZATION THROUGH INFORMATION BOTTLENECK SYSTEM OUTPUTS COMBINATION, Deepu Vijayasenan, Fabio Valente and Petr Motlicek, in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2011

attachment

Current trends in multilingual speech processing, Hervé Bourlard, John Dines, Mathew Magimai-Doss, Philip N. Garner, David Imseng, Petr Motlicek, Hui Liang, Lakshmi Saheer and Fabio Valente, in: Sadhana, 36(5):885–915, 2011

attachment

[DOI]
[URL]

Transcribing meetings with the AMIDA systems, Thomas Hain, Lukas Burget, John Dines, Philip N. Garner, Frantisek Grezl, Asmaa El Hannani, Marijn Huijbregts, Martin Karafiat, Mike Lincoln and Vincent Wan, in: IEEE Transactions on Audio, Speech, and Language Processing, 20(2):486--498, 2012

attachment

[DOI]
[URL]

Overview of the CLEF 2009 medical image annotation track, Tatiana Tommasi, Barbara Caputo, Petra Welter, Mark O. Güld and Thomas M Deserno, in: Workshop of the Cross-Language Evaluation Forum, Corfu, Greece, pages 85-93, Springer Berlin Heidelberg, 2009

attachment

[DOI]

Object Recognition using Visuo-Affordance Maps, Arjan Gijsberts, Tatiana Tommasi, Giorgio Metta and Barbara Caputo, in: International Conference on Intelligent Robots and Systems, Taipei, pages 1572-1578, IEEE, 2010

attachment

[DOI]

Towards a quantitative measure of rareness, Tatiana Tommasi and Barbara Caputo, in: DIRAC Workshop at the European Conference on Machine Learning, pages 129-136, Springer Berlin Heidelberg, 2010

attachment

[DOI]

Transferring Activities: Updating Human Behavior Analysis, Fabian Nater, Tatiana Tommasi, Helmut Grabner, Luc Van Gool and Barbara Caputo, in: Visual Surveillance Workshop at ICCV, 2011

attachment

Safety in Numbers: Learning Categories from Few Examples with Multi Model Knowledge Transfer, Tatiana Tommasi, Francesco Orabona and Barbara Caputo, in: Proceedings of IEEE Computer Vision and Pattern Recognition Conference, San Francisco, CA, pages 3081-3088, IEEE, 2010

attachment

[DOI]

Using Modality Replacement to Facilitate Communication between Visually and Hearing-Impaired People, K. Moustakas, D. Tzovaras, L. Dybkjaer, N. Bernsen and Oya Aran, in: IEEE Multimedia, 18(2):26-37, 2011

[DOI]

Domain-specific language model adaptation: a case study, Gwénolé Lecorvé, Petr Motlicek and John Dines, Idiap-Com-01-2013

attachment

VTLN-Based Rapid Cross-Lingual Adaptation for Statistical Parametric Speech Synthesis, Lakshmi Saheer, Hui Liang, John Dines and Philip N. Garner, Idiap-RR-12-2012

attachment

Combining Vocal Tract Length Normalization with Linear Transformations in a Bayesian Framework, Lakshmi Saheer, Junichi Yamagishi, Philip N. Garner and John Dines, Idiap-RR-11-2012

attachment

Inter-session Variability Modelling and Joint Factor Analysis for Face Authentication, Roy Wallace, Mitchell McLaren, Chris McCool and Sébastien Marcel, in: International Joint Conference on Biometrics, 2011

An Audio Visual Corpus for Emergent Leader Analysis, Dairazalia Sanchez-Cortes, Oya Aran and Daniel Gatica-Perez, in: Multimodal Corpora for Machine Learning: Taking Stock and Road mapping the Future, 2011

Robustness of Group Delay Representations for Noisy Speech Signals, Sree Hari Krishnan Parthasarathi, Padmanabhan Rajan and Hema A Murthy, Idiap-RR-36-2011

attachment

| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 | 84 | 85 | 86 | 87 |

processing time: 0.0005 seconds.