logo Idiap Research Institute        
All publications sorted by recency
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 |

The ICSI RT-09 Speaker Diarization System, Gerald Friedland, Adam Janin, David Imseng, Xavier Anguera, Luke Gottlieb, Marijn Huijbregts, Mary Tai Knox and Oriol Vinyals, in: IEEE Transactions on Audio, Speech, and Language Processing, 20(2):371--381, 2012
[DOI]
The Kaldi Speech Recognition Toolkit, Daniel Povey, Arnab Ghoshal, Gilles Boulianne, Lukas Burget, Ondrej Glembek, Nagendra Goel, Mirko Hannemann, Petr Motlicek, Yanmin Qian, Petr Schwarz, Jan Silovsky, Georg Stemmer and Karel Vesely, in: IEEE 2011 Workshop on Automatic Speech Recognition and Understanding, Hilton Waikoloa Village, Big Island, Hawaii, US, IEEE Signal Processing Society, 2011
attachment
A tree-based distance between distributions: application to classification of neurons, Riwal Lefort and Francois Fleuret, in: ICASSP 2012 : IEEE International Conference on Acoustics, Speech and Signal Processing, 2012
Computational Methods For Structured Sparse Component Analysis of Convolutive Speech Mixtures, Afsaneh Asaei, Michael E. Davies, Hervé Bourlard and Volkan Cevher, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2012
attachment
Using KL-divergence and multilingual information to improve ASR for under-resourced languages, David Imseng, Hervé Bourlard and Philip N. Garner, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, pages 4869--4872, 2012
attachment
Look at who's talking, M. Cristani, A. Pesarin, Alessandro Vinciarelli, M. Crocco and V. Murino, in: Proceedings of International Conference on Ambient Intelligence, pages 68-76, 2011
Recent Developments in Social Signal Processing, Albert Ali Salah, Maja Pantic and Alessandro Vinciarelli, in: Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, pages 380-385, 2011
Towards Computational Proxemics: Inferring Social Relations from Interpersonal Distances, M. Cristani, G. Paggetti, Alessandro Vinciarelli, L. Bazzani, G. Menegaz and V. Murino, in: Proceedings of the IEEE International Conference on Social Computing, pages 290-297, 2011
Introduction to Sequence Analysis for Human Behavior Understanding, Hugues Salamin and Alessandro Vinciarelli, in: "Computer Analysis of Human Behavior" by A.Salah and T.Gevers (eds.), pages 21-40, Springer Verlag, 2011
Social Signal Processing: The Research Agenda, Maja Pantic, R. Cowie, F. D'Errico, Dirk Heylen, M. Mehu, C. Pelachaud, I. Poggi, M. Schroeder and Alessandro Vinciarelli, in: "Visual Analysis of Humans" by T.B.Moeslund, A.Hilton, V.Krueger and L.Sigal (eds.), pages 511-538, Springer Verlag, 2011
Open-ended Learning of Visual and Multi-modal Patterns, Jie Luo, Ecole polytechnique fédérale de Lausanne, 2011
attachment
Deep Learning for Efficient Discriminative Parsing, Ronan Collobert, in: International Conference on Artificial Intelligence and Statistics, 2011
attachment
Fast Human Detection from Joint Appearance and Foreground Feature Subset Covariances, Jian Yao and Jean-Marc Odobez, in: Computer Vision and Image Understanding, 115(10):1414-1426, 2011
attachment
Evaluation of Meeting Support Technology, Simon Tucker and Andrei Popescu-Belis, in: Multimodal Signal Processing: Human Interactions in Meetings, pages 237-252, Cambridge University Press, 2012
User Requirements for Meeting Support Technology, Denis Lalanne and Andrei Popescu-Belis, in: Multimodal Signal Processing: Human Interactions in Meetings, pages 210-221, Cambridge University Press, 2012
Multimodal Signal Processing for Meetings: an Introduction, Andrei Popescu-Belis and Jean Carletta, in: Multimodal Signal Processing: Human Interactions in Meetings, pages 1-11, Cambridge University Press, 2012
attachment
Finding Audio-Visual Events in Informal Social Gatherings, Xavier Alameda-Pineda, Vasil Khalidov, Radu Horaud and Florence Forbes, in: IEEE/ACM 13th International Conference on Multimodal Interaction, 2011
attachment
Engagement-based Multi-party Dialog with a Humanoid Robot, David Klotz, Johannes Wienke, Julia Peltason, Britta Wrede, Sebastian Wrede, Vasil Khalidov and Jean-Marc Odobez, in: Proceedings of the SIGDIAL 2011: the 12th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pages 341-343, 2011
attachment
OM-2: An Online Multi-class Multi-kernel Learning Algorithm, Jie Luo, Francesco Orabona, Marco Fornoni, Barbara Caputo and Nicolo Cesa-Bianchi, in: In Proceeding of CVPR 2010, Online Learning for Computer Vision Workshop, 2010
attachment
Hand Gesture Analysis, Cem Keskin, Oya Aran and Lale Akarun, in: Computer Analysis of Human Behavior,, pages 125-149, Springer London, 2011
Analysis of Group Conversations: Modeling Social Verticality, Oya Aran and Daniel Gatica-Perez, in: Computer Analysis of Human Behavior, pages 293-322, Springer London, 2011
Environment - Application - Adaptation: a Community Architecture for Ambient Intelligence, Remi Emonet, in: International Conference on Ambient Computing, Applications, Services and Technologies, 2011
Speaker Diarization, Fabio Valente and Gerald Friedland, in: Multimodal Signal Processing: Human Interactions in Meetings, Cambridge University Press, 2012
[URL]
Data-driven extraction of spectral-dynamics based posteriors, Fabio Valente, in: Handbook of Natural Language Processing and Machine Translation Handbook of Natural Language Processing and Machine Translation, Springer, 2011
[URL]
Speaker Diarization of Meetings based on Speaker Role N-gram Models, Fabio Valente, Deepu Vijayasenan and Petr Motlicek, in: Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011
attachment
MULTISTREAM SPEAKER DIARIZATION THROUGH INFORMATION BOTTLENECK SYSTEM OUTPUTS COMBINATION, Deepu Vijayasenan, Fabio Valente and Petr Motlicek, in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2011
attachment
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 |