logo Idiap Research Institute        
All publications
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 |

2012
The ICSI RT-09 Speaker Diarization System, Gerald Friedland, Adam Janin, David Imseng, Xavier Anguera, Luke Gottlieb, Marijn Huijbregts, Mary Tai Knox and Oriol Vinyals, in: IEEE Transactions on Audio, Speech, and Language Processing, 20(2):371--381, 2012
[DOI]
A tree-based distance between distributions: application to classification of neurons, Riwal Lefort and Francois Fleuret, in: ICASSP 2012 : IEEE International Conference on Acoustics, Speech and Signal Processing, 2012
Computational Methods For Structured Sparse Component Analysis of Convolutive Speech Mixtures, Afsaneh Asaei, Michael E. Davies, Hervé Bourlard and Volkan Cevher, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2012
attachment
Using KL-divergence and multilingual information to improve ASR for under-resourced languages, David Imseng, Hervé Bourlard and Philip N. Garner, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, pages 4869--4872, 2012
attachment
Evaluation of Meeting Support Technology, Simon Tucker and Andrei Popescu-Belis, in: Multimodal Signal Processing: Human Interactions in Meetings, pages 237-252, Cambridge University Press, 2012
User Requirements for Meeting Support Technology, Denis Lalanne and Andrei Popescu-Belis, in: Multimodal Signal Processing: Human Interactions in Meetings, pages 210-221, Cambridge University Press, 2012
Multimodal Signal Processing for Meetings: an Introduction, Andrei Popescu-Belis and Jean Carletta, in: Multimodal Signal Processing: Human Interactions in Meetings, pages 1-11, Cambridge University Press, 2012
attachment
Speaker Diarization, Fabio Valente and Gerald Friedland, in: Multimodal Signal Processing: Human Interactions in Meetings, Cambridge University Press, 2012
[URL]
Multimodal Cue Detection Engine for Orchestrated Entertainment, Danil Korchagin, Stefan Duffner, Petr Motlicek and Carl Scheffler, in: Proceedings International Conference on MultiMedia Modeling, Klagenfurt, Austria, 2012
attachment
A Fast Parts-based Approach to Speaker Verification using Boosted Slice Classifiers, Anindya Roy, Mathew Magimai.-Doss and Sébastien Marcel, in: IEEE Transactions on Information Forensics and Security, 7(1):241-254, 2012
attachment
A real-time deformable detector., Karim Ali, Francois Fleuret, David Hasler and Pascal Fua, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2012
attachment
2011
Bayesian Approaches to Uncertainty in Speech Processing, Philip N. Garner, School of Computing Sciences, University of East Anglia, 2011
attachment
The Kaldi Speech Recognition Toolkit, Daniel Povey, Arnab Ghoshal, Gilles Boulianne, Lukas Burget, Ondrej Glembek, Nagendra Goel, Mirko Hannemann, Petr Motlicek, Yanmin Qian, Petr Schwarz, Jan Silovsky, Georg Stemmer and Karel Vesely, in: IEEE 2011 Workshop on Automatic Speech Recognition and Understanding, Hilton Waikoloa Village, Big Island, Hawaii, US, IEEE Signal Processing Society, 2011
attachment
Look at who's talking, M. Cristani, A. Pesarin, Alessandro Vinciarelli, M. Crocco and V. Murino, in: Proceedings of International Conference on Ambient Intelligence, pages 68-76, 2011
Recent Developments in Social Signal Processing, Albert Ali Salah, Maja Pantic and Alessandro Vinciarelli, in: Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, pages 380-385, 2011
Towards Computational Proxemics: Inferring Social Relations from Interpersonal Distances, M. Cristani, G. Paggetti, Alessandro Vinciarelli, L. Bazzani, G. Menegaz and V. Murino, in: Proceedings of the IEEE International Conference on Social Computing, pages 290-297, 2011
Introduction to Sequence Analysis for Human Behavior Understanding, Hugues Salamin and Alessandro Vinciarelli, in: "Computer Analysis of Human Behavior" by A.Salah and T.Gevers (eds.), pages 21-40, Springer Verlag, 2011
Social Signal Processing: The Research Agenda, Maja Pantic, R. Cowie, F. D'Errico, Dirk Heylen, M. Mehu, C. Pelachaud, I. Poggi, M. Schroeder and Alessandro Vinciarelli, in: "Visual Analysis of Humans" by T.B.Moeslund, A.Hilton, V.Krueger and L.Sigal (eds.), pages 511-538, Springer Verlag, 2011
Open-ended Learning of Visual and Multi-modal Patterns, Jie Luo, Ecole polytechnique fédérale de Lausanne, 2011
attachment
Deep Learning for Efficient Discriminative Parsing, Ronan Collobert, in: International Conference on Artificial Intelligence and Statistics, 2011
attachment
Fast Human Detection from Joint Appearance and Foreground Feature Subset Covariances, Jian Yao and Jean-Marc Odobez, in: Computer Vision and Image Understanding, 115(10):1414-1426, 2011
attachment
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 |