Publication list - Idiap Publications

Update cookies preferences

| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 |

Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition., Philip N. Garner, Idiap-RR-15-2011

attachment

Learning a 3D Human Pose Distance Metric from Geometric Pose Descriptor, Cheng Chen, in: IEEE Transactions on Visualization and Computer Graphics, 17(11):1676-1689, 2011

attachment

3D human pose recovery from image by efficient visual feature selection, Cheng Chen, Yi Yang, Feiping Nie and Jean-Marc Odobez, in: Computer Vision and Image Understanding, 115(3), 2011

attachment

Parts-Based Face Verification using Local Frequency Bands, Chris McCool and Sébastien Marcel, Idiap-RR-06-2011

attachment

Automatic Time Skew Detection and Correction, Danil Korchagin, in: Proceedings International Conference on Signal Acquisition and Processing, Singapore, 2011

attachment

Analyzing ancient Maya glyph collections with Contextual Shape Descriptors, Edgar Roman-Rangel, Carlos Pallan, Jean-Marc Odobez and Daniel Gatica-Perez, in: International Journal of Computer Vision, 94(1):101-117, 2011

attachment

[DOI]

Integrating Articulatory Features using Kullback-Leibler Divergence based Acoustic Model for Phoneme Recognition, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-02-2011

attachment

An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: IEEE Transactions on Audio Speech and Language Processing, 19(2), 2011

[DOI]

Exploiting Long-Term Observations for Track Creation and Deletion in Online Multi-Face Tracking, Stefan Duffner and Jean-Marc Odobez, Idiap-RR-01-2011

attachment

The TA2 Database - A Multi-Modal Database from Home Entertainment, Stefan Duffner, Petr Motlicek and Danil Korchagin, in: International Conference on Signal Acquisition and Processing, Singapore, 2011

attachment

Language dependent universal phoneme posterior estimation for mixed language speech recognition, David Imseng, Hervé Bourlard, Mathew Magimai-Doss and John Dines, Idiap-RR-13-2011

attachment

Discovering Routines from Large-Scale Human Locations using Probabilistic Topic Models, Katayoun Farrahi and Daniel Gatica-Perez, in: ACM Transactions on Intelligent Systems and Technology, 2(1), 2011

attachment

Model-Based Compressive Sensing for Multi-Party Distant Speech Recognition, Afsaneh Asaei, Hervé Bourlard and Volkan Cevher, Idiap-RR-04-2011

attachment

On-line unsupervised adaptation for face verification using Gaussian Mixture Models with multiple user models, Niklas Johansson, Chris McCool and Sébastien Marcel, Idiap-RR-07-2011

attachment

Face Verification using Gabor Filtering and Adapted Gaussian Mixture Models, Laurent El Shafey, Roy Wallace and Sébastien Marcel, Idiap-RR-37-2011

attachment

Estimating Dominance in Multi-Party Meetings Using Speaker Diarization, Hayley Hung, Yan Huang, Gerald Friedland and Daniel Gatica-Perez, in: IEEE Transactions on Audio, Speech, and Language Processing, 19(4):847-860, 2011

attachment

Just-in-Time Multimodal Association and Fusion from Home Entertainment, Danil Korchagin, Petr Motlicek, Stefan Duffner and Hervé Bourlard, Idiap-RR-10-2011

attachment

Learning from Candidate Labeling Sets, Jie Luo and Francesco Orabona, Idiap-RR-27-2011

attachment

Privacy-Sensitive Audio Features for Speech/Nonspeech Detection, Sree Hari Krishnan Parthasarathi, Daniel Gatica-Perez, Hervé Bourlard and Mathew Magimai-Doss, Idiap-RR-12-2011

attachment

Multi-Person Visual Focus of Attention from Head Pose and Meeting Contextual Cues, Silèye O. Ba and Jean-Marc Odobez, in: IEEE Trans. on Pattern Analysis and Machine Intelligence, 33(1):101-116, 2011

attachment

Analysis of MLP Based Hierarchical Phoneme Posterior Probability Estimator, Joel Praveen Pinto, G. S. V. S. Sivaram, Mathew Magimai-Doss, Hynek Hermansky and Hervé Bourlard, in: IEEE Transcations on Audio, Speech, and Language Processing, 19(2):225-241, 2011

attachment

Face Detection using Ferns, Venkatesh Bala Subburaman and Sébastien Marcel, Idiap-Com-01-2011

attachment

The Robot Vision Track at ImageCLEF 2010, Andrzej Pronobis, Marco Fornoni, Henrik I. Christensen and Barbara Caputo, in: CLEF 2010 Notebook Papers/LABs/Workshops, 2010

attachment

[URL]

Extracting Motifs from Time Series Generated by Concurrent Activities., Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, in: NIPS workshop on Learning and Planning from Batch Time Series Data, 2010

attachment

Leveraging speaker diarization for meeting recognition from distant microphones, Andreas Stolcke, Gerald Friedland and David Imseng, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, pages 4390--4393, 2010

OM-2: An Online Multi-class Multi-kernel Learning Algorithm, Jie Luo, Francesco Orabona, Marco Fornoni, Barbara Caputo and Nicolo Cesa-Bianchi, in: In Proceeding of CVPR 2010, Online Learning for Computer Vision Workshop, 2010

attachment

Object Recognition using Visuo-Affordance Maps, Arjan Gijsberts, Tatiana Tommasi, Giorgio Metta and Barbara Caputo, in: International Conference on Intelligent Robots and Systems, Taipei, pages 1572-1578, IEEE, 2010

attachment

[DOI]

Towards a quantitative measure of rareness, Tatiana Tommasi and Barbara Caputo, in: DIRAC Workshop at the European Conference on Machine Learning, pages 129-136, Springer Berlin Heidelberg, 2010

attachment

[DOI]

Safety in Numbers: Learning Categories from Few Examples with Multi Model Knowledge Transfer, Tatiana Tommasi, Francesco Orabona and Barbara Caputo, in: Proceedings of IEEE Computer Vision and Pattern Recognition Conference, San Francisco, CA, pages 3081-3088, IEEE, 2010

attachment

[DOI]

Estimating Cohesion in Small Groups Using Audio-Visual Nonverbal Behavior, Hayley Hung and Daniel Gatica-Perez, in: IEEE Trans. on Multimedia, Special Issue on Multimodal Affective Interaction, 12(6):563 - 575, 2010

attachment

Delineating Trees in Noisy 2D Images and 3D Image Stacks, German Gonzalez, Engin Turetken, Francois Fleuret and Pascal Fua, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, pages 2799–2806, 2010

Joint Cascade Optimization Using a Product Of Boosted Classifiers, Leonidas Lefakis and Francois Fleuret, in: Proceedings of the Neural Information Processing Systems Conference, pages 1315–1323, 2010

Vlogcast Yourself: Nonverbal Behavior and Attention in Social Media, Joan-Isaac Biel and Daniel Gatica-Perez, in: Proceedings International Conference on Multimodal Interfaces (ICMI-MLMI), 2010

attachment

Automatic Content Linking: Speech-based Just-in-time Retrieval for Multimedia Archives, Andrei Popescu-Belis, Jonathan Kilgour, Peter Poller, Alexandre Nanchen, Erik Boertjes and Joost de Wit, in: Proceedings of the 33rd Annual ACM SIGIR Conference, Geneva, Switzerland, pages 703, 2010

[DOI]

Socially Intelligent Surveillance and Monitoring: Analysing Social Dimensions of Physical Space, V. Murino, M. Cristani and Alessandro Vinciarelli, in: Proceedings of International Workshop on Socially Intelligent Surveillance and Monitoring, San Francisco, pages 51-58, 2010

attachment

Mobile Social Signal Processing: vision and research issues, Alessandro Vinciarelli, Roderick Murray-Smith and Hervé Bourlard, in: Proceedings of the International Workshop on Mobile HCI, Lisbon, pages 513-516, 2010

attachment

www.sspnet.eu: A Web Portal for Social Signal Processing, Alessandro Vinciarelli and Maja Pantic, in: IEEE Signal Processing Magazine, 27(4):142-144, 2010

attachment

Human Behavior Understanding, Alessandro Vinciarelli, Springer Verlag, 2010

View-Based Appearance Model Online Learning for 3D Deformable Face Tracking, Stéphanie Lefèvre and Jean-Marc Odobez, in: Proc. Int. Conf. on Computer Vision Theory and Applications, Angers, 2010

attachment

Identifying Emergent Leadership in Small Groups using Nonverbal Communicative Cues, Dairazalia Sanchez-Cortes, Oya Aran, Marianne Schmid Mast and Daniel Gatica-Perez, in: Proc. ICMI-MLMI '10 International Conference on Multimodal Interfaces and the Workshop on Machine Learning for Multimodal Interaction, Beijing, ACM New York, NY, USA ©2010, 2010

attachment

[DOI]

Speech Enhancement using an Improved MMSE Estimator with Laplacian Prior, Mehdi Rashidinejad, Hamid Reza Abutalebi and Ali Akbar Tadaion, in: Proceedings of 5th International Symposium on Telecommunications, 2010

attachment

Single Channel Speech Separation with a Frame-based Pitch Range Estimation Method in Modulation Frequency, Azar Mahmoodzadeh, Hamid Reza Abutalebi, Hamid Soltanianzadeh and Hamid Sheikhzadeh, in: Proceedings of 5th International Symposium on Telecommunications, 2010

attachment

Determination of Pitch Range Based on Onset and Offset Analysis in Modulation Frequency Domain, Azar Mahmoodzadeh, Hamid Reza Abutalebi, Hamid Soltanianzadeh and Hamid Sheikhzadeh, in: Proceedings of 5th International Symposium on Telecommunications, 2010

attachment

Social Network Analysis for Automatic Role Recognition, Sarah Favre, Ecole Polytechnique Fédérale de Lausanne, 2010

attachment

Discovering Human Places of Interest from Multimodal Mobile Phone Data, Raul. Montoliu and Daniel Gatica-Perez, in: Proceedings of 9th International Conference on on Mobile and Ubiquitous Multimedia (MUM,',','), Limassol, Cyprus, 2010

attachment

Feature distribution modelling techniques for 3D face recognition, Chris McCool, Jordi Sanchez-Riera and Sébastien Marcel, in: Pattern Recognition Letters, 31:1324-1330, 2010

attachment

An Information Theoretic Approach to Speaker Diarization of Meeting Recordings, Deepu Vijayasenan, Ecole polytechnique fédérale de Lausanne, 2010

attachment

Hierarchical and Parallel Processing of Auditory and Modulation Frequencies for Automatic Speech Recognition, Fabio Valente, in: Speech Communication, 52(10):790-800, 2010

[DOI]

Multi-Stream Speech Recognition based on Dempster-Shafer Combination Rule, Fabio Valente, in: Speech Communication, 52(3):213-222, 2010

[DOI]

VARIATIONAL BAYESIAN SPEAKER DIARIZATION OF MEETING RECORDINGS, Fabio Valente, Petr Motlicek and Deepu Vijayasenan, in: Proceedings of ICASSP, 2010

attachment

| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 |

processing time: 0.0004 seconds.