All publications
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 |
2011
Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition., , Idiap-RR-15-2011 |
![]() |
Learning a 3D Human Pose Distance Metric from Geometric Pose Descriptor, , in: IEEE Transactions on Visualization and Computer Graphics, 17(11):1676-1689, 2011 |
![]() |
3D human pose recovery from image by efficient visual feature selection, , , and , in: Computer Vision and Image Understanding, 115(3), 2011 |
![]() |
Parts-Based Face Verification using Local Frequency Bands, and , Idiap-RR-06-2011 |
![]() |
Automatic Time Skew Detection and Correction, , in: Proceedings International Conference on Signal Acquisition and Processing, Singapore, 2011 |
![]() |
Analyzing ancient Maya glyph collections with Contextual Shape Descriptors, , , and , in: International Journal of Computer Vision, 94(1):101-117, 2011 |
![]() [DOI] |
Integrating Articulatory Features using Kullback-Leibler Divergence based Acoustic Model for Phoneme Recognition, and , Idiap-RR-02-2011 |
![]() |
An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization, , and , in: IEEE Transactions on Audio Speech and Language Processing, 19(2), 2011 |
[DOI] |
Exploiting Long-Term Observations for Track Creation and Deletion in Online Multi-Face Tracking, and , Idiap-RR-01-2011 |
![]() |
The TA2 Database - A Multi-Modal Database from Home Entertainment, , and , in: International Conference on Signal Acquisition and Processing, Singapore, 2011 |
![]() |
Language dependent universal phoneme posterior estimation for mixed language speech recognition, , , and , Idiap-RR-13-2011 |
![]() |
Discovering Routines from Large-Scale Human Locations using Probabilistic Topic Models, and , in: ACM Transactions on Intelligent Systems and Technology, 2(1), 2011 |
![]() |
Model-Based Compressive Sensing for Multi-Party Distant Speech Recognition, , and , Idiap-RR-04-2011 |
![]() |
On-line unsupervised adaptation for face verification using Gaussian Mixture Models with multiple user models, , and , Idiap-RR-07-2011 |
![]() |
Face Verification using Gabor Filtering and Adapted Gaussian Mixture Models, , and , Idiap-RR-37-2011 |
![]() |
Estimating Dominance in Multi-Party Meetings Using Speaker Diarization, , , and , in: IEEE Transactions on Audio, Speech, and Language Processing, 19(4):847-860, 2011 |
![]() |
Just-in-Time Multimodal Association and Fusion from Home Entertainment, , , and , Idiap-RR-10-2011 |
![]() |
Learning from Candidate Labeling Sets, and , Idiap-RR-27-2011 |
![]() |
Privacy-Sensitive Audio Features for Speech/Nonspeech Detection, , , and , Idiap-RR-12-2011 |
![]() |
Multi-Person Visual Focus of Attention from Head Pose and Meeting Contextual Cues, and , in: IEEE Trans. on Pattern Analysis and Machine Intelligence, 33(1):101-116, 2011 |
![]() |
Analysis of MLP Based Hierarchical Phoneme Posterior Probability Estimator, , , , and , in: IEEE Transcations on Audio, Speech, and Language Processing, 19(2):225-241, 2011 |
![]() |
Face Detection using Ferns, and , Idiap-Com-01-2011 |
![]() |
2010
The Robot Vision Track at ImageCLEF 2010, , , and , in: CLEF 2010 Notebook Papers/LABs/Workshops, 2010 |
![]() [URL] |
Extracting Motifs from Time Series Generated by Concurrent Activities., , and , in: NIPS workshop on Learning and Planning from Batch Time Series Data, 2010 |
![]() |
Leveraging speaker diarization for meeting recognition from distant microphones, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, pages 4390--4393, 2010 |
OM-2: An Online Multi-class Multi-kernel Learning Algorithm, , , , and , in: In Proceeding of CVPR 2010, Online Learning for Computer Vision Workshop, 2010 |
![]() |
Object Recognition using Visuo-Affordance Maps, , , and , in: International Conference on Intelligent Robots and Systems, Taipei, pages 1572-1578, IEEE, 2010 |
![]() [DOI] |
Towards a quantitative measure of rareness, and , in: DIRAC Workshop at the European Conference on Machine Learning, pages 129-136, Springer Berlin Heidelberg, 2010 |
![]() [DOI] |
Safety in Numbers: Learning Categories from Few Examples with Multi Model Knowledge Transfer, , and , in: Proceedings of IEEE Computer Vision and Pattern Recognition Conference, San Francisco, CA, pages 3081-3088, IEEE, 2010 |
![]() [DOI] |
Estimating Cohesion in Small Groups Using Audio-Visual Nonverbal Behavior, and , in: IEEE Trans. on Multimedia, Special Issue on Multimodal Affective Interaction, 12(6):563 - 575, 2010 |
![]() |
Delineating Trees in Noisy 2D Images and 3D Image Stacks, , , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, pages 2799–2806, 2010 |
Joint Cascade Optimization Using a Product Of Boosted Classifiers, and , in: Proceedings of the Neural Information Processing Systems Conference, pages 1315–1323, 2010 |
Vlogcast Yourself: Nonverbal Behavior and Attention in Social Media, and , in: Proceedings International Conference on Multimodal Interfaces (ICMI-MLMI), 2010 |
![]() |
Automatic Content Linking: Speech-based Just-in-time Retrieval for Multimedia Archives, , , , , and , in: Proceedings of the 33rd Annual ACM SIGIR Conference, Geneva, Switzerland, pages 703, 2010 |
[DOI] |
Socially Intelligent Surveillance and Monitoring: Analysing Social Dimensions of Physical Space, , and , in: Proceedings of International Workshop on Socially Intelligent Surveillance and Monitoring, San Francisco, pages 51-58, 2010 |
![]() |
Mobile Social Signal Processing: vision and research issues, , and , in: Proceedings of the International Workshop on Mobile HCI, Lisbon, pages 513-516, 2010 |
![]() |
www.sspnet.eu: A Web Portal for Social Signal Processing, and , in: IEEE Signal Processing Magazine, 27(4):142-144, 2010 |
![]() |
Human Behavior Understanding, , Springer Verlag, 2010 |
View-Based Appearance Model Online Learning for 3D Deformable Face Tracking, and , in: Proc. Int. Conf. on Computer Vision Theory and Applications, Angers, 2010 |
![]() |
Identifying Emergent Leadership in Small Groups using Nonverbal Communicative Cues, , , and , in: Proc. ICMI-MLMI '10 International Conference on Multimodal Interfaces and the Workshop on Machine Learning for Multimodal Interaction, Beijing, ACM New York, NY, USA ©2010, 2010 |
![]() [DOI] |
Speech Enhancement using an Improved MMSE Estimator with Laplacian Prior, , and , in: Proceedings of 5th International Symposium on Telecommunications, 2010 |
![]() |
Single Channel Speech Separation with a Frame-based Pitch Range Estimation Method in Modulation Frequency, , , and , in: Proceedings of 5th International Symposium on Telecommunications, 2010 |
![]() |
Determination of Pitch Range Based on Onset and Offset Analysis in Modulation Frequency Domain, , , and , in: Proceedings of 5th International Symposium on Telecommunications, 2010 |
![]() |
Social Network Analysis for Automatic Role Recognition, , Ecole Polytechnique Fédérale de Lausanne, 2010 |
![]() |
Discovering Human Places of Interest from Multimodal Mobile Phone Data, and , in: Proceedings of 9th International Conference on on Mobile and Ubiquitous Multimedia (MUM,',','), Limassol, Cyprus, 2010 |
![]() |
Feature distribution modelling techniques for 3D face recognition, , and , in: Pattern Recognition Letters, 31:1324-1330, 2010 |
![]() |
An Information Theoretic Approach to Speaker Diarization of Meeting Recordings, , Ecole polytechnique fédérale de Lausanne, 2010 |
![]() |
Hierarchical and Parallel Processing of Auditory and Modulation Frequencies for Automatic Speech Recognition, , in: Speech Communication, 52(10):790-800, 2010 |
[DOI] |
Multi-Stream Speech Recognition based on Dempster-Shafer Combination Rule, , in: Speech Communication, 52(3):213-222, 2010 |
[DOI] |
VARIATIONAL BAYESIAN SPEAKER DIARIZATION OF MEETING RECORDINGS, , and , in: Proceedings of ICASSP, 2010 |
![]() |
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 |