All publications
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 |
2011
Multi-party Speech Recovery Exploiting Structured Sparsity Models, , , and , Idiap-RR-22-2011 |
|
Extracting and Locating Temporal Motifs in Video Scenes Using a Hierarchical Non Parametric Bayesian Model, , and , in: IEEE Conference on Computer Vision and Pattern Recognition, 2011 |
|
Contextual grouping: discovering real-life interaction types from longitudinal Bluetooth data, and , in: 12th International Conference on Mobile Data Management, 2011 |
|
Performance Improvement of TDOA-Based Speaker Localization in Joint Noisy and Reverberant Conditions, and , in: EURASIP Journal on Advances in Signal Processing, 2011 |
[DOI] |
Impact of Excitation Frequency on Short-Term Recording Synchronisation and Confidence Estimation, , Idiap-RR-20-2011 |
|
Social Focus of Attention as a Time Function Derived from Multimodal Signals, and , Idiap-RR-09-2011 |
|
HEAT: Iterative Relevance Feedback with One Million Images, and , Idiap-RR-33-2011 |
|
A Speech-based Just-in-Time Retrieval System using Semantic Search, , , and , Idiap-RR-31-2011 |
|
When Users Meet Technology: The Meeting Browser Development Helix, , and , Idiap-RR-05-2011 |
|
Multiple Object Tracking using K-Shortest Paths Optimization, , , and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2011 |
FlowBoost - Appearance Learning from Sparsely Annotated Video, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2011 |
Using object affordances to improve object recognition, , , , and , in: IEEE Transaction on Autonomous Mental Development, 2011 |
|
Towards semi-supervised learning of semantic spatial concepts for mobile robots, and , in: Journal of Physical Agents, 2011 |
|
Phoneme Recognition using Boosted Binary Features, , and , in: IEEE Intl. Conference on Acoustics, Speech and Signal Processing 2011, 2011 |
|
Posterior Features for Template-based ASR, , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Prague, Czech Republic, 2011 |
|
Towards semi-supervised learning of semantic spatial concepts, and , in: IEEE International Conference on Robotics and Automation, 2011 |
|
Towards semi-supervised learning of semantic spatial concepts, and , Idiap-RR-03-2011 |
|
Language dependent universal phoneme posterior estimation for mixed language speech recognition, , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Prag, CZ, pages 5012-5015, 2011 |
|
Audio Spatio-Temporal Fingerprints for Cloudless Real-Time Hands-Free Diarization on Mobile Devices, , Idiap-RR-08-2011 |
|
Call me Guru: user categories and large-scale behavior in YouTube, and , in: Social Media Computing, Springer, 2011 |
|
Flickr Groups: Multimedia Communities for Multimedia Analysis, and , in: Internet Multimedia Search and Mining, Bentham Science Publishers, 2011 |
Computational modeling of face-to-face social interaction using nonverbal behavioral cues, , Ecole Polytechnique Fédérale de Lausanne, 2011 |
|
Exploiting Long-Term Observations for Track Creation and Deletion in Online Multi-Face Tracking, and , in: IEEE Conference on Automatic Face and Gesture Recognition, pages 525-530, IEEE, 2011 |
|
Finding Information in Multimedia Records of Meetings, , and , Idiap-RR-32-2011 |
|
Automatic Identification of Discourse Markers in Multiparty Dialogues: An In-Depth Study of Like and Well, and , in: Computer Speech and Language, 25(3):499-518, 2011 |
[DOI] |
Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition., , Idiap-RR-15-2011 |
|
Learning a 3D Human Pose Distance Metric from Geometric Pose Descriptor, , in: IEEE Transactions on Visualization and Computer Graphics, 17(11):1676-1689, 2011 |
|
3D human pose recovery from image by efficient visual feature selection, , , and , in: Computer Vision and Image Understanding, 115(3), 2011 |
|
Parts-Based Face Verification using Local Frequency Bands, and , Idiap-RR-06-2011 |
|
Automatic Time Skew Detection and Correction, , in: Proceedings International Conference on Signal Acquisition and Processing, Singapore, 2011 |
|
Analyzing ancient Maya glyph collections with Contextual Shape Descriptors, , , and , in: International Journal of Computer Vision, 94(1):101-117, 2011 |
[DOI] |
Integrating Articulatory Features using Kullback-Leibler Divergence based Acoustic Model for Phoneme Recognition, and , Idiap-RR-02-2011 |
|
An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization, , and , in: IEEE Transactions on Audio Speech and Language Processing, 19(2), 2011 |
[DOI] |
Exploiting Long-Term Observations for Track Creation and Deletion in Online Multi-Face Tracking, and , Idiap-RR-01-2011 |
|
The TA2 Database - A Multi-Modal Database from Home Entertainment, , and , in: International Conference on Signal Acquisition and Processing, Singapore, 2011 |
|
Language dependent universal phoneme posterior estimation for mixed language speech recognition, , , and , Idiap-RR-13-2011 |
|
Discovering Routines from Large-Scale Human Locations using Probabilistic Topic Models, and , in: ACM Transactions on Intelligent Systems and Technology, 2(1), 2011 |
|
Model-Based Compressive Sensing for Multi-Party Distant Speech Recognition, , and , Idiap-RR-04-2011 |
|
On-line unsupervised adaptation for face verification using Gaussian Mixture Models with multiple user models, , and , Idiap-RR-07-2011 |
|
Face Verification using Gabor Filtering and Adapted Gaussian Mixture Models, , and , Idiap-RR-37-2011 |
|
Estimating Dominance in Multi-Party Meetings Using Speaker Diarization, , , and , in: IEEE Transactions on Audio, Speech, and Language Processing, 19(4):847-860, 2011 |
|
Just-in-Time Multimodal Association and Fusion from Home Entertainment, , , and , Idiap-RR-10-2011 |
|
Learning from Candidate Labeling Sets, and , Idiap-RR-27-2011 |
|
Privacy-Sensitive Audio Features for Speech/Nonspeech Detection, , , and , Idiap-RR-12-2011 |
|
Multi-Person Visual Focus of Attention from Head Pose and Meeting Contextual Cues, and , in: IEEE Trans. on Pattern Analysis and Machine Intelligence, 33(1):101-116, 2011 |
|
Analysis of MLP Based Hierarchical Phoneme Posterior Probability Estimator, , , , and , in: IEEE Transcations on Audio, Speech, and Language Processing, 19(2):225-241, 2011 |
|
Face Detection using Ferns, and , Idiap-Com-01-2011 |
|
2010
The Robot Vision Track at ImageCLEF 2010, , , and , in: CLEF 2010 Notebook Papers/LABs/Workshops, 2010 |
[URL] |
Extracting Motifs from Time Series Generated by Concurrent Activities., , and , in: NIPS workshop on Learning and Planning from Batch Time Series Data, 2010 |
|
Leveraging speaker diarization for meeting recognition from distant microphones, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, pages 4390--4393, 2010 |
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 |