All publications
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 |
2015
3D Gaze Estimation from Remote RGB-D Sensors, , École Polytechnique Fédérale de Lausanne, 2015 |
[DOI] |
Head Nod Detection from a Full 3D Model, , and , in: Proceedings of the ICCV 2015, pages 528-536, 2015 |
|
Posterior-Based Multi-Stream Formulation To Combine Multiple Grapheme-to-Phoneme Conversion Techniques, and , Idiap-RR-33-2015 |
|
HMM-based Non-native Accent Assessment using Posterior Features, , and , Idiap-RR-32-2015 |
|
EUMSSI team at the MediaEval Person Discovery Challenge, , , and , in: Working Notes Proceedings of the MediaEval 2015 Workshop, Wurzen, Germany, 2015 |
[URL] |
Towards utterance-based neural network adaptation in acoustic modeling, , , and , in: IEEE Automatic Speech Recognition and Understanding Workshop, pages 289-295, 2015 |
|
Deciphering the Silent Participant. On the Use of Audio-Visual Cues for the Classification of Listener Categories in Group Discussions, , , and , in: Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, ACM, Seattle, Washington, USA, pages 107--114, ACM, 2015 |
[DOI] |
Biometrics systems under spoofing attack: an evaluation methodology and lessons learned, , , and , in: IEEE Signal Processing Magazine, 2015 |
|
Palm Vein Database and Experimental Framework for Reproducible Research, and , in: IEEE International Conference of the Biometrics Special Interest Group, pages 1-7, 2015 |
[DOI] [URL] |
Finger vein Liveness Detection Using Motion Magnification, , , and , in: IEEE International Conference on Biometrics: Theory, Applications and Systems, pages 1-7, 2015 |
[DOI] |
I would hire you in a minute: Thin slices of nonverbal behavior in job interviews, and , in: Proceedings of the ACM International Conference on Multimodal Interaction (ICMI), pages 51-58, 2015 |
|
Personality Trait Classification via Co-Occurrent Multiparty Multimodal Event Discovery, , and , in: Proceedings of the ACM International Conference on Multimodal Interaction, Seattle, Washington, USA, pages 15-22, ACM, 2015 |
[DOI] |
Learning bimanual end-effector poses from demonstrations using task-parameterized dynamical systems, , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), Hamburg, Germany, pages 464-470, 2015 |
|
Learning Optimal Controllers in Human-robot Cooperative Transportation Tasks with Position and Force Constraints, , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), Hamburg, Germany, pages 1024-1030, 2015 |
|
Learning the Stiffness of a Continuous Soft Manipulator from Multiple Demonstrations, , , and , in: Intelligent Robotics and Applications, pages 185-195, Springer, 2015 |
[DOI] [URL] |
Overlapping Speech, Utterance Duration and Affective Content in HHI and HCI - an Comparison, , , , and , in: 6th IEEE Conference on Cognitive Infocommunications, Gyor, pages 83-88, 2015 |
[DOI] |
Enabling speech applications using Ad-Hoc Microphone Arrays, , École Polytechnique Fédérale de Lausanne, 2015 |
|
Visual Affect Around the World: A Large-scale Multilingual Visual Sentiment Ontology, , , , , and , in: Proceedings of the ACM International Conference on Multimedia, pages 159--168, 2015 |
|
Statistical Models in Automatic Speech Recognition, , University of Fribourg, Department of Mathematics, 2015 |
|
Exploring Dataset Similarities using PCA-based Feature Selection, , , and , in: Proceedings of ACII, Xi'an, pages 387-393, IEEE, 2015 |
[DOI] |
Cross-Corpus Acoustic Emotion Recognition: Variances and Strategies (Extended Abstract), , , , , , and , in: Proceedings of ACII, Xi'an, pages 470-476, IEEE, 2015 |
[DOI] |
Annotators' agreement and spontaneous emotion classification performance, and , in: Proceedings of Interspeech, pages 1546-1550, 2015 |
|
Pronoun Translation and Prediction with or without Coreference Links, , and , in: Proceedings of the Second Workshop on Discourse in Machine Translation (DiscoMT), Lisbon, Portugal, pages 94–100, 2015 |
|
Spatial Sound Localization via Multipath Euclidean Distance Matrix Recovery, , , , and , in: IEEE Journal of Selected Topics in Signal Processing, 9(5):802-814, 2015 |
|
Automatic social role recognition and its application in structuring multiparty interactions, , EPFL, 2015 |
|
HAVC-II - Idiap Private Cloud (Technical Inside-Out), , Idiap-Com-01-2015 |
|
On the Vulnerability of Speaker Verification to Realistic Voice Spoofing, , , and , in: IEEE International Conference on Biometrics: Theory, Applications and Systems, pages 1-8, IEEE, 2015 |
[DOI] [URL] |
Exploiting foreign resources for DNN-based ASR, , , , and , in: EURASIP Journal on Audio, Speech, and Music Processing(2015:17), 2015 |
[DOI] |
Syntactic Parsing of Morphologically Rich Languages Using Deep Neural Networks, and , Idiap-RR-25-2015 |
|
Joint RNN-Based Greedy Parsing and Word Composition, and , in: Proceedings of ICLR 2015, 2015 |
|
Analysis of CNN-based Speech Recognition System using Raw Speech as Input, , and , in: Proceedings of Interspeech, ISCA, Dresden, pages 11-15, ISCA, 2015 |
|
A Deeper Look at Dataset Bias, , , and , in: German Conference on Pattern Recognition, Aachen, Germany, pages 504–516, Springer International Publishing, 2015 |
[DOI] |
Periocular Biometrics in Mobile Environment, and , in: IEEE Seventh International Conference on Biometrics: Theory, Applications and Systems, Arlington, USA, pages 1-7, IEEE, 2015 |
[DOI] |
Simple Image Description Generator via a Linear Phrase-based Model, , and , Idiap-RR-22-2015 |
|
"The Sum of Its Parts": Joint Learning of Word and Phrase Representations with Autoencoders, and , Idiap-RR-21-2015 |
|
Rehabilitation of Count-based Models for Word Vector Representations, and , in: Computational Linguistics and Intelligent Text Processing, pages 417-429, Springer International Publishing, 2015 |
From Image-level to Pixel-level Labeling with Convolutional Networks, and , in: Computer Vision and Patter Recognition (CVPR), Boston, MA, pages 1713-1721, IEEE, 2015 |
[DOI] [URL] |
Phrase-based Image Captioning, , and , in: International Conference on Machine Learning (ICML), Lille, France, pages 2085–2094, JMLR, 2015 |
[URL] |
DexROV: Dexterous Undersea Inspection and Maintenance in Presence of Communication Latencies, , , , , , , , , , , , , , , and , in: IFAC Workshop on Navigation, Guidance and Control of Underwater Vehicles (NGCUV), pages 218-223, 2015 |
Dictionary Learning for Sparse Representation of Neural Network Exemplars in Speech Recognition, , and , in: Proceedings of SPARS 2015: Workshop on Signal Processing with Adaptive Sparse Structured Representations, 2015, pages 1093, 2015 |
|
Sparse Modeling of Neural Network Posterior Probabilities for Exemplar-Based Speech Recognition, , and , in: Proceedings of SPARS 2015: Workshop on Signal Processing with Adaptive Sparse Structured Representations, 2015 |
|
DNN-based Speech Synthesis: Importance of input features and training data, , and , in: International Conference on Speech and Computer , SPECOM, pages 193-200, Springer Berlin Heidelberg, 2015 |
[DOI] |
Objective Intelligibility Assessment of Text-to-Speech Systems Through Utterance Verification, , , and , in: Proceedings of Interspeech, Dresden, Germany, pages 3501-3505, 2015 |
[URL] |
Integrating Online I-vector extractor with Information Bottleneck based Speaker Diarization system, , , and , in: Proceedings of Interspeech 2015, pages 3105-3109, 2015 |
|
Integrating Online I-vector extractor with Information Bottleneck based Speaker Diarization system, , , and , Idiap-RR-20-2015 |
|
KL-HMM BASED SPEAKER DIARIZATION SYSTEM FOR MEETINGS, and , in: Proceedings of ICASSP 2015, pages 4435-4439, 2015 |
|
COMBINING SGMM SPEAKER VECTORS AND KL-HMM APPROACH FOR SPEAKER DIARIZATION, , and , in: Proceedings of ICASSP 2015, pages 4834-4837, 2015 |
|
EMPLOYMENT OF SUBSPACE GAUSSIAN MIXTURE MODELS IN SPEAKER RECOGNITION, , , and , in: 2015 IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE, Brisbane, Australia, pages 4445-4449, 2015 |
[URL] |
Exploiting foreign resources for DNN-based ASR, , , , and , Idiap-RR-27-2015 |
|
Sparse Modeling of Posterior Exemplars for Keyword Detection, , , and , in: Proceedings of Interspeech, pages 3690-3694, 2015 |
|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 |