Publication list - Idiap Publications

Update cookies preferences

| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 | 84 | 85 | 86 |

A Savitzky-Golay Filtering Perspective of Dynamic Feature Computation, S. R. Krishnan, Mathew Magimai-Doss and C. S. Seelamantula, in: IEEE Signal Processing Letters, 20(3):281 -- 284, 2013

[DOI]

A Probabilistic Framework for Multiple Speaker Localization, Youssef Oualil, Mathew Magimai-Doss, Friedrich Faubel and Dietrich Klakow, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2013

attachment

Alignment of Binocular-Binaural Data Using a Moving Audio-Visual Target, Vasil Khalidov, Florence Forbes and Radu Horaud, in: Proc. of IEEE Int. Workshop on Multimedia Signal Processing (MMSP), 2013

attachment

Proceedings of the ACL Workshop on Discourse in Machine Translation (DiscoMT 2013), Bonnie Webber, Andrei Popescu-Belis, Katja Markert and Jorg Tiedemann, Association for Computational Linguistics, 2013

[URL]

On Modeling Context-dependent Clustered States: Comparing HMM/GMM, Hybrid HMM/ANN and KL-HMM Approaches, Marzieh Razavi, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-43-2013

attachment

3D Head Pose and Gaze Tracking and Their Application to Diverse Multimodal Tasks, Kenneth Alberto Funes Mora, in: Doctoral consortium of the 15th ACM International Conference on Multimodal Interaction, Sydney, Australia, 2013

[DOI]

Context Aware Addressee Estimation for Human Robot Interaction, Samira Sheikhi, Dinesh Babu Jayagopi, Vasil Khalidov and Jean-Marc Odobez, in: Proceedings of the 6th Workshop on Eye Gaze in Intelligent Human Machine Interaction: Gaze in Multimodal Interaction, 2013

Leveraging the robot dialog state for visual focus of attention recognition, Samira Sheikhi, Vasil Khalidov, David Klotz, Britta Wrede and Jean-Marc Odobez, in: Proceedings of the 15th ACM on International Conference on Multimodal Interaction, 2013

Hire Me: Computational inference of hirability in employment interviews based on nonverbal behavior, Laurent Son Nguyen, Denise Frauendorfer, Marianne Schmid Mast and Daniel Gatica-Perez, in: IEEE Transactions on Multimedia, 16(4):1018 - 1031, 2014

attachment

[DOI]

Multimodal Analysis of Body Communication Cues in Employment Interviews, Laurent Son Nguyen, Alvaro Marcos-Ramiro, Marta Marron-Romera and Daniel Gatica-Perez, in: 15th ACM International Conference on Multimodal Interaction Proceedings, 2013

attachment

Evaluating Intra- and Crosslingual Adaptation for Non-native Speech Recognition in a Bilingual Environment, Gyorgy Szaszak and Philip N. Garner, in: Proceedings of the 4th IEEE International Conference on Cognitive Infocommunications, IEEE, Budapest, Hungary, pages 357-361, 2013

attachment

Adaptive Sampling for Large Scale Boosting, Charles Dubout and Francois Fleuret, in: Journal of Machine Learning Research, 15:1431-1453, 2014

attachment

Is Deep Learning Really Necessary for Word Embeddings?, Rémi Lebret, Joël Legrand and Ronan Collobert, Idiap-RR-44-2013

attachment

Introduction to the Special Issue on Learning Semantics, Antoine Bordes, Léon Bottou, Ronan Collobert, Dan Roth, Jason Weston and Luke Zettlemoyer, in: Machine Learning, 2013

[DOI]

Recurrent Convolutional Neural Networks for Scene Labeling, Pedro H. O. Pinheiro and Ronan Collobert, Idiap-RR-41-2013

attachment

End-to-end Phoneme Sequence Recognition using Convolutional Neural Networks, Dimitri Palaz, Ronan Collobert and Mathew Magimai-Doss, Idiap-RR-40-2013

attachment

Re-Identification for Improved People Tracking, Francois Fleuret, Horesh Ben Shitrit and Pascal Fua, in: Person Re-Identification, pages 311-336, Springer, 2014

Using the Europarl corpus for cross-linguistic research, Bruno Cartoni, Sandrine Zufferey and Thomas Meyer, in: Belgian Journal of Linguistics(27):23 – 42, 2013

[URL]

Stable Myoelectric Control of a Hand Prosthesis using Non-Linear Incremental Learning, Arjan Gijsberts, Rashida Bohra, David Sierra González, Alexander Werner, Markus Nowak, Barbara Caputo, Maximo A. Roa and Claudio Castellini, in: Frontiers in Neurorobotics, 8, 2014

[DOI]

The Movement Error Rate for Evaluation of Machine Learning Methods for sEMG-based Hand Movement Classification, Arjan Gijsberts, Manfredo Atzori, Claudio Castellini, Henning Müller and Barbara Caputo, in: Transactions on Neural Systems and Rehabilitation Engineering:735 - 744, 2014

[DOI]

Characterization of a Benchmark Database for Myoelectric Movement Classification, Manfredo Atzori, Arjan Gijsberts, Ilja Kuzborskij, Simone Heynen, Anne-Gabrielle Mittaz Hager, Olivier Deriaz, Claudio Castellini, Henning Müller and Barbara Caputo, in: Transactions on Neural Systems and Rehabilitation Engineering, 23:73-83, 2014

[DOI]

Crosslingual Tandem-SGMM: Exploiting Out-Of-Language Data for Acoustic Model and Feature Level Adaptation, Petr Motlicek, David Imseng and Philip N. Garner, in: Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013), ISCA - International Speech Communication Association, Lyon, France, pages 510-514, ISCA, 2013

attachment

Crosslingual Tandem-SGMM: Exploiting Out-Of-Language Data for Acoustic Model and Feature Level Adaptation, Petr Motlicek, David Imseng and Philip N. Garner, Idiap-RR-39-2013

attachment

ACCENT ADAPTATION USING SUBSPACE GAUSSIAN MIXTURE MODELS, Petr Motlicek, Philip N. Garner, Namhoon Kim and Jeongmi Cho, in: The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Vancouver, BC, Canada, pages 7170-7174, 2013

attachment

[DOI]

ACCENT ADAPTATION USING SUBSPACE GAUSSIAN MIXTURE MODELS, Petr Motlicek, Philip N. Garner, Namhoon Kim and Jeongmi Cho, Idiap-RR-38-2013

attachment

FEATURE AND SCORE LEVEL COMBINATION OF SUBSPACE GAUSSIANS IN LVCSR TASK, Petr Motlicek, Daniel Povey and Martin Karafiat, in: The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Vancouver, BC, Canada, pages 7604-7608, 2013

attachment

[DOI]

FEATURE AND SCORE LEVEL COMBINATION OF SUBSPACE GAUSSIANS IN LVCSR TASK, Petr Motlicek, Daniel Povey and Martin Karafiat, Idiap-RR-37-2013

attachment

Manifold Sparse Beamforming, Baran Gözcü, Afsaneh Asaei and Volkan Cevher, in: IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing, Saint Martin, France, pages 113-116, IEEE, 2013

attachment

[DOI]

Convexity in source separation: Models, geometry, and algorithms, Michael McCoy, Volkan Cevher, Quoc Tran Dinh, Afsaneh Asaei and Luca Baldassarre, in: IEEE Signal Processing Magazine, Special Issue on Source Separation and Applications, 2013

attachment

Fusing Matching and Biometric Similarity Measures for Face Diarization in Video, Elie Khoury, Paul Gay and Jean-Marc Odobez, Idiap-RR-31-2013

attachment

An Open-source State-of-the-art Toolbox for Broadcast News Diarization, Mickael Rouvier, Gregor Dupuy, Paul Gay, Elie Khoury, Teva Merlin and Sylvain Meignier, Idiap-RR-33-2013

attachment

Gesture control interface for immersive panoramic displays, Marcel Alcoverro, Xavier Suau, Adolfo Lopez-Mendez, Josep R. Morros, Javier Ruiz-Hidalgo, Albert Gil and Josep R. Casas, in: Multimedia Tools and Applications, 1380-7501:1-27, 2013

[DOI]

Exploiting Scene Cues for Dropped Object Detection, Adolfo Lopez-Mendez, Florent Monay and Jean-Marc Odobez, in: 9th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications., 2014

attachment

Creative Applications of Human Behavior Understanding. HBU 2013: 1-14, Albert Ali Salah, Hayley Hung, Oya Aran and Hatice Gunes, in: Human Behavior Understanding, pages 1-14, 2013

Inferring Mood in Ubiquitous Conversational Video, Dairazalia Sanchez-Cortes, Joan-Isaac Biel, Shiro Kumano, Junji Yamato, Kazuhiro Otsuka and Daniel Gatica-Perez, in: 12th International Conference on Mobile and Ubiquitous Multimedia, Luleå, Sweden, ACM Press, 2013

attachment

Model-based Sparse Component Analysis for Reverberant Speech Localization, Afsaneh Asaei, Hervé Bourlard, Mohammad J. Taghizadeh and Volkan Cevher, in: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing, pages 1439 - 1443, IEEE, 2014

attachment

[DOI]

Score Calibration in Face Recognition, Miranti I. Mantasari, Manuel Günther, Roy Wallace, Rahim Saedi, Sébastien Marcel and David Van Leeuwen, Idiap-RR-01-2014

attachment

Combining Vocal Tract Length Normalization with Hierarchical Linear Transformations, Lakshmi Saheer, Junichi Yamagishi, Philip N. Garner and John Dines, in: IEEE Journal of Selected Topics in Signal Processing - Special Issue on Statistical Parametric Speech Synthesis, 8(2):262 - 272, 2014

attachment

[DOI]

Broadcasting oneself: Visual Discovery of Vlogging Styles, Oya Aran, Joan-Isaac Biel and Daniel Gatica-Perez, in: IEEE Transactions on Multimedia, 16(1):201-215, 2014

attachment

[DOI]

One of a Kind: Inferring Personality Impressions in Meetings, Oya Aran and Daniel Gatica-Perez, in: 15th ACM International Conference on Multimodal Interaction, 2013

attachment

Cross-Domain Personality Prediction: From Video Blogs to Small Group Meetings, Oya Aran and Daniel Gatica-Perez, in: 15th ACM International Conference on Multimodal Interaction, 2013

attachment

Multiclass Latent Locally Linear Support Vector Machines, Marco Fornoni, Barbara Caputo and Francesco Orabona, in: JMLR W&CP, Volume 29: Asian Conference on Machine Learning, Canberra, Australia, pages 229-244, 2013

attachment

[URL]

A Semi-Automated System for Accurate Gaze Coding in Natural Dyadic Interactions, Kenneth Alberto Funes Mora, Laurent Son Nguyen, Daniel Gatica-Perez and Jean-Marc Odobez, in: 15th ACM International Conference on Multimodal Interaction, Sydney, Australia, ACM, 2013

attachment

[DOI]

Unsupervised methods for activity analysis and detection of abnormal events, Remi Emonet and Jean-Marc Odobez, in: Intelligent Video Surveillance Systems (ISTE), Wiley, 2013

attachment

[DOI]

Temporal Analysis of Motif Mixtures using Dirichlet Processes, Remi Emonet, Jagannadan Varadarajan and Jean-Marc Odobez, in: IEEE Trans. Pattern Analysis and Machine Intelligence (PAMI), 36(1), 2014

attachment

Observation of Vehicle Axles Through Pass-by Noise: A Strategy of Microphone Array Design, Patrick Marmaroli, M. Carmona, Xavier Falourd, Hervé Lissek and Jean-Marc Odobez, in: IEEE Trans. on Intelligent Transportation Systems, 2013

attachment

Exploiting Long-Term Connectivity and Visual Motion in CRF-based Multi-Person Tracking, Alexandre Heili, Adolfo Lopez Mendez and Jean-Marc Odobez, Idiap-RR-06-2014

attachment

Bi-Modal Biometric Authentication on Mobile Phones in Challenging Conditions, Elie Khoury, Laurent El Shafey, Chris McCool, Manuel Günther and Sébastien Marcel, in: Image and Vision Computing:1147-1160, 2014

[DOI]
[URL]

On the Improvements of Uni-modal and Bi-modal Fusions of Speaker and Face Recognition for Mobile Biometrics, Elie Khoury, Manuel Günther, Laurent El Shafey and Sébastien Marcel, in: Biometric Technologies in Forensic Science, Nijmegen, The Netherlands, 2013

attachment

Processing and Linking Audio Events in Large Multimedia Archives: The EU inEvent Project, Hervé Bourlard, Marc Ferras, Nikolaos Pappas, Andrei Popescu-Belis, Steve Renals, Fergus McInnes, Peter Bell, Sandy Ingram and Maël Guillemot, in: Workshop on Speech, Language and Audio in Multimedia, 2013

attachment

| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 | 84 | 85 | 86 |

processing time: 0.0004 seconds.