logo Idiap Research Institute        
All publications
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 |

2014
Combining Vocal Tract Length Normalization with Hierarchical Linear Transformations, Lakshmi Saheer, Junichi Yamagishi, Philip N. Garner and John Dines, in: IEEE Journal of Selected Topics in Signal Processing - Special Issue on Statistical Parametric Speech Synthesis, 8(2):262 - 272, 2014
attachment
[DOI]
Broadcasting oneself: Visual Discovery of Vlogging Styles, Oya Aran, Joan-Isaac Biel and Daniel Gatica-Perez, in: IEEE Transactions on Multimedia, 16(1):201-215, 2014
attachment
[DOI]
Temporal Analysis of Motif Mixtures using Dirichlet Processes, Remi Emonet, Jagannadan Varadarajan and Jean-Marc Odobez, in: IEEE Trans. Pattern Analysis and Machine Intelligence (PAMI), 36(1), 2014
attachment
A Survey of Personality Computing, Alessandro Vinciarelli and Gelareh Mohammadi, in: IEEE Transaction on Affective Computing, 5(3):273-291, 2014
attachment
Structured Sparsity Models for Reverberant Speech Separation, Afsaneh Asaei, Mohammad Golbabaee, Hervé Bourlard and Volkan Cevher, in: IEEE/ACM Transaction on Audio, Speech and Language Processing, 2014
attachment
2013
Transfer in Inverse Reinforcement Learning for Multiple Strategies, Ajay Kumar Tanwani and Aude Billard, in: IEEE/RSJ International Conference on Intelligent Robots and Systems, Tokyo, Japan, pages 3244-3250, IEEE, 2013
[DOI]
[URL]
MLP-based Factor Analysis for Tandem Speech Recognition, Marc Ferras and Hervé Bourlard, in: Proceedings of the International Conference on Acoustics, Speech and Signal Processing, 2013
attachment
Hi YouTube! Personality Impressions and Verbal Content in Social Video, Joan-Isaac Biel, Daniel Gatica-Perez, John Dines and Vagia Tsminiaki, in: 15th ACM International Conference on Multimodal Interaction, Sydney, Australia, ACM, 2013, 2013
attachment
Object Classification and Detection in High Dimensional Feature Space, Charles Dubout, Programme doctoral en Informatique, Communications et Information, 2013
attachment
Speech Processing, Mathew Magimai.-Doss, in: Interactive Multimodal Information Management, pages 221--245, EPFL Press, 2013
A Probabilistic Framework for Multiple Speaker Localization, Youssef Oualil, Mathew Magimai.-Doss, Friedrich Faubel and Dietrich Klakow, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2013
attachment
Alignment of Binocular-Binaural Data Using a Moving Audio-Visual Target, Vasil Khalidov, Florence Forbes and Radu Horaud, in: Proc. of IEEE Int. Workshop on Multimedia Signal Processing (MMSP), 2013
attachment
3D Head Pose and Gaze Tracking and Their Application to Diverse Multimodal Tasks, Kenneth Alberto Funes Mora, in: Doctoral consortium of the 15th ACM International Conference on Multimodal Interaction, Sydney, Australia, 2013
[DOI]
Context Aware Addressee Estimation for Human Robot Interaction, Samira Sheikhi, Dinesh Babu Jayagopi, Vasil Khalidov and Jean-Marc Odobez, in: Proceedings of the 6th Workshop on Eye Gaze in Intelligent Human Machine Interaction: Gaze in Multimodal Interaction, 2013
Leveraging the robot dialog state for visual focus of attention recognition, Samira Sheikhi, Vasil Khalidov, David Klotz, Britta Wrede and Jean-Marc Odobez, in: Proceedings of the 15th ACM on International Conference on Multimodal Interaction, 2013
Evaluating Intra- and Crosslingual Adaptation for Non-native Speech Recognition in a Bilingual Environment, Gyorgy Szaszak and Philip N. Garner, in: Proceedings of the 4th IEEE International Conference on Cognitive Infocommunications, IEEE, Budapest, Hungary, pages 357-361, 2013
attachment
Using the Europarl corpus for cross-linguistic research, Bruno Cartoni, Sandrine Zufferey and Thomas Meyer, in: Belgian Journal of Linguistics(27):23 – 42, 2013
[URL]
Crosslingual Tandem-SGMM: Exploiting Out-Of-Language Data for Acoustic Model and Feature Level Adaptation, Petr Motlicek, David Imseng and Philip N. Garner, in: Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013), ISCA - International Speech Communication Association, Lyon, France, pages 510-514, ISCA, 2013
attachment
ACCENT ADAPTATION USING SUBSPACE GAUSSIAN MIXTURE MODELS, Petr Motlicek, Philip N. Garner, Namhoon Kim and Jeongmi Cho, in: The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Vancouver, BC, Canada, pages 7170-7174, 2013
attachment
[DOI]
FEATURE AND SCORE LEVEL COMBINATION OF SUBSPACE GAUSSIANS IN LVCSR TASK, Petr Motlicek, Daniel Povey and Martin Karafiat, in: The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Vancouver, BC, Canada, pages 7604-7608, 2013
attachment
[DOI]
Manifold Sparse Beamforming, Baran Gözcü, Afsaneh Asaei and Volkan Cevher, in: IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing, Saint Martin, France, pages 113-116, IEEE, 2013
attachment
[DOI]
Convexity in source separation: Models, geometry, and algorithms, Michael McCoy, Volkan Cevher, Quoc Tran Dinh, Afsaneh Asaei and Luca Baldassarre, in: IEEE Signal Processing Magazine, Special Issue on Source Separation and Applications, 2013
attachment
Inferring Mood in Ubiquitous Conversational Video, Dairazalia Sanchez-Cortes, Joan-Isaac Biel, Shiro Kumano, Junji Yamato, Kazuhiro Otsuka and Daniel Gatica-Perez, in: 12th International Conference on Mobile and Ubiquitous Multimedia, Luleå, Sweden, ACM Press, 2013
attachment
One of a Kind: Inferring Personality Impressions in Meetings, Oya Aran and Daniel Gatica-Perez, in: 15th ACM International Conference on Multimodal Interaction, 2013
attachment
Cross-Domain Personality Prediction: From Video Blogs to Small Group Meetings, Oya Aran and Daniel Gatica-Perez, in: 15th ACM International Conference on Multimodal Interaction, 2013
attachment
Multiclass Latent Locally Linear Support Vector Machines, Marco Fornoni, Barbara Caputo and Francesco Orabona, in: JMLR W&CP, Volume 29: Asian Conference on Machine Learning, Canberra, Australia, pages 229-244, 2013
attachment
[URL]
A Semi-Automated System for Accurate Gaze Coding in Natural Dyadic Interactions, Kenneth Alberto Funes Mora, Laurent Son Nguyen, Daniel Gatica-Perez and Jean-Marc Odobez, in: 15th ACM International Conference on Multimodal Interaction, Sydney, Australia, ACM, 2013
attachment
[DOI]
Unsupervised methods for activity analysis and detection of abnormal events, Remi Emonet and Jean-Marc Odobez, in: Intelligent Video Surveillance Systems (ISTE), Wiley, 2013
attachment
[DOI]
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 |