logo Idiap Research Institute        
Project SNSF-MULTI
Name: SNSF-MULTI

Publications of project SNSF-MULTI
| 1 | 2 |

2014
2013
Crosslingual Tandem-SGMM: Exploiting Out-Of-Language Data for Acoustic Model and Feature Level Adaptation, Petr Motlicek, David Imseng and Philip N. Garner, in: Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013), ISCA - International Speech Communication Association, Lyon, France, pages 510-514, ISCA, 2013
attachment
Multilingual speech recognition A posterior based approach, David Imseng, École Polytechnique Fédérale de Lausanne (EPFL), 2013
attachment
Speaker adaptive Kullback-Leibler divergence based hidden Markov models, David Imseng and Hervé Bourlard, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2013
attachment
2012
A Fast Parts-based Approach to Speaker Verification using Boosted Slice Classifiers, Anindya Roy, Mathew Magimai.-Doss and Sébastien Marcel, in: IEEE Transactions on Information Forensics and Security, 7(1):241-254, 2012
attachment
Boosting localized binary features for speech recognition, Anindya Roy, Mathew Magimai.-Doss and Sébastien Marcel, in: Symposium on Machine Learning in Speech and Language Processing (MLSLP), 2012
attachment
Boosting under-resourced speech recognizers by exploiting out of language data - Case study on Afrikaans, David Imseng, Hervé Bourlard and Philip N. Garner, in: Proceedings of the 3rd International Workshop on Spoken Languages Technologies for Under-resourced Languages, Cape Town, pages 60--67, 2012
attachment
Bridging the Past, Present and Future: Modeling Scene Activities From Event Relationships and Global Rules, Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, in: IEEE Conference on Computer Vision and Pattern Recognition, 2012, Providence, Rhode Island, USA, 2012
Indoor Scene Recognition using Task and Saliency-driven Feature Pooling, Marco Fornoni and Barbara Caputo, in: Proceedings of the British Machine Vision Conference, Guildford, UK, 2012
attachment
MediaParl: Bilingual mixed language accented speech database, David Imseng, Hervé Bourlard, Holger Caesar, Philip N. Garner, Gwénolé Lecorvé and Alexandre Nanchen, in: Proceedings of the 2012 IEEE Workshop on Spoken Language Technology, pages 263--268, 2012
attachment
Overview of the ImageCLEF 2012 Robot Vision Task, Jesus Martinez-Gomez, Ismael Garcia-Varea and Barbara Caputo, in: Working Notes of the ImageCLEF 2012 Laboratory, 2012
attachment
Robust triphone mapping for acoustic modeling, Milos Cernak, David Imseng and Hervé Bourlard, in: Proceedings of Interspeech, Portland, Oregon, 2012
attachment
Sampling techniques for audio-visual tracking and head pose estimation, Jean-Marc Odobez and Oswald Lanz, in: Multimodal Signal Processing: Human Interactions in Meetings, pages 84-102, Cambridge University Press, 2012
attachment
Using KL-divergence and multilingual information to improve ASR for under-resourced languages, David Imseng, Hervé Bourlard and Philip N. Garner, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, pages 4869--4872, 2012
attachment
2011
An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: IEEE Transactions on Audio Speech and Language Processing, 19(2), 2011
[DOI]
Boosting Localized Features for Speaker and Speech Recognition, Anindya Roy, Ecole Polytechnique Federale de Lausanne (EPFL), 2011
attachment
Discovering Routines from Large-Scale Human Locations using Probabilistic Topic Models, Katayoun Farrahi and Daniel Gatica-Perez, in: ACM Transactions on Intelligent Systems and Technology, 2(1), 2011
attachment
Fast Speaker Verification on Mobile Phone data using Boosted Slice Classifiers, Anindya Roy, Mathew Magimai.-Doss and Sébastien Marcel, in: IAPR IEEE International Joint Conference on Biometrics, Washington DC, 2011
attachment
Flickr Groups: Multimedia Communities for Multimedia Analysis, Radu-Andrei Negoescu and Daniel Gatica-Perez, in: Internet Multimedia Search and Mining, Bentham Science Publishers, 2011
Language dependent universal phoneme posterior estimation for mixed language speech recognition, David Imseng, Hervé Bourlard, Mathew Magimai.-Doss and John Dines, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Prag, CZ, pages 5012-5015, 2011
attachment
Phoneme Recognition using Boosted Binary Features, Anindya Roy, Mathew Magimai.-Doss and Sébastien Marcel, in: IEEE Intl. Conference on Acoustics, Speech and Signal Processing 2011, 2011
attachment
Sensing the `Health State` of our Society, Anmol Madan, Manuel Cebrian, Sai Moturu, Katayoun Farrahi and Alex Pentland, in: IEEE Pervasive Computing, Special Issue on Large-Scale Opportunistic Sensing, 2011
attachment
Towards semi-supervised learning of semantic spatial concepts, Jesus Martinez-Gomez and Barbara Caputo, in: IEEE International Conference on Robotics and Automation, 2011
attachment
2010
A Sparsity Constraint for Topic Models - Application to Temporal Activity Mining, Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, in: NIPS-2010 Workshop on Practical Applications of Sparse Modeling: Open Issues and New Directions, 2010
attachment
BOOSTED BINARY FEATURES FOR NOISE-ROBUST SPEAKER VERIFICATION, Anindya Roy, Mathew Magimai.-Doss and Sébastien Marcel, in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, Texas, 2010
attachment
Crossmodal Matching of Speakers using Lip and Voice Features in Temporally Non-overlapping Audio and Video Streams, Anindya Roy and Sébastien Marcel, in: 20th International Conference on Pattern Recognition, Istanbul, Turkey, International Association for Pattern Recognition (IAPR), Istanbul, Turkey, 2010
attachment
Hierarchical Multilayer Perceptron based Language Identification, David Imseng, Mathew Magimai.-Doss and Hervé Bourlard, in: Proceedings of Interspeech, Makuhari, Japan, pages 2722-2725, 2010
attachment
Introducing Crossmodal Biometrics:Person Identification from Distinct Audio & Visual Streams, Anindya Roy and Sébastien Marcel, in: IEEE Fourth International Conference on Biometrics: Theory, Applications and Systems, 2010
attachment
Kodak Moments and Flickr Diamonds: How Users Shape Large-scale Media, Radu-Andrei Negoescu, Alexander Loui and Daniel Gatica-Perez, in: Proc. of the 18th Intl. Conf. on Multimedia, Firenze, Italy, 2010
Mining Human Location-Routines Using a Multi-Level Approach to Topic Modeling, Katayoun Farrahi and Daniel Gatica-Perez, in: 2010 IEEE Second International Conference on Social Computing, SIN Symposium, Minneapolis, Minnesota, USA, 2010
attachment
Probabilistic Mining of Socio-Geographic Routines from Mobile Phone Data, Katayoun Farrahi and Daniel Gatica-Perez, in: IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 4(4), 2010
attachment
Towards mixed language speech recognition systems, David Imseng, Hervé Bourlard and Mathew Magimai.-Doss, in: Proceedings of Interspeech, Makuhari, Japan, pages 278-281, 2010
attachment
Visual processing-inspired Fern-Audio features for Noise-Robust Speaker Verification, Anindya Roy and Sébastien Marcel, in: ACM 25th Symposium on Applied Computing, 2010, Sierre, Switzerland, Association for Computing Machinery, 2010
attachment
2009
Contextual classification of image patches with latent aspect models, Florent Monay, Pedro Quelhas, Jean-Marc Odobez and Daniel Gatica-Perez, in: EURASIP Journal on Image and Video Processing, Special Issue on Patches in Vision, 2009
attachment
Flickr Hypergroups, Radu-Andrei Negoescu, Brett Adams, Dinh Phung, Svetha Venkatesh and Daniel Gatica-Perez, in: Proceedings of the 17th ACM International Conference on Multimedia, 2009
attachment
Posterior features applied to speech recognition tasks with user-defined vocabulary, Guillermo Aradilla, Hervé Bourlard and Mathew Magimai.-Doss, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009
attachment
Social Network Analysis in Multimedia Indexing: Making Sense of People in Multiparty Recordings, Sarah Favre, in: Proceedings of the Doctoral Consortium of the International Conference on Affective Computing & Intelligent Interaction (ACII), 2009
attachment
Topic Models for Scene Analysis and Abnormality Detection, Jagannadan Varadarajan and Jean-Marc Odobez, in: 9th International Workshop in Visual Surveillance, IEEE, Kyoto, Japan, IEEE, 2009
attachment
Volterra Series for Analyzing MLP based Phoneme Posterior Probability Estimator, Joel Praveen Pinto, G. S. V. S. Sivaram, Hynek Hermansky and Mathew Magimai.-Doss, in: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2009
attachment
2008
Analyzing Flickr Groups, Radu-Andrei Negoescu and Daniel Gatica-Perez, in: Proc. of the Intl. Conf. on Image and Video Retrieval, ACM, 2008
Topickr: Flickr Groups and Users Reloaded, Radu-Andrei Negoescu and Daniel Gatica-Perez, in: MM '08: Proc. of the 16th ACM Intl. Conf. on Multimedia, ACM, 2008
| 1 | 2 |