logo Idiap Research Institute        
Project SNSF-MULTI
Name: SNSF-MULTI

Publications of SNSF-MULTI sorted by journal and type
| 1 | 2 |


Publications of type Idiap-RR


2013


2012


2011


2010


2009


2008


Publications of type Idiap-Com


2012


IEEE Transactions on Audio, Speech, and Language Processing


Speech Communication


IEEE Transactions on Audio, Speech, and Language Processing


IEEE Transactions on Information Forensics and Security

A Fast Parts-based Approach to Speaker Verification using Boosted Slice Classifiers, Anindya Roy, Mathew Magimai.-Doss and Sébastien Marcel, in: IEEE Transactions on Information Forensics and Security, 7(1):241-254, 2012
attachment

Speech Communication


ACM Transactions on Intelligent Systems and Technology

Discovering Routines from Large-Scale Human Locations using Probabilistic Topic Models, Katayoun Farrahi and Daniel Gatica-Perez, in: ACM Transactions on Intelligent Systems and Technology, 2(1), 2011
attachment

IEEE Pervasive Computing, Special Issue on Large-Scale Opportunistic Sensing

Sensing the `Health State` of our Society, Anmol Madan, Manuel Cebrian, Sai Moturu, Katayoun Farrahi and Alex Pentland, in: IEEE Pervasive Computing, Special Issue on Large-Scale Opportunistic Sensing, 2011
attachment

IEEE Transactions on Audio Speech and Language Processing

An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: IEEE Transactions on Audio Speech and Language Processing, 19(2), 2011
[DOI]

IEEE Transactions on Audio, Speech, and Language Processing


Journal of Physical Agents


IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING

Probabilistic Mining of Socio-Geographic Routines from Mobile Phone Data, Katayoun Farrahi and Daniel Gatica-Perez, in: IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 4(4), 2010
attachment

IEEE Transactions on Multimedia


EURASIP Journal on Image and Video Processing, Special Issue on Patches in Vision

Contextual classification of image patches with latent aspect models, Florent Monay, Pedro Quelhas, Jean-Marc Odobez and Daniel Gatica-Perez, in: EURASIP Journal on Image and Video Processing, Special Issue on Patches in Vision, 2009
attachment

Multimodal Signal Processing: Human Interactions in Meetings (2012)

Sampling techniques for audio-visual tracking and head pose estimation, Jean-Marc Odobez and Oswald Lanz, in: Multimodal Signal Processing: Human Interactions in Meetings, pages 84-102, Cambridge University Press, 2012
attachment

Internet Multimedia Search and Mining (2011)

Flickr Groups: Multimedia Communities for Multimedia Analysis, Radu-Andrei Negoescu and Daniel Gatica-Perez, in: Internet Multimedia Search and Mining, Bentham Science Publishers, 2011

Human Behavior Understanding (2014)


Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013) (2013)

Crosslingual Tandem-SGMM: Exploiting Out-Of-Language Data for Acoustic Model and Feature Level Adaptation, Petr Motlicek, David Imseng and Philip N. Garner, in: Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013), ISCA - International Speech Communication Association, Lyon, France, pages 510-514, ISCA, 2013
attachment

Working Notes, CLEF 2013 (2013)


Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2013)

Speaker adaptive Kullback-Leibler divergence based hidden Markov models, David Imseng and Hervé Bourlard, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2013
attachment

Working Notes of the ImageCLEF 2012 Laboratory (2012)


Symposium on Machine Learning in Speech and Language Processing (MLSLP) (2012)

Boosting localized binary features for speech recognition, Anindya Roy, Mathew Magimai.-Doss and Sébastien Marcel, in: Symposium on Machine Learning in Speech and Language Processing (MLSLP), 2012
attachment

Proceedings of the 3rd International Workshop on Spoken Languages Technologies for Under-resourced Languages (2012)

Boosting under-resourced speech recognizers by exploiting out of language data - Case study on Afrikaans, David Imseng, Hervé Bourlard and Philip N. Garner, in: Proceedings of the 3rd International Workshop on Spoken Languages Technologies for Under-resourced Languages, Cape Town, pages 60--67, 2012
attachment

IEEE Conference on Computer Vision and Pattern Recognition, 2012, Providence, Rhode Island, USA (2012)

Bridging the Past, Present and Future: Modeling Scene Activities From Event Relationships and Global Rules, Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, in: IEEE Conference on Computer Vision and Pattern Recognition, 2012, Providence, Rhode Island, USA, 2012

Proceedings of Interspeech (2012)


Proceedings of the British Machine Vision Conference (2012)

Indoor Scene Recognition using Task and Saliency-driven Feature Pooling, Marco Fornoni and Barbara Caputo, in: Proceedings of the British Machine Vision Conference, Guildford, UK, 2012
attachment

Proceedings of the 2012 IEEE Workshop on Spoken Language Technology (2012)

MediaParl: Bilingual mixed language accented speech database, David Imseng, Hervé Bourlard, Holger Caesar, Philip N. Garner, Gwénolé Lecorvé and Alexandre Nanchen, in: Proceedings of the 2012 IEEE Workshop on Spoken Language Technology, pages 263--268, 2012
attachment

Working Notes of the ImageCLEF 2012 Laboratory (2012)

Overview of the ImageCLEF 2012 Robot Vision Task, Jesus Martinez-Gomez, Ismael Garcia-Varea and Barbara Caputo, in: Working Notes of the ImageCLEF 2012 Laboratory, 2012
attachment

Proceedings of Interspeech (2012)

Robust triphone mapping for acoustic modeling, Milos Cernak, David Imseng and Hervé Bourlard, in: Proceedings of Interspeech, Portland, Oregon, 2012
attachment

Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2012)

Using KL-divergence and multilingual information to improve ASR for under-resourced languages, David Imseng, Hervé Bourlard and Philip N. Garner, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, pages 4869--4872, 2012
attachment

IEEE Conference on Computer Vision and Pattern Recognition (2011)


IAPR IEEE International Joint Conference on Biometrics (2011)

Fast Speaker Verification on Mobile Phone data using Boosted Slice Classifiers, Anindya Roy, Mathew Magimai.-Doss and Sébastien Marcel, in: IAPR IEEE International Joint Conference on Biometrics, Washington DC, 2011
attachment

Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2011)

Language dependent universal phoneme posterior estimation for mixed language speech recognition, David Imseng, Hervé Bourlard, Mathew Magimai.-Doss and John Dines, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Prag, CZ, pages 5012-5015, 2011
attachment

Interspeech (2011)


Pervasive (2011)


IEEE Intl. Conference on Acoustics, Speech and Signal Processing 2011 (2011)

Phoneme Recognition using Boosted Binary Features, Anindya Roy, Mathew Magimai.-Doss and Sébastien Marcel, in: IEEE Intl. Conference on Acoustics, Speech and Signal Processing 2011, 2011
attachment

IEEE International Conference on Robotics and Automation (2011)

Towards semi-supervised learning of semantic spatial concepts, Jesus Martinez-Gomez and Barbara Caputo, in: IEEE International Conference on Robotics and Automation, 2011
attachment

CLEF 2010 Notebook Papers/LABs/Workshops (2010)


NIPS-2010 Workshop on Practical Applications of Sparse Modeling: Open Issues and New Directions (2010)

A Sparsity Constraint for Topic Models - Application to Temporal Activity Mining, Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, in: NIPS-2010 Workshop on Practical Applications of Sparse Modeling: Open Issues and New Directions, 2010
attachment

2010 IEEE International Conference on Acoustics, Speech and Signal Processing (2010)

BOOSTED BINARY FEATURES FOR NOISE-ROBUST SPEAKER VERIFICATION, Anindya Roy, Mathew Magimai.-Doss and Sébastien Marcel, in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, Texas, 2010
attachment

20th International Conference on Pattern Recognition, Istanbul, Turkey (2010)

Crossmodal Matching of Speakers using Lip and Voice Features in Temporally Non-overlapping Audio and Video Streams, Anindya Roy and Sébastien Marcel, in: 20th International Conference on Pattern Recognition, Istanbul, Turkey, International Association for Pattern Recognition (IAPR), Istanbul, Turkey, 2010
attachment

ICASSP 2010 (2010)


Proceedings of Interspeech (2010)

Hierarchical Multilayer Perceptron based Language Identification, David Imseng, Mathew Magimai.-Doss and Hervé Bourlard, in: Proceedings of Interspeech, Makuhari, Japan, pages 2722-2725, 2010
attachment

IEEE Fourth International Conference on Biometrics: Theory, Applications and Systems (2010)

Introducing Crossmodal Biometrics:Person Identification from Distinct Audio & Visual Streams, Anindya Roy and Sébastien Marcel, in: IEEE Fourth International Conference on Biometrics: Theory, Applications and Systems, 2010
attachment

Proc. of the 18th Intl. Conf. on Multimedia (2010)

Kodak Moments and Flickr Diamonds: How Users Shape Large-scale Media, Radu-Andrei Negoescu, Alexander Loui and Daniel Gatica-Perez, in: Proc. of the 18th Intl. Conf. on Multimedia, Firenze, Italy, 2010

2010 IEEE Second International Conference on Social Computing, SIN Symposium (2010)

Mining Human Location-Routines Using a Multi-Level Approach to Topic Modeling, Katayoun Farrahi and Daniel Gatica-Perez, in: 2010 IEEE Second International Conference on Social Computing, SIN Symposium, Minneapolis, Minnesota, USA, 2010
attachment

BMVC 2010 (2010)


CLEF 2010 Notebook Papers/LABs/Workshops (2010)


Proceedings of Interspeech (2010)

Towards mixed language speech recognition systems, David Imseng, Hervé Bourlard and Mathew Magimai.-Doss, in: Proceedings of Interspeech, Makuhari, Japan, pages 278-281, 2010
attachment

ACM 25th Symposium on Applied Computing, 2010, Sierre, Switzerland (2010)

Visual processing-inspired Fern-Audio features for Noise-Robust Speaker Verification, Anindya Roy and Sébastien Marcel, in: ACM 25th Symposium on Applied Computing, 2010, Sierre, Switzerland, Association for Computing Machinery, 2010
attachment

Proceedings of the 17th ACM International Conference on Multimedia (2009)

Flickr Hypergroups, Radu-Andrei Negoescu, Brett Adams, Dinh Phung, Svetha Venkatesh and Daniel Gatica-Perez, in: Proceedings of the 17th ACM International Conference on Multimedia, 2009
attachment

British Machine Vision Conference 2009 (2009)


Proceedings of Interspeech 2009 (2009)


ICMI-MLMI (2009)


Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2009)

Posterior features applied to speech recognition tasks with user-defined vocabulary, Guillermo Aradilla, Hervé Bourlard and Mathew Magimai.-Doss, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009
attachment

Proceedings of the Doctoral Consortium of the International Conference on Affective Computing & Intelligent Interaction (ACII) (2009)

Social Network Analysis in Multimedia Indexing: Making Sense of People in Multiparty Recordings, Sarah Favre, in: Proceedings of the Doctoral Consortium of the International Conference on Affective Computing & Intelligent Interaction (ACII), 2009
attachment

Proceedings of ICMI-MLMI 2009 (2009)


9th International Workshop in Visual Surveillance (2009)

Topic Models for Scene Analysis and Abnormality Detection, Jagannadan Varadarajan and Jean-Marc Odobez, in: 9th International Workshop in Visual Surveillance, IEEE, Kyoto, Japan, IEEE, 2009
attachment

Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2009)

Volterra Series for Analyzing MLP based Phoneme Posterior Probability Estimator, Joel Praveen Pinto, G. S. V. S. Sivaram, Hynek Hermansky and Mathew Magimai.-Doss, in: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2009
attachment

Proc. of the Intl. Conf. on Image and Video Retrieval (2008)

Analyzing Flickr Groups, Radu-Andrei Negoescu and Daniel Gatica-Perez, in: Proc. of the Intl. Conf. on Image and Video Retrieval, ACM, 2008

MM '08: Proc. of the 16th ACM Intl. Conf. on Multimedia (2008)

Topickr: Flickr Groups and Users Reloaded, Radu-Andrei Negoescu and Daniel Gatica-Perez, in: MM '08: Proc. of the 16th ACM Intl. Conf. on Multimedia, ACM, 2008

Publications of type Phdthesis


2014


2013

Multilingual speech recognition A posterior based approach, David Imseng, École Polytechnique Fédérale de Lausanne (EPFL), 2013
attachment

2012


2011

Boosting Localized Features for Speaker and Speech Recognition, Anindya Roy, Ecole Polytechnique Federale de Lausanne (EPFL), 2011
attachment

2010

| 1 | 2 |