SNSF-MULTI - Idiap Publications

Update cookies preferences

Name:

SNSF-MULTI

| 1 | 2 |

Comparing different acoustic modeling techniques for multilingual boosting, David Imseng, John Dines, Petr Motlicek, Philip N. Garner and Hervé Bourlard, Idiap-RR-01-2013

attachment

Crosslingual Tandem-SGMM: Exploiting Out-Of-Language Data for Acoustic Model and Feature Level Adaptation, Petr Motlicek, David Imseng and Philip N. Garner, Idiap-RR-39-2013

attachment

MediaParl: Bilingual mixed language accented speech database, David Imseng, Hervé Bourlard, Holger Caesar, Philip N. Garner, Gwénolé Lecorvé and Alexandre Nanchen, Idiap-RR-03-2013

attachment

Robust triphone mapping for acoustic modeling, Milos Cernak, David Imseng and Hervé Bourlard, Idiap-RR-02-2013

attachment

Using out-of-language data to improve an under-resourced speech recognizer, David Imseng, Petr Motlicek, Hervé Bourlard and Philip N. Garner, Idiap-RR-09-2013

attachment

Boosting under-resourced speech recognizers by exploiting out of language data - Case study on Afrikaans, David Imseng, Hervé Bourlard and Philip N. Garner, Idiap-RR-15-2012

attachment

Wordless Sounds: Robust Speaker Diarization using Privacy-Preserving Audio Representations, Sree Hari Krishnan Parthasarathi, Hervé Bourlard and Daniel Gatica-Perez, Idiap-RR-28-2012

attachment

Continuous Speech Recognition using Boosted Binary Features, Anindya Roy, Mathew Magimai-Doss and Sébastien Marcel, Idiap-RR-35-2011

attachment

Language dependent universal phoneme posterior estimation for mixed language speech recognition, David Imseng, Hervé Bourlard, Mathew Magimai-Doss and John Dines, Idiap-RR-13-2011

attachment

LP Residual Features for Robust, Privacy-Sensitive Speaker Diarization, Sree Hari Krishnan Parthasarathi, Hervé Bourlard and Daniel Gatica-Perez, Idiap-RR-14-2011

attachment

Privacy-Sensitive Audio Features for Speech/Nonspeech Detection, Sree Hari Krishnan Parthasarathi, Daniel Gatica-Perez, Hervé Bourlard and Mathew Magimai-Doss, Idiap-RR-12-2011

attachment

Towards semi-supervised learning of semantic spatial concepts, Jesus Martinez-Gomez and Barbara Caputo, Idiap-RR-03-2011

attachment

An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, Idiap-RR-22-2010

attachment

Crossmodal Matching of Speakers using Lip and Voice Features in Temporally Non-overlapping Audio and Video Streams, Anindya Roy and Sébastien Marcel, Idiap-RR-13-2010

attachment

Evaluating the Robustness of Privacy-Sensitive Audio Features for Speech Detection in Personal Audio Log Scenarios, Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Hervé Bourlard and Daniel Gatica-Perez, Idiap-RR-01-2010

attachment

Flickr Groups: Multimedia Communities for Multimedia Analysis, Radu-Andrei Negoescu and Daniel Gatica-Perez, Idiap-RR-18-2010

attachment

Hierarchical Multilayer Perceptron based Language Identification, David Imseng, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-14-2010

attachment

Introducing Crossmodal Biometrics: Person Identification from Distinct Audio & Visual Streams, Anindya Roy and Sébastien Marcel, Idiap-RR-29-2010

attachment

Mining Human Location-Routines using a Multi-Level Topic Model, Katayoun Farrahi and Daniel Gatica-Perez, Idiap-RR-28-2010

attachment

Modeling and Understanding Flickr Communities through Topic-based Analysis, Radu-Andrei Negoescu and Daniel Gatica-Perez, Idiap-RR-19-2010

attachment

Probabilistic Latent Sequential Motifs: Discovering temporal activity patterns in video scenes, Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, Idiap-RR-33-2010

attachment

Towards mixed language speech recognition systems, David Imseng, Hervé Bourlard and Mathew Magimai-Doss, Idiap-RR-15-2010

attachment

Haar Local Binary Pattern Feature for Fast Illumination Invariant Face Detection, Anindya Roy and Sébastien Marcel, Idiap-RR-28-2009

attachment

Investigating Privacy-Sensitive Features for Speech Detection in Multiparty Conversations, Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Hervé Bourlard and Daniel Gatica-Perez, Idiap-RR-12-2009

attachment

On Joint Modelling of Grapheme and Phoneme Information using KL-HMM for ASR, Mathew Magimai-Doss, Guillermo Aradilla and Hervé Bourlard, Idiap-RR-24-2009

attachment

Speaker Change Detection with Privacy-Preserving Audio Cues, Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Daniel Gatica-Perez and Hervé Bourlard, Idiap-RR-23-2009

attachment

Visual processing-inspired Fern-Audio features for Noise-Robust Speaker Verification, Anindya Roy and Sébastien Marcel, Idiap-RR-29-2009

attachment

Volterra Series for Analyzing MLP based Phoneme Posterior Probability Estimator, Joel Praveen Pinto, G. S. V. S. Sivaram, Hynek Hermansky and Mathew Magimai-Doss, Idiap-RR-69-2008

attachment

Decision tree clustering for KL-HMM, David Imseng and John Dines, Idiap-Com-01-2012

attachment

Applying multi- and cross-lingual stochastic phone space transformations to non-native speech recognition, David Imseng, Hervé Bourlard, John Dines, Philip N. Garner and Mathew Magimai-Doss, in: IEEE Transactions on Audio, Speech, and Language Processing, 2013

attachment

[DOI]

Using out-of-language data to improve an under-resourced speech recognizer, David Imseng, Petr Motlicek, Hervé Bourlard and Philip N. Garner, in: Speech Communication, 2013

attachment

[DOI]
[URL]

Wordless Sounds: Robust Speaker Diarization using Privacy-Preserving Audio Representations, Sree Hari Krishnan Parthasarathi, Hervé Bourlard and Daniel Gatica-Perez, in: IEEE Transactions on Audio, Speech, and Language Processing, 2012

attachment

A Fast Parts-based Approach to Speaker Verification using Boosted Slice Classifiers, Anindya Roy, Mathew Magimai-Doss and Sébastien Marcel, in: IEEE Transactions on Information Forensics and Security, 7(1):241-254, 2012

attachment

Phase AutoCorrelation (PAC) features for noise robust speech recognition, Shajith Ikbal, Hemant Misra, Hynek Hermansky and Mathew Magimai-Doss, in: Speech Communication, 54(7):867–880, 2012

[DOI]

Discovering Routines from Large-Scale Human Locations using Probabilistic Topic Models, Katayoun Farrahi and Daniel Gatica-Perez, in: ACM Transactions on Intelligent Systems and Technology, 2(1), 2011

attachment

Sensing the `Health State` of our Society, Anmol Madan, Manuel Cebrian, Sai Moturu, Katayoun Farrahi and Alex Pentland, in: IEEE Pervasive Computing, Special Issue on Large-Scale Opportunistic Sensing, 2011

attachment

An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: IEEE Transactions on Audio Speech and Language Processing, 19(2), 2011

[DOI]

Privacy-Sensitive Audio Features for Speech/Nonspeech Detection, Sree Hari Krishnan Parthasarathi, Daniel Gatica-Perez, Hervé Bourlard and Mathew Magimai-Doss, in: IEEE Transactions on Audio, Speech, and Language Processing, 19(8), 2011

attachment

Towards semi-supervised learning of semantic spatial concepts for mobile robots, Jesus Martinez-Gomez and Barbara Caputo, in: Journal of Physical Agents, 2011

attachment

Probabilistic Mining of Socio-Geographic Routines from Mobile Phone Data, Katayoun Farrahi and Daniel Gatica-Perez, in: IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 4(4), 2010

attachment

Modeling and Understanding Flickr Communities through Topic-based Analysis, Radu-Andrei Negoescu and Daniel Gatica-Perez, in: IEEE Transactions on Multimedia, 12(5), 2010

[DOI]

Contextual classification of image patches with latent aspect models, Florent Monay, Pedro Quelhas, Jean-Marc Odobez and Daniel Gatica-Perez, in: EURASIP Journal on Image and Video Processing, Special Issue on Patches in Vision, 2009

attachment

Sampling techniques for audio-visual tracking and head pose estimation, Jean-Marc Odobez and Oswald Lanz, in: Multimodal Signal Processing: Human Interactions in Meetings, pages 84-102, Cambridge University Press, 2012

attachment

Flickr Groups: Multimedia Communities for Multimedia Analysis, Radu-Andrei Negoescu and Daniel Gatica-Perez, in: Internet Multimedia Search and Mining, Bentham Science Publishers, 2011

How Do You Like Your Virtual Agent?: Human-Agent Interaction Experience through Nonverbal Features and Personality Traits, Aleksandra Cerekovic, Oya Aran and Daniel Gatica-Perez, in: Human Behavior Understanding, pages 1-15, Springer, 2014

attachment

Crosslingual Tandem-SGMM: Exploiting Out-Of-Language Data for Acoustic Model and Feature Level Adaptation, Petr Motlicek, David Imseng and Philip N. Garner, in: Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013), ISCA - International Speech Communication Association, Lyon, France, pages 510-514, ISCA, 2013

attachment

Overview of the ImageCLEF 2013 Robot Vision Task, Jesus Martinez-Gomez, Ismael Garcia-Varea, Miguel Cazorla and Barbara Caputo, in: Working Notes, CLEF 2013, 2013

attachment

Speaker adaptive Kullback-Leibler divergence based hidden Markov models, David Imseng and Hervé Bourlard, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2013

attachment

Baseline Multimodal Place Classifier for the 2012 Robot Vision Task, Jesus Martinez-Gomez, Ismael Garcia-Varea and Barbara Caputo, in: Working Notes of the ImageCLEF 2012 Laboratory, 2012

attachment

Boosting localized binary features for speech recognition, Anindya Roy, Mathew Magimai-Doss and Sébastien Marcel, in: Symposium on Machine Learning in Speech and Language Processing (MLSLP), 2012

attachment

Boosting under-resourced speech recognizers by exploiting out of language data - Case study on Afrikaans, David Imseng, Hervé Bourlard and Philip N. Garner, in: Proceedings of the 3rd International Workshop on Spoken Languages Technologies for Under-resourced Languages, Cape Town, pages 60--67, 2012

attachment

Bridging the Past, Present and Future: Modeling Scene Activities From Event Relationships and Global Rules, Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, in: IEEE Conference on Computer Vision and Pattern Recognition, 2012, Providence, Rhode Island, USA, 2012

Comparing different acoustic modeling techniques for multilingual boosting, David Imseng, John Dines, Petr Motlicek, Philip N. Garner and Hervé Bourlard, in: Proceedings of Interspeech, Portland, Oregon, 2012

attachment

Indoor Scene Recognition using Task and Saliency-driven Feature Pooling, Marco Fornoni and Barbara Caputo, in: Proceedings of the British Machine Vision Conference, Guildford, UK, 2012

attachment

MediaParl: Bilingual mixed language accented speech database, David Imseng, Hervé Bourlard, Holger Caesar, Philip N. Garner, Gwénolé Lecorvé and Alexandre Nanchen, in: Proceedings of the 2012 IEEE Workshop on Spoken Language Technology, pages 263--268, 2012

attachment

Overview of the ImageCLEF 2012 Robot Vision Task, Jesus Martinez-Gomez, Ismael Garcia-Varea and Barbara Caputo, in: Working Notes of the ImageCLEF 2012 Laboratory, 2012

attachment

Robust triphone mapping for acoustic modeling, Milos Cernak, David Imseng and Hervé Bourlard, in: Proceedings of Interspeech, Portland, Oregon, 2012

attachment

Using KL-divergence and multilingual information to improve ASR for under-resourced languages, David Imseng, Hervé Bourlard and Philip N. Garner, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, pages 4869--4872, 2012

attachment

Extracting and Locating Temporal Motifs in Video Scenes Using a Hierarchical Non Parametric Bayesian Model, Remi Emonet, Jagannadan Varadarajan and Jean-Marc Odobez, in: IEEE Conference on Computer Vision and Pattern Recognition, 2011

attachment

Fast Speaker Verification on Mobile Phone data using Boosted Slice Classifiers, Anindya Roy, Mathew Magimai-Doss and Sébastien Marcel, in: IAPR IEEE International Joint Conference on Biometrics, Washington DC, 2011

attachment

Language dependent universal phoneme posterior estimation for mixed language speech recognition, David Imseng, Hervé Bourlard, Mathew Magimai-Doss and John Dines, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Prag, CZ, pages 5012-5015, 2011

attachment

LP Residual Features for Robust, Privacy-Sensitive Speaker Diarization, Sree Hari Krishnan Parthasarathi, Hervé Bourlard and Daniel Gatica-Perez, in: Interspeech, 2011

attachment

Pervasive Sensing to Model Political Opinions in Face-to-Face Networks, Anmol Madan, Katayoun Farrahi, Daniel Gatica-Perez and Alex Pentland, in: Pervasive, San Francisco, 2011

attachment

Phoneme Recognition using Boosted Binary Features, Anindya Roy, Mathew Magimai-Doss and Sébastien Marcel, in: IEEE Intl. Conference on Acoustics, Speech and Signal Processing 2011, 2011

attachment

Towards semi-supervised learning of semantic spatial concepts, Jesus Martinez-Gomez and Barbara Caputo, in: IEEE International Conference on Robotics and Automation, 2011

attachment

A Multi Cue Discriminative Approach to Semantic Place Classification, Marco Fornoni, Jesus Martinez-Gomez and Barbara Caputo, in: CLEF 2010 Notebook Papers/LABs/Workshops, 2010

attachment

A Sparsity Constraint for Topic Models - Application to Temporal Activity Mining, Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, in: NIPS-2010 Workshop on Practical Applications of Sparse Modeling: Open Issues and New Directions, 2010

attachment

BOOSTED BINARY FEATURES FOR NOISE-ROBUST SPEAKER VERIFICATION, Anindya Roy, Mathew Magimai-Doss and Sébastien Marcel, in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, Texas, 2010

attachment

Crossmodal Matching of Speakers using Lip and Voice Features in Temporally Non-overlapping Audio and Video Streams, Anindya Roy and Sébastien Marcel, in: 20th International Conference on Pattern Recognition, Istanbul, Turkey, International Association for Pattern Recognition (IAPR), Istanbul, Turkey, 2010

attachment

Evaluating the Robustness of Privacy-Sensitive Audio Features for Speech Detection in Personal Audio Log Scenarios, Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Hervé Bourlard and Daniel Gatica-Perez, in: ICASSP 2010, 2010

attachment

Hierarchical Multilayer Perceptron based Language Identification, David Imseng, Mathew Magimai-Doss and Hervé Bourlard, in: Proceedings of Interspeech, Makuhari, Japan, pages 2722-2725, 2010

attachment

Introducing Crossmodal Biometrics:Person Identification from Distinct Audio & Visual Streams, Anindya Roy and Sébastien Marcel, in: IEEE Fourth International Conference on Biometrics: Theory, Applications and Systems, 2010

attachment

Kodak Moments and Flickr Diamonds: How Users Shape Large-scale Media, Radu-Andrei Negoescu, Alexander Loui and Daniel Gatica-Perez, in: Proc. of the 18th Intl. Conf. on Multimedia, Firenze, Italy, 2010

Mining Human Location-Routines Using a Multi-Level Approach to Topic Modeling, Katayoun Farrahi and Daniel Gatica-Perez, in: 2010 IEEE Second International Conference on Social Computing, SIN Symposium, Minneapolis, Minnesota, USA, 2010

attachment

Probabilistic Latent Sequential Motifs: Discovering temporal activity patterns in video scenes, Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, in: BMVC 2010, Aberystwyth University, Aberystwyth, BMVA Press, 2010

attachment

The Robot Vision Track at ImageCLEF 2010, Andrzej Pronobis, Marco Fornoni, Henrik I. Christensen and Barbara Caputo, in: CLEF 2010 Notebook Papers/LABs/Workshops, 2010

attachment

[URL]

Towards mixed language speech recognition systems, David Imseng, Hervé Bourlard and Mathew Magimai-Doss, in: Proceedings of Interspeech, Makuhari, Japan, pages 278-281, 2010

attachment

Visual processing-inspired Fern-Audio features for Noise-Robust Speaker Verification, Anindya Roy and Sébastien Marcel, in: ACM 25th Symposium on Applied Computing, 2010, Sierre, Switzerland, Association for Computing Machinery, 2010

attachment

Flickr Hypergroups, Radu-Andrei Negoescu, Brett Adams, Dinh Phung, Svetha Venkatesh and Daniel Gatica-Perez, in: Proceedings of the 17th ACM International Conference on Multimedia, 2009

attachment

Haar Local Binary Pattern Feature for Fast Illumination Invariant Face Detection, Anindya Roy and Sébastien Marcel, in: British Machine Vision Conference 2009, 2009

attachment

Investigating Privacy-Sensitive Features for Speech Detection in Multiparty Conversations, Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Hervé Bourlard and Daniel Gatica-Perez, in: Proceedings of Interspeech 2009, 2009

attachment

Learning and Predicting Multimodal Daily Life Patterns from Cell Phones, Katayoun Farrahi and Daniel Gatica-Perez, in: ICMI-MLMI, 2009

attachment

Posterior features applied to speech recognition tasks with user-defined vocabulary, Guillermo Aradilla, Hervé Bourlard and Mathew Magimai-Doss, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009

attachment

Social Network Analysis in Multimedia Indexing: Making Sense of People in Multiparty Recordings, Sarah Favre, in: Proceedings of the Doctoral Consortium of the International Conference on Affective Computing & Intelligent Interaction (ACII), 2009

attachment

Speaker Change Detection with Privacy-Preserving Audio Cues, Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Daniel Gatica-Perez and Hervé Bourlard, in: Proceedings of ICMI-MLMI 2009, 2009

attachment

Topic Models for Scene Analysis and Abnormality Detection, Jagannadan Varadarajan and Jean-Marc Odobez, in: 9th International Workshop in Visual Surveillance, IEEE, Kyoto, Japan, IEEE, 2009

attachment

Volterra Series for Analyzing MLP based Phoneme Posterior Probability Estimator, Joel Praveen Pinto, G. S. V. S. Sivaram, Hynek Hermansky and Mathew Magimai-Doss, in: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2009

attachment

Analyzing Flickr Groups, Radu-Andrei Negoescu and Daniel Gatica-Perez, in: Proc. of the Intl. Conf. on Image and Video Retrieval, ACM, 2008

Topickr: Flickr Groups and Users Reloaded, Radu-Andrei Negoescu and Daniel Gatica-Perez, in: MM '08: Proc. of the 16th ACM Intl. Conf. on Multimedia, ACM, 2008

Saliency-based Representations and Multi-component Classifiers for Visual Scene Recognition, Marco Fornoni, École Polytechnique Fédérale de Lausanne (EPFL), 2014

attachment

Multilingual speech recognition A posterior based approach, David Imseng, École Polytechnique Fédérale de Lausanne (EPFL), 2013

attachment

Sequential Topic Models for Mining Recurrent Activities and their Relationships : Application to long term video recordings, Jagannadan Varadarajan, École Polytechnique Fédérale de Lausanne, 2012

attachment

Boosting Localized Features for Speaker and Speech Recognition, Anindya Roy, Ecole Polytechnique Federale de Lausanne (EPFL), 2011

attachment

Modeling and understanding communities in online social media using probabilistic methods, Radu-Andrei Negoescu, Ecole polytechnique fédérale de Lausanne, 2011

attachment

[DOI]
[URL]

An Information Theoretic Approach to Speaker Diarization of Meeting Recordings, Deepu Vijayasenan, Ecole polytechnique fédérale de Lausanne, 2010

attachment

| 1 | 2 |

processing time: 2.3993 seconds.