Publication list - Idiap Publications

Update cookies preferences

| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 | 84 | 85 | 86 |

Implementation of VTLN for Statistical Speech Synthesis, Lakshmi Saheer, John Dines, Philip N. Garner and Hui Liang, Idiap-RR-32-2010

attachment

Neural conditional random fields, Trinh-Minh-Tri Do and Thierry Artieres, in: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, Chia Laguna, Sardinia, Italy, JMLR: W&CP, 2010

attachment

Online-Batch Strongly Convex Multi Kernel Learning, Francesco Orabona, Jie Luo and Barbara Caputo, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2010

attachment

The More you Learn, the Less you Store: Memory-controlled Incremental SVM for Visual Place Recognition, Andrzej Pronobis, Jie Luo and Barbara Caputo, in: Image and Vision Computing, 2010

attachment

[DOI]

A Multimodal Corpus for Studying Dominance in Small Group Conversations, Oya Aran, Hayley Hung and Daniel Gatica-Perez, in: LREC workshop on Multimodal Corpora: Advances in Capturing, Coding and Analyzing Multimodality, Malta, May 2010, 2010

attachment

Multilayer Perceptron Based Hierarchical Acoustic Modeling for Automatic Speech Recognition, Joel Praveen Pinto, Ecole polytechnique fédérale de Lausanne, 2010

attachment

Joint Pose Estimator and Feature Learning for Object Detection, Karim Ali, Francois Fleuret, David Hasler and Pascal Fua, in: Proceedings of the IEEE International Conference on Computer Vision, 2009

Evaluation of Probabilistic Occupancy Map People Detection for Surveillance Systems, Jerome Berclaz, Ali Shahrokni, Francois Fleuret, James Ferryman and Pascal Fua, in: Proceedings of the IEEE International Workshop on Performance Evaluation of Tracking and Surveillance, 2009

Analysis of Phone Posterior Feature Space Exploiting Class Specific Sparsity and MLP-based Similarity Measure, Afsaneh Asaei, Benjamin Picart and Hervé Bourlard, in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, 2010

attachment

Multi-Person Visual Focus of Attention from Head Pose and Meeting Contextual Cues, Silèye O. Ba and Jean-Marc Odobez, in: IEEE Trans. on Pattern Analysis and Machine Intelligence, 33(1):101-116, 2011

attachment

Learning Large Margin Likelihood for Realtime Head Pose Tracking, Elisa Ricci and Jean-Marc Odobez, in: IEEE Int. Conference on Image Processing, Cairo, Egypt, IEEE, 2009

attachment

Structure and appearance features for robust 3D facial actions tracking, Stéphanie Lefèvre and Jean-Marc Odobez, in: IEEE Proc. Int. Conf. on Multimedia and Expo, IEEE, 2009

attachment

Analysis of MLP Based Hierarchical Phoneme Posterior Probability Estimator, Joel Praveen Pinto, G. S. V. S. Sivaram, Mathew Magimai-Doss, Hynek Hermansky and Hervé Bourlard, in: IEEE Transcations on Audio, Speech, and Language Processing, 19(2):225-241, 2011

attachment

Crossmodal Matching of Speakers using Lip and Voice Features in Temporally Non-overlapping Audio and Video Streams, Anindya Roy and Sébastien Marcel, Idiap-RR-13-2010

attachment

Finding without searching, Andrei Popescu-Belis, Idiap-Com-01-2010

attachment

Application of Out-Of-Language Detection To Spoken-Term Detection, Petr Motlicek and Fabio Valente, in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010

attachment

BOOSTED BINARY FEATURES FOR NOISE-ROBUST SPEAKER VERIFICATION, Anindya Roy, Mathew Magimai-Doss and Sébastien Marcel, in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, Texas, 2010

attachment

Multistream Speaker Diarization beyond Two Acoustic Feature Streams, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: International Conference on Acoustics, Speech, and Signal Processing, 2010

attachment

AMIDA/Klewel Mini-Project, Petr Motlicek, Philip N. Garner, Maël Guillemot and Vincent Bozzo, Idiap-RR-03-2010

attachment

An Alternative Scanning Strategy to Detect Faces, Venkatesh Bala Subburaman and Sébastien Marcel, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010

attachment

Canal9: A database of political debates for analysis of social interactions, Alessandro Vinciarelli, Alfred Dielmann, Sarah Favre and Hugues Salamin, in: Proceedings of the International Conference on Affective Computing and Intelligent Interaction (IEEE International Workshop on Social Signal Processing), Amsterdam, Netherlands, 2009

attachment

[DOI]

Evaluating the Robustness of Privacy-Sensitive Audio Features for Speech Detection in Personal Audio Log Scenarios, Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Hervé Bourlard and Daniel Gatica-Perez, in: ICASSP 2010, 2010

attachment

Using Audio and Visual Cues for Speaker Diarisation Initialisation, Giulia Garau and Hervé Bourlard, in: International Conference on Acoustics, Speech and Signal Processing, 2010

attachment

An Adaptive Initialization Method for Speaker Diarization based on Prosodic Features, David Imseng and Gerald Friedland, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, pages 4946-4949, 2010

attachment

On Improving Face Detection Performance by Modelling Contextual Information, Cosmin Atanasoaei, Chris McCool and Sébastien Marcel, Idiap-RR-43-2010

attachment

Automatic Temporal Alignment of AV Data with Confidence Estimation, Danil Korchagin, Philip N. Garner and John Dines, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010

attachment

MLP Based Hierarchical System for Task Adaptation in ASR, Joel Praveen Pinto, Mathew Magimai-Doss and Hervé Bourlard, in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, 2009

attachment

VTLN Adaptation for Statistical Speech Synthesis, Lakshmi Saheer, Philip N. Garner, John Dines and Hui Liang, Idiap-RR-41-2009

attachment

VTLN Adaptation for Statistical Speech Synthesis, Lakshmi Saheer, Philip N. Garner, John Dines and Hui Liang, in: Proceedings of ICASSP, Dallas, Texas, 2010

attachment

Towards a Technology of Nonverbal Communication: Vocal Behavior in Social and Affective Phenomena, Gelareh Mohammadi and Alessandro Vinciarelli, Idiap-RR-05-2012

attachment

A Comparison of Supervised and Unsupervised Cross-Lingual Speaker Adaptation Approaches for HMM-Based Speech Synthesis, Hui Liang, John Dines and Lakshmi Saheer, Idiap-RR-05-2010

attachment

A Comparison of Supervised and Unsupervised Cross-Lingual Speaker Adaptation Approaches for HMM-Based Speech Synthesis, Hui Liang, John Dines and Lakshmi Saheer, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Dallas, U.S.A., 2010

attachment

Bayesian Networks as Generative Models for Face Recognition, Guillaume Heusch, EPFL, 2009

attachment

A Multimedia Retrieval System Using Speech Input, Andrei Popescu-Belis, Peter Poller, Jonathan Kilgour, Erik Boertjes, Jean Carletta, Sandro Castronovo, Michal Fapso, Alexandre Nanchen, Theresa Wilson, Joost de Wit and Majid Yazdani, in: Proceedings of ICMI-MLMI 2009 (11th International Conference on Multimodal Interfaces and 6th Workshop on Machine Learning for Multimodal Interaction), Cambridge, MA, 2009

attachment

The FEMTI guidelines for contextual MT evaluation: principles and tools, Paula Estrella, Andrei Popescu-Belis and Margaret King, in: Linguistica Antverpiensia New Series, 8, 2009

Managing Multimodal Data, Metadata and Annotations: Challenges and Solutions, Andrei Popescu-Belis, in: Multimodal Signal Processing for Human-Computer Interaction, Elsevier / Academic Press, 2009

Accessing a Large Multimodal Corpus using an Automatic Content Linking Device, Andrei Popescu-Belis, Jean Carletta, Jonathan Kilgour and Peter Poller, in: Multimodal Corpora: From Models of Natural Interaction to Systems and Applications, Springer-Verlag, 2009

attachment

[DOI]

User Interface Design in a Just-in-time Retrieval System for Meetings, Andrei Popescu-Belis, Peter Poller, Jonathan Kilgour, Mike Flynn, Sebastian Germesin, Alexandre Nanchen and Majid Yazdani, Idiap-RR-38-2009

attachment

On MLP-based Posterior Features for Template-based ASR, Serena Soldo, Mathew Magimai-Doss, Joel Praveen Pinto and Hervé Bourlard, Idiap-RR-37-2009

attachment

Memoirs of Togetherness from Audio Logs, Danil Korchagin, in: Proceedings International ICST Conference on User Centric Media, Venice, Italy, 2009

attachment

Autoregressive Models of Amplitude Modulations in Audio Compression, Sriram Ganapathy, Petr Motlicek and Hynek Hermansky, in: IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 2010

attachment

[URL]

Wide-Band Audio Coding based on Frequency Domain Linear Prediction, Petr Motlicek, Sriram Ganapathy, Hynek Hermansky and Harinath Garudadri, in: EURASIP Journal on Audio Speech and Music Processing, 2010(856280), 2010

attachment

[DOI]
[URL]

MDCT for Encoding Residual Signals in Frequency Domain Linear Prediction, Sriram Ganapathy, Petr Motlicek and Hynek Hermansky, in: Audio Engineering Society (AES,',','), 127th Convention, Audio Engineering Society (AES), Audio Engineering Society, 60 East 42nd Street, New York, New York 10165-2520, USA;, 2009

attachment

[URL]

APPLICATIONS OF SIGNAL ANALYSIS USING AUTOREGRESSIVE MODELS FOR AMPLITUDE MODULATION, Sriram Ganapathy, Samuel Thomas, Petr Motlicek and Hynek Hermansky, in: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009, WASPAA '09., IEEE, Mohonk Mountain House, New Paltz, New York, USA, 2009

attachment

[URL]

APPLICATIONS OF SIGNAL ANALYSIS USING AUTOREGRESSIVE MODELS FOR AMPLITUDE MODULATION, Sriram Ganapathy, Samuel Thomas, Petr Motlicek and Hynek Hermansky, Idiap-RR-35-2009

attachment

MDCT for Encoding Residual Signals in Frequency Domain Linear Prediction, Sriram Ganapathy, Petr Motlicek and Hynek Hermansky, Idiap-RR-34-2009

attachment

Autoregressive Models of Amplitude Modulations in Audio Compression, Sriram Ganapathy, Petr Motlicek and Hynek Hermansky, Idiap-RR-33-2009

attachment

Wide-Band Audio Coding based on Frequency Domain Linear Prediction, Petr Motlicek, Sriram Ganapathy and Hynek Hermansky, Idiap-RR-32-2009

attachment

On the vulnerability of face verification systems to hill-climbing attacks, Javier Galbally, Chris McCool, Julian Fierrez, Sébastien Marcel and Javier Ortega-Garcia, in: Pattern Recognition, 2009

Retrieving Ancient Maya Glyphs with Shape Context, Edgar Roman-Rangel, Carlos Pallan, Jean-Marc Odobez and Daniel Gatica-Perez, in: 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops, Kyoto, Japan, IEEE, 2009

attachment

| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 | 84 | 85 | 86 |

processing time: 0.9794 seconds.