This site is powered by Aigaion - A PHP/Web based management system for shared and annotated bibliographies. For more information visit Aigaion.nl. SourceForge.hetLogo
All publications
| 1-50 | 51-100 | 101-150 | 151-200 | 201-250 | 251-300 | 301-350 | 351-400 | 401-450 | 451-500 | 501-550 | 551-600 | 601-650 | 651-700 | 701-750 | 751-800 | 801-850 | 851-900 | 901-950 | 951-1000 | 1001-1050 | 1051-1100 | 1101-1150 | 1151-1200 | 1201-1250 | 1251-1300 | 1301-1350 | 1351-1400 | 1401-1450 | 1451-1500 | 1501-1550 | 1551-1600 | 1601-1650 | 1651-1665 |


2010

Application of Out-Of-Language Detection To Spoken-Term Detection, Petr Motlicek and Fabio Valente, in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010
BOOSTED BINARY FEATURES FOR NOISE-ROBUST SPEAKER VERIFICATION, Anindya Roy, Mathew Magimai.-Doss and Sébastien Marcel, in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, Texas, 2010
Multistream Speaker Diarization beyond Two Acoustic Feature Streams, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: International Conference on Acoustics, Speech, and Signal Processing, 2010
An Alternative Scanning Strategy to Detect Faces, Venkatesh Bala Subburaman and Sébastien Marcel, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010
Using Audio and Visual Cues for Speaker Diarisation Initialisation, Giulia Garau and Hervé Bourlard, in: International Conference on Acoustics, Speech and Signal Processing, 2010
An Adaptive Initialization Method for Speaker Diarization based on Prosodic Features, David Imseng and Gerald Friedland, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010
Automatic Temporal Alignment of AV Data with Confidence Estimation, Danil Korchagin, Philip N. Garner and John Dines, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010
A Comparison of Supervised and Unsupervised Cross-Lingual Speaker Adaptation Approaches for HMM-Based Speech Synthesis, Hui Liang, John Dines and Lakshmi Saheer, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Dallas, U.S.A., 2010
Autoregressive Models of Amplitude Modulations in Audio Compression, Sriram Ganapathy, Petr Motlicek and Hynek Hermansky, in: IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 2010
[URL]
Visual processing-inspired Fern-Audio features for Noise-Robust Speaker Verification, Anindya Roy and Sébastien Marcel, in: ACM 25th Symposium on Applied Computing, 2010, Sierre, Switzerland, Association for Computing Machinery, 2010

2009

Canal9: A database of political debates for analysis of social interactions, Alessandro Vinciarelli, Alfred Dielmann, Sarah Favre and Hugues Salamin, in: Proceedings of the International Conference on Affective Computing and Intelligent Interaction (IEEE International Workshop on Social Signal Processing), Amsterdam, Netherlands, pages 1-4, 2009
[DOI]
MLP Based Hierarchical System for Task Adaptation in ASR, Joel Praveen Pinto, Mathew Magimai.-Doss and Hervé Bourlard, in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, pages 365-370, 2009
A Multimedia Retrieval System Using Speech Input, Andrei Popescu-Belis, Peter Poller, Jonathan Kilgour, Erik Boertjes, Jean Carletta, Sandro Castronovo, Michal Fapso, Alexandre Nanchen, Theresa Wilson, Joost de Wit and Majid Yazdani, in: Proceedings of ICMI-MLMI 2009 (11th International Conference on Multimodal Interfaces and 6th Workshop on Machine Learning for Multimodal Interaction), Cambridge, MA, 2009
Managing Multimodal Data, Metadata and Annotations: Challenges and Solutions, Andrei Popescu-Belis, in: Multimodal Signal Processing for Human-Computer Interaction, pages 183-203, Elsevier / Academic Press, 2009
Accessing a Large Multimodal Corpus using an Automatic Content Linking Device, Andrei Popescu-Belis, Jean Carletta, Jonathan Kilgour and Peter Poller, in: Multimodal Corpora: From Models of Natural Interaction to Systems and Applications, pages 189-206, Springer-Verlag, 2009
[DOI]
Memoirs of Togetherness from Audio Logs, Danil Korchagin, in: Proceedings International ICST Conference on User Centric Media, Venice, Italy, 2009
Wide-Band Audio Coding based on Frequency Domain Linear Prediction, Petr Motlicek, Sriram Ganapathy, Hynek Hermansky and Harinath Garudadri, in: EURASIP journal on audio speech and music processing, Special Issue on, 2009
[URL]
MDCT for Encoding Residual Signals in Frequency Domain Linear Prediction, Sriram Ganapathy, Petr Motlicek and Hynek Hermansky, in: Audio Engineering Society (AES), 127th Convention, Audio Engineering Society (AES), Audio Engineering Society, 60 East 42nd Street, New York, New York 10165-2520, USA;, 2009
[URL]
APPLICATIONS OF SIGNAL ANALYSIS USING AUTOREGRESSIVE MODELS FOR AMPLITUDE MODULATION, Sriram Ganapathy, Samuel Thomas, Petr Motlicek and Hynek Hermansky, in: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009, WASPAA '09., IEEE, Mohonk Mountain House, New Paltz, New York, USA, pages 341-344, 2009
[URL]
Retrieving Ancient Maya Glyphs with Shape Context, Edgar Roman-Rangel, Carlos Pallan, Jean-Marc Odobez and Daniel Gatica-Perez, in: 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops, Kyoto, Japan, IEEE, 2009
Wearing a YouTube hat: directors, comedians, gurus, and user aggregated behavior, Joan-Isaac Biel and Daniel Gatica-Perez, in: Proceedings of the 17th ACM International Conference on Multimedia, pages 833-836, ACM, 2009
Out-of-Scene AV Data Detection, Danil Korchagin, in: Proceedings IADIS International Conference Applied Computing, Rome, Italy, pages 244-248, 2009
Bayesian Networks to Combine Intensity and Color Information in Face Recognition, Guillaume Heusch and Sébastien Marcel, in: International Conference on Biometrics, pages 414-423, Springer, 2009
Who's Doing What: Joint Modeling of Names and Verbs for Simultaneous Face and Pose Annotation, Jie Luo, Barbara Caputo and Vittorio Ferrari, in: Advances in Neural Information Processing Systems 22 (NIPS09), NIPS Foundation, Vancouver, B.C., Canada, MIT Press, 2009
Topic Models for Scene Analysis and Abnormality Detection, Jagannadan Varadarajan and Jean-Marc Odobez, in: 9th International Workshop in Visual Surveillance, IEEE, Kyoto, Japan, IEEE, 2009
Social Network Analysis in Multimedia Indexing: Making Sense of People in Multiparty Recordings, Sarah Favre, in: Proceedings of the Doctoral Consortium of the International Conference on Affective Computing & Intelligent Interaction (ACII), pages 25-32, 2009
| 1-50 | 51-100 | 101-150 | 151-200 | 201-250 | 251-300 | 301-350 | 351-400 | 401-450 | 451-500 | 501-550 | 551-600 | 601-650 | 651-700 | 701-750 | 751-800 | 801-850 | 851-900 | 901-950 | 951-1000 | 1001-1050 | 1051-1100 | 1101-1150 | 1151-1200 | 1201-1250 | 1251-1300 | 1301-1350 | 1351-1400 | 1401-1450 | 1451-1500 | 1501-1550 | 1551-1600 | 1601-1650 | 1651-1665 |