logo Idiap Research Institute        
All publications sorted by journal and type
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 |


7th International Conference on Language Resources and Evaluation (2010)


Proceedings of Interspeech (2010)

Towards mixed language speech recognition systems, David Imseng, Hervé Bourlard and Mathew Magimai.-Doss, in: Proceedings of Interspeech, Makuhari, Japan, pages 278-281, 2010
attachment

Proc. ACM Int. Conf. on Pervasive Services (ICPS,',','), Berlin. (2010)

Towards rich mobile phone datasets: Lausanne data collection campaign, N. Kiukkonen, Blom J., O. Dousse, Daniel Gatica-Perez and J. K. Laurila, in: Proc. ACM Int. Conf. on Pervasive Services (ICPS,',','), Berlin., 2010
attachment

Proceedings of Interspeech (2010)

Tracter: A Lightweight Dataflow Framework, Philip N. Garner and John Dines, in: Proceedings of Interspeech, Makuhari, Japan, 2010
attachment

International Conference on Acoustics, Speech and Signal Processing (2010)

Using Audio and Visual Cues for Speaker Diarisation Initialisation, Giulia Garau and Hervé Bourlard, in: International Conference on Acoustics, Speech and Signal Processing, 2010
attachment

Proceedings of ICASSP (2010)


Proc. Int. Conf. on Computer Vision Theory and Applications (2010)

View-Based Appearance Model Online Learning for 3D Deformable Face Tracking, Stéphanie Lefèvre and Jean-Marc Odobez, in: Proc. Int. Conf. on Computer Vision Theory and Applications, Angers, 2010
attachment

ACM 25th Symposium on Applied Computing, 2010, Sierre, Switzerland (2010)

Visual processing-inspired Fern-Audio features for Noise-Robust Speaker Verification, Anindya Roy and Sébastien Marcel, in: ACM 25th Symposium on Applied Computing, 2010, Sierre, Switzerland, Association for Computing Machinery, 2010
attachment

Proceedings International Conference on Multimodal Interfaces (ICMI-MLMI) (2010)

Vlogcast Yourself: Nonverbal Behavior and Attention in Social Media, Joan-Isaac Biel and Daniel Gatica-Perez, in: Proceedings International Conference on Multimodal Interfaces (ICMI-MLMI), 2010
attachment

Proceedings of AAAI International Conference on Weblogs and Social Media, Washington DC (2010)

Voices of Vlogging, Joan-Isaac Biel and Daniel Gatica-Perez, in: Proceedings of AAAI International Conference on Weblogs and Social Media, Washington DC, 2010
attachment

Proceedings of ICASSP (2010)


Proceedings of ICMI-MLMI 2009 (11th International Conference on Multimodal Interfaces and 6th Workshop on Machine Learning for Multimodal Interaction) (2009)

A Multimedia Retrieval System Using Speech Input, Andrei Popescu-Belis, Peter Poller, Jonathan Kilgour, Erik Boertjes, Jean Carletta, Sandro Castronovo, Michal Fapso, Alexandre Nanchen, Theresa Wilson, Joost de Wit and Majid Yazdani, in: Proceedings of ICMI-MLMI 2009 (11th International Conference on Multimodal Interfaces and 6th Workshop on Machine Learning for Multimodal Interaction), Cambridge, MA, 2009
attachment

International Conference on Developmental Learning (2009)


Proceeding of The 9th Asian Conference on Computer Vision (2009)

An online framework for learning novel concepts over multiple cues, Jie Luo, Francesco Orabona and Barbara Caputo, in: Proceeding of The 9th Asian Conference on Computer Vision, Xi'an, China, 2009
attachment

IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009, WASPAA '09. (2009)

APPLICATIONS OF SIGNAL ANALYSIS USING AUTOREGRESSIVE MODELS FOR AMPLITUDE MODULATION, Sriram Ganapathy, Samuel Thomas, Petr Motlicek and Hynek Hermansky, in: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009, WASPAA '09., IEEE, Mohonk Mountain House, New Paltz, New York, USA, 2009
attachment
[URL]

10th Annual Conference of the International Speech Communication Association (2009)

Arithmetic Coding of Sub-Band Residuals in FDLP Speech/Audio Codec, Petr Motlicek, Sriram Ganapathy and Hynek Hermansky, in: 10th Annual Conference of the International Speech Communication Association, ISCA, Brighton, England, ISCA 2009, 2009
attachment

ACM International Conference on Multimedia (2009)


10th Annual Conference of the International Speech Communication Association (2009)

Automatic vs. human question answering over multimedia meeting recordings, Quoc Anh Le and Andrei Popescu-Belis, in: 10th Annual Conference of the International Speech Communication Association, 2009
attachment

International Conference on Biometrics (2009)


Proceedings of the International Conference on Affective Computing and Intelligent Interaction (IEEE International Workshop on Social Signal Processing) (2009)

Canal9: A database of political debates for analysis of social interactions, Alessandro Vinciarelli, Alfred Dielmann, Sarah Favre and Hugues Salamin, in: Proceedings of the International Conference on Affective Computing and Intelligent Interaction (IEEE International Workshop on Social Signal Processing), Amsterdam, Netherlands, 2009
attachment
[DOI]

Proceedings ICME 2009 (2009)


Proceedings ICMI-MLMI (2009)


Proceedings of the British Maschine Vision Conference (2009)

Dynamic Partitioned Sampling For Tracking With Discriminative Features, Stefan Duffner, Jean-Marc Odobez and Elisa Ricci, in: Proceedings of the British Maschine Vision Conference, London, 2009
attachment

12th International Conference on Text, Speech and Dialogue, TSD 2009 (2009)

Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, Sriram Ganapathy, Petr Motlicek and Hynek Hermansky, in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, Pilsen, Czech Republic, Springer - Verlag, Berlin Heidelberg 2009, 2009
attachment
Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, Sriram Ganapathy, Petr Motlicek and Hynek Hermansky, in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, Pilsen, Czech Republic, Springer - Verlag, Berlin Heidelberg 2009, 2009
attachment

Proceedings of the IEEE International Workshop on Performance Evaluation of Tracking and Surveillance (2009)

Evaluation of Probabilistic Occupancy Map People Detection for Surveillance Systems, Jerome Berclaz, Ali Shahrokni, Francois Fleuret, James Ferryman and Pascal Fua, in: Proceedings of the IEEE International Workshop on Performance Evaluation of Tracking and Surveillance, 2009

Proceedings of the 17th ACM International Conference on Multimedia (2009)

Flickr Hypergroups, Radu-Andrei Negoescu, Brett Adams, Dinh Phung, Svetha Venkatesh and Daniel Gatica-Perez, in: Proceedings of the 17th ACM International Conference on Multimedia, 2009
attachment

British Machine Vision Conference 2009 (2009)


Proceedings of the 10thAnnual Conference of the International Speech Communication Association (Interspeech) (2009)

Hierarchical Processing of the Modulation Spectrum for GALE Mandarin LVCSR system, Fabio Valente, Mathew Magimai.-Doss, Christian Plahl and Ravuri Suman, in: Proceedings of the 10thAnnual Conference of the International Speech Communication Association (Interspeech), Brighton, 2009
attachment

Proceedings of the First IEEE International Conference on Biometrics, Identity and Security (BIdS) (2009)

Hill-Climbing Attack to an Eigenface-Based Face Verification System, Javier Galbally, Chris McCool, Julian Fierrez, Sébastien Marcel and Javier Ortega-Garcia, in: Proceedings of the First IEEE International Conference on Biometrics, Identity and Security (BIdS), 2009
attachment

Proceedings of IEEE Conference on Multimedia and Expo (2009)

Implicit Human Centered Tagging, Alessandro Vinciarelli, Nicolae Suditu and Maja Pantic, in: Proceedings of IEEE Conference on Multimedia and Expo, 2009
attachment

Proceedings of Interspeech 2009 (2009)


Proceedings of the ACM International Conference on Multimedia (2009)

Investigating the use of Visual Focus of Attention for Audio-Visual Speaker Diarisation, Giulia Garau, Silèye O. Ba, Hervé Bourlard and Jean-Marc Odobez, in: Proceedings of the ACM International Conference on Multimedia, Beijing, China, 2009
attachment

Proceedings of the IEEE International Conference on Computer Vision (2009)

Joint Pose Estimator and Feature Learning for Object Detection, Karim Ali, Francois Fleuret, David Hasler and Pascal Fua, in: Proceedings of the IEEE International Conference on Computer Vision, 2009

10th Annual Conference of the International Speech Communication Association (2009)

KL Realignment for Speaker Diarization with Multiple Feature Streams, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: 10th Annual Conference of the International Speech Communication Association, 2009

ICMI-MLMI (2009)


IEEE Int. Conference on Image Processing, Cairo, Egypt (2009)

Learning Large Margin Likelihood for Realtime Head Pose Tracking, Elisa Ricci and Jean-Marc Odobez, in: IEEE Int. Conference on Image Processing, Cairo, Egypt, IEEE, 2009
attachment

Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (2009)

Learning Rotational Features for Filament Detection, German Gonzalez, Francois Fleuret and Pascal Fua, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2009

Audio Engineering Society (AES,',','), 127th Convention (2009)

MDCT for Encoding Residual Signals in Frequency Domain Linear Prediction, Sriram Ganapathy, Petr Motlicek and Hynek Hermansky, in: Audio Engineering Society (AES,',','), 127th Convention, Audio Engineering Society (AES), Audio Engineering Society, 60 East 42nd Street, New York, New York 10165-2520, USA;, 2009
attachment
[URL]

Proceedings of Interspeech (2009)

Measuring the gap between HMM-based ASR and TTS, John Dines, Junichi Yamagishi and Simon King, in: Proceedings of Interspeech, Brighton, U.K., 2009
attachment

Proceedings International ICST Conference on User Centric Media (2009)

Memoirs of Togetherness from Audio Logs, Danil Korchagin, in: Proceedings International ICST Conference on User Centric Media, Venice, Italy, 2009
attachment

Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding (2009)

MLP Based Hierarchical System for Task Adaptation in ASR, Joel Praveen Pinto, Mathew Magimai.-Doss and Hervé Bourlard, in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, 2009
attachment

IEEE International conference on Robotics and Automation (2009)


International Conference on Audio, Speech and Signal Processing (2009)


Proceedings of International Conference on Acoustics, Speech and Signal Processing (2009)

MUTUAL INFORMATION BASED CHANNEL SELECTION FOR SPEAKER DIARIZATION OF MEETINGS DATA, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2009
attachment

Proceedings of International conference on acoustics speech and signal processing (2009)

Mutual Information based Channel Selection for Speaker Diarization of Meetings Data, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: Proceedings of International conference on acoustics speech and signal processing, 2009

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2009)

Non-linear mapping for multi-channel speech separation and robust overlapping speech recognition, Weifeng Li, John Dines, Mathew Magimai.-Doss and Hervé Bourlard, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009
attachment

Proceedings IADIS International Conference Applied Computing (2009)

Out-of-Scene AV Data Detection, Danil Korchagin, in: Proceedings IADIS International Conference Applied Computing, Rome, Italy, 2009
attachment

Workshop of the Cross-Language Evaluation Forum (2009)

Overview of the CLEF 2009 medical image annotation track, Tatiana Tommasi, Barbara Caputo, Petra Welter, Mark O. Güld and Thomas M Deserno, in: Workshop of the Cross-Language Evaluation Forum, Corfu, Greece, pages 85-93, Springer Berlin Heidelberg, 2009
attachment
[DOI]

in Proceedings of IEEE/IAPR International Conference on Biometrics (2009)

Parts-Based Face Verification using Local Frequency Bands, Chris McCool and Sébastien Marcel, in: in Proceedings of IEEE/IAPR International Conference on Biometrics, 2009
attachment
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 |