All publications sorted by journal and type
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 |
INTERSPEECH (2012)
Speaker diarization of overlapping speech based on silence distribution in meeting recordings, and , in: INTERSPEECH, Portland, Oregon, USA, 2012 |
|
Ubicomp'12 (2012)
StressSense: Detecting Stress in Unconstrained Acoustic Environments using Smartphones, , , , , , , and , in: Ubicomp'12, Pittsburgh, 2012 |
|
SAPA-SCALE Conference, The 5th ISCA workshop on statistical and perceptual audition (2012)
Structured Sparse Coding for Microphone Array Location Calibration, , , and , in: SAPA-SCALE Conference, The 5th ISCA workshop on statistical and perceptual audition, 2012 |
|
Proceedings of the 13th Annual Conference of the International Speech Communication Association (InterSpeech) (2012)
Sub-Band Based Log-Energy and its Dynamic Range Stretching for Robust In-Car Speech Recognition, and , in: Proceedings of the 13th Annual Conference of the International Speech Communication Association (InterSpeech), Portland, Oregon, 2012 |
|
Proceedings of Interspeech (2012)
Supervised and unsupervised Web-based language model domain adaptation, , , and , in: Proceedings of Interspeech, Portland, Oregon, USA, pages to appear, 2012 |
|
Synthetic References for Template-based ASR using Posterior Features, , and , in: Proceedings of Interspeech, Portland, Oregon, USA, 2012 |
|
SAPA-SCALE Conference, International Speech Communication Association (2012)
Template-based ASR using Posterior features and synthetic references: comparing different TTS systems, , and , in: SAPA-SCALE Conference, International Speech Communication Association, 2012 |
|
Proceedings of AAAI International Conference on Weblogs and Social Media (2012)
The Good, the Bad, and the Angry: Analyzing Crowdsourced Impressions of Vloggers, and , in: Proceedings of AAAI International Conference on Weblogs and Social Media, 2012 |
|
NIST Speaker Recognition Conference (2012)
The I4U Submission to the 2012 NIST Speaker Recognition Evaluation, , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: NIST Speaker Recognition Conference, 2012 |
The Idiap Speaker Recognition Evaluation System at NIST SRE 2012, , and , in: NIST Speaker Recognition Conference, NIST, Orlando, USA, 2012 |
|
in Proceedings of INTERSPEECH (2012)
The INTERSPEECH 2012 Speaker Trait Challenge, , , , , , , , , , , and , in: in Proceedings of INTERSPEECH, 2012 |
Pervasive Computing (2012)
The Mobile Data Challenge: Big Data for Mobile Computing Research, , , , , , , , and , in: Pervasive Computing, Newcastle, 2012 |
|
Fourth Workshop on Computational Approaches to Arabic Script-based Languages at Proceedings of the Tenth Biennial Conference of the Association for Machine Translation in the Americas (AMTA) (2012)
Translating English Discourse Connectives into Arabic: a Corpus-based Analysis and an Evaluation Metric, and , in: Fourth Workshop on Computational Approaches to Arabic Script-based Languages at Proceedings of the Tenth Biennial Conference of the Association for Machine Translation in the Americas (AMTA), 2012 |
|
European Conference on Computer Vision (2012)
Unsupervised Activity Analysis and Monitoring algorithms for Effective Surveillance Systems, , , , , , , and , in: European Conference on Computer Vision, 2012 |
|
RecSys, Recommendation Utility Evaluation (RUE 2012) (2012)
Using Crowdsourcing to Compare Document Recommendation Strategies for Conversations, and , in: RecSys, Recommendation Utility Evaluation (RUE 2012), Dublin, Ireland, pages 15-20, 2012 |
|
Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2012)
Using KL-divergence and multilingual information to improve ASR for under-resourced languages, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, pages 4869--4872, 2012 |
|
Proceedings of the 14th ACM International Conference on Multimodal Interaction (2012)
Using Self-Context for Multimodal Detection of Head Nods in Face-to-Face Interactions, , and , in: Proceedings of the 14th ACM International Conference on Multimodal Interaction, 2012 |
|
Proceedings of the EACL2012 Workshop on Hybrid Approaches to Machine Translation (HyTra) (2012)
Using Sense-labeled Discourse Connectives for Statistical Machine Translation, and , in: Proceedings of the EACL2012 Workshop on Hybrid Approaches to Machine Translation (HyTra), Avignon, FR, pages 129--138, 2012 |
|
Proceedings of Interspeech (2012)
Using Sparse Classification Outputs as Feature Observations for Noise Robust ASR, , , , , and , in: Proceedings of Interspeech, 2012 |
|
IEEE International Conference on Computer Vision and Pattern Recognition (2012)
We are not Contortionists: Coupled Adaptive Learning for Head and Body Orientation Estimation in Surveillance Video, and , in: IEEE International Conference on Computer Vision and Pattern Recognition, 2012 |
|
European Signal Processing Conference (2011)
A Bimodal Sound Source Model for Vehicle Tracking in Traffic Monitoring, , , and , in: European Signal Processing Conference, 2011 |
|
Proceedings of the 19th European Signal Processing Conference (EUSIPCO) (2011)
A BSS-based Approach for Localization of Simultaneous Speakers in Reverberant Conditions, , , and , in: Proceedings of the 19th European Signal Processing Conference (EUSIPCO), 2011 |
|
Proceedings of International Symposium on Artificial Intelligence and Signal Processing (2011)
A Compressive Sensing Based Compressed Neural Network for Sound Source Localization, , and , in: Proceedings of International Symposium on Artificial Intelligence and Signal Processing, 2011 |
|
Proceedings of the GSCL2011 Workshop on "Contrastive Analysis - Translation Studies - Machine Translation: What can we learn from each other?" (2011)
A Corpus-based Contrastive Analysis for Defining Minimal Semantics of Inter-sentential Dependencies for Machine Translation, , , and , in: Proceedings of the GSCL2011 Workshop on "Contrastive Analysis - Translation Studies - Machine Translation: What can we learn from each other?", Hamburg, Germany, pages 5, 2011 |
|
IEEE International Workshop on Socially Intelligent Surveillance and Monitoring (2011)
A Joint Estimation of Head and Body Orientation Cues in Surveillance Video, , and , in: IEEE International Workshop on Socially Intelligent Surveillance and Monitoring, 2011 |
|
SIGDIAL 2011 (12th annual SIGDIAL Meeting on Discourse and Dialogue), Demonstration Session (2011)
A Just-in-Time Document Retrieval System for Dialogues or Monologues, , , and , in: SIGDIAL 2011 (12th annual SIGDIAL Meeting on Discourse and Dialogue), Demonstration Session, Portland, OR, pages 350-352, 2011 |
|
Proceedings of the 22nd British Machine Vision Conference (2011)
A Large-Scale Database of Images and Captions for Automatic Face Naming, , , and , in: Proceedings of the 22nd British Machine Vision Conference, 2011 |
|
Proceedings of the ACL-HLT 2011 System Demonstrations (49th Annual Meeting of the Association for Computational Linguistics) (2011)
A Speech-based Just-in-Time Retrieval System using Semantic Search, , , and , in: Proceedings of the ACL-HLT 2011 System Demonstrations (49th Annual Meeting of the Association for Computational Linguistics), Portland, OR, pages 80-86, 2011 |
[URL] |
Multimodal Corpora for Machine Learning: Taking Stock and Road mapping the Future (2011)
An Audio Visual Corpus for Emergent Leader Analysis, , and , in: Multimodal Corpora for Machine Learning: Taking Stock and Road mapping the Future, 2011 |
The Third Joint Workshop on Hands-free Speech Communication and Microphone Arrays (2011)
An Integrated Framework for Multi-Channel Multi-Source Localization and Voice Activity Detection, , , , and , in: The Third Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2011 |
|
Proceedings of Interspeech 2011 (2011)
Analysis and Comparison of Recent MLP Features for LVCSR Systems, , and , in: Proceedings of Interspeech 2011, 2011 |
|
Proceedings of the 3rd Joint Workshop on Hands-Free Speech Communication and Microphone Arrays (2011)
Audio Spatio-Temporal Fingerprints for Cloudless Real-Time Hands-Free Diarization on Mobile Devices, , in: Proceedings of the 3rd Joint Workshop on Hands-Free Speech Communication and Microphone Arrays, Edinburgh, UK, 2011 |
|
1st International SystemsX.ch Conference on Systems Biology (2011)
Automated Quantification of Morphodynamics for High-Throughput Live Cell Imaging Datasets, , , , , and , in: 1st International SystemsX.ch Conference on Systems Biology, 2011 |
Proceedings International Conference on Signal Acquisition and Processing (2011)
Automatic Time Skew Detection and Correction, , in: Proceedings International Conference on Signal Acquisition and Processing, Singapore, 2011 |
|
Proceedings of the Neural Information Processing Systems Conference (2011)
Boosting with Maximum Adaptive Sampling, and , in: Proceedings of the Neural Information Processing Systems Conference, 2011 |
Proceedings of Corpus Linguistics Conference (2011)
Building 'directional corpora' for unbiased contrastive analysis, and , in: Proceedings of Corpus Linguistics Conference, Birmingham, UK, pages 29-30, 2011 |
|
AVSS (2011)
Combined Estimation of Location and Body Pose in Surveillance Video, , and , in: AVSS, 2011 |
|
Proceedings of IAPR IEEE International Joint Conference on Biometrics (IJCB), Washington DC, USA (2011)
Competition on Counter Measures to 2-D Facial Spoofing Attacks, , , , , , , , , , , , , , , , , , , , , , , and , in: Proceedings of IAPR IEEE International Joint Conference on Biometrics (IJCB), Washington DC, USA, 2011 |
|
12th International Conference on Mobile Data Management (2011)
Contextual grouping: discovering real-life interaction types from longitudinal Bluetooth data, and , in: 12th International Conference on Mobile Data Management, 2011 |
|
Proceedings of Interspeech (2011)
Cross-Lingual Speaker Discrimination Using Natural and Synthetic Speech, and , in: Proceedings of Interspeech, Florence, Italy, 2011 |
|
International Conference on Artificial Intelligence and Statistics (2011)
Deep Learning for Efficient Discriminative Parsing, , in: International Conference on Artificial Intelligence and Statistics, 2011 |
|
The Eleventh IEEE International Workshop on Visual Surveillance (2011)
Detection-Based Multi-Human Tracking Using a CRF Model, , and , in: The Eleventh IEEE International Workshop on Visual Surveillance, 2011 |
|
Proceedings of Corpus Linguistics Conference (2011)
Disambiguating discourse connectives using parallel corpora: senses vs. translations, , , , , and , in: Proceedings of Corpus Linguistics Conference, Birmingham, UK, pages 104-105, 2011 |
|
Proceedings of ACL-HLT 2011 Student Session (2011)
Disambiguating Temporal-Contrastive Discourse Connectives for Machine Translation, , in: Proceedings of ACL-HLT 2011 Student Session, Association for Computational Linguistics, Portland, OR, pages 46--51, 2011 |
|
Proceedings of the SIGDIAL 2011: the 12th Annual Meeting of the Special Interest Group on Discourse and Dialogue (2011)
Engagement-based Multi-party Dialog with a Humanoid Robot, , , , , , and , in: Proceedings of the SIGDIAL 2011: the 12th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pages 341-343, 2011 |
|
International Conference on Ambient Computing, Applications, Services and Technologies (2011)
Environment - Application - Adaptation: a Community Architecture for Ambient Intelligence, , in: International Conference on Ambient Computing, Applications, Services and Technologies, 2011 |
IEEE Conference on Automatic Face and Gesture Recognition (2011)
Exploiting Long-Term Observations for Track Creation and Deletion in Online Multi-Face Tracking, and , in: IEEE Conference on Automatic Face and Gesture Recognition, pages 525-530, IEEE, 2011 |
|
Exploiting observers' judgements for nonverbal group interaction analysis, , and , in: IEEE Conference on Automatic Face and Gesture Recognition, pages 6, IEEE, 2011 |
|
IEEE Conference on Computer Vision and Pattern Recognition (2011)
Extracting and Locating Temporal Motifs in Video Scenes Using a Hierarchical Non Parametric Bayesian Model, , and , in: IEEE Conference on Computer Vision and Pattern Recognition, 2011 |
|
Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding (2011)
Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition, , and , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Hawaii, USA, pages 348-353, 2011 |
|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 |