Publications of Idiap sorted by recency
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 |
Learning Goal-oriented Bimanual Dough Rolling Using Dynamic Heterogeneous Graph Based on Human Demonstration, , , , , , , and , in: In Proc. IEEE Intl Conf. on Robotics and Biomimetics (ROBIO), 2024 |
A Minimum-Jerk Approach to Handle Singularities in Virtual Fixtures, , and , in: IEEE Robotics and Automation Letters (RA-L), 9(11):10256-10263, 2024 |
|
Intuitive Robot Programming, , , , and , in: Ergonomics in Robotics: Advances and Innovations, Springer, 2025 |
Impact of Speech Mode in Automatic Pathological Speech Detection, and , in: EUSIPCO, IEEE, 2024 |
[URL] |
Are there identifiable structural parts in the sentence embedding whole?, and , in: Proceedings of the 7th BlackboxNLP Workshop: Analyzing and Interpreting Neural Networks for NLP, 2024 |
Tracking linguistic information in transformer-based sentence embeddings through targeted sparsification, and , in: Proceedings of the 9th Workshop on Representation Learning for NLP, 2024 |
[URL] |
Exploring Italian sentence embeddings properties through multi-tasking, , , and , in: Tenth Italian Conference on Computational Linguistics, 2024 |
Biologically Inspired Spiking Neural Networks for Speech Recognition, , EPFL/EDEE, 2024 |
[DOI] |
BLM-It - Blackbird Language Matrices for Italian: A CALAMITA Challenge, , , and , in: Proceedings of the 10th Italian Conference on Computational Linguistics, 2024 |
Parkinson's Disease Detection through Formant and F0 Analysis at Syllable Level, and , in: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024 |
|
Can We Learn to Select the Right Algorithm for OOD Generalization?, and , in: Out Of Distribution Generalization in Computer Vision, Workshop at ECCV, 2024 |
Neural Redshift: Random Networks are not Random Functions, , , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024 |
CulturePark: Boosting Cross-cultural Understanding in Large Language Models, , , , , and , in: Advances in Neural Information Processing Systems (NeurIPS), 2024 |
Robust Manipulation Primitive Learning via Domain Contraction, , , and , in: Proceedings of Conference on Robot Learning, 2024 |
|
Mirror-based Full-View Finger Vein Authentication with Illumination Adaptation, , , , and , in: IEEE Transactions on Circuits and Systems for Video Technology, 2024 |
[DOI] |
A Stochastic Approach to Contact-rich Manipulation, , Ecole Polytechnique Fédérale de Lausanne, 2024 |
|
Robot Learning using Tensor Networks, , Ecole Polytechnique Fédérale de Lausanne, 2024 |
[DOI] |
Annotator-centric Active Learning for Subjective NLP Tasks, , , and , in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, 2024 |
Identifying Privacy Personas, and , in: Proceedings on Privacy Enhancing Technologies, 2025 |
|
GADePo: Graph-Assisted Declarative Pooling Transformers for Document-Level Relation Extraction, , , and , in: Proceedings of the 3rd Workshop on Knowledge Augmented Methods for NLP, Association for Computational Linguistics, 2024 |
[DOI] [URL] |
Strong and Efficient Baselines for Open Domain Conversational Question Answering, , and , in: Findings of EMNLP, Association for Computational Linguistics, 2023 |
[DOI] [URL] |
Posterior-based analysis of spatio-temporal features for Sign Language Assessment, , , , and , Idiap-RR-11-2024 |
|
Hardware-effective Approaches for Skill Extraction in Job Offers and Resumes, , , , , , and , in: The 4th Workshop on Recommender Systems for Human Resources, in conjunction with the 18th ACM Conference on Recommender Systems, 2024 |
[URL] |
DiffuCOMET: Contextual Commonsense Knowledge Diffusion, , , , , and , in: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Bangkok, Thailand, pages 4809–4831, Association for Computational Linguistics, 2024 |
[DOI] [URL] |
Towards Dynamic Skeleton-based Handshape Subunits for Sign Language Assessment, and , Idiap-RR-09-2024 |
|
Nonparametric Variational Regularisation of Pretrained Transformers, and , in: First conference on Language Modelling, 2024 |
[URL] |
ChildPlay-Hand: A Dataset of Hand Manipulations in the Wild, , and , in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024 |
|
Bayesian Parameter-Efficient Fine-Tuning for Overcoming Catastrophic Forgetting, and , in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 |
[DOI] |
Recursive Forward Dynamics for Serial Kinematic Chains using Conformal Geometric Algebra, and , in: In Proc. Intl Workshop on the Algorithmic Foundations of Robotics (WAFR), 2024 |
Adversarial Robustness Analysis in Automatic Pathological Speech Detection Approaches, and , in: Interspeech, 2024 |
|
Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper, , , , , , , , and , Idiap-RR-10-2024 |
|
Synergizing Natural Language Towards Enhanced Shared Autonomy, , and , in: Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, 2024 |
[URL] |
Exploring neural oscillations during speech perception via surrogate gradient spiking neural networks, and , in: Frontiers in Neuroscience, 18(1449181), 2024 |
[DOI] |
Missed Opportunities in Building Energy Performance Assessment, , , and , in: Journal of Sustainable Real Estate, 16(1), 2024 |
[DOI] |
CCDb-HG: Novel Annotations and Gaze-Aware Representations for Head Gesture Recognition, , , and , in: 18th IEEE Int. Conference on Automatic Face and Gesture Recognition (FG), Istanbul,, 2024 |
|
Sharingan: A Transformer Architecture for Multi-Person Gaze Following, , and , in: Int. Conference Computer Vision and Pattern Recognition (CVPR), Seatle, 2024 |
|
Exploring the Zero-Shot Capabilities of Vision-Language Models for Improving Gaze Following, , , and , in: Int. Conf. Computer Vision and Pattern Recognition (CVPR), Workshop on Gaze Estimation and Prediction in the Wild, 2024 |
|
Segmenting Object Affordances: Reproducibility and Sensitivity to Scale, , , and , in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024 |
|
Image-guided topic modeling for interpretable privacy classification, and , in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024 |
|
Comparing Stability and Discriminatory Power of Hand-crafted Versus Deep Radiomics: A 3D-Printed Anthropomorphic Phantom Study, , , , , , , , , , and , in: Proceedings of the 12th European Workshop on Visual Information Processing, 2024 |
|
GLoFool: global enhancements and local perturbations to craft adversarial images, and , in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024 |
|
Explaining models relating objects and privacy, , , , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024 |
[URL] |
Sparse multi-view hand-object reconstruction for unseen environments, , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2024 |
[URL] |
Open-Vocabulary Object 6D Pose Estimation, , , , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024 |
[URL] |
Test-time adaptation for 6D pose tracking, , and , in: Pattern Recognition, 152, 2024 |
[DOI] [URL] |
Training a Filter-Based Model of the Cochlea in the Context of Pre-Trained Acoustic Models, and , in: Acoustics, 6:470 - 488, 2024 |
[DOI] |
Refining Tuberculosis Detection in CXR Imaging: Addressing Bias in Deep Neural Networks via Interpretability, , and , in: Proceedings of the 12th European Workshop on Visual Information Processing, 2024 |
|
Face Liveness Detection Competition (LivDet-Face) - 2024, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: IEEE International Joint Conference on Biometrics, 2024 |
|
Feature Representations for Automatic Meerkat Vocalization Classification, , , and , in: 4th International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots, 2024 |
|
Evaluating the Effectiveness of Attack-Agnostic Features for Morphing Attack Detection, and , in: International Joint Conference on Biometrics, 2024 |
|
Towards Wine Tasting Activity Recognition for a Digital Sommelier, , , and , in: Proc. ACM Workshop on Exploring Innovative Technology for Commensality and Human-Food Interaction, 2024 |
Integrating large language models and ASR systems using confidence measures and prompting, , Idiap-Com-02-2024 |
|
On the Utility of Speech and Audio Foundation Models for Marmoset Call Analysis, and , in: 4th International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots, 2024 |
|
Sentiment Analysis using pretrained LLMs, , and , Idiap-RR-05-2024 |
|
A Novel and Responsible Dataset for Face Presentation Attack Detection on Mobile Devices, , , , , and , in: The IEEE International Joint Conference on Biometrics, Buffalo, New York, pages 8, 2024 |
|
Vascular Biometrics Experiments on Candy -- A New Contactless Finger-Vein Dataset, , , , and , in: Proceedings of the International Conference on Pattern Recognition (ICPR), Calcutta (India), 2024 |
|
SWEET - An Open Source Modular Platform for Contactless Hand Vascular Biometric Experiments, , , , and , in: arXiv, 2024 |
[DOI] [URL] |
Assessing the Reliability of Biometric Authentication on Virtual Reality Devices, , and , in: Proceedings of IEEE International Joint Conference on Biometrics, 2024 |
|
Towards interfacing large language models with ASR systems using confidence measures and prompting, , , , and , in: Proceedings of Interspeech, pages 2980-2984, 2024 |
[DOI] |
Factors that Affect Personalization of Robots for Older Adults, , and , in: CONCATENATE Workshop at HRI 2023 in Stockholm, Sweden, 2023 |
[URL] |
A System for Human-Robot Teaming through End-User Programming and Shared Autonomy, , , , , and , in: Proceedings of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, pages 231-239, 2024 |
[DOI] [URL] |
Generative AI Literacy: Twelve Defining Competencies, , and , in: ACM Digital Government: Research and Practice, 2024 |
[DOI] [URL] |
Feature Representations for Automatic Meerkat Vocalization Classification, , , and , Idiap-RR-06-2024 |
|
gafro: Geometric Algebra for Robotics, , and , in: IEEE Robotics and Automation Magazine, 2024 |
|
M3BAT: Unsupervised Domain Adaptation for Multimodal Mobile Sensing with Multi-Branch Adversarial Training, , and , in: PACM on Interactive, Mobile, Wearable, and Ubiquitous Technologies (IMWUT), 8(2):46, 2024 |
[DOI] |
Group Membership Verification via Nonlinear Sparsifying Transform Learning, , , , , , and , in: IEEE Access, 12:86739-86751, 2024 |
[DOI] [URL] |
Performing And Detecting Backdoor Attacks on Face Recognition Algorithms, , Ecole Polytechnique Fédérale de Lausanne, 2024 |
|
Logic Learning from Demonstrations for Multi-step Manipulation Tasks in Dynamic Environments, , , and , in: IEEE Robotics and Automation Letters, 2024 |
|
Modality Agnostic Heterogeneous Face Recognition with Switch Style Modulators, and , in: IEEE International Joint Conference on Biometrics, 2024 |
|
Logic-Skill Programming: An Optimization-based Approach to Sequential Skill Planning, , , and , in: Proc. Robotics: Science and Systems (RSS), 2024 |
|
Configuration Space Distance Fields for Manipulation Planning, , , and , in: Robotics: Science and Systems (RSS), 2024, 2024 |
|
A Unified Model for Gaze Following and Social Gaze Prediction, , , and , in: The 18th IEEE International Conference on Automatic Face and Gesture Recognition, 2024 |
|
Representing Robot Geometry as Distance Fields: Applications to Whole-body Manipulation, , , and , in: IEEE International Conference on Robotics and Automation, 2024 |
|
Online Multi-Contact Receding Horizon Planning via Value Function Approximation, , , , , , , , , , and , in: IEEE Transactions on Robotics (T-RO), 2024 |
|
An Optimal Control Formulation of Tool Affordance Applied to Impact Tasks, , , and , in: IEEE Transactions on Robotics (T-RO), 2024 |
|
A Probabilistic Approach to Multi-Modal Adaptive Virtual Fixtures, , , , , , and , in: IEEE Robotics and Automation Letters (RA-L), 2024 |
|
Towards Robo-Coach: Robot Interactive Stiffness/Position Adaptation for Human Strength and Conditioning Training, , , , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), 2024 |
D-LGP: Dynamic Logic-Geometric Program for Reactive Task and Motion Planning, , and , in: IEEE International Conference on Robotics and Automation, 2024 |
|
Extending the Cooperative Dual-Task Space in Conformal Geometric Algebra, and , in: Proc. IEEE Intl Conf. on Robotics and Automation, 2024 |
|
Generalized Policy Iteration using Tensor Approximation for Hybrid Control, , and , in: International Conference on Learning Representations (ICLR), 2024 |
|
Normalizing Flows for Speaker and Language Recognition Backend, , , , and , in: Odyssey 2024: The Speaker and Language Recognition Workshop, 2024 |
|
DAIC-WOZ: On the Validity of Using the Therapist's prompts in Automatic Depression Detection from Clinical Interviews, , , , , and , in: Proceedings of the 6th Clinical Natural Language Processing Workshop, Association for Computational Linguistics, 2024 |
|
A Differentiable Integer Linear Programming Solver for Explanation-Based Natural Language Inference, , and , in: The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, 2024 |
Estimating the Causal Effects of Natural Logic Features in Transformer-Based NLI Models, , and , in: The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, 2024 |
Why daylight should be a priority for urban planning, , , , , , , , , , , , , and , in: Journal of Urban Management, 2024 |
[DOI] [URL] |
From Modalities to Styles: Rethinking the Domain Gap in Heterogeneous Face Recognition, and , in: IEEE Transactions on Biometrics, Behavior, and Identity Science, 2024 |
|
On Learning to Classify Meerkat Calls, , Idiap-Com-01-2024 |
|
Enhancing Ethical Explanations of Large Language Models through Iterative Symbolic Refinement, , , and , in: The 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024), 2024 |
Improving Semantic Control in Discrete Latent Spaces with Transformer Quantized Variational Autoencoders, , , , and , in: Findings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024), 2024 |
Multi-Relational Hyperbolic Word Embeddings from Natural Language Definitions, , and , in: The 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024), 2024 |
COMPARING DATA-DRIVEN AND HANDCRAFTED FEATURES FOR DIMENSIONAL EMOTION RECOGNITION, , and , in: International Conference on Acoustics, Speech and Signal Processing, 2024 |
|
Developing 3D-Printed Wrist Splints for Distal Radius and Scaphoid Fractures, , , , , , and , in: Journal of Wrist Surgery, 2024 |
[DOI] [URL] |
Understanding the effects of language-specific class imbalance in multilingual fine-tuning, and , in: Findings of the European chapter of Association for Computational Linguistics, 2024, 2024 |
|
Generalization and Personalization of Machine Learning for Multimodal Mobile Sensing in Everyday Life, , EPFL, 2023 |
|
Vulnerability of Face Age Verification to Replay Attacks, , , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2024 |
|
EdgeFace : Efficient Face Recognition Model for Edge Devices, , , , and , in: IEEE Transactions on Biometrics, Behavior, and Identity Science, 2024 |
|
PRIMIS: Privacy-Preserving Medical Image Sharing via Deep Sparsifying Transform Learning with Obfuscation, , , , , , , and , in: Journal of Biomedical Informatics, Elsevier, 150, 2024 |
[DOI] [URL] |
Deep Variational Privacy Funnel: General Modeling with Applications in Face Recognition, , and , in: 49th IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE, 2024 |
[DOI] [URL] |
Heterogeneous Face Recognition Using Domain Invariant Units, and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2024 |
|
Transformers as Graph-to-Graph Models, , , and , in: Big Picture Workshop at EMNLP 2023, 2023 |
Nonparametric Variational Regularisation of Pretrained Transformers, and , in: ArXiv, 2023 |
[DOI] [URL] |
Safe Deep Neural Networks, , EPFL, 2024 |
|
Verification of an open-source Python library for the simulation of district heating networks with complex topologies, and , in: Energy, 2023 |
[DOI] [URL] |
Loose and Tight: Creative Formation but Rigid Use of Nominal Compounds in Conspiracist Texts, , and , in: The Journal of Creative Behavior, 2023 |
Absolute retinal blood flow in healthy eyes and in eyes with retinal vein occlusion, , , , , , , and , in: Microvascular Research, 152, 2024 |
[DOI] |
Fine-tuning Self-Supervised Models For Language Identification Using Orthonormal Constraint, , , , , , and , in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP), 2024 |
|
Online Learning of Continuous Signed Distance Fields Using Piecewise Polynomials, , and , in: IEEE Robotics and Automation Letters (RA-L), 9(6):6020-6026, 2024 |
[DOI] [URL] |
CONTEXTUAL BIASING METHODS FOR IMPROVING RARE WORD DETECTION IN AUTOMATIC SPEECH RECOGNITION, , , , , , , and , in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, Seoul, Korea, 2024 |
|
From Zero Energy to Zero Power Buildings: a new paradigm for a sustainable transition of the building stock, , and , in: Sustainable Cities and Society, 2023 |
[DOI] [URL] |
Automatic Speech Analysis Framework for ATC Communication in HAAWAII, , , , , and , in: 13th SESAR Innovation Days, 2023 |
|
CONTENT-BASED OBJECTIVE EVALUATION OF ARTIFICIALLY GENERATED SIGN LANGUAGE VIDEOS, , , , , and , in: ICASSP, 2024 |
|
Demystifying the Scribes behind the Voynich Manuscript using Computational Linguistic Techniques, , and , in: Proceedings of the 1st International Conference on the Voynich Manuscript, 2022 |
International Conference on the Voynich Manuscript 2022, , , , , , and , in: Proceedings of the International Conference on Historical Cryptology, 2023 |
UM-DFKI Maltese Speech Translation, , , , , , , , , and , in: Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023), 2022 |
Findings of the IWSLT 2023 evaluation campaign, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: Proceedings of the IWSLT conference, 2023 |
A Machine Learning Model for the Prediction of Building Hourly Heating Demand from CityGML Files: Training Workflow and Deployment as an API, , and , in: Proceedings of Building Simulation 2023: 18th Conference of IBPSA, pages 2932 - 2939, 2023 |
[DOI] [URL] |
Data-driven urban building energy modeling in Satom (CH): The energy savings potential and use of available renewable energy sources., , and , Politecnico di Torino, 2023 |
[URL] |
Meta-analysis informed machine learning: Supporting cytokine storm detection during CAR-T cell Therapy, , , , , , , , , and , in: Journal of Biomedical Informatics, 142, 2023 |
[DOI] |
Epidemiological and clinical analysis of polish short-term and long-term travelers returning from tropical countries, , and , in: Travel Medicine and Infectious Disease, 55, 2023 |
[DOI] |
Defining the role of real-world data in cancer clinical research: the position of the European Organisation for Research and Treatment of Cancer, , and , in: European Journal of Cancer, 2023 |
A systematic review of biologically-informed deep learning models for cancer: fundamental trends for encoding and interpreting oncology data, , , , and , in: BMC Bioinformatics, 24(198), 2023 |
[DOI] |
Learning Lessons from the COVID-19 pandemic for Real World Evidence research in Oncology–shared perspectives from an international consortia, , and , in: ESMO Open, 2023 |
What do individuals with visual impairment need and want from a dialogue-based digital assistant?, , , , and , in: Clinical and Experimental Optometry, 2023 |
NLI4CT: Multi-Evidence Natural Language Inference for Clinical Trial Reports, , , , , and , in: Proceedings of The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), 2023 |
Transformers, Tables and Frame Semantics, , , and , in: International Conference on Semantic Computing, 2023 |
SemEval-2023 Task 7: Multi-Evidence Natural Language Inference for Clinical Trial Data, , , , , and , in: Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023), Association for Computational Linguistics, 2023 |
[DOI] [URL] |
Introduction to Mathematical Language Processing: Informal Proofs, Word Problems, and Supporting Tasks, and , in: Transactions of the ACL, 2023 |
A Canonical Context-preserving Representation for Open IE: Extracting Semantically Typed Relational Tuples from Complex Sentences, , , and , in: Knowledge-based Systems, 2023 |
Learning Disentangled Representations for Natural Language Definitions, , , and , in: In Findings of the European chapter of Association for Computational Linguistics, 2023 |
|
Assessment of Subsidization Strategies for Multi-Objective Optimization of Energy Efficiency Measures for Building Renovation at District Scale, , , , and , in: Energies, 16(15), 2023 |
[DOI] |
Bi-directional Training for Composed Image Retrieval via Text Prompt Learning, , , , and , in: IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2024 |
[URL] |
A Symbolic Framework for Systematic Evaluation of Mathematical Reasoning with Transformers, , , and , in: Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2024 |
[URL] |
Candidate Set Re-ranking for Composed Image Retrieval with Dual Multi-modal Encoder, , , and , in: Transactions on Machine Learning Research (TMLR), 2024 |
[URL] |
Selective Mixup Helps with Distribution Shifts, But Not (Only) because of Mixup, , and , in: International Conference on Machine Learning (ICML), 2024 |
[URL] |
Learning diverse features in vision transformers for improved generalization, , , and , in: ICML 2023: The Second Workshop on Spurious Correlations, Invariance and Stability, 2023 |
[URL] |
Leveraging Diffusion Disentangled Representations to Mitigate Shortcuts in Underspecified Visual Tasks, , , , and , in: NeurIPS Workshop on Diffusion Models, 2023 |
[URL] |
Energy assessment of a district by integrating solar thermal in district heating network: a dynamic analysis approach, , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2023 |
[DOI] [URL] |
ZooPFL: Exploring Black-box Foundation Models for Personalized Federated Learning, , , , , , , , and , in: NeurIPS 2024 Workshop on Federated Learning, 2024 |
[URL] |
Shortcut Bias Mitigation via Ensemble Diversity Using Diffusion Probabilistic Models, , , , , and , in: Under review, 2023 |
[URL] |
Zero-shot Retrieval: Augmenting Pre-trained Models with Search Engines, , , , , , and , in: Under review, 2023 |
[URL] |
Potential for district heating networks from waste heat: an assessment tool and its application to sewage treatment plants in the Canton of Zurich, and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2023 |
[DOI] [URL] |
Suggesting disease associations for overlooked metabolites using literature from metabolic neighbors, , , , , , , , and , in: GigaScience, 12:13, 2023 |
[DOI] |
End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation, , , , , , and , in: The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), Singapore, 2023 |
[URL] |
Enhancing Multi-modal Classification of Violent Events using Image Captioning, , and , in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2023), CEUR Workshop Proceedings, Jaén, Spain, 2023 |
[URL] |
Robust Face Presentation Attack Detection with Multi-channel Neural Networks, and , in: Handbook of Biometric Anti-Spoofing, Springer, 2023 |
|
Attacking Face Recognition with T-shirts: Database, Vulnerability Assessment and Detection, and , in: IEEE Access, 2023 |
|
Learning to Abstract with Nonparametric Variational Information Bottleneck, , and , in: The 2023 Conference on Empirical Methods in Natural Language Processing, 2023 |
[URL] |
Human-Robot Collaboration in a Sanding Task, , , , , , , , and , in: Proceedings of the Human Factors and Ergonomics Society Annual Meeting, 2023 |
|
Towards Improved Replicability of Human Studies in Human-Robot Interaction: Recommendations for Formalized Reporting, , , , , , , , , , and , in: Companion of the 2023 ACM/IEEE International Conference on Human-Robot Interaction, pages 629-633, 2023 |
|
Combating COVID-19 with charisma: Evidence on governor speeches in the United States, , , , , , and , in: The Leadership Quarterly, 2023 |
[DOI] [URL] |
Diversity and neocolonialism in Big Data research: Avoiding extractivism while struggling with paternalism, , , , , , and , in: Big Data & Society, 2023 |
[DOI] |
Integrated transcriptome landscape of ALS identifies genome instability linked to TDP-43 pathology, , , , , , , , , , , , , and , in: Nature Communications, 2023 |
RNA at a breaking point? Cytoplasmic cleavage and other post-transcriptional RNA processing in neurodevelopment and disease, , and , in: Frontiers in Molecular Neuroscience, 2023 |
The predicted RNA-binding protein regulome of axonal mRNAs, , , and , in: Genome Research, 2023 |
Robust Execution of Assembly Policies Using a Pose Invariant Task Representation, , , , , and , in: 20th International Conference on Ubiquitous Robots (UR), Honolulu, HI, USA, IEEE, 2023 |
|
Tensor Train for Global Optimization Problems in Robotics, , , and , in: International Journal of Robotics Research, 43(6):811-839, 2024 |
[DOI] |
EdgeFace: Efficient Face Recognition Model for Edge Devices, , , , and , Idiap-RR-01-2024 |
|
Whole-Body Ergodic Exploration with a Manipulator Using Diffusion, , and , in: IEEE Robotics and Automation Letters, 8(12):8581-8587, 2023 |
[DOI] [URL] |
The Interpreter Understands Your Meaning: End-to-end Spoken Language Understanding Aided by Speech Translation, and , in: Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023, pages 4408-4423, Association for Computational Linguistics, 2023 |
[DOI] |
Enhancing user acceptance in automated systems with human-centric lighting: the role of visual comfort, personality, and preference, , , , , , , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2023 |
[DOI] [URL] |
SoftGPT: Learn Goal-oriented Soft Object Manipulation Skills by Generative Pre-trained Heterogeneous Graph Transformer, , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2023 |
|
Customization of Automatic Speech Recognition Engines for Rare Word Detection Without Costly Model Re-Training, , , , , , and , in: Proc. 13th SESAR Innovation Days, Seville, Spain, 2023 |
[DOI] [URL] |
A Multitask and Kernel approach for Learning to Push Objects with a Task-Parameterized Deep Q-Network, , , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2023 |
|
Reactive Anticipatory Robot Skills with Memory, , and , in: Robotic Research, pages 436-451, Springer, 2023 |
|
Programming industrial robots from few demonstrations., , in: Human-Robot Collaboration: Unlocking the potential for industrial applications, pages 9-37, Institution of Engineering and Technology (IET), 2023 |
Efficient Grapevine Structure Estimation in Vineyards Conditions, , , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision, Paris, pages 712--720, 2023 |
[URL] |
Mitigating Demographic Bias in Face Recognition via Regularized Score Calibration, and , in: IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, IEEE/CVF, 2024 |
|
Parameter-Efficient Tuning With Adaptive Bottlenecks For Automatic Speech Recognition, , , , , and , in: Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU'23, 2023 |
|
Coordinated Multi-Robot Shared Autonomy Based on Scheduling and Demonstrations, , , , , , , and , in: IEEE Robotics and Automation Letters, 8(12):8335 - 8342, 2023 |
[DOI] [URL] |
The Suisse Romande Local News Dataset, and , Idiap-Com-03-2023 |
|
Lessons Learned in Transcribing 5000 h of Air Traffic Control Communications for Robust Automatic Speech Understanding, , , , , , , , , , and , in: Aerospace, 10(10):898, 2023 |
[DOI] [URL] |
An Automatic Speaker Clustering Pipeline for the Air Traffic Communication Domain, , , , , , and , in: Aerospace, 10(10):876, 2023 |
[DOI] [URL] |
The Idiap Speech Synthesis System for the Blizzard Challenge 2023, , , and , in: Proc. 18th Blizzard Challenge Workshop, 2023 |
[DOI] |
Modeling Structured Data in Attention-based Models, , EPFL, 2023 |
[URL] |
ProGAP: Progressive Graph Neural Networks with Differential Privacy Guarantees, and , in: The 17th ACM International Conference on Web Search and Data Mining, 2024 |
|
BLESS: Benchmarking Large Language Models on Sentence Simplification, , , , , , and , in: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Singapore, 2023 |
|
Can Language Models Learn Analogical Reasoning? Investigating Training Objectives and Comparisons to Human Performance, and , in: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Singapore, Association for Computational Linguistics, 2023 |
|
Development and comparison of adaptive data-driven models for thermal comfort assessment and control, , , , , and , in: Total Environment Research Themes, 8, 2023 |
[DOI] [URL] |
Benefits of Max Pooling in Neural Networks: Theoretical and Experimental Evidence, , and , in: Transactions on Machine Learning Research, 2023 |
Practical computational imaging by use of spatiotemporal light modulation: from simulations to applications in biological microscopy, , EPFL, 2023 |
[DOI] |
Affordance segmentation of hand-occluded containers from exocentric images, , , , , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), 2023 |
[DOI] [URL] |
Black-box Attacks on Image Activity Prediction and its Natural Language Explanations, , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), 2023 |
[DOI] [URL] |
Privacy-Preserving Machine Learning on Graphs, , EPFL, 2023 |
[DOI] |
Document-level Text Simplification with Coherence Evaluation, , , and , in: Proceedings of the Second Workshop on Text Simplification, Accessibility and Readability, 2023 |
|
Generalizable Automatic Classification of Sleep Stages, , Idiap-Com-02-2023 |
|
From Nano to Macro: An overview of the IEEE Bio Image and Signal Processing Technical Committee, , , , , , , , and , in: IEEE Signal Processing Magazine, 40(4):61-71, 2023 |
[DOI] [URL] |
SynthDistill: Face Recognition with Knowledge Distillation from Synthetic Data, , and , in: IEEE International Joint Conference on Biometrics (IJCB 2023), 2023 |
[DOI] |
Multi-image deconvolution of thermal images with a boundary condition weighting scheme, , , , and , in: Target and Background Signatures IX, International Society for Optics and Photonics, Amsterdam, pages 149-158, SPIE, 2023 |
[DOI] [URL] |
The Unconstrained Ear Recognition Challenge 2023: Maximizing Performance and Minimizing Bias, and , in: IEEE International Joint Conference on Biometrics (IJCB 2023), 2023 |
|
EFaR 2023: Efficient Face Recognition Competition, , , , and , in: IEEE International Joint Conference on Biometrics (IJCB 2023), 2023 |
|
The DCU-EPFL Enhanced Dependency Parser at the IWPT 2021 Shared Task, , , , and , in: Proceedings of the 17th International Conference on Parsing Technologies and the IWPT 2021 Shared Task on Parsing into Enhanced Universal Dependencies, Online, pages 204-212, Association for Computational Linguistics, 2021 |
|
Multilingual Extraction and Categorization of Lexical Collocations with Graph-aware Transformers, , , , and , in: Proceedings of the 11th Joint Conference on Lexical and Computational Semantics, Seattle, USA, pages 89–100, 2022 |
|
Characterizing Swiss Alpine Lakes: from Wikipedia to Citizen Science, and , in: ACM Journal on Computing and Sustainable Societies, 2023 |
|
Urban Crowdsourcing Platforms across the World: A Systematic Review, and , in: ACM Digital Government: Research and Practice, 2023 |
[DOI] [URL] |
Understanding the Social Context of Eating with Multimodal Smartphone Sensing: The Role of Country Diversity, , and , in: 25th ACM International Conference on Multimodal Interaction, 2023 |
[DOI] [URL] |
Health Talk: Understanding Practices of Popular Professional YouTubers, , , , and , in: Proceedings of the 21st International Conference on Mobile and Ubiquitous Multimedia, 2022 |
Novel Methods For Detection And Analysis Of Atypical Aspects In Speech, , École Polytechnique Fédérale de Lausanne, 2023 |
[DOI] |
Claim-Dissector: An Interpretable Fact-Checking System with Joint Re-ranking and Veracity Prediction, , and , in: Association for Computational Linguistics, Findings of the Association for Computational Linguistics: ACL 2023:10184–10205, 2023 |
[URL] |
ChildPlay: A New Benchmark for Understanding Children's Gaze Behaviour, , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023 |
|
Toward responsible face datasets: modeling the distribution of a disentangled latent space for sampling face images from demographic groups, , and , in: IEEE International Joint Conference on Biometrics, 2023 |
|
A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers, , , , and , in: Aerospace, 10(5), 2023 |
[DOI] [URL] |
Approximating Optimal Morphing Attacks using Template Inversion, , and , in: IEEE International Joint Conference on Biometric, 2023 |
[DOI] |
Validating Automatic Speech Recognition and Understanding for Pre-Filling Radar Labels-Increasing Safety While Reducing Air Traffic Controllers' Workload, , , , , , , and , in: Aerospace, 10(6):538, 2023 |
[DOI] |
Learning Joint Space Reference Manifold for Reliable Physical Assistance, , , , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), pages 10412-10417, 2023 |
[DOI] |
A Geometric Optimal Control Approach for Imitation and Generalization of Manipulation Skills, , , , and , in: Robotics and Autonomous Systems, 2023 |
VP-STO: Via-point-based Stochastic Trajectory Optimization for Reactive Robot Behavior, , , and , in: Proc. IEEE International Conference on Robotics and Automation (ICRA), 2023 |
|
PAAQ: Paired Alternating AcQuisitions for Virtual High Frame Rate Multichannel Cardiac Fluorescence Microscopy, , , and , in: Biological Imaging, 3:e20, 2023 |
[DOI] |
Efficient compressed sensing reconstruction for 3D fluorescence microscopy using OptoMechanical Modulation Tomography (OMMT) with a 1+2D regularization, and , in: Optics Express, 31(20):31718-31733, 2023 |
[DOI] |
Vulnerability of Automatic Identity Recognition to Audio-Visual Deepfakes, , , and , in: IEEE International Joint Conference on Biometrics, 2023 |
|
A VAE for Transformers with Nonparametric Variational Information Bottleneck, and , in: The Eleventh International Conference on Learning Representations, 2023 |
[URL] |
Bridging the Gap: Heterogeneous Face Recognition with Conditional Adaptive Instance Modulation, and , in: IJCB, 2023 |
|
Idiap Scientific Report 2022, , , , , , , , , , , , , , , , , and , Idiap-RR-05-2023 |
|
Predicting is not understanding: Recognizing and addressing underspecification in machine learning, , and , in: European Conference on Computer Vision, pages 458-476, Springer, 2022 |
On matching data and model in LF-MMI-based dysarthric speech recognition, , École polytechnique fédérale de Lausanne, 2023 |
[DOI] [URL] |
Text Representation Learning for Low Cost Natural Language Understanding, , École polytechnique fédérale de Lausanne, 2023 |
[DOI] [URL] |
HyperMixer: An MLP-based Low Cost Alternative to Transformers, , , , , , and , in: Proc. of the 61st Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Toronto, Canada, pages 15632-15654, 2023 |
[DOI] |
Demonstration-guided Optimal Control for Long-term Non-prehensile Planar Manipulation, , , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), pages 4999-5005, 2023 |
[DOI] |
Verification of PyDHN - a Python library for the thermo-hydraulic simulation of district heating networks - through the DESTEST, , and , in: Proceedings of Building Simulation 2023: 18th Conference of IBPSA, IBPSA, IBPSA, 2023 |
[DOI] [URL] |
Automatic Speech Recognition and Understanding for Radar Label Maintenance Support Increases Safety and Reduces Air Traffic Controllers’ Workload, , , , , , , , , , , and , in: Fifteenth USA/Europe Air Traffic Management Research and Development Seminar (ATM2023), Eurocontrol (Europe), FAA (U.S.), Savannah, Georgia, USA, 2023 |
[URL] |
Diffusion Transformer for Adaptive Text-to-Speech, and , in: Proc. 12th ISCA Speech Synthesis Workshop (SSW 12), 2023 |
[DOI] |
Intelligent Technologies: Concepts, Applications, and Future Directions, Volume 2, and , Springer, volume 1098, 2023 |
[DOI] |
Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviews, , , and , in: Proceedings of Interspeech, 2023 |
|
Periscope: A Robotic Camera System to Support Remote Physical Collaboration, , , , and , in: Proceedings of the ACM on Human Computer Interaction, 2023 |
|
Using Commercial ASR Solutions to Assess Reading Skills in Children: A Case Report, , , , , and , in: Proceedings of Interspeech, pages 4573-4577, 2023 |
[DOI] [URL] |
Few-shot Dysarthric Speech Recognition with Text-to-Speech Data Augmentation, and , in: Proceedings of Interspeech, pages 156-160, 2023 |
[DOI] [URL] |
HyperConformer: Multi-head HyperMixer for Efficient Speech Recognition, , , and , in: Proc. Interspeech 2023, Ireland, 2023 |
|
Can ChatGPT Detect Intent? Evaluating Large Language Models for Spoken Language Understanding, and , in: Proc. INTERSPEECH 2023, pages 1109-1113, 2023 |
[DOI] |
Attacking Face Recognition with T-shirts: Database, Vulnerability Assessment and Detection, and , Idiap-RR-08-2023 |
|
Bridging the Gap: Heterogeneous Face Recognition with Conditional Adaptive Instance Modulation, and , Idiap-RR-09-2023 |
|
Implementing contextual biasing in GPU decoder for online ASR, , , , , , and , in: Proc. Interspeech 2023, 2023 |
|
CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice, , , and , in: Proc. Interspeech 2023, 2023 |
[URL] |
Development of 3D-printed Patient-Specific Anatomical Braces (PSAB) for Distal Radius and Scaphoid Fractures, , , , , , and , in: Journal of wrist Surgery, 2023 |
Keep Sensors in Check: Disentangling Country-Level Generalization Issues in Mobile Sensor-Based Models with Diversity Scores, , , and , in: Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, 2023 |
|
Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers?, and , in: Proceedings of Interspeech, 2023 |
|
Implementing contextual biasing in GPU decoder for online ASR, , , , , , and , Idiap-RR-02-2023 |
|
Geometric Algebra for Optimal Control with Applications in Manipulation Tasks, and , in: IEEE Transactions on Robotics, 2023 |
|
Approximating Optimal Morphing Attacks using Template Inversion, , and , Idiap-RR-07-2023 |
|
The rise of artificial intelligence reading of chest X-rays for enhanced TB diagnosis and elimination, , , , , , , , and , in: The International Journal of Tuberculosis and Lung Disease, 27(5):367--372, 2023 |
[DOI] [URL] |
Referencing in YouTube Knowledge Communication Videos, and , in: ACM International Conference on Interactive Media Experiences (IMX '23), June 2023, Nantes, France, 2023 |
|
Examining European Press Coverage of the Covid-19 No-Vax Movement: An NLP Framework, and , in: 2nd ACM International Workshop on Multimedia AI against Disinformation (MAD '23), June 12, 2023, Thessaloniki, Greece, 2023 |
|
Framing the News: From Human Perception to Large Language Model Inferences, and , in: International Conference on Multimedia Retrieval (ICMR '23), June 12--15, 2023, Thessaloniki, Greece, 2023 |
|
Learning and Optimization of Anticipatory Feedback Controllers for Robot Manipulation, , École Polytechnique Fédérale de Lausanne, 2023 |
[DOI] |
Automatic identification of storytelling responses to past-behavior interview questions via machine learning, , , , , and , in: International Journal of Selection and Assessment, 2023 |
|
ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (CVPR), pages 17408-17419, 2023 |
[DOI] |
A lexical-availability-based framework from short communications for automatic personality identification, , , , and , in: Cognitive Systems Research, 79:126-137, 2023 |
[DOI] [URL] |
Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks, , , , , , , , and , in: Proceedings of the 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, 2023 |
|
Sparse Autoencoders for Speech Modeling and Recognition, , École polytechnique fédérale de Lausanne, 2023 |
[DOI] |
Stop Wasting my FLOPS: Improving the Efficiency of Deep Learning Models, , École Polytechnique Fédérale de Lausanne, 2022 |
[DOI] |
Automatic pathological speech assessment, , École polytechnique fédérale de Lausanne, 2022 |
[DOI] |
Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviews, , , and , Idiap-RR-03-2023 |
|
Quantified Canine: Inferring Dog Personality From Wearables, , , , , and , in: Proceedings of the 2023 ACM CHI Conference on Human Factors in Computing Systems (CHI) 2023, Association for Computing Machinery, 2023 |
[DOI] |
Complex Daily Activities, Country-Level Diversity, and Smartphone Sensing: A Study in Denmark, Italy, Mongolia, Paraguay, and UK, , , , , , , , , , , , , , , , , , , , , and , in: Proceedings of the 2023 ACM CHI Conference on Human Factors in Computing Systems (CHI), Association for Computing Machinery, 2023 |
[DOI] |
Ranking parameters in urban energy models for various building forms and climates using sensitivity analysis, , , and , in: Building Simulation, 2022 |
[DOI] |
Situated Participatory Design: A Method for In Situ Design of Robotic Interaction with Older Adults, , and , in: CHI '23: Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023 |
[DOI] [URL] |
A Bayesian approach to machine learning model comparison, , Idiap-Com-01-2023 |
|
On Interventional Probing in High Dimensions: An NLI Case Study, , , and , in: Findings of the 17th European Chapter of the Association for Computational Linguistics, 2023 |
Graph Refinement for Coreference Resolution, and , in: Findings of Association for >Computational Linguistics: ACL 2022, 2022 |
Why Scholars Are Diagramming Neural Network Models, , and , in: 13th International Conference on the Theory and Application of Diagrams, 2022 |
Shallow Discourse Parsing for Open Information Extraction and Text Simplification, , and , in: 3rd Workshop on Computational Approaches to Discourse (CODI) @ COLING, 2022 |
digital ECMT cancer trial matching tool, an open source research application to support oncologists in the identification of precision medicine clinical trials,, , and , in: JCO Clinical Cancer Informatics, 2022 |
Assessing the communication gap between AI models and healthcare professionals: explainability, utility and trust in AI-driven clinical decision-making, , , , , , and , in: Artificial Intelligence, 2022 |
Patient Attrition in Molecular Tumour Boards: A Systematic Review, , , , , and , in: British Journal of Cancer, 2022 |
Symmetry-induced Disentanglement on Graphs, , and , in: Advances in Neural Information Processing Systems 35, 2022 |
Transformers and the representation of biomedical background knowledge, , , , , and , in: Computational Linguistics, 2022 |
Towards energy hubs: an innovative Geographic Information System based approach for cluster definition, , , and , in: ICREC 2022 Conference Proceedings, 2022 |
An exploratory interplay between daylight, general and task lighting for visual comfort and electricity savings in a personal office space, , , , , , and , in: Proceedings of ISES and IEA SHC International Conference on Solar Energy for Buildings and Industry, Kassel, Germany, 2022 |
Speech and Natural Language Processing Technologies for Pseudo-Pilot Simulator, , , , , and , in: 12th SESAR Innovation Days, 2022 |
|
Grammar Based Speaker Role Identification for Air Traffic Control Speech Recognition, , , , , , and , in: 12th SESAR Innovation Days, 2022 |
|
Integrating daylight with general and task lighting: A longitudinal in-the-wild study in individual and open space working areas, , , , , , and , in: Solar Energy Advances, 2, 2022 |
[DOI] [URL] |
Identification of existing tools and workflows for solar neighborhood planning, , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: SHC Task 63: Solar Neighborhood Planning, Subtask C: Solar Planning Tools, IEA, 2022 |
[DOI] |
Natural Language Processing in Healthcare, , , , and , Taylor & Francis Groups, 2022 |
[DOI] [URL] |
Response Burden and Dropout in a Probability-Based Online Panel Study – A Comparison between an App and Browser-Based Design, , , , and , in: Journal of Official Statistics, 2022 |
[DOI] [URL] |
Differentiation of motor speech disorders through the seven deviance scores from MonPaGe-2.0.s, , and , in: Brain Sciences, 12(11):1471-1487, 2022 |
IDIAPers @ Causal News Corpus 2022: Extracting Cause-Effect-Signal Triplets via Pre-trained Autoregressive Language Model, , , , , , and , in: The 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE @ EMNLP 2022), 2022 |
[URL] |
Meta-analysis of the amyotrophic lateral sclerosis spectrum uncovers genome instability, , , , , , , , , , , and , in: BioRxiv, 2022 |
IDIAPers @ Causal News Corpus 2022: Efficient Causal Relation Identification Through a Prompt-based Few-shot Approach, , , , , , and , in: The 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE @ EMNLP 2022), 2022 |
[URL] |
The RNA Binding proteome of axonal mRNAs in sympathetic neurons, , and , in: BioRxiv, 2022 |
Physiological intron retaining transcripts in the cytoplasm abound during human motor neurogenesis, , , , , , , , and , in: Genome Research, 2022 |
Efficient Training of Low-Curvature Neural Networks, , , and , in: NeurIPS 2022, 2022 |
[URL] |
VP-STO: Via-point-based Stochastic Trajectory Optimization for Reactive Robot Behavior, , , and , Idiap-RR-04-2023 |
|
Passive Bimanual Skills Learning from Demonstration with Motion Graph Attention Networks, , , , and , in: IEEE Robotics and Automation Letters (RA-L), 7(2):4917-4923, 2022 |
Robot Cooking with Stir-fry: Bimanual Non-prehensile Manipulation of Semi-fluid Objects, , , , , , and , in: IEEE Robotics and Automation Letters (RA-L), 7(2):5159-5166, 2022 |
|
Vision-Language Pretraining: Current Trends and the Future, , and , in: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts, 2022 |
[URL] |
SelecMix: Debiased Learning by Mixing up Contradicting Pairs, , , , , , and , in: ICML Workshop on Spurious Correlations, Invariance and Stability, 2022 |
EBMs vs. CL: Exploring Self-Supervised Visual Pretraining for Visual Question Answering, , , , and , in: arXiv, 2022 |
ID and OOD performance are sometimes inversely correlated on real-world datasets, , , and , in: Advances in Neural Information Processing Systems (NeurIPS), 2023 |
SelecMix: Debiased Learning by Contradicting-pair Sampling, , , , , , and , in: Advances in Neural Information Processing Systems, 2022 |
Reasoning over vision and language: Exploring the benefits of supplemental knowledge, , , and , in: arXiv, 2022 |
Bayesian Recurrent Units and the Forward Backward Algorithm, and , in: Proc. Interspeech 2022, pages 4137-4141, 2022 |
[DOI] |
Readback Error Detection by Automatic Speech Recognition and Understanding -- Results of HAAWAII Project for Isavia’s Enroute Airspace, , , , , , , , , and , in: 11th SESAR Innovation Days, SESAR, pages 9, 2022 |
|
How Does Pre-trained Wav2Vec 2.0 Perform on Domain-Shifted ASR? An Extensive Benchmark on Air Traffic Control Communications, , , , , , , , and , in: 2023 IEEE Spoken Language Technology Workshop (SLT), IEEE, 2023 |
[URL] |
BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications, , , , , , and , in: 2023 IEEE Spoken Language Technology Workshop (SLT), IEEE, 2023 |
[URL] |
Towards Smart Pruning: ViNet, a Deep-Learning Approach for Grapevine Structure Estimation, , , and , in: Computers and Electronics in Agriculture, 207:107736, 2023 |
[DOI] [URL] |
Your Day in Your Pocket: Complex Activity Recognition from Smartphone Accelerometers, , and , in: EAI Pervasive Health, 2022 |
|
Health Talk: Understanding Practices of Popular Professional YouTubers, , , , and , in: Int. Conf. on Mobile and Ubiquitous Multimedia, 2022 |
|
GAP: Differentially Private Graph Neural Networks with Aggregation Perturbation, , , and , in: 32nd USENIX Security Symposium (USENIX Security 23), 2023 |
|
Mechanical Artifacts in Optical Projection Tomography: Classification and Automatic Calibration, , , , and , in: Opt. Continuum, 1(12):2577--2589, 2022 |
[DOI] |
TextGraphs 2022 Shared Task on Natural Language Premise Selection, , , , and , in: Proceedings of TextGraphs-16: Graph-based Methods for Natural Language Processing, 2022 |
[URL] |
Decomposing Natural Logic Inferences for Neural NLI, , , , and , in: BlackBoxNLP: Workshop on analyzing and interpreting neural networks for NLP, 2022 |
Diff-Explainer: Differentiable Convex Optimization for Explainable Multi-Hop Inference, , , , and , in: Transactions of the Association for Computational Linguistics, 2022 |
[DOI] |
Case-Based Abductive Natural Language Inference, , and , in: Proceedings of the 29th International Conference on Computational Linguistics, 2022 |
[URL] |
Generalization and Personalization of Mobile Sensing-Based Mood Inference Models: An Analysis of College Students in Eight Countries, , , , , , , , , , , , , , , , , , , , , , , , , , and , in: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT), 6(4), 2022 |
[DOI] |
What Do Compressed Multilingual Machine Translation Models Forget?, , , , , and , in: The 2022 Conference on Empirical Methods in Natural Language Processing, 2022 |
|
SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages, , , , , and , in: The 2022 Conference on Empirical Methods in Natural Language Processing, 2022 |
|
Imitation of Manipulation Skills Using Multiple Geometries, , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2022 |
|
Prepended Domain Transformer: Heterogeneous Face Recognition without Bells and Whistles, , and , in: IEEE Transactions on Information Forensics and Security, 2022 |
|
Innovators@SMM4H'22: An Ensembles Approach for self-reporting of COVID-19 Vaccination Status Tweets, , , , and , in: International Conference on Computational Linguistics (COLING 2022), 2022 |
|
Multimodality for NLP-Centered Applications: Resources, Advances and Frontiers, , , and , in: International Conference on Language Resources and Evaluation (LREC 2022), 2022 |
|
Innovators@SMM4H'22: An Ensembles Approach for Stance and Premise Classification of COVID-19 Health Mandates Tweets, , , , and , in: International Conference on Computational Linguistics (COLING 2022), 2022 |
Automatic Summarization for Creative Writing: Denoising Auto-Encoder based Pipeline Method for Generating Summary of Movie Scripts, , , , and , in: Automatic Summarization for Creative Writing, International Conference on Computational Linguistics (COLING 2022), 2022 |
|
Automatic Minuting: A Pipeline Method for Generating Minutes, , , and , in: Pacific Asia Conference on Language, Information and Computation (PACLIC 36), In proceedings of ACL Anthology, 2022 |
|
An End-to-End Multilingual System for Automatic Minuting of Multi-Party Dialogues, , , and , in: Pacific Asia Conference on Language, Information and Computation (PACLIC 36) , In proceedings of ACL Anthology, 2022 |
|
An Empirical Comparison of Semantic Similarity Methods for Analyzing down-streaming Automatic Minuting task, , , , and , in: Pacific Asia Conference on Language, Information and Computation (PACLIC 36), In ACL Anthology Proceedings, 2022 |
|
Bio-Medical Multi-label Scientific Literature Classification using LWAN and Dual-attention module, , , , and , in: PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION (PACLIC 2022), In Proceedings of ACL Anthology, 2022 |
|
HMIST: Hierarchical Multilingual Isometric Speech Translation using Multi-Task Learning Framework for Automatic Dubbing, , , and , in: PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION (PACLIC 2022), In Proceedings of ACL Anthology, 2022 |
|
Two Simple and Domain-independent Approaches for Early Detection of Anorexia, , , and , in: Early Detection of Mental Health Disorders by Social Media Monitoring: The First Five Years of the eRisk Project, pages 159-182, Springer International Publishing, 2022 |
[DOI] [URL] |
Towards Automatic Prediction of Non-Expert Perceived Speech Fluency Ratings, , , and , in: ACM International Conference on Multimodal Interaction (ICMI Companion), 2022 |
[DOI] |
Towards Accessible Sign Language Learning and Assessment, , , and , in: ACM International Conference on Multimodal Interaction, Bangalore, INDIA, pages 626-631, 2022 |
[DOI] |
Claim-Dissector: An Interpretable Fact-Checking System with Joint Re-ranking and Veracity Prediction, , and , Idiap-Com-03-2022 |
[URL] |
Leveraging Events Sub-Categories for Violent-Events Detection in Social Media, , and , in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022), 2022 |
[URL] |
UNSL at eRisk 2022: Decision policies with history for early classification, , , and , in: CEUR Workshop Proceedings, 2022 |
[URL] |
IDIAPers @ Causal News Corpus 2022: Extracting Cause-Effect-Signal Triplets via Pre-trained Autoregressive Language Model, , , , , , and , Idiap-RR-12-2022 |
|
IDIAPers @ Causal News Corpus 2022: Efficient Causal Relation Identification Through a Prompt-based Few-shot Approach, , , , , , and , Idiap-RR-13-2022 |
|
A surrogate gradient spiking baseline for speech command recognition, and , in: Frontiers in Neuroscience, 2022 |
[DOI] [URL] |
Local estimation of parametric point spread functions in thermal images via convolutional neural networks, , , and , in: SPIE sensors + imaging, Target and Background Signatures VIII, Berlin, Germany, pages 1227009 1--8, SPIE, 2022 |
[DOI] [URL] |
Pre-training Data Quality and Quantity for a Low-Resource Language: New Corpus and BERT Models for Maltese, , , , and , in: Proceedings of the workshop on Deep Learning for Low-Resource NLP, 2022 |
[URL] |
SPEECH MODELING USING SPARSE AUTOENCODERS, and , Idiap-RR-11-2022 |
|
A Systems Approach Towards Remote Health-Monitoring in Older Adults: Introducing a Zero-Interaction Digital Exhaust, , , , , , , , , , , , and , in: npj Digital Medicine, 5(Article 116), 2022 |
|
An anomaly detection approach for backdoored neural networks: face recognition as a case study, and , in: 21st International Conference of the Biometrics Special Interest Group (BIOSIG 2022), Darmstadt, Germany, 2022 |
|
On the detection of morphing attacks generated by GANs, and , in: 21st International Conference of the Biometrics Special Interest Group (BIOSIG 2022), 2022 |
|
HyperMixer: An MLP-based Green AI Alternative to Transformers, , , , , , and , in: arxiv, 2022 |
A Variational AutoEncoder for Transformers with Nonparametric Variational Information Bottleneck, and , in: arxiv, 2022 |
[DOI] [URL] |
DHgeN: Automated Generation of District Heating Network Layouts for Feasibility Studies, and , in: -, 2022 |
|
Fairness Index Measures to Evaluate Bias in Biometric Recognition, and , in: International Conference on Pattern Recognition Workshops, 2022 |
|
Towards Lifelong Human Assisted Speaker Diarization, , , , , , , , , , , , and , in: Computer Speech & Language, 2022 |
[DOI] [URL] |
Reactive Anticipatory Robot Skills with Memory, , and , in: The International Symposium on Robotics Research, 2022 |
|
Modeling and Optimal Control of the Open Torque-Controlled Quadruped Robot Solo-12, , Idiap-Com-02-2022 |
|
On the detection of morphing attacks generated by GANs, and , Idiap-RR-07-2022 |
|
An anomaly detection approach for backdoored neural networks: face recognition as a case study, and , Idiap-RR-08-2022 |
[URL] |
Face Anthropometry Aware Audio-visual Age Verification, and , in: ACM Multimedia, 2022 |
|
Learning to Guide Online Multi-Contact Receding Horizon Planning, , , , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2022 |
Pulmonary Tuberculosis Screening from Radiological Signs on Chest X-Ray Images Using Deep Models, , and , in: Union World Conference on Lung Health, The Union, 2022 |
Classifying the Social Media Author Profile Through a Multimodal Representation, , , and , in: Intelligent Technologies: Concepts, Applications, and Future Directions. Studies in Computational Intelligence, Springer, 2022 |
[DOI] [URL] |
drozBot: Using Ergodic Control to Draw Portraits, , and , in: IEEE Robotics and Automation Letters:7, 2022 |
[DOI] [URL] |
Memory of Motion for Initializing Optimization in Robotics, , École Polytechnique Fédérale de Lausanne, 2022 |
|
Data Privacy Concerns as a Source of Resistance to Complete Mobile Data Collection Tasks via a Smartphone App, , , , and , in: Journal of Survey Statistics and Methodology, 2022 |
|
Expanded Lattice Embeddings for Spoken Document Retrieval on Informal Meetings, , , , and , in: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, 2022 |
[DOI] |
A Comparative Study Of Simulation Tools To Model The Solar Irradiation On Building Façades, , , , , , , , , , , , , , , and , in: Proceedings of SWC 2021: ISES Solar World Congress, ISES, 2021 |
[DOI] [URL] |
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering, , and , in: Proceedings of Interspeech, 2022 |
|
Conversational Speech Recognition Needs Data? Experiments with Austrian German, , , and , in: Proceedings of the 13th Language Resources and Evaluation Conference, European Language Resources Association, pages 4684--4691, 2022 |
[URL] |
Using synthetic fingerprint images to test the performance of an AFIS system, , Université de Lausanne, 2022 |
|
Autoencoders Reloaded, and , in: Springer Biological Cybernetics, 2022 |
[DOI] [URL] |
Adversarial-free speaker identity-invariant representation learning for automatic dysarthric speech classification, and , in: Annual Conference of the International Speech Communication Association, 2022 |
|
Comparison of 5 methods for the evaluation of intelligibility in mild to moderate French dysarthric speech, , , , , , , , , and , in: Annual Conference of the International Speech Communication Association, pages 2188-2192, 2022 |
[DOI] |
Perceptual classification of motor speech disorders: the role of severity, speech task, and listener's expertise, , , and , in: Journal of Speech, Language, and Hearing Research, 2022 |
Sensing Eating Events in Context: A Smartphone-Only Approach, , , , , and , in: IEEE Access, 10, 2022 |
[DOI] [URL] |
How Did Europe’s Press Cover Covid-19 Vaccination News? A Five-Country Analysis, and , in: MAD '22: Proceedings of the 1st International Workshop on Multimedia AI against Disinformation, 2022 |
[DOI] [URL] |
Borrowing from yourself: Faster future video segmentation with partial channel update, and , in: International Conference on Pattern Recognition, 2022 |
|
Saving energy by maximising daylight and minimising the impact on occupants: an automatic lighting system approach, , , , , , , and , in: Energy and Buildings, 2022 |
[DOI] |
Visually Grounded Interpretation of Noun-Noun Compounds in English, , , and , in: Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics, Association for Computational Linguistics, 2022 |
State-of-the-art retinal vessel segmentation with minimalistic models, , , , , and , in: Nature Scientific Reports, 12(6174), 2022 |
[DOI] |
A Modular Multimodal Architecture for Gaze Target Prediction: Application to Privacy-Sensitive Settings, , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022 |
|
Hierarchical Multi-task learning framework for Isometric-Speech Language Translation, , , and , in: ACL, 2022 |
|
IDIAP Submission@LT-EDI-ACL2022: Detecting Signs of Depression from Social Media Text, and , in: ACL, 2022 |
|
IDIAP Submission@LT-EDI-ACL2022: Homophobia/Transphobia Detection in social media comments, and , in: ACL Proceedings, 2022 |
|
IDIAP Submission@LT-EDI-ACL2022 : Hope Speech Detection for Equality, Diversity and Inclusion, and , in: ACL, 2022 |
|
IDIAP_TIET@LT-EDI-ACL2022 : Hope Speech Detection in Social Media using Contextualized BERT with Attention Mechanism, , and , in: ACL, 2022 |
|
PERFECT: Prompt-free and Efficient Few-shot Learning with Language Models, , , , , , and , in: ACL, 2022 |
|
The societal and ethical relevance of computational Creativity, , and , in: Proceedings of the International Conference on Computational Creativity, 2020 |
Compositionality in English deverbal compounds:The role of the head, , and , in: The role of constituents in multiword expressions. Phraseology and Multiword Expressions, Language Science Press, Berlin, 2020 |
Compound or phrase or in between? Testing Linguistic Criteria for Compoundhood in English, and , in: Word Structure, 13(2):250-281, 2020 |
Biomarker identification using dynamic time warping analysis: a longitudinal cohort study of COVID-19 patients in a UK tertiary hospital, , , , , and , in: BMJ Open, 2022 |
Voyager: Data Discovery for Onboarding in Data Science, , , and , in: 37th IEEE International Conference on Data Engineering (ICDE), 2022 |
Systematicity, Compositionality and Transitivity of Deep NLP Models: a Metamorphic Testing Perspective, , , , and , in: Findings of the ACL, 2022 |
To be or not to be an Integer? Encoding Variables for Mathematical Text, , , , and , in: Findings of the ACL, 2022 |
Establishment of CORONET, COVID-19 Risk in Oncology Evaluation Tool, to Identify Cancer Patients at Low Versus High Risk of Severe Complications of COVID-19 Infection Upon Presentation to Hospital, , , and , in: Clinical Cancer Informatics, 2022 |
Active Learning by Feature Mixing, , , , , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022 |
GeoNeRF: Generalizing NeRF with Geometry Priors, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (CVPR), 2022 |
[URL] |
End-to-End Accented Speech Recognition, , and , in: International Conference on Speech and Language Processing, Interspeech, ISCA, Graz, Austria, pages 2140-2144, 2019 |
[DOI] |
Generating Exact Lattices in The WFST Framework, , , , , , , , , , , , and , in: Proceedings of 2012 IEEE International Conference on Acoustics, Speech and Signal Processing., The 37th International Conference on Acoustics, Speech, and Signal Processing, Kyoto, JP, Kyoto, Japan, pages 4213-4216, IEEE Signal Processing Societ, 2012 |
[DOI] |
Efficient Depth-based Deep Learning Methods for Multi-Party Pose Estimation, , École polytechnique fédérale de Lausanne, 2021 |
[DOI] |
Gradient-based Methods for Deep Model Interpretability, , École polytechnique fédérale de Lausanne, 2021 |
[DOI] |
Learning strategies and representations for intuitive robot learning from demonstration, , EPFL, 2021 |
|
Expanded Lattice Embeddings for Spoken Document Retrieval on Informal Meetings, , , , and , Idiap-RR-06-2022 |
|
A two-step approach to leverage contextual data: speech recognition in air-traffic communications, , , , and , in: Proc. of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022 |
|
Are GAN-based Morphs Threatening Face Recognition?, , , and , in: International Conference on Acoustics, Speech and Signal Processing, 2022 |
|
Custom attribution loss for improving generalization and interpretability of deepfake detection, , and , in: International Conference on Acoustics, Speech, and Signal Processing, 2022 |
|
Domain-Adversarial Based Model with Phonological Knowledge for Cross-Lingual Speech Recognition, , , , , and , in: Electronics, 10(24):1-15, 2021 |
[DOI] [URL] |
Experimental investigation on STFT phase Representations for deep learning-based dysarthric speech detection, and , in: International Conference on Acoustics, Speech, and Signal Processing, 2022 |
|
Domain-Specific Adaptation of CNN for Detecting Face Presentation Attacks in NIR, , , , , , , and , in: IEEE Transactions on Biometrics, Behavior, and Identity Science, 2022 |
|
From Key Positions to Optimal Basis Functions for Probabilistic Adaptive Control, , and , in: IEEE Robotics and Automation Letters, 2022 |
|
An Attention Mechanism for Deep Q-Networks with Applications in Robotic Pushing, , and , in: Proc. of Workshop on Emerging paradigms for robotic manipulation: from the lab to the productive world, ICRA, 2021 |
Modeling Source and System characteristics using Zero Frequency Filtering for Voice Activity Detection, , and , Idiap-Internal-RR-80-2021 |
Analysis of Vector Representations in Maintenance Logs in the Industry: Towards an Information Retrieval System, , , and , in: Journal of Research in Computing Science, 2021 |
Topic analysis and tracking from Mexico's President daily press briefing, , and , in: Journal of Research in Computing Science, 2021 |
|
Improving Generalization of Deepfake Detection with Data Farming and Few-Shot Learning, and , in: IEEE Transactions on Biometrics, Behavior, and Identity Science, 2021 |
|
Automatic processing pipeline for collecting and annotating air-traffic voice communication data, , , , , , , , , and , in: Proceedings of 9th OpenSky Symposium 2020, OpenSky Network, Brussels, Belgium, pages 1-9, MDPI, 2021 |
|
Multi-Adversarial Learning for Cross-Lingual Word Embeddings, , and , in: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Online, pages 463-472, 2021 |
Imposing Relation Structure in Language-Model Embeddings Using Contrastive Learning, , , and , in: Proceedings of the 25th Conference on Computational Natural Language Learning, Online, pages 337-348, Association for Computational Linguistics, 2021 |
DepthInSpace: Exploitation and Fusion of Multiple Video Frames for Structured-Light Depth Estimation, , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 6039-6048, 2021 |
[URL] |
Recursive Non-Autoregressive Graph-to-Graph Transformer for Dependency Parsing with Iterative Refinement, and , in: Transactions of the Association for Computational Linguistics (2021), 9:18, 2021 |
[DOI] [URL] |
Bag-of-Vectors Autoencoders for Unsupervised Conditional Text Generation, and , Idiap-RR-21-2021 |
ParsiNLU: A Suite of Language Understanding Challenges for Persian, , , , , , , , , , , , , , , , , , , , , and , in: TACL, 2021 |
|
Compacter: Efficient Low-Rank Hypercomplex Adapter Layers, , and , in: NeurIPS, 2021 |
|
Fairness in Biometrics: a figure of merit to assess biometric verification systems, and , in: IEEE Transactions on Biometrics, Behavior, and Identity Science, 2021 |
[DOI] |
What is SemEval evaluating? A Systematic Analysis of Evaluation Campaigns in NLP, , , and , in: EMNLP, 2nd Workshop on Evaluation and Comparison of NLP Systems, 2021 |
Longitudinal characterisation of haematological and biochemical parameters in cancer patients prior to and during COVID-19 reveals features associated with outcome, , , and , in: ESMO Open, 2021 |
Wave comparisons of clinical characteristics and outcomes of COVID-19 admissions - Exploring the impact of treatment and strain dynamics, , , , , , and , in: Journal of Clinical Virology, 2022 |
Automated Interpretation of Air Traffic Control Communication: The Journey from Spoken Words to a Deeper Understanding of the Meaning, , , , , , , and , in: 2021 IEEE/AIAA 40th Digital Avionics Systems Conference (DASC), San Antonio, TX, USA, pages 1-9, IEEE, 2021 |
[DOI] |
Number and quality of diagrams in scholarly publications is associated with number of citations, , and , in: Diagrams, 2021 |
Structuralist analysis for neural network system diagrams, , and , in: Diagrams, 2021 |
Scholarly AI system diagrams as an access point to mental models, , and , in: Diagrams, 2021 |
Similarity-Based Equational Inference in Physics, and , in: Physics Review Research, 2021 |
Disentangling Generative Factors in Natural Language with Discrete Variational Autoencoders, and , in: The 2021 Conference on Empirical Methods in Natural Language Processing, 2021 |
Hybrid Autoregressive Inference for Scalable Multi-hop Explanation Regeneration, , , and , in: Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022 |
Readback Error Detection by Automatic Speech Recognition to Increase ATM Safety, , , , , , , , , , , , , and , in: Fourteenth USA/Europe Air Traffic Management Research and Development Seminar (ATM2021), The United States Federal Aviation Administration (FAA), EUROCONTROL, pages 10, 2021 |
[URL] |
On the Language-specificity of Multilingual BERT and the Impact of Fine-tuning, , , and , in: Proceedings of the Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2021 |
Automated and unbiased discrimination of ALS from control tissue at single cell resolution, , , , , , , , and , in: Brain Pathology, 2021 |
Cytoplasmic cleavage of IMPA1 3' UTR is necessary for maintaining axon integrity, , , , , , , , , , and , in: Cell Reports, 2021 |
Aberrant cytoplasmic intron retention is a blueprint for RNA binding protein mislocalization in VCP-related amyotrophic lateral sclerosis, , , , , , , , and , in: Brain, 2021 |
Image-based deep learning reveals the responses of human motor neurons to stress and VCP-related ALS, , , and , in: Neuropathology and Applied Neurobiology, 2021 |
A Comprehensive Evaluation on Multi-channel Biometric Face Presentation Attack Detection, , and , Idiap-RR-02-2022 |
|
Robust Face Presentation Attack Detection with Multi-channel Neural Networks, and , Idiap-RR-03-2022 |
|
Bilateral Teleoperation with Object-Adaptive Mapping, , , , and , in: Complex & Intelligent Systems, 2021 |
|
Learning from Demonstration using Products of Experts: Applications to Manipulation and Task Prioritization, , and , in: International Journal of Robotics Research, 41(2):163-188, 2022 |
|
Motion Mappings for Continuous Bilateral Teleoperation, , , , , and , in: IEEE Robotics and Automation Letters, 6(3):5048-5055, 2021 |
|
Sequential Robot Imitation Learning from Observations, , , , and , in: International Journal of Robotics Research (IJRR), 2021 |
Tensor-variate mixture of experts for proportional myographic control of a robotic hand, , and , in: Robotics and Autonomous Systems, 142:103812, 2021 |
|
Editorial: Artificial Intelligence and Human Movement in Industries and Creation, , , , and , in: Frontiers in Robotics and AI, 8:712521, 2021 |
|
Towards an Engagement-Aware Attentive Artificial Listener for Multi-Party Interactions, , , , , and , in: Frontiers in Robotics and AI, 8:189, 2021 |
[DOI] [URL] |
Multimodal Neural Machine Translation System for English to Bengali, , , , , , and , in: Proceedings of the First Workshop on Multimodal Machine Translation for Low Resource Languages (MMTLRL 2021), Online (Virtual Mode), pages 31--39, INCOMA Ltd., 2021 |
[URL] |
Automatic Dialect Detection for Low Resource Santali Language, , , , , and , in: Proceeding of International Conference on Information Technology (OCIT), 2021 |
|
Open Machine Translation for Low Resource South American Languages (AmericasNLP 2021 Shared Task Contribution), , , , , , , , and , in: Proceedings of the First Workshop on Natural Language Processing for Indigenous Languages of the Americas, pages 218–223, Association for Computational Linguistics, 2021 |
[DOI] [URL] |
Unshuffling data for improved generalization in visual question answering, , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021 |
|
Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models, , , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021 |
|
Beyond question-based biases: Assessing multimodal shortcut learning in visual question answering, , , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021 |
|
Perspectives and limitations of visible-thermal image pair synthesis via generative adversarial networks, , , , and , in: Security + Defence, Target and Background Signatures VII, Proc. of SPIE, online only, pages 1186509-1--1186509-8, SPIE, 2021 |
[DOI] [URL] |
Zurich Like New: Analyzing Open Urban Multimodal Data, , and , in: Proceedings of the 1st International Workshop on Multimedia Computing for Urban Data, 2021 |
|
Professional YouTubers’ health videos as research material: Formulating a multi-method design in health psychology, , , , and , in: Methods in Psychology, Special Issue on Innovations in Qualitative Research, 5, 2021 |
|
A Sensor-Driven Visit Detection System in Older Adults’ Homes: Towards Digital Late-Life Depression Marker Extraction, , , , , , , , , , and , in: IEEE Journal of Biomedical And Health Informatics, 26(4):1560-1569, 2021 |
[DOI] [URL] |
Graph2Speak: Improving Speaker Identification using Network Knowledge in Criminal Conversational Data, , , and , in: 1st ISCA Symposium on Security and Privacy in Speech Communication, pages 10--13, 2021 |
[DOI] |
Open-Set Speaker Identification pipeline in live criminal investigations, and , in: 1st ISCA Symposium on Security and Privacy in Speech Communication, 2021 |
|
ROXSD: a Simulated Dataset of Communication in Organized Crime, , , , and , in: 1st ISCA Symposium on Security and Privacy in Speech Communication, 2021 |
|
Extreme Learning Machines with feature selection using GA for effective prediction of fetal heart disease: A Novel Approach, , , and , in: Informatica, 45(3), 2021 |
[DOI] [URL] |
Optimization of robot configurations for motion planning in industrial riveting, , , and , in: Proc. IEEE Intl Conf. on Advanced Robotics (ICAR), 2021 |
|
Optimal Control Combining Emulation and Imitation to Acquire Physical Assistance Skills, , and , in: Proc. IEEE Intl Conf. on Advanced Robotics (ICAR), 2021 |
|
Implementation of machine learning techniques for the quasi real-time blind and electric lighting optimization in a controlled experimental facility, , , , , and , in: Journal of Physics: Conference Series, IOP Publishing, 2021 |
[DOI] [URL] |
Machine learning techniques for the daylight and electric lighting performance predictions, , and , in: Proceedings of Building Simulation 2021, 2021 |
Trajectory Prediction with Compressed 3D Environment Representation using Tensor Train Decomposition, , , and , in: International Conference on Advanced Robotics, 2021 |
|
Social Robot Co-Design Canvases: A Participatory Design Framework, , , and , in: ACM Transactions on Human-Robot Interaction, 11(1), 2022 |
[DOI] [URL] |
Application of Urban Scale Energy Modelling and Multi-Objective Optimization Techniques for Building Energy Renovation at District Scale, , , and , in: Sustainability, 13(20), 2021 |
[DOI] [URL] |
BERTraffic: A Robust BERT-Based Approach for Speaker Change Detection and Role Identification of Air-Traffic Communications, , , , , , and , Idiap-RR-15-2021 |
District heating network modelling for future integration of solar thermal energy, , , and , in: Journal of Physics: Conference Series, pages 012089, IOP Publishing, 2021 |
[DOI] |
Adjustable Deterministic Pseudonymization of Speech, , and , in: Computer, Speech & Language, 72, 2022 |
[DOI] |
An Evaluation Benchmark for Automatic Speech Recognition of German-English Code-Switching, , and , in: IEEE Automatic Speech Recognition and Understanding Workshop, 2021 |
|
Learning to Translate Low-Resourced Swiss German Dialectal Speech into Standard German Text, , and , in: IEEE Automatic Speech Recognition and Understanding Workshop, Colombia, Cartagena, IEEE, 2021 |
|
Novel Methods for Incorporating Prior Knowledge for Automatic Speech Assessment, , École polytechnique fédérale de Lausanne (EPFL), 2021 |
|
Pose Transformers (POTR): Human Motion Prediction with Non-Autoregressive Transformers, , and , in: International Conference in Computer Vision - Workshops, 2021 |
|
Classifier Implementation for Spontaneous EEG Activity during Schizophrenic Psychosis, , , , and , in: Computacion y Sistemas (CyS), 25(3), 2021 |
[URL] |
Fusion of Acoustic and Linguistic Information Using Supervised Autoencoder for Improved Emotion Recognition, , and , in: 2nd Multimodal Sentiment Analysis Challenge (MuSe '21), October 24, 2021, Virtual Event, China, 2021 |
[DOI] |
Approximating the Mental Lexicon from Clinical Interviews as a Support Tool for Depression Detection, , , , and , in: Proceedings of the 2021 International Conference on Multimodal Interaction, ACM, 2021 |
[DOI] |
Development of a lung segmentation algorithm for analog imaged chest X-Ray: preliminary results, , , , and , in: XV Brazilian Congress on Computational Intelligence, Joinville, Brazil, 2021 |
[URL] |
Vein Enhancement with Deep Auto-Encoders to improve Finger Vein Recognition, , and , in: Biometrics Special Interest Group (BIOSIG 2021), 2021 |
|
Probabilistic Iterative LQR for Short Time Horizon MPC, and , in: International Conference on Intelligent Robots and Systems, pages 579-585, 2021 |
[DOI] |
Multimodal Neural Machine Translation System for English to Bengali, , , , , , and , Idiap-RR-13-2021 |
|
Multi-channel Face Presentation Attack Detection Using Deep Learning, and , in: Deep Learning-Based Face Analytics, Springer International Publishing, 2021 |
|
Improving Generalization of Deepfake Detection by Training for Attribution, , and , in: International Workshop on Multimedia Signal Processing, 2021 |
|
Deep Learning Approaches for Auditory Perception in Robotics, , École polytechnique fédérale de Lausanne, 2021 |
|
Adjustable Deterministic Pseudonymization of Speech, , and , Idiap-RR-12-2021 |
|
Towards Automatic Prediction of Non-Expert Perceived Speech Fluency Ratings, , , and , Idiap-RR-11-2021 |
|
Modeling and Inferring Attention between Humans or for Human-Robot Interactions, , Ecole Polytechnique Federale de Lausanne, 2021 |
[DOI] [URL] |
Supervised Speech Representation Learning for Parkinson's Disease Classification, and , in: ITG Conference on Speech Communication, 2021 |
|
Overview of the 8th Workshop on Asian Translation, , , , , , , , , , , , , , , and , in: Proceedings of the 8th Workshop on Asian Translation (WAT2021), pages 1--45, Association for Computational Linguistics, 2021 |
[URL] |
NLPHut's Participation at WAT2021, , , , , , , and , in: Proceedings of the 8th Workshop on Asian Translation (WAT2021), pages 146--154, Association for Computational Linguistics, 2021 |
[URL] |
Applying Attention-Based Models for Detecting Cognitive Processes and Mental Health Conditions, , , and , in: Cognitive Computation:18, 2021 |
[DOI] [URL] |
Active tuberculosis detection from frontal chest X-ray images, , Idiap-Com-01-2021 |
[URL] |
Modeling Dialectal Variation for Swiss German Automatic Speech Recognition, , and , in: Proceedings of Interspeech, 2021 |
[DOI] |
Grammar Based Identification Of Speaker Role For Improving ATCO And Pilot ASR, , , , , , and , Idiap-RR-22-2021 |
|
LATTICE-FREE MMI ADAPTATION OF SELF-SUPERVISED PRETRAINED ACOUSTIC MODELS, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, 2021 |
[URL] |
Comparing CTC and LFMMI for out-of-domain adaptation of wav2vec 2.0 acoustic model, , and , in: Proceedings of Interspeech, 2021 |
[URL] |
Examining the Social Context of Alcohol Drinking in Young Adults with Smartphone Sensing, , , and , in: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT), 5(3):26, 2021 |
[DOI] |
Rethinking the Role of Gradient-based Attribution Methods for Model Interpretability, and , in: International Conference on Learning Representations, 2021 |
|
Improving callsign recognition with air-surveillance data in air-traffic communication, , , and , Idiap-RR-20-2021 |
[URL] |
An Efficient Image-to-Image Translation HourGlass-based Architecture for Object Pushing Policy Learning, , and , in: IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021 |
|
Face Liveness Detection Competition (LivDet-Face) - 2021, , , , , , , , , and , in: International Joint Conference on Biometrics, 2021 |
Robust Unsupervised Gaze Calibration using Conversation and Manipulation Attention Priors, and , in: ACM Transactions on Multimedia Computing, Communications, and Applications, 18(1):26, 2022 |
[DOI] [URL] |
PROMPT: Probabilistic Motion Primitives based Trajectory Planning, , , and , in: Proceedings of Robotics: Science and Systems, 2021 |
[DOI] [URL] |
AUTOMATIC DYSARTHRIC SPEECH DETECTION EXPLOITING PAIRWISE DISTANCE-BASED CONVOLUTIONAL NEURAL NETWORKS, , and , in: 45th International Conference on Acoustics, Speech, and Signal Processing, Toronto, Canada, pages 7328–7332, 2021 |
|
Multi-task Single Channel Speech Enhancement Using Speech Presence Probability As A Secondary Task Training Target, , and , in: European Signal Processing Conference, EUSIPCO 2021, 2021 |
|
An Objective Evaluation Framework for Pathological Speech Synthesis, , , , , and , in: Proceedings of ITG Conference on Speech Communication, 2021 |
|
Improving Emotional TTS with an Emotion Intensity Input from Unsupervised Extraction, and , in: 11th ISCA Speech Synthesis Workshop, 2021 |
[URL] |
Boosting of contextual information in ASR for air-traffic call-sign recognition, , , , , , , and , in: Interspeech 2021, 2021 |
|
Contextual Semi-Supervised Learning: An Approach To Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems, , , , , , and , in: Interspeech 2021, 2021 |
[URL] |
Applying Attention Based Models for Detecting Cognitive Processes and Mental Health Conditions, , , and , Idiap-RR-01-2022 |
|
Late Fusion of the Available Lexicon and Raw Waveform-based Acoustic Modeling for Depression and Dementia Recognition, , , , , and , in: Proceedings of Interspeech 2021, ISCA-International Speech Communication Association 2021, 2021 |
|
Handling acoustic variation in dysarthric speech recognition systems through model combination, and , in: Proceedings of Interspeech, 2021 |
|
Trust indicators and explainable AI: A study on user perceptions, , , , , , and , in: Proc. Int. Conf. on Human-Computer Interaction, Bari, Italy, 2021 |
|
Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks, , , and , in: ACL, 2021 |
|
Variational Information Bottleneck for Effective Low-Resource Fine-Tuning, , and , in: ICLR, 2021 |
|
Unequivocal cardiac phase sorting from alternating ramp- and pulse-illuminated microscopy image sequences, , , , and , in: 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI), pages 868-872, 2021 |
[DOI] [URL] |
Contactless Sleep Monitoring for Early Detection of Health Deteriorations in Community-Dwelling Older Adults: Exploratory Study, , , , , , , , , and , in: JMIR Mhealth Uhealth, 9(6), 2021 |
|
Declarative Variables in Online Dating: A Mixed-Method Analysis of a Mimetic-Distinctive Mechanism, , and , in: Proceedings of the ACM on Human-Computer Interaction, 5(CSCW1), 2021 |
|
Identification of F1 and F2 in speech using modified zero frequency filtering, and , in: Proceedings of Interspeech, 2021 |
|
On Modeling Glottal Source Information for Phonation Assessment in Parkinson’s Disease, , , , and , in: Proceedings of Interspeech, 2021 |
|
ROXANNE Research Platform: Automate criminal investigations, , , , , and , in: Interspeech Show and Tell 2021, 2021 |
|
Phoneme based Respiratory Analysis of Read Speech, , , and , in: Proceedings of European Signal Processing Conference (EUSIPCO), 2021 |
|
Multi-task Neural Network for Robust Multiple Speaker Embedding Extraction, , and , in: Proceedings of Interspeech 2021, 2021 |
Approximating the Mental Lexicon from Clinical Interviews as a Support Tool for Depression Detection, , , , and , Idiap-RR-19-2021 |
Late Fusion of the Available Lexicon and Raw Waveform-based Acoustic Modeling for Depression and Dementia Recognition, , , , , and , Idiap-RR-09-2021 |
|
Robust Command Recognition for Lithuanian Air Traffic Control Tower Utterances, , , , , , , and , in: Interspeech, 2021 |
|
Speech Activity Detection Based on Multilingual Speech Recognition System, , and , in: Interspeech, 2021 |
|
Supporting Context Monotonicity Abstractions in Neural NLI Models, , , , and , in: Natural Logic Meets Machine Learning Workshop, 2021 |
[URL] |
On the use of automatically generated synthetic image datasets for benchmarking face recognition, , and , in: International Joint Conference on Biometrics (IJCB 2021), 2021 |
|
Ergodic Exploration using Tensor Train: Applications in Insertion Tasks, , and , in: IEEE Transactions on Robotics, 38(2):906--921, 2022 |
[DOI] [URL] |
Does My Representation Capture X? Probe-Ably, , , , and , in: 59th Annual Meeting of the Association for Computational Linguistics (Demonstration track), 2021 |
[URL] |
Do Natural Language Explanations Represent Valid Logical Arguments? Verifying Entailment in Explainable NLI Gold Standards, , and , in: 14th International Conference on Computational Semantics, 2021 |
[URL] |
Switching Contexts: Transportability Measures for NLP, , , and , in: 14th International Conference on Computational Semantics, 2021 |
[URL] |
Encoding Explanatory Knowledge for Zero-shot Science Question Answering, , , and , in: 14th International Conference on Computational Semantics, 2021 |
[URL] |
Unification-based Reconstruction of Multi-hop Explanations for Science Questions, , and , in: 16th conference of the European Chapter of the Association for Computational Linguistics, 2021 |
[URL] |
Explainable Inference Over Grounding-Abstract Chains for Science Questions, , and , in: 59th Annual Meeting of the Association for Computational Linguistics (ACL Findings), 2021 |
|
On the Effectiveness of Vision Transformers for Zero-shot Face Anti-Spoofing, and , in: International Joint Conference on Biometrics (IJCB 2021), 2021 |
|
Supervised Speech Representation Learning for Parkinson's Disease Classification, and , Idiap-RR-08-2021 |
|
BertOdia: BERT pre-training for low resource Odia language, , , , , and , Idiap-RR-16-2021 |
|
NLPHut’s Participation at WAT2021, , , , , , , and , Idiap-RR-10-2021 |
|
Evading the Simplicity Bias: Training a Diverse Set of Models Discovers Solutions with Superior OOD Generalization, , , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022 |
|
The Theory, Practice, and Ethical Challenges of Designing a Diversity-Aware Platform for Social Relations, , , , , , , and , in: Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society, pages 11, ACM, 2021 |
[DOI] |
Ten seconds of my nights: exploring methods to measure brightness, loudness and attendance and their associations with alcohol use from video clips, , , , , and , in: PLOS ONE, 2021 |
[DOI] |
Subjective and objective evaluation of deepfake videos, and , in: The international Conference on Acoustics, Speech, and Signal Processing, 2021 |
|
Visual Focus of Attention Estimation in 3D Scene with an Arbitrary Number of Targets, and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pages 9, IEEE, 2021 |
|
Deep learning architectures for estimating breathing signal and respiratory parameters from speech recordings, , , , and , in: Neural Networks, 141:211--224, 2021 |
[DOI] |
Open Machine Translation for Low Resource South American Languages (AmericasNLP 2021 Shared Task Contribution), , , , , , , , and , Idiap-RR-07-2021 |
|
Syntax-Aware Graph-to-Graph Transformer for Semantic Role Labelling, and , in: Arxiv, 2021 |
|
Optics Versus Computation: Influence of Illumination and Reconstruction Model Accuracy in Focal-Plane-Scanning Optical Projection Tomography, and , in: 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI), Nice, France, pages 567-570, IEEE, 2021 |
[DOI] |
Contextual Semi-Supervised Learning: An Approach To Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems, , , , , , and , Idiap-RR-14-2021 |
[URL] |
Explainable Phonology-based Approach for Sign Language Recognition and Assessment, , Ecole Polytechnique Fédérale de Lausanne, 2021 |
|
Neural Network Adaptation and Data Augmentation for Multi-Speaker Direction-of-Arrival Estimation, , and , in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 29:1303-1317, 2021 |
[DOI] [URL] |
Comparing CTC and LFMMI for out-of-domain adaptation of wav2vec 2.0 acoustic model, , and , Idiap-RR-04-2021 |
|
Semantic Behavior Analysis of COVID-19 Patients: A Collaborative Framework, , , and , in: Machine Learning for Healthcare Applications, John Wiley & Sons, Inc. USA and Scrivener Publishing LLC, USA, 2021 |
[URL] |
A Laser-based Dual-arm System for Precise Control of Collaborative Robots, , and , in: IEEE International Conference on Robotics and Automation, 2021 |
|
Estimating Nonplanar Flow from 2D Motion-blurred Widefield Microscopy Images via Deep Learning, and , in: International Symposium on Biomedical Imaging, 2021, 2021 |
|
Natural Language Inference over Tables: Enabling Explainable Data Exploration on Data Lakes, , , and , in: 18th Extended Semantic Web Conference (ESWC), 2021 |
[URL] |
Explainable Natural Language Reasoning via Conceptual Unification, , and , in: 16th conference of the European Chapter of the Association for Computational Linguistics (EACL), 2021 |
[URL] |
STAR: Cross-modal Statement Representation for Selecting Relevant Mathematical Premises, and , in: 16th conference of the European Chapter of the Association for Computational Linguistics (EACL), 2021 |
Cost–effective Variational Active Entity Resolution, , , , and , in: 37th IEEE International Conference on Data Engineering (ICDE), 2021 |
[URL] |
Signal-to-signal neural networks for improved spike estimation from calcium imaging data, , , and , in: PLoS Computational Biology, 17(3):1--19, 2021 |
[DOI] |
A Bayesian Interpretation of the Light Gated Recurrent Unit, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2021 |
[DOI] |
Accurate Nod and 3D Gaze Estimation for Social Interaction Analysis, , EDEE, EPFL, 2020 |
|
Whole Body Model Predictive Control with a Memory of Motion:Experiments on a Torque-Controlled Talos, , , , , , , , , , , and , in: IEEE International Conference on Robotics and Automation, 2021 |
|
Learning Constrained Distributions of Robot Configurations with Generative Adversarial Network, , , and , in: IEEE Robotics and Automation Letters, 2021 |
|
A COMPARISON OF METHODS FOR OOV-WORD RECOGNITION ON A NEW PUBLIC DATASET, , and , in: 2021 IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE Signal Processing Society, Toronto, Ontario, Canada, 2021 |
|
Learning Optimal Impedance Control During Complex 3D Arm Movements, , , , and , in: IEEE Robotics and Automation Letters (RA-L), 6(2):1248-1255, 2021 |
[DOI] [URL] |
Cross Modal Focal Loss for RGBD Face Anti-Spoofing, and , in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021 |
|
Probabilistic Adaptive Control for Robust Behavior Imitation, , and , in: IEEE Robotics and Automation Letters, 2021 |
|
Automatic and perceptual discrimination between dysarthria, apraxia of speech, and neurotypical speech, , , and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2021 |
|
Inducing Meaningful Units from Character Sequences with Dynamic Capacity Slot Attention, and , in: Transactions on Machine Learning Research, 2023 |
[URL] |
Graph2Speak: Improving Speaker Identification using Network Knowledge in Criminal Conversational Data, , , and , Idiap-RR-01-2023 |
[URL] |
Evaluation of Urban Scale Building Energy-Use Models and Tools – Application for the City of Fribourg, Switzerland, , , and , in: Sustainability, 13(7), 2021 |
[DOI] [URL] |
Discourse Phenomena in Machine Translation, , École polytechnique fédérale de Lausanne, 2020 |
|
One More Bite? Inferring Food Consumption Level of College Students Using Smartphone Sensing and Self-Reports, , , , , , and , in: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT), 5(1), 2021 |
|
Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications, , , , , , , , , , , , , , , and , in: Proceedings of 8th OpenSky Symposium 2020, OpenSky Network, pages 1-10, MDPI, 2020 |
[DOI] [URL] |
Challenges for Using Impact Regularizers to Avoid Negative Side Effects, , and , in: SafeAI 2021 - AAAI's Workshop on Artificial Intelligence Safety, 2021 |
|
Exact Preimages of Neural Network Aircraft Collision Avoidance Systems, and , in: Machine Learning for Engineering Modeling, Simulation, and Design Workshop at Neural Information Processing Systems 2020, 2020 |
|
Predicting the Causal Effect Relationship Between COPD and Cardio Vascular Diseases, , , and , in: Informatica, 44(4), 2020 |
[DOI] [URL] |
Unsupervised Representation Learning for Gaze Estimation, and , in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020 |
|
Paraspeckle components NONO and PSPC1 are not mislocalized from motor neuron nuclei in sporadic ALS, , , , , and , in: Brain, 2020 |
[URL] |
Mammary epithelial morphogenesis in 3D combinatorial microenvironments, , , and , in: Scientific Reports, 10(1), 2020 |
[URL] |
Author Profiling in Social Media with Multimodal Information., , , and , in: In Journal of Computacion y Sistemas (CyS), 24(3), 2020 |
[URL] |
SPARSE AUTOENCODERS TO ENHANCE SPEECH RECOGNITION, and , Idiap-RR-10-2022 |
|
Real-Time Segmentation Networks should be Latency Aware, and , in: Asian Conference on Computer Vision, 2020 |
|
Fairness in Biometrics: a figure of merit to assess biometric verification systems, and , in: arXiv, 2020 |
|
Subspace-based Learning for Automatic Dysarthric Speech Detection, , and , in: IEEE Signal Processing Letters, 2020 |
COMPARISON OF SUBWORD SEGMENTATION METHODS FOR OPEN-VOCABULARYEND-TO-END SPEECH RECOGNITION, , , and , Idiap-RR-34-2020 |
|
Smartphone Sensing for the Well-being of Young Adults: A Review, and , in: IEEE Access, 2021 |
[DOI] [URL] |
LATTICE-FREE MMI ADAPTATION OF SELF-SUPERVISED PRETRAINED ACOUSTIC MODELS, , and , Idiap-RR-40-2020 |
[URL] |
Fast Transformers with Clustered Attention, , and , in: Proceedings of the International Conference on Neural Information Processing Systems, 2020 |
Partially-supervised Mention Detection, and , in: Proceedings of the Third Workshop on Computational Models of Reference, Anaphora and Coreference, 2020 |
|
The Unstoppable Rise of Computational Linguistics in Deep Learning, , in: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Online, pages 6294-6306, Association for Computational Linguistics, 2020 |
[DOI] [URL] |
Evaluation of 1-Year in-Home Monitoring Technology by Home-Dwelling Older Adults, Family Caregivers, and Nurses, , , , , , , and , in: Frontiers in Public Health, 8:9, 2020 |
[DOI] [URL] |
BertAA: BERT fine-tuning for Authorship Attribution, , , and , in: Proceedings of the 17th International Conference on Natural Language Processing, 2020 |
|
Free annotated data for deep learning in microscopy? A hitchhiker's guide, and , in: Photoniques(104):30-33, 2020 |
[DOI] [URL] |
Aliasing mitigation in optical microscopy of dynamic biological samples by use of temporally modulated color illumination and a standard RGB camera, and , in: Journal of Biomedical Optics, 25(10):106505, 2020 |
[DOI] [URL] |
An Integrated and strategic evaluation of automatic blind controls to achieve energy and occupant's comfort objectives, and , in: Proceedings of the 5th IBPSA-England Conference on Building Simulation and Optimization (Virtual), Loughborough, UK, 2020 |
[URL] |
Detection of Similar Languages and Dialects Using Deep Supervised Autoencoders, , , , and , in: Proceedings of the 17th International Conference on Natural Language Processing, 2020 |
|
Overview of the 7th Workshop on Asian Translation, , , , , , , , , , , and , in: Proceedings of the 7th Workshop on Asian Translation, Association for Computational Linguistics, 2020 |
[URL] |
Generative adversarial training of product of policies for robust and adaptive movement primitives, , and , in: In Proc. Conference on Robot Learning (CoRL), 2020 |
|
Learning, Generating and Adapting Wave Gestures for Expressive Human-Robot Interaction, , and , in: Proc. ACM/IEEE Intl Conf. on Human-Robot Interaction (HRI), pages 386-388, 2020 |
[DOI] [URL] |
Context is Everything: Using a Smartphone App to Capture Young People's Drinking Behaviours, Cognitions, Environments, and Consequences, , La Trobe University, Melbourne, Australia, 2020 |
[DOI] |
Do different drinks make you feel different emotions? Examination of young adolescents' beverage-specific alcohol expectancies using the Alcohol Expectancy Task, , , and , in: Addictive Behaviors, 2020 |
[DOI] [URL] |
Fun/intoxication pre-drinking motives lead indirectly to more alcohol-related consequences via increased alcohol consumption on a given night, , , and , in: Addictive Behaviors, 2020 |
[DOI] [URL] |
Alone or With Others? Understanding Eating Episodes of College Students with Mobile Sensing, , and , in: 19th International Conference on Mobile and Ubiquitous Multimedia, ACM, Essen, Germany, pages 162–166, Association for Computing Machinery, 2020 |
[DOI] [URL] |
Protecting Mobile Food Diaries from Getting too Personal, , and , in: 19th International Conference on Mobile and Ubiquitous Multimedia, Essen, Germany, pages 212–222, Association for Computing Machinery, 2020 |
[DOI] [URL] |
A Neural Model to Predict Parameters for a Generalized Command Response Model of Intonation, and , in: MLSLP-18 Proceedings, Hyderabad, 2018 |
[URL] |
On the Effectiveness of Vision Transformers for Zero-shot Face Anti-Spoofing, and , Idiap-RR-30-2020 |
|
Spectro-temporal sparsity characterization for dysarthric speech detection, and , in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 28:1210-1222, 2020 |
|
Novel Architectures for Unsupervised Information Bottleneck based Speaker Diarization of Meetings, , , and , in: IEEE/ACM Transactions on Audio Speech and Language Processing, 2020 |
Novel Architectures for Unsupervised Information Bottleneck based Speaker Diarization of Meetings, , , and , Idiap-RR-26-2020 |
|
An Attention Mechanism for Deep Q-Networks with Applications in Robotic Pushing, , and , Idiap-RR-03-2021 |
|
Assisted teleoperation in changing environments with a mixture of virtual guides, , and , in: Advanced Robotics, 34(18):1157-1170, 2020 |
[DOI] [URL] |
A Comparison of Acoustic and Linguistics Methodologies for Alzheimer's Dementia Recognition, , , , , , , , , , and , in: Proceedings of Interspeech, pages 2182-2186, 2020 |
|
Graph-to-Graph Transformer for Transition-based Dependency Parsing, and , in: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Findings, ACL, Online, pages 3278–3289, Association for Computational Linguistics, 2020 |
[URL] |
ODIANLP's Participation in WAT2020, , , , , , , , and , in: Proceedings of the 7th Workshop on Asian Translation, ACL Anthology, 2020 |
|
On Joint Optimization of Automatic Speaker Verification and Anti-spoofing in the Embedding Space, , , , and , in: IEEE Transactions on Information Forensics and Security, 16:1579--1593, 2021 |
[DOI] |
Vulnerability Analysis of Face Morphing Attacks from Landmarks and Generative Adversarial Networks, , , and , Idiap-RR-38-2020 |
|
Inferring Highly-dense Representations for Clustering Broadcast Media Content, , , and , in: The Prague Bulletin of Mathematical Linguistics, 2020 |
[URL] |
AUTOMATIC DYSARTHRIC SPEECH DETECTION EXPLOITING PAIRWISE DISTANCE-BASED CONVOLUTIONAL NEURAL NETWORKS, , and , Idiap-RR-32-2020 |
|
Recursive Non-Autoregressive Graph-to-Graph Transformer for Dependency Parsing with Iterative Refinement, and , in: Transactions of the Association for Computational Linguistics, 2020 |
[URL] |
Graph-to-Graph Transformer for Transition-based Dependency Parsing, and , in: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, 2020 |
[URL] |
Shooting shots: Estimating alcoholic drink sizes in real life using event-level reports and annotations of close-up pictures, , , and , in: Drug and Alcohol Review, 2020 |
[DOI] [URL] |
A t-distribution based operator for enhancing out of distribution robustness of neural network classifiers, and , in: IEEE Signal Processing Letters, 27:1070-1074, 2020 |
[DOI] |
Plug and Play Autoencoders for Conditional Text Generation, , , , and , in: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, Online, 2020 |
|
Automatic Discrimination of Apraxia of Speech and Dysarthria using a Minimalistic Set of Handcrafted Features, , , and , in: Interspeech, 2020 |
|
Adaptive Ensemble-based Optimisation for Petrophysical Inversion, and , in: Mathematical Geosciences, 2020 |
[DOI] [URL] |
A Phonology-based Approach for Isolated Sign Production Assessment in Sign Language, , , and , in: Companion Publication of the 2020 International Conference on Multimodal Interaction (ICMI '20 Companion), 2020 |
|
Idiap and UAM Participation at MEX-A3T Evaluation Campaign, , , , and , in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2020) co-located with 36th Conference of the Spanish Society for Natural Language Processing (SEPLN 2020), pages 6, CEUR Workshop Proceedings, 2020 |
[URL] |
The High-Quality Wide Multi-Channel Attack (HQ-WMCA) database, , , , and , Idiap-RR-22-2020 |
|
The Little W-Net That Could: State-of-the-Art Retinal Vessel Segmentation with Minimalistic Models, , , , , and , in: Cornell University Pre-print Server, 2020 |
[URL] |
Taming GANs with Lookahead, , , and , Idiap-RR-20-2020 |
[URL] |
Deep Generative Models and Applications, and , EPFL, 2020 |
[DOI] [URL] |
Active Illumination and Computational Methods for Temporal and Spectral Super-Resolution Microscopy, , EPFL, 2020 |
[DOI] |
Deepfake detection: humans vs. machines, and , Idiap-RR-36-2020 |
|
Iris Liveness Detection Competition (LivDet-Iris) – The 2020 Edition, , , , , , , , , , , , , and , in: INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB 2020), 2020 |
[URL] |
Robot skills learning with Riemannian manifolds : Leveraging geometry-awareness in robot learning, optimization and control, , Ecole Polytechnique Fédérale de Lausanne, 2020 |
|
Understanding Applicants' Reactions to Asynchronous Video Interviews through Self-Reports and Nonverbal Cues, , , , , and , in: Proc. ACM Int. Conf. on Multimodal Interaction (ICMI), Utrecht, 2020 |
|
Product of experts for robot learning from demonstration, , EPFL, 2020 |
Face Recognition systems: performance evaluation and bias analysis, , Idiap-Com-04-2020 |
|
Deep Learning of Charisma, , Idiap-Com-03-2020 |
|
Planning and control of robot manipulation tasks, , Idiap-Com-01-2022 |
|
Machine Learning for Adverse Event Detection in Latent Tuberculosis Infection Treatment, , Idiap-Com-02-2020 |
|
Automatic Speech Recognition Engines Adapted for Embedded Platforms, , Idiap-Com-01-2020 |
|
Detection of disguised speech in forensic science by humans and automatic systems, , Université de Lausanne Ecole des Sciences Criminelles, 2020 |
|
Automatic Speech Recognition Benchmark for Air-Traffic Communications, , , , and , in: Proc. Interspeech 2020, pages 2297-2301, 2020 |
[DOI] |
An HMM Approach with Inherent Model Selection for Sign Language and Gesture Recognition, , and , in: Proceedings of the International Conference on Language Resources and Evaluation LREC 2020, 2020 |
|
Temporal resolution doubling in fluorescence light-sheet microscopy via a hue-encoded shutter and regularization, , , and , in: OSA Continuum, 3(8), 2020 |
|
Smartphone Multi-modal Biometric Authentication: Database and Evaluation, , , , , , , , and , Idiap-RR-17-2020 |
[URL] |
Learning One Class Representations for Face Presentation Attack Detection using Multi-channel Convolutional Neural Networks, and , in: IEEE Transactions on Information Forensics and Security, 2020 |
|
The MuMMER data set for Robot Perception in multi-party HRI Scenarios, , , and , in: Proceedings of the 29th IEEE International Conference on Robot & Human Interactive Communication, 2020 |
|
Transformers are RNNs: Fast Autoregressive Transformers with Linear Attention, , , and , in: Proceedings of International Conference on Machine Learning, 2020 |
A Bayesian Approach to Recurrence in Neural Networks, and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 43(8):2527--2537, 2021 |
[DOI] |
Spatial Attention for Far-Field Speech Recognition with Deep Beamforming Neural Networks, , , , , and , in: 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, pages 7499-7503, 2020 |
[DOI] |
WatchNet++: Efficient and accurate depth-based network for detecting people attacks and intrusion, , , and , in: Machine Vision and Applications, 2020 |
|
Plug and Play Autoencoders for Conditional Text Generation, , , , and , Idiap-RR-24-2020 |
|
Optimizer Benchmarking Needs to Account for Hyperparameter Tuning, , , , and , in: Proceedings of the 37th International Conference on Machine Learning, Vienna, Austria, 2020 |
[URL] |
Gradient Alignment in Deep Neural Networks, and , Idiap-RR-14-2020 |
|
Deep Models and Shortwave Infrared Information to Detect Face Presentation Attacks, , , , and , in: IEEE Transactions on Biometrics, Behavior, and Identity Science, 2020 |
|
Active Improvement of Control Policies with Bayesian Gaussian Mixture Model, , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, 2020 |
Idiap & UAM participation at GermEval 2020: Classification and Regression of Cognitive and Motivational Style from Text, , , , and , in: Proceedings of the GermEval 2020 Shared Task on the Classification and Regression of Cognitive and Motivational style from Text, 2020 |
[URL] |
Idiap Submission to Swiss-German Language Detection Shared Task, , , , and , in: Proceedings of the 5th Swiss Text Analytics Conference (SwissText) & 16th Conference on Natural Language Processing (KONVENS), CEUR Workshop Proceedings, 2020 |
[URL] |
Generating Master Faces for Use in PerformingWolf Attacks on Face Recognition Systems, , , and , in: International Join Conference on Biometrics, 2020 |
|
Automatic pathological speech intelligibility assessment exploiting subspace-based analyses, , and , in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 28:1717 - 1728, 2020 |
[DOI] |
End-to-End Bias Mitigation by Modelling Biases in Corpora, , and , in: ACL, 2020 |
|
Comparison of Subword Segmentation Methods for Open-vocabulary ASR using a Difficulty Metric, , , and |
|
OdiEnCorp 2.0: Odia-English Parallel Corpus for Machine Translation, , , , , and , European Language Resources Association (ELRA), 2020 |
[URL] |
On quantifying the quality of acoustic models in hybrid DNN-HMM ASR, , and , in: Speech Communication, 119:24-35, 2020 |
[DOI] |
Parametric study of URBAN morphology on building solar energy potential in Singapore context, , , , and , in: Urban Climate, 33(100624), 2020 |
[DOI] [URL] |
CNN Patch Pooling for Detecting 3D Mask Presentation Attacks in NIR, and , in: IEEE International Conference on Image Processing, 2020 |
|
Neural Network based End-to-End Query by Example Spoken Term Detection, , and , in: IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 2020 |
|
pyannote.audio: neural building blocks for speaker diarization, , , , , , , , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2020 |
[URL] |
Idiap Submission to Swiss-German Language Detection Shared Task, , , , and , Idiap-RR-11-2020 |
|
Spatially-Variant CNN-Based Point Spread Function Estimation for Blind Deconvolution and Depth Estimation in Optical Microscopy, and , in: IEEE Transactions on Image Processing, 29:5848 - 5861, 2020 |
[DOI] |
Understanding the performance gap: a machine learning approach on residential buildings in Turin, Italy, , , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
Multilingual and Unsupervised Subword Modeling for Zero-Resource Languages, , and , in: Computer Speech and Language, 65, 2021 |
[DOI] [URL] |
Competitive Neural Layer-based Method to Identify People with High Risk for Diabetic Foot, , , , , and , in: Computers in Biology and Medicine, 120, 2020 |
[DOI] [URL] |
ManiGaze: a Dataset for Evaluating Remote Gaze Estimator in Object Manipulation Situations, , and , in: Symposium on Eye Tracking Research and Applications, Stuttgart, Germany, pages 5, ACM, 2020 |
[DOI] |
Epileptic seizure detection: a comparative study between deep and traditional machine learning techniques, , , , and , in: Journal of Integrative Neuroscience, 19(1):1-9, 2020 |
[URL] |
Recursive Non-Autoregressive Graph-to-Graph Transformer for Dependency Parsing with Iterative Refinement, and , in: Transactions of the Association for Computational Linguistics(under submission), 2020 |
Tractable Approaches to Learning and Planning in High Dimensions, , EPFL, 2014 |
[DOI] |
Theory and Algorithms for Hypothesis Transfer Learning, , EPFL, 2018 |
[DOI] |
Variational Inference with Mixture Model Approximation for Applications in Robotics, , and , in: International Conference on Robotics and Automation, 2020 |
|
Gaussians on Riemannian Manifolds for Robot Learning and Adaptive Control, , in: IEEE Robotics and Automation Magazine (RAM), 2020 |
|
Memory of Motion for Warm-starting Trajectory Optimization, , , and , in: IEEE Robotics and Automation Letters, 5(2):2594-2601, 2020 |
[DOI] |
A memory of motion for visual predictive control tasks, , and , in: International Conference on Robotics and Automation, 2020 |
|
Learning How to Walk: Warm-starting Optimal Control Solver with Memory of Motion, , , , and , in: International Conference on Robotics and Automation, 2020 |
|
Sparse and Low-rank Modeling for Automatic Speech Recognition, , EPFL, 2019 |
[DOI] |
Trustworthy speaker recognition with minimal prior knowledge using neural networks, , Ecole polytechnique fédérale de Lausanne (EPFL), 2019 |
[DOI] [URL] |
Towards Multilingual Sign Language Recognition, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020 |
|
Detection of S1 and S2 locations in phonocardiogram signals using zero frequency filter, , , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020 |
|
SYNTHETIC SPEECH REFERENCES FOR AUTOMATIC PATHOLOGICAL SPEECH INTELLIGIBILITY ASSESSMENT, , and , in: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Barcelona, Spain, 2020 |
|
Dysarthric Speech Recognition with Lattice-Free MMI, and , in: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 6109-6113, 2020 |
[DOI] [URL] |
Youth nightlife at home: towards a feminist conceptualisation of home, , , , , , and , in: Children's Geographies, 2020 |
[DOI] [URL] |
Learning Trajectory Distributions for Assisted Teleoperation and Path Planning, , , , , , and , in: Frontiers in Robotics and AI, 6:89, 2019 |
[DOI] [URL] |
Plucking Motions for Tea Harvesting Robots Using Probabilistic Movement Primitives, , , and , in: IEEE International Conference on Robotics and Automation, 2020 |
Low-latency speaker spotting with online diarization and detection, , , , , , , , and , in: The Speaker and Language Recognition Workshop (Odyssey), 2018 |
|
AM-FM DECOMPOSITION OF SPEECH SIGNAL: APPLICATIONS FOR SPEECH PRIVACY AND DIAGNOSIS, , , , and , in: 11th International workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, Universita Degli Studi Firenze, Firenze, Italy, 2019 |
[URL] |
INCREMENTAL TRANSFER LEARNING IN TWO-PASS INFORMATION BOTTLENECK BASED SPEAKER DIARIZATION SYSTEM FOR MEETINGS, , , and , in: Proceedings of ICASSP 2019, pages 6291-6295, 2019 |
Implicit discourse relation classification with syntax-aware contextualized word representations, , , and , in: Proceedings of the 32nd International Florida Artificial Intelligence Research Society Conference, 2019 |
SATokE: How can Syntax-Aware Contextualized Word Representations Benefit Implicit Discourse Relation Classification?, , , and , in: Ptroc. 2019 Conference sur l'Apprentissage automatique, 2019 |
Weakly-Supervised Concept-based Adversarial Learning for Cross-lingual Word Embeddings, , and , in: Proc. 2019 Conference on Empirical Methods in Natural Language Processing, 2019 |
Joint estimation of RETF vector and power spectral densities for speech enhancement based on alternating least squares, , and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, pages 795--799, 2019 |
Learning Entailment-Based Sentence Embeddings from Natural Language Inference, , and , Idiap-RR-20-2019 |
[URL] |
Learning an event sequence embedding for event-based deep stereo, , , , and , in: Proceedings of the IEEE International Conference on Computer Vision, 2019 |
Reducing Noise in GAN Training with Variance Reduced Extragradient, , , and , in: Proceedings of the international conference on Neural Information Processing Systems, 2019 |
Uncertainty-aware imitation learning using kernelized movement primitives, , , , and , in: IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019 |
|
A Non-Euclidean Gradient Descent Framework for Non-Convex Matrix Factorization, , , , , and , in: IEEE Transactions on Signal Processing, 2018 |
|
Real-Time DCT Learning-based Reconstruction of Neural Signals, , and , in: EUSIPCO, 2018 |
|
Learning-Based Compressive MRI, , , , , , and , in: IEEE Transactions on Medical Imaging, 2018 |
|
On the Tunability of Optimizers in Deep Learning, , , , and , Idiap-RR-19-2019 |
[URL] |
Extractive Odia Text Summarization System: An OCR based Approach, , Idiap-RR-02-2020 |
|
Reinforcement learning of trajectory distributions: Applications in assisted teleoperation and motion planning, , , , , and , in: IEEE International Conference on Intelligent Robots and Systems, 2019 |
SCALAR: Simultaneous Calibration of 2-D Laser and Robot Kinematic Parameters Using Planarity and Distance Constraints, , and , in: IEEE Transactions on Automation Science and Engineering, 16(4):1971-1979, 2019 |
[DOI] |
Aligning Multilingual Word Embeddings for Cross-Modal Retrieval Task, , and , in: Proceedings of the Second Workshop on Fact Extraction and VERification (FEVER), Hong Kong, pages 27-33, Association for Computational Linguistics, 2019 |
[DOI] [URL] |
Adaptive Design of Experiments for Conservative Estimation of Excursion Sets, , , , and , in: Technometrics, 2019 |
[DOI] [URL] |
Cross-lingual Automatic Speech Recognition Exploiting Articulatory Features, , , , , and , in: Proceedings of APSIPA ASC 2019, 2019 |
A Differential Approach for Gaze Estimation, , and , in: IEEE Transaction on Pattern Analysis and Machine Intelligence, 43(3):1092--1098, 2021 |
[DOI] [URL] |
Self-attention for Speech Emotion Recognition, , and , in: Proc. Interspeech 2019, 2019 |
[DOI] |
Broadcast Media Content Categorization Using Low-Resolution Concepts, , , , and , Idiap-RR-06-2021 |
|
Building energy models with Morphological urban-scale parameters: a case study in Turin, , , , , and , in: Proceedings of 4th Building Simulation Applications Conference - BSA 2019, 2019 |
[URL] |
The Simulation of Mean Radiant Temperature in Outdoor Conditions: A Review of Architectural Tools Calculation Assumptions, , , and , in: Proceedings of Building Simulation 2019: 16th Conference of IBPSA, 2019 |
OdiEnCorp 2.0: Odia-English Parallel Corpus for Machine Translation, , , , , and , Idiap-RR-08-2020 |
|
Mixture Models for the Analysis, Edition, and Synthesis of Continuous Time Series, , in: Mixture Models and Applications, pages 39-57, Springer, 2019 |
[DOI] |
Interactive Generation of Calligraphic Trajectories from Gaussian Mixtures, , and , in: Mixture Models and Applications, pages 23-38, Springer, 2019 |
[DOI] |
A Survey on Policy Search Algorithms for Learning Robot Controllers in a Handful of Trials, , , , and , in: IEEE Trans. on Robotics, 32(2):328-347, 2020 |
[DOI] [URL] |
Improving dual-arm assembly by master-slave compliance, , , and , in: Proc. IEEE Intl Conf. on Robotics and Automation, pages 8676-8682, 2019 |
|
Bayesian Gaussian mixture model for robotic policy imitation, and , in: IEEE Robotics and Automation Letters, 4(4):4452 - 4458, 2019 |
[DOI] [URL] |
Daylighting simulation for external Venetian blinds based on HDR sky luminance monitoring with matrix algebraic approach, , and , in: Energy Procedia, 158:2677-2682, 2019 |
[DOI] |
Performance assessment of the BTDF data compression based on wavelet transforms in daylighting simulation, , and , in: Solar Energy, 2019 |
[DOI] |
CityLearn v1.0: An OpenAI Gym Environment for Demand Response with Deep Reinforcement Learning, , , and , in: Proceedings of the 6th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation, New-York, USA, pages 356-357, ACM, 2019 |
[DOI] |
TOWARDS MULTILINGUAL SIGN LANGUAGE RECOGNITION, , and , Idiap-RR-16-2019 |
|
Retrofitting, district heating and energy storage: neighborhood energy planning, , , , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
A smart luminaire in an office environment: impact on light distribution, user interactions and comfort, , , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
Multi-agent reinforcement learning for adaptive demand response in smart cities, , , , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
Daylight regulated by automated external Venetian blinds based on HDR sky luminance mapping in winter, , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
A morphological based PV generation and energy consumption predictive model for Singapore neighbourhood, and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
CO2 experimental measurements towards the development of a predictive framework using user actions in smart buildings, , , , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
Multi-scale sequential network for semantic text segmentation and localization, , and , in: Pattern Recognition Letters, 129:63-69, 2020 |
[DOI] [URL] |
Can Your Face Detector Do Anti-spoofing? Face Presentation Attack Detection with a Multi-Channel Face Detector, and , Idiap-RR-12-2020 |
|
Language Independent Query by Example Spoken Term Detection, , École Polytechnique Fédérale de Lausanne, 2019 |
|
Idiap submission to the NIST SRE 2019 Speaker Recognition Evaluation, , , , and , Idiap-RR-15-2019 |
|
Overview of the 6th Workshop on Asian Translation, , in: Proceedings of the 6th Workshop on Asian Translation, Hong Kong, China, pages 1–35, Association for Computational Linguistics, 2019 |
[DOI] [URL] |
Idiap NMT System for WAT 2019 Multimodal Translation Task, and , in: Proceedings of the 6th Workshop on Asian Translation, Hong Kong, China, pages 175–180, Association for Computational Linguistics, 2019 |
[DOI] [URL] |
Improving the conditioning of the optimization criterion in acoustic multi-channel equalization using shorter reshaping filters, and , in: EURASIP Journal on Advances in Signal Processing(11), 2018 |
|
Selecting, Planning, and Rewriting: A Modular Approach for Data-to-Document Generation and Translation, , and , in: WNGT EMNLP, 2019 |
|
CHALLENGES IN BROADCAST MEDIA CONTENT CATEGORIZATION, , and , Idiap-RR-02-2021 |
|
Vulnerability of Face Recognition to Deep Morphing, and , in: International Conference on Biometrics for Borders, 2019 |
|
Idiap Abstract Text Summarization System for German Text Summarization Task, and , in: Proceedings of the 4th edition of the Swiss Text Analytics Conference, 2019 |
[URL] |
Multilingual Bottleneck Features for Query by Example Spoken Term Detection, , and , in: IEEE Automatic Speech Recognition and Understanding Workshop, 2019 |
|
Understanding Raw Waveform based CNN through Low-rank Spectro-Temporal Decoupling, , and , Idiap-RR-11-2019 |
|
Subunits Inference and Lexicon Development Based on Pairwise Comparison of Utterances and Signs, and , in: Information, 10:298, 2019 |
[DOI] [URL] |
Super-gaussianity of Speech Spectral Coefficients as a Potential Biomarker for Dysarthric Speech Detection, and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2019 |
Joint acoustic localization and dereverberation through plane wave decomposition and sparse regularization, , , , and , in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 27(12):1893-1905, 2019 |
[DOI] [URL] |
AM-FM DECOMPOSITION OF SPEECH SIGNAL: APPLICATIONS FOR SPEECH PRIVACY AND DIAGNOSIS, , , , and , Idiap-RR-01-2020 |
|
Full-Gradient Representation for Neural Network Visualization, and , in: Advances in Neural Information Processing Systems, 2019 |
[URL] |
Idiap NMT System for WAT 2019 Multimodal Translation Task, and , Idiap-RR-04-2020 |
|
Multispectral Deep Embeddings As a Countermeasure To Custom Silicone Mask Presentation Attacks, , and , in: IEEE Transactions on Biometrics, Behavior, and Identity Science, 2019 |
|
Abstract Text Summarization: A Low Resource Challenge, and , in: In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2019), HongKong, China, pages 5, Association for Computational Linguistics (ACL), 2019 |
|
Learning One Class Representations for Presentation Attack Detection using Multi-channel Convolutional Neural Networks, and , Idiap-RR-15-2020 |
|
A Comprehensive Experimental and Reproducible Study on Selfie Biometrics in Multistream and Heterogeneous Settings, , and , in: IEEE Transactions on Biometrics, Behavior and Identity Science, 2019 |
[DOI] [URL] |
Discovering Eating Routines in Context with a Smartphone App, , , and , in: Ubicomp/Iswc'19 Adjunct: Proceedings Of The 2019 Acm International Joint Conference On Pervasive And Ubiquitous Computing And Proceedings Of The 2019 Acm International Symposium On Wearable Computers, London, pages 422-429, 2019 |
[DOI] |
Validity of pervasive computing based continuous physical activity assessment in community-dwelling old and oldest-old, , , , , , , , , , , and , in: Scientific Reports, 9(9662), 2019 |
|
German News Article Classification : A Multichannel CNN Approach, , and , Idiap-RR-09-2020 |
|
Cross-lingual Automatic Speech Recognition Exploiting Articulatory Features, , , , and , Idiap-RR-05-2021 |
[URL] |
Temporal Super-Resolution Microscopy Using a Hue-Encoded Shutter, , , and , in: Optical Society of America Biomedical Optics Express, 10(09):4727-4741, 2019 |
[DOI] [URL] |
Generalized temporal sampling with active illumination in optical microscopy, and , in: Proceeding of the SPIE Conference Optics and Photonics, Wavelets and Sparsity XVIII, SPIE, San Diego, California, United States, SPIE, 2019 |
|
Processing Megapixel Images with Deep Attention-Sampling Models, and , in: Proceedings of International Conference on Machine Learning, 2019 |
[URL] |
The Speed Submission to DIHARD II: Contributions & Lessons Learned, , , , , , , , , , , , , and , Idiap-RR-14-2019 |
|
Tampered Speaker Inconsistency Detection with Phonetically Aware Audio-visual Features, , , , , , , and , in: International Conference on Machine Learning, 2019 |
|
Vulnerability assessment and detection of Deepfake videos, and , in: IAPR International Conference on Biometrics, 2019 |
|
Understanding and Visualizing Raw Waveform-based CNNs, , , and , in: Proceedings of Interspeech, 2019 |
|
CBOW Is Not All You Need: Combining CBOW with the Compositional Matrix Space Model, , and , in: International Conference on Learning Representations, New Orleans, Louisiana, USA, 2019 |
[URL] |
Automated Daylighting Control System based on Sky Luminance Monitoring and Lighting Computing, , and , EPFL, 2019 |
[DOI] |
Spoken language identification using language bottleneck features, , , , , and , in: Proceedings of TSD, 2019 |
|
The contexts of heavy drinking: A systematic review of the combinations of context-related factors associated with heavy drinking occasions, , , , and , in: PLOS ONE, 14(7):29, 2019 |
[DOI] [URL] |
An End-to-end Network to Synthesize Intonation Using a Generalized Command Response Model, , , , and , in: ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, United Kingdom, pages 7040-7044, IEEE, 2019 |
[DOI] [URL] |
Spectral Subspace Analysis for Automatic Assessment of Pathological Speech Intelligibility, , and , in: Proceedings of Interspeech, Graz, Austria, pages 3038--3042, 2019 |
|
End-to-end Accented Speech Recognition, , and , Idiap-RR-04-2022 |
|
Split-pane electrochromic window control based on an embedded photometric device with real-time daylighting computing, , , , and , in: Building and Environment, 2019 |
[DOI] |
Using Speech Production Knowledge for Raw Waveform Modelling based Styrian Dialect Identification, and , in: Proceedings of Interspeech, 2019 |
|
Idiap Abstract Text Summarization System for German Text Summarization Task, and , Idiap-RR-03-2020 |
|
BookTubing Across Regions: Examining Differences based on Nonverbal and Verbal Cues, , and , in: Proc. ACM Int. Conf. on Interactive Experiences for Television and Online Video (TVX), Salford, ENGLAND, 2019 |
[DOI] |
Automated Eye-sight Venetian blinds based on an embedded photometric device with real-time daylighting computing, , and , in: Applied Energy, 252, 2019 |
[DOI] [URL] |
Deep Residual Output Layers for Neural Language Generation, and , in: Proceedings of the 36th International Conference on Machine Learning (ICML), 2019 |
|
The Role of Sex and Age on Pre-drinking: An Exploratory International Comparison of 27 Countries, , , , and , in: Alcohol and Alcoholism, 54(4):378–385, 2019 |
[DOI] |
Domain Adaptation in Multi-Channel Autoencoder based Features for Robust Face Anti-Spoofing, , and , in: International Conference on Biometrics 2019, IEEE, 2019 |
|
Biometric Face Presentation Attack Detection with Multi-Channel Convolutional Neural Network, , , , , and , in: IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2019 |
|
Processing Megapixel Images with Deep Attention-Sampling Models, and , Idiap-RR-07-2019 |
[URL] |
A solar-based sustainable urban design: The effects of city-scale street-canyon geometry on solar access in Geneva, Switzerland, , , , , and , in: Applied Energy, 240:173-190, 2019 |
[DOI] |
Deep Pixel-wise Binary Supervision for Face Presentation Attack Detection, and , in: International Conference on Biometrics, 2019 |
|
The emotional entanglements of smartphones in the field: On emotional discomfort, power relations, and research ethics, , , , , and , in: Area, 52(1), 2020 |
[DOI] [URL] |
Multimodal Person Recognition in Audio-Visual Streams, , EPFL, 2019 |
[DOI] |
Custom Silicone Face Masks - Vulnerability of Commercial Face Recognition Systems & Presentation Attack Detection, , , , , , and , in: Proceedings of 7th IAPR/IEEE International Workshop on Biometrics and Forensics, 2019 |
|
A Deep Learning Approach for Robust Head Pose Independent Eye Movements Recognition from Videos, , and , in: 2019 ACM Symposium on Eye Tracking Research and Applications, pages 5, ACM, 2019 |
[DOI] |
Conditions for the finiteness of the moments of the volume of level sets, , , and , in: Electronic Communications in Probability, 24(17), 2019 |
[DOI] [URL] |
Contaminant source localization via Bayesian global optimization, , , and , in: Hydrology and Earth System Sciences, 23:351-369, 2019 |
[DOI] [URL] |
On the choice of the low-dimensional domain for global optimization via random embeddings, , and , in: Journal of Global Optimization, 2019 |
[DOI] [URL] |
PATHOLOGICAL SPEECH INTELLIGIBILITY ASSESSMENT BASED ON THE SHORT-TIME OBJECTIVE INTELLIGIBILITY MEASURE, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brighton, UK, pages 6405--6409, 2019 |
|
Adaptation of Multiple Sound Source Localization Neural Networks with Weak Supervision and Domain-Adversarial Training, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brighton, United Kingdom, pages 770-774, 2019 |
[DOI] |
SPOKEN LANGUAGE IDENTIFICATION USING LANGUAGE BOTTLENECK FEATURES, , , , , and , Idiap-RR-08-2019 |
|
EMPIRICAL EVALUATION AND COMBINATION OF PUNCTUATION PREDICTION MODELS APPLIED TO BROADCAST NEWS, and , in: Proceedings of 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2019 |
|
A Learning-Based Framework for Quantized Compressed Sensing, , and , in: A Learning-Based Framework for Quantized Compressed Sensing, 2019 |
|
End-to-End Acoustic Modeling using Convolutional Neural Networks for HMM-based Automatic Speech Recognition, , and , in: Speech Communication, 108:15--32, 2019 |
[DOI] |
Improving Children Speech Recognition through Feature Learning from Raw Speech Signal, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
|
Learning voice source related information for depression detection, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
|
Segment-level training of ANNs based on acoustic confidence measures for hybrid HMM/ANN Speech Recognition, and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
|
Automatic Diagnosis of Alzheimer's Disease Using Neural Network Language Models, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
|
HMM-based Approaches to Model Multichannel Information in Sign Language inspired from Articulatory Features-based Speech Processing, , , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
|
GILE: A Generalized Input-Label Embedding for Text Classification, and , in: Transactions of the Association for Computational Linguistics (TACL), 2019 |
|
Capturing drinking and nightlife behaviours and their social and physical context with a smartphone application - investigation of users' experience and reactivity, , , , , , , and , in: Addiction Research and Theory, 28(1):62-75, 2020 |
[DOI] [URL] |
Adaptation of Assistant Based Speech Recognition to New Domains and Its Acceptance by Air Traffic Controllers, , , , , , , , , and , in: Proceedings of the 2nd International Conference on Intelligent Human Systems Integration (IHSI 2019): Integrating People and Intelligent Systems, San Diego, California, USA, pages 820 - 826, 2019 |
[DOI] |
Building Blocks of Assistant Based Speech Recognition for Air Traffic Management Applications, , , , , , , and , in: Conference: SESAR Innovation Days 2018, European Union, Eurocontrol, Salzburg, Austria, SESARJU, 2018 |
[URL] |
Iterative Learning of Speech Recognition Models for Air Traffic Control, , , , , , and , in: Proceedings of Interspeech 2018, ISCA, Hyderabad, India, pages 3519-3523, 2018 |
[DOI] |
CBOW Is Not All You Need: Combining CBOW with the Compositional Matrix Space Model, , and , Idiap-RR-06-2019 |
[URL] |
Multi-Spectral Widefield Microscopy of the Beating Heart through Post-Acquisition Synchronization and Unmixing, , , and , in: 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019), Venice, Italy, pages 1382-1385, 2019 |
[DOI] |
Leveraging Convolutional Pose Machines for Fast and Accurate Head Pose Estimation, , and , in: 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Madrid, SPAIN, pages 1089-1094, IEEE, 2018 |
|
Improving speech embedding using crossmodal transfer learning with audio-visual data, and , in: Multimedia Tools and Applications, 78(11):15681-15704, 2019 |
[DOI] |
Voice Presentation Attack Detection Using Convolutional Neural Networks, , , , , and , in: Handbook of Biometric Anti-Spoofing, pages 391--415, Springer, 2019 |
[URL] |
Multilingual bottleneck features for subword modeling in zero-resource languages, and , in: Proc. Interspeech, pages 2668-2672, 2018 |
[DOI] |
Language model domain adaptation for automatic speech recognition, , and , Idiap-RR-05-2020 |
|
Practical Deep Stereo (PDS): Toward applications-friendly deep stereo matching, , and , in: Proceedings of the international conference on Neural Information Processing Systems, 2018 |
Geodesic Convolutional Shape Optimization, , , and , in: Proceedings of the International Conference on Machine Learning, 2018 |
Kronecker Recurrent Units, , and , in: Proceedings of the International Conference on Machine Learning, 2018 |
Enhancing Trust in eAssessment - the TeSLA System Solution, , , , and , in: Technology Enhanced Assessment Conference., 2018 |
|
A Neural Model to Predict Parameters for a Generalized Command Response Model of Intonation, and , in: Proc. Interspeech 2018, pages 3147-3151, 2018 |
[DOI] |
SCALAR - Simultaneous Calibration of 2D Laser And Robot's Kinematic Parameters Using Three Planar Constraints, , and , in: International Conference on Intelligent Robots, 2018 |
|
Recent Advances in Face Presentation Attack Detection, , , and , in: Handbook of Biometric Anti-Spoofing, Springer, 2019 |
[URL] |
Data-Driven Movement Subunit Extraction from Skeleton Information for Modeling Signs and Gestures, , and , Idiap-RR-02-2019 |
|
An Introduction to Vein Presentation Attacks and Detection, , and , in: Handbook of Biometric Anti-Spoofing, Springer International Publishing, 2019 |
[DOI] [URL] |
DeepFakes: a New Threat to Face Recognition? Assessment and Detection, and , Idiap-RR-18-2018 |
|
A Cross-database Study of Voice Presentation Attack Detection, and , in: Handbook of Biometric Anti-Spoofing: Presentation Attack Detection, 2nd Edition, Springer, 2018 |
Profile extrema for visualizing and quantifying uncertainties on excursion regions. Application to coastal flooding, , , and , in: Technometrics, 61(4):474-493, 2019 |
[DOI] [URL] |
A supermartingale approach to Gaussian process based sequential design of experiments, , and , in: Bernoulli, 25(4A):2883-2919, 2019 |
Adaptation of Multiple Sound Source Localization Neural Networks with Weak Supervision and Domain-Adversarial Training, , and , Idiap-Com-01-2019 |
|
Dexterous Underwater Manipulation from Distant Onshore Locations, , , , , , , , , , , , , , , , , , , , , , , , and , in: IEEE Robotics and Automation Magazine, 2018 |
|
Learning from Demonstration (Programming by Demonstration), , in: Encyclopedia of Robotics, Springer, 2019 |
[DOI] [URL] |
Small Variance Asymptotics for Non-Parametric Online Robot Learning, and , in: International Journal of Robotics Research (IJRR), 38(1):3-22, 2019 |
|
Learning Task Priorities from Demonstrations, , , and , in: IEEE Transactions on Robotics, 35(1):78-94, 2019 |
[DOI] [URL] |
Learning from demonstration for semi-autonomous teleoperation, and , in: Autonomous Robots, 43(3):713-726, 2019 |
[DOI] [URL] |
Generalizing Robot Imitation Learning with Invariant Hidden Semi-Markov Models, , , , , , , and , in: 13th Intl Workshop on the Algorithmic Foundations of Robotics (WAFR), 2018 |
|
Bimanual Skill Learning with Pose and Joint Space Constraints, , , and , in: Proc. of the IEEE-RAS Intl Conf. on Humanoid Robots (Humanoids), 2018 |
|
Probabilistic Learning of Torque Controllers from Kinematic and Force Constraints, , , , and , in: Proc. of the IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), pages 6552-6559, 2018 |
|
Heterogeneous Face Recognition Using Domain Specific Units, , and , in: IEEE Transactions on Information Forensics and Security:13, 2019 |
[DOI] |
INVESTIGATING TIME DELAY NEURAL NETWORK (TDNN) FOR LANGUAGE MODELING IN LOW RESOURCE AUTOMATIC SPEECH RECOGNITION, , , and , Idiap-RR-13-2019 |
|
STACKED NEURAL NETWORKS WITH PARAMETER SHARING FOR MULTILINGUAL LANGUAGE MODELING, , , , , and , Idiap-RR-12-2019 |
|
Designing second order recurrent neural networks for prosody modelling, , Idiap-RR-16-2018 |
|
Fusing TensorFlow with building energy simulation for intelligent energy management in smart cities, , , and , in: Sustainable Cities and Society, 2018 |
[DOI] |
Semi-supervised Adaptation of Assistant Based Speech Recognition Models for different Approach Areas, , , , , , , , , , and , in: 37th AIAA/IEEE Digital Avionics Systems Conference, AIAA/IEEE, London, 2018 |
[URL] |
EMPIRICAL EVALUATION AND COMBINATION OF PUNCTUATION PREDICTION MODELS APPLIED TO BROADCAST NEWS, and , Idiap-RR-01-2019 |
|
AN END-TO-END NETWORK TO SYNTHESIZE INTONATION USING A GENERALIZED COMMAND RESPONSE MODEL, , , , and , Idiap-RR-05-2019 |
|
Words Worth: Verbal Content and Hirability Impressions in YouTube Video Resumes, , and , in: Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, 2018 |
|
Facing Employers and Customers: What Do Gaze and Expressions Tell About Soft Skills?, , , and , in: Proceedings of the 17th International Conference on Mobile and Ubiquitous Multimedia, Cairo, Egypt, pages 121-126, ASSOC COMPUTING MACHINERY, 2018 |
[DOI] |
Vlogging Over Time: Longitudinal Impressions and Behavior in YouTube, , , , and , in: Proceedings of the 17th International Conference on Mobile and Ubiquitous Multimedia, Cairo, EGYPT, pages 37-46, 2018 |
[DOI] |
UNICITY: A depth maps database for people detection in security airlocks, , , , , , , and , in: IEEE International Conference on Advanced Video and Signal-based Surveillance Workshop, 2018 |
|
WatchNet: Efficient and Depth-based Network for People Detection in Video Surveillance Systems, , , and , in: IEEE International Conference on Advanced Video and Signal-based Surveillance, Auckland, NEW ZEALAND, pages 109-114, 2018 |
|
Integrating Weakly Supervised Word Sense Disambiguation into Neural Machine Translation, , , and , in: Transactions of the Association for Computational Linguistics (TACL), 2018 |
|
Phonetic Subspace Features for Improved Query by Example Spoken Term Detection, , and , in: Speech Communication, 103:27-36, 2018 |
[DOI] |
Investigating Depth Domain Adaptation for Efficient Human Pose Estimation, , , and , in: European Conference on Computer Vision - Workshops, 2018 |
|
Cross-lingual Adaptation of a CTC-based multilingual Acoustic Model, , and , in: Speech Communication, 104:39-46, 2018 |
[DOI] |
Modeling Dyadic and Group Impressions with Inter-Modal and Inter-Person Features, , , and , in: ACM Transactions on Multimedia Computing, Communications, and Applications, 15(1), 2019 |
|
Mi Casa es su Casa? Examining Airbnb Hospitality Exchange Practices in a Developing Economy, , , , , , and , in: ACM Transactions on Social Computing, 2(1), 2019 |
|
Beyond Weight Tying: Learning Joint Input-Output Embeddings for Neural Machine Translation, , and , in: Proceedings of the Third Conference on Machine Translation (WMT), 2018 |
|
Looking South: Learning Urban Perception in Developing Cities, , and , in: ACM Transactions on Social Computing, 2018 |
|
Document-Level Neural Machine Translation with Hierarchical Attention Networks, , , and , in: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2018 |
|
Stochastic Variance Reduced Gradient Optimization of Generative Adversarial Networks, , , and , in: International Conference on Machine Learning (ICML) workshop on Theoretical Foundations and Applications of Deep Generative Models, 2018 |
Single-channel late reverberation power spectral density estimation using denoising autoencoders, and , in: Proc. Annual Conference of the International Speech Communication Association, Hyderabad, India, 2018 |
|
Iterative alternating least-aquares approach to jointly estimate the RETFs and the diffuse PSD, , and , in: Proc. ITG conference on Speech Communication, Oldenburg, Germany, 2018 |
|
Statistical modeling of speech spectral coefficients in patients with Parkinson's disease, and , in: Proc. ITG conference on Speech Communication, Oldenburg, Germany, 2018 |
|
Modelling glottal source information for depression detection, , and , Idiap-RR-13-2018 |
|
Word Sense Consistency in Statistical and Neural Machine Translation, , École Polytechnique Fédérale de Lausanne, 2018 |
|
Supervised Gaze Bias Correction for Gaze Coding in Interactions, and , in: ECEM COGAIN Symposium, pages 3, 2017 |
|
A Differential Approach for Gaze Estimation with Calibration, , , and , in: 29TH BRITISH MACHINE VISION CONFERENCE, 2018 |
|
Knowledge Transfer with Jacobian Matching, and , in: Proceedings of the International Conference on Machine Learning, 2018 |
[URL] |
Not All Samples Are Created Equal: Deep Learning with Importance Sampling, and , in: Proceedings of International Conference on Machine Learning, 2018 |
|
Gradient-based spectral visualization of CNNs using raw waveforms, , , and , Idiap-RR-11-2018 |
|
A Tale of Two Interactions: Inferring Performance in Hospitality Encounters from Cross-Situation Social Sensing, , and , in: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 2(129), 2018 |
|
Spoofing Deep Face Recognition With Custom Silicone Masks, , and , in: Proceedings of BTAS2018, 2018 |
|
A Neural Model to Predict Parameters for a Generalized Command Response Model of Intonation, and , Idiap-RR-10-2018 |
|
Special issue on robot learning for human-robot collaboration, , , , and , in: Autonomous Robots, 42(5):953-956, 2018 |
[DOI] [URL] |
Programming by Demonstration for Shared Control with an Application in Teleoperation, , and , in: IEEE Robotics and Automation Letters (RA-L), 3(3):1848-1855, 2018 |
[DOI] [URL] |
Flexible Automation Driven by Demonstration: Leveraging Strategies that Simplify Robotics, , , , , , and , in: IEEE Robotics and Automation Magazine (RAM), 25(2):18-27, 2018 |
[DOI] [URL] |
A Brief Survey on the Role of Dimensionality Reduction in Manipulation Learning and Control, , and , in: IEEE Robotics and Automation Letters (RA-L), 3(3):2608-2615, 2018 |
[DOI] [URL] |
Learning Control, and , in: Humanoid Robotics: a Reference, pages 1261-1312, Springer, 2019 |
[DOI] [URL] |
SIIP: An Innovative Speaker Identification Approach for Law Enforcement Agencies, , , , , , , , , and , in: Big Data and Artificial Intelligence for Military Decision Making, http://www.sto.nato.int/, pages PT-1 - 1: PT-1 - 14, STO, 2018 |
[DOI] [URL] |
Phonological Posterior Hashing for Query by Example Spoken Term Detection, , and , in: Proceedings of Interspeech, 2018 |
|
CNN based Query by Example Spoken Term Detection, , and , in: Proceedings of Interspeech, 2018 |
|
Semi-Blind Spatially-Variant Deconvolution in Optical Microscopy with Local Point Spread Function Estimation by Use of Convolutional Neural Networks, and , in: 2018 25th IEEE International Conference on Image Processing (ICIP), pages 3818-3822, IEEE, 2018 |
[DOI] |
Robust and Discriminative Speaker Embedding via Intra-Class Distance Variance Regularization, and , in: Proceedings of Interspeech, Hyderabad, INDIA, pages 2257-2261, 2018 |
[DOI] |
On Learning Vocal Tract System Related Speaker Discriminative Information from Raw Signal Using CNNs, , and , in: Proceedings of Interspeech, Hyderabad, INDIA, pages 1116-1120, 2018 |
|
On Learning to Identify Genders from Raw Speech Signal Using CNNs, , and , in: Proceedings of Interspeech, Hyderabad, INDIA, pages 287-291, 2018 |
[DOI] |
Speaker Inconsistency Detection in Tampered Video, and , in: European Signal Processing Conference, 2018 |
|
Denoising and Raw-waveform Networks for Weakly-Supervised Gender Identification on Noisy Speech, , , , , and , in: Proceedings of Interspeech 2018, Hyderabad, INDIA, pages 292-296, 2018 |
[DOI] |
Implementing Fusion Techniques for the Classification of Paralinguistic Information, , , and , in: Proceedings of Interspeech 2018, pages 526-530, 2018 |
|
Analysis of Language Dependent Front-End for Speaker Recognition, , and , in: Proceedings of Interspeech 2018, Hyderabad, INDIA, pages 1101-1105, 2018 |
[DOI] |
Experimental evaluation of speech enhancement methods in remote microphone systems for hearing aids, , , , and , in: Proc. EuroNoise 2018, Crete, Greece, pages 351-358, 2018 |
|
How to Tell Ancient Signs Apart? Recognizing and Visualizing Maya Glyphs with CNNs, , and , in: ACM Journal on Computing and Cultural Heritage (JOCCH), 11(4):20, 2018 |
[DOI] |
Warped Gaussian processes and derivative-based sequential design for functions with heterogeneous variations, , , and , in: SIAM/ASA Journal on Uncertainty Quantification, 6(3):991-1018, 2018 |
Implementing gender-dependent vowel-level analysis for boosting speech-based depression recognition, , , and , in: Proceedings of Interspeech 2017, 2017 |
|
Not All Samples Are Created Equal: Deep Learning with Importance Sampling, and , Idiap-RR-12-2018 |
|
Local Affine Approximations for Improving Knowledge Transfer, and , Idiap-Com-01-2018 |
[URL] |
WILDTRACK: A Multi-camera HD Dataset for Dense Unscripted Pedestrian Detection, , , , , , , , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, pages 5030-5039, 2018 |
[DOI] |
Knowledge Transfer with Jacobian Matching, and , Idiap-RR-04-2018 |
[URL] |
SGAN: An Alternative Training of Generative Adversarial Networks, and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, pages 9407-9415, IEEE, 2018 |
[DOI] |
Implémentation d'un algorithme de réduction de taille des réseaux de neurones, , Idiap-RR-03-2018 |
|
Sequential Design of Computer Experiments, , in: Wiley StatsRef: Statistics Reference Online, Wiley, 2018 |
Analysis of eigenvalue decomposition-based late reverberation power spectral density estimation, and , in: IEEE Transaction on Acoustics, Speech and Language Processing, 26(6):1106-1118, 2018 |
|
Complexity reduction of eigenvalue decomposition-based diffuse power spectral density estimators using the power method, , and , in: Proc. International Conference on Acoustics, Speech, and Signal Processing, Calgary, Canada, pages 451-455, 2018 |
|
Joint late reverberation and noise power spectral density estimation in a spatially homogeneous noise field, and , in: Proc. International Conference on Acoustics, Speech, and Signal Processing, Calgary, Canada, pages 441-445, 2018 |
|
Self-Attentive Residual Decoder for Neural Machine Translation, , , and , in: Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2018 |
|
Learning embeddings: efficient algorithms and applications, , École Polytechnique Fédérale de Lausanne, 2018 |
[DOI] |
Novel Algorithms for Clustering, , École polytechnique fédérale de Lausanne, 2018 |
[DOI] |
Semi-blind spatially-variant deconvolution in optical microscopy with local Point Spread Function estimation by use of Convolutional Neural Networks, and , Idiap-RR-07-2018 |
|
Deep Neural Networks for Multiple Speaker Detection and Localization, , and , in: 2018 IEEE International Conference on Robotics and Automation (ICRA), Brisbane, AUSTRALIA, pages 74-79, 2018 |
[DOI] |
Towards directly modeling raw speech signal for speaker verification using CNNs, , and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, Calgary, CANADA, pages 4884-4888, 2018 |
|
NASAL SPEECH SOUNDS DETECTION USING CONNECTIONIST TEMPORAL, and , in: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 2018 |
|
Generative Models for Learning Robot Manipulation Skills from Humans, , Ecole Polytechnique Federale de Lausanne, 2018 |
[DOI] |
DrinkSense: Characterizing Youth Drinking Behavior using Smartphones, , , , , and , in: IEEE Transactions on Mobile Computing, 2018 |
|
Cross-Eyed 2017: Cross-Spectral Iris/Periocular Recognition Competition., , , , , , , , , , , , , and , in: IEEE/IAPR International Joint Conference on Biometrics, Denver, Colorado, USA, IEEE, 2017 |
A Poisson regression approach to model monthly hail occurrence in Northern Switzerland using large-scale environmental variables, , and , in: Atmospheric Research, 203:261-274, 2018 |
[DOI] |
Theories and Models of Teams and Group, , , , and , in: Small Group Research, 48(5):544--567, 2017 |
[DOI] |
SMILE Swiss German Sign Language Dataset, , , , , , , , , , , and , in: Language Resources and Evaluation Conference, 2018 |
Active Online Anomaly Detection using Dirichlet Process Mixture Model and Gaussian Process Classification, , , , and , in: IEEE Winter Conference on Applications of Computer Vision (WACV), Washington, 2017 |
|
Towards a breakthrough Speaker Identification approach for Law Enforcement Agencies: SIIP, , , , , , , , , and , in: European Intelligence and Security Informatics Conference (EISIC) 2017, Athenes, Greece, pages 32-39, IEEE Computer Society, 2017 |
[DOI] [URL] |
On the Use of Convolutional Neural Networks for Speech Presentation Attack Detection, , , , and , in: International Conference on Identity, Security and Behavior Analysis, 2018 |
|
Deep Multi-Camera People Detection, and , in: Proceedings of the IEEE International Conference on Machine Learning and Applications, 2017 |
K-Medoids For K-Means Seeding, and , in: Proceedings of the international conference on Neural Information Processing Systems, 2017 |
Social Scene Understanding: End-to-End Multi-Person Action Localization and Collective Activity Recognition, , , , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2017 |
Multi-Modal Mean-Fields via Cardinality-Based Clamping, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2017 |
Boosted Exudate Segmentation in Retinal Images using Residual Nets, , , and , in: Proceedings of the MICCAI Workshop on Ophthalmic Medical Image Analysis, 2017 |
Geometric calibration of Colour and Stereo Surface Imaging System of ESA's Trace Gas Orbiter, , , , , , , and , in: Advances in Space Research, 2018 |
Non-Markovian Globally Consistent Multi-Object Tracking, , , and , in: Proceedings of the IEEE International Conference on Computer Vision, 2017 |
Deep Occlusion Reasoning for Multi-Camera Multi-Target Detection, , and , in: Proceedings of the IEEE International Conference on Computer Vision, 2017 |
Weakly Supervised Learning of Deep Metrics for Stereo Reconstruction, , and , in: Proceedings of the IEEE International Conference on Computer Vision, 2017 |
Exploratory Study on Direct Prediction of Diabetes using Deep Residual Networks, , , and , in: Proceedings of the thematic conference on computational vision and medical image processing, 2017 |
Towards Weakly Supervised Acoustic Subword Unit Discovery and Lexicon Development Using Hidden Markov Models, , and , in: Speech Communication, 96:168-183, 2018 |
[DOI] |
DNN based speaker embedding using content information for text-dependent speaker verification, , , and , Idiap-RR-06-2018 |
|
Check Out This Place: Inferring Ambiance from Airbnb Photos, , , and , in: IEEE transactions on Multimedia, 20(6):1499-1511, 2018 |
[DOI] [URL] |
Development of the Geographical Proportional-to-size Street-Intercept Sampling (GPSIS) method for recruiting urban nightlife-goers in an entire city, , , , , , , and , in: International Journal of Social Research Methodology, 20(6):721-736, 2017 |
[DOI] |
Examining Linguistic Content and Skill Impression Structure for Job Interview Analytics in Hospitality, and , in: Proceedings of the 16th International Conference on Mobile and Ubiquitous Multimedia, 2017 |
|
Learning Autonomous Behaviours for the Body of a Flexible Surgical Robot, , and , in: Autonomous Robots, 41(2):333-347, 2017 |
[DOI] [URL] |
Robot Learning with Task-Parameterized Generative Models, , in: Robotics Research, pages 111-126, Springer, 2018 |
[DOI] [URL] |
Supervisory teleoperation with online learning and optimal control, and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Singapore, pages 1534-1540, IEEE, 2017 |
[URL] |
Trajectory and Foothold Optimization using Low-Dimensional Models for Rough Terrain Locomotion, , , , , , , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Singapore, pages 1096-1103, IEEE, 2017 |
[URL] |
Learning Task-Space Synergies using Riemannian Geometry, , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), Vancouver, Canada, pages 73-78, IEEE, 2017 |
[URL] |
Generating Calligraphic Trajectories with Model Predictive Control, , and , in: Proc. 43rd Conf. on Graphics Interface, Edmonton, AL, Canada, pages 132-139, 2017 |
[DOI] |
Dynamic Graffiti Stylisation with Stochastic Optimal Control, , and , in: Intl Workshop on movement and computing (MOCO), London, UK, pages 1-8, ACM, 2017 |
[DOI] [URL] |
Visual Analysis of Maya Glyphs via Crowdsourcing and Deep Learning, , École Polytechnique Fédérale de Lausanne, 2017 |
[DOI] |
Deeply Vulnerable -- a study of the robustness of face recognition to presentation attacks, , and , in: IET (The Institution of Engineering and Technology) -- Biometrics:1--13, 2017 |
[DOI] |
Planification adaptative d'expériences numériques par paquets en contexte non stationnaire pour une étude de fissuration mécanique, , , , and , in: 23eme Congres Francais de Mecanique, 28 aout - 1er septembre 2017, Lille, France (FR), AFM, 2017 |
[URL] |
Non-parametric warping via local scale estimation for non-stationary Gaussian process modelling, , , and , in: Wavelets and Sparsity XVII, pages 1039421, International Society for Optics and Photonics, 2017 |
[DOI] [URL] |
Towards directly modeling raw speech signal for speaker verification using CNNs, , and , Idiap-RR-30-2017 |
|
Deep Neural Networks for Multiple Speaker Detection and Localization, , and , Idiap-RR-02-2018 |
|
Estimating orthant probabilities of high dimensional Gaussian vectors with an application to set estimation, and , in: Journal of Computational and Graphical Statistics, 27(2):255-267, 2018 |
[DOI] [URL] |
On uncertainty quantification in hydrogeology and hydrogeophysics, , , , and , in: Advances in Water Resources, 110:166–181, 2017 |
[DOI] [URL] |
Towards Quantifying the Entropy of Fingervein Patterns across Different Feature Extractors, and , in: 2018 IEEE 4th International Conference on Identity, Security, and Behavior Analysis (ISBA), 2018 |
|
NASAL SPEECH SOUNDS DETECTION USING CONNECTIONIST TEMPORAL CLASSIFICATION, and , Idiap-RR-28-2017 |
|
Combining Electromyography and Tactile Myography to Improve Hand and Wrist Activity Detection in Prostheses, , , and , in: Technologies, 5(4), 2017 |
|
On Job Training: Automated Interpersonal Behavior Assessment & Real-Time Feedback, , in: Proceedings of the 25th ACM International Conference on Multimedia, 2017, 2017 |
|
Bites'n'Bits: Inferring Eating Behavior from Contextual Mobile Data, , , and , in: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (PACM IMWUT), 1(4):125-157, 2017 |
|
Cognitive Speech Coding: Examining the Impact of Cognitive Speech Processing on Speech Compression, , and , in: IEEE Signal Processing Magazine, 35(3):97-109, 2018 |
[DOI] |
Direct inversion algorithm for focal plane scanning optical projection tomography, and , in: Biomedical Optics Express, 2017 |
|
What you can't see can help you -- extended-range imaging for 3D-mask presentation attack detection, and , in: Proceedings of the 16th International Conference on Biometrics Special Interest Group., Darmstadt (Germany), Gesellschaft fuer Informatik e.V. (GI), 2017 |
|
Evaluating Attention Networks for Anaphora Resolution, , , and , Idiap-RR-27-2017 |
|
Towards Document-Level Neural Machine Translation, , Idiap-RR-25-2017 |
|
A Domain Adaptation Approach to Improve Speaker Turn Embedding Using Face Representation, and , in: ACM International Conference on Multimodal Interaction, Glasgow, Scotland, ACM, 2017 |
|
Long-Term Spectral Statistics for Voice Presentation Attack Detection, , , and , in: IEEE/ACM Transactions on Audio, Speech and Language Processing, 25(11):2098-2111, 2017 |
|
How May I Help You? Behavior and Impressions in Hospitality Service Encounters, , and , in: Proceddings of 19th ACM International Conference on Multimodal Interaction, 2017 |
|
Elderly People Living Alone: Detecting Home Visits with Ambient and Wearable Sensing, , , and , in: In Proceedings of MMHealth, 2017 |
|
Venues in Social Media: Examining Ambiance Perception Through Scene Semantics, , and , in: Proceedings of the 25th ACM International Conference on Multimedia, ACM, 2017, 2017 |
|
Multilingual Hierarchical Attention Networks for Document Classification, and , in: Proceedings of the 8th International Joint Conference on Natural Language Processing (IJCNLP), pages 1015-1025, 2017 |
|
A reproducible study on remote heart rate measurement, , and , in: arXiv, 2017 |
[URL] |
Biometrics: In Search of Identity and Security (Q & A), , , , , and , in: IEEE MultiMedia, PP, 2017 |
[DOI] |
Maya Codical Glyph Segmentation: A Crowdsourcing Approach, , and , in: IEEE Transactions on Multimedia, 20(3):711-725, 2018 |
[DOI] [URL] |
On the Generalization of Fused Systems in Voice Presentation Attack Detection, , , , and , in: 16th International Conference of the Biometrics Special Interest Group, 2017 |
|
Presentation attack detection in voice biometrics, and , in: User-Centric Privacy and Security in Biometrics, The Institution of Engineering and Technology, 2017 |
|
Impact of score fusion on voice biometrics and presentation attack detection in cross-database evaluations, and , in: IEEE Journal of Selected Topics in Signal Processing, 11(4):695 - 705, 2017 |
[DOI] |
Improving speaker turn embedding by crossmodal transfer learning from face embedding, and , in: ICCV Workshop on Computer Vision for Audio-Visual Media, 2017 |
|
Perceptual Information Loss due to Impaired Speech Production, , and , in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2017 |
|
Perceptual Information Loss due to Impaired Speech Production, , and , Idiap-RR-20-2017 |
|
On Modeling the Synergy Between Acoustic and Lexical Information for Pronunciation Lexicon Development, , École polytechnique fédérale de Lausanne (EPFL), 2017 |
[DOI] |
A Generative Model for Intention Recognition and Manipulation Assistance in Teleoperation, and , in: IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017 |
|
NeuroSpeech: An open-source software for Parkinson's speech analysis, , , , , , , , , , , , , , and , in: Digital Signal Processing, 2017 |
[DOI] |
A Sub-Quadratic Exact Medoid Algorithm, and , in: Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017 |
Template-matching for Text-dependent Speaker Verification, , , and , in: Speech Communication, 2017 |
|
Validation of an Automatic Metric for the Accuracy of Pronoun Translation (APT), and , in: Proceedings of the Third Workshop on Discourse in Machine Translation (DiscoMT), Denmark, Copenhagen, Association for Computational Linguistics (ACL), 2017 |
|
End-to-End Convolutional Neural Network-based Voice Presentation Attack Detection, , and , in: International Joint Conference on Biometrics, Denver, Colorado, USA, 2017 |
|
Sense-Aware Statistical Machine Translation using Adaptive Context-Dependent Clustering, , and , in: Proceedings of Second Conference on Machine Translation (WMT17), 2017 |
|
Learning Manipulability Ellipsoids for Task Compatibility in Robot Manipulation, , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, pages 3183-3189, 2017 |
[URL] |
Gaussian Mixture Regression on Symmetric Positive Definite Matrices Manifolds: Application to Wrist Motion Estimation with sEMG, and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, pages 59-64, 2017 |
[URL] |
Improving hand and wrist activity detection using tactile sensors and tensor regression methods on Riemannian manifolds, , and , in: Proc. of the Myoelectric Control Symposium, 2017 |
[URL] |
Cross-lingual Transfer for News Article Labeling: Benchmarking Statistical and Neural Models, , and , Idiap-RR-26-2017 |
|
An Investigation of Deep Neural Networks for Multilingual Speech Recognition Training and Adaptation, , and , in: Proc. of Interspeech, 2017 |
|
Characterisation of voice quality of Parkinson's disease using differential phonological posterior features, , , , , and , in: Computer Speech and Language, 2017 |
|
Supervised Gaze Bias Correction for Gaze Coding in Interactions, and , Idiap-RR-23-2017 |
|
MAAYA: Multimedia Methods to Support Maya Epigraphic Analysis, , , , , , and , in: Arqueologia computacional: Nuevos enfoques para el analisis y la difusion del patrimonio cultural, INAH-RedTDPC, 2017 |
|
Towards large scale multimedia indexing: A case study on person discovery in broadcast news, , and , in: 15th International Workshop on Content-Based Multimedia Indexing, 2017 |
|
Shape Representations for Maya Codical Glyphs: Knowledge-driven or Deep?, , and , in: 15th International Workshop on Content-Based Multimedia Indexing, 2017 |
|
Bob Speaks Kaldi, , , , and , in: Proc. of Interspeech, 2017 |
|
Characterisation of voice quality of Parkinson's disease using differential phonological posterior features, , , , , and , Idiap-RR-16-2017 |
|
An Approach for Imitation Learning on Riemannian Manifolds, , , , and , in: IEEE Robotics and Automation Letters (RA-L), 2(3):1240-1247, 2017 |
[DOI] [URL] |
Learning adaptive dressing assistance from human demonstration, and , in: Robotics and Autonomous Systems, 93:61-75, 2017 |
[DOI] [URL] |
Insiders and Outsiders: Comparing Urban Impressions between Population Groups, , and , in: International Conference on Multimedia Retrieval, ACM, 2017 |
[DOI] |
Sparse Pronunciation Codes for Perceptual Phonetic Information Assessment, , , and , in: Workshop on Signal Processing with Adaptive Sparse Structured Representations (SPARS), 2017 |
|
Topic and Sentiment in Phrase-Based Statistical Machine Translation, , and , Idiap-RR-10-2017 |
|
A Posterior-Based Multi-Stream Formulation for G2P Conversion, and , in: IEEE Signal Processing Letters, 2017 |
|
Object Detection with Active Sample Harvesting, , École Polytechnique Fédérale de Lausanne, 2017 |
|
Large-Scale Image Segmentation with Convolutional Networks, , Sciences et Techniques de l’Ingénieur (STI), 2017 |
|
Using Coreference Links to Improve Spanish-to-English Machine Translation, and , in: Proceedings of the 2nd Workshop on Coreference Resolution Beyond OntoNotes (CORBON 2017), Valencia, Spain, pages 30-40, Association for Computational Linguistics (ACL), 2017 |
|
Using Coreference Links to Improve Spanish-to-English Machine Translation, and , Idiap-RR-07-2017 |
|
Machine translation of Spanish personal and possessive pronouns using anaphora probabilities, and , in: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL), Valencia, Spain, Association for Computational Linguistics, 2017 |
|
Machine translation of Spanish personal and possessive pronouns using anaphora probabilities, and , Idiap-RR-06-2017 |
|
Multilingual Hierarchical Attention Networks for Document Classification, and , Idiap-RR-17-2017 |
[URL] |
Explicit Document Modeling through Weighted Multiple-Instance Learning, and , in: Journal of Artificial Intelligence Research (JAIR), 58:591--626, 2017 |
|
Multilingual Visual Sentiment Concept Clustering and Analysis, , , , , , and , in: International Journal of Multimedia Information Retrieval, 2017 |
|
Real-time Multiple Head Tracking Using Texture and Colour Cues, and , Idiap-RR-02-2017 |
|
Intonation Modelling for Speech Synthesis and Emphasis Preservation, , École Polytechnique Fédérale de Lausanne, 2017 |
[DOI] |
The SIWIS French Speech Synthesis Database – Design and recording of a high quality French database for speech synthesis, , , and , Idiap-RR-03-2017 |
|
On the Impact of Non-modal Phonation On Phonological Features, , , , , , , , , , , , , and , in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), 2017 |
|
Multi-view Representation Learning Via GCCA for Multimodal Analysis of Parkinson's Disease, , , , , , , , , , , , , , and , in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), 2017 |
|
Towards Weakly Supervised Acoustic Subword Unit Discovery and Lexicon Development Using Hidden Markov Models, , and , Idiap-RR-15-2017 |
|
A MultiPath Network for Object Detection, , , , , , and , in: Proceedings of the British Machine Vision Conference, BMVA Press, 2016 |
[URL] |
Learning to Refine Object Segments, , , and , in: Computer Vision - ECCV 2016, Amsterdam, pages 75-91, Springer, 2016 |
[DOI] [URL] |
Unsupervised Interpretable Pattern Discovery in Time Series Using Autoencoders, , , and , in: IAPR Int. Workshops on Structural and Syntactic Pattern Recognition (SSPR), 2016 |
|
CRF-Based Context Modeling for Person Identification in Broadcast Videos, , , and , in: Frontiers in ICT: Computer Image Analysis, 3, 2016 |
|
Consistent Translation of Repeated Nouns using Syntactic and Semantic Cues, , and , in: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL), Valencia, Spain, Association for Computational Linguistics, 2017 |
|
Consistent Translation of Repeated Nouns using Syntactic and Semantic Cues, , and , Idiap-RR-08-2017 |
|
Manual and automatic labeling of discourse connectives for machine translation (Keynote paper), , in: TextLink: Structuring Discourse in Multilingual Europe (Handbook of the Second Action Conference), Budapest, Hungary, pages 16-20, 2016 |
[URL] |
Findings of the 2016 WMT Shared Task on Cross-lingual Pronoun Prediction, , , , , , , , and , in: Proceedings of WMT 2016 (First Conference on Machine Translation), Association for Computational Linguistics, Berlin, Germany, pages 525–542, 2016 |
[URL] |
Comparing Two Strategies for Query Expansion in a News Monitoring System, and , in: Natural Language Processing and Information Systems: 21st International Conference on Applications of Natural Language to Information Systems, NLDB 2016, pages 267-275, Springer-Verlag, 2016 |
[DOI] |
Towards building an attentive artificial listener: on the perception of attentiveness in audio-visual feedback tokens, , , , , , and , in: Proceedings of the 18th ACM International Conference on Multimodal Interaction, Tokyo, Japan, pages 21-28, ACM, 2016 |
[DOI] |
EXPLOITING SEQUENCE INFORMATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION, , , and , in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, New Orleans, pages 5370-5374, 2017 |
|
CONTENT NORMALIZATION FOR TEXT-INDEPENDENT SPEAKER VERIFICATION, , , and , Idiap-RR-31-2017 |
|
EXPLOITING SEQUENCE INFORMATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION, , , and , Idiap-RR-04-2017 |
|
Template-matching for Text-dependent Speaker Verification, , , and , Idiap-RR-32-2017 |
|
Two-Pass IB based Speaker Diarization System using Meeting-Specific ANN based Features, , , and , in: Proceedings of Interspeech 2016, pages 2199-2203, 2016 |
Two-Pass IB based Speaker Diarization System using Meeting-Specific ANN based Features, , , and , Idiap-RR-09-2018 |
|
Computational Analysis of Urban Places Using Mobile Crowdsensing, , Ecole Polytechnique Federale de Lausanne, 2016 |
[DOI] |
Long Term Spectral Statistics for Voice Presentation Attack Detection, , , and , Idiap-RR-11-2017 |
|
Online Inference in Bayesian Non-Parametric Mixture Models under Small Variance Asymptotics, and , in: NIPS workshop on Advances in Approximate Bayesian Inference, Barcelona, Spain, pages 1-5, 2016 |
[URL] |
Learning From Humans, , and , in: Handbook of Robotics, pages 1995-2014, Springer, 2016 |
[DOI] [URL] |
Online motion synthesis with minimal intervention control and formal safety guarantees, , , and , in: Proc. IEEE Intl Conf. on Systems, Man, and Cybernetics, Budapest, Hungary, 2016 |
|
Dexterous Undersea Interventions with Far Distance Onshore Supervision: the DexROV Project, , , , , , , , , , , , , , , , , , , , , , and , in: IFAC Conference on Control Applications in Marine Systems (CAMS), Trondheim, Norway, pages 414-419, 2016 |
[DOI] [URL] |
Stochastic learning and control in multiple coordinate systems, , in: Intl Workshop on Human-Friendly Robotics, Genoa, Italy, pages 1-5, 2016 |
|
Scalable greedy algorithms for transfer learning, , and , in: Computer Vision and Image Understanding, 2016 |
Fast Rates by Transferring from Auxiliary Hypotheses, and , in: Machine Learning, 2016 |
Maya Codical Glyph Segmentation: A Crowdsourcing Approach, , and , Idiap-RR-01-2017 |
|
Redundant Hash Addressing for Large-Scale Query by Example Spoken Query Detection, , and , Idiap-RR-31-2016 |
|
Validation of an Automatic Metric for the Accuracy of Pronoun Translation (APT), and , Idiap-RR-29-2016 |
|
Information Theoretic Analysis of Production-Perception Efficiency: Case Study of Speech Pathology, , and , Idiap-RR-30-2016 |
|
What TripAdvisor Can't Tell: Crowdsourcing Urban Impressions for Whole Cities, , and , in: Digital Polis, L'Oeil d'Or (translated to French.), 2018 |
|
SenseCityVity: Mobile Crowdsourcing, Urban Awareness, and Collective Action in Mexico, , , , , , , , , and , in: IEEE Pervasive Computingg, Special Issue on Smart Cities, 16(2):44-53, 2017 |
|
Cognitive speech coding, and , Idiap-RR-27-2016 |
|
Learning assistive teleoperation behaviors from demonstration, and , in: Proc. IEEE International Symposium on Safety, Security and Rescue Robotics, pages 258-263, 2016 |
|
Learning dynamic graffiti strokes with a compliant robot, , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, pages 3981-3986, 2016 |
[URL] |
Implementation of the Standard I-vector System for the Kaldi Speech Recognition Toolkit, , , and , Idiap-RR-26-2016 |
|
Nested Mini-Batch K-Means, and , in: Proceedings of NIPS, 2016 |
Speech vocoding for laboratory phonology, , and , in: Computer Speech and Language, 2016 |
|
Investigating Spectral Amplitude Modulation Phase Hierarchy Features in Speech Synthesis, , , and , in: 9th ISCA Speech Synthesis Workshop, 2016 |
|
Human versus Machine Attention in Document Classification: A Dataset with Crowdsourced Annotations, and , in: Proceedings of the EMNLP 2016 Workshop on Natural Language Processing for Social Media, Austin, USA, 2016 |
|
On the impact of non-modal phonation on phonological features, , , , , , , , , , , , , and , Idiap-RR-28-2016 |
|
Anomaly detection in elderly daily behavior in ambient sensing environments, , , and , in: Proceedings of the 7th Int. Workshop on Human Behavior Understanding, ACM Multimedia, 2016, Amsterdam, Netherlands, 2016 |
|
The REPLAY-MOBILE Face Presentation-Attack Database, , , and , in: Proceedings of the International Conference on Biometrics Special Interests Group, 2016 |
|
Composition of Deep and Spiking Neural Networks for Very Low Bit Rate Speech Coding, , , and , in: IEEE/ACM Trans. on Audio, Speech and Language Processing, 2016 |
|
Feature mapping using far-field microphones for distant speech recognition, , , and , Idiap-RR-20-2016 |
|
InnerView: Learning Place Ambiance from Social Media Images, , and , in: Proceedings of the 24th ACM International Conference on Multimedia, ACM, 2016 |
[DOI] |
The Night is Young: Urban Crowdsourcing of Nightlife Patterns, , , , , , and , in: Proceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing, ACM, 2016 |
[DOI] |
Sparse Hidden Markov Models for Exemplar-based Speech Recognition Using Deep Neural Network Posterior Features, , and , Idiap-RR-19-2016 |
|
Explicit Suggestion of Query Terms for News Search using Topic Models and Word Embeddings, and , Idiap-RR-21-2016 |
|
Feature mapping using far-field microphones for distant speech recognition, , , and , in: Speech Communication, 83:1-9, 2016 |
[DOI] [URL] |
Efficient Posterior Exemplar Search Space Hashing Exploiting Class-Specific Sparsity Structures, , , and , in: Interspeech, San Francisco, CA, 2016 |
|
On Structured Sparsity of Phonological Posteriors for Linguistic Parsing, , and , in: Speech Communication, 84:36-45, 2016 |
[DOI] [URL] |
PAoS Markers: Trajectory Analysis of Selective Phonological Posteriors for Assessment of Progressive Apraxia of Speech, , and , in: Proceeding on the 7th Workshop on Speech and Language Processing for Assistive Technologies (SLPAT), 2016 |
|
Word Sequence Modeling using Deep Learning: and End-to-end Approach and its Applications, , EPFL, 2016 |
[DOI] |
Presentation Attack Detection Using Long-Term Spectral Statistics for Trustworthy Speaker Verification, , and , in: International Conference of the Biometrics Special Interest Group (BIOSIG), 2016 |
|
Transferring Neural Representations for Low-dimensional Indexing of Maya Hieroglyphic Art, , , , , , , , and , in: Proc. ECCV Workshop on Computer Vision for Art Analysis, Amsterdam, pages 842-855, Springer, 2016 |
[DOI] [URL] |
Emphasis Recreation for TTS using Intonation Atoms, and , in: 9th ISCA Speech Synthesis Workshop, pages 14--20, 2016 |
[DOI] |
Learning Controllers for Reactive and Proactive Behaviors in Human-Robot Collaboration, , , and , in: Frontiers in Robotics and AI, 3(30):1-11, 2016 |
[DOI] |
Learning Physical Collaborative Robot Behaviors from Human Demonstrations, , , , and , in: IEEE Trans. on Robotics, 32(3):513-527, 2016 |
[DOI] [URL] |
Variable Duration Movement Encoding with Minimal Intervention Control, , and , in: Proc. of the IEEE Intl Conf. on Robotics and Automation (ICRA), pages 497-503, 2016 |
|
Joint Operation of Voice Biometrics and Presentation Attack Detection, and , in: IEEE International Conference on Biometrics: Theory, Applications and Systems, 2016 |
[URL] |
Overview of BTAS 2016 Speaker Anti-spoofing Competition, , , , , , , , , , , , , , , and , in: IEEE International Conference on Biometrics: Theory, Applications and Systems, 2016 |
[URL] |
Cross-database evaluation of audio-based spoofing detection systems, and , in: Interspeech, San Francisco, USA, 2016 |
[URL] |
Inter-task System Fusion for Speaker Recognition, , , , and , in: Proceeedings of the INTERSPEECH, 2016 |
|
Scalable Metric Learning via Weighted Approximate Rank Component Analysis, and , in: ECCV 2016, 2016 |
|
Fast K-Means with Accurate Bounds, and , in: Proceedings of the International Conference on Machine Learning (ICML), New York, 2016 |
Phrase Representations for Multiword Expressions, and , in: Proceedings of the 12th Workshop on Multiword Expressions, 2016 |
|
Neural Network-based Word Alignment through Score Aggregation, , and , in: Proceedings of the ACL 1st Conference on Machine Translation, 2016 |
|
Deep Neural Networks for Syntactic Parsing of Morphologically Rich Languages, and , in: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016 |
|
A Large-Scale Open-Source Acoustic Simulator for Speaker Recognition, , , , and , in: IEEE Signal Processing Letters, 23(4):527 - 531, 2016 |
|
Building Word Embeddings for Solving Natural Language Processing, , École Polytechnique Fédérale de Lausanne, 2016 |
[DOI] |
On ANOVA Decompositions of Kernels and Gaussian Random Field Paths, , , , and , in: Monte Carlo and Quasi-Monte Carlo Methods, pages 315-330, Springer International Publishing, 2016 |
[DOI] |
Design of Computer Experiments Using Competing Distances Between Set-Valued Inputs, , , and , in: mODa 11 - Advances in Model-Oriented Design and Analysis, pages 123-131, Springer International Publishing, 2016 |
[DOI] |
End-to-End Acoustic Modeling using Convolutional Neural Networks for Automatic Speech Recognition, , and , Idiap-RR-18-2016 |
|
Subspace Detection of DNN Posterior Probabilities via Sparse Representation for Query by Example Spoken Term Detection, , and , in: Interspeech, 2016 |
|
HMM-based Non-native Accent Assessment using Posterior Features, , and , in: Proceedings of Interspeech, San Francisco, USA, 2016 |
|
When Naïve Bayes Nearest Neighbors Meet Convolutional Neural Networks, , and , in: Computer Vision and Pattern Recognition (CVPR), IEEE Conference on, 2016 |
|
Pronoun Language Model and Grammatical Heuristics for Aiding Pronoun Prediction, and , in: Proceedings of the First Conference on Machine Translation (WMT16), Berlin, Germany, ACL, 2016 |
|
Investigating Spectral Amplitude Modulation Phase Hierarchy Features in Speech Synthesis, , , and , Idiap-RR-22-2016 |
|
Improving Under-Resourced Language ASR Through Latent Subword Unit Space Discovery, and , in: Proceedings of Interspeech, 2016 |
|
Modeling Unvoiced Sounds In Statistical Parametric Speech Synthesis with a Continuous Vocoder, , , and , in: Proc. of EUSIPCO, Budapest, Hungary, 2016 |
|
PhonVoc: A Phonetic and Phonological Vocoding Toolkit, and , in: Interspeech, San Francisco, USA, 2016 |
|
Sound Pattern Matching for Automatic Prosodic Event Detection, , , , and , in: Interspeech, San Francisco, USA, 2016 |
|
Improving Pronoun Translation by Modeling Coreference Uncertainty, and , in: Proceedings of the First Conference on Machine Translation (WMT16), Berlin, Germany, 2016 |
|
The SIWIS database: a multilingual speech database with acted emphasis, , , , , , , , , , , and , in: Proceedings of Interspeech, San Francisco, USA, pages 1532--1535, 2016 |
[DOI] |
Probabilistic Amplitude Demodulation Features in Speech Synthesis for Improving Prosody, , and , in: Proceedings of Interspeech, San Francisco, USA, 2016 |
|
Simultaneous temporal superresolution and denoising for cardiac fluorescence microscopy, , , and , in: IEEE Transactions on Computational Imaging, 2016 |
[DOI] [URL] |
Proceedings of the 16th International Conference on Multimodal Interaction, ICMI 2014, Istanbul, Turkey, November 12-16, 2014., , , , , and , ACM, 2014 |
Brief Introduction to the Special Issue on Behavior Understanding for Arts and Entertainment, , , , and , in: ACM Transactions on Interactive Intelligent Systems, 5(2):6, 2015 |
[DOI] |
Rapport with Virtual Agents: What do Human Social Cues and Personality Explain?, , and , in: IEEE Transactions on Affective Computing, 8(3):382-395, 2017 |
[DOI] |
Predicting the Performance in Decision-Making Tasks: From Individual Cues to Group Interaction, and , in: IEEE Transactions on Multimedia, 18(4):643--658, 2016 |
[DOI] [URL] |
High-slope terrain locomotion for torque-controlled quadruped robots, , , , , and , in: Autonomous Robots, 2016 |
[DOI] [URL] |
Hierarchical Planning of Dynamic Movements without Scheduled Contact Sequences, , , , and , in: Proceedings of the IEEE International Conference of Robotics and Automation, 2016 |
Question Answering in Conversations: Query Refinement Using Contextual and Semantic Information, , and , in: Data & Knowledge Engineering Journal, 2016 |
Question Answering in Conversations: Query Refinement Using Contextual and Semantic Information, , and , Idiap-RR-16-2016 |
|
Towards End-to-End Speech Recognition, , Ecole polytechnique Fédérale de Lausanne, 2016 |
[DOI] |
Importance Sampling Tree for Large-scale Empirical Expectation, , and , in: Proceedings of the International Conference on Machine Learning (ICML), New-York, 2016 |
A Sub-Quadratic Exact Medoid Algorithm, and , Idiap-RR-19-2017 |
|
Heterogeneous Face Recognition using Inter-Session Variability Modelling, and , in: IEEE Computer Society Workshop on Biometrics, Las Vegas - USA, IEEE, 2016 |
|
A Contextual Language Model to Improve Machine Translation of Pronouns by Re-ranking Translation Hypotheses, and , in: European Association for Machine Translation, 2016 |
ISWC 2013--Wearables are Here to Stay, , , and , in: IEEE Pervasive Computing, 13(1):14-18, 2014 |
[DOI] |
SYSTEM FUSION AND SPEAKER LINKING FOR LONGITUDINAL DIARIZATION OF TV SHOWS, , , and , in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5495-5499, IEEE, 2016 |
|
Quantifying uncertainties on excursion sets under a Gaussian random field prior, , , and , in: SIAM/ASA J. Uncertainty Quantification, 4(1):850-874, 2016 |
[DOI] [URL] |
Acoustic data-driven grapheme-to-phoneme conversion in the probabilistic lexical modeling framework, , and , in: Speech Communication, 80, 2016 |
[DOI] |
Exploiting Low-dimensional Structures to Enhance DNN based Acoustic Modeling in Speech Recognition, , , and , in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5690-5694, IEEE, 2016 |
|
INFORMATION THEORETIC CLUSTERING FOR UNSUPERVISED DOMAIN-ADAPTATION, , and , in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5580-5584, IEEE, 2016 |
|
DEEP NEURAL NETWORK BASED POSTERIORS FOR TEXT-DEPENDENT SPEAKER VERIFICATION, , , and , in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5050-5054, IEEE, 2016 |
|
Phonetic and Phonological Posterior Search Space Hashing Exploiting Class-Specific Sparsity Structures, , , and , Idiap-RR-10-2016 |
|
Composition of Deep and Spiking Neural Networks for Very Low Bit Rate Speech Coding, , , and , Idiap-RR-11-2016 |
|
"Can you hear me now?" --- Automatic assessment of background noise intrusiveness and speech intelligibility in telecommunications, , Sciences et Techniques de l’Ingénieur (STI), 2016 |
[DOI] |
Overview of BTAS 2016 Speaker Anti-spoofing Competition, , , , , , , , , , , , , , , and , Idiap-RR-24-2016 |
[URL] |
Joint Operation of Voice Biometrics and Presentation Attack Detection, and , Idiap-RR-25-2016 |
[URL] |
Audiovisual Summarization of Lectures and Meetings Using a Segment Similarity Graph, , and , in: Proceedings of the ACM International Conference on Multimedia Retrieval (ICMR), ACM, New York, NY, ACM Press, 2016 |
Using the TED Talks to Evaluate Spoken Post-editing of Machine Translation, and , in: Proceedings of the 10th Language Resources and Evaluation Conference (LREC), Portoroz, Slovenia, 2016 |
|
Tracking Interacting Objects Using Intertwined Flows, , , and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016 |
Principled Parallel Mean-Field Inference for Discrete Random Fields, , , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2016 |
The SIWIS database: a multilingual speech database with acted emphasis, , , , , , , , , , , and , Idiap-RR-13-2016 |
|
Probabilistic Amplitude Demodulation features in Speech Synthesis for Improving Prosody, , and , Idiap-RR-12-2016 |
|
On Structured Sparsity of Phonological Posteriors for Linguistic Parsing, , and , Idiap-RR-07-2016 |
[URL] |
Subspace Detection of DNN Posterior Probabilities via Sparse Representation for Query by Example Spoken Term Detection, , and , Idiap-RR-06-2016 |
|
Low-Rank Representation For Enhanced Deep Neural Network Acoustic Models, , Idiap-RR-05-2016 |
|
Low-Rank Representation of Nearest Neighbor Phone Posterior Probabilities to Enhance DNN Acoustic Modeling, , , and , Idiap-RR-04-2016 |
|
Sound Pattern Matching for Automatic Prosodic Event Detection, , , , and , Idiap-RR-03-2016 |
|
Multilingual Visual Sentiment Concept Matching, , , , , , and , in: Proceedings of the International Conference on Multimedia Retrieval (ICMR), 2016 |
|
Large Scale Hard Sample Mining with Monte Carlo Tree Search, and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2016 |
|
Deep Dynamic Neural Networks for Multimodal Gesture Segmentation and Recognition, , , , , , and , in: IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016 |
|
Wiki-LDA: A Mixed-Method Approach for Effective Interest Mining on Twitter Data, , , and , in: Proceedings of CSEDU 2016, 2016 |
|
Learning Robot Manipulation Tasks with Task-Parameterized Semi-Tied Hidden Semi-Markov Model, and , in: IEEE Robotics and Automation Letters, 1(1):235-242, 2016 |
[DOI] [URL] |
Learning Explainable User Sentiment and Preferences for Information Filtering, , École Polytechnique Fédérale de Lausanne, 2016 |
[DOI] |
A Point-Spread-Function-Aware Filtered Backprojection Algorithm for Focal-Plane-Scanning Optical Projection Tomography, and , in: 2016 IEEE International Symposium on Biomedical Imaging, 2016 |
An Analysis of Rhythmic Staccato-Vocalization Based on Frequency Demodulation for Laughter Detection in Conversational Meetings, , , and , Idiap-RR-02-2016 |
|
On Learning Grapheme-to-Phoneme Relationships through the Acoustic Speech Signal, and , in: The Phonetician, 109–110:6-23, 2014 |
|
Global Optimization with Sparse and Local Gaussian Process Models, and , in: Machine Learning, Optimization, and Big Data, pages 185-196, Springer International Publishing, 2015 |
[DOI] |
Differentiating the Multipoint Expected Improvement for Optimal Batch Design, , and , in: Machine Learning, Optimization, and Big Data, pages 37-48, Springer International Publishing, 2015 |
[DOI] |
Sparse Subspace Modeling for Query by Example Spoken Term Detection, , and , Idiap-RR-01-2016 |
|
Trustworthy Biometric Verification under Spoofing Attacks: Application to the Face Mode, , École Polytechnique Fédérale de Lausanne, 2015 |
[URL] |
Proceedings of the Second Workshop on Discourse in Machine Translation (DiscoMT 2015), , , and , Association for Computational Linguistics, 2015 |
[URL] |
Klewel Webcast: from Research to Growing Company, , , and , in: IEEE Multimedia, 22(4):94-99, 2015 |
|
Computer vision profiling of neurite outgrowth dynamics reveals spatio-temporal modularity of Rho GTPase signaling, , , , , , , , , and , in: Journal of Cell Biology, 212(1):91-111, 2016 |
[DOI] |
Combining dynamic head pose-gaze mapping with the robot conversational state for attention recognition in human-robot interactions, and , in: Pattern Recognition Letters, 66:81-90, 2015 |
|
Integration of Real-Time Speech Processing Technologies for Online Gaming, , and , Idiap-Com-01-2016 |
|
Transfer Learning through Greedy Subset Selection, , and , in: Image Analysis and Processing - ICIAP 2015, Genoa, Italy, pages 3-14, Springer International Publishing, 2015 |
[DOI] |
Robot Learning with Task-Parameterized Generative Models, , in: Proc. Intl Symp. on Robotics Research, 2015 |
|
Learned Minimal Intervention Control Synthesis based on Hidden Semi-Markov Models, , and , in: Proc. of the 8th Intl Workshop on Human-Friendly Robotics, pages 17, 2015 |
|
Jointly Informative Feature Selection, and , in: Journal of Machine Learning Research, 2016 |
Kullback-Leibler Proximal Variational Inference, , , and , in: Proceedings of the international conference on Neural Information Processing Systems, pages 3402-3410, 2015 |
|
Loud and Trendy: Crowdsourcing Impressions of Social Ambiance in Popular Indoor Urban Places, and , in: Proceedings of the 23rd ACM International Conference on Multimedia, Brisbane, Australia, pages 211-220, ACM, 2015 |
[DOI] [URL] |
CommuniSense: Crowdsourcing Road Hazards in Nairobi, , , , , , and , in: Proceedings of the 17th International Conference on Human-Computer Interaction with Mobile Devices and Services, Copenhagen, Denmark, pages 445-456, ACM, 2015 |
[DOI] [URL] |
Looking at Cities in Mexico with Crowds, , and , in: Proceedings of the 2015 Annual Symposium on Computing for Development, London, United Kingdom, pages 127-135, ACM, 2015 |
[DOI] [URL] |
Learning to Segments Objects Candidates, , and , in: Advances in Neural Information Processing Systems, Montreal, Canada, pages 1990-1998, Curran Associates, Inc., 2015 |
[URL] |
On degeneracy and invariances of random fields paths with applications in Gaussian process modelling, , and , in: Journal of Statistical Planning and Inference, 170:117-128, 2016 |
[DOI] |
Evaluating Shape Representations for Maya Glyph Classification, , and , in: ACM Journal on Computing and Cultural Heritage (JOCCH), 9(3), 2016 |
Ancient Maya Writings as High-Dimensional Data: a Visualization Approach, , , and , in: Digital Humanities (DH), Krakow, 2016 |
|
On Compressibility of Neural Network phonological Features for Low Bit Rate Speech Coding, , and , in: Proceeding of Interspeech, pages 418-422, ISCA, 2015 |
|
Intonation atom based emphasis transfer, and , Idiap-RR-14-2016 |
|
TDOA Matrices: Algebraic Properties and their Application to Robust Denoising with Missing Data, , , and , in: IEEE Transactions on Signal Processing, 64(20):5242-5254, 2016 |
[DOI] [URL] |
INFORMATION THEORETIC CLUSTERING FOR UNSUPERVISED DOMAIN-ADAPTATION, , and , Idiap-RR-09-2016 |
|
DEEP NEURAL NETWORK BASED POSTERIORS FOR TEXT-DEPENDENT SPEAKER VERIFICATION, , , and , Idiap-RR-08-2016 |
|
Pronunciation Lexicon Development for Under-Resourced Languages Using Automatically Derived Subword Units: A Case Study on Scottish Gaelic, , and , in: 4th Biennial Workshop on Less-Resourced Languages, 2015 |
|
International Conference on Mobile and Ubiquitous Multimedia, , and , in: Happy and Agreeable? Multi-Label Classification of Impressions in Social Video, Linz, Austria, pages 109-120, ACM, 2015 |
[DOI] [URL] |
Towards Multiple Pronunciation Generation in Acoustic G2P Conversion Framework, , and , Idiap-RR-34-2015 |
|
3D Gaze Estimation from Remote RGB-D Sensors, , École Polytechnique Fédérale de Lausanne, 2015 |
[DOI] |
Head Nod Detection from a Full 3D Model, , and , in: Proceedings of the ICCV 2015, pages 528-536, 2015 |
|
Posterior-Based Multi-Stream Formulation To Combine Multiple Grapheme-to-Phoneme Conversion Techniques, and , Idiap-RR-33-2015 |
|
HMM-based Non-native Accent Assessment using Posterior Features, , and , Idiap-RR-32-2015 |
|
Predicting the intrusiveness of noise through sparse coding with auditory kernels, and , in: Speech Communication, 76:186-200, 2016 |
[DOI] [URL] |
Fast K-Means with Accurate Bounds, and , Idiap-RR-17-2016 |
|
A Tutorial on Task-Parameterized Movement Learning and Retrieval, , in: Intelligent Service Robotics, 9(1):1-29, 2016 |
[DOI] [URL] |
Towards utterance-based neural network adaptation in acoustic modeling, , , and , in: IEEE Automatic Speech Recognition and Understanding Workshop, pages 289-295, 2015 |
|
Gaze Estimation in the 3D Space Using RGB-D sensors. Towards Head-Pose And User Invariance., and , in: International Journal of Computer Vision, 118(2):194-216, 2016 |
[DOI] [URL] |
Deciphering the Silent Participant. On the Use of Audio-Visual Cues for the Classification of Listener Categories in Group Discussions, , , and , in: Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, ACM, Seattle, Washington, USA, pages 107--114, ACM, 2015 |
[DOI] |
Within- and Cross- Database Evaluations for Gender Classification via BeFIT Protocols, , and , in: International Workshop on Multimedia Signal Processing, pages 1-6, 2014 |
[DOI] [URL] |
Palm Vein Database and Experimental Framework for Reproducible Research, and , in: IEEE International Conference of the Biometrics Special Interest Group, pages 1-7, 2015 |
[DOI] [URL] |
Finger vein Liveness Detection Using Motion Magnification, , , and , in: IEEE International Conference on Biometrics: Theory, Applications and Systems, pages 1-7, 2015 |
[DOI] |
Personality Trait Classification via Co-Occurrent Multiparty Multimodal Event Discovery, , and , in: Proceedings of the ACM International Conference on Multimodal Interaction, Seattle, Washington, USA, pages 15-22, ACM, 2015 |
[DOI] |
Learning bimanual end-effector poses from demonstrations using task-parameterized dynamical systems, , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), Hamburg, Germany, pages 464-470, 2015 |
|
Learning Optimal Controllers in Human-robot Cooperative Transportation Tasks with Position and Force Constraints, , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), Hamburg, Germany, pages 1024-1030, 2015 |
|
Learning the Stiffness of a Continuous Soft Manipulator from Multiple Demonstrations, , , and , in: Intelligent Robotics and Applications, pages 185-195, Springer, 2015 |
[DOI] [URL] |
Overlapping Speech, Utterance Duration and Affective Content in HHI and HCI - an Comparison, , , , and , in: 6th IEEE Conference on Cognitive Infocommunications, Gyor, pages 83-88, 2015 |
[DOI] |
Enabling speech applications using Ad-Hoc Microphone Arrays, , École Polytechnique Fédérale de Lausanne, 2015 |
|
Binary Sparse Coding of Convolutive Mixtures for Sound Localization and Separation via Spatialization, , , , , and , in: IEEE Transactions on Signal Processing, 64(3):567-579, 2016 |
[DOI] |
Adaptive Sentiment-Aware One-Class Collaborative Filtering, and , in: Expert Systems with Applications, 43:23-41, 2016 |
[DOI] [URL] |
Visual Affect Around the World: A Large-scale Multilingual Visual Sentiment Ontology, , , , , and , in: Proceedings of the ACM International Conference on Multimedia, pages 159--168, 2015 |
|
Statistical Models in Automatic Speech Recognition, , University of Fribourg, Department of Mathematics, 2015 |
|
Exploring Dataset Similarities using PCA-based Feature Selection, , , and , in: Proceedings of ACII, Xi'an, pages 387-393, IEEE, 2015 |
[DOI] |
Cross-Corpus Acoustic Emotion Recognition: Variances and Strategies (Extended Abstract), , , , , , and , in: Proceedings of ACII, Xi'an, pages 470-476, IEEE, 2015 |
[DOI] |
Annotators' agreement and spontaneous emotion classification performance, and , in: Proceedings of Interspeech, pages 1546-1550, 2015 |
|
Pronoun Translation and Prediction with or without Coreference Links, , and , in: Proceedings of the Second Workshop on Discourse in Machine Translation (DiscoMT), Lisbon, Portugal, pages 94–100, 2015 |
|
Spatial Sound Localization via Multipath Euclidean Distance Matrix Recovery, , , , and , in: IEEE Journal of Selected Topics in Signal Processing, 9(5):802-814, 2015 |
|
Computational Methods for Underdetermined Convolutive Speech Localization and Separation via Model-based Sparse Component Analysis, , , and , in: Speech Communication, 76:201-217, 2016 |
|
Automatic social role recognition and its application in structuring multiparty interactions, , EPFL, 2015 |
|
Articulatory feature based continuous speech recognition using probabilistic lexical modeling, and , in: Computer Speech and Language, 36:233-259, 2016 |
[DOI] |
HAVC-II - Idiap Private Cloud (Technical Inside-Out), , Idiap-Com-01-2015 |
|
On the Vulnerability of Speaker Verification to Realistic Voice Spoofing, , , and , in: IEEE International Conference on Biometrics: Theory, Applications and Systems, pages 1-8, IEEE, 2015 |
[DOI] [URL] |
Exploiting foreign resources for DNN-based ASR, , , , and , in: EURASIP Journal on Audio, Speech, and Music Processing(2015:17), 2015 |
[DOI] |
Syntactic Parsing of Morphologically Rich Languages Using Deep Neural Networks, and , Idiap-RR-25-2015 |
|
Joint RNN-Based Greedy Parsing and Word Composition, and , in: Proceedings of ICLR 2015, 2015 |
|
Analysis of CNN-based Speech Recognition System using Raw Speech as Input, , and , in: Proceedings of Interspeech, ISCA, Dresden, pages 11-15, ISCA, 2015 |
|
Face Recognition Systems Under Spoofing Attacks, , , and , Idiap-RR-18-2020 |
|
A Deeper Look at Dataset Bias, , , and , in: German Conference on Pattern Recognition, Aachen, Germany, pages 504–516, Springer International Publishing, 2015 |
[DOI] |
Rewards-driven control of robot arm by decoding EEG signals, , and , in: Engineering in Medicine and Biology Society (EMBC), 2014 36th Annual International Conference of the IEEE, pages 1658-1661, IEEE, 2014 |
[DOI] [URL] |
Simple Image Description Generator via a Linear Phrase-based Model, , and , Idiap-RR-22-2015 |
|
Transfer in Inverse Reinforcement Learning for Multiple Strategies, and , in: IEEE/RSJ International Conference on Intelligent Robots and Systems, Tokyo, Japan, pages 3244-3250, IEEE, 2013 |
[DOI] [URL] |
Autonomous reinforcement learning with experience replay, and , in: Neural Networks, 41:156 - 167, 2013 |
[DOI] [URL] |
"The Sum of Its Parts": Joint Learning of Word and Phrase Representations with Autoencoders, and , Idiap-RR-21-2015 |
|
Rehabilitation of Count-based Models for Word Vector Representations, and , in: Computational Linguistics and Intelligent Text Processing, pages 417-429, Springer International Publishing, 2015 |
From Image-level to Pixel-level Labeling with Convolutional Networks, and , in: Computer Vision and Patter Recognition (CVPR), Boston, MA, pages 1713-1721, IEEE, 2015 |
[DOI] [URL] |
Phrase-based Image Captioning, , and , in: International Conference on Machine Learning (ICML), Lille, France, pages 2085–2094, JMLR, 2015 |
[URL] |
DexROV: Dexterous Undersea Inspection and Maintenance in Presence of Communication Latencies, , , , , , , , , , , , , , , and , in: IFAC Workshop on Navigation, Guidance and Control of Underwater Vehicles (NGCUV), pages 218-223, 2015 |
Dictionary Learning for Sparse Representation of Neural Network Exemplars in Speech Recognition, , and , in: Proceedings of SPARS 2015: Workshop on Signal Processing with Adaptive Sparse Structured Representations, 2015, pages 1093, 2015 |
|
Sparse Modeling of Neural Network Posterior Probabilities for Exemplar-Based Speech Recognition, , and , in: Proceedings of SPARS 2015: Workshop on Signal Processing with Adaptive Sparse Structured Representations, 2015 |
|
DNN-based Speech Synthesis: Importance of input features and training data, , and , in: International Conference on Speech and Computer , SPECOM, pages 193-200, Springer Berlin Heidelberg, 2015 |
[DOI] |
Integrating Online I-vector extractor with Information Bottleneck based Speaker Diarization system, , , and , Idiap-RR-20-2015 |
|
KL-HMM BASED SPEAKER DIARIZATION SYSTEM FOR MEETINGS, and , in: Proceedings of ICASSP 2015, pages 4435-4439, 2015 |
|
COMBINING SGMM SPEAKER VECTORS AND KL-HMM APPROACH FOR SPEAKER DIARIZATION, , and , in: Proceedings of ICASSP 2015, pages 4834-4837, 2015 |
|
EMPLOYMENT OF SUBSPACE GAUSSIAN MIXTURE MODELS IN SPEAKER RECOGNITION, , , and , in: 2015 IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE, Brisbane, Australia, pages 4445-4449, 2015 |
[URL] |
Exploiting foreign resources for DNN-based ASR, , , , and , Idiap-RR-27-2015 |
|
Sparse Modeling of Posterior Exemplars for Keyword Detection, , , and , in: Proceedings of Interspeech, pages 3690-3694, 2015 |
|
Weighted Correlation based Atom Decomposition Intonation Modelling, , , and , in: Proceedings of Interspeech, Dresden, Germany, pages 1601--1605, 2015 |
|
Automatic Recognition of Emergent Social Roles in Small Group Interactions, and , in: Multimedia, IEEE Transactions, 17(5):746 - 760, 2015 |
[DOI] |
Channel Selection in the Short-time Modulation Domain for Distant Speech Recognition, , , , and , in: Proceedings of Interspeech, pages 741-745, 2015 |
|
Neuromorphic Based Oscillatory Device for Incremental Syllable Boundary Detection, and , in: Proc. of Interspeech, Dresden, Germany, pages 1191-1195, ISCA, 2015 |
|
Neuromorphic Based Oscillatory Device for Incremental Syllable Boundary Detection, and , Idiap-RR-14-2015 |
|
Automatic Accentedness Evaluation of Non-Native Speech Using Phonetic and Sub-Phonetic Posterior Probabilities, , , and , in: Proceedings of Interspeech, 2015 |
|
Automatic Accentedness Evaluation of Non-Native Speech Using Phonetic and Sub-Phonetic Posterior Probabilities, , , and , Idiap-RR-12-2015 |
|
Computational Analysis Of Behavior In Employment Interviews And Video Resumes, , École Polytechnique Fédérale de Lausanne, 2015 |
|
Probability Occupancy Maps for Occluded Depth Images, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, pages 2829-2837, 2015 |
Machine learning-based tools to model and to remove the off-target effect, , , and , in: Pattern Analysis and Applications, 20(1):87-100, 2017 |
[DOI] |
Adaptive relevance feedback for large-scale image retrieval, and , in: Multimedia Tools and Applications, 75(12):6777-6807, 2016 |
[DOI] |
Sparse Modeling of Neural Network Posterior Probabilities for Exemplar-based Speech Recognition, , and , in: Speech Communication: Special Issue on Advances in Sparse Modeling and Low-rank Modeling for Speech Processing, 76:230–244, 2016 |
[DOI] |
Dynamic structure and protein expression of the live embryonic heart captured by 2-photon light sheet microscopy and retrospective registration, , , , , and , in: Biomedical Optics Express, 6(6):2056-2066, 2015 |
[DOI] [URL] |
An Empirical Model of Emphatic Word Detection, and , in: Proc. of Interspeech, Dresden, Germany, pages 573-577, ISCA, 2015 |
|
Acoustic Data-Driven Grapheme-to-Phoneme Conversion in the Probabilistic Lexical Modeling Framework, , and , Idiap-RR-10-2015 |
|
Leveraging Compounds to Improve Noun Phrase Translation from Chinese and German, , , , , and , Idiap-RR-09-2015 |
|
Leveraging Compounds to Improve Noun Phrase Translation from Chinese and German, , , , , and , in: Proceedings of the ACL-IJCNLP 2015 Student Research Workshop, Beijing, China, pages 8-15, 2015 |
|
Disambiguating Discourse Connectives for Statistical Machine Translation, , and , in: IEEE/ACM Transactions on Audio, Speech and Language Processing, 23(7):1184-1197, 2015 |
[DOI] |
In the Mood for Vlog: Multimodal Inference in Conversational Social Video, , , and , in: ACM Transactions on Interactive Intelligent Systems, 5(2), 2015 |
[DOI] |
Speech vocoding for laboratory phonology, , and , Idiap-RR-07-2015 |
|
Channel Selection in the Short-time Modulation Domain for Distant Speech Recognition; Comparison with the Envelope-Variance Measure, , , , and , Idiap-RR-30-2015 |
|
Learning Feature Mapping using Deep Neural Network Bottleneck Features for Distant Large Vocabulary Speech Recognition, , , , , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing, pages 4540-4544, 2015 |
[DOI] |
Joint Speaker Verification and Anti-Spoofing in the i-Vector Space, , , , and , in: IEEE Transactions on Information Forensics and Security, 10(4):821-832, 2015 |
[DOI] |
Gender Classification by LUT based boosting of Overlapping Block Patterns, , and , in: Scandinavian Conference on Image Analysis, pages 530-542, Springer International Publishing, 2015 |
[DOI] [URL] |
An Empirical Model of Emphatic Word Detection, and , Idiap-RR-11-2015 |
|
Reconstruction of Images from Gabor Graphs with Applications in Facial Image Processing, , , and , in: Journal of Wavelets, Multiresolution and Information Processing, 13(4):25, 2015 |
[DOI] |
Incremental Syllable-Context Phonetic Vocoding, , , , and , in: IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 23(6), 2015 |
[URL] |
Learning linearly separable features for speech recognition using convolutional neural networks, , and , Idiap-RR-24-2015 |
[URL] |
Analysis of CNN-based Speech Recognition System using Raw Speech as Input, , and , Idiap-RR-23-2015 |
|
On the Application of Automatic Subword Unit Derivation and Pronunciation Generation for Under-Resourced Language ASR: A Study on Scottish Gaelic, , and , Idiap-RR-13-2015 |
|
Query Refinement Using Conversational Context: a Method and an Evaluation Resource, and , in: Proceedings of NLDB 2015 (20th International Conference on Applications of Natural Language to Information Systems), Passau, Germany, pages 89-102, Springer-Verlag Berlin, 2015 |
[DOI] |
Integrated Pronunciation Learning for Automatic Speech Recognition Using Probabilistic Lexical Modeling, , and , in: International Conference on Acoustics, Speech and Signal Processing, South Brisbane, QLD, pages 5176-5180, 2015 |
[DOI] |
An HMM-Based Formalism for Automatic Subword Unit Derivation and Pronunciation Generation, and , in: International Conference on Acoustics, Speech and Signal Processing, pages 4639-4643, IEEE, 2015 |
[DOI] |
Convolutional Neural Networks-based Continuous Speech Recognition using Raw Speech Signal, , and , in: International Conference on Acoustics, Speech and Signal Procecssing, IEEE, South Brisbane, QLD, pages 4295 - 4299, IEEE, 2015 |
|
On the Vulnerability of Palm Vein Recognition to Spoofing Attacks, and , in: The 8th IAPR International Conference on Biometrics (ICB), pages 319 - 325, 2015 |
[DOI] [URL] |
The 1st Competition on Counter Measures to Finger Vein Spoofing Attacks, , , , , , , , , and , in: The 8th IAPR International Conference on Biometrics (ICB), pages 513 - 518, 2015 |
[DOI] [URL] |
A New Identity for the Least-square Solution of Overdetermined Set of Linear Equations, , and , Idiap-RR-35-2015 |
|
Analysis of Small Groups, , and , in: Social Signal Processing, pages 349-367, Cambridge University Press. Editors J. Burgoon, N. Magnenat-Thalmann, M. Pantic, and A. Vinciarelli, 2017 |
[DOI] |
Twitter Sentiment Analysis (Almost) from Scratch, , and , Idiap-RR-15-2016 |
|
N-gram-Based Low-Dimensional Representation for Document Classification, and , in: International Conference on Learning Representations, 2015 |
[URL] |
Phrase-based Image Captioning, , and , Idiap-RR-08-2015 |
|
Estimation of Divergence-Free 3D Cardiac Blood Flow in a Zebrafish Larva Using Multi-View Microscopy, and , in: Biomedical Imaging (ISBI), 2015 IEEE 12th International Symposium on, IEEE, Brooklyn, NY, USA, pages 385-388, 2015 |
[DOI] |
Atom Decomposition-based Intonation Modelling, , and , in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, Australia, pages 4744--4748, IEEE, 2015 |
[DOI] |
Intensity-Based Point-Spread-Function-Aware Registration for Multi-View Applications in Optical Microscopy, , and , in: Biomedical Imaging (ISBI), 2015 IEEE 12th International Symposium on, pages 306-309, IEEE, 2015 |
[DOI] |
Phonological Vocoding Using Artificial Neural Networks, , and , in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, Australia, pages 4844-4848, IEEE, 2015 |
[DOI] |
On the use of client identity information for face anti-spoofing, and , in: IEEE Transactions on Information Forensics and Security, Special Issue on Biometric Anti-spoofing, 10(4):787-796, 2015 |
|
On Application Of Non-Negative Matrix Factorization for Ad Hoc Microphone Array Calibration from Incomplete Noisy Distances, , , , and , in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 2694-2698, IEEE, 2015 |
[DOI] |
Novel GCC-PHAT Model in Diffuse Sound Field for Microphone Array Pairwise Distance Based Calibration, , , , , , and , in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 2669-2673, 2015 |
|
Robust Microphone Placement for Source Localization from Noisy Distance Measurements, , , , and , in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 2579-2583, IEEE, 2015 |
[DOI] |
Building context-dependent DNN acoustic models using Kullback-Leibler divergence-based state tying, , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2015 |
|
What Your Face Vlogs About: Expressions of Emotion and Big-Five Traits Impressions in YouTube, , , and , in: IEEE Transactions Affective Computing, 2014 |
|
The Workshop on Computational Personality Recognition 2014, , , , , and , in: Proceedings of the ACM International Conference on Multimedia, 2014 |
|
Mining Crowdsourced First Impressions in Online Social Video, and , in: IEEE Transactions on Multimedia, 16(7), 2014 |
|
Preliminary Work on Speaker Adaptation for DNN-Based Speech Synthesis, , and , Idiap-RR-02-2015 |
|
Automatic Blinking Detection towards Stress Discovery, , , and , in: Proc. ACM Int. Conf. on Multimodal Interaction, Istanbul, pages 307-310, ACM New York, 2014 |
[DOI] |
Capturing Upper Body Motion in Conversation: an Appearance Quasi-Invariant Approach, , , and , in: Proc. ACM Int. Conf. on Multimodal Interaction, Istanbul, pages 327-334, ACM New York, 2014 |
[DOI] |
Signal Processing in the Workplace, , in: IEEE Signal Processing Magazine, 32(1):121-125, 2015 |
|
Leveraging Colour Segmentation for Upper-Body Detection, and , in: Pattern Recognition, 47(6):2222-2230, 2014 |
|
Detecting and Labeling Speakers on Overlapping Speech using Vector Taylor Series, , and , in: INTERSPEECH, 2014 |
|
Multi-source Posteriors for Speech Activity Detection on Public Talks, and , in: INTERSPEECH, 2014 |
|
Diarizing Large Corpora using Multi-modal Speaker Linking, , , and , in: INTERSPEECH 2014, 2014 |
|
LETHA: Learning from High Quality Inputs for 3D Pose Estimation in Low Quality Images, , , and , in: Proceedings of the International Conference on 3D vision, pages 517–524, 2014 |
Tracking Interacting Objects Optimally Using Integer Programming, , , and , in: Proceedings of the European Conference on Computer Vision, pages 17-32, 2014 |
|
Phoneme Background Model for Information Bottleneck based Speaker Diarization, , and , in: Interspeech, Singapore, 2014 |
|
Artificial neural network features for speaker diarization, , and , in: IEEE Spoken Language Technology workshop, South Lake Tahoe, USA, 2014 |
|
Overlapping speech detection using long-term conversational features for speaker diarization in meeting room conversations., and , in: Audio, Speech and Language processing, IEEE/ACM Transaction on, 22(12):1688-1700, 2014 |
|
Evaluation Databases, , , and , in: Handbook of Biometric Anti-Spoofing, pages 247-278, Springer-Verlag, 2014 |
[DOI] |
LETHA: Learning from High Quality Inputs for 3D Pose Estimation in Low Quality Images., , , and , Idiap-RR-22-2014 |
|
EMPLOYMENT OF SUBSPACE GAUSSIAN MIXTURE MODELS IN SPEAKER RECOGNITION, , , and , Idiap-RR-16-2015 |
|
Development of Bilingual ASR System for MediaParl Corpus, , , and , in: Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech 2014), Singapore, ISCA, 2014 |
|
Development of Bilingual ASR System for MediaParl Corpus, , , and , Idiap-RR-21-2014 |
|
Sample Distillation for Object Detection and Image Classification, , and , in: Proceedings of the 6th Asian Conference on Machine Learning (ACML), Nha Trang, Vietnam, 2014 |
|
Efficient Sample Mining for Object Detection, and , in: Proceedings of the 6th Asian Conference on Machine Learning (ACML), Nha Trang, Vietnam, 2014 |
|
Keyword Extraction and Clustering for Document Recommendation in Conversations, and , in: IEEE/ACM Transactions on Audio Speech and Language Processing, 23(4):746 - 759, 2015 |
[DOI] |
Otomatik İşaret Dili Tanıma ve Türk İşaret Dili için Bilgisayar Uygulamaları, , , , and , in: Ellerle Konusmak: Turk Isaret Dili Arastirmalari / Research on Turkish Sign Language, pages 471-498, Koc University Press, 2016 |
Modeling Annotator Behaviors for Crowd Labeling, , , and , in: Neurocomputing, 160:141–156, 2015 |
[DOI] |
Discourse connectives: theoretical models and empirical validations in humans and computers, and , in: Papers dedicated to Jacques Moeschler, University of Geneva, 2014 |
[URL] |
ROCKIT: Roadmap for Conversational Interaction Technologies, , , , , , , , , , , , , , and , in: Proceedings of the 2014 Workshop on Roadmapping the Future of Multimodal Interaction Research including Business Opportunities and Challenges (RFMIR '14), pages 39-42, ACM, 2014 |
[DOI] |
Syllabic Pitch Tuning for Neutral-to-Emotional Voice Conversion, , and , Idiap-RR-31-2015 |
|
Transfer Learning through Greedy Subset Selection, , and , Idiap-RR-26-2015 |
|
Incremental Syllable-Context Phonetic Vocoding, , , , and , Idiap-RR-05-2015 |
|
Phonological vocoding using artificial neural networks, , and , Idiap-RR-04-2015 |
|
Acoustic and Lexical Resource Constrained ASR using Language-Independent Acoustic Model and Language-Dependent Probabilistic Lexical Model, and , in: Speech Communication, 68:23–40, 2015 |
[DOI] [URL] |
Impact of Eye Detection Error on Face Recognition Performance, , , , , and , in: IET Biometrics, 2015 |
[URL] |
A Skill Transfer Approach for Continuum Robots - Imitation of Octopus Reaching Motion with the STIFF-FLOP Robot, , , and , in: In Proc. of the AAAI Symp. on Knowledge, Skill, and Behavior Transfer in Autonomous Robots, Arlington, VA, USA, pages 49-52, 2014 |
[URL] |
Skills Learning in Robots by Interaction with Users and Environment, , in: In Proc. of the Intl Conf. on Ubiquitous Robots and Ambient Intelligence (URAI), Kuala Lumpur, Malaysia, pages 161-162, 2014 |
[URL] |
COMBINING SGMM SPEAKER VECTORS AND KL-HMM APPROACH FOR SPEAKER DIARIZATION, , and , Idiap-RR-17-2015 |
|
KL-HMM BASED SPEAKER DIARIZATION SYSTEM FOR MEETINGS, and , Idiap-RR-19-2015 |
|
Articulatory Feature based Continuous Speech Recognition using Probabilistic Lexical Modeling, and , Idiap-RR-19-2014 |
|
Who Will Get the Grant ? A Multimodal Corpus for the Analysis of Conversational Behaviours in Group Interviews, , , , and , in: International Conference on Multimodal Interaction, Understanding and Modeling Multiparty, Multimodal Interactions Workshop, Istanbul, Turkey, ACM, 2014 |
[DOI] |
Joint Phoneme Segmentation Inference and Classification using CRFs, , and , in: Global Conference on Signal and Information Processing, Atlanta, GA, pages 587 - 591, IEEE, 2014 |
[DOI] |
Convolutional Neural Networks-based Continuous Speech Recognition using Raw Speech Signal, , and , Idiap-RR-18-2014 |
|
Learning by Imitation with the STIFF-FLOP Surgical Robot: A Biomimetic Approach Inspired by Octopus Movements, , , and , in: Robotics and Biomimetics, 1(13):1-15, 2014 |
[URL] |
Learning adaptive movements from demonstration and self-guided exploration, , and , in: Proc. IEEE Intl Conf. on Development and Learning and on Epigenetic Robotics (ICDL-EpiRob), Genoa, Italy, pages 160-165, 2014 |
|
Learning Force and Position Constraints in Human-robot Cooperative Transportation, , and , in: Proc. IEEE Intl Symposium on Robot and Human Interactive Communication (Ro-Man), Edinburgh, Scotland, UK, pages 619-624, 2014 |
|
Saliency-based Representations and Multi-component Classifiers for Visual Scene Recognition, , École Polytechnique Fédérale de Lausanne (EPFL), 2014 |
|
Emergent Power Hierarchies and Group Performance, , , and , in: International Journal of Psychology, 50(5):392–396, 2015 |
[DOI] [URL] |
The MAAYA Project: Multimedia Analysis and Access for Documentation and Decipherment of Maya Epigraphy, , , , , , and , in: Proc. Digital Humanities Conference, Lausanne, 2014 |
|
Is That a Jaguar? Segmenting Ancient Maya Glyphs via Crowdsourcing, , and , in: Proc. ACM Int. Workshop on Crowdsourcing for Multimedia, Orlando, pages 37-40, ACM New York, 2014 |
[DOI] |
Feature Mapping of Multiple Beamformed Sources for Robust Overlapping Speech Recognition Using a Microphone Array, , , , , , and , Idiap-RR-17-2014 |
|
Grapheme-based Automatic Speech Recognition using Probabilistic Lexical Modeling, , École polytechnique fédérale de Lausanne, 2014 |
[DOI] |
A Probabilistic Kernel Method for Human Mobility Prediction with Smartphones, , , and , in: Pervasive and Mobile Computing, 2014 |
|
The SP2 SCOPES Project on Speech Prosody, , , , , , , , and , in: DOGS2014 - Digital speech and image processing, 2014 |
|
Automatic Speech Recognition and Translation of a Swiss German Dialect: Walliserdeutsch, , and , in: Proceedings of Interspeech, 2014 |
|
Overview of the ImageCLEF 2014 Domain Adaptation Task, and , in: ImageCLEF 2014: Overview and analysis of the results, 2014 |
|
The Young and the City: Crowdsourcing Urban Awareness in a Developing Country, , and , in: Proceedings of the First International Conference on IoT in Urban Space, pages 74-79, 2014 |
[DOI] [URL] |
Effect of nonverbal behavioral patterns on the performance of small groups, and , in: ICMI Workshop on Understanding and Modeling Multiparty Multimodal Interactions, Istanbul, Turkey, 2014 |
|
MODIFIED GROUP DELAY FEATURE BASED TOTAL VARIABILITY SPACE MODELLING FOR SPEAKER RECOGNITION, , and , in: International Journal of Speech Techonology, 18(1):17-23, 2014 |
[DOI] |
Feature Switching in the i-vector Framework for Speaker Verification, , , , and , in: Proc. of Interspeech 2014, pages 5, 2014 |
Cross-Database Evaluation With an Open Finger Vein Sensor, , , and , in: IEEE Workshop on Biometric Measurements and Systems for Security and Medical Applications (BioMS), Rome, Italy, pages 30-35, IEEE, 2014 |
[DOI] |
The DBOX Corpus Collection of Spoken Human-Human and Human-Machine Dialogues, , , , , , , , , , , , , and , in: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), Reykjavik, Iceland, European Language Resources Association (ELRA), 2014 |
[URL] |
How Do You Like Your Virtual Agent?: Human-Agent Interaction Experience through Nonverbal Features and Personality Traits, , and , in: Human Behavior Understanding, pages 1-15, Springer, 2014 |
|
Inferring Visual Attention and Addressee in Human Robot Interaction, , École Polytechnique Fédérale de Lausanne (EPFL), 2014 |
|
Biometrics Evaluation Under Spoofing Attacks, , and , in: IEEE Transactions on Information Forensics and Security, 9(12):2264-2276, 2014 |
[DOI] |
Automatic Maya Hieroglyph Retrieval Using Shape and Context Information, , , , and , in: ACM MM, pages 4, 2014 |
[URL] |
Evaluation Methodologies, , and , in: Handbook of Biometric Antispoofing, Springer, 2014 |
Exemplar-based Sparse Representation for Posterior Features, , and , Idiap-RR-11-2014 |
|
Weakly Supervised Object Segmentation with Convolutional Neural Networks, and , Idiap-RR-13-2014 |
|
Ad Hoc Microphone Array Calibration: Euclidean Distance Matrix Completion Algorithm and Theoretical Guarantees, , , , and , in: Signal Processing, 107:123–140, 2015 |
[DOI] |
Explaining the Stars: Weighted Multiple-Instance Learning for Aspect-Based Sentiment Analysis, and , in: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar, 2014 |
|
Human Tracking and Pose Estimation in Open Spaces, , École Polytechnique Fédérale de Lausanne (EPFL), 2014 |
|
On the Vulnerability of Finger Vein Recognition to Spoofing, , and , in: IEEE International Conference of the Biometrics Special Interest Group (BIOSIG), Darmstadt, Germay, pages 1 - 10, IEEE, 2014 |
|
Phoneme Background Model for Information Bottleneck based Speaker Diarization, , and , in: Interspeech 2014, 2014 |
Information Bottleneck based Speaker Diarization of Meetings using Non-speech as Side Information, and , in: ICASSP, Florence, IT, pages 96 - 100, IEEE, 2014 |
[DOI] |
Inferring social relationships in a phone call from a single party's speech, , and , in: ICASSP, Florence, IT, pages 4843 - 4847, IEEE, 2014 |
[DOI] |
Detecting speaker roles and topic changes in multiparty conversations using latent topic models, and , in: Proceedings of Interspeech, 2014 |
|
Dynamic Programming Boosting for Discriminative Macro-Action Discovery, and , in: International Conference on Machine Learning, 2014 |
|
Jointly Informative Feature Selection, and , in: International Conference on Artificial Intelligence and Statistics, pages 567–575, 2014 |
|
Audio-Visual Gender Recognition in Uncontrolled Environment Using Variability Modeling Techniques, , and , in: International Joint Conference on Biometrics, Clearwater, Florida, USA, pages 1 - 8, IEEE, 2014 |
[DOI] [URL] |
Improving Head and Body Pose Estimation through Semi-supervised Manifold Alignment, , , , and , in: International Conference on Image Processing, 2014 |
|
Exploiting Long-Term Connectivity and Visual Motion in CRF-based Multi-Person Tracking, , and , in: Transactions on Image Processing, 2014 |
|
On Recognition of Non-Native Speech Using Probabilistic Lexical Model, and , in: Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech 2014), 2014 |
|
Posterior-based Sparse Representation for Automatic Speech Recognition, , , and , in: Proceeding of Interspeech, 2014 |
|
Raw Speech Signal-based Continuous Speech Recognition using Convolutional Neural Networks, , and , Idiap-RR-15-2014 |
|
Stress and Accent Transmission In HMM-Based Syllable-Context Very Low Bit Rate Speech Coding, , , and , Idiap-RR-10-2014 |
|
Recurrent Greedy Parsing with Neural Networks, and , in: Proceedings of ECML 2014, pages 130-144, Springer Berlin Heidelberg, 2014 |
[DOI] |
Geometric Generative Gaze Estimation (G3E) for Remote RGB-D Cameras, and , in: IEEE Computer Vision and Pattern Recognition Conference, Columbus, Ohio,USA, pages 1773-1780, IEEE, 2014 |
[DOI] |
Topic-Level Extractive Summarization of Lectures and Meetings Using a Snippet Similarity Graph, and , Idiap-RR-09-2014 |
|
Stress and Accent Transmission In HMM-Based Syllable-Context Very Low Bit Rate Speech Coding, , , and , in: Interspeech, 2014 |
|
Syllable-based Regional Swiss French Accent Identification using Prosodic Features, , , and , in: Nouveaux cahiers de linguistique francaise, 2014 |
|
Dialect Levelling in Finnish: A Universal Speech Attribute Approach, , , , , , and , in: The 15th Annual Conference of the International Speech Communication Association, 2014 |
Introducing I-Vectors for Joint Anti-spoofing and Speaker Verification, , , , and , in: The 15th Annual Conference of the International Speech Communication Association, 2014 |
|
Comparison of Two Methods for Unsupervised Person Identification in TV Shows, , , , and , in: 12th International Workshop on Content-Based Multimedia Indexing, 2014 |
|
Enhanced Diffuse Field Model for Ad Hoc Microphone Array Calibration, , and , in: Signal Processing, 101:242-255, 2014 |
|
Modeling Overlapping Speech using Vector Taylor Series, , and , in: Odyssey: The Speaker and Language Recognition Workshop, Joensuu, Finland, 2014 |
|
MLP-based Factor Analysis for Tandem Speech Recognition, and , in: Proceedings of the International Conference on Acoustics, Speech and Signal Processing, 2013 |
|
Enforcing Topic Diversity in a Document Recommender for Conversations, and , in: Proceedings of the Coling 2014 (25th International Conference on Computational Linguistics), Dublin, Ireland, pages 746-759, IEEE, 2014 |
|
Recurrent Convolutional Neural Networks for Scene Labeling, and , in: 31st International Conference on Machine Learning (ICML), Beijing, China, pages 82-90, JMLR, 2014 |
[URL] |
Ad-Hoc Microphone Array Calibration from Partial Distance Measurements, , , and , in: Proceedings of the 4th Joint Workshop on Hands-free speech communication and Microphone Arrays, Villers-les-Nancy, pages 1 - 5, IEEE, 2014 |
[DOI] |
Ad-Hoc Microphone Array Calibration from Partial Distance Measurements, , , and , in: Proceeding of 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014 |
Face identification from overlaid texts using Local Face Recurrent Patterns and CRF models, , , , and , in: IEEE International Conference on Image Processing 2014, Paris, IEEE, 2014 |
|
Scene Recognition with Naive Bayes Non-linear Learning, and , in: Proceedings of the 22nd International Conference on Pattern Recognition, Stockholm, pages 3404 - 3409, IEEE, 2014 |
[DOI] |
Spoofing Face Recognition with 3D Masks, and , in: IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY:1084-1097, 2014 |
[DOI] |
A task-parameterized probabilistic model with minimal intervention control, , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Hong Kong, pages 3339 - 3344, IEEE, 2014 |
[DOI] |
Null space redundancy learning for a flexible surgical robot, , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Hong Kong, pages 2443 - 2448, IEEE, 2014 |
[DOI] |
Learning from demonstrations with partially observable task parameters, , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Hong Kong, pages 3309 - 3314, IEEE, 2014 |
[DOI] |
Mode of Teaching Based Segmentation and Annotation of Video Lectures, , and , in: International Workshop on Content-Based Multimedia Indexing, 2014 |
|
Improving Speaker Diarization using social role information, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2014 |
|
Prosody in Swiss French Accents: Investigation using Analysis by Synthesis, , , and , in: Speech Prosody, 2014 |
|
Exploiting Long-Term Connectivity and Visual Motion in CRF-based Multi-Person Tracking, , and , Idiap-RR-05-2014 |
|
Improving Real Time Factor of Information Bottleneck-based Speaker Diarization System, , and , Idiap-RR-18-2015 |
|
Hierarchical speaker clustering methods for the NIST i-vector Challenge, , , and , in: Odyssey: The Speaker and Language Recognition Workshop, 2014 |
|
Multi-Source Adaptive Learning for Fast Control of Prosthetics Hand, , and , in: Proceedings of the International Conference on Pattern Recognition, Stockholm, pages 2769 - 2774, 2014 |
[DOI] |
Learning to Learn, from Transfer Learning to Domain Adaptation: A Unifying Perspective, and , in: Proceedings of the Computer Vision and Pattern Recognition, Columbus, OH, pages 1442-1449, IEEE, 2014 |
[DOI] |
Sparse Gammatone Signal Model Predicts Perceived Noise Intrusiveness, and , Idiap-RR-07-2014 |
|
On Modeling Context-Dependent Clustered States: Comparing HMM/GMM, Hybrid HMM/ANN and KL-HMM Approaches, , and , in: International Conference on Acoustics, Speech, and Signal Processing, Florence, IT, pages 7659 - 7663, IEEE, 2014 |
[DOI] |
SVR vs MLP for Phone Duration Modelling in HMM-based Speech Synthesis, , and , in: Speech Prosody, 2014 |
|
SWISS FRENCH REGIONAL ACCENT IDENTIFICATION, , , , , and , in: Odyssey: The Speaker and Language Recognition Workshop, 2014 |
|
Scalable Probabilistic Models for Face and Speaker Recognition, , École Polytechnique Fédérale de Lausanne (EPFL), 2014 |
[URL] |
Multilingual Deep Neural Network based Acoustic Modeling For Rapid Language Adaptation, , , , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Florence, pages 7639-7643, IEEE, 2014 |
[DOI] |
Exploiting un-transcribed foreign data for speech recognition in well-resourced languages, , , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Florence, IT, pages 2322 - 2326, IEEE, 2014 |
[DOI] |
A Conditional Random field approach for audio-visual people diarization, , , , and , in: Proceedings of the 39th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 116 - 120, IEEE, 2014 |
[DOI] |
EYEDIAP: A Database for the Development and Evaluation of Gaze Estimation Algorithms from RGB and RGB-D Cameras, , and , in: Proceedings of the ACM Symposium on Eye Tracking Research and Applications, Safety Harbor, Florida, United States of America, ACM, 2014 |
[DOI] |
Cross-linguistic annotation of narrativity for English/French verb tense disambiguation, and , in: 9th Edition of the Language Resources and Evaluation Conference, Reykjavik, Iceland, 2014 |
|
English-French Verb Phrase Alignment in Europarl for Tense Translation Modeling, , and , in: The Ninth Language Resources and Evaluation Conference, Reykjavik, Iceland, 2014 |
|
Multimodal Reranking of Content-based Recommendations for Hyperlinking Video Snippets, , , and , in: ACM International Conference on Multimedia Retrieval, 2014 |
|
Acoustic and Lexical Resource Constrained ASR using Language-Independent Acoustic Model and Language-Dependent Probabilistic Lexical Model, and , Idiap-RR-02-2014 |
|
Word Embeddings through Hellinger PCA, and , in: 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014 |
|
Hi YouTube! Personality Impressions and Verbal Content in Social Video, , , and , in: 15th ACM International Conference on Multimodal Interaction, Sydney, Australia, ACM, 2013, 2013 |
|
Theoretical Analysis of Euclidean Distance Matrix Completion for Ad hoc Microphone Array Calibration, , Idiap-RR-20-2014 |
|
EYEDIAP Database: Data Description and Gaze Tracking Evaluation Benchmarks, , and , Idiap-RR-08-2014 |
|
The Robot Vision Track at ImageCLEF 2010, , , and , in: CLEF 2010 Notebook Papers/LABs/Workshops, 2010 |
[URL] |
Combining Content with User Preferences for Non-Fiction Multimedia Recommendation: A Study on TED Lectures, and , in: Multimedia Tools and Applications, Special Issue on Content Based Multimedia Indexing, 74(4):1175-1197, 2015 |
[DOI] |
SVR vs MLP for Phone Duration Modelling in HMM-based Speech Synthesis, , and , Idiap-RR-03-2014 |
|
Prosody in Swiss French Accents: Investigation using Analysis by Synthesis, , , and , Idiap-RR-04-2014 |
|
What to Show? Automatic Stream Selection Among Multiple Sensors, , and , in: International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, 2014 |
|
Object Classification and Detection in High Dimensional Feature Space, , Programme doctoral en Informatique, Communications et Information, 2013 |
|
Clustering flood events from water quality time-series using Latent Dirichlet Allocation model, , , , , , , , , and , in: Water Resources Research, 2013 |
[DOI] |
Speech Processing, , in: Interactive Multimodal Information Management, pages 221--245, EPFL Press, 2013 |
Gammatone Wavelet Cepstral Coefficients for Robust Speech Recognition, , and , in: Proceedings of IEEE TENCON, 2013 |
|
A Savitzky-Golay Filtering Perspective of Dynamic Feature Computation, , and , in: IEEE Signal Processing Letters, 20(3):281 -- 284, 2013 |
[DOI] |
A Probabilistic Framework for Multiple Speaker Localization, , , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2013 |
|
Alignment of Binocular-Binaural Data Using a Moving Audio-Visual Target, , and , in: Proc. of IEEE Int. Workshop on Multimedia Signal Processing (MMSP), 2013 |
|
Proceedings of the ACL Workshop on Discourse in Machine Translation (DiscoMT 2013), , , and , Association for Computational Linguistics, 2013 |
[URL] |
On Modeling Context-dependent Clustered States: Comparing HMM/GMM, Hybrid HMM/ANN and KL-HMM Approaches, , and , Idiap-RR-43-2013 |
|
3D Head Pose and Gaze Tracking and Their Application to Diverse Multimodal Tasks, , in: Doctoral consortium of the 15th ACM International Conference on Multimodal Interaction, Sydney, Australia, 2013 |
[DOI] |
Context Aware Addressee Estimation for Human Robot Interaction, , , and , in: Proceedings of the 6th Workshop on Eye Gaze in Intelligent Human Machine Interaction: Gaze in Multimodal Interaction, 2013 |
Leveraging the robot dialog state for visual focus of attention recognition, , , , and , in: Proceedings of the 15th ACM on International Conference on Multimodal Interaction, 2013 |
Multimodal Analysis of Body Communication Cues in Employment Interviews, , , and , in: 15th ACM International Conference on Multimodal Interaction Proceedings, 2013 |
|
Evaluating Intra- and Crosslingual Adaptation for Non-native Speech Recognition in a Bilingual Environment, and , in: Proceedings of the 4th IEEE International Conference on Cognitive Infocommunications, IEEE, Budapest, Hungary, pages 357-361, 2013 |
|
Adaptive Sampling for Large Scale Boosting, and , in: Journal of Machine Learning Research, 15:1431-1453, 2014 |
|
Is Deep Learning Really Necessary for Word Embeddings?, , and , Idiap-RR-44-2013 |
|
Introduction to the Special Issue on Learning Semantics, , , , , and , in: Machine Learning, 2013 |
[DOI] |
Recurrent Convolutional Neural Networks for Scene Labeling, and , Idiap-RR-41-2013 |
|
End-to-end Phoneme Sequence Recognition using Convolutional Neural Networks, , and , Idiap-RR-40-2013 |
|
Re-Identification for Improved People Tracking, , and , in: Person Re-Identification, pages 311-336, Springer, 2014 |
Using the Europarl corpus for cross-linguistic research, , and , in: Belgian Journal of Linguistics(27):23 – 42, 2013 |
[URL] |
Stable Myoelectric Control of a Hand Prosthesis using Non-Linear Incremental Learning, , , , , , , and , in: Frontiers in Neurorobotics, 8, 2014 |
[DOI] |
The Movement Error Rate for Evaluation of Machine Learning Methods for sEMG-based Hand Movement Classification, , , , and , in: Transactions on Neural Systems and Rehabilitation Engineering:735 - 744, 2014 |
[DOI] |
Characterization of a Benchmark Database for Myoelectric Movement Classification, , , , , , , , and , in: Transactions on Neural Systems and Rehabilitation Engineering, 23:73-83, 2014 |
[DOI] |
Crosslingual Tandem-SGMM: Exploiting Out-Of-Language Data for Acoustic Model and Feature Level Adaptation, , and , in: Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013), ISCA - International Speech Communication Association, Lyon, France, pages 510-514, ISCA, 2013 |
|
Crosslingual Tandem-SGMM: Exploiting Out-Of-Language Data for Acoustic Model and Feature Level Adaptation, , and , Idiap-RR-39-2013 |
|
ACCENT ADAPTATION USING SUBSPACE GAUSSIAN MIXTURE MODELS, , , and , in: The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Vancouver, BC, Canada, pages 7170-7174, 2013 |
[DOI] |
ACCENT ADAPTATION USING SUBSPACE GAUSSIAN MIXTURE MODELS, , , and , Idiap-RR-38-2013 |
|
FEATURE AND SCORE LEVEL COMBINATION OF SUBSPACE GAUSSIANS IN LVCSR TASK, , and , in: The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Vancouver, BC, Canada, pages 7604-7608, 2013 |
[DOI] |
FEATURE AND SCORE LEVEL COMBINATION OF SUBSPACE GAUSSIANS IN LVCSR TASK, , and , Idiap-RR-37-2013 |
|
Manifold Sparse Beamforming, , and , in: IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing, Saint Martin, France, pages 113-116, IEEE, 2013 |
[DOI] |
Convexity in source separation: Models, geometry, and algorithms, , , , and , in: IEEE Signal Processing Magazine, Special Issue on Source Separation and Applications, 2013 |
|
Fusing Matching and Biometric Similarity Measures for Face Diarization in Video, , and , Idiap-RR-31-2013 |
|
An Open-source State-of-the-art Toolbox for Broadcast News Diarization, , , , , and , Idiap-RR-33-2013 |
|
Gesture control interface for immersive panoramic displays, , , , , , and , in: Multimedia Tools and Applications, 1380-7501:1-27, 2013 |
[DOI] |
Exploiting Scene Cues for Dropped Object Detection, , and , in: 9th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications., 2014 |
|
Creative Applications of Human Behavior Understanding. HBU 2013: 1-14, , , and , in: Human Behavior Understanding, pages 1-14, 2013 |
Inferring Mood in Ubiquitous Conversational Video, , , , , and , in: 12th International Conference on Mobile and Ubiquitous Multimedia, Luleå, Sweden, ACM Press, 2013 |
|
Model-based Sparse Component Analysis for Reverberant Speech Localization, , , and , in: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing, pages 1439 - 1443, IEEE, 2014 |
[DOI] |
Combining Vocal Tract Length Normalization with Hierarchical Linear Transformations, , , and , in: IEEE Journal of Selected Topics in Signal Processing - Special Issue on Statistical Parametric Speech Synthesis, 8(2):262 - 272, 2014 |
[DOI] |
Broadcasting oneself: Visual Discovery of Vlogging Styles, , and , in: IEEE Transactions on Multimedia, 16(1):201-215, 2014 |
[DOI] |
One of a Kind: Inferring Personality Impressions in Meetings, and , in: 15th ACM International Conference on Multimodal Interaction, 2013 |
|
Cross-Domain Personality Prediction: From Video Blogs to Small Group Meetings, and , in: 15th ACM International Conference on Multimodal Interaction, 2013 |
|
Multiclass Latent Locally Linear Support Vector Machines, , and , in: JMLR W&CP, Volume 29: Asian Conference on Machine Learning, Canberra, Australia, pages 229-244, 2013 |
[URL] |
A Semi-Automated System for Accurate Gaze Coding in Natural Dyadic Interactions, , , and , in: 15th ACM International Conference on Multimodal Interaction, Sydney, Australia, ACM, 2013 |
[DOI] |
Unsupervised methods for activity analysis and detection of abnormal events, and , in: Intelligent Video Surveillance Systems (ISTE), Wiley, 2013 |
[DOI] |
Temporal Analysis of Motif Mixtures using Dirichlet Processes, , and , in: IEEE Trans. Pattern Analysis and Machine Intelligence (PAMI), 36(1), 2014 |
|
Observation of Vehicle Axles Through Pass-by Noise: A Strategy of Microphone Array Design, , , , and , in: IEEE Trans. on Intelligent Transportation Systems, 2013 |
|
Exploiting Long-Term Connectivity and Visual Motion in CRF-based Multi-Person Tracking, , and , Idiap-RR-06-2014 |
|
Bi-Modal Biometric Authentication on Mobile Phones in Challenging Conditions, , , , and , in: Image and Vision Computing:1147-1160, 2014 |
[DOI] [URL] |
On the Improvements of Uni-modal and Bi-modal Fusions of Speaker and Face Recognition for Mobile Biometrics, , , and , in: Biometric Technologies in Forensic Science, Nijmegen, The Netherlands, 2013 |
|
Processing and Linking Audio Events in Large Multimedia Archives: The EU inEvent Project, , , , , , , , and , in: Workshop on Speech, Language and Audio in Multimedia, 2013 |
|
Estimating Phoneme Class Conditional Probabilities from Raw Speech Signal using Convolutional Neural Networks, , and , in: Proceedings of Interspeech, 2013 |
|
Idiap at MediaEval 2013: Search and Hyperlinking Task, , , and , in: MediaEval 2013 Workshop, Barcelona, Spain, CEUR-WS.org, 2013 |
|
Probabilistic Lexical Modeling and Unsupervised Training for Zero-Resourced ASR, , and , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, 2013 |
|
Reservoir Boosting : Between Online and Offline Ensemble Learning, and , in: Proceedings of the international conference on Neural Information Processing Systems, 2013 |
|
Multi-Commodity Network Flow for Tracking Multiple People, , , and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013 |
Impact of deep MLP architecture on different acoustic modeling techniques for under-resourced speech recognition, , , and , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, 2013 |
|
Biometrics Evaluation under Spoofing Attacks, , and , Idiap-RR-12-2014 |
|
A Survey of Personality Computing, and , in: IEEE Transaction on Affective Computing, 5(3):273-291, 2014 |
|
Interactive Multimodal Information Management, and , EPFL Press, 2013 |
Interactive Multimodal Information Management: Shaping the Vision, and , in: Interactive Multimodal Information Management, pages 1-17, EPFL Press, 2013 |
|
Real-Time Audio-Visual Analysis for Multiperson Videoconferencing, , , , , , , , and , in: Advances in Multimedia, 2013:21, 2013 |
[DOI] [URL] |
Automatic Staging of Audio with Emotions, and , in: International Conference on Affective Computing and Intelligent Interaction, 2013 |
Understanding Factors in Emotion Perception, and , Idiap-RR-28-2013 |
|
Understanding Factors in Emotion Perception, and , in: ISCA Speech Synthesis Workshop, 2013 |
|
Inferring social activities with mobile sensor networks, , , , and , in: 15th ACM International Conference on Multimodal Interaction, 2013 |
|
From Big Smartphone Data to Worldwide Research: The Mobile Data Challenge, , , , , , , and , in: Pervasive and Mobile Computing, 9(6):752–771, 2013 |
|
Multi-factor Segmentation for Topic Visualization and Recommendation: the MUST-VIS System, , , , , , , and , in: Proceedings of the 21st ACM International Conference on Multimedia, Barcelona, Spain, pages 365-368, ACM, 2013 |
[DOI] [URL] |
Revisiting the Generality of the Rank-based Human Mobility Model, and , in: Proceedings of the 2013 ACM Conference on Pervasive and Ubiquitous Computing Adjunct Publication, Zurich, Switzerland, pages 1209-1218, ACM, 2013 |
[DOI] [URL] |
Speaking Swiss: Languages and Venues in Foursquare, and , in: Proceedings of the 21st ACM International Conference on Multimedia, Barcelona, Spain, pages 501-504, ACM, 2013 |
[DOI] [URL] |
On the Improvements of Uni-modal and Bi-modal Fusions of Speaker and Face Recognition for Mobile Biometrics, , , and , Idiap-RR-35-2013 |
|
Spoofing Attacks To 2D Face Recognition Systems With 3D Masks, and , in: International Conference of the Biometrics Special Interes Group, Darmstadt, Germany, 2013 |
|
Spoofing in 2D Face Recognition with 3D Masks and Anti-spoofing with Kinect, and , in: Biometrics: Theory, Applications and Systems, Washington DC, USA, 2013 |
|
Investigating time-sensitive topic model approaches for action recognition, , and , Idiap-RR-26-2013 |
|
Similarity Learning Over Large Collaborative Networks, , and , EPFL, 2013 |
|
Euclidean Distance Matrix Completion for Ad-hoc Microphone Array Calibration, , , and , in: Proceedings IEEE International Conference On Digital Signal Processing, 2013 |
|
The vernissage corpus: a conversational human-robot-interaction dataset, , , , , , , , , and , in: Proceedings of the 8th ACM/IEEE international conference on Human-robot interaction, 2013 |
|
Word Embeddings through Hellinger PCA, and , Idiap-RR-29-2013 |
|
Given that, Should I Respond? Contextual Addressee Estimation in Multi-Party Human-Robot Interactions, and , in: Proceedings of Human Robot Interaction (HRI) Conference, 2013 |
|
Discovering Temporal Patterns in Water Quality Time Series, Focusing on Floods with the LDA method, , , , , , , and , in: European Geosciences Union, 2013 |
|
Time-Sensitive Topic Models for Action Recognition in Videos, , and , in: IEEE International Conference on Image Processing, 2013 |
|
Learning to Rank on Network Data, , and , in: Mining and Learning with Graphs, 2013 |
|
Computing Text Semantic Relatedness using the Contents and Links of a Hypertext Encyclopedia, and , in: International Joint Conference on artificial intelligence, 2013 |
|
Automatic Speech Indexing System of Bilingual Video Parliament Interventions, , , , , and , Idiap-RR-25-2013 |
|
Deformable Part Models with Individual Part Scaling, and , in: British Machine Vision Conference, 2013 |
|
Are ACT's scores increasing with better translation quality?, , in: Are ACT's scores increasing with better translation quality?, pages 6, 2013 |
|
Accelerated Training of Linear Object Detectors, and , in: CVPR 2013 Workshop on Structured Prediction, 2013 |
[URL] |
Medical image annotation, , in: Interactive Multimodal Information Management, EPFL Press, 2013 |
|
Learning to learn new models of human activities in indoor settings1, , , and , in: Interactive Multimodal Information Management, EPFL Press, 2013 |
|
Learning to learn new models of human activities in indoor settings1, , , and , in: Interactive Multimodal Information Management, EPFL Press, 2013 |
Overview of the ImageCLEF 2013 Robot Vision Task, , , and , in: Working Notes, CLEF 2013, 2013 |
|
Investigating the Impact of Language Style and Vocal Expression on Social Roles of Participants in Professional Meetings, and , in: Affective Computing and Intelligent Interaction, Geneva, pages 324-329, IEEE, 2013 |
[DOI] |
Noise Intrusiveness Factors in Speech Telecommunications, , , and , in: Proceedings of the AIA-DAGA 2013 International Conference on Acoustics, Merano, Italy, pages 436-439, 2013 |
|
Multilingual speech recognition A posterior based approach, , École Polytechnique Fédérale de Lausanne (EPFL), 2013 |
|
Mining Conversational Social Video, , EPFL, 2013 |
|
Detecting Narrativity to Improve English to French Translation of Simple Past Verbs, , and , in: Proceedings of the 1st DiscoMT Workshop at ACL 2013 (51st Annual Meeting of the Association for Computational Linguistics), Sofia, Bulgaria, pages 33-42, 2013 |
|
Syllable-based Pitch Encoding for Low Bit Rate Speech Coding with Recognition/Synthesis Architecture, , and , Idiap-RR-24-2013 |
|
Where and What: Using Smartphones to Predict Next Locations and Applications in Daily Life, and , in: Pervasive and Mobile Computing, 2013 |
|
Automatic Social Role Recognition In Professional Meetings Using Conditional Random Fields, and , in: Proceedings of Interspeech, 2013 |
|
Using Phonological Phrase Segmentation to Improve Automatic Keyword Spotting for the Highly Agglutinating Hungarian Language, and , Idiap-RR-23-2013 |
|
Recurrent Convolutional Neural Networks for Scene Parsing, and , Idiap-RR-22-2013 |
|
Improving Grapheme-based ASR by Probabilistic Lexical Modeling Approach, and , in: Proceedings of Interspeech, 2013 |
|
Machine Translation with Many Manually Labeled Discourse Connectives, and , in: Proceedings of the 1st DiscoMT Workshop at ACL 2013 (51st Annual Meeting of the Association for Computational Linguistics), Sofia, Bulgaria, pages 43-50, 2013 |
|
Implicitation of Discourse Connectives in (Machine) Translation, and , in: Proceedings of the 1st DiscoMT Workshop at ACL 2013 (51st Annual Meeting of the Association for Computational Linguistics), Sofia, Bulgaria, pages 19-26, 2013 |
|
Syllable-based Pitch Encoding for Low Bit Rate Speech Coding with Recognition/Synthesis Architecture, , and , in: Proc. of Interspeech 2013, Lyon, France, 2013 |
|
Person Independent 3D Gaze Estimation From Remote RGB-D Cameras, and , in: International Conference on Image Processing, Melbourne, Australia, IEEE, 2013 |
[DOI] |
Automatic Personality Perception: Inferring Personality Traits from Nonverbal Vocal Behavior, , Electrical Engineering Department, EPFL, 2013 |
|
Who is Persuasive? The Role of Perceived Personality and Communication Modality in Social Multimedia, , , , and , in: International Conference on Multimodal Interaction, 2013 |
A Survey on Perceived Speaker Traits: Personality, Likability, Pathology and the First Challenge, , , , , , , , , , and , in: Computer Speech and Language, 19(1):100-131, 2015 |
[DOI] |
Diverse Keyword Extraction from Conversations, and , in: Proceedings of the ACL 2013 (51th Annual Meeting of the Association for Computational Linguistics ), Short Papers, Sofia, Bulgaria, pages 651-657, ACL, 2013 |
|
Using Phonological Phrase Segmentation to Improve Automatic Keyword Spotting for the Highly Agglutinating Hungarian Language, and , in: Proc. of Interspeech 2013, 2013 |
An Open-source State-of-the-art Toolbox for Broadcast News Diarization, , , , , and , in: INTERSPEECH, 2013 |
|
Unsupervised Methods for Activity Analysis and Detection of Abnormal Events, and , Idiap-RR-21-2013 |
|
Analyse non supervisée d'activités en vidéo surveillance pour l'analyse de scène et la détection d'événements anormaux, and , Idiap-RR-20-2013 |
[URL] |
Extracting Motifs from Time Series Generated by Concurrent Activities., , and , in: NIPS workshop on Learning and Planning from Batch Time Series Data, 2010 |
|
I4U Submission to NIST SRE 2012: a large-scale collaborative effort for noise-robust speaker verification, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: INTERSPEECH, Lyon, France, 2013 |
|
Stability and Hypothesis Transfer Learning, and , in: International Conference on Machine Learning, 2013 |
|
Annotating the meaning of discourse connectives by looking at their translation: The translation-spotting technique, , and , in: Dialogue & Discourse, 4(2):65-86, 2013 |
[DOI] |
Bi-Modal Biometric Authentication on Mobile Phones in Challenging Conditions, , , , and , Idiap-RR-30-2013 |
|
The 2013 Face Recognition Evaluation in Mobile Environment, , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: The 6th IAPR International Conference on Biometrics, 2013 |
|
The 2013 Speaker Recognition Evaluation in Mobile Environment, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: The 6th IAPR International Conference on Biometrics, 2013 |
|
Fusing Matching and Biometric Similarity Measures for Face Diarization in Video, , and , in: Proceedings of the 3rd ACM conference on International conference on multimedia retrieval, Dallas, Texas, USA, pages 97-104, ACM, 2013 |
|
Exploiting Accelerometers to Improve Movement Classification for Prosthetics, and , in: International Conference on Rehabilitation Robotics, 2013 |
|
Anti-spoofing in action: joint operation with a verification system, , and , in: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Workshop on Biometrics, Portland, Oregon, 2013 |
|
The 2nd competition on counter measures to 2D face spoofing attacks, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: International Conference of Biometrics 2013, Madrid, Spain, 2013 |
|
Sentiment Analysis of User Comments for One-Class Collaborative Filtering over TED Talks, and , in: 36th ACM SIGIR Conference on Research and Development in Information Retrieval, Dublin, Ireland, ACM, 2013 |
|
Probabilistic Lexical Modeling and Grapheme-based Automatic Speech Recognition, and , Idiap-RR-15-2013 |
|
From N to N+1: Multiclass Transfer Incremental Learning, , and , in: Proceedings of the Conference on Computer Vision and Pattern Recognition, 2013 |
|
Applying multi- and cross-lingual stochastic phone space transformations to non-native speech recognition, , , , and , in: IEEE Transactions on Audio, Speech, and Language Processing, 2013 |
[DOI] |
Combining Content with User Preferences for TED Lecture Recommendation, and , in: Proceedings of the 11th International Workshop on Content Based Multimedia Indexing, Veszprém, Hungary, IEEE, 2013 |
|
The 2nd Competition on Counter Measures to 2D Face Spoofing Attacks, , and , Idiap-RR-18-2013 |
|
Improving Grapheme-based ASR by Probabilistic Lexical Modeling Approach, and , Idiap-RR-14-2013 |
|
The 2013 Speaker Recognition Evaluation in Mobile Environment, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , Idiap-RR-32-2013 |
|
The 2013 Face Recognition Evaluation in Mobile Environment, , , , , , , , , , , , , , , , , , , , , , , , , , , and , Idiap-RR-36-2013 |
|
Estimating Phoneme Class Conditional Probabilities from Raw Speech Signal using Convolutional Neural Networks, , and , Idiap-RR-13-2013 |
|
Spoofing in 2D Face Recognition with 3D Masks and Anti-spoofing with Kinect, and , Idiap-RR-27-2013 |
|
Anti-spoofing in action: joint operation with a verification system, , and , Idiap-RR-19-2013 |
|
Evaluating Shape Descriptors for Detection of Maya Hieroglyphs, , and , in: in Proc. Mexican Conf. on Pattern Recognition, Queretaro, 2013 |
|
Statistical models for HMM/ANN hybrids, and , Idiap-RR-11-2013 |
|
From Foursquare to my Square: Learning Check-in Behavior from Multiple Sources, , and , in: The 7th International AAAI Conference on Weblogs and Social Media, 2013 |
|
Bias Adaptation for Vocal Tract Length Normalization, , , and , Idiap-RR-12-2013 |
|
CONNECTIONIST SPEECH RECOGNITION - A Hybrid Approach, and , KLUWER ACADEMIC PUBLISHERS, 1994 |
|
Adaptation Experiments on French MediaParl ASR, , Idiap-RR-10-2013 |
|
Grapheme and Multilingual Posterior Features for Under-Resourced Speech Recognition: A Study on Scottish Gaelic, , and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2013 |
|
Improved Overlap Speech Diarization of Meeting Recordings using Long-term Conversational Features, and , in: ICASSP, 2013 |
|
Improved Overlap Speech Diarization of Meeting Recordings using Long-term Conversational Features, and , in: ICASSP, 2013 |
Enhancing State Mapping-Based Cross-Lingual Speaker Adaptation using Phonological Knowledge in a Data-Driven Manner, and , Idiap-RR-08-2013 |
|
Speaker adaptive Kullback-Leibler divergence based hidden Markov models, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2013 |
|
A Scalable Formulation of Probabilistic Linear Discriminant Analysis: Applied to Face Recognition, , , and , Idiap-RR-07-2013 |
[URL] |
On the (Un)importance of the Contextual Factors In HMM-Based Speech Synthesis, , and , in: Proceedings of the IEEE Intl. Conference on Acoustics, Speech and Signal Processing (ICASSP), Vancouver, Canada, pages 8140 - 8143, 2013 |
|
Convolutional Pitch Target Approximation Model for Speech Synthesis, and , Idiap-RR-05-2013 |
|
Fast Object Detection with Entropy-Driven Evaluation, , , and , in: Proceedings of the Conference on Computer Vision and Pattern Recognition, 2013 |
KL-HMM and Probabilistic Lexical Modeling, and , Idiap-RR-04-2013 |
|
Who Wants To Be A Millionaire? (II), , and , Idiap-Com-02-2013 |
|
Learning Categories from Few Examples with Multi Model Knowledge Transfer, , and , Idiap-RR-16-2013 |
|
Learning to Learn by Exploiting Prior Knowledge, , EDIC, 2013 |
|
The Places of Our Lives: Visiting Patterns and Automatic Labeling from Longitudinal Smartphone Data, and , in: IEEE Transactions on Mobile Computing, 2013 |
|
Using out-of-language data to improve an under-resourced speech recognizer, , , and , Idiap-RR-09-2013 |
|
Using out-of-language data to improve an under-resourced speech recognizer, , , and , in: Speech Communication, 2013 |
[DOI] [URL] |
Statistical Shape Descriptors for Ancient Maya Hieroglyphs Analysis, , École Polytechnique Fédérale de Lausanne, 2012 |
|
Distinguishing the Popularity Between Topics: A System for Up-to-date Opinion Retrieval and Mining in the Web, , and , in: Proceedings of the 14th International Conference on Intelligent Text Processing and Computational Linguistics, LNCS, Samos, Greece, ACM, 2013 |
[URL] |
Assessing the Accuracy of Discourse Connective Translations: Validation of an Automatic Metric, and , in: 14th International Conference on Intelligent Text Processing and Computational Linguistics, University of the Aegean, Samos, Greece, pages 236-247, Springer, 2013 |
[DOI] |
Regularized Bundle Methods for Convex and Non-Convex Risks, and , in: Journal of Machine Learning Research, 13:3539-3583, 2012 |
|
Spoofing Attacks To 2D Face Recognition Systems With 3D Masks, and , Idiap-RR-42-2013 |
|
Body communicative cue extraction for conversational analysis, , , , and , in: Proceedings of IEEE International Conference on Automatic Face and Gesture Recognition, 2013 |
|
Unified Framework Of Feature Based Adaptation For Statistical Speech Synthesis And Recognition, , Ecole Polytechnique Federale de Lausanne (EPFL), 2012 |
|
Data-Driven Enhancement of State Mapping-Based Cross-Lingual Speaker Adaptation, , Idiap-RR-38-2012 |
|
FaceTube: predicting personality from facial expressions of emotion in online conversational video, , and , in: Proceedings International Conference on Multimodal Interfaces (ICMI-MLMI), 2012 |
|
Speaker Diarization and Linking of Large Corpora, and , in: Proceedings of the IEEE Workshop on Spoken Language Technology, 2012 |
|
Robot-to-group Interaction in a Vernissage: Architecture & Dataset for Multi-party Dialog, , , , , , , and , in: Proceedings of 5th International Conference on Cognitive Systems, 2012 |
|
Implementing Neural Networks Efficiently, , and , in: Neural Networks: Tricks of the Trade, Springer, 2012 |
|
Deep Learning via Semi-Supervised Embedding, , , and , in: In Neural Networks: Tricks of the Trade, Springer, 2012 |
|
A Method, Apparatus and Computer Program for Determining the Location of a Plurality of Speech Source, , and , in: 2012US-13/654055, 2012 |
[URL] |
Structured Sparse Acoustic Modeling for Speech Separation, , , and , in: Signal Processing with Adaptive Sparse Structured Representations SPARS, SPARS, 2013 |
|
Model-based Sparse Component Analysis for Multiparty Distant Speech Recognition, , École Polytechnique Fédérale de Lausanne, 2013 |
|
A Multipath Sparse Beamfroming Method, , , and , in: Signal Processing with Adaptive Sparse Structured Representations SPARS, 2013 |
|
Unsupervised Activity Analysis and Monitoring algorithms for Effective Surveillance Systems, , , , , , , and , in: European Conference on Computer Vision, 2012 |
|
A Track Creation and Deletion Framework for Long-Term Online Multi-Face Tracking, and , in: IEEE Transactions on Image Processing, 2013 |
|
Sampling techniques for audio-visual tracking and head pose estimation, and , in: Multimodal Signal Processing: Human Interactions in Meetings, pages 84-102, Cambridge University Press, 2012 |
|
Parameter Estimation and Contextual Adaptation for a Multi-Object Tracking CRF Model, and , in: IEEE Workshop on Performance Evaluation of Tracking and Surveillance, 2013 |
|
Recognizing the Visual Focus of Attention for Human Robot Interaction, , and , in: IEEE International Conference on Intelligent Robots and Systems (IROS) - Human Behavior Understanding Workshop(IROS-HBU), 2012 |
|
Investigating the Midline Effect for Visual Focus of Attention Recognition, and , in: Int Conf. on Multimodal Interaction (ICMI), Santa Monica, 2012 |
|
The I4U Submission to the 2012 NIST Speaker Recognition Evaluation, , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: NIST Speaker Recognition Conference, 2012 |
Together Anywhere, Together Anytime, Technologies for Intimate Interactions, , , and , Centrum Wiskunde & Informatica, 2012 |
IMPROVING ACOUSTIC BASED KEYWORD SPOTTING USING LVCSR LATTICES, , and , in: Proceedings on IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE, Japan, pages 4413-4416, 2012 |
IMPROVING ACOUSTIC BASED KEYWORD SPOTTING USING LVCSR LATTICES, , and , Idiap-RR-36-2012 |
|
ICB 2013 - Competition on speaker recognition in mobile environment using the MOBIO database: The Evaluation Plan, , and , Idiap-Com-04-2012 |
|
I4U Submission to NIST SRE 2012: a large-scale collaborative effort for noise-robust speaker verification, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , Idiap-RR-34-2013 |
|
The Idiap Speaker Recognition Evaluation System at NIST SRE 2012, , and , in: NIST Speaker Recognition Conference, NIST, Orlando, USA, 2012 |
|
Automatic Social Role Recognition In Professional Meetings, and , Idiap-RR-35-2012 |
|
Bi-Modal Authentication in Mobile Environments Using Session Variability Modelling, , , , and , in: Proceedings of the 21st International Conference on Pattern Recognition, 2012 |
|
Face Verification using Gabor Filtering and Adapted Gaussian Mixture Models, , and , in: Proceedings of the 11th International Conference of the Biometrics Special Interest Group, Darmstadt, Germany, pages 397-408, GI-Edition, 2012 |
|
Grapheme and Multilingual Posterior Features For Under-Resource Speech Recognition: A Study on Scottish Gaelic, , and , Idiap-RR-34-2012 |
|
Modeling dominance effects on nonverbal behaviors using granger causality, , , , , and , in: Proceedings of International Conference on Multimodal Interaction, ICMI 2012, Santa Monica, CA, 2012 |
|
The TA2 Database – A Multi-Modal Database From Home Entertainment, , and , in: International Journal of Computer and Electrical Engineering, 4(5):670-673, 2012 |
[URL] |
Real-time model learning using Incremental Sparse Spectrum Gaussian Process Regression, and , in: Neural Networks, 2012 |
A Simple Continuous Pitch Estimation Algorithm, , and , in: IEEE Signal Processing Letters, 20(1):102--105, 2013 |
[URL] |
treeKL: A distance between high dimension empirical distributions, and , in: Pattern Recognition Letters, 34(2):140-145, 2013 |
|
On the Effectiveness of Local Binary Patterns in Face Anti-spoofing, , and , in: Proceedings of the 11th International Conference of the Biometrics Special Interes Group, 2012 |
|
ON THE (UN)IMPORTANCE OF THE CONTEXTUAL FACTORS IN HMM-BASED SPEECH SYNTHESIS AND CODING, , and , Idiap-RR-06-2013 |
|
Data-Driven Enhancement of State Mapping-Based Cross-Lingual Speaker Adaptation, , École Polytechnique Fédérale de Lausanne, 2012 |
|
Improving Object Classification using Pose Information, , , and , Idiap-RR-30-2012 |
|
Leveraging speaker diarization for meeting recognition from distant microphones, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, pages 4390--4393, 2010 |
Checking In or Checked In: Comparing Large-Scale Manual and Automatic Location Disclosure Patterns, , and , in: Proceedings of the 11th International Conference on Mobile and Ubiquitous Multimedia, Ulm, Germany, 2012 |
Socio-Technical Network Analysis from Wearable Interactions, , and , in: International Symposium on Wearable Computers, 2012 |
|
A Sequential Topic Model for Mining Recurrent Activities from Long Term Video Logs, , and , in: International Journal of Computer Vision, 103(1):100-126, 2013 |
|
Macro-Action Discovery Based on Change Point Detection and Boosting, and , in: International Conference on Machine Learning and Applications, 2012 |
|
Joint Similarity Learning for Predicting Links in Networks with Multiple-type Links, and , Idiap-RR-29-2015 |
|
An Open Source Framework for Standardized Comparisons of Face Recognition Algorithms, , and , in: Computer Vision - ECCV 2012. Workshops and Demonstrations, Idiap Research Institute, Heidelberg, pages 547-556, Springer Berlin, 2012 |
[DOI] [URL] |
Overview of the ImageCLEF 2012 Robot Vision Task, , and , in: Working Notes of the ImageCLEF 2012 Laboratory, 2012 |
|
Baseline Multimodal Place Classifier for the 2012 Robot Vision Task, , and , in: Working Notes of the ImageCLEF 2012 Laboratory, 2012 |
|
MediaParl: Bilingual mixed language accented speech database, , , , , and , Idiap-RR-03-2013 |
|
MediaParl: Bilingual mixed language accented speech database, , , , , and , in: Proceedings of the 2012 IEEE Workshop on Spoken Language Technology, pages 263--268, 2012 |
|
Beyond Dataset Bias: Multi-task Unaligned Shared Knowledge Transfer, , , and , in: Asian Conference on Computer Vision, 2012 |
|
Face Recognition with Disparity Corrected Gabor Phase Differences, , and , in: Artificial Neural Networks and Machine Learning, Heidelberg, pages 411-418, Springer Berlin, 2012 |
[DOI] |
An Open Source Framework for Standardized Comparisons of Face Recognition Algorithms, , and , Idiap-RR-29-2012 |
|
Exact Acceleration of Linear Object Detectors, and , in: Proceedings of the European Conference on Computer Vision, 2012 |
|
Empirical validations of multilingual annotation schemes for discourse relations, , , and , in: 8th Joint ACL-ISO Workshop on Interoperable Semantic Annotation, 2012 |
|
Predicting the Conflict Level in Television Political Debates: an Approach Based on Crowdsourcing, Nonverbal Communication and Gaussian Processes, , , and , in: ACM Multimedia, 2012 |
Collecting data for socially intelligent surveillance and monitoring approaches: the case of conflict in competitive conversations, , , and , in: International Symposium on Communications, Control, and Signal Processing, 2012 |
|
Wordless Sounds: Robust Speaker Diarization using Privacy-Preserving Audio Representations, , and , in: IEEE Transactions on Audio, Speech, and Language Processing, 2012 |
|
Crowdsourcing Micro-Level Multimedia Annotations: The Challenges of Evaluation and Interface, , , and , in: Proceedings of International ACM Workshop on Crowdsourcing for Multimedia, 2012 |
Annotation and Recognition of Personality Traits in Spoken Conversations from the AMI Meetings Corpus, , and , in: Proceedings of Interspeech 2012, 2012 |
|
DiarTk : An Open Source Toolkit for Research in Multistream Speaker Diarization and its Application to Meetings Recordings, and , in: Proceedings of Interspeech, 2012 |
|
Detecting and Labeling Folk Literature in Spoken Cultural Heritage Archives using Structural and Prosodic Features, and , in: IEEE Content Based Multimedia Indexing, 2012 |
|
Speaker Diarization of Meetings based on large TDOA feature vectors, and , in: Proceedings of International Conference on Acoustic, Speech and Signal Processing, 2012 |
|
Translating English Discourse Connectives into Arabic: a Corpus-based Analysis and an Evaluation Metric, and , in: Fourth Workshop on Computational Approaches to Arabic Script-based Languages at Proceedings of the Tenth Biennial Conference of the Association for Machine Translation in the Americas (AMTA), 2012 |
|
An Agent-Based Focused Crawling Framework for Topic- and Genre-Related Web Document Discovery, , and , in: 24th IEEE International Conference on Tools with Artificial Intelligence, Athens, Greece, IEEE, 2012 |
[URL] |
Linking Speaking and Looking Behavior Patterns with Group Composition, Perception, and Performance, , , , and , in: Proceedings of the International Conference on Multimodal Interaction (ICMI), Santa Monica, USA, 2012 |
|
Sequential Topic Models for Mining Recurrent Activities and their Relationships : Application to long term video recordings, , École Polytechnique Fédérale de Lausanne, 2012 |
|
Iterative Relevance Feedback with Adaptive Exploration/Exploitation Trade-off, and , in: Proceedings of the 21st ACM Conference on Information and Knowledge Management, pages 1323-1331, 2012 |
|
Machine Translation of Labeled Discourse Connectives, , , and , in: Proceedings of the Tenth Biennial Conference of the Association for Machine Translation in the Americas (AMTA), pages 10, 2012 |
|
Using Sparse Classification Outputs as Feature Observations for Noise Robust ASR, , , , , and , in: Proceedings of Interspeech, 2012 |
|
Combination of Sparse Classification and Multilayer Perceptron for Noise Robust ASR, , , , , and , in: Proceedings of Interspeech, 2012 |
|
Baseline System for Automatic Speech Recognition with French GlobalPhone Database, and , Idiap-RR-26-2012 |
|
Reading Companion: The Technical and Social Design of an Automated Reading Tutor, , , , , and , in: Workshop on Child, Computer and Interaction, Portland, Oregon, U.S.A., 2012 |
|
The Vernissage Corpus: A Multimodal Human-Robot-Interaction Dataset, , , , , , , , , and , Idiap-RR-33-2012 |
|
From Research to Reality: Evaluation of a Single-Computer Real-Time LVCSR System for Speech-Based Retrieval, , , and , Idiap-RR-12-2017 |
|
Indoor Scene Recognition using Task and Saliency-driven Feature Pooling, and , in: Proceedings of the British Machine Vision Conference, Guildford, UK, 2012 |
|
Leveraging over prior knowledge for online learning of visual categories, , , and , in: Proceedings of the British Machine Vision Conference, 2012 |
|
Contextual Conditional Models for Smartphone-based Human Mobility Prediction, and , in: Proceedings of the 14th ACM International Conference on Ubiquitous Computing, 2012 |
|
Using Crowdsourcing to Compare Document Recommendation Strategies for Conversations, and , in: RecSys, Recommendation Utility Evaluation (RUE 2012), Dublin, Ireland, pages 15-20, 2012 |
|
Impact du degré de supervision sur l'adaptation à un domaine d'un modèle de langage à partir du Web, , , and , Idiap-RR-23-2012 |
|
Application of Subspace Gaussian Mixture Models in Contrastive Acoustic Scenarios, , , and , Idiap-RR-20-2012 |
|
Building the NinaPro Database: a Resource for the Biorobotics Community, , , , , , , , and , in: Proceedings of the Fourth IEEE RAS/EMBS International Conference on Biomedical Robotics and Biomechatronics, 2012 |
|
Who Wants To Be A Millionaire?, , , and , Idiap-Com-03-2012 |
|
Integrating Posterior Features and Self-Organizing Maps for Isolated Word Recognition without Dynamic Programming, and , Idiap-RR-17-2012 |
|
The Good, the Bad, and the Angry: Analyzing Crowdsourced Impressions of Vloggers, and , in: Proceedings of AAAI International Conference on Weblogs and Social Media, 2012 |
|
From Speech to Personality: Mapping Voice Quality and Intonation into Personality Differences, , , and , in: in Proceedings of ACM Multimedia 2012, 2012 |
|
Microphone Array Beampattern Characterization for Hands-free Speech Applications, , and , in: IEEE 7th Sensor Array and Multichannel Signal Processing Workshop(SAM), Hoboken, NJ, USA, pages 473-476, 2012 |
|
Sparsity in Topic Models, , and , in: Practical Applications of Sparse Modeling: Biology, Signal Processing and Beyond, MIT Press, 2012 |
|
Bi-Modal Authentication in Mobile Environments Using Session Variability Modelling, , , , and , Idiap-RR-18-2012 |
|
Template-based ASR using Posterior features and synthetic references: comparing different TTS systems, , and , in: SAPA-SCALE Conference, International Speech Communication Association, 2012 |
|
Gaze Estimation From Multimodal Kinect Data, and , in: IEEE Conference in Computer Vision and Pattern Recognition, Workshop on Gesture Recognition, Providence, RI, USA, 2012 |
[DOI] |
On Speaker-Independent Personality Perception and Prediction from Speech, , , , , and , in: in Proceedings of INTERSPEECH 2012, 2012 |
|
Conversion of Recurrent Neural Network Language Models to Weighted Finite State Transducers for Automatic Speech Recognition, and , in: Proceedings of Interspeech, Portland, Oregon, USA, pages to appear, 2012 |
|
Supervised and unsupervised Web-based language model domain adaptation, , , and , in: Proceedings of Interspeech, Portland, Oregon, USA, pages to appear, 2012 |
|
Structured Sparse Coding for Microphone Array Location Calibration, , , and , in: SAPA-SCALE Conference, The 5th ISCA workshop on statistical and perceptual audition, 2012 |
|
Computing Text Semantic Relatedness using the Contents and Links of a Hypertext Encyclopedia, and , in: Artificial Intelligence Journal, 194:176–202, 2013 |
[DOI] |
Sub-Band Based Log-Energy and its Dynamic Range Stretching for Robust In-Car Speech Recognition, and , in: Proceedings of the 13th Annual Conference of the International Speech Communication Association (InterSpeech), Portland, Oregon, 2012 |
|
Sub-Band Based Log-Energy and its Dynamic Range Stretching for Robust In-Car Speech Recognition, and , Idiap-RR-16-2012 |
|
Integrating Language Identification to improve Multilingual Speech Recognition, , Idiap-RR-24-2012 |
|
Combining Acoustic Data Driven G2P and Letter-to-Sound Rules for Under Resource Lexicon Generation, and , in: Proceedings of Interspeech, Portland, Oregon, 2012 |
|
Emergent leaders through looking and speaking: from audio-visual data to multimodal recognition, , , , and , in: Journal on Multimodal User Interfaces, 2012 |
|
Automatic detection of conflict escalation in spoken conversations, , and , in: INTERSPEECH, ISCA, Portland, Oregon, USA., 2012 |
|
Speaker diarization of overlapping speech based on silence distribution in meeting recordings, and , in: INTERSPEECH, Portland, Oregon, USA, 2012 |
|
Using Crowdsourcing to Compare Document Recommendation Strategies for Conversations, and , Idiap-RR-14-2012 |
|
Extracting Informative Textual Parts from Web Pages Containing User-Generated Content, , and , in: 12th International Conference on Knowledge Management and Knowledge Technologies, ACM ICPS, Graz, Austria, pages 4:1--4:8, ACM, 2012 |
[URL] |
Robust triphone mapping for acoustic modeling, , and , Idiap-RR-02-2013 |
|
Robust triphone mapping for acoustic modeling, , and , in: Proceedings of Interspeech, Portland, Oregon, 2012 |
|
Comparing different acoustic modeling techniques for multilingual boosting, , , , and , Idiap-RR-01-2013 |
|
Synthetic References for Template-based ASR using Posterior Features, , and , in: Proceedings of Interspeech, Portland, Oregon, USA, 2012 |
|
Comparing different acoustic modeling techniques for multilingual boosting, , , , and , in: Proceedings of Interspeech, Portland, Oregon, 2012 |
|
Phase AutoCorrelation (PAC) features for noise robust speech recognition, , , and , in: Speech Communication, 54(7):867–880, 2012 |
[DOI] |
A Survey on Language Modeling using Neural Networks, and , Idiap-RR-32-2012 |
|
Notes on Probabilistic Linear Discriminant Analysis, and , Idiap-Com-03-2013 |
|
Bob: a free signal processing and machine learning toolbox for researchers, , , , , and , Idiap-RR-25-2012 |
|
Assessing Sparse Coding Methods for Contextual Shape Indexing of Maya Hieroglyphs, , and , in: Journal of Multimedia, 7(2):179--192, 2012 |
|
Bridging the Past, Present and Future: Modeling Scene Activities From Event Relationships and Global Rules, , and , in: IEEE Conference on Computer Vision and Pattern Recognition, 2012, Providence, Rhode Island, USA, 2012 |
Conversion of Recurrent Neural Network Language Models to Weighted Finite State Transducers for Automatic Speech Recognition, and , Idiap-RR-21-2012 |
|
Supervised and unsupervised Web-based language model domain adaptation, , , and , Idiap-RR-22-2012 |
|
Impact du degré de supervision sur l'adaptation à un domaine d'un modèle de langage à partir du Web, , , and , in: Actes de la conference conjointe JEP-TALN-RECITAL 2012, Grenoble, France, pages 193-200, ATALA/AFCP, 2012 |
|
Audiovisual Diarization Of People In Video Content, , and , in: Multimedia Tools and Applications, 2012 |
|
Combining transcription-based and acoustic-based speaker identifications for broadcast news, , , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2012 |
|
Bi-Modal Person Recognition on a Mobile Phone: using mobile phone data, , , , , , , , , , , , , and , in: IEEE ICME Workshop on Hot Topics in Mobile Multimedia, 2012 |
|
Session Variability Modelling for Face Authentication, , , , and , Idiap-RR-17-2013 |
|
Bi-Modal Person Recognition on a Mobile Phone: using mobile phone data, , , , , , , , , , , , , and , Idiap-RR-13-2012 |
|
Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns, , , , and , in: Proceedings of the eighth international conference on Language Resources and Evaluation (LREC), pages 5, 2012 |
|
Vocal Tract Length Normalization for Statistical Parametric Speech Synthesis, , and , in: IEEE Transactions on Audio, Speech and Language Processing, 2012 |
|
COMBINING VOCAL TRACT LENGTH NORMALIZATION WITH HIERARCHIAL LINEAR TRANSFORMATIONS, , , and , in: Proceedings in International conference on Speech and Signal processing, Kyoto, Japan, pages 4493-4496, IEEE SPS (ICASSP), 2012 |
|
On the Challenge of Classifying 52 Hand Movements from Surface Electromyography, , and , in: 34th Annual Conference of the IEEE Engineering in Medicine & Biology Society, 2012 |
|
Boosting under-resourced speech recognizers by exploiting out of language data - Case study on Afrikaans, , and , Idiap-RR-15-2012 |
|
Alternative search techniques for face detection using location estimation and binary features, , ECOLE POLYTECHNIQUE FEDERALE DE LAUSANNE, 2012 |
|
Automatic Speaker Role Labeling in AMI Meetings: Recognition of Formal and Social Roles, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, Japan, 2012, 2012 |
|
Bayesian Approaches to Uncertainty in Speech Processing, , School of Computing Sciences, University of East Anglia, 2011 |
|
Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies, and , in: Proceedings of the eighth international conference on Language Resources and Evaluation (LREC), Istanbul, TR, pages 6, 2012 |
|
Using Sense-labeled Discourse Connectives for Statistical Machine Translation, and , in: Proceedings of the EACL2012 Workshop on Hybrid Approaches to Machine Translation (HyTra), Avignon, FR, pages 129--138, 2012 |
|
Automatic detection of conflicts in spoken conversations: ratings and analysis of broadcast political debates, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, Japan, 2012 |
|
Boosting under-resourced speech recognizers by exploiting out of language data - Case study on Afrikaans, , and , in: Proceedings of the 3rd International Workshop on Spoken Languages Technologies for Under-resourced Languages, Cape Town, pages 60--67, 2012 |
|
Progress report of a project in very low bit-rate speech coding, , and , Idiap-RR-08-2012 |
|
From Nonverbal Cues to Perception: Personality and Social Attractiveness, , , , and , in: LNCS Proceedings on COGNITIVE BEHAVIOURAL SYSTEMS, Springer, 2012 |
Automatic Attribution of Personality Traits Based on Prosodic Features, and , in: IEEE Transactions on Affective Computing, 2012 |
|
Translation Error Spotting from a User's Point of View, , Idiap-RR-31-2012 |
|
Acoustic Data-driven Grapheme-to-Phoneme Conversion using KL-HMM, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2012 |
|
Decision tree clustering for KL-HMM, and , Idiap-Com-01-2012 |
|
Improving Control of Dexterous Hand Prostheses Using Adaptive Learning, , , and , Idiap-RR-07-2012 |
|
Improving Control of Dexterous Hand Prostheses Using Adaptive Learning, , , and , in: IEEE TRANSACTIONS ON ROBOTICS, 2012 |
[DOI] |
Transfer Learning of Visual Concepts across Robots: a Discriminative Approach, , and , Idiap-RR-06-2012 |
|
The INTERSPEECH 2012 Speaker Trait Challenge, , , , , , , , , , , and , in: in Proceedings of INTERSPEECH, 2012 |
The ICSI RT-09 Speaker Diarization System, , , , , , , and , in: IEEE Transactions on Audio, Speech, and Language Processing, 20(2):371--381, 2012 |
[DOI] |
The Kaldi Speech Recognition Toolkit, , , , , , , , , , , , and , in: IEEE 2011 Workshop on Automatic Speech Recognition and Understanding, Hilton Waikoloa Village, Big Island, Hawaii, US, IEEE Signal Processing Society, 2011 |
|
A tree-based distance between distributions: application to classification of neurons, and , in: ICASSP 2012 : IEEE International Conference on Acoustics, Speech and Signal Processing, 2012 |
Morphodynamic profiling to explore spatio-temporal signaling networks regulating neurite outgrowth, , , , , , and , in: 1st International SystemsX.ch Conference on Systems Biology, 2011 |
Automated Quantification of Morphodynamics for High-Throughput Live Cell Imaging Datasets, , , , , and , in: 1st International SystemsX.ch Conference on Systems Biology, 2011 |
Machine learning techniques to analyse complex, computer vision-extracted, dynamic cellular phenotypes, , , , , and , in: 1st International SystemsX.ch Conference on Systems Biology, 2011 |
Computational Methods For Structured Sparse Component Analysis of Convolutive Speech Mixtures, , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2012 |
|
Cross-pollination of normalisation techniques from speaker to face authentication using Gaussian mixture models, , , and , in: IEEE Transactions on Information Forensics and Security, 7(2):553 -- 562, 2012 |
|
Cross-pollination of normalisation techniques from speaker to face authentication using Gaussian mixture models, , , and , Idiap-RR-03-2012 |
|
Using KL-divergence and multilingual information to improve ASR for under-resourced languages, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, pages 4869--4872, 2012 |
|
Hierarchical Tandem Features for ASR in Mandarin, , and , in: Proceedings of Interspeech, 2011 |
Look at who's talking, , , , and , in: Proceedings of International Conference on Ambient Intelligence, pages 68-76, 2011 |
Recent Developments in Social Signal Processing, , and , in: Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, pages 380-385, 2011 |
Understanding Social Signals in Multi-party Conversations: Automatic Recognition of Socio-Emotional Roles in the AMI Meeting Corpus, , , and , in: Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, pages 374-379, 2011 |
Towards Computational Proxemics: Inferring Social Relations from Interpersonal Distances, , , , , and , in: Proceedings of the IEEE International Conference on Social Computing, pages 290-297, 2011 |
Conversation Analysis at Work: Detection of Conflict in Competitive Discussions through Automatic Turn-Organization Analysis, , , and , in: Cognitive Processing, 2012 |
Bridging the Gap Between Social Animal and Unsocial Machine: A Survey of Social Signal Processing, , , , , , and , in: IEEE Transactions on Affective Computing, 2012 |
Automatic Role Recognition in Multiparty Conversations: an Approach Based on Turn Organization, Prosody and Conditional Random Fields, and , in: IEEE Transactions on Multimedia, 2012 |
Introduction to Sequence Analysis for Human Behavior Understanding, and , in: "Computer Analysis of Human Behavior" by A.Salah and T.Gevers (eds.), pages 21-40, Springer Verlag, 2011 |
Social Signal Processing: The Research Agenda, , , , , , , , and , in: "Visual Analysis of Humans" by T.B.Moeslund, A.Hilton, V.Krueger and L.Sigal (eds.), pages 511-538, Springer Verlag, 2011 |
Analysis of Verbal and Nonverbal Communication and Enactment: The Processing Issues, , , , and , Springer Verlag, 2011 |
Open-ended Learning of Visual and Multi-modal Patterns, , Ecole polytechnique fédérale de Lausanne, 2011 |
|
A Bimodal Sound Source Model for Vehicle Tracking in Traffic Monitoring, , , and , in: European Signal Processing Conference, 2011 |
|
Torch7: A Matlab-like Environment for Machine Learning, , and , in: BigLearn, NIPS Workshop, 2011 |
|
Learning Structured Embeddings of Knowledge Bases, , , and , in: Conference on Artificial Intelligence, 2011 |
|
Deep Learning for Efficient Discriminative Parsing, , in: International Conference on Artificial Intelligence and Statistics, 2011 |
|
Natural Language Processing (Almost) from Scratch, , , , , and , in: Journal of Machine Learning Research, 12:2493-2537, 2011 |
|
Fast Human Detection from Joint Appearance and Foreground Feature Subset Covariances, and , in: Computer Vision and Image Understanding, 115(10):1414-1426, 2011 |
|
Evaluation of Meeting Support Technology, and , in: Multimodal Signal Processing: Human Interactions in Meetings, pages 237-252, Cambridge University Press, 2012 |
User Requirements for Meeting Support Technology, and , in: Multimodal Signal Processing: Human Interactions in Meetings, pages 210-221, Cambridge University Press, 2012 |
Multimodal Signal Processing for Meetings: an Introduction, and , in: Multimodal Signal Processing: Human Interactions in Meetings, pages 1-11, Cambridge University Press, 2012 |
|
BROADBAND BEAMPATTERN FOR MULTI-CHANNEL SPEECH ACQUISITION AND DISTANT SPEECH RECOGNITION, , and , Idiap-RR-39-2011 |
|
Finding Audio-Visual Events in Informal Social Gatherings, , , and , in: IEEE/ACM 13th International Conference on Multimodal Interaction, 2011 |
|
Engagement-based Multi-party Dialog with a Humanoid Robot, , , , , , and , in: Proceedings of the SIGDIAL 2011: the 12th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pages 341-343, 2011 |
|
Acoustic Data-driven Grapheme-to-Phoneme Conversion using KL-HMM, and , Idiap-RR-38-2011 |
|
OM-2: An Online Multi-class Multi-kernel Learning Algorithm, , , , and , in: In Proceeding of CVPR 2010, Online Learning for Computer Vision Workshop, 2010 |
|
Analysis of Group Conversations: Modeling Social Verticality, and , in: Computer Analysis of Human Behavior, pages 293-322, Springer London, 2011 |
A Nonverbal Behavior Approach to Identify Emergent Leaders in Small Groups, , , and , in: IEEE Transactions on Multimedia, 14(3-2):816-832, 2012 |
[DOI] |
Personalising speech-to-speech translation: Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis, , , , , , , , , , , , and , in: Computer Speech and Language, 2011 |
[DOI] [URL] |
Environment - Application - Adaptation: a Community Architecture for Ambient Intelligence, , in: International Conference on Ambient Computing, Applications, Services and Technologies, 2011 |
Speaker Diarization, and , in: Multimodal Signal Processing: Human Interactions in Meetings, Cambridge University Press, 2012 |
[URL] |
Language-Independent Socio-Emotional Role Recognition in the AMI Meetings Corpus, and , in: Proceedings of Interspeech, 2011 |
|
Analysis and Comparison of Recent MLP Features for LVCSR Systems, , and , in: Proceedings of Interspeech 2011, 2011 |
|
Multistream speaker diarization of meetings recordings beyond MFCC and TDOA features, , and , in: Speech Communication, 54(1), 2012 |
[DOI] |
Transcribing Mandarin Broadcast Speech Using Multi-Layer Perceptron Acoustic Features, , , , and , in: IEEE Transactions on Audio, Speech, and Language Processing, 19(8), 2011 |
[DOI] |
Data-driven extraction of spectral-dynamics based posteriors, , in: Handbook of Natural Language Processing and Machine Translation Handbook of Natural Language Processing and Machine Translation, Springer, 2011 |
[URL] |
Speaker Diarization of Meetings based on Speaker Role N-gram Models, , and , in: Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011 |
|
MULTISTREAM SPEAKER DIARIZATION THROUGH INFORMATION BOTTLENECK SYSTEM OUTPUTS COMBINATION, , and , in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2011 |
|
Overview of the CLEF 2009 medical image annotation track, , , , and , in: Workshop of the Cross-Language Evaluation Forum, Corfu, Greece, pages 85-93, Springer Berlin Heidelberg, 2009 |
[DOI] |
Object Recognition using Visuo-Affordance Maps, , , and , in: International Conference on Intelligent Robots and Systems, Taipei, pages 1572-1578, IEEE, 2010 |
[DOI] |
Towards a quantitative measure of rareness, and , in: DIRAC Workshop at the European Conference on Machine Learning, pages 129-136, Springer Berlin Heidelberg, 2010 |
[DOI] |
Transferring Activities: Updating Human Behavior Analysis, , , , and , in: Visual Surveillance Workshop at ICCV, 2011 |
|
Safety in Numbers: Learning Categories from Few Examples with Multi Model Knowledge Transfer, , and , in: Proceedings of IEEE Computer Vision and Pattern Recognition Conference, San Francisco, CA, pages 3081-3088, IEEE, 2010 |
[DOI] |
Using Modality Replacement to Facilitate Communication between Visually and Hearing-Impaired People, , , , and , in: IEEE Multimedia, 18(2):26-37, 2011 |
[DOI] |
Domain-specific language model adaptation: a case study, , and , Idiap-Com-01-2013 |
|
VTLN-Based Rapid Cross-Lingual Adaptation for Statistical Parametric Speech Synthesis, , , and , Idiap-RR-12-2012 |
|
Combining Vocal Tract Length Normalization with Linear Transformations in a Bayesian Framework, , , and , Idiap-RR-11-2012 |
|
Inter-session Variability Modelling and Joint Factor Analysis for Face Authentication, , , and , in: International Joint Conference on Biometrics, 2011 |
An Audio Visual Corpus for Emergent Leader Analysis, , and , in: Multimodal Corpora for Machine Learning: Taking Stock and Road mapping the Future, 2011 |
Robustness of Group Delay Representations for Noisy Speech Signals, , and , Idiap-RR-36-2011 |
|
Robustness of Group Delay Representations for Noisy Speech Signals, , and , in: IJST (Springer), 14(4), 2011 |
|
Privacy-Sensitive Audio Features for Conversational Speech Processing, , Ecole Polytechnique Fédérale de Lausanne, 2011 |
|
Human Interaction Discovery in Smartphone Proximity Networks, and , in: Personal and Ubiquitous Computing, 2012 |
|
Joint Optimization of Hidden Conditional Random Fields and Non Linear Feature Extraction, , and , in: Proceedings of International Conference on Document Analysis and Recognition, 2011 |
Mining Large-Scale Smartphone Data for Personality Studies, , and , in: Personal and Ubiquitous Computing, 2012 |
|
Boosting Localized Features for Speaker and Speech Recognition, , Ecole Polytechnique Federale de Lausanne (EPFL), 2011 |
|
Multi-camera Open Space Human Activity Discovery for Anomaly Detection, , and , in: 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011 |
|
Information Bottleneck Features for HMM/GMM Speaker Diarization of Meetings Recordings, and , in: Interspeech, Florence, Italy, pages 953-956, 2011 |
|
Continuous Speech Recognition using Boosted Binary Features, , and , Idiap-RR-35-2011 |
|
Multimodal Cue Detection Engine for Orchestrated Entertainment, , , and , in: Proceedings International Conference on MultiMedia Modeling, Klagenfurt, Austria, 2012 |
|
Multimodal Cue Detection Engine for Orchestrated Entertainment, , , and , Idiap-RR-34-2011 |
|
Estimating Cohesion in Small Groups Using Audio-Visual Nonverbal Behavior, and , in: IEEE Trans. on Multimedia, Special Issue on Multimodal Affective Interaction, 12(6):563 - 575, 2010 |
|
Competition on Counter Measures to 2-D Facial Spoofing Attacks, , , , , , , , , , , , , , , , , , , , , , , and , in: Proceedings of IAPR IEEE International Joint Conference on Biometrics (IJCB), Washington DC, USA, 2011 |
|
Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition, , and , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Hawaii, USA, pages 348-353, 2011 |
|
Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition, , and , Idiap-RR-01-2012 |
|
Comparing machines and humans on a visual categorization test, , , , , and , in: Proceedings of the National Academy of Sciences, 2011 |
Boosting with Maximum Adaptive Sampling, and , in: Proceedings of the Neural Information Processing Systems Conference, 2011 |
Detection-Based Multi-Human Tracking Using a CRF Model, , and , in: The Eleventh IEEE International Workshop on Visual Surveillance, 2011 |
|
A Joint Estimation of Head and Body Orientation Cues in Surveillance Video, , and , in: IEEE International Workshop on Socially Intelligent Surveillance and Monitoring, 2011 |
|
Smartphone usage in the wild: a large-scale analysis of applications and context, , and , in: 13th International Conference on Multimodal Interaction, 2011 |
|
Building 'directional corpora' for unbiased contrastive analysis, and , in: Proceedings of Corpus Linguistics Conference, Birmingham, UK, pages 29-30, 2011 |
|
Disambiguating discourse connectives using parallel corpora: senses vs. translations, , , , , and , in: Proceedings of Corpus Linguistics Conference, Birmingham, UK, pages 104-105, 2011 |
|
A Corpus-based Contrastive Analysis for Defining Minimal Semantics of Inter-sentential Dependencies for Machine Translation, , , and , in: Proceedings of the GSCL2011 Workshop on "Contrastive Analysis - Translation Studies - Machine Translation: What can we learn from each other?", Hamburg, Germany, pages 5, 2011 |
|
A Fast Parts-based Approach to Speaker Verification using Boosted Slice Classifiers, , and , in: IEEE Transactions on Information Forensics and Security, 7(1):241-254, 2012 |
|
Fast Speaker Verification on Mobile Phone data using Boosted Slice Classifiers, , and , in: IAPR IEEE International Joint Conference on Biometrics, Washington DC, 2011 |
|
VlogSense: Conversational Behavior and Social Attention in YouTube, and , in: Transactions on Multimedia Computing, Communications and Applications, 2011 |
|
The Kaldi Speech Recognition Toolkit, , , , , , , , , , , , and , Idiap-RR-04-2012 |
|
HEAT: Iterative Relevance Feedback with One Million Images, and , in: Proceedings of the IEEE International Conference on Computer Vision, pages 2118-2125, 2011 |
Multimodal Signal Processing: Human Interactions in Meetings, , , and , Cambridge University Press, 2012 |
[URL] |
A Just-in-Time Document Retrieval System for Dialogues or Monologues, , , and , in: SIGDIAL 2011 (12th annual SIGDIAL Meeting on Discourse and Dialogue), Demonstration Session, Portland, OR, pages 350-352, 2011 |
|
Finding Information in Multimedia Records of Meetings, , and , in: IEEE Multimedia, 19(2):48-57, 2012 |
[DOI] [URL] |
A Speech-based Just-in-Time Retrieval System using Semantic Search, , , and , in: Proceedings of the ACL-HLT 2011 System Demonstrations (49th Annual Meeting of the Association for Computational Linguistics), Portland, OR, pages 80-86, 2011 |
[URL] |
Learning from Images with Captions Using the Maximum Margin Set Algorithm, , , and , Idiap-RR-30-2011 |
|
A Large-Scale Database of Images and Captions for Automatic Face Naming, , , and , in: Proceedings of the 22nd British Machine Vision Conference, 2011 |
|
A Large-Scale Database of Images and Captions for Automatic Face Naming, , , and , Idiap-RR-26-2011 |
|
Multiclass Transfer Learning from Unconstrained Priors, , and , in: Proceedings of the 13th International Conference on Computer Vision, 2011 |
|
Multiclass Transfer Learning from Unconstrained Priors, , and , Idiap-RR-25-2011 |
|
Joint Adaptive Colour Modelling and Skin, Hair and Clothing Segmentation Using Coherent Probabilistic Index Maps, and , in: British Machine Vision Conference, British Machine Vision Association, Dundee, UK, 2011 |
|
Speech Enhancement using Beta-order MMSE Spectral Amplitude Estimator with Laplacian Prior, , , and , Idiap-RR-24-2011 |
|
Integrating articulatory features using Kullback-Leibler divergence based acoustic model for phoneme recognition, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, pages 5192 - 5195, 2011 |
[DOI] |
Improving Articulatory Feature and Phoneme Recognition using Multitask Learning, and , in: Artificial Neural Networks and Machine Learning - ICANN 2011, pages 299-306, Springer Berlin / Heidelberg, 2011 |
[DOI] [URL] |
Inferring truth from multiple annotators for social interaction analysis, , and , in: Neural Information Processing Systems (NIPS) Workshop on Modeling Human Communication Dynamics (HCD), pages 4, 2011 |
|
Who's Who with Big-Five: Analyzing and Classifying Personality Traits with Smartphones, , and , in: International Symposium on Wearable Computing, pages 8, 2011 |
|
Exploiting observers' judgements for nonverbal group interaction analysis, , and , in: IEEE Conference on Automatic Face and Gesture Recognition, pages 6, IEEE, 2011 |
|
An Integrated Framework for Multi-Channel Multi-Source Localization and Voice Activity Detection, , , , and , in: The Third Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2011 |
|
Intuitive Recipes for Uncertainty Decoding with SNR Features for Noise Robust ASR, and , Idiap-RR-23-2011 |
|
Privacy-sensitive recognition of group conversational context with sociometers, , , and , in: Springer Multimedia Systems Journal, 2011 |
|
Multi-party Speech Recovery Exploiting Structured Sparsity Models, , , and , in: Proceedings of Interspeech, 2011 |
|
Model-based Compressive Sensing for Multi-party Distant Speech Recognition, , and , in: 2011 IEEE International Conference on Acoustics, Speech and Signal Processing, 2011 |
|
A novel framework for noise robust ASR using cochlear implant-like spectrally reduced speech, , and , in: Speech Communication, 2011 |
[DOI] |
Grapheme-based Automatic Speech Recognition using KL-HMM, , , and , in: Proceedings of Interspeech, 2011 |
|
The MASH Project, , , and , in: European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2011 |
|
Using a Wikipedia-based Semantic Relatedness Measure for Document Clustering., and , in: Graph-based Methods for Natural Language Processing, 2011 |
|
Humans as Feature Extractors: Combining Prosody and Personality Perception for Better Speaking Style Recognition, and , in: Proceeding of IEEE Int Conference on Systems, Man, and Cybernetics - Special Sessions, 2011 |
|
Tracking Multiple Objects under Global Appearance Constraints, , , and , in: Proceedings of the IEEE International Conference on Computer Vision, 2011 |
A real-time deformable detector., , , and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2012 |
|
Multitask Learning to Improve Articulatory Feature Estimation and Phoneme Recognition, and , Idiap-RR-21-2011 |
|
Sensing the `Health State` of our Society, , , , and , in: IEEE Pervasive Computing, Special Issue on Large-Scale Opportunistic Sensing, 2011 |
|
Pervasive Sensing to Model Political Opinions in Face-to-Face Networks, , , and , in: Pervasive, San Francisco, 2011 |
|
A Probabilistic Approach to Socio-Geographic Reality Mining, , Ecole Polytechnique Fédérale de Lausanne, 2011 |
|
GroupUs: Smartphone Proximity Data and Human Interaction Type Mining, and , in: 15th annual International Symposium on Wearable Computers, San Francisco, USA, 2011 |
|
Impact of Excitation Frequency on Short-Term Recording Synchronisation and Confidence Estimation, , in: Proceedings European Signal Processing Conference, Barcelona, Spain, 2011 |
|
Improving non-native ASR through stochastic multilingual phoneme space transformations, , , , and , in: Proceedings of Interspeech, Florence, Italy, pages 537-540, 2011 |
|
Improving non-native ASR through stochastic multilingual phoneme space transformations, , , , and , Idiap-RR-19-2011 |
|
AN INTEGRATED FRAMEWORK FOR MULTI-CHANNEL MULTI-SOURCE LOCALIZATION AND VOICE ACTIVITY DETECTION, , , , and , Idiap-RR-16-2011 |
|
Modeling and understanding communities in online social media using probabilistic methods, , Ecole polytechnique fédérale de Lausanne, 2011 |
[DOI] [URL] |
How Comparable are Parallel Corpora? Measuring the Distribution of General Vocabulary and Connectives, , , and , in: Proceedings of 4th Workshop on Building and Using Comparable Corpora, ACL, Portland, OR, pages 78--86, 2011 |
|
A BSS-based Approach for Localization of Simultaneous Speakers in Reverberant Conditions, , , and , in: Proceedings of the 19th European Signal Processing Conference (EUSIPCO), 2011 |
|
LP Residual Features for Robust, Privacy-Sensitive Speaker Diarization, , and , in: Interspeech, 2011 |
|
LP Residual Features for Robust, Privacy-Sensitive Speaker Diarization, , and , Idiap-RR-14-2011 |
|
Wordless Sounds: Robust Speaker Diarization using Privacy-Preserving Audio Representations, , and , Idiap-RR-28-2012 |
|
Privacy-Sensitive Audio Features for Speech/Nonspeech Detection, , , and , in: IEEE Transactions on Audio, Speech, and Language Processing, 19(8), 2011 |
|
A Compressive Sensing Based Compressed Neural Network for Sound Source Localization, , and , in: Proceedings of International Symposium on Artificial Intelligence and Signal Processing, 2011 |
|
Inter-session Variability Modelling and Joint Factor Analysis for Face Authentication, , , and , Idiap-RR-28-2011 |
|
Multilingual Annotation and Disambiguation of Discourse Connectives for Machine Translation, , , and , in: Proceedings of 12th SIGdial Meeting on Discourse and Dialogue, Association for Computational Linguistics, Portland, OR, pages 194--203, 2011 |
|
Ultra-Fast Optimization Algorithm for Sparse Multi Kernel Learning, and , in: Proceedings of the 28th International Conference on Machine Learning, 2011 |
|
Ultra-Fast Optimization Algorithm for Sparse Multi Kernel Learning, and , Idiap-RR-11-2011 |
|
You Are Known by How You Vlog: Personality Impressions and Nonverbal Behavior in YouTube, , and , in: Proceedings of AAAI International Conference on Weblogs and Social Media, Barcelona, 2011 |
|
Just-in-Time Multimodal Association and Fusion from Home Entertainment, , , and , in: Proceedings IEEE International Conference on Multimedia & Expo, Barcelona, Spain, 2011 |
|
Social Focus of Attention as a Time Function Derived from Multimodal Signals, and , in: Proceedings IEEE International Conference on Multimedia & Expo, Barcelona, Spain, 2011 |
|
Audio Spatio-Temporal Fingerprints for Cloudless Real-Time Hands-Free Diarization on Mobile Devices, , in: Proceedings of the 3rd Joint Workshop on Hands-Free Speech Communication and Microphone Arrays, Edinburgh, UK, 2011 |
|
Disambiguating Temporal-Contrastive Discourse Connectives for Machine Translation, , in: Proceedings of ACL-HLT 2011 Student Session, Association for Computational Linguistics, Portland, OR, pages 46--51, 2011 |
|
Multi-party Speech Recovery Exploiting Structured Sparsity Models, , , and , Idiap-RR-22-2011 |
|
Extracting and Locating Temporal Motifs in Video Scenes Using a Hierarchical Non Parametric Bayesian Model, , and , in: IEEE Conference on Computer Vision and Pattern Recognition, 2011 |
|
Performance Improvement of TDOA-Based Speaker Localization in Joint Noisy and Reverberant Conditions, and , in: EURASIP Journal on Advances in Signal Processing, 2011 |
[DOI] |
Impact of Excitation Frequency on Short-Term Recording Synchronisation and Confidence Estimation, , Idiap-RR-20-2011 |
|
Social Focus of Attention as a Time Function Derived from Multimodal Signals, and , Idiap-RR-09-2011 |
|
HEAT: Iterative Relevance Feedback with One Million Images, and , Idiap-RR-33-2011 |
|
A Speech-based Just-in-Time Retrieval System using Semantic Search, , , and , Idiap-RR-31-2011 |
|
When Users Meet Technology: The Meeting Browser Development Helix, , and , Idiap-RR-05-2011 |
|
Verified Speaker Localization Utilizing Voicing Level in Split-bands, , , and , in: Signal Processing, 89(6):1038-1049, 2009 |
|
Multiple Object Tracking using K-Shortest Paths Optimization, , , and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2011 |
FlowBoost - Appearance Learning from Sparsely Annotated Video, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2011 |
Delineating Trees in Noisy 2D Images and 3D Image Stacks, , , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, pages 2799–2806, 2010 |
Joint Cascade Optimization Using a Product Of Boosted Classifiers, and , in: Proceedings of the Neural Information Processing Systems Conference, pages 1315–1323, 2010 |
Using object affordances to improve object recognition, , , , and , in: IEEE Transaction on Autonomous Mental Development, 2011 |
|
Towards semi-supervised learning of semantic spatial concepts for mobile robots, and , in: Journal of Physical Agents, 2011 |
|
Phoneme Recognition using Boosted Binary Features, , and , in: IEEE Intl. Conference on Acoustics, Speech and Signal Processing 2011, 2011 |
|
Posterior Features for Template-based ASR, , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Prague, Czech Republic, 2011 |
|
Cue integration through discriminative accumulation, and , in: International Conference on Computer Vision and Pattern Recognition, 2004 |
|
Towards semi-supervised learning of semantic spatial concepts, and , in: IEEE International Conference on Robotics and Automation, 2011 |
|
Towards semi-supervised learning of semantic spatial concepts, and , Idiap-RR-03-2011 |
|
Language dependent universal phoneme posterior estimation for mixed language speech recognition, , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Prag, CZ, pages 5012-5015, 2011 |
|
Audio Spatio-Temporal Fingerprints for Cloudless Real-Time Hands-Free Diarization on Mobile Devices, , Idiap-RR-08-2011 |
|
Call me Guru: user categories and large-scale behavior in YouTube, and , in: Social Media Computing, Springer, 2011 |
|
Vlogcast Yourself: Nonverbal Behavior and Attention in Social Media, and , in: Proceedings International Conference on Multimodal Interfaces (ICMI-MLMI), 2010 |
|
Automatic Content Linking: Speech-based Just-in-time Retrieval for Multimedia Archives, , , , , and , in: Proceedings of the 33rd Annual ACM SIGIR Conference, Geneva, Switzerland, pages 703, 2010 |
[DOI] |
Socially Intelligent Surveillance and Monitoring: Analysing Social Dimensions of Physical Space, , and , in: Proceedings of International Workshop on Socially Intelligent Surveillance and Monitoring, San Francisco, pages 51-58, 2010 |
|
Mobile Social Signal Processing: vision and research issues, , and , in: Proceedings of the International Workshop on Mobile HCI, Lisbon, pages 513-516, 2010 |
|
Human Behavior Understanding, , Springer Verlag, 2010 |
Computational modeling of face-to-face social interaction using nonverbal behavioral cues, , Ecole Polytechnique Fédérale de Lausanne, 2011 |
|
Exploiting Long-Term Observations for Track Creation and Deletion in Online Multi-Face Tracking, and , in: IEEE Conference on Automatic Face and Gesture Recognition, pages 525-530, IEEE, 2011 |
|
Finding Information in Multimedia Records of Meetings, , and , Idiap-RR-32-2011 |
|
Automatic Identification of Discourse Markers in Multiparty Dialogues: An In-Depth Study of Like and Well, and , in: Computer Speech and Language, 25(3):499-518, 2011 |
[DOI] |
Multi-Person Bayesian Tracking with Multiple Cameras, and , in: Multi-camera networks: principles and applications, pages 363-388, Academic Press, 2009 |
|
View-Based Appearance Model Online Learning for 3D Deformable Face Tracking, and , in: Proc. Int. Conf. on Computer Vision Theory and Applications, Angers, 2010 |
|
Identifying Emergent Leadership in Small Groups using Nonverbal Communicative Cues, , , and , in: Proc. ICMI-MLMI '10 International Conference on Multimodal Interfaces and the Workshop on Machine Learning for Multimodal Interaction, Beijing, ACM New York, NY, USA ©2010, 2010 |
[DOI] |
Speech Enhancement using an Improved MMSE Estimator with Laplacian Prior, , and , in: Proceedings of 5th International Symposium on Telecommunications, 2010 |
|
Single Channel Speech Separation with a Frame-based Pitch Range Estimation Method in Modulation Frequency, , , and , in: Proceedings of 5th International Symposium on Telecommunications, 2010 |
|
Determination of Pitch Range Based on Onset and Offset Analysis in Modulation Frequency Domain, , , and , in: Proceedings of 5th International Symposium on Telecommunications, 2010 |
|
Social Network Analysis for Automatic Role Recognition, , Ecole Polytechnique Fédérale de Lausanne, 2010 |
|
Learning a 3D Human Pose Distance Metric from Geometric Pose Descriptor, , in: IEEE Transactions on Visualization and Computer Graphics, 17(11):1676-1689, 2011 |
|
3D human pose recovery from image by efficient visual feature selection, , , and , in: Computer Vision and Image Understanding, 115(3), 2011 |
|
Discovering Human Places of Interest from Multimodal Mobile Phone Data, and , in: Proceedings of 9th International Conference on on Mobile and Ubiquitous Multimedia (MUM,',','), Limassol, Cyprus, 2010 |
|
Parts-Based Face Verification using Local Frequency Bands, and , Idiap-RR-06-2011 |
|
Feature distribution modelling techniques for 3D face recognition, , and , in: Pattern Recognition Letters, 31:1324-1330, 2010 |
|
An Information Theoretic Approach to Speaker Diarization of Meeting Recordings, , Ecole polytechnique fédérale de Lausanne, 2010 |
|
Hierarchical and Parallel Processing of Auditory and Modulation Frequencies for Automatic Speech Recognition, , in: Speech Communication, 52(10):790-800, 2010 |
[DOI] |
Multi-Stream Speech Recognition based on Dempster-Shafer Combination Rule, , in: Speech Communication, 52(3):213-222, 2010 |
[DOI] |
VARIATIONAL BAYESIAN SPEAKER DIARIZATION OF MEETING RECORDINGS, , and , in: Proceedings of ICASSP, 2010 |
|
A Comparative Study of MLP Front-ends for Mandarin ASR, , , , and , in: Proceedings of Interspeech, Japan, 2010 |
|
Improving Speech Processing trough Social Signals: Automatic Speaker Segmentation of Political Debates using Role based Turn-Taking Patterns., and , in: Proceedings of ACM Multimedia Workshop on Social Signal Processing, 2010 |
|
Social Signal Processing: Understanding Nonverbal Communication in Social Interactions, and , in: Proceedings of Measuring Behavior 2010, Eindhoven (The Netherlands), 2010 |
|
Automatic Time Skew Detection and Correction, , in: Proceedings International Conference on Signal Acquisition and Processing, Singapore, 2011 |
|
Integrating Articulatory Features using Kullback-Leibler Divergence based Acoustic Model for Phoneme Recognition, and , Idiap-RR-02-2011 |
|
Hierarchical Tandem Features for ASR in Mandarin, , and , Idiap-RR-39-2010 |
|
Automatic Time Skew Detection and Correction, , Idiap-RR-42-2010 |
|
Face detection using boosted Jaccard distance-based regression, , and , Idiap-RR-02-2012 |
|
A Sparsity Constraint for Topic Models - Application to Temporal Activity Mining, , and , in: NIPS-2010 Workshop on Practical Applications of Sparse Modeling: Open Issues and New Directions, 2010 |
|
Kodak Moments and Flickr Diamonds: How Users Shape Large-scale Media, , and , in: Proc. of the 18th Intl. Conf. on Multimedia, Firenze, Italy, 2010 |
Advances in Fast Multistream Diarization based on the Information Bottleneck Framework, , and , in: Proceedings of Interspeech, 2010 |
An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization, , and , in: IEEE Transactions on Audio Speech and Language Processing, 19(2), 2011 |
[DOI] |
Fast Bounding Box Estimation based Face Detection, and , in: ECCV, Workshop on Face Detection: Where we are, and what next?, 2010 |
[URL] |
Exploiting Long-Term Observations for Track Creation and Deletion in Online Multi-Face Tracking, and , Idiap-RR-01-2011 |
|
The TA2 Database - A Multi-Modal Database from Home Entertainment, , and , in: International Conference on Signal Acquisition and Processing, Singapore, 2011 |
|
The TA2 Database - A Multi-Modal Database from Home Entertainment, , and , Idiap-RR-37-2010 |
|
Recognizing conversational context in group interaction using privacy-sensitive mobile sensors, , , and , in: Proceedings of International Conference on Mobile and Ubiquitous Multimedia, Limassol, Cyprus, 2010 |
|
Learning from Candidate Labeling Sets, and , in: Advances in Neural Information Processing Systems 23 (NIPS10), NIPS Foundation, Vancouver, B.C., Canada, MIT Press, 2010 |
|
Language dependent universal phoneme posterior estimation for mixed language speech recognition, , , and , Idiap-RR-13-2011 |
|
Discovering Routines from Large-Scale Human Locations using Probabilistic Topic Models, and , in: ACM Transactions on Intelligent Systems and Technology, 2(1), 2011 |
|
Mining Human Location-Routines Using a Multi-Level Approach to Topic Modeling, and , in: 2010 IEEE Second International Conference on Social Computing, SIN Symposium, Minneapolis, Minnesota, USA, 2010 |
|
Model-Based Compressive Sensing for Multi-Party Distant Speech Recognition, , and , Idiap-RR-04-2011 |
|
A Sparsity Constraint for Topic Models - Application to Temporal Activity Mining, , and , Idiap-RR-36-2010 |
|
Towards a Technology of Nonverbal Communication: Vocal Behavior in Social and Affective Phenomena, and , in: "Affective Computing and Interaction: Psychological, Cognitive and Neuroscientific Perspectives" by D. Gokcay & G. Yildirim (eds.), igi-global, 2010 |
|
The Voice of Personality: Mapping Nonverbal Vocal Behavior into Trait Attributions, , and , in: Proceedings of ACM Multimedia Workshop on Social Signal Processing, 2010 |
|
More than Words: Inference of Socially Relevant Information from Nonverbal Vocal Cues in Speech, , , and , in: Toward Autonomous, Adaptive, and Context-Aware Multimodal Interfaces: Theoretical and Practical Issues, A.Esposito (ed.), LNCS,Springer, 2010 |
|
Automatic Role Recognition Based on Conversational and Prosodic Behaviour, , , and , in: Proceedings of the ACM International Conference on Multimedia, 2010 |
|
On-line unsupervised adaptation for face verification using Gaussian Mixture Models with multiple user models, , and , Idiap-RR-07-2011 |
|
Face Verification using Gabor Filtering and Adapted Gaussian Mixture Models, , and , Idiap-RR-37-2011 |
|
Measuring the gap between HMM-based ASR and TTS, , and , Idiap-RR-34-2010 |
|
Measuring the gap between HMM-based ASR and TTS, , and , in: IEEE Journal of Selected Topics in Signal Processing, in print, 2010 |
|
Tuning-Robust Initialization Methods for Speaker Diarization, and , in: IEEE Transactions on Audio, Speech, and Language Processing, 18(8):2028-2037, 2010 |
[DOI] |
A Multi Cue Discriminative Approach to Semantic Place Classification, , and , in: CLEF 2010 Notebook Papers/LABs/Workshops, 2010 |
|
The Wolf Corpus: Exploring group behaviour in a competitive role-playing game, and , in: ACM Multimedia, 2010 |
|
Are you a Werewolf? Detecting deceptive roles and outcomes in a conversational role-playing game, and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2010 |
|
Automatic nonverbal analysis of social interaction in small groups: A review, , in: Image and Vision Computing, Special Issue on Human Behavior, 27(12), 2009 |
|
YOU ARE FIRED! NONVERBAL ROLE ANALYSIS IN COMPETITIVE MEETINGS, , and , in: Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP,',','), Taiwan., 2009 |
|
Modeling interest in face-to-face conversations from multimodal nonverbal behavior, , in: In J.-P. Thiran, H. Bourlard, and F. Marques, (Eds.,',','), Multimodal Signal Processing, Academic Press, Academic Press, 2009 |
|
Voices of Vlogging, and , in: Proceedings of AAAI International Conference on Weblogs and Social Media, Washington DC, 2010 |
|
Towards rich mobile phone datasets: Lausanne data collection campaign, , , , and , in: Proc. ACM Int. Conf. on Pervasive Services (ICPS,',','), Berlin., 2010 |
|
Visual Attention, Speaking Activity, and Group Conversational Analysis in Multi-Sensor Environments, and , in: In H. Nakashima, J. Augusto, H. Aghajan (Eds.,',','), Handbook of Ambient Intelligence and Smart Environments, Springer, 2010 |
Inferring competitive role patterns in reality TV show through nonverbal analysis, and , in: Multimedia Tools and Applications, Special issue on Social Media, 2010 |
|
Estimating Dominance in Multi-Party Meetings Using Speaker Diarization, , , and , in: IEEE Transactions on Audio, Speech, and Language Processing, 19(4):847-860, 2011 |
|
Mining group nonverbal conversational patterns using probabilistic topic models, and , in: IEEE Transactions on Multimedia, 2010 |
|
Modeling and Understanding Flickr Communities through Topic-based Analysis, and , in: IEEE Transactions on Multimedia, 12(5), 2010 |
[DOI] |
Predicting Remote Versus Collocated Group Interactions using Nonverbal Cues, , and , in: Proc. Int. Conf. on Multimodal Interfaces, Workshop on Multimodal Sensor-Based Systems and Mobile Phones for Social Computing,, Cambridge, 2009 |
[DOI] |
Introducing Crossmodal Biometrics:Person Identification from Distinct Audio & Visual Streams, and , in: IEEE Fourth International Conference on Biometrics: Theory, Applications and Systems, 2010 |
|
Hands Free Audio Analysis from Home Entertainment, , and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
|
A Random Walk Framework to Compute Textual Semantic Similarity: a Unified Model for Three Benchmark Tasks, and , in: Proceedings of the 4th IEEE International Conference on Semantic Computing (ICSC 2010 ), Carnegie Mellon University, Pittsburgh, PA, USA, 2010 |
|
Towards a standard for dialogue act annotation, , , , , , , , , , , and , in: 7th International Conference on Language Resources and Evaluation, Malta, 2010 |
[URL] |
The ACLD: Speech-based Just-in-Time Retrieval of Meeting Transcripts, Documents and Websites, , , and , in: ACM Multimedia Workshop on Searching Spontaneous Conversational Speech, Florence, Italy, 2010 |
|
The ACLD: Speech-based Just-in-Time Retrieval of Multimedia Documents and Websites, , , and , Idiap-RR-26-2010 |
|
English Spoken Term Detection in Multilingual Recordings, , and , in: Proceedings of Interspeech, Makuhari, Japan, 2010, ISCA, Makuhari, Japan, 2010 |
|
Sparse Component Analysis for Speech Recognition in Multi-Speaker Environment, , and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
|
Study of Jacobian Normalization for VTLN, , and , Idiap-RR-25-2010 |
|
Implementation of VTLN for Statistical Speech Synthesis, , , and , in: Proceedings of ISCA Speech Synthesis Workshop, Kyoto, Japan, 2010 |
|
Floor Holder Detection and End of Speaker Turn Prediction in Meetings, , and , in: International Conference on Speech and Language Processing, Interspeech, Makuhari, Japan, ISCA, 2010 |
|
Kodak Moments and Flickr Diamonds: How Users Shape Large-scale Media, , and , Idiap-RR-20-2010 |
|
A Multi-class Classification Strategy for Fisher Scores: Application to Signer Independent Sign Language Recognition, and , in: Pattern Recognition, 43(5), 2010 |
[DOI] |
Fusing Audio-Visual Nonverbal Cues to Detect Dominant People in Conversations, and , in: 20th International Conference on Pattern Recognition, Istanbul, Turkey, 2010, Istanbul, Turkey, 2010 |
|
Fusing Audio-Visual Nonverbal Cues to Detect Dominant People in Conversations, and , Idiap-RR-17-2010 |
|
An Analysis of Language Mismatch in HMM State Mapping-Based Cross-Lingual Speaker Adaptation, and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
|
An Analysis of Language Mismatch in HMM State Mapping-Based Cross-Lingual Speaker Adaptation, and , Idiap-RR-16-2010 |
|
Towards mixed language speech recognition systems, , and , in: Proceedings of Interspeech, Makuhari, Japan, pages 278-281, 2010 |
|
Hierarchical Multilayer Perceptron based Language Identification, , and , in: Proceedings of Interspeech, Makuhari, Japan, pages 2722-2725, 2010 |
|
Audio–Visual Synchronisation for Speaker Diarisation, , and , in: International Conference on Speech and Language Processing, Interspeech, Makuhari, Japan, 2010 |
|
An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization, , and , Idiap-RR-22-2010 |
|
Crossmodal Matching of Speakers using Lip and Voice Features in Temporally Non-overlapping Audio and Video Streams, and , in: 20th International Conference on Pattern Recognition, Istanbul, Turkey, International Association for Pattern Recognition (IAPR), Istanbul, Turkey, 2010 |
|
Advances in Fast Multistream Diarization based on the Information Bottleneck Framework, , and , Idiap-RR-23-2010 |
|
Estimating Cohesion in Small Groups using Audio-Visual Nonverbal Behavior, and , Idiap-RR-12-2010 |
|
Mining Human Location-Routines using a Multi-Level Topic Model, and , Idiap-RR-28-2010 |
|
Probabilistic Mining of Socio-Geographic Routines from Mobile Phone Data, and , in: IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 4(4), 2010 |
|
Introducing Crossmodal Biometrics: Person Identification from Distinct Audio & Visual Streams, and , Idiap-RR-29-2010 |
|
Just-in-Time Multimodal Association and Fusion from Home Entertainment, , , and , Idiap-RR-10-2011 |
|
Learning from Candidate Labeling Sets, and , Idiap-RR-27-2011 |
|
Probabilistic Latent Sequential Motifs: Discovering temporal activity patterns in video scenes, , and , Idiap-RR-33-2010 |
|
Hierarchical Multilayer Perceptron based Language Identification, , and , Idiap-RR-14-2010 |
|
Towards mixed language speech recognition systems, , and , Idiap-RR-15-2010 |
|
Privacy-Sensitive Audio Features for Speech/Nonspeech Detection, , , and , Idiap-RR-12-2011 |
|
Hands Free Audio Analysis from Home Entertainment, , and , Idiap-RR-27-2010 |
|
Fast Bounding Box Estimation based Face Detection, and , Idiap-RR-38-2010 |
|
The More you Learn, the Less you Store: Memory-controlled Incremental SVM for Visual Place Recognition, , and , Idiap-RR-08-2010 |
|
Online-Batch Strongly Convex Multi Kernel Learning, , and , Idiap-RR-07-2010 |
|
OM-2: An Online Multi-class Multi-kernel Learning Algorithm, , , , and , Idiap-RR-06-2010 |
|
Implementation of VTLN for Statistical Speech Synthesis, , , and , Idiap-RR-32-2010 |
|
Neural conditional random fields, and , in: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, Chia Laguna, Sardinia, Italy, JMLR: W&CP, 2010 |
|
Online-Batch Strongly Convex Multi Kernel Learning, , and , in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2010 |
|
The More you Learn, the Less you Store: Memory-controlled Incremental SVM for Visual Place Recognition, , and , in: Image and Vision Computing, 2010 |
[DOI] |
A Multimodal Corpus for Studying Dominance in Small Group Conversations, , and , in: LREC workshop on Multimodal Corpora: Advances in Capturing, Coding and Analyzing Multimodality, Malta, May 2010, 2010 |
|
Multilayer Perceptron Based Hierarchical Acoustic Modeling for Automatic Speech Recognition, , Ecole polytechnique fédérale de Lausanne, 2010 |
|
Joint Pose Estimator and Feature Learning for Object Detection, , , and , in: Proceedings of the IEEE International Conference on Computer Vision, 2009 |
Evaluation of Probabilistic Occupancy Map People Detection for Surveillance Systems, , , , and , in: Proceedings of the IEEE International Workshop on Performance Evaluation of Tracking and Surveillance, 2009 |
Analysis of Phone Posterior Feature Space Exploiting Class Specific Sparsity and MLP-based Similarity Measure, , and , in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, 2010 |
|
Multi-Person Visual Focus of Attention from Head Pose and Meeting Contextual Cues, and , in: IEEE Trans. on Pattern Analysis and Machine Intelligence, 33(1):101-116, 2011 |
|
Learning Large Margin Likelihood for Realtime Head Pose Tracking, and , in: IEEE Int. Conference on Image Processing, Cairo, Egypt, IEEE, 2009 |
|
Structure and appearance features for robust 3D facial actions tracking, and , in: IEEE Proc. Int. Conf. on Multimedia and Expo, IEEE, 2009 |
|
Analysis of MLP Based Hierarchical Phoneme Posterior Probability Estimator, , , , and , in: IEEE Transcations on Audio, Speech, and Language Processing, 19(2):225-241, 2011 |
|
Crossmodal Matching of Speakers using Lip and Voice Features in Temporally Non-overlapping Audio and Video Streams, and , Idiap-RR-13-2010 |
|
Finding without searching, , Idiap-Com-01-2010 |
|
Application of Out-Of-Language Detection To Spoken-Term Detection, and , in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010 |
|
BOOSTED BINARY FEATURES FOR NOISE-ROBUST SPEAKER VERIFICATION, , and , in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, Texas, 2010 |
|
Multistream Speaker Diarization beyond Two Acoustic Feature Streams, , and , in: International Conference on Acoustics, Speech, and Signal Processing, 2010 |
|
AMIDA/Klewel Mini-Project, , , and , Idiap-RR-03-2010 |
|
An Alternative Scanning Strategy to Detect Faces, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010 |
|
Canal9: A database of political debates for analysis of social interactions, , , and , in: Proceedings of the International Conference on Affective Computing and Intelligent Interaction (IEEE International Workshop on Social Signal Processing), Amsterdam, Netherlands, 2009 |
[DOI] |
Evaluating the Robustness of Privacy-Sensitive Audio Features for Speech Detection in Personal Audio Log Scenarios, , , and , in: ICASSP 2010, 2010 |
|
Using Audio and Visual Cues for Speaker Diarisation Initialisation, and , in: International Conference on Acoustics, Speech and Signal Processing, 2010 |
|
An Adaptive Initialization Method for Speaker Diarization based on Prosodic Features, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, pages 4946-4949, 2010 |
|
On Improving Face Detection Performance by Modelling Contextual Information, , and , Idiap-RR-43-2010 |
|
Automatic Temporal Alignment of AV Data with Confidence Estimation, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010 |
|
MLP Based Hierarchical System for Task Adaptation in ASR, , and , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, 2009 |
|
VTLN Adaptation for Statistical Speech Synthesis, , , and , Idiap-RR-41-2009 |
|
VTLN Adaptation for Statistical Speech Synthesis, , , and , in: Proceedings of ICASSP, Dallas, Texas, 2010 |
|
Towards a Technology of Nonverbal Communication: Vocal Behavior in Social and Affective Phenomena, and , Idiap-RR-05-2012 |
|
A Comparison of Supervised and Unsupervised Cross-Lingual Speaker Adaptation Approaches for HMM-Based Speech Synthesis, , and , Idiap-RR-05-2010 |
|
A Comparison of Supervised and Unsupervised Cross-Lingual Speaker Adaptation Approaches for HMM-Based Speech Synthesis, , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Dallas, U.S.A., 2010 |
|
A Multimedia Retrieval System Using Speech Input, , , , , , , , , , and , in: Proceedings of ICMI-MLMI 2009 (11th International Conference on Multimodal Interfaces and 6th Workshop on Machine Learning for Multimodal Interaction), Cambridge, MA, 2009 |
|
The FEMTI guidelines for contextual MT evaluation: principles and tools, , and , in: Linguistica Antverpiensia New Series, 8, 2009 |
Managing Multimodal Data, Metadata and Annotations: Challenges and Solutions, , in: Multimodal Signal Processing for Human-Computer Interaction, Elsevier / Academic Press, 2009 |
Accessing a Large Multimodal Corpus using an Automatic Content Linking Device, , , and , in: Multimodal Corpora: From Models of Natural Interaction to Systems and Applications, Springer-Verlag, 2009 |
[DOI] |
User Interface Design in a Just-in-time Retrieval System for Meetings, , , , , , and , Idiap-RR-38-2009 |
|
On MLP-based Posterior Features for Template-based ASR, , , and , Idiap-RR-37-2009 |
|
Memoirs of Togetherness from Audio Logs, , in: Proceedings International ICST Conference on User Centric Media, Venice, Italy, 2009 |
|
Autoregressive Models of Amplitude Modulations in Audio Compression, , and , in: IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 2010 |
[URL] |
Wide-Band Audio Coding based on Frequency Domain Linear Prediction, , , and , in: EURASIP Journal on Audio Speech and Music Processing, 2010(856280), 2010 |
[DOI] [URL] |
MDCT for Encoding Residual Signals in Frequency Domain Linear Prediction, , and , in: Audio Engineering Society (AES,',','), 127th Convention, Audio Engineering Society (AES), Audio Engineering Society, 60 East 42nd Street, New York, New York 10165-2520, USA;, 2009 |
[URL] |
APPLICATIONS OF SIGNAL ANALYSIS USING AUTOREGRESSIVE MODELS FOR AMPLITUDE MODULATION, , , and , in: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009, WASPAA '09., IEEE, Mohonk Mountain House, New Paltz, New York, USA, 2009 |
[URL] |
APPLICATIONS OF SIGNAL ANALYSIS USING AUTOREGRESSIVE MODELS FOR AMPLITUDE MODULATION, , , and , Idiap-RR-35-2009 |
|
MDCT for Encoding Residual Signals in Frequency Domain Linear Prediction, , and , Idiap-RR-34-2009 |
|
Autoregressive Models of Amplitude Modulations in Audio Compression, , and , Idiap-RR-33-2009 |
|
Wide-Band Audio Coding based on Frequency Domain Linear Prediction, , and , Idiap-RR-32-2009 |
|
On the vulnerability of face verification systems to hill-climbing attacks, , , , and , in: Pattern Recognition, 2009 |
Wearing a YouTube hat: directors, comedians, gurus, and user aggregated behavior, and , in: Proceedings of the 17th ACM International Conference on Multimedia, ACM, 2009 |
|
MOBIO Database for the ICPR 2010 Face and Speech Competition, and , Idiap-Com-02-2009 |
|
Out-of-Scene AV Data Detection, , in: Proceedings IADIS International Conference Applied Computing, Rome, Italy, 2009 |
|
Analysis of F0 and Cepstral Features for Robust Automatic Gender Recognition, and , Idiap-RR-30-2009 |
|
Bayesian Networks to Combine Intensity and Color Information in Face Recognition, and , in: International Conference on Biometrics, Springer, 2009 |
|
A novel statistical generative model dedicated to face recognition, and , in: Image & Vision Computing, 2009 |
|
Who's Doing What: Joint Modeling of Names and Verbs for Simultaneous Face and Pose Annotation, , and , in: Advances in Neural Information Processing Systems 22 (NIPS09), NIPS Foundation, Vancouver, B.C., Canada, MIT Press, 2009 |
|
Visual processing-inspired Fern-Audio features for Noise-Robust Speaker Verification, and , in: ACM 25th Symposium on Applied Computing, 2010, Sierre, Switzerland, Association for Computing Machinery, 2010 |
|
Haar Local Binary Pattern Feature for Fast Illumination Invariant Face Detection, and , in: British Machine Vision Conference 2009, 2009 |
|
Topic Models for Scene Analysis and Abnormality Detection, and , in: 9th International Workshop in Visual Surveillance, IEEE, Kyoto, Japan, IEEE, 2009 |
|
Social Network Analysis in Multimedia Indexing: Making Sense of People in Multiparty Recordings, , in: Proceedings of the Doctoral Consortium of the International Conference on Affective Computing & Intelligent Interaction (ACII), 2009 |
|
Flickr Hypergroups, , , , and , in: Proceedings of the 17th ACM International Conference on Multimedia, 2009 |
|
Multimodal Signal Processing: Methods and Techniques to Build Multimodal Interactive Systems, , and , Academic Press, 2009 |
Memoirs of Togetherness from Audio Logs, , Idiap-RR-36-2009 |
|
Learning and Predicting Multimodal Daily Life Patterns from Cell Phones, and , in: ICMI-MLMI, 2009 |
|
An Adaptive Initialization Method for Speaker Diarization based on Prosodic Features, and , Idiap-RR-02-2010 |
|
Multimodal Data Flow Controller, , Idiap-Com-01-2009 |
|
Haar Local Binary Pattern Feature for Fast Illumination Invariant Face Detection, and , Idiap-RR-28-2009 |
|
Automatic Temporal Alignment of AV Data with Confidence Estimation, , and , Idiap-RR-40-2009 |
|
Dynamic Partitioned Sampling For Tracking With Discriminative Features, , and , in: Proceedings of the British Maschine Vision Conference, London, 2009 |
|
Application of Out-Of-Language Detection To Spoken-Term Detection, and , Idiap-RR-04-2010 |
|
Arithmetic Coding of Sub-Band Residuals in FDLP Speech/Audio Codec, , and , in: 10th Annual Conference of the International Speech Communication Association, ISCA, Brighton, England, ISCA 2009, 2009 |
|
Automatic Out-of-Language Detection Based on Confidence Measures Derived fromLVCSR Word and Phone Lattices, , in: 10thAnnual Conference of the International Speech Communication Association, ISCA, Brighton, England, 2009 |
|
Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, , and , in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, Pilsen, Czech Republic, Springer - Verlag, Berlin Heidelberg 2009, 2009 |
|
Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, , and , in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, Pilsen, Czech Republic, Springer - Verlag, Berlin Heidelberg 2009, 2009 |
|
Robust Speaker Diarization for Short Speech Recordings, and , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, pages 432-437, 2009 |
|
Robust Speaker Diarization for Short Speech Recordings, and , Idiap-RR-26-2009 |
|
On the design of audio features robust to the album-effect for music information retrieval., , Ecole Polytechnique Fédérale de Lausanne, 2009 |
An online framework for learning novel concepts over multiple cues, , and , in: Proceeding of The 9th Asian Conference on Computer Vision, Xi'an, China, 2009 |
|
Discovering Group Nonverbal Conversational Patterns with Topics, and , in: Proceedings ICMI-MLMI, 2009 |
|
Speaker Change Detection with Privacy-Preserving Audio Cues, , , and , in: Proceedings of ICMI-MLMI 2009, 2009 |
|
The more you know, the less you learn: from knowledge transfer to one-shot learning of object categories, and , in: British Machine Vision Conference, 2009 |
|
Hill-Climbing Attack to an Eigenface-Based Face Verification System, , , , and , in: Proceedings of the First IEEE International Conference on Biometrics, Identity and Security (BIdS), 2009 |
|
Co-occurrence Models for Image Annotation and Retrieval, , Idiap-RR-22-2009 |
|
Tagging and Retrieving Images with Co-Occurrence Models: from Corel to Flickr, and , Idiap-RR-21-2009 |
|
Automatic Role Recognition in Multiparty Recordings: Using Social Affiliation Networks for Feature Extraction, , and , in: IEEE Transactions on Multimedia, 11(7), 2009 |
|
Automatic Role Recognition in Multiparty Recordings Using Social Networks and Probabilistic Sequential Models, , and , in: ACM International Conference on Multimedia, 2009 |
|
Investigating the use of Visual Focus of Attention for Audio-Visual Speaker Diarisation, , , and , in: Proceedings of the ACM International Conference on Multimedia, Beijing, China, 2009 |
|
Visual Speaker Localization Aided by Acoustic Models, , and , in: ACM Multimedia, 2009 |
Fast Human Detection in Videos using Joint Appearance and Foreground Learning from Covariances of Image Feature Subsets, and , Idiap-RR-19-2009 |
|
Improved Phone Posterior Estimation Through k-NN and MLP-Based Similarity, , Idiap-RR-18-2009 |
|
You live, you learn, you forget: continuous learning of visual places with a forgetting mechanism, , and , in: International Conference on Robotic and Systems, 2009 |
Towards a theoretical framework for learning multi-modal patterns for embodied agents, , , , , , , and , in: International Conference on Image Analysis and Processing, 2009 |
|
A theoretical framework for transfer of knowledge across modalities in artificial and cognitive systems, , , and , in: International Conference on Developmental Learning, 2009 |
|
Model adaptation with least-square SVM for adaptive hand prosthetics, , , , and , in: IEEE International conference on Robotics and Automation, 2009 |
|
Bounded kernel-based perceptrons, , and , in: Journal of Machine Learning Research, Accepted for pub, 2009 |
Cue Integration for Medical Image Annotation, , and , in: Advances in Multilingual and Multimodal Information Retrieval: 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Budapest, Hungary, September 19-21, 2007, Revised Selected Papers, Springer-Verlag, 2008 |
|
Investigation of kNN Classifier on Posterior Features Towards Application in Automatic Speech Recognition, , and , Idiap-RR-11-2010 |
|
Towards Life-long Learning for Cognitive Systems: Online Independent Support Vector Machine, , , , and , in: Pattern Recognition, Accepted for Pub, 2009 |
Classifying Material in the Real World, , , and , in: Image and vision Computing, accepted for pub, 2009 |
COLD: The COsy Localization Database, and , in: International Journal of Robotics Research, 28(5), 2009 |
|
KL Realignment for Speaker Diarization with Multiple Feature Streams, , and , in: 10th Annual Conference of the International Speech Communication Association, 2009 |
Mutual Information based Channel Selection for Speaker Diarization of Meetings Data, , and , in: Proceedings of International conference on acoustics speech and signal processing, 2009 |
Robustness of Phase based Features for Speaker Recognition, , and , in: Proceedings of Interspeech, 2009 |
|
Automatic vs. human question answering over multimedia meeting recordings, and , in: 10th Annual Conference of the International Speech Communication Association, 2009 |
|
Robustness of Phase based Features for Speaker Recognition, , and , Idiap-RR-14-2009 |
|
Automatic vs. human question answering over multimedia meeting recordings, and , Idiap-RR-13-2009 |
|
Investigating Privacy-Sensitive Features for Speech Detection in Multiparty Conversations, , , and , in: Proceedings of Interspeech 2009, 2009 |
|
Comparing meeting browsers using a task-based evaluation method, , Idiap-RR-11-2009 |
|
Tuning-Robust Initialization Methods for Speaker Diarization, and , Idiap-RR-35-2010 |
|
A Novel Criterion for Classifiers Combination in Multistream Speech Recognition, , in: IEEE Signal Processing Letters, 16(7), 2009 |
[DOI] |
Hierarchical Processing of the Modulation Spectrum for GALE Mandarin LVCSR system, , , and , in: Proceedings of the 10thAnnual Conference of the International Speech Communication Association (Interspeech), Brighton, 2009 |
|
KL Realignment for Speaker Diarization with Multiple Feature Streams, , and , Idiap-RR-24-2010 |
|
Speech/Non-Speech Detection in Meetings from Automatically Extracted Low Resolution Visual Features, and , Idiap-RR-20-2009 |
|
Speaker Change Detection with Privacy-Preserving Audio Cues, , , and , Idiap-RR-23-2009 |
|
Out-of-Scene AV Data Detection, , Idiap-RR-31-2009 |
|
Novel initialization methods for Speaker Diarization, , Idiap-RR-07-2009 |
|
Steerable Features for Statistical 3D Dendrite Detection, , , , and , in: Proceedings of the International Conference on Medical Image Computing and Computer Assisted Intervention, 2009 |
Automatic Temporal Alignment of AV Data, , and , Idiap-RR-39-2009 |
|
Investigating Privacy-Sensitive Features for Speech Detection in Multiparty Conversations, , , and , Idiap-RR-12-2009 |
|
Characterising Conversationsal Group Dynamics Using Nonverbal Behaviour, , and , in: Proceedings ICME 2009, 2009 |
|
Visual processing-inspired Fern-Audio features for Noise-Robust Speaker Verification, and , Idiap-RR-29-2009 |
|
Visual Activity Context For Focus of Attention Estimation in Dynamic Meetings, , and , in: International Conference on Multimedia & Expo, 2009 |
|
An SVM Confidence-Based Approach to Medical Image Annotation, , and , in: Workshop of the Cross-Language Evaluation Forum, 2008 |
|
Learning Rotational Features for Filament Detection, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2009 |
Discriminative Keyword Spotting, , and , in: Speech Communication, 51(4), 2009 |
|
Parts-Based Face Verification using Local Frequency Bands, and , in: in Proceedings of IEEE/IAPR International Conference on Biometrics, 2009 |
|
Modeling and Understanding Flickr Communities through Topic-based Analysis, and , Idiap-RR-19-2010 |
|
Non-linear mapping for multi-channel speech separation and robust overlapping speech recognition, , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009 |
|
Volterra Series for Analyzing MLP based Phoneme Posterior Probability Estimator, , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2009 |
|
Visual activity context for focus of attention estimation in dynamic meetings, , and , Idiap-RR-02-2009 |
|
MULTI-MODAL SPEAKER DIARIZATION OF REAL-WORLD MEETINGS USING COMPRESSED-DOMAIN VIDEO FEATURES, , and , in: International Conference on Audio, Speech and Signal Processing, 2009 |
|
Topickr: Flickr Groups and Users Reloaded, and , in: MM '08: Proc. of the 16th ACM Intl. Conf. on Multimedia, ACM, 2008 |
Analyzing Flickr Groups, and , in: Proc. of the Intl. Conf. on Image and Video Retrieval, ACM, 2008 |
Discriminative Keyword Spotting, , and , in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009 |
A Proposal for a Kernel-based Algorithm for Large Vocabulary Continuous Speech Recognition, , in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009 |
A Kernel Wrapper for Phoneme Sequence Recognition, and , in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009 |
A Large Margin Algorithm for Forced Alignment, , , and , in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009 |
Robust Discriminative Keyword Spotting for Emotionally Colored Spontaneous Speech using Bidirectional LSTM Networks, , , , , and , in: IEEE International Conference on Acoustic, Speech, and Signal Processing, 2009 |
|
Support Vector Machines with a Reject Option, , , and , in: Proceedings of the 22nd Annual Conference on Neural Information Processing Systems, 2008 |
|
MUTUAL INFORMATION BASED CHANNEL SELECTION FOR SPEAKER DIARIZATION OF MEETINGS DATA, , and , in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2009 |
|
An Information Theoretic Approach to Speaker Diarization of Meeting Data, , and , in: IEEE Transactions on Audio Speech and Language Processing, 17(7), 2009 |
[DOI] |
Role Recognition in Multiparty Recordings using Social Affiliation Networks and Discrete Distributions, , , and , in: International Conference on Multimodal Interfaces, Chania, Greece, 2008 |
|
Tracking the visual focus of attention for a varying number of wandering people, , , and , in: IEEE Trans. on Pattern Analysis and Machine Intelligence, 30(7), 2008 |
|
Multi-camera 3d person tracking with particle filter in a surveillance environment, and , in: 16th European Signal processing Conference (EUSIPCO), 2008 |
|
Detecting queues at vending machines: a statistical layered approach, and , in: Proc. Int. Conf. on Pattern Recognition (ICPR), Tampa, 2008 |
|
Multi-camera multi-person 3d space tracking with mcmc in surveillance scenarios, and , in: European Conference on Computer Vision, workshop on Multi Camera and Multi-modal Sensor Fusion Algorithms and Applications (ECCV-M2SFA2), Marseille, 2008 |
|
Fast human detection from videos using covariance features, and , in: European Conference on Computer Vision, workshop on Visual Surveillance (ECCV-VS), Marseille, 2008 |
|
Understanding Metro Station Usage using Closed Circuit Television Cameras Analysis, , , , , , and , in: 11th International IEEE Conference on Intelligent Transportation Systems (ITSC), Bejing, 2008 |
|
Exploiting Contextual Information for Speech/Non-Speech Detection, , and , in: Text, Speech and Dialogue, Brno, Czech Republic, Springer-Verlag Berlin, Heidelberg, 2008 |
|
Perceptually motivated Sub-band Decomposition for FDLP Audio Coding, , , , and , in: Text, Speech and Dialogue, Brno, Czech Republic, Springer-Verlag Berlin, Heidelberg, 2008 |
|
Recognizing Human Visual Focus of Attention from Head Pose in Meetings, and , in: IEEE Transactions on Systems, Man, Cybernetics, Part-B, Vol. 39(No. 1), 2009 |
|
Contextual classification of image patches with latent aspect models, , , and , in: EURASIP Journal on Image and Video Processing, Special Issue on Patches in Vision, 2009 |
|
Bayesian Networks to Combine Intensity and Color Information in Face Recognition, and , Idiap-RR-27-2009 |
|
Model Adaptation with Least-Squares SVM for Adaptive Hand Prosthetics, , , , and , Idiap-RR-05-2009 |
|
CLEF2008 Image Annotation Task: an SVM Confidence-Based Approach, , and , Idiap-RR-77-2008 |
|
Face Detection using Ferns, and , Idiap-Com-01-2011 |
|
Low-Delay Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, , and , Idiap-RR-75-2008 |
|
MODIFIED DISCRETE COSINE TRANSFORM FOR ENCODING RESIDUAL SIGNALS IN FREQUENCY DOMAIN LINEAR PREDICTION, , and , Idiap-RR-74-2008 |
|
Enhancing posterior based speech recognition systems, , Ecole Polytechnique Fédérale de Lausanne, 2008 |
|
Automatic Out-of-Language Detection based on Confidence Measures derived from LVCSR Word and Phone Lattices, , Idiap-RR-06-2009 |
|
Predicting Two Facets of Social Verticality in Meetings from Five-Minute Time Slices and Nonverbal Cues, , , and , in: Proceedings - ICMI 2008, 2008 |
|
Modeling Dominance in Group Conversations using NonVerbal Activity Cues, , , and , in: IEEE Transactions on Audio, Speech and Language Processing, 2008 |
|
Methods for Asynchronous and Non-Invasive EEG-Based Brain-Computer Interfaces. Towards Intelligent Brain-Actuated Wheelchairs, , University of Barcelona, 2008 |
|
Principled Detection-by-classification from Multiple Views, , and , in: proceedings of the International Conference on Computer Vision Theory and Applications, 2008 |
Classification-based Probabilistic Modeling of Texture Transition for Fast Line Search Tracking and Delineation, , , and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008 |
Multi-layer Boosting for Pattern Recognition, , in: Pattern Recognition Letter, 30, 2009 |
Multi-Camera People Tracking with a Probabilistic Occupancy Map, , , and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(2), 2008 |
Multiple Object Tracking using Flow Linear Programming, , and , Idiap-RR-10-2009 |
|
Integrating audio and vision for robust automatic gender recognition, and , Idiap-RR-73-2008 |
|
Parts-Based Face Verification using Local Frequency Bands, and , Idiap-RR-03-2009 |
|
How does a dictation machine recognize speech ?, , and , in: Applied Signal Processing--A MATLAB approach, Springer MA, 2008 |
|
How does a dictation machine recognize speech?, , and , Idiap-RR-72-2008 |
|
Entropy coding of Quantized Spectral Components in FDLP audio codec, , and , Idiap-RR-71-2008 |
|
Modulation Frequency Features For Phoneme Recognition In Noisy Speech, , and , in: Journal of Acoustical Society of America - Express Letters, 2008 |
|
Modulation Frequency Features For Phoneme Recognition In Noisy Speech, , and , Idiap-RR-70-2008 |
|
CLEF2007 Image Annotation Task: an SVM-based Cue Integration Approach, , and , in: Proceedings of ImageCLEF 2007 -LNCS, 2007 |
|
The DIRAC AWEAR Audio-Visual Platform for Detection of Unexpected and Incongruent Events, , , , , , , , , , , , , and , in: Proceedings of the International Conference on Multimodal Interfaces, 2008 |
|
The DIRAC AWEAR Audio-Visual Platform for Detection of Unexpected and Incongruent Events, , , , , , , , , , , , , and , Idiap-RR-41-2010 |
|
Biologically Motivated Audio-Visual Cue Integration for Object, , , , , , , , and , in: Proceedings of the first Internatinal Conference on Cognitive Systems, 2008 |
|
SVM-based Discriminative Accumulation Scheme for Place Recognition, , and , in: Proceedings of the IEEE International Conference on Robotics and Automation (ICRA08), 2008 |
|
Inference in switching linear dynamical systems applied to noise robust speech recognition of isolated digits, , Ecole Polytechnique Fédérale de Lausanne, 2008 |
|
Probabilistic models for music, , Ecole Polytechnique Fédérale de Lausanne, 2008 |
[URL] |
Machine Learning for Information Retrieval, , Ecole Polytechnique Fédérale de Lausanne, 2008 |
|
ESTIMATING THE DOMINANT PERSON IN MULTI-PARTY CONVERSATIONS USING SPEAKER DIARIZATION STRATEGIES, , , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2007 |
Investigating Automatic Dominance Estimation in Groups From Visual Attention and Speaking Activity, , , , and , in: International Conference on Multi-modal Interfaces, 2008 |
|
Towards Audio-Visual On-line Diarization Of Participants In Group Meetings, and , in: European Conference on Computer Vision Workshop on Multi-camera and Multi-modal Sensor Fusion, 2008 |
|
Kernel Based Text-Independnent Speaker Verification, , and , Idiap-RR-68-2008 |
|
Towards Robust Place Recognition for Robot Localization, , , , , and , in: IEEE International Conference on Robotics ad Automation, 2008 |
|
Towards Robust Place Recognition for Robot Localization, , , , , and , Idiap-RR-40-2010 |
|
Class specific object recognition using kernel Gibbs distributions, , in: ELectronic Letters on Computer vision and Image Analysis, 7(2), 2008 |
|
Discriminative cue integration for medical image annotation, , and , in: Pattern Recognition Letters, 2008 |
|
Acoustic Models for Posterior Features in Speech Recognition, , Ecole Polytechnique Fédérale de Lausanne, 2008 |
|
Acoustic Models for Posterior Features in Speech Recognition, , Idiap-RR-67-2008 |
|
Fast Recognition of Anticipation Related Potentials, , and , in: IEEE Transactions on Biomedical Engineering, 2008 |
|
SimpleMKL, , , and , in: Journal of Machine Learning Research, 9, 2008 |
|
Multi-layer Boosting for Pattern Recognition, , Idiap-RR-76-2008 |
|
Multi-Person Visual Focus of Attention from Head Pose and Meeting Contextual Cues, and , Idiap-RR-47-2008 |
|
Graphical representation of meetings on mobile devices, , and , in: MobileHCI 2008 (10th International Conference on Human-Computer Interaction with Mobile Devices and Services, Demonstrations Session), Amsterdam, 2008 |
|
Improving Contextual Quality Models for MT Evaluation Based on Evaluators' Feedback, , and , in: 6th International Conference on Language Resources and Evaluation, Marrakech, Morocco, 2008 |
Task-based evaluation of meeting browsers: from BET task elicitation to user behavior analysis, , , and , in: 6th International Conference on Language Resources and Evaluation, Marrakech, Morocco, 2008 |
|
Reference-based vs. task-based evaluation of human language technology, , in: LREC 2008 ELRA Workshop on Evaluation, ELRA, Marrakech, Morocco, 2008 |
|
The AMIDA Automatic Content Linking Device: Just-in-Time Document Retrieval in Meetings, , , , , , , and , in: Machine Learning for Multimodal Interaction V, Utrecht, Springer-Verlag, 2008 |
[DOI] |
Dimensionality of Dialogue Act Tagsets: An Empirical Analysis of Large Corpora, , in: Language Resources and Evaluation, 42(1), 2008 |
[DOI] |
Towards an Objective Test for Meeting Browsers: the BET4TQB Pilot Experiment, , , and , in: Machine Learning for Multimodal Interaction IV, Springer-Verlag, 2008 |
[DOI] |
Machine Learning for Multimodal Interaction V, and , Springer-Verlag, LNCS, volume 5237, 2008 |
[DOI] |
Machine Learning for Multimodal Interaction IV, , and , Springer-Verlag, LNCS, volume 4892, 2008 |
[DOI] |
Social Signal Processing: State-of-the-Art and Future Perspectives of an Emerging Domain, , , and , in: Proceedings of the ACM International Conference on Multimedia, 2008 |
|
Social Signals, their Function, and Automatic Analysis: A Survey, , , and , in: Proceedings of International Conference on Multimodal Interfaces (to appear), 2008 |
|
Fast Human Detection from Videos Using Covariance Features, and , Idiap-RR-68-2007 |
|
Multi-Layer Background Subtraction Based on Color and Texture, and , Idiap-RR-67-2007 |
|
Multi-Layer Background Subtraction Based on Color and Texture, and , in: CVPR 2007 Workshop on Visual Surveillance (VS2007), 2007 |
|
Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, and , John Wiley & Sons, 2008 |
Support Vector Machines with a Reject Option, , , and , Idiap-RR-01-2009 |
|
Discriminative Keyword Spotting, , and , in: Workshop on Non-Linear Speech Processing, Paris, France, 2007 |
|
Discriminative Kernel-Based Phoneme Sequence Recognition, , , , and , in: The 9th International Conference on Spoken Language Processing (INTERSPEECH), Pittsburgh, PA, 2006 |
|
Role Recognition for Meeting Participants: an Approach Based on Lexical Information and Social Network Analysis, , , , and , in: ACM International Conference on Multimedia, Vancouver, Canada, 2008 |
|
Identifying Dominant People in Meetings from Audio-Visual Sensors, and , in: International Conference on Automatic Face and Gesture Recognition, Amsterdam, The Netherlands, 2008 |
|
Identifying Dominant People in Meetings from Audio-Visual Sensors, and , Idiap-RR-65-2008 |
|
Associating Audio-Visual Activity Cues in a Dominance Estimation Framework, , , and , Idiap-RR-66-2008 |
|
Associating Audio-Visual Activity Cues in a Dominance Estimation Framework, , , and , in: First IEEE Workshop on CVPR for Human Communicative Behavior Analysis, 2008 |
|
Predicting the Dominant Clique in Meetings through Fusion of Nonverbal Cues, , , and , in: ACM MM 2008, 2008 |
|
Topickr: Flickr Groups and Users Reloaded, and , Idiap-RR-61-2008 |
|
Stationary Features and Cat Detection, and , in: Journal of Machine Learning Research, 9, 2008 |
Automated Delineation of Dendritic Networks in Noisy Image Stacks, , and , in: proceedings of the European Conference on Computer Vision, 2008 |
Multi-Camera Tracking and Atypical Motion Detection with Behavioral Maps, , and , in: proceedings of the European Conference on Computer Vision, 2008 |
What Did You Do Today? Discovering Daily Routines from Large-Scale Mobile Data, and , Idiap-RR-49-2008 |
|
What Did You Do Today? Discovering Daily Routines from Large-Scale Mobile Data, and , in: ACM International Conference on Multimedia (ACMMM), 2008 |
|
Discovering Human Routines from Cell Phone Data with Topic Models, and , in: IEEE International Symposium on Wearable Computers (ISWC), 2008 |
|
Discovering Human Routines from Cell Phone Data with Topic Models, and , Idiap-RR-32-2008 |
|
Daily Routine Classification from Mobile Phone Data, and , in: Workshop on Machine Learning and Multimodal Interaction (MLMI08), 2008 |
|
Role Recognition in Multiparty Recordings using Social Affiliation Networks and Discrete Distributions, , and , Idiap-RR-64-2008 |
|
Calibration from statistical properties of the visual world, , and , Idiap-RR-63-2008 |
|
Calibration from statistical properties of the visual world, , and , in: European Conf. on Computer Vision, 2008 |
|
Predicting the dominant clique in meetings through fusion of nonverbal cues, , , and , Idiap-RR-08-2008 |
|
Performance Generalization in Biometric Authentication Using Joint User-Specific and Sample Bootstraps, , and , Idiap-RR-60-2005 |
|
Using Chimeric Users to Construct Fusion Classifiers in Biometric Authentication Tasks: An Investigation, and , Idiap-RR-59-2005 |
|
Optimisation de réseaux de neurones, , {EPFL}, Lausanne, Switzerland, 1995 |
Role Recognition for Meeting Participants: an Approach Based on Lexical Information and Social Network Analysis, , , , and , Idiap-RR-57-2008 |
|
understanding metro station usage using closed circuit television cameras analysis, , , , , , and , Idiap-RR-38-2008 |
|
The COLD Database, , , , and , Idiap-RR-49-2007 |
|
Visual Focus of Attention Estimation from Head Pose Posterior Probability Distributions, and , Idiap-RR-75-2007 |
|
Analysis of Confusion Matrix to Combine Evidence for Phoneme Recognition, , , and , Idiap-RR-27-2007 |
|
Classifying Materials in the Real World, , , and , Idiap-RR-69-2007 |
|
Using Chimeric Users to Construct Fusion Classifiers in Biometric Authentication Tasks: An Investigation, and , Idiap-RR-59-2005 |
|
A System for the Off-Line Recognition of Handwritten Text, , Idiap-RR-02-1994 |
|
View-Based Recognition, , Idiap-RR-09-1993 |
|
On the Combination of Auditory and Modulation Frequency Channels for ASR applications, and , in: Interspeech 2008, 2008 |
|
Integration of TDOA Features in Information Bottleneck Framework for Fast Speaker Diarization, , and , in: Interspeech 2008, 2008 |
|
Melanoma Recognition Using Representative and Discriminative Kernel Classifiers, , and , in: International Workshop on Computer Vision Applications for Medical Image Analysis, 2006 |
|
A Discriminative Approach to Robust Visual Place Recognition, , , and , in: IEEE International Conference on Intelligent RObot Systems (IROS), 2006 |
|
Biometric Person Authentication IS A Multiple Classifier Problem, and , in: 7th International Workshop on Multiple Classifier Systems, MCS, 2007 |
|
Spin Glass Models of Markov Random Fields, , in: International Journal on Image, Systems and Technology, 16(5), 2006 |
|
Neural Network Initialization, and , in: From Natural to Artificial Neural Computation, Springer Verlag, 1995 |
Les domaines d'application des technologies vocales, , in: Fondements et perspectives en traitement automatique de la parole, GDR-PRC Communication Homme-Machine, 1995 |
A Hybrid Approach to Continuous Speech Recognition, and , in: The handbook of brain theory and neural networks, The MIT Press, 1995 |
Assessment of speaker verification systems, and , in: Spoken Language Ressources and Assessment, EAGLES Handbook, 1995 |
Handwriting Recognition, , in: Recent Developments in Computer Vision, Springer, 1995 |
Applying Handwriting Recognition to US Census Forms, , in: Recent Developments in Computer Vision, Springer, 1995 |
|
An All-Optical Forward Propagation Multilayer Neural Network, and , in: From Natural to Artificial Neural Computation, Springer Verlag, 1995 |
Composite Kernel Learning, , and , Idiap-RR-59-2008 |
|
Composite Kernel Learning, , and , in: Proceedings of the 25th Annual International Conference on Machine Learning (ICML 2008), Omnipress, 2008 |
|
Joint Head Tracking and Pose Estimation for Visual Focus of Attention Recognition, , École Polytechnique Fédérale de Lausanne, 2007 |
|
Autoregressive Modelling of Hilbert Envelopes for Wide-band Audio Coding, , , and , Idiap-RR-40-2008 |
|
Autoregressive Modelling of Hilbert Envelopes for Wide-band Audio Coding, , , and , in: AES 124th Convention, Audio Engineering Society, 2008 |
|
Continuous Brain-Actuated Control of an Intelligent Wheelchair by Human EEG, , , , , , and , Idiap-RR-53-2008 |
|
Continuous Brain-Actuated Control of an Intelligent Wheelchair by Human EEG, , , , , , and , in: In proceedings, 4th Intl. Brain-Computer Interface Workshop and Training Course, 2008 |
|
A Brain-Actuated Wheelchair: Asynchronous and Non-Invasive Brain-Computer Interfaces for Continuous Control of Robots, , , , , , and , in: Clinical Neurophysiology, 2008 |
|
Error-related EEG potentials in brain-computer interfaces, , École Polytechnique Fédérale de Lausanne, 2007 |
|
EEG-Based Brain-Computer Interaction: Improved Accuracy by Automatic Single-Trial Error Detection, and , in: Advances in Neural Information Processing Systems 21, 2007 |
|
Simultaneous Real-Time Detection of Motor Imagery and Error-Related Potentials for Improved BCI Accuracy, and , in: Proceedings of the 4th International Brain-Computer Interface Workshop and Training Course, 2008 |
|
Daily Routine Classification from Mobile Phone Data, and , Idiap-RR-62-2007 |
|
Analyzing Interactions Between Navigation Strategies Using a Computational Model of Action Selection, , , , and , in: Int Conf Spatial Cognition 2008, 2008 |
|
Analyzing Interactions Between Navigation Strategies Using a Computational Model of Action Selection, , , , and , Idiap-RR-48-2008 |
|
Asynchronous detection and classification of oscillatory brain activity, , and , Idiap-RR-36-2008 |
|
Asynchronous detection and classification of oscillatory brain activity, , and , in: 16 European Signal Processing Conference, 2008 |
|
Fast Approximate Spoken Term Detection from Sequence of Phonemes, , , and , in: Workshop on Searching Spontaneous Conversational Speech at SIGIR, 2008 |
|
Combining Evidence from a Generative and a Discriminative Model in Phoneme Recognition, and , in: Proceedings of Interspeech, 2008 |
|
Fast Approximate Spoken Term Detection from Sequence of Phonemes, , , and , Idiap-RR-45-2008 |
|
Silence Models in Weighted Finite-State Transducers, , in: Interspeech, 2008 |
|
Predictive Models for Music, , and , Idiap-RR-51-2008 |
|
Probabilistic Models for Melodic Prediction, , and , Idiap-RR-50-2008 |
|
In-Context Phone Posteriors as Complementary Features for Tandem ASR, and , in: ICSLP'08, 2008 |
|
Hierarchical Integration of Phonetic and Lexical Knowledge in Phone Posterior Estimation, and , in: ICASSP'08, 2008 |
|
Enhanced Phone Posteriors for Improving Speech Recognition Systems, and , Idiap-RR-39-2008 |
|
Recognition of Anticipatory Behavior from Human EEG, , and , Idiap-RR-52-2008 |
|
Recognition of Anticipatory Behavior from Human EEG, , and , in: In proceedings, 4th Intl. Brain-Computer Interface Workshop and Training Course, 2008 |
|
Spectral Noise Shaping: Improvements in Speech/Audio Codec Based on Linear Prediction in Spectral Domain, , , and , in: INTERSPEECH 2008, 2008 |
|
Hilbert Envelope Based Features for Far-Field Speech Recognition, , and , in: MLMI 2008, 2008 |
|
Hilbert Envelope Based Spectro-Temporal Features for Phoneme Recognition in Telephone Speech, , and , in: Interspeech 2008, 2008 |
|
Front-end for Far-field Speech Recognition based on Frequency Domain Linear Prediction, , and , in: Interspeech 2008, 2008 |
|
Recognition Of Reverberant Speech Using Frequency Domain Linear Prediction, , and , in: IEEE Signal Processing Letters, 2008 |
|
Hilbert Envelope Based Features for Far-Field Speech Recognition, , and , Idiap-RR-42-2008 |
|
Recognition Of Reverberant Speech Using Frequency Domain Linear Prediction, , and , Idiap-RR-41-2008 |
|
Spectro-Temporal Features for Automatic Speech Recognition using Linear Prediction in Spectral Domain, , and , in: EUSIPCO 2008, 2008 |
|
Introducing Temporal Asymmetries in Feature Extraction for Automatic Speech Recognition, and , in: Interspeech 2008, 2008 |
|
Emulating Temporal Receptive Fields of Auditory Mid-Brain Neurons for Automatic Speech Recognition, and , in: Proc. 16th European Signal Processing Conference (EUSIPCO), 2008 |
|
Timbre and Rhythmic TRAP-TANDEM features for music information retrieval, , in: "Int. Conf. on Music Information Retrieval (ISMIR)", 2008 |
|
Timbre and Rhythmic TRAP-TANDEM features for music information retrieval, , Idiap-RR-46-2008 |
|
Reverse Correlation for analyzing MLP Posterior Features in ASR, , and , in: 11th International Conference on Text, Speech, and Dialogue, 2008 |
|
An Information Theoretic Approach to Speaker Diarization of Meeting Data, , and , Idiap-RR-58-2008 |
|
Scene image classification and segmentation with quantized local descriptors and latent aspect modeling, , École Polytechnique Fédérale de Lausanne, 2007 |
|
Bayesian methods for visual multi-object tracking with applications to human activity recognition, , École Polytechnique Fédérale de Lausanne, 2007 |
|
CLIENT / WORLD MODEL SYNCHRONOUS ALIGNEMENT FOR SPEAKER VERIFICATION, , , and , Idiap-RR-23-1999 |
|
Benchmarking Non-Parametric Statistical Tests, , and , Idiap-RR-38-2005 |
|
Multi-resolution Spectral Entropy Based Feature for Robust ASR, , , and , Idiap-RR-37-2004 |
|
Off-Line Cursive Script Recognition Based on Continuous Density HMM, and , Idiap-RR-25-1999 |
|
Discriminant linear processing of time-frequency plane, and , Idiap-RR-20-2006 |
|
Text Segmentation and Recognition in Complex Background Based on Markov Random Field, , and , Idiap-RR-17-2002 |
|
Nonlinear Spectral Transformations for Robust Speech Recognition, , and , Idiap-RR-36-2003 |
|
Hand Posture Classification and Recognition using the Modified Census Transform, , and , Idiap-RR-02-2006 |
|
Test of several external posterior weighting functions for multiband Full Combination ASR, and , Idiap-RR-27-2000 |
|
Extracting Information from Multimedia Meeting Collections, , and , Idiap-RR-50-2005 |
|
Machine Learning Approaches to Text Representation using Unlabeled Data, , Idiap-RR-76-2006 |
|
Motion likelihood and proposal modeling in Model-Based Stochastic Tracking, and , Idiap-RR-61-2004 |
|
Low cost duration modelling for noise robust speech recognition, , and , Idiap-RR-08-2002 |
|
Indexing spoken audio by LSA and SOMs, , Idiap-RR-06-2000 |
|
On the Complexity of Recognizing Regions Computable by Two-Layered Perceptrons, , Idiap-RR-03-1998 |
|
New Nonsense Syllables Database -- Analyses and Preliminary ASR Experiments, , , and , Idiap-RR-29-2004 |
|
Combining multiple tracking algorithms for improved general performance, , and , Idiap-RR-13-2000 |
|
A Rao-Blackwellized Mixed State Particle Filter for Head Pose Tracking, and , Idiap-RR-35-2005 |
|
Modeling Individual and Group Actions in Meetings: a Two-Layer HMM Framework, , , , and , Idiap-RR-09-2004 |
|
Intrinsic dimension estimation of data: an approach based on Grassberger-Procaccia's algorithm, and , Idiap-RR-33-2000 |
|
Speech/Music Discrimination using Entropy and Dynamism Features in a HMM Classification Framewor, , and , Idiap-RR-26-2001 |
|
Subjective Evaluation of Join Cost and Smoothing Methods for Unit Selection Speech Synthesis, and , Idiap-RR-34-2005 |
|
Automatic Speech Recognition: an Auditory Perspective, , and , Idiap-RR-17-1998 |
A State-of-the-art Neural Network for Robust Face Verification, , and , Idiap-RR-36-2002 |
|
Robust Speech Recognition and Feature Extraction Using HMM2, , , and , Idiap-RR-42-2001 |
|
Face Verification Using Synthesized Non-Frontal Models, and , Idiap-RR-60-2003 |
|
Adaptive ML-Weighting in Multi-Band Recombination of Gaussian Mixture ASR, , and , Idiap-RR-05-2001 |
|
Investigating Lexical Substitution Scoring for Subtitle Generation, , , , and , Idiap-RR-36-2006 |
|
Head-Pose Invariant Facial Expression Recognition using Convolutional Neural Networks, , Idiap-RR-51-2002 |
|
Robust Speaker Change Detection, , and , Idiap-RR-39-2002 |
|
Robust Speech Recognition with Small Microphone Arrays using the Missing Data Approach, , and , Idiap-RR-09-2002 |
|
Speechreading using Probabilistic Models, and , Idiap-RR-12-1997 |
|
Handwritten Digit Recognition with Binary Optical Perceptron, , , and , Idiap-RR-15-1997 |
|
Video OCR for Sport Video Annotation and Retrieval, and , Idiap-RR-28-2001 |
|
Online Policy Adaptation for Ensemble Classifiers, and , Idiap-RR-69-2003 |
|
Exploring Contextual Information in a Layered Framework for Group Action Recognition, , and , Idiap-RR-41-2006 |
|
Taking on the Curse of Dimensionality in Joint Distributions Using Neural Networks, and , Idiap-RR-01-2000 |
|
Spectro-Temporal Activity Pattern (STAP) Features for Noise Robust ASR, , , and , Idiap-RR-20-2004 |
|
LP-TRAP: Linear predictive temporal patterns, , and , Idiap-RR-59-2004 |
|
Nearly optimal exploration-exploitation decision thresholds, , Idiap-RR-12-2006 |
|
From missing data to maybe useful data: soft data modelling for noise robust ASR, , and , Idiap-RR-06-2001 |
|
Estimating the Intrinsic Dimension of Data with a Fractal-Based Method, and , Idiap-RR-02-2002 |
|
Multiple Hypotheses Video OCR, and , Idiap-RR-28-2000 |
|
On Use of Task Independent Training Data in Tandem Feature Extraction, and , Idiap-RR-57-2003 |
|
On the Use of Speech and Face Information for Identity Verification, and , Idiap-RR-10-2004 |
|
A Multi-sample Multi-source Model for Biometric Authentication, , and , Idiap-RR-14-2002 |
|
A Statistical Significance Test for Person Authentication, and , Idiap-RR-83-2003 |
|
Audio-Visual Person Verification, , , , and , Idiap-RR-18-1998 |
|
Comparison and Combination of Features in a Hybrid HMM/MLP and a HMM/GMM Speech Recognition System, , , , and , Idiap-RR-48-2003 |
|
Revisiting Doddington's Zoo: A Systematic Method to Assess User-dependent Variabilities, , and , Idiap-RR-04-2006 |
|
Truncation Confusion Patterns in Onset Consonants, , Idiap-RR-05-2007 |
|
Constructing visual models with a latent space approach, , , and , Idiap-RR-14-2005 |
|
Performance Generalization in Biometric Authentication Using Joint User-Specific and Sample Bootstraps, , and , Idiap-RR-60-2005 |
|
Adapted Generative Models For Face Verification, , and , Idiap-RR-76-2003 |
|
A New Margin-Based Criterion for Efficient Gradient Descent, and , Idiap-RR-16-2003 |
|
Tracking the Multi Person Wandering Visual Focus of Attention, , , and , Idiap-RR-80-2005 |
|
On the Complexity of Recognizing Iterated Differences of Polyhedra, , Idiap-RR-10-1997 |
|
An Overview of the PICASSO Project Research Activities in Speaker Verification for Telephone Applications, , , , , , , , , , , and , Idiap-RR-24-1999 |
Using pitch frequency information in speech recognition, , and , Idiap-RR-23-2003 |
|
Juicer: A Weighted Finite-State Transducer speech decoder, , , , , and , Idiap-RR-21-2006 |
|
On Confusions in a Phoneme Recognizer, , and , Idiap-RR-10-2007 |
|
Hierarchical Multi-Stream Posterior Based Speech Recognition System, , and , Idiap-RR-25-2005 |
|
A Hierarchical Keyframe User Interface for Browsing Video over the Internet, , , and , Idiap-Com-02-2003 |
|
Using Auxiliary Sources of Knowledge for Automatic Speech Recognition, , Idiap-RR-90-2005 |
|
Using more informative posterior probabilities for speech recognition, , , and , Idiap-RR-91-2005 |
|
Towards Predicting Optimal Subsets of Base-Experts in Biometric Authentication Task, and , Idiap-RR-17-2004 |
|
Automatic Speech Recognition using Dynamic Bayesian Networks with both Acoustic and Articulatory Variables, , , and , Idiap-RR-19-2000 |
|
Entropy-based Multi-stream Combination, , and , Idiap-RR-31-2002 |
|
Combination of Acoustic Classifiers based on Dempster-Shafer Theory of evidence, and , Idiap-RR-61-2006 |
|
Multi-Modal Data Fusion for Person Authentication using SVM, , Idiap-RR-07-1998 |
|
HMM2- A Novel Approach to HMM Emission Probability Estimation, , and , Idiap-RR-30-2000 |
|
Multi-Person Tracking in Meetings: A Comparative Study, , , , , and , Idiap-RR-38-2006 |
|
Finding groups of people in Google news, and , Idiap-RR-68-2005 |
|
Mutliscale Facial Expression Recognition using Convolutional Neural Networks, , Idiap-RR-52-2002 |
|
Automatic Facial Expression Analysis: A Survey, and , Idiap-RR-19-1999 |
|
Combinatorial Approach for Data Binarization, and , Idiap-RR-08-1999 |
|
Sociometry Based Multiparty Audio Recordings Segmentation, , Idiap-RR-78-2005 |
|
Experimental Protocol on the BANCA Database, , , , , , , and , Idiap-RR-05-2002 |
|
Robust Speech Recognition based on Multi-Stream Features, , and , Idiap-RR-01-1997 |
|
Subjective Evaluation of Join Cost Functions Used in Unit Selection Speech Synthesis, and , Idiap-RR-26-2004 |
|
Modeling Individual and Group Actions in Meetings With Layered HMMs, , , , and , Idiap-RR-33-2004 |
|
Measuring the Performance of Face Localization Systems, , , and , Idiap-RR-53-2005 |
|
Tracking People in Meetings with Particles, , , , and , Idiap-RR-71-2004 |
|
Spectral Entropy Based Feature for Robust ASR, , , and , Idiap-RR-56-2003 |
|
Text Identification in Complex Background using SVM, , and , Idiap-RR-20-2001 |
|
Recognition of Isolated Complex Mono- and Bi-Manual 3D Hand Gestures, , and , Idiap-RR-63-2003 |
|
A Kernel Trick For Sequences Applied to Text-Independent Speaker Verification Systems, and , Idiap-RR-77-2005 |
|
Analytic Assessment of Telephone Transmission Impact on ASR Performance Using a Simulation Model, and , Idiap-RR-17-2001 |
|
The Expected Performance Curve, , and , Idiap-RR-85-2003 |
|
A new normalization technique for cursive handwritten words, and , Idiap-RR-32-2000 |
|
Unified Inference for Variational Bayesian Linear Gaussian State-Space Models, and , Idiap-RR-50-2006 |
|
Text detection and recognition in images and video sequences, , Idiap-RR-44-2003 |
|
Face Authentication Using Adapted Local Binary Pattern Histograms, and , Idiap-RR-06-2006 |
|
A Probabilistic Interpretation of SVMs with an Application to Unbalanced Classification, , and , Idiap-RR-26-2005 |
|
Boosting Pixel-based Classifiers for Face Verification, and , Idiap-RR-65-2003 |
|
Audio visual speech recognition, , , , , , , and , Idiap-RR-35-2000 |
|
Modeling Human Interaction in Meetings, , , , , , , and , Idiap-RR-59-2002 |
|
Segmenting Multiple Concurrent Speakers Using Microphone Arrays, , and , Idiap-RR-21-2003 |
|
Localized mixtures of experts, , Idiap-RR-14-1998 |
Database, Protocol and Tools for Evaluating Score-Level Fusion Algorithms in Biometric Authentication, and , Idiap-RR-44-2004 |
|
Non-Linear Variance Reduction Techniques in Biometric Authentication, and , Idiap-RR-26-2003 |
|
Multimodal Integration for Meeting Group Action Segmentation and Recognition, , , , , and , Idiap-RR-31-2005 |
|
Microphone Array Speech Recognition : Experiments on Overlapping Speech in Meetings, and , Idiap-RR-41-2002 |
|
Improving Face Verification using Skin Color Information, and , Idiap-RR-44-2001 |
|
Improving Posterior Based Confidence Measures in Hybrid HMM/ANN Speech Recognition Systems, and , Idiap-RR-11-1998 |
|
Automatic Analysis of Multimodal Group Actions in Meetings, , , , , and , Idiap-RR-27-2003 |
|
Detecting Abandoned Luggage Items in a Public Space, , and , Idiap-RR-39-2006 |
|
Microphone Array Post-filter based on Noise Field Coherence, and , Idiap-RR-40-2001 |
|
Speech Recognition Using Advanced HMM2 Features, , and , Idiap-RR-24-2001 |
|
Analyzing Group Interactions in Conversations: a Review, , Idiap-RR-63-2006 |
|
Probabilistic Home Video Structuring: Feature Selection and Performance Evaluation, , and , Idiap-RR-11-2002 |
|
Scalability Analysis of Audio-Visual Person Identity Verification, , , and , Idiap-RR-04-2003 |
|
2D Multi-Person Tracking: A Comparative Study in AMI Meetings, , , , , and , Idiap-RR-37-2006 |
|
Effect of Segmentation Method on Video Retrieval Performance, and , Idiap-RR-83-2004 |
|
Text Enhancement with Asymmetric Filter for Video OCR, , and , Idiap-RR-19-2001 |
|
Clustering And Segmenting Speakers And Their Locations In Meetings, , and , Idiap-RR-55-2003 |
|
MAP Combination of Multi-Stream HMM or HMM/ANN Experts, , and , Idiap-RR-14-2001 |
|
Linking Objects in Videos by Importance Sampling, and , Idiap-RR-20-2002 |
|
Improving Face Authetication Using Virtual Samples, , and , Idiap-RR-40-2002 |
|
A Symmetric Transformation for LDA-based Face Verification, , Idiap-RR-67-2003 |
|
Detecting Group Interest-level in Meetings, , , and , Idiap-RR-51-2004 |
|
An Implicit Motion Likelihood for Tracking with Particle Filters, , and , Idiap-RR-15-2003 |
|
A Parallel Mixture of SVMs for Very Large Scale Problems, , and , Idiap-RR-12-2001 |
|
On Performance Evaluation of Face Detection and Localization Algorithms, , , and , Idiap-RR-80-2003 |
|
Improved Pairwise Coupling Classification With Correcting Classifiers, and , Idiap-RR-09-1997 |
|
Noise-Robust Multi-Stream Fusion for Text-Independent Speaker Authentication, and , Idiap-RR-01-2004 |
|
User-Customized Password Speaker Verification based on HMM/ANN and GMM Models, and , Idiap-RR-10-2002 |
|
Likelihood ratio adjustment for the compensation of model mismatch in speaker verification, and , Idiap-RR-05-1997 |
|
Improving Speech Recognition Using a Data-Driven Approach, , and , Idiap-RR-66-2005 |
|
Robust speech recognition based on multi-stream processing, , Idiap-RR-41-2001 |
|
Multi-stream ASR: Oracle Test and Embedded Training, , and , Idiap-RR-62-2005 |
|
Object Localization in Metric Spaces for Video Linking, and , Idiap-RR-09-2003 |
|
Modeling Interactions from Email Communication, , , and , Idiap-RR-51-2005 |
|
On Automatic Annotation of Images with Latent Space Models, and , Idiap-RR-31-2003 |
|
An Acoustic Model Based on Kullback-Leibler Divergence for Posterior Features, , and , Idiap-RR-60-2006 |
|
Offline Cursive Word Recognition using Continuous Density Hidden Markov Models trained with PCA or ICA Features, and , Idiap-RR-46-2001 |
|
Phase AutoCorrelation (PAC) derived Robust Speech Features, , and , Idiap-RR-38-2002 |
|
Face Authentication Based on Local Features and Generative Models, , Idiap-RR-85-2005 |
|
Offline Recognition of Unconstrained Handwritten Texts Using HMMs and Statistical Language Models, , and , Idiap-RR-22-2003 |
|
Information Fusion and Person Verification Using Speech & Face Information, and , Idiap-RR-33-2002 |
|
A Probabilistic Model for Chord Progressions, , and , Idiap-RR-57-2005 |
|
Assessing Scene Structuring in Consumer Videos, , , , and , Idiap-RR-11-2004 |
|
Microphone Array Post-filter for Diffuse Noise Field, and , Idiap-RR-39-2001 |
|
Nonlinear Feature Transformations for Noise Robust Speech Recognition, , Idiap-RR-70-2004 |
|
Auxiliary Variables in Conditional Gaussian Mixtures for Automatic Speech Recognition, , and , Idiap-RR-25-2002 |
|
An Asynchronous Hidden Markov Model for Audio-Visual Speech Recognition, , Idiap-RR-26-2002 |
|
Exploiting Phoneme Similarities in Hybrid HMM-ANN Keyword Spotting, , and , Idiap-RR-11-2007 |
|
Phoneme-Grapheme Based Speech Recognition System, , , and , Idiap-RR-37-2003 |
|
Multi-stream Processing for Noise Robust Speech Recognition, , Idiap-RR-28-2006 |
|
PhD Thesis: Speech Analysis with Production Constraints, , Idiap-RR-35-2001 |
|
The ami meeting corpus: a pre-announcement, , , , , , , , , , , , , , , , and , Idiap-RR-82-2005 |
|
An Investigation of Spectral Subband Centroids for Speaker Authentication, , and , Idiap-RR-62-2003 |
|
Phase AutoCorrelation (PAC) features in Entropy based Multi-Stream for Robust Speech Recognition, , , and , Idiap-RR-54-2003 |
|
Boosting word error rates, and , Idiap-RR-49-2004 |
|
Offline Recognition of Large Vocabulary Cursive Handwritten Text, , and , Idiap-RR-01-2003 |
|
Gradient estimates of return, and , Idiap-RR-29-2005 |
|
Entropy Based Combination of Tandem Representations for Noise Robust ASR, , , , and , Idiap-RR-19-2004 |
|
Cursive Character Challenge: a New Database for Machine Learning and Pattern Recognition, , and , Idiap-RR-79-2005 |
|
Robust Face Analysis using Convolutional Neural Networks, , Idiap-RR-48-2001 |
|
A Comparative Study of Adaptation Methods for Speaker Verification, and , Idiap-RR-34-2001 |
|
PLSA-based Image Auto-Annotation: Constraining the Latent Space, and , Idiap-RR-30-2004 |
|
Local Binary Patterns as an Image Preprocessing for Face Authentication, , and , Idiap-RR-76-2005 |
|
A Sector-Based, Frequency-Domain Approach to Detection and Localization of Multiple Speakers, and , Idiap-RR-54-2004 |
|
Comparison of HMM experts with MLP experts in the Full Combination Multi-Band Approach to Robust ASR, and , Idiap-RR-21-2000 |
|
Increasing Speech Recognition Noise Robustness with HMM2, , and , Idiap-RR-36-2001 |
|
Multimodal Multispeaker Probabilistic Tracking in Meetings, , , and , Idiap-RR-66-2004 |
|
Speaker Normalization using HMM2, , and , Idiap-RR-15-2002 |
|
Bayesian Factorial Linear Gaussian State-Space Models for Biosignal Decomposition, and , Idiap-RR-84-2005 |
|
Developing and Enhancing Posterior Based Speech Recognition Systems, , , and , Idiap-RR-23-2005 |
|
A Graphical Model for Chord Progressions Embedded in a Psychoacoustic Space, , , and , Idiap-RR-33-2005 |
|
Modelling Auxiliary Features in Tandem Systems, , , and , Idiap-RR-21-2004 |
|
Modeling Scenes with Local Descriptors and Latent Aspects, , , , , and , Idiap-RR-79-2004 |
|
Confidence Evaluation for Risk Prediction, , and , Idiap-RR-22-2001 |
|
Facial Expression Analysis using Shape and Motion Information Extracted by Convolutional Neural Networks, , Idiap-RR-49-2001 |
|
Infinite Models for Speaker Clustering, , Idiap-RR-19-2006 |
|
Continuous Audio-Visual Speech Recognition, and , Idiap-RR-02-1998 |
|
Indexing Audio Documents by using Latent Semantic Analysis and SOM, , Idiap-RR-13-1999 |
|
Joint Decoding for Phoneme-Grapheme Continuous Speech Recognition, , and , Idiap-RR-52-2003 |
|
Using Pitch as Prior Knowledge in Template-Based Speech Recognition, , and , Idiap-RR-65-2005 |
|
EEG pattern recognition through multi-stream evidence combination, , and , Idiap-RR-31-2001 |
|
Writer Identification for Smart Meeting Room Systems, , , , , and , Idiap-RR-70-2005 |
|
Error Correcting Posterior Combination for Robust Multi-Band Speech Recognition, and , Idiap-RR-10-2001 |
|
EER of Fixed and Trainable Fusion Classifiers: A Theoretical Study with Application to Biometric Authentication Tasks, and , Idiap-RR-01-2005 |
|
Behavior of a Bayesian adaptation method for incremental enrollment in speaker verification, , , , , and , Idiap-RR-02-2000 |
|
On Performance / Robustness / Complexity Trade-Offs in Face Verification, , and , Idiap-RR-74-2004 |
|
A Neural Network for Text Representation, and , Idiap-RR-12-2005 |
|
Video Indexing and Similarity Retrieval by Largest Common Subgraph Detection using Decision Trees, , and , Idiap-RR-15-2000 |
|
Learning the Decision Function for Speaker Verification, and , Idiap-RR-40-2000 |
|
Spectral Entropy Feature in Full-Combination Multi-stream for Robust ASR, and , Idiap-RR-10-2005 |
|
Evaluation of Formant-Like Features for ASR, , , , , and , Idiap-RR-04-2002 |
|
Multi-Modal Audio-Visual Event Recognition for Football Analysis, , and , Idiap-RR-12-2003 |
|
On the Decomposition of Polychotomies into Dichotomies, and , Idiap-RR-08-1996 |
|
On Variable-Scale Piecewise Stationary Spectral Analysis of Speech Signals for ASR, , and , Idiap-RR-09-2005 |
|
A Color and Gradient Local Descriptor Fusion Scheme For Object Recognition, and , Idiap-RR-71-2003 |
|
Tangent Vector Kernels for Invariant Image Classification with SVMs, and , Idiap-RR-75-2003 |
|
Fast latent semantic indexing of spoken documents by using self-organizing maps, , Idiap-RR-20-1999 |
|
A Discriminative Approach for the Retrieval of Images from Text Queries, , and , Idiap-RR-15-2006 |
|
Audio-visual probabilistic tracking of multiple speakers in meetings, , , and , Idiap-RR-27-2005 |
|
Semi-supervised Meeting Event Recognition with Adapted HMMs, , and , Idiap-RR-15-2005 |
|
Joint Speech and Speaker Recognition, , Idiap-RR-28-2005 |
|
Short-Term Spatio-Temporal Clustering of Sporadic and Concurrent Events, , and , Idiap-RR-14-2004 |
|
HMM and IOHMM Modeling of EEG Rhythms for Asynchronous BCI Systems, and , Idiap-RR-49-2003 |
|
Person Authentication using Brainwaves (EEG) and Maximum A Posteriori Model Adaptation, and , Idiap-RR-81-2005 |
|
User Authentication via Adapted Statistical Models of Face Images, , and , Idiap-RR-38-2004 |
|
New Approaches Towards Robust and Adaptive Speech Recognition, , and , Idiap-RR-01-2001 |
|
Posteriori Probabilities and Likelihoods Combination for Speech and Speaker Recognition, and , Idiap-RR-23-2004 |
|
An Optical Thresholding Perceptron, , , , and , Idiap-RR-16-1997 |
|
A Neural Network to Retrieve Images from Text Queries, and , Idiap-RR-33-2006 |
|
Client Dependent GMM-SVM Models for Speaker Verification, and , Idiap-RR-03-2003 |
|
Speech recognition of spontaneous, noisy speech using auxiliary information in Bayesian networks, , and , Idiap-RR-44-2002 |
A Robust Speaker Clustering Algorithm, and , Idiap-RR-38-2003 |
|
Threshold Selection for Unsupervised Detection, with an Application to Microphone Arrays, , , and , Idiap-RR-52-2005 |
|
Data binarization by discriminant elimination, , and , Idiap-RR-04-1999 |
|
A survey on Off-Line Cursive Word Recognition, , Idiap-RR-43-2000 |
|
Sector-Based Detection for Hands-Free Speech Enhancement in Cars, , and , Idiap-RR-67-2004 |
|
Noisy Text Categorization, , Idiap-RR-61-2003 |
|
Links between Perceptrons, MLPs and SVMs, and , Idiap-RR-06-2004 |
|
Incorporating Domain Knowledge with Video and Voice Data Analysis in News Broadcasts, , and , Idiap-RR-14-2000 |
|
Estimating the Quality of Face Localization for Face Verification, , , and , Idiap-RR-07-2004 |
|
A Mixed-State I-Particle Filter for Multi-Camera Speaker Tracking, , , and , Idiap-RR-25-2003 |
|
Theme Topic Mixture Model: A Graphical Model for Document Representation, and , Idiap-RR-05-2004 |
|
Application of Information Retrieval Techniques to Single Writer Documents, , Idiap-RR-12-2004 |
|
A Localization/Verification Scheme for Finding Text in Images and Video Frames Based on Contrast Independent Features and Machine Learning Methods, and , Idiap-RR-42-2003 |
|
A Study on Visual Focus of Attention Recognition from Head Pose in a Meeting Room, and , Idiap-RR-10-2006 |
|
Learning influence among interacting Markov chains, , , and , Idiap-RR-48-2005 |
|
Video Text Segmentation Using Particle Filters, and , Idiap-RR-43-2003 |
|
Statistical Transformation Techniques for Face Verification Using Faces Rotated in Depth, and , Idiap-RR-04-2004 |
|
Modeling Auxiliary Information in Bayesian Network Based ASR, , and , Idiap-RR-11-2001 |
|
Boosting HMMs with an application to speech recognition, and , Idiap-RR-41-2003 |
|
User-Customized Password Speaker Verification Using Multiple Reference and Background Models, and , Idiap-RR-41-2004 |
|
EEG Classification using Generative Independent Component Analysis, and , Idiap-RR-77-2004 |
|
Dichotomy Between Clustering Performance and Minimum Distortion in Piecewise-Dependent-Data (PDD) Clustering, and , Idiap-RR-48-2002 |
|
More Efficiency in Multiple Kernel Learning, , , and , Idiap-RR-18-2007 |
|
Why Do Multi-Stream, Multi-Band and Multi-Modal Approaches Work on Biometric User Authentication Tasks?, and , Idiap-RR-59-2003 |
|
Analysis and Classification of EEG Signals using Probabilistic Models for Brain Computer Interfaces, , Idiap-RR-48-2006 |
[URL] |
Natural Scene Image Modeling using Color and Texture Visterms., and , Idiap-RR-17-2006 |
|
Writer adaptation techniques in HMM based Off-Line Cursive Script Recognition, and , Idiap-RR-15-2001 |
|
The segmentation of multi-channel meeting recordings for automatic speech recognition, , and , Idiap-RR-22-2006 |
|
Location Based Speaker Segmentation, and , Idiap-RR-43-2002 |
|
Online Policy Adaptation for Ensemble Classifiers, and , Idiap-RR-69-2003 |
|
A Frequency-Domain Silence Noise Model, , and , Idiap-RR-13-2005 |
|
Sociometry Based Multiparty Audio Recordings Summarization, , Idiap-RR-27-2006 |
|
Learning the structure of image collections with latent aspect models, , Idiap-RR-06-2007 |
|
HMM Mixtures (HMM2) for Robust Speech Recognition, , Idiap-RR-34-2003 |
|
A Probabilistic Framework for Joint Head Tracking and Pose Estimation, and , Idiap-RR-78-2003 |
|
Finding Structure in Consumer Videos by Probabilistic Hierarchical Clustering, , and , Idiap-RR-22-2002 |
|
Illumination-robust Pattern Matching Using Distorted Color Histograms, and , Idiap-RR-09-1998 |
|
Multi-resolution RASTA filtering for TANDEM-based ASR, and , Idiap-RR-18-2005 |
|
Hidden Markov Models and other Finite State Automata for Sequence Processing, and , Idiap-RR-37-2001 |
|
A Novel Approach to Combining Client-Dependent and Confidence Information in Multimodal Biometric, and , Idiap-RR-68-2004 |
|
Local Machine Learning Models for Spatial Data Analysis, and , Idiap-RR-34-2000 |
|
Multimodal Authentication using Asynchronous HMMs, , Idiap-RR-02-2003 |
|
Fusion of Face and Speech Data for Person Identity Verification, , and , Idiap-RR-03-1999 |
|
Cursive Character Recognition by Learning Vector Quantization, and , Idiap-RR-47-2000 |
|
Role Recognition in Broadcast News Using Social Network Analysis and Duration Distribution Modeling, , Idiap-RR-35-2006 |
|
How Do Correlation and Variance of Base-Experts Affect Fusion in Biometric Authentication Tasks?, and , Idiap-RR-18-2004 |
|
Detection and Application of Influence Rankings in Small Group Meetings, , , and , Idiap-RR-49-2006 |
|
Confidence Measures for Multimodal Identity Verification, , , and , Idiap-RR-38-2001 |
|
Unsupervised Spectral Substraction for Noise-Robust ASR, , , and , Idiap-RR-42-2005 |
|
Neural Networks in Automatic Speech Recognition, , , and , Idiap-RR-09-2001 |
|
Application of Information Retrieval Technologies to Presentation Slides, and , Idiap-RR-36-2005 |
|
Speech Coding based on Spectral Dynamics, , , and , Idiap-RR-05-2006 |
|
Evaluation of Multiple Cues Head Pose Tracking Algorithm in Indoor Environments, and , Idiap-RR-05-2005 |
|
Mixtures of Experts Estimate A Posteriori Probabilities, , Idiap-RR-07-1997 |
|
Effect of Recognition Errors on Information Retrieval Performance, , Idiap-RR-08-2004 |
|
Posterior Based Keyword Spotting with A Priori Thresholds, , , and , Idiap-RR-67-2006 |
|
Image Classification by Neural Networks for the Quality Control of Watches, , and , Idiap-RR-10-1996 |
|
Latent Semantic Indexing by Self-Organizing Map, and , Idiap-RR-12-1999 |
|
Video Text Recognition Based on Markov Random Field and Grayscale Consistency Constraint, and , Idiap-RR-18-2002 |
|
PLP$^2$: Autoregressive modeling of auditory-like 2-D spectro-temporal patterns, , and , Idiap-RR-60-2004 |
|
Confusion matrix based posterior probabilities correction, and , Idiap-RR-53-2002 |
|
Recognition of Asymmetric Facial Action Unit Activities and Intensities, and , Idiap-RR-22-1999 |
|
Using Posterior-Based Features in Template Matching for Speech Recognition, , and , Idiap-RR-23-2006 |
|
Combining Neural Gas and Learning Vector Quantization for Cursive Character Recognition, and , Idiap-RR-18-2001 |
|
An Investigation of F-ratio Client-Dependent Normalisation on Biometric Authentication Tasks, and , Idiap-RR-46-2004 |
|
Vessel Segmentation and Branching Detection using an Adaptive Profile Kalman Filter in Retinal Blood Vessel Structure Analysis, and , Idiap-RR-58-2003 |
|
A supervised learning approach based on STDP and polychronization in spiking neuron networks, , and , Idiap-RR-54-2006 |
|
Confidence Measures in Multiple pronunciations Modeling For Speaker Verification, and , Idiap-RR-53-2003 |
|
Sparse Probabilistic Classifiers, and , Idiap-RR-19-2007 |
|
Using Multiple Time Scales in the Framework of Multi-Stream Speech Recognition, and , Idiap-RR-22-2000 |
|
AV16.3: an Audio-Visual Corpus for Speaker Localization and Tracking, , and , Idiap-RR-28-2004 |
|
Semi-supervised Adapted HMMs for Unusual Event Detection, , and , Idiap-RR-80-2004 |
|
Mel-Cepstrum Modulation Spectrum (MCMS) Features for Robust ASR, , , and , Idiap-RR-47-2003 |
|
Real-Time Face Detection Using Boosting Learning in Hierarchical Feature Spaces, , and , Idiap-RR-70-2003 |
|
Speech Acquisition in Meetings with an Audio-Visual Sensor Array, , , , and , Idiap-RR-03-2005 |
|
Embedding Motion in Model-Based Stochastic Tracking, , and , Idiap-RR-72-2003 |
|
Noisy Text Categorization, , Idiap-RR-03-2004 |
|
A Meeting Browser Evaluation Test, , , and , Idiap-RR-02-2005 |
|
On the Combination of Speech and Speaker Recognition, and , Idiap-RR-19-2003 |
|
On automatic annotation of meeting databases, , , , and , Idiap-RR-06-2003 |
|
An Online Audio Indexing System, , and , Idiap-RR-39-2003 |
|
Speech recognition with auxiliary information, , and , Idiap-RR-58-2002 |
Inferring Document Similarity from Hyper-links, and , Idiap-RR-21-2005 |
|
Mixed Bayesian Networks with Auxiliary Variables for Automatic Speech Recognition, , and , Idiap-RR-45-2001 |
|
HMM2- Extraction of Formant Features and their Use for Robust ASR, , and , Idiap-RR-42-2000 |
|
A Sector-Based Approach for Localization of Multiple Speakers with Microphone Arrays, and , Idiap-RR-15-2004 |
|
Audio-Visual Speaker Tracking with Importance Particle Filters, , , , and , Idiap-RR-37-2002 |
|
On Factorizing Spectral Dynamics for Robust Speech Recognition, , , and , Idiap-RR-32-2003 |
|
Large Scale Machine Learning, , Idiap-RR-42-2004 |
|
Acoustic-Labial Speaker Verification, , , and , Idiap-RR-13-1997 |
|
Text dependent speaker verification using binary classifiers, , and , Idiap-RR-08-1997 |
|
Towards Robust and Adaptive Speech Recognition Models, , and , Idiap-RR-01-2002 |
|
Pseudo-Syntactic Language Modeling for Disfluent Speech Recognition, , Idiap-RR-55-2004 |
|
Face Verification using MLP and SVM, and , Idiap-RR-21-2002 |
|
Conditional Gaussian Mixture Models for Environmental Risk Mapping, , and , Idiap-RR-12-2002 |
|
User-Customized Password HMM Based Speaker Verification, and , Idiap-RR-35-2002 |
|
Adaptation robuste de modèles HMM pour la vérification du locuteur dépendante du texte, and , Idiap-RR-08-2000 |
|
Unknown-Multiple Speaker clustering using HMM, , , and , Idiap-RR-07-2002 |
|
Improving Fusion with Margin-Derived Confidence In Biometric Authentication Tasks, and , Idiap-RR-63-2004 |
|
Multiple Timescale Feature Combination towards Robust Speech Recognition, , Idiap-RR-29-2000 |
|
A MAP approach, with synchronous decoding and unit-based normalization for text-dependent speaker verification, , and , Idiap-RR-48-2000 |
|
Multimodal Group Action Clustering in Meetings, , , , and , Idiap-RR-24-2004 |
|
A Unified Framework for Score Normalization Techniques Applied to Text Independent Speaker Verification, and , Idiap-RR-62-2004 |
|
The Role of Speech in Multimodal Human-Computer Interaction (Towards Reliable Rejection of Non-Keyword Input), , and , Idiap-RR-63-2005 |
|
Hybrid HMM/ANN and GMM Combination for User-Customized Password Speaker Verification, and , Idiap-RR-45-2002 |
|
Indexation de Documents Manuscrits, , Idiap-RR-31-2006 |
|
From Multi-Band Full Combination to Multi-Stream Full Combination Processing in Robust ASR, , and , Idiap-RR-20-2000 |
|
Using RASTA in task independent TANDEM feature extraction, , and , Idiap-RR-22-2004 |
|
On Spectral Methods and the Structuring of Home Videos, , and , Idiap-RR-55-2002 |
|
Data utility modelling for mismatch reduction, , Idiap-RR-30-2001 |
|
Order Matters: A Distributed Sampling Method for Multi-Object Tracking, , Idiap-RR-25-2004 |
|
Integrating co-occurrence and spatial contexts on patch-based scene segmentation, , , and , Idiap-RR-30-2005 |
|
Dynamic Bayesian Network Based Speech Recognition with Pitch and Energy as Auxiliary Variables, , , and , Idiap-RR-24-2002 |
|
The Expected Performance Curve: a New Assessment Measure for Person Authentication, and , Idiap-RR-84-2003 |
|
Robust HMM-Based Speech/Music Segmentation, , and , Idiap-RR-33-2001 |
|
A neural network for classification with incomplete data, , Idiap-RR-23-2000 |
|
TRAP-TANDEM: Data-driven extraction of temporal features from speech, , Idiap-RR-50-2003 |
|
Learning to Retrieve Images from Text Queries with a Discriminative Model, , and , Idiap-RR-32-2006 |
|
Bi-Modal Face and Speech Authentication: a BioLogin Demonstration System, , , and , Idiap-RR-18-2006 |
|
Swiss French PolyPhone and PolyVar: telephone speech databases to model inter- and intra-speaker variability, , , , and , Idiap-RR-01-1996 |
|
Supervised Ontogenic Networks, and , in: Handbook of Neural Computation, 1996 |
Superceptron Construction, , and , in: Proceedings of the '96 SIPAR-Workshop on Parallel and Distributed Computing, Scientific and Parallel Computing Group, University of Geneva, Geneva, Switzerland, 1996 |
Sun Workstation and SwissNet Platform for Speech Recognition and Speaker Verification over the Telephone, , , , and , in: Proceedings of Workstations und ihre Anwendungen, SIWORK'96, 1996 |
|
Statistical lip modelling for visual speech recognition, , and , in: Proceedings of the 8th European Signal Processing Conference (Eusipco'96), 1996 |
|
Speaker-Dependent Speech Recognition Based on Phone-Like Units Models --- Application to Voice Dialing, and , Idiap-RR-09-1996 |
|
Speaker identification by lipreading, , and , in: Proceedings of the 4th International Conference on Spoken Language Processing (ICSLP'96), 1996 |
|
Speachreading using shape and intensity information, , and , in: Proceedings of the 4th International Conference on Spoken Language Processing (ICSLP'96), 1996 |
|
Sparse Initial Topologies for High Order Perceptrons, , and , in: Proceedings of the International Conference on Neural Networks, IEEE, 1996 |
Semi-automatic HMM-based annotation of the PolyCOST Database, , , and , in: Application of speaker recognition techniques in telephony, COST250, 1996 |
Secured vocal access to telephone servers, , , , and , in: Proceedings of IVTTA 1996 IEEE Third Workshop Interactive Voice Technology for Telecommunications Applications, 1996 |
Secured vocal access to telephone servers, , , , and , Idiap-RR-04-1996 |
|
Reconnaissance et compréhension de la parole: évaluation et applications, , , , and , in: Fondements et perspectives en traitement automatique de la parole, AUPELF -- UREF, 1996 |
Présentation du Modèle DRM, , Idiap-Com-03-1996 |
|
Polycost Database, , and , 1996 |
Overcoming Inaccuracies in Optical Multilayer Perceptrons, , and , in: Proceedings of the First International Symposium on Neuro-Fuzzy Systems (AT'96), Lausanne, Switzerland, AATI, 1996 |
On Variations of the Convex Hull Operator, , Idiap-RR-06-1996 |
|
On the Power of Democratic Networks, , in: SIAM Journal of Discr. Math, 9(02), 1996 |
|
On the Complexity of the Class of Regions Computable by a Two-Layered Perceptron, , Idiap-RR-03-1996 |
|
New time-frequency derived cepstral coefficients for automatic speech recognition, and , in: Proceedings of the 8th European Signal Processing Conference (Eusipco'96), 1996 |
|
Neural Network Topologies, , in: Handbook of Neural Computation, 1996 |
Neural Network Pruning and Pruning Parameters, and , in: The 1st Workshop on Soft Computing, Dept. of Information Electronics Nagoya University, 1996 |
|
Multi-Stream Speech Recognition, , and , Idiap-RR-07-1996 |
|
Multi-modal person verification tools using speech and images, , in: European Conference on Multimedia Applications, Services and Techniques, 1996 |
Machine Recognition and Applications, , and , in: Speechreading by Humans and Machines, Springer Verlag, 1996 |
Locating and tracking facial speech features, , and , in: Proceedings of the International Conference on Pattern Recognition (ICPR'96), IAPR, 1996 |
|
Learning to recognise talking faces, , and , in: Proceedings of the International Conference on Pattern Recognition (ICPR'96), IAPR, 1996 |
|
Incorporation of Liquid-Crystal Light Valve Non-Linearities in Optical Multilayer Neural Networks, , and , in: Applied Optics, 35(26), 1996 |
|
Image Classification by Neural Networks for the Quality Control of Watches, , and , in: Proceedings ISAI /IFIS 1996, ITESM, Cancun, Mexico, ITESM, 1996 |
Hardware-Friendly Learning Algorithms for Neural Networks: An Overview, and , in: Proceedings of the Fifth International Conference on Microelectronics for Neural Networks and Fuzzy Systems: MicroNeuro'96, EPFL and CSEM, Lausanne, Switzerland, IEEE Computer Society Press, 1996 |
|
Handbook of Neural Computation, Institute of Physics and Oxford University Press, The Computational Intelligence Library, 1996 |
Generalized Cauchy Machines, and , in: Neurocomputing, 1996 |
Finding Lines Under Bounded Error, , in: Pattern Recognition, 29(01), 1996 |
Extended Cauchy Machines, and , in: Proceedings of the International Conference on Neural Information Processing, 1996 |
ETC\_vérif : un environnement multi-agents de reconnaissance automatique de la parole en continu, and , in: Proceedings of JEP'96: XXIemes Journees d'etude sur la Parole, 1996 |
|
Datapump Full-Duplex, , , and , Idiap-Com-02-1996 |
|
Constructive Training Methods for Feedforward Neural Networks with Binary Weights, and , in: International Journal of Neural Systems, 7(2), 1996 |
|
Connectionist Quantization Functions, , and , in: Proceedings of the '96 SIPAR-Workshop on Parallel and Distributed Computing, Scientific and Parallel Computing Group, University of Geneva, Geneva, Switzerland, 1996 |
|
Combining methods to improve speaker verification decision, , , and , Idiap-RR-02-1996 |
|
Combining methods to improve speaker verification decision, , , and , in: Proceedings of The Fourth International Conference on Spoken Language Processing, ICSLP, ICSLP, 1996 |
|
Bounds on the Degree of High Order Binary Perceptrons, , in: Proceedings of ESANN'96, D facto, 1996 |
|
Annulation d'écho sur une ligne téléphonique, , , and , Idiap-Com-06-1996 |
|
An Implementation of Logical Analysis of Data, , , , , and , Idiap-RR-05-1996 |
|
Amelioration des performances de verification du locuteur par combinaison de methodes, , , and , in: Journees d'etudes sur la parole, JEP, 1996 |
Active Shape Models for Visual Speech Feature Extraction, , and , in: Speechreading by Humans and Machines, Springer Verlag, 1996 |
|
A Review of MicroNeuro'96, February 12-14, 1996, Lausanne, Switzerland, , in: Neurocomputing, 12(04), 1996 |
A Method for All-Positive Optical Multilayer Perceptrons, , and , in: Proceedings of the Third IEEE International Conference on Electronics, Circuits, and Systems, University of Patras, Rhodos, Greece, IEEE, 1996 |
|
A Boolean Approach to Construct Neural Networks for Non-Boolean Problems, and , in: Proceedings of the 8th IEEE International Conference on Tools with Artificial Intelligence, IEEE, 1996 |
Zeolite cycle sequences, and , in: Zeolites, 19, 1997 |
Visual Speech and Speaker Recognition, , University of Sheffield, 1997 |
|
Using the Multi-Stream Approach for Continuous Audio-Visual Speech Recognition, and , Idiap-RR-14-1997 |
|
Using Multiple Time Scales in a Multi-Stream Speech Recognition System, and , in: EUROSPEECH'97, 1997 |
|
Two neural network construction methods, and , in: Neural Processing Letters, 6(01), 1997 |
Towards Speaker Independent Continuous Speechreading, , in: Proceedings of the European Conference on Speech Communication and Technology, 1997 |
|
The 3-regular nets with 4 and 6 vertices per unit cell, , and , in: Zeitschrift fur Kristallographie, 212, 1997 |
SWISSCOM ``AVIS'' PROJECT (No. 392) Advanced Vocal Interfaces Services, , and , Idiap-Com-06-1997 |
|
Subband-Based Speech Recognition, and , in: IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, 1997 |
|
State-of-the-Art and Recent Progress in Hybrid HMM/ANN Speech Recognition, , in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'97), Springer-Verlag, 1997 |
Speechreading using Probabilistic Models, and , in: Computer Vision and Image Understanding, 65(02), 1997 |
Speaker-Dependent Speech Recognition Based on Phone-Like Unit Model -- Application to Voice Dialing, and , in: IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, 1997 |
|
Speaker Verification in the Telephone Network : Research Activities in the CAVE Project, , , , , and , in: Proceedings of the European Conference on Speech Communication and Technology (EUROSPEECH'97), 1997 |
|
Speaker Verification by Pairwise Coupling, , Idiap-Com-07-1997 |
Some Methods for Training Mixtures of Experts, , Idiap-Com-05-1997 |
|
Robust Speech Recognition based on Multi-Stream Features, , and , in: Proc. of the ESCA-NATO Workshop on Robust Speech Recognition for Unknown Communication Channels, 1997 |
|
Reconnaissance de caractères manuscrits à l'aide de réseaux neuromimétiques, , Idiap-RR-18-1997 |
|
Réalisation d'un Majordome vocal, , Idiap-Com-04-1997 |
|
Quantization and Pruning of Multilayer Perceptrons: Towards Compact Neural Networks, and , Idiap-Com-02-1997 |
|
Pruning of Neural Networks, and , Idiap-RR-03-1997 |
|
Person Authentication by Fusing Face and Speech Information, , , and , in: Proceedings of the First International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'97), Springer Verlag, 1997 |
Optimization of high order perceptrons, , École Polytechnique Fédérale de Lausanne, 1997 |
|
Optimal Setting of Weights, Learning Rate, and Gain, and , Idiap-RR-04-1997 |
|
On the Decomposition of Polychotomies into Dichotomies, and , in: Proceedings of The Fourteenth International Conference on Machine Learning, Morgan Kaufmann, 1997 |
|
On the Complexity of Recognizing Iterated Differences of Polyhedra, , in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'97), Springer-Verlag, 1997 |
|
Neural Network Adaptations to Hardware Implementations, and , Idiap-RR-17-1997 |
|
Neural Network Adaptations to Hardware Implementations, and , in: Handbook of Neural Computation, Institute of Physics Publishing and Oxford University Publishing, 1997 |
|
Mixtures of Experts Estimate A Posteriori Probabilities, , in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'97), Springer-Verlag, 1997 |
|
Likelihood ratio adjustment for the compensation of model mismatch in speaker verification, and , in: Eurospeech 97, 1997 |
|
Investigation of a possible process identity between DRM and Linear Filtering, , Idiap-RR-19-1997 |
|
Integrating Acoustic and Labial Information for Speaker Identification and Verification, , , and , in: Proceedings of the European Conference on Speech Communication and Technology, 1997 |
Improved Pairwise Coupling Classification With Correcting Classifiers, and , in: Machine Learning: ECML-98, Springer, 1998 |
|
Hybrid HMM/ANN Systems for Training Independent Tasks: Experiments on 'Phonebook' and Related Improvements, , , , and , in: IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, 1997 |
|
Hybrid HMM/ANN Systems for Speech Recognition: Overview and New Research Directions, and , in: International School on Neural Nets: Adaptive Processing of Temporal Information, Springer Verlag, 1997 |
|
High Order and Multilayer Perceptron Initialization, and , in: IEEE Transactions on Neural Networks, 8(02), 1997 |
Handwritten Digit Recognition with Binary Optical Perceptron, , , and , in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'97), Springer-Verlag, 1997 |
|
Fusion of audio and video information for multi modal person authentication, , , , and , in: Pattern Recognition Letters, 18(9), 1997 |
Fast Object Detection using MLP and FFT, , Idiap-RR-11-1997 |
|
Estimation of Global Posteriors and Forward-Backward Training of Hybrid HMM/ANN Systems, , , and , in: EUROSPEECH'97, 1997 |
|
Ellipsometry, , in: Optical Metrology, Artech House, 1997 |
Discrete All-Positive Multilayer Perceptrons for Optical Implementation, , and , Idiap-RR-02-1997 |
|
Decision fusion in a multi-modal identity verification system using a multi-linear classifier, , and , Idiap-RR-06-1997 |
CRC Comprehensive Dictionary of Electrical Engineering, , CRC Press, 1997 |
Calendar of meetings (several issues), , in: Neurocomputing, 1997 |
An Optical Thresholding Perceptron, , , , and , in: Proceedings of the Workshop on Optics and Computer Science, Geneva, Switzerland, 1997 |
|
Adapting the 2-Class Recursive Deterministic Perceptron Neural Network to m Classes, , , and , in: Proceedings of the International Conference on Neural Networks, IEEE, IEEE, 1997 |
Activity Report 1996, , , , and , Idiap-Com-01-1997 |
|
Acoustic-Labial Speaker Verification, , , and , in: Pattern Recognition Letters, 18(09), 1997 |
|
Acoustic-Labial Speaker Verification, , , and , in: Proceedings of the First International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'97), Springer Verlag, 1997 |
A Connectionist System for Two-Dimensional Representation of Multivariate Location Data, and , in: Proceedings of the Fifth International Workshop on Artificial Intelligence for High Energy Physics, AIHENP, Lausanne, Switzerland, Elsevier Science, 1997 |
1997 NIST Evaluation: Text independent speaker detection (verification), and , Idiap-Com-03-1997 |
|
Voice-B System, , , , and , in: IEEE 4th Workshop on Intercative Voice Technology for Telecommunications Applications (IVTTA'98) September 29--30, Torino, Italy, 1998 |
Voice transformation, a tool for imposture of speaker verification, and , in: Proceedings of International Phonetic Science conference IPS98, Washington, 1998 |
Using the Multi-Stream Approach for Continuous Audio-Visual Speech Recognition: Experiments on the M2VTS Database, and , in: Proc. 5th Int. Conf. on Spoken Language Processing, 1998 |
|
Text dependent speaker verification using binary classifiers, , and , in: Proceedings of the 1998 IEEE International Conference on Acoustics, Speech, and Signal Processing --- ICASSP'98, IEEE, IEEE, 1998 |
|
Support Vector Machine for Multiclass Classification, and , Idiap-RR-06-1998 |
|
Subband-Based Speech Recognition in Noisy Conditions: The Full Combination Approach, , and , Idiap-RR-15-1998 |
|
Speech pre-processing against intentional imposture in speaker recognition, and , in: Proceedings of ICSLP, Sidney, 1998 |
Speaker Verification: A Quick Overview, and , Idiap-RR-12-1998 |
|
Reconnaissance robuste de la parole par segmentation signal/bruit en sous-bandes, , , and , in: Neurosciences et Sciences de l'Ingenieur'98 - Munster, CNRS, 1998 |
|
Reconnaissance multi-bandes de la parole bruitée par couplage entre les niveaux primitifs et d'identification, , , and , in: Journees Etude Parole - Martigny, 1998 |
|
POLYCOST: a telephone-speech database for speaker recognition, , , and , in: Reconnaissance du locuteur et ses applications commerciales et criminalistiques, 1998 |
Optimal Parameterization of Point Distribution Models, and , Idiap-RR-01-1998 |
|
On the Complexity of Recognizing Regions Computable by Two-Layered Perceptrons, , in: Annals Mathematics and Artificial Intelligence, 1999 |
|
Multi-Modal Data Fusion for Person Authentication using SVM, , in: Proc. Second International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'99), 1999 |
|
Introduction à la reconnaissance de la parole et du locuteur, , Idiap-RR-13-1998 |
Interfacing of CASA and partial recognition based on a multistream technique, , , and , in: ICSLP'98, Sidney, 1998 |
|
Interfacing of CASA and Multistream recognition, , , and , in: TSD'98-Text, Speech and Dialog International Workshop, BRNO-Czech Republic, 1998 |
|
Improving Posterior Based Confidence Measures in Hybrid HMM/ANN Speech Recognition Systems, and , in: Proceedings of International Conference on Spoken Language Processing (ICSLP'98) Sydney, Australia, 1998 |
|
Hybrid HMM/ANN Systems for Speech Recognition: Overview and New Research Directions, and , in: Adaptive Processing of Sequences and Data Structures, Springer Verlag, 1998 |
Fast Multi-Scale Face Detection, , Idiap-Com-04-1998 |
|
Evaluation Protocol for the extended M2VTS Database (XM2VTSDB), and , Idiap-Com-05-1998 |
|
Evaluating the Complexity of Databases for Person Identification and Verification, , and , Idiap-RR-10-1998 |
|
Discrete All-Positive Multilayer Perceptrons for Optical Implementation, , and , in: Optical Engineering, 37(4), 1998 |
|
Decision fusion using a multi-linear classifier, , and , in: 1st International Conference on Multisource-Multisensor Data Fusion, 1998 |
Continuous Audio-Visual Speech Recognition, and , in: Proc. 5th European Conference on Computer Vision, Springer Verlag, 1998 |
|
Connectionist Techniques, and , in: Survey of the State of the Art in Human Language Technology, Cambridge University Press, 1998 |
Connectionist speech recognition, , in: Proceedings of IK'98, Interdisziplinares Kolleg, Spring Scholl, Gunne am Mohnessee, Germany, March 7--14, 1998 |
Confidence Measures in Hybrid HMM/ANN Speech Recognition, and , in: Proceedings of Workshop on Text, Speech and Dialog (TSD'98) Brno, Czech Republic, 1998 |
Combining Linear Dichomotizers to Construct Nonlinear Polychotomizers, and , Idiap-RR-05-1998 |
|
Combined 5x2cv $F$-Test for Comparing Supervised Classification Learning Algorithms, , Idiap-RR-04-1998 |
|
Classification using localized mixtures of experts, , in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'99), London: IEE, 1999 |
|
Baseline System for Hybrid Speech Recognition on French (Experiments on BREF), , Idiap-Com-07-1998 |
|
Automatic Speech Recognition: an Auditory Perspective, , and , in: Speech Processing in the Auditory System, Springer Verlag, New York, 2000 |
Audio-Visual Person Verification, , , , and , in: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, June 1999, Fort Collins, USA, 1999 |
|
An overview of the cave project research activities in speaker verification, , , , , and , in: Reconnaissance du locuteur et ses applications commerciales et criminalistiques, 1998 |
Acoustico-articulatory inversion of unequal-length tube models through lattice inverse filtering, , Idiap-RR-16-1998 |
|
A comparison of mixture models for density estimation, , in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'99), London: IEE, 1999 |
|
A comparison of a priori threshold setting procedures for speaker verification in the CAVE project, , , , , , and , in: ICASSP 98, 1998 |
XM2VTSDB: The Extended M2VTS Database, , , , and , in: Proc. Second International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'99), 1999 |
Tracking Articulators in X-ray Movies of the Vocal Tract, , in: 8th Int. Conf. Computer Analysis of Images and Patterns, Springer Verlag, 1999 |
|
Towards introducing long-term statistics in MUSE for robust speech recognition, and , Idiap-RR-18-1999 |
|
Towards introducing long-term statistics in MUSE for robust speech recognition, and , in: Automatic Speech Recognition and Understanding (ASRU) workshop, 1999 |
|
The full combination sub-bands approach to noise robust HMM/ANN based ASR, , and , in: 6th European Conference on Speech Communication and Technology --- Eurospeech'99, 1999 |
|
The Elisa'99 Speaker Recognition and Tracking Systems, , , , , , , , , , , , , , , and , in: IEEE Workshop on Automatic Advanced Technologies, 1999 |
The ELISA Systems for the NIST'99 Evaluation in Speaker Detection and Tracking, , , , , , , , , , , , , , , , , , , , and , in: DSP Journal (Special Issue on the Nist Speaker Recognition Workshop), 1999 |
Synchronous Alignment, and , Idiap-RR-06-1999 |
|
Speech Reading, , in: Modern Interface Technology: The Leading Edge, Research Studies Press Ltd., 1999 |
Speaker verification experiments on the XM2VTS database, , Idiap-RR-02-1999 |
|
Segmentation of X-ray Image Sequences Showing the Vocal Tract (with tool documentation), , Idiap-RR-01-1999 |
|
Segmentation of X-ray Image Sequences Showing the Vocal Tract, , Idiap-RR-01-1999 |
|
Robust Person Verification based on Speech and Facial Images, and , in: Proceedings of the European Conference on Speech Communication and Technology, 1999 |
|
Reconnaissance et Transformation de Locuteurs, , École Polytechnique Fédérale de Lausanne, 1999 |
|
Off-Line Cursive Script Recognition Based on Continuous Density HMM, and , in: Proceedings of 7th International Workshop on Frontiers in Handwriting Recognition, 2000 |
Numerical Experiments with Support Vector Machines, and , Idiap-RR-15-1999 |
|
Non-Stationary Multi-Channel (Multi-Stream) Processing Towards Robust and Adaptive ASR, , in: Proc. of the ESCA Workshop on Robust Methods for Speech Recognition in Adverse Conditions, 1999 |
Multi-stream adaptive evidence combination for noise robust ASR, , , and , Idiap-RR-26-1999 |
|
Multi Modal Verification for Teleservices and Security Applications, , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: IEEE International Conference on Multimedia Computing and Systems, 1999 |
LPC-based inversion of the DRM articulatory model, , in: Proc. Eurospeech'99, 1999 |
|
Latent variable decomposition for posteriors or likelihood based subband ASR, , Idiap-Com-04-1999 |
|
Latent Semantic Indexing by Self-Organizing Map, and , in: ESCA ETRW workshop on Accessing Information in Spoken Audio, 1999 |
|
Iterative Posterior-Based Keyword Spotting Without Filler Models: Iterative Viterbi Decoding and One-Pass Approach, and , Idiap-RR-27-1999 |
Iterative Posterior-Based Keyword Spotting Without Filler Models, and , in: Proceedings of the IEEE Automatic Speech Recognition and Understanding (ASRU'99) Workshop, 1999 |
Iterative Posterior-Based Keyword Spotting Without Filler Models, and , Idiap-RR-16-1999 |
INtegrating SPEech acoustic and linguistic Constraints: Baseline System Development, , , and , Idiap-RR-21-1999 |
|
Indexing Audio Documents by using Latent Semantic Analysis and SOM, , in: Kohonen Maps, Elsevier, 1999 |
|
Incremental Enrollment of Speech Recognizers, and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'99,',','), Phoenix, Arizona, USA, 1999 |
Illumination-robust Pattern Matching Using Distorted Color Histograms, and , in: Pattern Recognition and Image Understanding, Infix, 1999 |
Fusion of Face and Speech Data for Person Identity Verification, , and , in: IEEE Transactions on Neural Networks, 10(05), 1999 |
|
Fast Face Detection using MLP and FFT, , and , in: Proc. Second International Conference on Audio and Video-based Biometric Person Authentication (AVBPA'99), 1999 |
|
Extraction of Articulators in X-Ray Image Sequences, and , in: Proceedings of the European Conference on Speech Communication and Technology, 1999 |
|
Experimental evaluation of text-dependent speaker verification on laboratory and field test databases in the M2VTS project, , , and , in: Proceedings of the European Conference on Speech Communication and Technology, 1999 |
|
Evaluating the Complexity of Databases for Person Identification and Verification, , and , in: 8th Int. Conf. Computer Analysis of Images and Patterns, Springer Verlag, 1999 |
|
Environmental spatial data classification with Support Vector Machines, , , and , Idiap-RR-07-1999 |
|
Environmental and Pollution Spatial Data Classification with Support Vector Machines and Geostatistics, , , and , in: Intelligent techniques for Spatio-Temporal Data Analysis in Environmental Applications. Workshop W07, 1999 |
DynaBoost: Combining Boosted Hypotheses in a Dynamic Way, and , Idiap-RR-09-1999 |
|
Different Weighting Schemes in the Full Combination Subbands Approach for Noise Robust ASR, , and , in: Robust Methods for Speech Recognition in Adverse Conditions, 1999 |
|
Deliberate Imposture: a challenge for automatic speaker verification systems, and , in: Proceedings of the European Conference on Speech Communication and Technology, 1999 |
Decision-Oriented Environmental Mapping with Radial Basis Function Neural Networks, , , , and , in: Intelligent techniques for Spatio-Temporal Data Analysis in Environmental Applications. Workshop W07, 1999 |
Data binarization by discriminant elimination, , and , in: Proceedings of the ICML-99 Workshop: From Machine Learning to Knowledge Discovery in Databases, 1999 |
|
Combining Wavelet-domain Hidden Markov Trees with Hidden Markov Models, , and , Idiap-RR-14-1999 |
|
Combinatorial Approach for Data Binarization, and , in: Principles of Data Mining and Knowledge Discovery: third european conference; proceedings / PKDD'99, Springer, 1999 |
|
CLIENT / WORLD MODEL SYNCHRONOUS ALIGNEMENT FOR SPEAKER VERIFICATION, , , and , in: 6th European Conference on Speech Communication and Technology --- Eurospeech'99, 1999 |
|
Blind separation of delayed and superimposed acoustic sources : learning algorithms an experimental study, , , , and , in: Proc. IEEE Int. Conference on Speech Processing (ICSP), IEEE, 1999 |
An Overview of the PICASSO Project Research Activities in Speaker Verification for Telephone Applications, , , , , , , , , , , and , in: 6th european conference on speech communication and technology --- eurospeech'99, 1999 |
A new SNR-feature mapping for robust multistream speech recognition, and , in: Proc. Int. Congress on Phonetic Sciences (ICPhS), 1999 |
A measure of speech and pitch reliability from voicing, and , in: Proc. Int. Joint Conf. on Artificial Intelligence (IJCAI), Scandinavian AI Society, 1999 |
A comparison of two strategies for ASR in additive noise : Missing Data and Spectral Subtraction, and , in: 6th European Conference on Speech Communication and Technology --- Eurospeech'99, 1999 |
|
A comparison of two strategies for ASR in additive noise : Missing Data and Spectral Subtraction, and , Idiap-RR-17-1999 |
|
A comparison of noise reduction techniques for robust speech recognition, , Idiap-RR-10-1999 |
|
A CASA-labelling model using the localisation cue for robust cocktail-party speech recognition, , and , in: Proc.\ European Conf.\ on Speech Communication and Technology (EUROSPEECH), 1999 |
A CASA front-end using the localisation cue for segregation and then cocktail-party speech recognition, , , and , in: Proc. IEEE Int. Conference on Speech Processing (ICSP), IEEE, 1999 |
Weighting schemes for audio-visual fusion in speech recognition, , , , and , Idiap-RR-44-2000 |
|
Video sequence matching via decision tree path following, , and , Idiap-RR-12-2000 |
|
Video Indexing and Similarity Retrieval by Largest Common Subgraph Detection using Decision Trees, , and , in: Pattern Recognition, 34(05), 2000 |
|
Various adaptive weighting schemes for large vocabulary robust audio-visual ASR, with particular reference to the cocktail party effect, , Idiap-Com-04-2000 |
Using Multiple Time Scales in the Framework of Multi-Stream Speech Recognition, and , in: ICSLP, 2000 |
|
Traitement de la Parole, , , , and , Presses Polytechniques Universitaires Romandes, 2000 |
Thematic Indexing of Spoken Documents by Using Self-Organizing Maps, , Idiap-RR-05-2000 |
|
The use of Boolean concepts in general classification contexts, , Idiap-RR-46-2000 |
|
The use of Boolean concepts in general classification contexts, , Ecole Polytechnique Federale de Lausanne, 2000 |
|
Test of several external posterior weighting functions for multiband Full Combination ASR, and , in: Int. Conf. on Spoken Language Processing (ICSLP), 2000 |
Taking on the Curse of Dimensionality in Joint Distributions Using Neural Networks, and , in: IEEE Transaction on Neural Networks special issue on data mining and knowledge discovery, 2000 |
|
Support Vector Machines, Théorie et Application, , Idiap-Com-03-2000 |
|
Support Vector Machines for Large-Scale Regression Problems, and , Idiap-RR-17-2000 |
|
Spatial Data Mapping with Support Vector Regression, and , Idiap-RR-09-2000 |
|
Some applications of a priori knowledge in multi-stream HMM and HMM/ANN based ASR, , in: Phonus No.5,Dec.2000, ISSN 0949-1791, Proc. Workshop on Phonetics and Phonology in ASR, 2000 |
|
Robust multi-stream speech recognition based on the combined reliabilities of the speech signal and phonemes estimates, , Idiap-RR-36-2000 |
Relating LPC modeling to a factor-based articulatory model, , in: Proc. ICSLP 2000, 2000 |
|
Reconnaissance de la parole dans le bruit après renforcement fondé sur l'harmonicité, and , in: Proceedings of JEP'2000, no IDIAP RR, see RESPITE www, 2000 |
Recognition of Asymmetric Facial Action Unit Activities and Intensities, and , in: Proceedings of the International Conference on Pattern Recognition (ICPR 2000), 2000 |
|
Recent Developments in Speaker Verification at IDIAP, and , Idiap-RR-26-2000 |
|
Personal Voice Dialing over PC, and , Idiap-Com-05-2000 |
|
On the Convergence of SVMTorch, an Algorithm for Large-Scale Regression Problems, and , Idiap-RR-24-2000 |
|
Neural Networks in Automatic Speech Recognition, , , and , in: to be published in The Handbook of Brain Theory and Neural Networks, Bradford Books, The MIT Press, 2000 |
Neural Network Residual Stochastic Co-simulation for Environmental Data Analysis, , , , and , in: Neural Computation 2000, 2000 |
Multiple Timescale Feature Combination towards Robust Speech Recognition, , in: KONVENS 2000 / Sprachkommunikation, 2000 |
|
Multiple Hypotheses Video OCR, and , in: Proceedings of the 4th International Workshop on Document Analysis System, 2000 |
|
Multichannel signal separation for cocktail party speech recognition: a dynamic recurrent network, , , and , in: Int. Conf. on Spoken Language Processing (ICSLP), no IDIAP RR, see RESPITE www, 2000 |
Mixtures of latent variable models for density estimation and classification, , Idiap-RR-25-2000 |
|
Mixture Models for Unsupervised and Supervised Learning, , Idiap-RR-18-2000 |
|
Mixture Models for Unsupervised and Supervised Learning, , École Polytechnique Fédérale de Lausanne, Computer Science Department, 2000 |
|
LPC modeling with speech production constraints, , in: Proc. 5th Speech Production Seminar, 2000 |
|
Local Machine Learning Models for Spatial Data Analysis, and , in: Journal of Geographic Information and Decision Analysis, 4(01), 2000 |
|
Language modeling based on neural clustering of words, , Idiap-Com-02-2000 |
|
Iterative Posterior-Based Keyword Spotting Without Filler Models, and , in: Proceedings of the IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, 2000 |
Inverse lattice filtering of speech with adapted non-uniform delays, and , in: Proc. ICSLP 2000, 2000 |
|
Indoor Radon Risk Assessment with Geostatistics and Artificial Neural Networks, , , , , , and , in: Geostatistical congress 2000, 2000 |
Indexing spoken audio by LSA and SOMs, , in: Proceedings of the European Signal Processing Conference EUSIPCO'2000, 2000 |
Incorporating Domain Knowledge with Video and Voice Data Analysis in News Broadcasts, , and , in: Proceedings of the Sixth ACM International Conference on Knowledge Discovery and Data Mining, ACM, Boston, MA, USA, 2000 |
|
HMM2- A Novel Approach to HMM Emission Probability Estimation, , and , in: International Conference on Spoken Langugae Processing (ICSLP 2000), 2000 |
|
Handwritten Digits Recognition, , Idiap-RR-07-2000 |
|
From Multi-Band Full Combination to Multi-Stream Full Combination Processing in Robust ASR, , and , in: ISCA ITRW ASR2000, 2000 |
|
Fast latent semantic indexing of spoken documents by using self-organizing maps, , in: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP'2000, 2000 |
|
Etudes comparatives des robustesses au bruit de l'approche 'Full Combination' et de son approximation, and , in: Journee d'Etudes sur la Parole, Aussois, 2000 |
|
Environmental Data Mapping with Support Vector Regression and Geostatistics, , and , Idiap-RR-10-2000 |
|
Environmental and Pollution Spatial Data Classification with Support Vector Machines and Geostatistics, , , and , in: Geostatistical congress 2000, 2000 |
Cursive Character Recognition by Learning Vector Quantization, and , in: Pattern Recognition Letters, 22(6), 2001 |
|
Comparison of Unsupervised and Supervised Training of RBF Neural Networks. Case Study: Mapping of Contamination Data, and , in: Neural Computation 2000, 2000 |
Comparison of HMM experts with MLP experts in the Full Combination Multi-Band Approach to Robust ASR, and , in: ICSLP, 2000 |
|
Combining multiple tracking algorithms for improved general performance, , and , in: Pattern Recognition, 34(06), 2000 |
|
Blind acoustic source separation for cocktail party speech recognition, , , and , in: ICONIP, 7th IEEE Int. Conf. on Neural Information Processing, 2000 |
Behavior of a Bayesian adaptation method for incremental enrollment in speaker verification, , , , , and , in: ICASSP2000 - IEEE International Conference on Acoustics, Speech, and Signal Processing, 2000 |
|
Automatic Speech Recognition using Pitch Information in Dynamic Bayesian Networks, , and , Idiap-RR-41-2000 |
|
Automatic Speech Recognition using Dynamic Bayesian Networks with both Acoustic and Articulatory Variables, , , and , in: 6th International Conference on Spoken Language Processing: ICSLP~2000 (Interspeech~2000), 2000 |
|
Auto-Association by Multilayer Perceptrons and Singular Value Decomposition, , Idiap-RR-16-2000 |
|
Audio-Visual Speech Modelling for Continuous Speech Recognition, and , in: IEEE Transactions on Multimedia, 2000 |
Audio visual speech recognition, , , , , , , and , Johns Hopkins University-CLSP, 2000 |
ASYMMETRIC FILTER FOR TEXT RECOGNITION IN VIDEO, and , Idiap-RR-37-2000 |
|
Approches génératives pour le traitement de séquences d'images: application à la reconnaissance dynamique des gestes de la main, , Idiap-RR-45-2000 |
|
An Introduction to Bayesian Network Theory and Usage, , Idiap-RR-03-2000 |
|
An EM Algorithm for HMMs with Emission Distributions Represented by HMMs, , and , Idiap-RR-11-2000 |
|
Advanced Spatial Data Analysis and Modelling with Support Vector Machines, , , and , Idiap-RR-31-2000 |
|
Adaptation robuste de modèles HMM pour la vérification du locuteur dépendante du texte, and , in: Journee d'Etudes sur la Parole, Aussois, 2000 |
|
Activity Report 1999, , Idiap-Com-01-2000 |
|
A survey on Off-Line Cursive Word Recognition, , in: Pattern Recognition, 35(07), 2002 |
|
A Survey of Text Detection and Recognition in Images and Videos, and , Idiap-RR-38-2000 |
|
A new normalization technique for cursive handwritten words, and , in: Pattern Recognition Letters, 22(09), 2001 |
|
A neural network for classification with incomplete data: application to robust ASR, , , , and , in: Proc. ICSLP, 2000 |
|
A MAP approach, with synchronous decoding and unit-based normalization for text-dependent speaker verification, , and , in: ICSLP, 2000 |
|
A front-end using the harmonicity cue for speech enhancement in loud noise, , and , in: Int. Conf. on Spoken Language Processing (ICSLP), 2000 |
Video OCR for Sport Video Annotation and Retrieval, , and , in: Proceedings of the 8th IEEE International Conference on Mechatronics and Machine Vision in Practice, 2001 |
|
Using posterior probabilities for speech/music discrimination, , Idiap-RR-08-2001 |
|
User Customized HMM/ANN Based Speaker Verification, and , Idiap-RR-32-2001 |
|
Text Identification in Complex Background using SVM, , and , in: Proceedings of the Int. Conf. on computer vision and pattern recognition, 2001 |
Text Enhancement with Asymmetric Filter for Video OCR, , and , in: Proceedings of the 11th International Conference on Image Analysis and Processing, 2001 |
|
SVMTorch: Support Vector Machines for Large-Scale Regression Problems, and , in: Journal of Machine Learning Research, 1, 2001 |
|
Support Vector Machines for Classification and Mapping of Reservoir Data, , , , , and , Idiap-RR-04-2001 |
|
Speech/Music Discrimination using Entropy and Dynamism Features in a HMM Classification Framework, , and , in: Speech Communication, 40, 2003 |
|
Speech Recognition Using Advanced HMM2 Features, , and , in: Automatic Speech Recognition and Understanding Workshop, 2001 |
|
Speech Recognition Engine for Interactive Voice Response application on Windows, , Idiap-Com-10-2001 |
|
Speaker Verification Based On User-Customized Password, , and , Idiap-RR-13-2001 |
|
Signal modeling with Non Uniform Topology lattice filters, and , in: Proc. ICASSP 2001, 2001 |
|
Robust speech recognition based on multi-stream processing, , École Polytechnique Fédérale de Lausanne, 2001 |
|
Robust Speech Recognition and Feature Extraction Using HMM2, , , and , in: Computer Speech & Language, 17(2-3), 2003 |
Rebuilding Speech Recognition on Windows, , Idiap-Com-09-2001 |
|
Pronunciation models and their evaluation using confidence measures, and , Idiap-RR-29-2001 |
|
PhD Thesis: Speech Analysis with Production Constraints, , École Polytechnique Fédérale de Lausanne, 2001 |
|
Offline Cursive Word Recognition using Continuous Density Hidden Markov Models trained with PCA or ICA Features, and , in: Proceedings of International Conference on Pattern Recognition, 2002 |
|
New Approaches Towards Robust and Adaptive Speech Recognition, , and , in: Advances in Neural Information Processing Systems 13, MIT Press, 2001 |
|
Multi-stream adaptive evidence combination for noise robust ASR, , , and , in: Speech Communication, 2001 |
Modeling Auxiliary Information in Bayesian Network Based ASR, , and , in: 7th European Conference on Speech Communication and Technology (Eurospeech~2001), 2001 |
|
Microphone Array Post-filter for Diffuse Noise Field, and , in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2002 |
|
Microphone Array Post-filter based on Noise Field Coherence, and , in: IEEE Transactions on Speech and Audio Processing, 11(6), 2003 |
|
MAP Combination of Multi-Stream HMM or HMM/ANN Experts, , and , in: Proc. Eurospeech, 2001 |
|
Learning the Decision Function for Speaker Verification, and , in: IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP, 2001 |
|
Intrinsic dimension estimation of data: an approach based on Grassberger-Procaccia's algorithm, and , in: Neural Processing Letters, 14(01), 2001 |
|
Improving Face Verification using Skin Color Information, and , in: Proceedings of the 16th International Conference on Pattern Recognition, IEEE Computer Society Press, 2002 |
|
IDIAP HMM/HMM2 System: Theoretical Basis and Software Specifications, , , and , Idiap-RR-27-2001 |
|
HMM2- Extraction of Formant Features and their Use for Robust ASR, , and , in: European Conference on Speech Communication and Technology (Eurospeech 2001), 2001 |
|
From missing data to maybe useful data: soft data modelling for noise robust ASR, , and , in: Proc. WISP, 2001 |
|
Facial Expression Analysis using Shape and Motion Information Extracted by Convolutional Neural Networks, , in: International IEEE Workshop on Neural Networks for Signal Processing (NNSP 02), 2002 |
|
Evaluation of SVM Binary Classification with Nonparametric Stochastic Simulations, , Idiap-RR-07-2001 |
|
Evaluation of Biometric Technology on XM2VTS, , and , Idiap-RR-21-2001 |
|
Error Correcting Posterior Combination for Robust Multi-Band Speech Recognition, and , in: EUROSPEECH, 2001 |
|
EPFL lab session 2/2: Introduction to Hidden Markov Models, , Idiap-Com-07-2001 |
|
EPFL lab session 1/2: Introduction to Gaussian statistics and pattern recognition, , Idiap-Com-06-2001 |
|
EEG pattern recognition through multi-stream evidence combination, , and , in: Proc. World Congress on Neuroinformatics, 2001 |
|
Development of a DTW based Speech Recognition System over the telephone line, , and , Idiap-Com-05-2001 |
|
Developement d'un systeme de demande interactif via le telephone (INFOVOX), , Idiap-Com-08-2001 |
|
Detection of Narrative Structure for Annotation of News Broadcasts, , and , Idiap-RR-03-2001 |
|
Data utility modelling for mismatch reduction, , in: Proc. CRAC (workshop on Consistent & Reliable Acoustic Cues for sound analysis), 2001 |
|
Confidence Evaluation for Risk Prediction, , and , in: 2001 Annual Conference of the IAMG, 2001 |
|
Comparison of Client Model Adaptation Schemes, and , Idiap-RR-25-2001 |
|
Combining Neural Gas and Learning Vector Quantization for Cursive Character Recognition, and , in: Neurocomputing, 51, 2003 |
|
Artifacts of the colour coherence vector and an alternative similarity measure, and , Idiap-RR-02-2001 |
|
Analytic Assessment of Telephone Transmission Impact on ASR Performance Using a Simulation Model, and , in: Speech Communication, 2002 |
|
Adaptive ML-Weighting in Multi-Band Recombination of Gaussian Mixture ASR, , and , in: ICASSP, 2001 |
|
Activity Report 2000, , Idiap-Com-01-2001 |
|
A Pragmatic View of the Application of HMM2 for ASR, , and , Idiap-RR-23-2001 |
|
A Parallel Mixture of SVMs for Very Large Scale Problems, , and , in: Advances in Neural Information Processing Systems, NIPS 14, MIT Press, 2002 |
|
A Comparative Study of Adaptation Methods for Speaker Verification, and , in: International Conference on Spoken Language Processing ICSLP, 2002 |
|
Writer adaptation techniques in HMM based Off-Line Cursive Script Recognition, and , in: Pattern Recognition Letters, 23(8), 2002 |
|
Writer adaptation techniques in HMM based Off-Line Cursive Script Recognition, and , in: Proceedings of 8$^{th}$ International Conference on Frontiers on Handwriting Recognition, 2002 |
|
What is Better: GMM of Two Gaussians or Two Clusters With One Gaussian?, , Idiap-RR-56-2002 |
|
Video Text Recognition Based on Markov Random Field and Grayscale Consistency Constraint, and , in: Int. Conf. Image Processing 2002, 2002 |
|
User-Customized Password Speaker Verification based on HMM/ANN and GMM Models, and , in: International Conference on Spoken Language Processing (ICSLP~2002), 2002 |
|
User-Customized Password HMM Based Speaker Verification, and , in: Proceedings of the COST275 Workshop on the Advent of Biometrics on the Internet, 2002 |
|
Unknown-Multiple Speaker clustering using HMM, , , and , in: ICSLP, 2002 |
|
Transforming the feature vectors to improve HMM based cursive word recognition systems, and , Idiap-RR-32-2002 |
|
Towards Robust and Adaptive Speech Recognition Models, , and , Idiap-RR-47-2002 |
|
Towards Robust and Adaptive Speech Recognition Models, , and , in: Mathematical Foundations of Speech Processing and Recognition, Springer-Verlag, 2002 |
|
Torch: a modular machine learning software library, , and , Idiap-RR-46-2002 |
|
TODE: A Decoder for Continuous Speech Recognition, , Idiap-Com-09-2002 |
|
The VidTIMIT Database, , Idiap-Com-06-2002 |
|
The MNIST Database of Handwritten upper-case letters, and , Idiap-Com-04-2002 |
|
The IDIAP Smart Meeting Room, , Idiap-Com-07-2002 |
|
The BANCA Database and Experimental Protocol for Speaker Verification, , , and , Idiap-RR-13-2002 |
|
The analysis of kernel ridge regression learning algorithm., , Idiap-RR-54-2002 |
|
Text Segmentation and Recognition in Complex Background Based on Markov Random Field, , and , in: Int. Conf. Pattern Recognition 2002, 2002 |
|
Text Detection and Recognition in Images and Videos, , and , Idiap-RR-61-2002 |
|
Structurally noise resistant classifier for multi-modal person verification, and , in: Pattern Recognition Letters, 24(16), 2003 |
Speech Processing & Text-Independent Automatic Person Verification, , Idiap-Com-08-2002 |
|
Speaker Normalization using HMM2, , and , in: Proceedings of the 2002 IEEE International Workshop on Neural Networks for Signal Processing (NNSP-02), 2002 |
|
SOM-Based Clustering for On-Line Fraud Behavior Classification: a Case Study, and , Idiap-RR-30-2002 |
|
Self-Organizing-Maps With BIC For Speaker Clustering, , Idiap-RR-60-2002 |
|
Scaling Large Learning Problems with Hard Parallel Mixtures, , and , in: International Workshop on Pattern Recognition with Support Vector Machines, SVM'2002, 2002 |
|
Robust Speech Recognition with Small Microphone Arrays using the Missing Data Approach, , and , in: Proceedings of International Conference on Speech and Language Processing (ICSLP), 2002 |
|
Robust Speaker Change Detection, , and , in: IEEE Signal Processing Letters (to appear), 2003 |
|
Robust HMM-Based Speech/Music Segmentation, , and , in: ICASSP, 2002 |
|
Robust Face Verification using Skin Color and Neural Networks, , Idiap-RR-49-2002 |
|
Robust Face Analysis using Convolutional Neural Networks, , in: Proceedings of the International Conference on Pattern Recognition (ICPR 02), 2002 |
|
Robot Navigation, , in: The Handbook of Brain Theory and Neural Networks: The Second Edition, The MIT Press, 2002 |
|
Recent advances in the multi-stream HMM/ANN hybrid approach to noise robust ASR, and , Idiap-RR-57-2002 |
|
Proceedings of the Twelfth IEEE Workshop on Neural Networks for Signal Processing (NNSP), IEEE Press, 2002 |
Probabilistic Home Video Structuring: Feature Selection and Performance Evaluation, , and , in: IEEE International Conference on Image Processing, 2002 |
|
Phase AutoCorrelation (PAC) derived Robust Speech Features, , and , in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003 |
|
Online Policy Adaptation for Ensemble Algorithms, and , Idiap-RR-28-2002 |
|
Object Localization in Metric Spaces for Video Linking, and , in: IEEE Workshop on Motion and Video Computing, 2002 |
|
Noise Resistant Audio-Visual Verification via Structural Constraints, and , in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003 |
Noise PDF transformation in secondary feature processing, , Idiap-RR-29-2002 |
|
New Entropy Based Combination Rules in HMM/ANN Multi-stream ASR, , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2003 |
|
Mutliscale Facial Expression Recognition using Convolutional Neural Networks, , in: Indian Conference on Computer Vision, Graphics and Image Processing (ICVGIP 02), 2002 |
|
Modelling auxiliary information (pitch frequency) in hybrid HMM/ANN based ASR systems, , and , Idiap-RR-62-2002 |
|
Modeling Human Interaction in Meetings, , , , , , , and , in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2003 |
|
Mixed Bayesian Networks with Auxiliary Variables for Automatic Speech Recognition, , and , in: International Conference on Pattern Recognition (ICPR~2002), 2002 |
|
Low cost duration modelling for noise robust speech recognition, , and , in: Proc. ICSLP, 2002 |
|
Linking Objects in Videos by Importance Sampling, and , in: IEEE International Conference on Multimedia and Expo, 2002 |
|
Increasing Speech Recognition Noise Robustness with HMM2, , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 02), 2002 |
|
Improved Unknown-Multiple Speaker clustering using HMM, , and , Idiap-RR-23-2002 |
|
Hybrid generative-discriminative models for speech and speaker recognition, and , Idiap-RR-06-2002 |
|
Hidden Markov Models and other Finite State Automata for Sequence Processing, and , in: The Handbook of Brain Theory and Neural Networks: The Second Edition, The MIT Press, 2002 |
|
Head-Pose Invariant Facial Expression Recognition using Convolutional Neural Networks, , in: International IEEE Conference on Multimodal Interfaces (ICMI 02), 2002 |
|
Handwriting Recognition Demo, , and , Idiap-Com-02-2002 |
|
Gestures for Multi-Modal Interfaces: A Review, , Idiap-RR-34-2002 |
|
Face Verification using MLP and SVM, and , in: XI Journees NeuroSciences et Sciences pour l'Ingenieur (NSI 2002), 2002 |
|
Extended BIC Criterion for Model Selection, and , Idiap-RR-42-2002 |
|
Evolution of the Mental States Operating a Brain-Computer Interface, , , and , in: Proceedings of the International Federation for Medical and Biological Engineering, 2002 |
|
Evaluation Protocols and Comparative Results for the Triesch Hand Posture Database, , Idiap-RR-50-2002 |
|
Evaluation of Formant-Like Features for ASR, , , , , and , in: International Conference on Spoken Language Processing (ICSLP 2002), 2002 |
|
Estimation of Conditional Distributions using Gaussian Mixture Models, , and , Idiap-RR-03-2002 |
|
Estimating the Intrinsic Dimension of Data with a Fractal-Based Method, and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(10), 2002 |
|
Dynamic Bayesian Network Based Speech Recognition with Pitch and Energy as Auxiliary Variables, , , and , in: 2002 IEEE International Workshop on Neural Networks for for Signal Processing (NNSP~2002), 2002 |
|
Dichotomy Between Clustering Performance and Minimum Distortion in Piecewise-Dependent-Data (PDD) Clustering, and , in: to be published in IEEE Signal Processing Letters, 2003 |
|
Confidence Measures for Multimodal Identity Verification, , , and , in: Information Fusion, 3(04), 2002 |
|
Conditional Gaussian Mixture Models for Environmental Risk Mapping, , and , in: IEEE International Workshop on Neural Networks for Signal Processing (NNSP), 2002 |
|
Comparison of Support Vector Machine and Neural Network for Text Texture Verification, and , Idiap-RR-19-2002 |
|
Brain-Computer Interfaces, , in: The Handbook of Brain Theory and Neural Networks: The Second Edition, The MIT Press, 2002 |
|
Bagging Using the VMSE Cost Function, , Idiap-RR-27-2002 |
|
Auxiliary Variables in Conditional Gaussian Mixtures for Automatic Speech Recognition, , and , in: Seventh International Conference on Spoken Language Processing (ICSLP~2002), 2002 |
|
An information theoretic measure of sequence recognition performance, , Idiap-Com-03-2002 |
|
An Asynchronous Hidden Markov Model for Audio-Visual Speech Recognition, , in: Advances in Neural Information Processing Systems, NIPS 15, MIT Press, 2003 |
|
Algorithms for Video Structuring, , and , Idiap-Com-05-2002 |
|
Activity Report 2001, , Idiap-Com-01-2002 |
|
A State-of-the-art Neural Network for Robust Face Verification, , and , in: Proceedings of the COST275 Workshop on The Advent of Biometrics on the Internet, 2002 |
|
A Parallel Mixture of SVMs for Very Large Scale Problems, , and , in: Neural Computation, 14(05), 2002 |
|
A New Method of Contrast Normalization for Verification of Extracted Video Text Having Complex Backgrounds, and , Idiap-RR-16-2002 |
|
A Multi-sample Multi-source Model for Biometric Authentication, , and , in: IEEE International Workshop on Neural Networks for Signal Processing (NNSP), 2002 |
|
Why Do Multi-Stream, Multi-Band and Multi-Modal Approaches Work on Biometric User Authentication Tasks?, and , in: Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04), 2004 |
|
Video Shot Clustering using Spectral Methods, , and , in: 3rd Workshop on Content-Based Multimedia Indexing (CBMI), 2003 |
|
Vessel Segmentation and Branching Detection using an Adaptive Profile Kalman Filter in Retinal Blood Vessel Structure Analysis, and , in: Pattern Recognition and Image Analysis: First Iberian Conference, IbPRIA 2003, Springer-Verlag LNCS, 2003 |
|
Variance Reduction Techniques in Biometric Authentication, and , Idiap-RR-17-2003 |
|
Using pitch frequency information in speech recognition, , and , in: Proceedings of Eurospeech, 2003 |
|
TRAP-TANDEM: Data-driven extraction of temporal features from speech, , in: large part published in Proceedings of ASRU-2003, 2003 |
|
Towards Computer Understanding of Human Interactions, , , and , Idiap-RR-45-2003 |
|
The Expected Performance Curve, , and , in: International Conference on Machine Learning, ICML, Workshop on ROC Analysis in Machine Learning, 2005 |
|
The BANCA Database and Evaluation Protocol, , , , , , , , , , , and , in: 4th International Conference on Audio- and Video-Based Biometric Person Authentication, AVBPA, Springer-Verlag, 2003 |
|
Textual Data Representation, and , Idiap-RR-74-2003 |
|
Text detection and recognition in images and video sequences, , École Polytechnique Fédérale de Lausanne, 2003 |
|
Studying Phase Synchrony for Classification of Mental Tasks in Brain Machine Interfaces, , , and , in: Proceedings of the Conference of the International Society for Brain Electromagnetic Topography, 2003 |
Speech & Face Based Biometric Authentication at IDIAP, , , , , , , and , Idiap-RR-13-2003 |
|
Speech & Face Based Biometric Authentication at IDIAP, , , , , , , and , in: Proceedings of the 2003 IEEE International Conference on Multimedia & Expo (ICME-03), 2003 |
Speech Recognition with Auxiliary Information, , École Polytechnique Fédérale de Lausanne, Computer Science Department, 2003 |
|
Speech Recognition with Auxiliary Information, , Idiap-RR-28-2003 |
|
Speech recognition of spontaneous, noisy speech using auxiliary information in Bayesian networks, , and , in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003 |
Spectral Structuring of Home Videos, , and , in: International Conference on Image and Video Retrieval (CIVR'03), Springer Verlag, 2003 |
|
Spectral Entropy Based Feature for Robust ASR, , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2004 |
|
Some Emerging Concepts in Speech Recognition., and , Idiap-RR-82-2003 |
|
Small Microphone Array: Algorithms and Hardware, and , Idiap-Com-07-2003 |
|
Show What You Know: Musings on the Reporting of Negative Results in Speech Recognition Research, and , Idiap-RR-81-2003 |
|
Sequential Monte Carlo Video Text Segmentation, and , in: ICIP, 2003 |
|
Segmenting Multiple Concurrent Speakers Using Microphone Arrays, , and , in: Proceedings of Eurospeech 2003, 2003 |
|
Scaling Large Learning Problems with Hard Parallel Mixtures, , and , in: International Journal on Pattern Recognition and Artificial Intelligence (IJPRAI), 17(3), 2003 |
|
Scalability Analysis of Audio-Visual Person Identity Verification, , , and , in: 4th International Conference on Audio- and Video-Based Biometric Person Authentication, AVBPA, Springer-Verlag, 2003 |
|
Robust Features for Frontal Face Authentication in Difficult Image Conditions, and , Idiap-RR-05-2003 |
|
Robust Features for Frontal Face Authentication in Difficult Image Conditions, and , in: Proceedings of 4th International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA-03), 2003 |
Reconnaissance de gestes 3D bi-manuels, , , and , Idiap-RR-79-2003 |
|
Recognition of Isolated Complex Mono- and Bi-Manual 3D Hand Gestures, , and , in: Proc. of the sixth International Conference on Automatic Face and Gesture Recognition, 2004 |
|
Real-Time Face Detection Using Boosting Learning in Hierarchical Feature Spaces, , and , in: the International Conference on Pattern Recognition (ICPR), 2004 |
|
Phoneme-Grapheme Based Speech Recognition System, , , and , in: Proceedings of IEEE ASRU, 2003 |
|
Online Policy Adaptation for Ensemble Classifiers, and , in: 12th European Symposium on Artificial Neural Networks, ESANN 04, 2004 |
Online Policy Adaptation for Ensemble Classifiers, and , in: Neurocomputing, 2005 |
|
On Use of Task Independent Training Data in Tandem Feature Extraction, and , in: Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04), 2004 |
|
On the Need for On-Line Learning in Brain-Computer Interfaces, , Idiap-RR-30-2003 |
|
On the Combination of Speech and Speaker Recognition, and , in: European Conference On Speech, Communication and Technology (EUROSPEECH'03), 2003 |
|
On Performance Evaluation of Face Detection and Localization Algorithms, , , and , in: 17th International Conference on Pattern Recognition (ICPR), 2004 |
|
On Multi-scale Fourier Transform Analysis of Speech Signals, and , Idiap-RR-33-2003 |
|
On Image Auto-Annotation with Latent Space Models, and , in: Proc. ACM Int. Conf. on Multimedia (ACM MM), 2003 |
|
On Factorizing Spectral Dynamics for Robust Speech Recognition, , , and , in: Eurospeech, 2003 |
|
On automatic annotation of meeting databases, , , , and , in: IEEE International Conference on Image Processing (ICIP), 2003 |
|
Offline Recognition of Unconstrained Handwritten Texts Using HMMs and Statistical Language Models, , and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 26(6), 2004 |
|
Offline Recognition of Large Vocabulary Cursive Handwritten Text, , and , in: Proceedings of International Conference on Document Analysis and Recognition (ICDAR), 2003 |
|
Offline Cursive Handwriting: From Word To Text Recognition, , Idiap-RR-24-2003 |
|
Non-Linear Variance Reduction Techniques in Biometric Authentication, and , in: Workshop on Multimodal User Authentication, 2003 |
|
Nonlinear Spectral Transformations for Robust Speech Recognition, , and , in: Proceedings of the IEEE Automatic Speech Recognition and Understanding (ASRU) Workshop 2003, 2003 |
|
Nonlinear Analysis of Cognitive and Motor-related EEG Signals, and , Idiap-RR-14-2003 |
Non-Invasive Brain-Actuated Control of a Mobile Robot, , , and , in: Proceedings of the 18th International Joint Conference on Artificial Intelligence, 2003 |
|
Noisy Text Categorization, , in: Proceedings of International Conference on Pattern Recognition (ICPR), 2004 |
|
Noise Robust Discriminative Models, and , Idiap-RR-40-2003 |
|
Multimodal Identity Verification at IDIAP, , Idiap-Com-04-2003 |
|
Multimodal Authentication using Asynchronous HMMs, , in: 4th International Conference on Audio- and Video-Based Biometric Person Authentication, AVBPA, Springer-Verlag, 2003 |
|
Multi-Modal Audio-Visual Event Recognition for Football Analysis, , and , in: Proc. IEEE Workshop on Neural Networks for Signal Processing (NNSP), 2003 |
|
Monte Carlo Video Text Segmentation, and , Idiap-RR-07-2003 |
|
Modélisation implicite du mouvement en suivi par filtrage de Monte Carlo séquentiel, and , in: GRETSI conference, Signal and Image Processing,, 2003 |
|
Microphone Array Speech Recognition : Experiments on Overlapping Speech in Meetings, and , in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003 |
|
Mel-Cepstrum Modulation Spectrum (MCMS) Features for Robust ASR, , , and , in: IEEE ASRU, 2003 |
|
Meeting Data Collection Specifications, , and , Idiap-Com-10-2003 |
|
Location Based Speaker Segmentation, and , in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003 |
|
Joint Decoding for Phoneme-Grapheme Continuous Speech Recognition, , and , in: Proceedings of ICASSP, 2004 |
|
Internship Report : Summer 2003, , Idiap-Com-09-2003 |
|
Information Retrieval on Noisy Text, , and , Idiap-Com-08-2003 |
|
In Search of a Good BET, and , Idiap-Com-11-2003 |
|
Improving Face Verification using Symmetric Transformation, , Idiap-RR-68-2003 |
|
Improving Face Authetication Using Virtual Samples, , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003 |
|
IDIAP Demonstration Management, and , Idiap-Com-06-2003 |
|
Hybrid HMM/ANN and GMM Combination for User-Customized Password Speaker Verification, and , in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003 |
|
HMM Mixtures (HMM2) for Robust Speech Recognition, , Ecole Polytechnique Federale de Lausanne, 2003 |
|
HMM inference towards flexible speech recognition, , Idiap-Com-03-2003 |
From Samples to Objects in Kernel Methods, and , Idiap-RR-29-2003 |
|
Finding Structure in Home Videos by Probabilistic Hierarchical Clustering, , and , in: IEEE Transactions on Circuits and Systems for Video Technology, 13(6), 2003 |
|
Fast features for face authentication under illumination direction changes, and , in: Pattern Recognition Letters, 24(14), 2003 |
[DOI] |
Face Verification using LDA and MLP on the BANCA database, , Idiap-RR-66-2003 |
|
Face Verification Using Adapted Generative Models, , and , in: The 6th International Conference on Automatic Face and Gesture Recognition, FG2004, IEEE, 2004 |
|
Face Processing & Frontal Face Verification, , Idiap-RR-20-2003 |
|
Evaluation of formant-like features for automatic speech recognition, , , , , and , Idiap-RR-08-2003 |
|
Enhanced Performance of Multimodal Biometric Systems by Confidence Estimation, , Idiap-Com-05-2003 |
|
EEG-based BCI Systems and IDIAP EEG Database, and , Idiap-RR-64-2003 |
|
Direct Non-Invasive Brain Computer Interfaces, , , , and , in: Proceedings of the 9th International Conference on Functional Mapping of the Human Brain, 2003 |
Confusion Matrix Based Entropy Correction in Multi-stream Combination, and , in: Proceedings of ISCA European Conference on Speech Communication and Technology (Eurospeech), 2003 |
|
Conditional Gaussian Mixtures, , Idiap-RR-11-2003 |
|
Comparison of MLP and GMM Classifiers for Face Verification on XM2VTS, , and , Idiap-RR-10-2003 |
|
Comparison of MLP and GMM Classifiers for Face Verification on XM2VTS, , and , in: 4th International Conference on AUDIO- and VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, 2003 |
|
Comparison of different feature classifiers for brain computer interfaces, , , , , , , , and , in: Proceedings of the 1st International IEEE EMBS Conference on Neural Engineering, 2003 |
Comparison and Combination of Features in a Hybrid HMM/MLP and a HMM/GMM Speech Recognition System, , , , and , in: to be published in IEEE Transactions on Speech and Audio Processing(48), 2003 |
|
Clustering And Segmenting Speakers And Their Locations In Meetings, , and , in: ICASSP, 2004 |
|
Client Dependent GMM-SVM Models for Speaker Verification, and , in: International Conference on Artificial Neural Networks, ICANN/ICONIP 2003, Springer Verlag, 2003 |
|
Boosting Pixel-based Classifiers for Face Verification, and , in: Biometric Authentication Workshop of the 8th European Conference on Computer Vision, BIOAW2004, Springer-Verlag, 2004 |
|
Boosting HMMs with an application to speech recognition, and , in: IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP, 2004 |
|
Automatic Speech Recognition using Dynamic Bayesian Networks with the Energy as an Auxiliary Variable, and , Idiap-RR-18-2003 |
|
Automatic Facial Expression Analysis: A Survey, and , in: Pattern Recognition, 36(1), 2003 |
|
Augmenting Frontal Face Models for Non-Frontal Verification, and , in: Proceedings of the 2003 Workshop on Multimodal User Authentication (MMUA'03), 2003 |
Audio-Visual Speaker Tracking with Importance Particle Filters, , , , and , in: IEEE International Conference on Image Processing (ICIP), 2003 |
|
Audio-Video Person Clustering in Video Databases, and , Idiap-RR-46-2003 |
|
Asynchronous BCI and Local Neural Classifiers: An Overview of the Adaptive Brain Interface Project, and , in: IEEE Trans. on Neural Systems and Rehabilitation Engineering, Special Issue on Brain-Computer Interface Technology, 11(2), 2003 |
|
An Investigation of Spectral Subband Centroids for Speaker Authentication, , and , in: Int'l Conf. on Biometric Authentication, 2004 |
|
An Implicit Motion Likelihood for Tracking with Particle Filters, , and , in: British Machine Vision Conference (BMVC), Springer Verlag, 2003 |
|
An Alternative To Silence Removal For Text-Independent Speaker Verification, and , Idiap-RR-51-2003 |
|
Adaptive Importance Sampling to Accelerate Training of a Neural Probabilistic Language Model, and , Idiap-RR-35-2003 |
|
Adaptive Brain Interfaces for Communication and Control, , in: Proceedings of the 10th International Conference on Human-Computer Interaction, 2003 |
|
Adaptive Brain Interfaces, , in: Communications of the ACM, 46(3), 2003 |
Activity Report 2002, , Idiap-Com-01-2003 |
|
A Symmetric Transformation for LDA-based Face Verification, , in: Proceedings of the 6th International Conference on Automatic Face and Gesture Recognition, IEEE Computer Society Press, 2004 |
|
A Statistical Significance Test for Person Authentication, and , in: Proceedings of Odyssey 2004: The Speaker and Language Recognition Workshop, 2004 |
|
A Robust Speaker Clustering Algorithm, and , in: IEEE Automatic Speech Recognition Understanding Workshop, 2003 |
|
A Mixed-State I-Particle Filter for Multi-Camera Speaker Tracking, , , and , in: IEEE Int. Conf. on Computer Vision Workshop on Multimedia Technologies for E-Learning and Collaboration (ICCV-WOMTEC), 2003 |
|
A Hierarchical Keyframe User Interface for Browsing Video over the Internet, , , and , in: Proceedings of the 9th International Conference on Human-Computer Interaction (INTERACT-2003), IOS Press, 2003 |
|
Variational Information Maximization in Gaussian Channels, and , Idiap-RR-88-2004 |
|
Variational Information Maximization for Population Coding, , Idiap-RR-85-2004 |
|
Using RASTA in task independent TANDEM feature extraction, , and , in: Proceedings of ICSLP, 2004, 2004 |
|
User Authentication via Adapted Statistical Models of Face Images, , and , in: IEEE Transaction on Signal Processing, 2005 |
|
Unsupervised Location-Based Segmentation of Multi-Party Speech, , and , in: Proceedings of the 2004 ICASSP-NIST Meeting Recognition Workshop, 2004 |
|
Une application de reconnaissance du locuteur : \\ le User-Customized Password Speaker Verification, , Idiap-Com-04-2004 |
|
Tracking People in Meetings with Particles, , , , and , in: Proc. Int. Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS,',','), invited paper, 2005 |
|
Towards using hierarchical posteriors for flexible automatic speech recognition systems, , , , , and , Idiap-RR-58-2004 |
|
Theme Topic Mixture Model: A Graphical Model for Document Representation, and , in: Pascal Workshop on Text Mining and Understanding, 2004 |
|
The IDIAP Multimedia File Server, and , Idiap-Com-05-2004 |
|
The Expected Performance Curve: a New Assessment Measure for Person Authentication, and , in: Proceedings of Odyssey 2004: The Speaker and Language Recognition Workshop, 2004 |
|
The Auxiliary Variable Trick for deriving Kalman Smoothers, , Idiap-RR-87-2004 |
|
Text Detection and Recognition in Images and Videos, , and , in: Pattern Recognition, 37(3), 2004 |
Tangent Vector Kernels for Invariant Image Classification with SVMs, and , in: 17th Int. Conf. Pattern Recognition (ICPR), 2004 |
|
Subjective Evaluation of Join Cost Functions Used in Unit Selection Speech Synthesis, and , in: Proceedings of ICSLP, 2004 |
|
Stochastic techniques in deriving perceptual knowledge, , Idiap-RR-84-2004 |
|
Statistical Transformations of Frontal Models for Non-Frontal Face Verification, and , in: Proceedings of the IEEE International Conference on Image Processing (ICIP), 2004 |
|
Speech recognition with auxiliary information, , and , in: IEEE Trans. on Speech and Audio Processing, 4, 2004 |
Spectro-Temporal Activity Pattern (STAP) Features for Noise Robust ASR, , , and , in: Proceedings of the INTERSPEECH-ICSLP-04, 2004 |
|
{S}ignificance {T}ests for {\em Bizarre} {M}easures in 2-{C}lass {C}lassification {T}asks, , and , Idiap-RR-34-2004 |
|
Sequence Classification with Input-Output Hidden Markov Models, and , Idiap-RR-13-2004 |
Sector-Based Detection for Hands-Free Speech Enhancement in Cars, , and , in: EURASIP Journal on Applied Signal Processing, Special Issue on Advances in Multimicrophone Speech Processing, 2006 |
|
Robust Playfield Segmentation using MAP Adaptation, and , in: Proc. 17th International Conference on Pattern Recognition (ICPR 2004), 2004 |
|
Robust Audio Segmentation, , and , École Polytechnique Fédérale de Lausanne, 2004 |
|
Robust Audio Segmentation, , and , Idiap-RR-35-2004 |
|
Restoring Locomotion with a Thought Controlled Mobile Robot, , in: Proceedings of the 4th Forum of European Neuroscience, 2004 |
Pseudo-Syntactic Language Modeling for Disfluent Speech Recognition, , in: Proceedings of SST 2004 (10th Australian International Conference on Speech Science & Technology,',','), Sydney, Australia, 2004, 2004 |
|
Posteriori Probabilities and Likelihoods Combination for Speech and Speaker Recognition, and , in: International Conference on Spoken Language Processing (ICSLP~2004), 2004 |
|
PLSA-based Image Auto-Annotation: Constraining the Latent Space, and , in: Proc. ACM Int. Conf. on Multimedia (ACM MM), 2004 |
|
Phoneme vs Grapheme Based Automatic Speech Recognition, , , and , Idiap-RR-48-2004 |
|
Phase AutoCorrelation (PAC) features in Entropy based Multi-Stream for Robust Speech Recognition, , , and , in: Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04), 2004 |
|
Phase AutoCorrelation (PAC) Features for Noise Robust ASR, , , and , Idiap-RR-40-2004 |
Order Matters: A Distributed Sampling Method for Multi-Object Tracking, , in: British Machine Vision Conference (BMVC), 2004 |
|
On the Use of Information Retrieval Measures for Speech Recognition Evaluation, , , , , , and , Idiap-RR-73-2004 |
|
On the Need for On-Line Learning in Brain-Computer Interfaces, , in: Proceedings of the International Joint Conference on Neural Networks, 2004 |
|
On the Adequacy of Baseform Pronunciations and Pronunciation Variants, and , Idiap-RR-27-2004 |
|
On Local Features for Face Verification, and , Idiap-RR-36-2004 |
|
Nonlinear Feature Transformations for Noise Robust Speech Recognition, , Ecole Polytechnique Fédérale de Lausanne, 2004 |
|
Non-Invasive Brain-Actuated Control of a Mobile Robot by Human EEG, , , and , in: IEEE Trans. on Biomedical Engineering, Special Issue on Brain-Machine Interfaces, 51(6), 2004 |
|
Noisy Text Clustering, and , Idiap-RR-31-2004 |
|
Noise-Robust Multi-Stream Fusion for Text-Independent Speaker Authentication, and , in: The Speaker and Recognition Workshop, 2004 |
|
New Nonsense Syllables Database -- Analyses and Preliminary ASR Experiments, , , and , in: Proceedings of International Conference on Spoken Language Processing (ICSLP), 2004 |
|
Multi-resolution Spectral Entropy Based Feature for Robust ASR, , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2005 |
|
Multimodal Speech Processing Using Asynchronous Hidden Markov Models, , in: Information Fusion, 5(2), 2004 |
|
Multimodal Group Action Clustering in Meetings, , , , and , in: ACM 2nd International Workshop on Video Surveillance & Sensor Networks in conjunction with 12th ACM International Conference on Multimedia, 2004 |
|
Modelling Auxiliary Features in Tandem Systems, , , and , in: Proceedings of ICSLP, 2004 |
|
Modeling Individual and Group Actions in Meetings: a Two-Layer HMM Framework, , , , and , in: the Second IEEE Workshop on Event Mining: Detection and Recognition of Events in Video, In Association with CVPR, 2004 |
|
Modeling Individual and Group Actions in Meetings With Layered HMMs, , , , and , in: IEEE Transaction on Multimedia, June, 2006, 2004 |
|
Making Retrieval Faster Through Document Clustering, and , Idiap-RR-02-2004 |
|
LP-TRAP: Linear predictive temporal patterns, , and , 2004 |
|
Links Between Perceptrons, MLPs and SVMs, and , in: International Conference on Machine Learning, ICML, 2004 |
|
Large Scale Machine Learning, , Université de Paris VI, 2004 |
|
Invariances in Kernel Methods: From Samples to Objects, and , Idiap-RR-56-2004 |
|
Improving Single Modal and Multimodal Biometric Authentication Using F-ratio Client-Dependent Normalisation, and , Idiap-RR-52-2004 |
|
Identity verification using speech and face information, and , in: Digital Signal Processing, 14(5), 2004 |
[DOI] |
HMM/ANN Based Spectral Peak Location Estimation for Noise Robust Speech Recognition, , and , Idiap-RR-50-2004 |
|
HMM and IOHMM Modeling of EEG Rhythms for Asynchronous BCI Systems, and , in: European Symposium on Artificial Neural Networks ESANN, 2004 |
|
HMM and IOHMM for the Recognition of Mono- and Bi-Manual 3D Hand Gestures, , and , Idiap-RR-39-2004 |
|
Fusion of Structural and Color Local Descriptors for Enhanced Object Recognition, and , in: Proceedings IEEE WIAMIS 2004(5th International Workshop on Image Analysis for Multimedia Interactive Services,',','), 21-23 April, 2004, Lisboa, Portugal, 2004 |
|
Face Authentication using Client-specific Matching Pursuit, , , and , Idiap-RR-78-2004 |
|
Evidences of Equal Error Rate Reduction in Biometric Authentication Fusion, and , Idiap-RR-43-2004 |
|
Evaluation of Formant-Like Features for Automatic Speech Recognition, , , , , and , in: Journal of the Acoustical Society of America (JASA), 116(3), 2004 |
|
Estimating the Quality of Face Localization for Face Verification, , , and , in: IEEE International Conference on Image Processing, ICIP, 2004 |
|
Estimates of Parameter Distributions for Optimal Action Selection, and , Idiap-RR-72-2004 |
|
Entropy Based Combination of Tandem Representations for Noise Robust ASR, , , , and , in: Proceedings of the INTERSPEECH-ICSLP-04, 2004 |
|
Embedding motion in model-based stochastic tracking, and , in: 17th Int. Conf. Pattern Recognition (ICPR), 2004 |
|
Embedding Motion in Model-Based Stochastic Tracking, , and , in: IEEE Transaction on Image Processing, 15(11), 2006 |
Effect of Segmentation Method on Video Retrieval Performance, and , in: Proceedings of the 2005 IEEE International Conference on Multimedia and Expo (ICME-05), 2005 |
|
Effect of Recognition Errors on Text Clustering, and , Idiap-RR-82-2004 |
|
Effect of Recognition Errors on Information Retrieval Performance, , in: Proceedings of International Workshop on Frontiers in Handwriting Recognition, 2004 |
|
Detecting Group Interest-level in Meetings, , , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2005 |
|
Database, Protocol and Tools for Evaluating Score-Level Fusion Algorithms in Biometric Authentication, and , in: Fifth Int'l. Conf. Audio- and Video-Based Biometric Person Authentication AVBPA, 2005 |
|
Database, Protocol and Tools for Evaluating Score-Level Fusion Algorithms in Biometric Authentication, and , in: Pattern Recognition Journal, 2005 |
|
Confidence Measures in Multiple pronunciations Modeling For Speaker Verification, and , in: Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04), 2004 |
|
Browsing Recorded Meetings with Ferret, , and , Idiap-RR-32-2004 |
|
Brain-Actuated Interaction, , , and , in: Artificial Intelligence, 159(1-2), 2004 |
|
Boosting word error rates, and , in: IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP, 2005 |
Automatic Analysis of Multimodal Group Actions in Meetings, , , , , and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence (to appear), 2004 |
|
Assessing Scene Structuring in Consumer Videos, , , , and , in: Int. Conf. on Image and Video Retrieval (CIVR), 2004 |
|
Are two Classifiers performing equally? A treatment using Bayesian Hypothesis Testing, , Idiap-RR-57-2004 |
|
An Online Audio Indexing System, , and , 2004 |
|
An Auxiliary Variational Method, and , Idiap-RR-86-2004 |
|
Activity Report 2003, , Idiap-Com-01-2004 |
|
A video package for Torch, and , Idiap-Com-02-2004 |
|
A Study of the Effects of Score Normalisation Prior to Fusion in Biometric Authentication Tasks, and , Idiap-RR-69-2004 |
|
A Stable Switching Kalman Smoother, , Idiap-RR-89-2004 |
|
A Sector-Based Approach for Localization of Multiple Speakers with Microphone Arrays, and , in: Proceedings of the 2004 SAPA Workshop, 2004 |
|
A probabilistic framework for joint head tracking and pose estimation, and , in: 17th Int. Conf. Pattern Recognition (ICPR), 2004 |
|
A Novel Approach to Combining Client-Dependent and Confidence Information in Multimodal Biometric, and , in: Fifth Int'l. Conf. Audio- and Video-Based Biometric Person Authentication AVBPA, 2005 |
|
A New Speech Recognition Baseline System for Numbers 95 Version 1.3 Based on Torch, and , Idiap-RR-16-2004 |
|
A Meeting Browser Evaluation Test, , , and , Idiap-RR-53-2004 |
|
A Localization/Verification Scheme for Finding Text in Images and Video Frames Based on Contrast Independent Features and Machine Learning Methods, , and , in: Signal Processing: Image Communication, 19(3), 2004 |
|
A Gentle Hessian for Efficient Gradient Descent, and , in: IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP, 2004 |
|
A Generative Model for Music Transcription, , and , in: IEEE Transactions on Speech and Audio Processing, 2004 |
|
You Are Wrong!---Automatic Detection of Interaction Errors from Brain Waves, and , in: Proceedings of the 19th International Joint Conference on Artificial Intelligence, 2005 |
|
Writer Identification for Smart Meeting Room Systems, , , , , and , in: Seventh IAPR Workshop on Document Analysis Systems, DAS, 2006 |
|
Video Text Recognition using Sequential Monte Carlo and Error Voting Methods, and , in: Pattern Recognition Letters, 26(9), 2005 |
|
Using Pitch as Prior Knowledge in Template-Based Speech Recognition, , and , in: Proceedings of ICASSP, 2006, 2006 |
|
Using Auxiliary Sources of Knowledge for Automatic Speech Recognition, , École Polytechnique Fédérale de Lausanne, Computer Science Department, 2005 |
|
Unsupervised Spectral Subtraction for Noise-Robust ASR, , , and , in: Proceedings of the 2005 IEEE ASRU Workshop, 2005 |
|
Two-Handed Gesture Recognition, and , Idiap-RR-24-2005 |
|
Towards Explaining the Success (Or Failure) of Fusion in Biometric Authentication, and , Idiap-RR-43-2005 |
|
Towards ASR Based on Hierarchical Posterior-Based Keyword Recognition, and , Idiap-RR-64-2005 |
|
Threshold Selection for Unsupervised Detection, with an Application to Microphone Arrays, , and , in: Proceedings of ICASSP 2006, 2006 |
|
The Role of Speech in Multimodal Human-Computer Interaction (Towards Reliable Rejection of Non-Keyword Input), , and , in: Proceedings of 8th International Conference on Text, Speech and Dialogue - TSD 2005, 2005 |
|
The Multi-Channel Wall Street Journal Audio Visual Corpus (MC-WSJ-AV): Specification and Initial Experiments, , , and , Idiap-RR-69-2005 |
|
The AMI Meeting Corpus: a Pre-Announcement, , , , , , , , , , , , , , , , and , in: Machine Learning for Multimodal Interaction: Second International Workshop, MLMI'2005, 2005 |
|
Stable Directed Belief Propagation in Gaussian DAGs using the auxiliary variable trick, and , Idiap-RR-72-2005 |
|
Sports Event Recognition using Layered HMMs, and , Idiap-RR-07-2005 |
|
Speech Acquisition in Meetings with an Audio-Visual Sensor Array, , , , and , in: Pro. IEEE ICME, 2005 |
|
Spectral Entropy Feature in Multi-stream for Robust ASR, and , Idiap-RR-45-2005 |
|
Spectral Entropy Feature in Full-Combination Multi-Stream for Robust ASR, and , in: Proceedings of ISCA European Conference on Speech Communication and Technology (Eurospeech), 2005 |
|
Sociometry Based Multiparty Audio Recordings Segmentation, , in: Proceedings of the IEEE Conference on Multimedia and Expo (ICME 2006), 2006 |
|
Semi-supervised Meeting Event Recognition with Adapted HMMs, , and , in: Pro. IEEE ICME, 2005 |
|
Semi-supervised Adapted HMMs for Unusual Event Detection, , , and , in: Pro. IEEE CVPR, 2005 |
|
Probabilistic Tagging of Unstructured Genealogical Records, and , Idiap-RR-86-2005 |
|
Person Authentication using Brainwaves (EEG) and Maximum A Posteriori Model Adaptation, and , in: IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE Special Issue on Biometrics, 2007 |
|
Performance Generalization in Biometric Authentication Using Joint User-Specific and Sample Bootstraps, , and , in: IEEE Pattern Analysis and Machine intelligence, 2007 |
|
Optimal Spike-Timing Dependent Plasticity for Precise Action Potential Firing in Supervised Learing, , , and , Idiap-RR-88-2005 |
|
Optimal Spike-Timing Dependent Plasticity for Precise Action Potential Firing in Supervised Learing, , , and , 2005 |
|
On Variable-scale Piecewise Stationary Spectral Analysis of Speech Signals for Asr, , and , in: Speech Communication, 48(9), 2006 |
|
On Variable-Scale Piecewise Stationary Spectral Analysis of Speech Signals for ASR, , and , Idiap-RR-19-2005 |
|
On transforming statistical models for non-frontal face verification, , and , in: Pattern Recognition (in press), 2005 |
[DOI] |
On Accuracy/Robustness/Complexity Trade-Offs in Face Verification, , and , in: IEEE International Conference on Information Technology and Applications, ICITA, 2005 |
|
OCR Based Slide Retrieval, , and , Idiap-RR-11-2005 |
|
Non-Invasive Estimation of Local Field Potentials for Neuroprosthesis Control, , , , and , in: Cognitive Processing, Special Issue on Motor Planning in Humans and Neuroprosthesis Control, 6(1), 2005 |
|
Noisy Text Categorization, , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(12), 2005 |
|
Multiview Face Detection, , and , Idiap-RR-49-2005 |
|
Multi-stream ASR: An Oracle Perspective, , and , in: Proceedings of ISCA International Conference on Spoken Language Processing (ICSLP), 2006 |
|
Multi-resolution RASTA filtering for TANDEM-based ASR, and , in: Proceedings of Interspeech 2005, 2005 |
|
Multimodal Multispeaker Probabilistic Tracking in Meetings, , , and , in: Proc. Int. Conf. on Multimodal Interfaces (ICMI), 2005 |
|
Multimodal Integration for Meeting Group Action Segmentation and Recognition, , , , , and , in: MLMI, 2005 |
|
Multimedia event modelling and recognition, , École Polytechnique Fédérale de Lausanne, 2005 |
Multichannel Speech Enhancement in Cars: Explicit vs. Implicit Adaptation Control, , and , in: Proceedings of HSCMA 2005, 2005 |
|
Multi Channel Sequence Processing, and , Idiap-RR-04-2005 |
|
Monte Carlo Video Text Segmentation, , and , in: International Journal of Pattern Recognition and Artificial Intelligence (IJPRAI), 19(5), 2005 |
|
Modeling Scenes with Local Descriptors and Latent Aspects, , , , , and , in: IEEE Int. Conf. on Computer Vision, 2005 |
|
Modeling Interactions from Email Communication, , and , in: Proc. IEEE International Conference on Multimedia & Expo (ICME,',','), 2006, 2006 |
|
Measuring the Performance of Face Localization Systems, , , and , in: Image and Vision Computing Journal, 24(8), 2006 |
|
Machine Learning for Multimodal Interaction: First International Workshop, MLMI'2004, Springer-Verlag Heidelberg, 2005 |
Local Features and 1D-HMMs for Fast and Robust Face Authentication, , Idiap-RR-17-2005 |
|
Local Binary Patterns as an Image Preprocessing for Face Authentication, , and , in: IEEE Int. Conf. on Automatic Face and Gesture Recognition (AFGR), 2006 |
|
Lighting Normalization Algorithms for Face Verification, , and , Idiap-Com-03-2005 |
|
Learning influence among interacting Markov chains, , , and , in: NIPS, 2005 |
|
Kernelized Infomax Clustering, and , Idiap-RR-73-2005 |
|
Joint Training of Multi-Stream HMMs, , Idiap-RR-22-2005 |
|
Joint Speech and Speaker Recognition, , École Polytechnique Fédérale de Lausanne, Computer Science Department, 2005 |
|
Interfaces Cerebrales, , in: Mente y Cerebro, 13(July), 2005 |
|
Inferring Document Similarity from Hyperlinks, and , in: ACM Conference on Information and Knowledge Management, 2005 |
|
Improving Speech Recognition Using a Data-Driven Approach, , and , in: Proceedings of Interspeech, 2005, 2005 |
|
Improving Fusion with Margin-Derived Confidence In Biometric Authentication Tasks, and , in: Fifth Int'l. Conf. Audio- and Video-Based Biometric Person Authentication AVBPA, 2005 |
|
Improving Continuous Speech Recognition System Performance with Grapheme Modelling, , , and , Idiap-RR-16-2005 |
|
Implicit Control of Noise Canceller for Speech Enhancement, , and , in: Proceedings of INTERSPEECH 2005, 2005 |
|
How Do Correlation and Variance of Base-Experts Affect Fusion in Biometric Authentication Tasks?, and , in: IEEE Trans. on Signal Processing, 2005 |
|
Hierarchical Multi-Stream Posterior Based Speech Recognition System, , and , in: Proceedings MLMI workshop, 2005 |
|
Hierarchical approach for spotting keywords, , Idiap-RR-41-2005 |
|
Harmonic Plus Noise Model for Concatenative Speech Synthesis, , Idiap-RR-37-2005 |
|
Gradient estimates of return distributions, and , in: PASCAL Workshop on Principled Methods of Trading Exploration and Exploitation, 2005 |
|
Generative Temporal ICA for Classification in Asynchronous BCI Systems, and , in: The 2nd International IEEE EMBS Conference On Neural Engineering, 2005 |
|
Generative Temporal ICA for Classification in Asynchronous BCI Systems, and , Idiap-RR-08-2005 |
|
Generative Independent Component Analysis for EEG Classification, and , in: European Symposium on Artificial Neural Networks ESANN, 2005 |
|
From Meeting Recordings to Web Distribution: Description of the Process, and , Idiap-Com-05-2005 |
|
F-ratio Client-Dependent Normalisation on Biometric Authentication Tasks, and , in: Proceedings of the 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-05), 2005 |
|
Finding groups of people in Google news, and , in: ACM Int. Conf. on Human-Centered Multimedia (HCM), 2006 |
|
Face Authentication Based on Local Features and Generative Models, , École Polytechnique Fédérale de Lausanne, 2005 |
|
Extracting Information from Multimedia Meeting Collections, , and , in: 7th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2005 |
|
Exploiting Hyperlinks to Learn a Retrieval Model, and , in: NIPS Workshop on Learning to Rank, 2005 |
|
Evaluation of Multiple Cues Head Pose Tracking Algorithm in Natural Environments, and , in: International Conference on Multimedia & Expo ICME 2005, 2005 |
|
Efficient Kalman Smoothing for Harmonic State-Space Models, , Idiap-RR-87-2005 |
|
Efficient Diffusion-based Illumination Normalization for Face Verification, , and , Idiap-RR-46-2005 |
|
EER of Fixed and Trainable Fusion Classifiers: A Theoretical Study with Application to Biometric Authentication Tasks, and , in: Sixth International Workshop on Multiple Classifier System (MCS2005), 2005 |
|
Developing and Enhancing Posterior Based Speech Recognition Systems, , , and , in: Proceedings of Interspeech, 2005 |
|
Cursive Character Challenge: a New Database for Machine Learning and Pattern Recognition, , and , in: Proceedings of International Conference on Pattern Recognition (ICPR), 2006 |
|
Continuous Microphone Array Speech Recognition on Wall Street Journal Corpus, , and , Idiap-RR-47-2005 |
|
Construction and comparison of approximations for switching linear gaussian state space models, and , Idiap-RR-06-2005 |
|
Construction and comparison of approximations for switching linear gaussian state space models, , Idiap-RR-71-2005 |
|
Constructing visual models with a latent space approach, , , and , in: the Springer series of Lecture Notes in Computer Science, 2006 |
|
Compensating User-Specific Information with User-Independent Information in Biometric Authentication Tasks, and , Idiap-RR-44-2005 |
|
Chord Representations for Probabilistic Models, , and , Idiap-RR-58-2005 |
|
Can Chimeric Persons Be Used in Multimodal Biometric Authentication Experiments?, and , Idiap-RR-20-2005 |
|
Can a Professional Imitator Fool a GMM-Based Speaker Verification System?, and , Idiap-RR-61-2005 |
|
Benchmarking Non-Parametric Statistical Tests, , and , in: Advances in Neural Information Processing Systems, NIPS 18. MIT Press, 2005 |
|
Bayesian Factorial Linear Gaussian State-Space Models for Biosignal Decomposition, and , in: IEEE Signal Processing Letters, 2007 |
|
AV16.3: an Audio-Visual Corpus for Speaker Localization and Tracking, , and , in: Proceedings of the 2004 MLMI Workshop, S. Bengio and H. Bourlard Eds, Springer Verlag, 2005 |
|
Audio-visual probabilistic tracking of multiple speakers in meetings, , , and , in: IEEE Trans. on Audio, Speech, and Language Processing, accepted for publication., 2006 |
|
Application of Information Retrieval Techniques to Single Writer Documents, , in: Pattern Recognition Letters, 26(14-15), 2005 |
|
Activity Report 2004, , Idiap-Com-01-2005 |
|
A Video Database for Head Pose Tracking Evaluation, and , Idiap-Com-04-2005 |
|
A Unified Framework for Score Normalization Techniques Applied to Text Independent Speaker Verification, and , in: IEEE Signal Processing Letters, Volume 12, 12(7), 2005 |
|
A Spectrogram Model for Enhanced Source Localization and Noise-Robust ASR, , and , in: Proceedings of INTERSPEECH 2005, 2005 |
|
A Sector-Based, Frequency-Domain Approach to Detection and Localization of Multiple Speakers, and , in: Proceedings of ICASSP 2005, 2005 |
|
A Rao-Blackwellized Mixed State Particle Filter for Head Pose Tracking, and , in: ACM ICMI Workshop on Multimodal Multiparty Meeting Processing (MMMP), 2005 |
|
A Probabilistic Model for Chord Progressions, , and , in: Proceedings of the Sixth International Conference on Music Information Retrieval (ISMIR), 2005 |
|
A Probabilistic Interpretation of SVMs with an Application to Unbalanced Classification, , and , in: Advances in Neural Information Processing Systems, NIPS 15, 2005 |
|
A Neural Network for Text Representation, and , in: International Conference on Artificial Neural Networks, ICANN, 2005 |
|
A Meeting Browser Evaluation Test, , , and , in: CHI '92: Proceedings of the SIGCHI conference on Human factors in computing systems, Portland, OR, USA, ACM Press, 2005 |
|
A Kernel Classifier for Distributions, and , Idiap-RR-32-2005 |
|
A Graphical Model for Chord Progressions Embedded in a Psychoacoustic Space, , , and , in: Proceedings of the 22nd International Conference on Machine Learning, 2005 |
|
A Generative Model for Music Transcription, , and , Idiap-RR-89-2005 |
|
A Discriminative Decoder for the Recognition of Phoneme Sequences, and , Idiap-RR-67-2005 |
|
Wide-Band Perceptual Audio Coding based on Frequency-Domain Linear Prediction, , and , Idiap-RR-58-2006 |
|
Very High Frequency Oscillations (VHFO) as a Predictor of Movement Intentions, , , , , and , in: NeuroImage, 32(1), 2006 |
|
Using Posterior-Based Features in Template Matching for Speech Recognition, , and , in: International Conference on Spoken Language Processing, 2006 |
|
Using more informative posterior probabilities for speech recognition, , , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2006 |
|
Using Chimeric Users to Construct Fusion Classifiers in Biometric Authentication Tasks: An Investigation, and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2006 |
|
User-Customized Password Speaker Verification Using Multiple Reference and Background Models, and , in: Speech Communication, 8, 2006 |
|
Unsupervised Speech/Non-speech Detection for Automatic Speech Recognition in Meeting Rooms, , and , Idiap-RR-57-2006 |
|
Unsupervised Spectral Subtraction for Noise-Robust ASR on Unknown Transmission Channels, , and , Idiap-RR-09-2006 |
|
Unified Inference for Variational Bayesian Linear Gaussian State-Space Models, and , in: NIPS, 2006 |
|
Two-Handed Gestures for Human-Computer Interaction, , École Polytechnique Fédérale de Lausanne, 2006 |
|
Two-Handed Gestures for Human-Computer Interaction, , Idiap-RR-73-2006 |
|
Tracking the Multi Person Wandering Visual Focus of Attention, , , and , in: International Conference on Multimodal Interfaces (ICMI06), 2006 |
|
Tracking Attention for Multiple People: Wandering Visual Focus of Attention Estimation, , , and , Idiap-RR-40-2006 |
|
Towards using slide information to enhance speech transcription of meetings, , and , Idiap-RR-01-2006 |
|
Towards a Robust BCI: Error Potentials and Online Learning, , and , in: IEEE Trans. on Neural Systems and Rehabilitation Engineering, 14(2), 2006 |
|
The segmentation of multi-channel meeting recordings for automatic speech recognition, , and , in: Int. Conf. on Spoken Language Processing (Interspeech ICSLP), 2006 |
|
The more you learn, the less you store: memory\--controlled incremental SVM, and , Idiap-RR-51-2006 |
|
The More you Learn, the Less you Store: Memory-Controlled Incremental SVM, and , in: Proceedings of International Cognitive Vision Workshop (ICVW) 2006), 2006 |
|
The Juicer LVCSR Decoder - User Manual for Juicer version 0.5.0, , Idiap-Com-03-2006 |
|
The BCI Competition III: Validating Alternative Approaches to Actual BCI Problems, , , , , , , , , and , in: IEEE Trans. on Neural Systems and Rehabilitation Engineering, 14(2), 2006 |
Switching Linear Dynamical Systems for Noise Robust Speech Recognition, and , Idiap-RR-08-2006 |
|
SVM-based Transfer of Visual Knowledge Across Robotic Platforms, , and , Idiap-RR-65-2006 |
|
Subjective Evaluation of Join Cost and Smoothing Methods for Unit Selection Speech Synthesis, and , in: IEEE Trans. on Audio, Speech and Language Processing, 14(5), 2006 |
|
Spiking Neuron Networks A survey, , Idiap-RR-11-2006 |
|
Speech Enhancement and Recognition in Meetings with an Audio-Visual Sensor Array, , and , Idiap-RR-24-2006 |
|
Speech Coding based on Spectral Dynamics, , , and , in: Ninth International Conference on TEXT, SPEECH and DIALOGUE (TSD), 2006 |
|
Speaker Localization for Microphone Array-Based ASR: The Effects of Accuracy on Overlapping Speech, and , Idiap-RR-29-2006 |
|
Speaker Localization for Microphone Array-Based ASR: The Effects of Accuracy on Overlapping Speech, and , in: Proc. Int. Conf. on Multimodal Interfaces (ICMI), 2006 |
|
Spatio-Temporal Analysis of Spontaneous Speech with Microphone Arrays, , Idiap-RR-77-2006 |
|
Spatio-Temporal Analysis of Spontaneous Speech with Microphone Arrays, , Ecole Polytechnique Fédérale de Lausanne, 2006 |
|
Sociometry Based Multiparty Audio Recordings Summarization, , in: Proceedings of International Conference on Pattern Recognition (ICPR 2006), 2006 |
|
Semantic Segmentation of Radio Programs Using Social Network Analysis and Duration Distribution Modeling, , and , Idiap-RR-75-2006 |
|
Robust-to-Illumination Face Localisation using Active Shape Models and Local Binary Patterns, , and , Idiap-RR-47-2006 |
|
Revisiting Doddington's Zoo: A Systematic Method to Assess User-dependent Variabilities, , and , in: Multimodal User Authentication (MMUA), 2006 |
|
Recognizing People's Focus of Attention from Head Poses: a Study, and , Idiap-RR-42-2006 |
|
Probabilistic Graphical Models for Human Interaction Analysis, , Idiap-RR-78-2006 |
|
Probabilistic Graphical Models for Human Interaction Analysis, , École Polytechnique Fédérale de Lausanne, 2006 |
|
Prior Knowledge in Kernel Methods, , École Polytechnique Fédérale de Lausanne, 2006 |
|
Posterior Based Keyword Spotting with A Priori Thresholds, , , and , in: International Conference on Spoken Language Processing (ICSLP), 2006 |
|
ORGIDIAP : le couteau suisse pour la gestion d'une entreprise, and , Idiap-Com-05-2006 |
|
Online statistical estimation for vehicle control, , Idiap-RR-13-2006 |
|
Online Classifier Adaptation in Brain-Computer Interfaces, and , Idiap-RR-16-2006 |
|
On the Recent Use of Local Binary Patterns for Face Authentication, , and , Idiap-RR-34-2006 |
|
Observations on Multi-Band Asynchrony in Distant Speech Recordings, , Idiap-RR-74-2006 |
|
Non-Invasive Brain-Actuated Control of a Mobile Robot by Human EEG, , , and , in: 2006 IMIA Yearbook of Medical Informatics, Schattauer Verlag, 2006 |
Nearly optimal exploration-exploitation decision thresholds, , in: Int. Conf. on Artificial Neural Networks (ICANN), 2006 |
|
Natural Scene Image Modeling using Color and Texture Visterms., and , in: Conference on Image and Video Retrieval CIVR, 2006 |
|
Multi-system Biometric Authentication: Optimal Fusion and User-Specific Information, , École Polytechnique Fédérale de Lausanne, 2006 |
|
Multi-stream Processing for Noise Robust Speech Recognition, , École Polytechnique Fédérale de Lausanne, 2006 |
|
Multi-Person Tracking in Meetings: A Comparative Study, , , , , and , in: Multimodal Interaction and Related Machine Learning Algorithms (MLMI), 2006 |
|
Model Adaptation for Sentence Unit Segmentation from Speech, , Idiap-RR-64-2006 |
|
Melanoma Recognition using Kernel Classifiers, , and , Idiap-RR-53-2006 |
|
Master Thesis: Integration of the Harmonic plus Noise Model (HNM) into the Hidden Markov Model-Based Speech Synthesis System (HTS), , Idiap-RR-69-2006 |
|
Managing IDIAP Inventory (Computers, Components, Software and Licences), and , Idiap-Com-04-2006 |
|
Machine Learning Approaches to Text Representation using Unlabeled Data, , Ecole Polytechnique Fédérale de Lausanne, 2006 |
|
Learning to Retrieve Images from Text Queries with a Discriminative Model, , and , in: International Workshop on Adaptive Multimedia Retrieval (AMR), 2006 |
|
Kernel Methods for Melanoma Recognition, , and , in: Proceedings of Workshop on Computer Vision Approaches to Medical Image Analysis (CVAMIA) 2006), 2006 |
|
Juicer: A Weighted Finite-State Transducer speech decoder, , , , , and , in: 3rd Joint Workshop on Multimodal Interaction and Related Machine LEarning Algorithms MLMI'06, 2006 |
|
Investigating Lexical Substitution Scoring for Subtitle Generation, , , , and , in: Proceedings of the 10th Conference on Computational Natural Language Learning (CoNLL)., 2006 |
|
Integrating co-occurrence and spatial contexts on patch-based scene segmentation, , , and , in: Beyond Patches Workshop, in conjunction with CVPR, 2006 |
|
Infinite Models for Speaker Clustering, , in: International Conference on Spoken Language Processing, 2006 |
|
Indexation de Documents Manuscrits, , in: Proceedings du Colloque International Francophone sur l'Ecrit et le Document (CIFED06), 2006 |
|
Incremental Learning for Place Recognition in Dynamic Environments, , , and , Idiap-RR-52-2006 |
|
Identifying unexpected words using in-context and out-of-context phoneme posteriors, and , Idiap-RR-68-2006 |
|
Hand Posture Classification and Recognition using the Modified Census Transform, , and , in: IEEE Int. Conf. on Automatic Face and Gesture Recognition (AFGR), 2006 |
|
Further Applications of Sector-Based Detection and Short-Term Clustering, , Idiap-RR-26-2006 |
|
Face Detection and Verification using Local Binary Patterns, , Idiap-RR-79-2006 |
|
Face Detection and Verification using Local Binary Patterns, , École Polytechnique Fédérale de Lausanne, 2006 |
|
Face Authentication Using Adapted Local Binary Pattern Histograms, and , in: 9th European Conference on Computer Vision (ECCV), 2006 |
|
Exploring Contextual Information in a Layered Framework for Group Action Recognition, , and , in: In the Eighth International Conference on Multimodal Interfaces (ICMI'06), 2006 |
|
Estimating the Confidence Interval of Expected Performance Curve in Biometric Authentication Using Joint Bootstrap, and , Idiap-RR-25-2006 |
|
Ensembles for Sequence Learning, , École Polytechnique Fédérale de Lausanne, 2006 |
|
EEG Classification using Generative Independent Component Analysis, and , in: Neurocomputing, 2006 |
|
Discrmininant Models for Text-independent Speaker Verification, , Idiap-RR-70-2006 |
|
Discriminative Kernel-Based Phoneme Sequence Recognition, , , , and , Idiap-RR-14-2006 |
|
Discriminant linear processing of time-frequency plane, and , in: International Conference on Spoken Language Processing, 2006 |
|
Detection and Application of Influence Rankings in Small Group Meetings, , , and , in: In the Eighth International Conference on Multimodal Interfaces (ICMI'06), 2006 |
|
Detecting Intentional Mental Transitions in an Asynchronous BCI, , , , and , Idiap-RR-43-2006 |
|
Detecting Abandoned Luggage Items in a Public Space, , and , in: IEEE Performance Evaluation of Tracking and Surveillance Workshop (PETS), 2006 |
|
Combination of Acoustic Classifiers based on Dempster-Shafer Theory of evidence, and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007 |
|
Bi-Modal Face and Speech Authentication: a BioLogin Demonstration System, , , and , in: Workshop on Multimodal User Authentication (MMUA), 2006 |
|
Audio Coding Based on Long Temporal Segments: Experiments With Quantization of Excitation Signal, and , Idiap-RR-46-2006 |
|
Audio Coding Based on Long Temporal Contexts, , , and , Idiap-RR-30-2006 |
|
Assessing the effectiveness of slides as a mean to improve the automatic transcription of oral presentations, , and , Idiap-RR-56-2006 |
|
Application of Information Retrieval Technologies to Presentation Slides, and , in: IEEE Transactions on Multimedia, 8(5), 2006 |
|
Annotation of face detection: description of XML format and files, , , and , Idiap-Com-06-2006 |
|
Analyzing Group Interactions in Conversations: a Review, , in: IEEE Int. Conf. on Multisensor Fusion and Integration for Intelligent Systems (MFI), 2006 |
|
Analysis and Classification of EEG Signals using Probabilistic Models for Brain Computer Interfaces, , École Polytechnique Fédérale de Lausanne, 2006 |
[DOI] [URL] |
An Acoustic Model Based on Kullback-Leibler Divergence for Posterior Features, , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007 |
|
Activity Report 2005, , Idiap-Com-01-2006 |
|
Active Shape Models Using Local Binary Patterns, and , Idiap-RR-07-2006 |
|
A Study on Visual Focus of Attention Recognition from Head Pose in a Meeting Room, and , in: 3rd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI06), 2006 |
|
A Neural Network to Retrieve Images from Text Queries, and , in: International Conference on Artificial Neural Networks (ICANN), 2006 |
|
A Multitask Learning Approach to Document Representation using Unlabeled Data, and , Idiap-RR-44-2006 |
|
A Max Kernel For Text-Independent Speaker Verification Systems, and , in: Second Workshop on Multimodal User Authentication, MMUA, 2006 |
|
A Generalized Dynamic Composition Algorithm of Weighted Finite State Transducers for Large Vocabulary Speech Recognition, , and , Idiap-RR-62-2006 |
|
A Bayesian Alternative to Gain Adaptation in Autoregressive Hidden Markov Models, and , Idiap-RR-55-2006 |
|
2D Multi-Person Tracking: A Comparative Study in AMI Meetings, , , , , and , in: Classification of Events, Activities, and Relationships (CLEAR) 2006, 2006 |
|
Towards Brain-Computer Interfacing, , , and , The MIT Press, 2007 |
The IDIAP Brain-Computer Interface: An Asynchronous Multi-Class Approach, , and , in: Towards Brain-Computer Interfacing, The MIT Press, 2007 |
Tapping the Mind or Resonating Minds?, , in: European Visions for the Knowledge Age, Cheshire Henbury, 2007 |
Speech Recognition based on Template Matching and Phone Posterior Probabilities, , and , Idiap-Com-02-2007 |
|
Sparse Probabilistic Classifiers, and , in: International Conference on Machine Learning (ICML), 2007 |
|
Short-Term Spatio-Temporal Clustering Applied to Multiple Moving Speakers, and , in: IEEE Transactions on Audio, Speech and Language Processing, 15(5):15, 2007 |
|
Scalable Wide-band Audio Codec based on Frequency Domain Linear Prediction, , , and , Idiap-RR-16-2007 |
|
Role Recognition in Broadcast News Using Social Network Analysis and Duration Distribution Modeling, , in: IEEE Transactions on Multimedia, 2007 |
|
On Confusions in a Phoneme Recognizer, , and , 2007 |
|
Non-Invasive Estimates of Local Field Potentials for Brain-Computer Interfaces, , , and , in: Towards Brain-Computer Interfacing, The MIT Press, 2007 |
Multi-stream Features Combination based on Dempster-Shafer Rule for LVCSR System, , and , Idiap-RR-09-2007 |
|
More Efficiency in Multiple Kernel Learning, , , and , in: International Conference on Machine Learning (ICML), 2007 |
|
Learning the structure of image collections with latent aspect models, , École Polytechnique Fédérale de Lausanne, 2007 |
|
Learning the Inter-frame Distance for Discriminative Template-based Keyword Detection, and , Idiap-RR-15-2007 |
|
Joint Bi-Modal Face and Speaker Authentication using Explicit Polynomial Expansion, , Idiap-RR-14-2007 |
|
Hierarchical Neural Networks Feature Extraction for LVCSR system, , , , , and , Idiap-RR-08-2007 |
|
Feature Selection Methods on Distributed Linear Inverse Solutions for a Non-Invasive Brain-Machine Interface, , and , Idiap-Com-04-2007 |
|
Face Authentication with Salient Local Features and Static Bayesian Network, and , Idiap-RR-04-2007 |
|
Error-Related EEG Potentials in Brain-Computer Interfaces, and , in: Towards Brain-Computer Interfacing, The MIT Press, 2007 |
Direct optimisation of a multilayer perceptron for the estimation of cepstral mean and variance statistics, and , Idiap-RR-13-2007 |
|
Correcting Confusion Matrices for Phone Recognizers, , Idiap-Com-03-2007 |
|
Confidence-based Cue Integration for Visual Place Recognition, and , Idiap-RR-17-2007 |
|
Adaptation in Brain-Computer Interfaces, , , , , , , , , , and , in: Towards Brain-Computer Interfacing, The MIT Press, 2007 |
A study of phoneme and grapheme based context-dependent ASR systems, and , Idiap-RR-12-2007 |
|
A Kernel Trick For Sequences Applied to Text-Independent Speaker Verification Systems, and , in: Pattern Recognition, 2007 |
|
Broadcast News Story Segmentation Using Social Network Analysis and Hidden Markov Models, and , Idiap-RR-30-2007 |
|
COMBINATION OF AGGLOMERATIVE AND SEQUENTIAL CLUSTERING FOR SPEAKER DIARIZATION, , and , Idiap-RR-51-2007 |
|
AGGLOMERATIVE INFORMATION BOTTLENECK FOR SPEAKER DIARIZATION OF MEETINGS DATA, , and , Idiap-RR-31-2007 |
|
Hierarchical and Parallel Processing of Modulation Spectrum for ASR applications, and , Idiap-RR-45-2007 |
|
Recognition and Understanding of Meetings The AMI and AMIDA Projects, , and , Idiap-RR-46-2007 |
|
A Thousand Words in a Scene, , , and , Idiap-RR-40-2005 |
|
Exploiting Contextual Information for Improved Phoneme Recognition, , , and , Idiap-RR-65-2007 |
|
Bayesian Controller for a Novel Semi-Autonomous Navigation Concept, , , and , Idiap-RR-26-2007 |
|
A Generative Model for Rhythms, , , and , Idiap-RR-70-2007 |
|
A Distance Model for Rhythms, , , and , Idiap-RR-33-2008 |
|
The Projectron: a Bounded Kernel-Based Perceptron, , and , Idiap-RR-30-2008 |
|
A Cognitive and Unsupervised MAP Adaptation Approach to the Recognition of the Focus of Attention from Head Pose, and , Idiap-RR-20-2007 |
|
Modeling semantic aspects for cross-media image indexing, and , Idiap-RR-56-2005 |
|
Non-linear Spectral Contrast Stretching for In-car Speech Recognition, and , Idiap-RR-53-2007 |
|
Using Audio and Video Features to Classify the Most Dominant Person in a Group Meeting, , , , , , , , and , Idiap-RR-29-2007 |
|
A Discriminative Kernel-based Model to Rank Images from Text Queries, and , Idiap-RR-38-2007 |
|
Hierarchical Penalization, , and , Idiap-RR-76-2007 |
|
The use of brain-computer interfacing for ambient intelligence, , , , , and , Idiap-RR-61-2007 |
|
Temporal Masking for Bit-rate Reduction in Audio Codec Based on Frequency Domain Linear Prediction, , , and , Idiap-RR-48-2007 |
|
Feature Extraction for Multi-class BCI using Canonical Variates Analysis, , , , and , Idiap-RR-23-2007 |
|
To Err Is Human: Learning from Error Potentials in Brain-Computer Interfaces, , and , Idiap-RR-37-2007 |
|
A Comparative Psychophysical and EEG Study of Different Feedback Modalities for HRI, , , , and , Idiap-RR-78-2007 |
|
Recognition and Understanding of Meetings Overview of the European AMI and AMIDA Projects, and , Idiap-RR-27-2008 |
|
Characterizing the EEG Correlates of Exploratory Behavior, , , and , Idiap-RR-28-2008 |
|
Multi-party Focus of Attention Recognition in Meetings from Head Pose and Multimodal Contextual Cues, and , Idiap-RR-50-2007 |
|
Probabilistic Head Pose Tracking Evaluation in Single and Multiple Camera Setups, and , Idiap-RR-21-2007 |
|
Detection and Recognition of Number Sequences in Spoken Utterances, and , Idiap-RR-42-2007 |
|
Posterior-Based Features and Distances in Template Matching for Speech Recognition, and , Idiap-RR-41-2007 |
|
A graphical tool for monitoring Oz objects activity, and , in: Proc. of WOz'95: International Workshop on Oz Programming, IDIAP, Uni. Fribourg, 1995 |
ETC\_vérif, a Prototype of a Cooperative Automatic Speech Recognition System, and , in: Proc. of WOz'95: International Workshop on Oz Programming, IDIAP, Uni. Fribourg, 1995 |
Role Recognition in Radio Programs using Social Affiliation Networks and Mixtures of Discrete Distributions: an Approach Inspired by Social Cognition, and , Idiap-RR-40-2007 |
|
Mapping Nonverbal Communication into Social Status: Automatic Recognition of Journalists and Non-journalists in Radio News, , Idiap-RR-33-2007 |
|
Semantic Segmentation of Radio Programs Using Social Network Analysis and Duration Distribution Modeling, , and , in: IEEE International Conference on Multimedia and Expo (ICME), 2007 |
|
Broadcast News Story Segmentation Using Social Network Analysis and Hidden Markov Models, and , in: ACM International Conference on Multimedia, 2007 |
|
COMBINATION OF AGGLOMERATIVE AND SEQUENTIAL CLUSTERING FOR SPEAKER DIARIZATION, , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008 |
|
AGGLOMERATIVE INFORMATION BOTTLENECK FOR SPEAKER DIARIZATION OF MEETINGS DATA, , and , in: IEEE Automatic Speech Recognition and Understanding Workshop, 2007 |
|
Integration of TDOA Features in Information Bottleneck Framework for Fast Speaker Diarization, , and , Idiap-RR-26-2008 |
|
On the Combination of Auditory and Modulation Frequency Channels for ASR applications, and , Idiap-RR-12-2008 |
|
Hierarchical Neural Networks Feature Extraction for LVCSR system, , , , , and , in: Interspeech 2007, 2007 |
|
Multi-stream Features Combination based on Dempster-Shafer Rule for LVCSR System, , and , in: Interspeech 2007, 2007 |
|
Hierarchical and Parallel Processing of Modulation Spectrum for ASR applications, and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008 |
|
Hilbert Envelope Based Specto-Temporal Features for Phoneme Recognition in Telephone Speech, , and , Idiap-RR-18-2008 |
|
Front-end for Far-field Speech Recognition based on Frequency Domain Linear Prediction, , and , Idiap-RR-17-2008 |
|
Spectro-Temporal Features for Automatic Speech Recognition using Linear Prediction in Spectral Domain, , and , Idiap-RR-05-2008 |
|
Discriminative Cue Integration for Medical Image Annotation, , and , Idiap-RR-64-2007 |
|
The Interchangeability of Learning Rate and Gain in Backpropagation Neural Networks, , and , in: Neural Computation, 8(02), 1996 |
|
Evaluating pruning methods, and , in: 1995 International Symposium on Artificial Neural Networks (ISANN'95), 1995 |
|
Gain Elimination form Backpropagation Neural Networks, , and , in: Proceedings of the International Conference on Neural Networks, IEEE, Perth, IEEE, 1995 |
High Order and Multilayer Perceptron Initialization, and , Idiap-RR-07-1994 |
|
Weight Initialization for High Order and Multilayer Perceptrons, and , in: Proceedings of the '94 SIPAR--Workshop on Parallel and Distributed Computing, SI Group for Parallel Systems, 1994 |
Modular Object-Oriented Neural Network Simulators and Topology Generalizations, , and , in: Proceedings of the International Conference on Artificial Neural Networks (ICANN 94), Sorrento, Italy, Springer-Verlag, 1994 |
|
Introducing Temporal Asymmetries in Feature Extraction for Automatic Speech Recognition, and , Idiap-RR-25-2008 |
|
Emulating Temporal Receptive Fields of Auditory Mid-Brain Neurons for Automatic Speech Recognition, and , Idiap-RR-24-2008 |
|
Time Resolved Polarimetry on an Optical Fiber Ammeter, and , in: Journal of the European Optical Society, 5, 1996 |
Optical Multilayer Perceptrons based on Liquid Crystal Devices, , , and , in: Optics and Information, Cercle SFO/SEE d'Opto-informatique, Mulhouse, France, European Optical Society (EOS), 1995 |
Adaptive Multilayer Optical Neural Network with Optical Thresholding, and , in: Optical Engineering, 34(08), 1995 |
Adaptive Multilayer Optical Neural Network Design, and , Idiap-RR-04-1994 |
|
Recognition and Understanding of Meetings The AMI and AMIDA Projects, , and , in: Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU'07, 2007 |
|
A Thousand Words in a Scene, , , and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2007 |
|
Confidence-based Cue Integration for Visual Place Recognition, and , in: IEEE International Conference on Intelligent RObot Systems (IROS), 2007 |
|
Boolean Logic Inspired High Order Perceptron Construction, , and , in: SIPAR Workshop'95 Parallel and Distributed Systems, SIPAR SI Group for Parallel Systems, Biel School of Engineering, Computer Science Department, 1995 |
|
Combining Evidence from a Generative and a Discriminative Model in Phoneme Recognition, and , Idiap-RR-20-2008 |
|
Reverse Correlation for analyzing MLP Posterior Features in ASR, , and , Idiap-RR-13-2008 |
|
Comparing Different Word Lattice Rescoring Approaches Towards Keyword Spotting, , , and , Idiap-RR-32-2007 |
|
Significance of Contextual Information in Phoneme Recognition, , , and , Idiap-RR-28-2007 |
|
Exploiting Contextual Information for Improved Phoneme Recognition, , , and , in: "IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)", 2008 |
|
Silence Models in Weighted Finite-State Transducers, , Idiap-RR-19-2008 |
|
A Weighted Finite State Transducer tutorial, , Idiap-Com-03-2008 |
|
Bayesian Controller for a Novel Semi-Autonomous Navigation Concept, , , and , in: 3rd European Conference on Mobile Robots (ECMR 2007), 2007 |
|
A supervised learning approach based on STDP and polychronization in spiking neuron networks, , and , in: European Symposium on Artificial Neural Networks, ESANN, 2007 |
|
A Data-driven Approach to Speech/Non-speech Detection, and , Idiap-RR-23-2008 |
|
Exploiting contextual information for speech/non-speech detection, and , Idiap-RR-22-2008 |
|
Exploiting temporal context for speech/non-speech detection, , and , Idiap-RR-21-2008 |
|
A Generative Model for Rhythms, , , and , in: NIPS Workshop on Brain, Music and Cognition, 2007 |
|
A Distance Model for Rhythms, , , and , in: 25th International Conference on Machine Learning (ICML), 2008 |
|
On-line Independent Support Vector Machines for Cognitive Systems, , , , and , Idiap-RR-63-2007 |
|
The Projectron: a Bounded Kernel-Based Perceptron, , and , in: Int. Conf. on Machine Learning, 2008 |
|
Indoor Place Recognition using Online Independent Support Vector Machines, , , , and , in: 18th British Machine Vision Conference (BMVC07), 2007 |
|
A Cognitive and Unsupervised MAP Adaptation Approach to the Recognition of the Focus of Attention from Head Pose, and , in: International Conference on Multi-Media & Expo (ICME07), 2007 |
|
Analyzing Flickr Groups, and , Idiap-RR-03-2008 |
|
Detecting queues at vending machines: a statistical layered approach, and , Idiap-RR-04-2008 |
|
Non-Uniform Speech/Audio Coding Exploiting Predictability of Temporal Evolution of Spectral Envelopes, , , and , in: Tenth International Conference on TEXT, SPEECH and DIALOGUE (TSD), 2007 |
|
LP-TRAPs in all senses, , Idiap-RR-66-2007 |
|
Non-uniform QMF Decomposition for Wide-band Audio Coding based on Frequency Domain Linear Prediction, , , and , Idiap-RR-43-2007 |
|
Frequency Domain Linear Prediction for QMF Sub-bands and Applications to Audio Coding, , , and , in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), 2007 |
|
Wide-Band Perceptual Audio Coding based on Frequency-Domain Linear Prediction, , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007 |
|
Unsupervised Speech/Non-speech Detection for Automatic Speech Recognition in Meeting Rooms, , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007 |
|
Neural Networks with Adaptive Learning Rate and Momentum Terms, and , Idiap-RR-04-1995 |
|
Modeling semantic aspects for cross-media image indexing, and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2007 |
|
The Effects of Optical Thresholding in Backpropagation Neural Networks, , and , in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'95 and NeuroNimes'95), ENNS, Paris, France, EC2 & Cie, 1995 |
Results on the Steepness in Backpropagation Neural Networks, , and , in: Proceedings of the '94 SIPAR-Workshop on Parallel and Distributed Computing, SI Group for Parallel Systems, 1994 |
Non-Invasive Brain-Machine Interaction, , , , and , in: International Journal of Pattern Recognition and Artificial Intelligence, 2008 |
|
Brain-Controlled Robots, , in: IEEE Intelligent Systems, 2008 |
|
Brain-Computer Interfaces for HCI and Games, , , , , and , in: Proceedings of the 26th Annual CHI Conference on Human Factors in Computing Systems, Extended Abstracts, 2008 |
|
High-Resolution EEG Techniques for Brain-Computer Interface Applications, , , , , , , , , , , and , in: Journal of Neuroscience Methods, 2007 |
|
An Asynchronous and Non-Invasive Brain-Actuated Wheelchair, , , , , , , and , in: Proceedings of the 13th International Symposium on Robotics Research, 2007 |
|
Augmenting Astronaut's Capabilities through Brain-Machine Interfaces, , , and , in: Proceedings of the 20th International Joint Conference on Artificial Intelligence, Workshop on Artificial Intelligence for Space Applications, 2007 |
|
Adaptive Shared Control of a Brain-Actuated Simulated Wheelchair, , , , , , , and , in: Proceedings of the 10th IEEE International Conference on Rehabilitation Robotics, 2007 |
|
Brain-Machine Interfaces through Control of Electroencephalographic Signals and Vibrotactile Feedback, , , , , , , , and , in: Proceedings of the 12th International Conference on Human-Computer Interaction, 2007 |
|
Vibrotactile Feedback in the Context of Mu-Rhythm based BCI, , , , , , , , , , , and , in: Proceedings of the 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2007 |
|
Context-based Filtering for Assisted Brain-Actuated Wheelchair Driving, , , , , , , and , in: Computational Intelligence and Neuroscience, 2007, 2007 |
|
Vibrotactile Feedback for Brain-Computer Interface Operation, , , , , , , , , , , and , in: Computational Intelligence and Neuroscience, 2007, 2007 |
|
Non-Invasive Brain-Actuated Interaction, , , , and , in: Proceedings of the 2nd International Symposium on Brain, Vision and Artificial Intelligence, 2007 |
|
Prospects on Brain-Machine Interfaces for Space System Control, , , , , , , , , , , , , , , , , and , in: Proceedings of the 57th International Astronautical Conference, 2006 |
|
Haptic Feedback Compared with Visual Feedback for BCI, , , , , and , in: Proceedings of the 3rd International Brain-Computer Interface Workshop & Training Course 2006, 2006 |
|
Inference in Switching Linear Dynamical Systems Applied to Noise Robust Speech Recognition of Isolated Digits, , Idiap-RR-35-2008 |
|
A Bayesian Switching Linear Dynamical System for Scale-Invariant robust speech extraction, and , Idiap-RR-52-2007 |
|
Google Portrait, , and , Idiap-Com-07-2007 |
|
On the Recent Use of Local Binary Patterns for Face Authentication, , and , in: International Journal on Image and Video Processing Special Issue on Facial Image Processing, 2007 |
|
Traitement préliminaire de l'image d'un texte manuscrit en vue de sa reconnaissance: une méthode de sur-segmentation, , and , in: 4eme Colloque National sur l'A?crit et le Document (CNED'96), 1996 |
Experiments with robust similarity measures for OCR, , Idiap-RR-03-1995 |
Object Category Detection using Audio-visual Cues, , , , and , Idiap-RR-58-2007 |
|
Incremental Learning for Place Recognition in Dynamic Environments, , , and , in: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS07), 2007 |
|
Object Category Detection using Audio-visual Cues, , , , and , in: International Conference on Computer Vision Systems (ICVS08), 2008 |
|
SVM-based Transfer of Visual Knowledge Across Robotic Platforms, , and , in: International Conference on Computer Vision Systems (ICVS07), 2007 |
|
Visual Speech Recognition using Active Shape Models and Hidden Markov Models, , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'96), 1996 |
|
The Anterior Cingulate Cortex, , Idiap-Com-02-2008 |
|
A Neural Network based Regression Approach for Recognizing Simultaneous Speech, , , , and , Idiap-RR-10-2008 |
|
Neural Network based Regression for Robust Overlapping Speech Recognition using Microphone Arrays, , , and , Idiap-RR-09-2008 |
|
Effective post-processing for single-channel frequency-domain speech enhancement, , Idiap-RR-71-2007 |
|
Robust overlapping speech recognition based on neural networks, , and , Idiap-RR-55-2007 |
|
MLP-based Log Spectral Energy Mapping for Robust Overlapping Speech Recognition, , , and , Idiap-RR-54-2007 |
|
Non-linear Spectral Contrast Stretching for In-car Speech Recognition, and , in: Interspeech-Eurospeech # to appear in html, 2007 |
|
Non-Invasive Brain Computer Interface for Mental Control of a Simulated Wheelchair, , , , , , and , in: Proceedings of the 3rd International Brain-Computer Interface Workshop & Training Course 2006, 2006 |
|
Dynamical Dirichlet Mixture Model, , and , Idiap-RR-02-2007 |
|
Kernel Methods for Melanoma Recognition, , and , in: Medical Informatics in Europe (MIE), 2006 |
|
Local velocity-adapted motion events for spatio-temporal recognition, , and , in: Computer Vision and Image Undertanding, 108(3), 2007 |
|
Adaptive Beamforming with a Maximum Negentropy Criterion, , , , , and , Idiap-RR-29-2008 |
|
Maximum Negentropy Beamforming, , , , and , Idiap-RR-07-2008 |
|
Adaptive Beamforming with a Maximum Negentropy Criterion, , , , and , Idiap-RR-06-2008 |
|
Filter Bank Design for Subband Adaptive Beamforming and Application to Speech Recognition, , , , , and , Idiap-RR-02-2008 |
|
Filter Bank Design based on Minimization of Individual Aliasing Terms for Minimum Mutual Information Subband Adaptive Beamforming, , , , , and , Idiap-RR-77-2007 |
|
Adaptive Beamforming with a Minimum Mutual Information Criterion, , , , , and , Idiap-RR-74-2007 |
|
Minimum Mutual Information Beamforming for Simultaneous Active Speakers, , , , , and , Idiap-RR-73-2007 |
|
Unsupervised Learning for Information Distillation, , Idiap-RR-47-2007 |
|
Discriminatove Keyword Spotting, , and , Idiap-RR-31-2008 |
|
Theoretical Foundations for Large-Margin Kernel-Based Continuous Speech Recognition, , Idiap-RR-44-2007 |
|
Human-Centered Computing: Toward a Human Revolution, , , and , Idiap-RR-57-2007 |
|
Human-centered Computing: Toward a Human Revolution, , , and , in: IEEE Computer, 40(5), 2007 |
|
Automatic Word Recognition in Cars, and , in: IEEE Speech and Audio Processing, 1995 |
Définition et évaluation d'un protocole de négociation dans un système multi-agents de reconnaissance de la parole, , Idiap-RR-02-1995 |
Lexical filtrering by means of prosodic information, , and , in: International Congress of Phonetic Sciences, 1995 |
The use of prosodic agents in a cooperative automatic speech recognition system, and , in: International Congress of Phonetic Sciences, 1995 |
A study of Intra- and Inter-Speaker Variability in the Voices of Twins for Speaker Verification, and , in: International Congress of Phonetic Sciences, 1995 |
Neural nets approaches to Speaker Verification: comparison with Second Order Statistical Measure, and , in: ICASSP, 1995 |
Environnement multi-agents de reconnaissance automatique de la parole en continu, and , in: Actes des 3emes Journees Francophones sur l'Intelligence Artificielle Distribuee et les Systemes Multi-agents, 1995 |
ESTIMATING THE DOMINANT PERSON IN MULTI-PARTY CONVERSATIONS USING SPEAKER DIARIZATION STRATEGIES, , , and , Idiap-RR-60-2007 |
|
Using Audio and Video Features to Classify the Most Dominant Person in a Group Meeting, , , , , , , , and , in: "", 2007 |
|
A Novel Statistical Generative Model Dedicated To Face Recognition, and , Idiap-RR-39-2007 |
|
Face Authentication with Salient Local Features and Static Bayesian Network, and , in: IEEE / IAPR Intl. Conf. On Biometrics (ICB), 2007 |
|
VoicePhone: An Interactive Vocal Server for Telephone Numbers, , Idiap-Com-04-1996 |
|
Swiss-French Polyphone: a Telephone Speech Database to develop Interactive Voice Servers, , , and , in: Linguistic Databases, 1995 |
A Discriminative Kernel-based Model to Rank Images from Text Queries, and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), X, 2008 |
|
Machine Learning for Information Retrieval, , Idiap-RR-34-2008 |
|
Learning the Inter-frame Distance for Discriminative Template-based Keyword Detection, and , in: International Conference on Speech Communication and Technology (INTERSPEECH), 2007 |
|
A Discriminative Approach for the Retrieval of Images from Text Queries, , and , in: European Conference on Machine Learning (ECML), 2006 |
|
Hierarchical Penalization, , and , in: Advances in Neural Information Processing Systems 21, 2007 |
|
The use of brain-computer interfacing for ambient intelligence, , , , , and , in: In the book, Constructing Ambient Intelligence: AmI-07 Workshops Proceedings, Max M\:uhlh\:auser, Alois Ferscha, and Erwin Aitenbichler (Eds.,',','), LNCS, Springer Verlag, 2008., 2007 |
|
Spectral Noise Shaping: Improvements in Speech/Audio Codec Based on Linear Prediction in Spectral Domain, , , and , Idiap-RR-16-2008 |
|
Temporal Masking for Bit-rate Reduction in Audio Codec Based on Frequency Domain Linear Prediction, , , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008 |
|
Feature Extraction for Multi-Class BCI using Canonical Variates Analysis, , , , and , in: Proceedings of the IEEE International Symposium on Intelligent Signal Processing, 2007 |
|
Visuo-Spatial Attention Frame Recognition for Brain-Computer Interfaces, , , , , , and , in: Proceedings of the 1st International Conference on Cognitive Neurodynamics, 2007 |
|
Stationary Features and Cat Detection, and , Idiap-RR-56-2007 |
|
Neural Network Classification and Formalization, , in: Computer Standards & Interfaces, 16(03), 1994 |
Neural Network Formalization, , Idiap-RR-01-1992 |
|
Error-Related EEG Potentials Generated during Simulated Brain-Computer Interaction, and , in: IEEE Trans. on Biomedical Engineering, 55(3), 2008 |
|
High Frequency Bands and Estimated Local Field Potentials to Improve Single-Trial Classification of Electroencephalographic Signals, , , , , and , in: Proceedings of the 3rd International Brain-Computer Interface Workshop & Training Course 2006, 2006 |
|
Reliability in a Multi-agent Spoken Language Recognition System, and , in: 4th European Conference on Speech Communication and Technology, 1995 |
Microprosodic study of isolated French word corpora, , in: 4th European Conference on Speech Communication and Technology, 1995 |
Discrimination of the voices of twins and siblings for speaker verification, and , in: 4th European Conference on Speech Communication and Technology, 1995 |
Non-Ontogenic Sparse Neural Networks, , and , in: Proceedings of the International Conference on Neural Networks, IEEE, IEEE, 1995 |
Keyword Spotting on Word Lattices, and , Idiap-RR-22-2007 |
|
Ontogenic High Order Cauchy Machines, and , in: Proceedings of the SIPAR Workshop '95: Parallel and Distributed Systems, Biel School of Engineering, 1995 |
Validating Different Flexible Vocabulary Approaches on the Swiss French PolyPhone and PolyVar databases, , , and , in: Proceedings of ICSLP 96, 1996 |
Un système prédictif de la structuration syntaxico-rythmique d'un énoncé à l'aide d'informations prosodiques, , and , in: Proceedings of JEP'96: XXIemes Journees d'etude sur la Parole, 1996 |
|
Towards a Multi-agents Approach for Understanding Speech, and , Idiap-Com-05-1996 |
|
Un interface de recherche documentaire: I de r, version 2.0, , Idiap-RR-04-1993 |
|
Un interface d'indexation documentaire: I d'i, version 2.0, , Idiap-RR-03-1993 |
|
Un interface d'indexation documentaire: I d'i, version 1.4, , Idiap-RR-01-1993 |
|
Une technique efficace de traitement en Prolog de la morphologie flexionnelle du français, , Idiap-RR-04-1992 |
|
Un environnement d'analyse linguistique robuste: CPD, version 1.7, , Idiap-RR-03-1992 |
|
To Err Is Human: Learning from Error Potentials in Brain-Computer Interfaces, , and , in: 1st International Conference on Cognitive Neurodynamics (ICCN 2007), 2007 |
|
A Comparative Psychophysical and EEG Study of Different Feedback Modalities for HRI, , , , and , in: 3rd ACM/IEEE Conf on Human-Robot Interaction (HRI08), 2008 |
|
Online Classifier Adaptation in High Frequency EEG, , and , in: Proceedings of the 3rd International Brain-Computer Interface Workshop & Training Course 2006, 2006 |
|
Apprentissage de prototypes de caractères à partir de l'image d'un texte manuscrit et avec l'aide d'un opérateur, , Idiap-RR-01-1995 |
|
A system for the off-line recognition of handwritten text, , in: International Conference on Pattern Recognition (ICPR,',','), Jerusalem, 1994 |
Recognition of Handprinted Digits using Optimal Bounded Error Matching, , in: International Conference on Document Analysis and Retrieval (ICDAR,',','), Tsukuba Science City, Japan, 1993 |
Design and Implementation of a System for the Recognition of Handwritten Responses on US Census Forms, , in: IAPR Workshop on Document Analysis Systems, 1994 |
Higher-Order Statistics in Visual Object Recognition, , in: Proc. IEEE Conf. on Computer Vision and Pattern Recognition, 1993 |
|
Handwriting Recognition, , in: Second Asian Conference on Computer Vision (ACCV'95,',','), Singapore, 1995 |
Finding Lines under Bounded Error, , Idiap-RR-11-1993 |
|
An RBF Network that Learns Some Aspects of Perceptual Organization, , Idiap-RR-10-1993 |
|
The 3D Indexing Problem, , Idiap-RR-08-1993 |
|
Geometric Matching in Computer Vision--Algorithms and Open Problems, , Idiap-RR-07-1993 |
|
Recognition of Handprinted Digits, , Idiap-RR-06-1993 |
|
Higher-Order Statistics in Visual Object Recognition, , Idiap-RR-02-1993 |
|
Recognition and Understanding of Meetings Overview of the European AMI and AMIDA Projects, and , in: LangTech 2008, 2008 |
|
Characterizing the EEG Correlates of Exploratory Behavior, , , and , in: IEEE Transactions on Neural Systems & Rehabilitation Engineering, 2008 |
|
Biometric Person Authentication IS A Multiple Classifier Problem, and , Idiap-RR-03-2007 |
|
Do Backpropagation trained neural networks have normal weight distributions?, and , in: International Conference on Artificial neural Networks, 1993 |
|
Visual Focus of Attention Estimation from Head Pose Posterior Probability Distributions, and , in: International Conference on Multi-media & Expo, 2008 |
|
Multi-party Focus of Attention Recognition in Meetings from Head Pose and Multimodal Contextual Cues, and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008 |
|
Probabilistic Head Pose Tracking Evaluation in Single and Multiple Camera Setups, and , in: Classification of Events, Activities and Relationship Evaluation and Workshop, 2007 |
|
Detection and Recognition of Number Sequences in Spoken Utterances, and , in: 2nd Workshop on Speech in Mobile and Pervasive Environments (SiMPE), 2007 |
|
Posterior Features Applied to Speech Recognition Tasks with Limited Training Data, , and , Idiap-RR-15-2008 |
|
Using KL-based Acoustic Models in a Large Vocabulary Recognition Task, , and , Idiap-RR-14-2008 |
|
Posterior-Based Features and Distances in Template Matching for Speech Recognition, and , in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), 2007 |
|
Swiss PolyPhone and PolyVar: Building Databases for Speech Recognition and Speaker Verification, and , in: Proceedings of The 3rd Slovenian-German and 2nd SDRV Workshop, Speech and Image Understanding, 1996 |
Machine Learning for Audio, Image and Video Analysis, and , Springer Verlag, 2008 |
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 |