All conference papers
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 |
2024
A Differentiable Integer Linear Programming Solver for Explanation-Based Natural Language Inference, , and , in: The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, 2024 |
A Human Perspective to AI-based Candidate Screening, , , , , , and , in: Proceedings of the 58th Hawaii International Conference on System Sciences (HICSS), 2024 |
A Novel and Responsible Dataset for Face Presentation Attack Detection on Mobile Devices, , , , , and , in: The IEEE International Joint Conference on Biometrics, Buffalo, New York, pages 8, 2024 |
|
A Symbolic Framework for Systematic Evaluation of Mathematical Reasoning with Transformers, , , and , in: Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2024 |
[URL] |
A System for Human-Robot Teaming through End-User Programming and Shared Autonomy, , , , , and , in: Proceedings of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, pages 231-239, 2024 |
[DOI] [URL] |
A Unified Model for Gaze Following and Social Gaze Prediction, , , and , in: The 18th IEEE International Conference on Automatic Face and Gesture Recognition, 2024 |
|
Adversarial Robustness Analysis in Automatic Pathological Speech Detection Approaches, and , in: Interspeech, 2024 |
|
Annotator-centric Active Learning for Subjective NLP Tasks, , , and , in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, 2024 |
Are there identifiable structural parts in the sentence embedding whole?, and , in: Proceedings of the 7th BlackboxNLP Workshop: Analyzing and Interpreting Neural Networks for NLP, 2024 |
Assessing the Reliability of Biometric Authentication on Virtual Reality Devices, , and , in: Proceedings of IEEE International Joint Conference on Biometrics, 2024 |
|
Bi-directional Training for Composed Image Retrieval via Text Prompt Learning, , , , and , in: IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2024 |
[URL] |
BLM-It - Blackbird Language Matrices for Italian: A CALAMITA Challenge, , , and , in: Proceedings of the 10th Italian Conference on Computational Linguistics, 2024 |
Breaking Template Protection: Reconstruction of Face Images from Protected Facial Templates, and , in: 18th International Conference on Automatic Face and Gesture Recognition (FG), 2024 |
|
Can We Learn to Select the Right Algorithm for OOD Generalization?, and , in: Out Of Distribution Generalization in Computer Vision, Workshop at ECCV, 2024 |
CCDb-HG: Novel Annotations and Gaze-Aware Representations for Head Gesture Recognition, , , and , in: 18th IEEE Int. Conference on Automatic Face and Gesture Recognition (FG), Istanbul,, 2024 |
|
ChatGPT and biometrics: an assessment of face recognition, gender detection, and age estimation capabilities, , , , and , in: 2024 IEEE International Conference on Image Processing (ICIP), 2024 |
[DOI] [URL] |
ChildPlay-Hand: A Dataset of Hand Manipulations in the Wild, , and , in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024 |
|
COMPARING DATA-DRIVEN AND HANDCRAFTED FEATURES FOR DIMENSIONAL EMOTION RECOGNITION, , and , in: International Conference on Acoustics, Speech and Signal Processing, 2024 |
|
Comparing Stability and Discriminatory Power of Hand-crafted Versus Deep Radiomics: A 3D-Printed Anthropomorphic Phantom Study, , , , , , , , , , and , in: Proceedings of the 12th European Workshop on Visual Information Processing, 2024 |
|
Configuration Space Distance Fields for Manipulation Planning, , , and , in: Robotics: Science and Systems (RSS), 2024, 2024 |
|
CONTENT-BASED OBJECTIVE EVALUATION OF ARTIFICIALLY GENERATED SIGN LANGUAGE VIDEOS, , , , , and , in: ICASSP, 2024 |
|
CONTEXTUAL BIASING METHODS FOR IMPROVING RARE WORD DETECTION IN AUTOMATIC SPEECH RECOGNITION, , , , , , , and , in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, Seoul, Korea, 2024 |
|
Cross-transfer Knowledge between Speech and Text Encoders to Evaluate Customer Satisfaction, , , , and , in: Interspeech, Kos Island, Greece, ISCA, 2024 |
|
D-LGP: Dynamic Logic-Geometric Program for Reactive Task and Motion Planning, , and , in: IEEE International Conference on Robotics and Automation, 2024 |
|
DAIC-WOZ: On the Validity of Using the Therapist's prompts in Automatic Depression Detection from Clinical Interviews, , , , , and , in: Proceedings of the 6th Clinical Natural Language Processing Workshop, Association for Computational Linguistics, 2024 |
|
Deep Variational Privacy Funnel: General Modeling with Applications in Face Recognition, , and , in: 49th IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE, 2024 |
[DOI] [URL] |
Demographic Fairness Transformer for Bias Mitigation in Face Recognition, and , in: Proceedings of IEEE International Joint Conference on Biometrics (IJCB2024), 2024 |
|
Detecting Criminal Networks via Non-Content Communication Data Analysis Techniques from the TRACY Project, , , , , , , , , and , in: 15th EAI International Conference on Digital Forensics & Cyber Crime, 2024 |
|
Dialog2Flow: Pre-training Soft-Contrastive Action-Driven Sentence Embeddings for Automatic Dialog Flow Extraction, , and , in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, ACL, 2024 |
|
DiffuCOMET: Contextual Commonsense Knowledge Diffusion, , , , , and , in: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Bangkok, Thailand, pages 4809–4831, Association for Computational Linguistics, 2024 |
[DOI] [URL] |
Does Data-Efficient Generalization Exacerbate Bias in Foundation Models?, , , , , and , in: Proceedings of the 18th European Conference on Computer Vision, 2024 |
|
Enhancing Ethical Explanations of Large Language Models through Iterative Symbolic Refinement, , , and , in: The 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024), 2024 |
Entity Matching Across Small Networks Using Node Attributes, , , , , , , , , and , in: ECAI 2024 - 27th European Conference on Artificial Intelligence, October 19-24, 2024, Santiago de Compostela, Spain - Including 13th Conference on Prestigious Applications of Intelligent Systems (PAIS 2024), Proceedings, 2024 |
[DOI] |
Estimating the Causal Effects of Natural Logic Features in Transformer-Based NLI Models, , and , in: The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, 2024 |
Evaluating the Effectiveness of Attack-Agnostic Features for Morphing Attack Detection, and , in: International Joint Conference on Biometrics, 2024 |
|
Explaining models relating objects and privacy, , , , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024 |
[URL] |
Exploring Italian sentence embeddings properties through multi-tasking, , , and , in: Tenth Italian Conference on Computational Linguistics, 2024 |
Exploring syntactic information in sentence embeddings through multilingual subject-verb agreement, , , and , in: Tenth Italian Conference on Computational Linguistics, 2024 |
Exploring the Zero-Shot Capabilities of Vision-Language Models for Improving Gaze Following, , , and , in: Int. Conf. Computer Vision and Pattern Recognition (CVPR), Workshop on Gaze Estimation and Prediction in the Wild, 2024 |
|
Extending the Cooperative Dual-Task Space in Conformal Geometric Algebra, and , in: Proc. IEEE Intl Conf. on Robotics and Automation, 2024 |
|
Face Liveness Detection Competition (LivDet-Face) - 2024, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: IEEE International Joint Conference on Biometrics, 2024 |
|
Face Recognition Using Lensless Camera, , and , in: Proceedings of the 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2024 |
[DOI] [URL] |
Face Reconstruction from Partially Leaked Facial Embeddings, and , in: Proceedings of the 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2024 |
[DOI] [URL] |
Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper, , , , , , , , and , in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, ACL, 2024 |
|
Feature Representations for Automatic Meerkat Vocalization Classification, , , and , in: 4th International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots, 2024 |
|
Fine-tuning Self-Supervised Models For Language Identification Using Orthonormal Constraint, , , , , , and , in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP), 2024 |
|
FRCSyn Challenge at WACV 2024: Face Recognition Challenge in the Era of Synthetic Data, , , , , and , in: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Workshops, pages 892-901, 2024 |
[URL] |
GADePo: Graph-Assisted Declarative Pooling Transformers for Document-Level Relation Extraction, , , and , in: Proceedings of the 3rd Workshop on Knowledge Augmented Methods for NLP, Association for Computational Linguistics, 2024 |
[DOI] [URL] |
Generalized Policy Iteration using Tensor Approximation for Hybrid Control, , and , in: International Conference on Learning Representations (ICLR), 2024 |
|
GLoFool: global enhancements and local perturbations to craft adversarial images, and , in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024 |
|
Hardware-effective Approaches for Skill Extraction in Job Offers and Resumes, , , , , , and , in: The 4th Workshop on Recommender Systems for Human Resources, in conjunction with the 18th ACM Conference on Recommender Systems, 2024 |
[URL] |
Heterogeneous Face Recognition Using Domain Invariant Units, and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2024 |
|
Human Interest or Conflict? Leveraging LLMs for Automated Framing Analysis in TV Shows, , and , in: ACM International Conference on Interactive Media Experiences, 2024 |
|
Image-guided topic modeling for interpretable privacy classification, and , in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024 |
|
Impact of Speech Mode in Automatic Pathological Speech Detection, and , in: EUSIPCO, IEEE, 2024 |
[URL] |
Improving Semantic Control in Discrete Latent Spaces with Transformer Quantized Variational Autoencoders, , , , and , in: Findings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024), 2024 |
Investigating Semantic Segmentation Models to Assist Visually Impaired People, , and , in: European Conference on Computer Vision - Workshops, 2024 |
|
Latent Enhancing AutoEncoder for Occluded Image Classification, , in: Proceedings of International Conference on Image Processing, 2024 |
|
Learning About Social Context from Smartphone Data: Generalization Across Countries and Daily Life Moments, , and , in: Proc. ACM Conference on Human Factors in Computing Systems, 2024 |
|
Learning Goal-oriented Bimanual Dough Rolling Using Dynamic Heterogeneous Graph Based on Human Demonstration, , , , , , , and , in: In Proc. IEEE Intl Conf. on Robotics and Biomimetics (ROBIO), 2024 |
Logic-Skill Programming: An Optimization-based Approach to Sequential Skill Planning, , , and , in: Proc. Robotics: Science and Systems (RSS), 2024 |
|
Mapping the Media Landscape: Predicting Factual Reporting and Political Bias Through Web Interactions, , , and , in: Experimental IR Meets Multilinguality, Multimodality, and Interaction - 14th International Conference of the CLEF Association, CLEF, 2024, Grenoble, France, September 9-12, 2024, Proceedings, 2024 |
|
Mitigating Demographic Bias in Face Recognition via Regularized Score Calibration, and , in: IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, IEEE/CVF, 2024 |
|
Modality Agnostic Heterogeneous Face Recognition with Switch Style Modulators, and , in: IEEE International Joint Conference on Biometrics, 2024 |
|
Model Pairing Using Embedding Translation for Backdoor Attack Detection on Open-Set Classification Tasks, , , and , in: NeurIPS Safe Generative AI Workshop 2024, 2024 |
|
Multi-Relational Hyperbolic Word Embeddings from Natural Language Definitions, , and , in: The 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024), 2024 |
Multitask Speech Recognition and Speaker Change Detection for Unknown Number of Speakers, , , , , , , and , in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, 2024 |
Neural Redshift: Random Networks are not Random Functions, , , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024 |
Neurocomputational model of speech recognition for pathological speech detection: a case study on Parkinson’s disease speech detection, and , in: Proceedings of Interspeech, Kos Island, Greece, pages 3590-3594, 2024 |
[DOI] [URL] |
Nonparametric Variational Regularisation of Pretrained Transformers, and , in: First conference on Language Modelling, 2024 |
[URL] |
Normalizing Flows for Speaker and Language Recognition Backend, , , , and , in: Odyssey 2024: The Speaker and Language Recognition Workshop, 2024 |
|
On the Utility of Speech and Audio Foundation Models for Marmoset Call Analysis, and , in: 4th International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots, 2024 |
|
Open-Vocabulary Object 6D Pose Estimation, , , , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024 |
[URL] |
OptoMechanical Modulation Tomography for Ungated Compressive Cardiac Light Sheet Microscopy, and , in: 2024 IEEE International Symposium on Biomedical Imaging (ISBI), Athens, Greece, pages 1--4, 2024 |
[DOI] [URL] |
OptoMechanical Modulation Tomography for Ungated Compressive Cardiac Light Sheet Microscopy, and , in: 2024 IEEE International Symposium on Biomedical Imaging (ISBI), Athens, Greece, pages 1--4, 2024 |
[DOI] [URL] |
Parametric point spread function estimation for thermal imaging systems using easy-to-manufacture random pattern targets, , , and , in: Target and Background Signatures X: Traditional Methods and Artificial Intelligence, pages 1319905-(1-9), SPIE, 2024 |
[DOI] [URL] |
Parkinson's Disease Detection through Formant and F0 Analysis at Syllable Level, and , in: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024 |
|
Predicting Heart Activity from Speech using Data-driven and Knowledge-based features, , and , in: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2024 |
|
Probability-Aware Word-Confusion-Network-to-Text Alignment Approach for Intent Classification, , , , , , , and , in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, 2024 |
|
ProGAP: Progressive Graph Neural Networks with Differential Privacy Guarantees, and , in: The 17th ACM International Conference on Web Search and Data Mining, 2024 |
|
Recursive Forward Dynamics for Serial Kinematic Chains using Conformal Geometric Algebra, and , in: In Proc. Intl Workshop on the Algorithmic Foundations of Robotics (WAFR), 2024 |
Refining Tuberculosis Detection in CXR Imaging: Addressing Bias in Deep Neural Networks via Interpretability, , and , in: Proceedings of the 12th European Workshop on Visual Information Processing, 2024 |
|
Reliability Estimation of News Media Sources: Birds of a Feather Flock Together, , , and , in: Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics, 2024 |
|
Representing Robot Geometry as Distance Fields: Applications to Whole-body Manipulation, , , and , in: IEEE International Conference on Robotics and Automation, 2024 |
|
Robust Manipulation Primitive Learning via Domain Contraction, , , and , in: Proceedings of Conference on Robot Learning, 2024 |
|
ROXSD: The ROXANNE Multimodal and Simulated Dataset for Advancing Criminal Investigations, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: Odyssey 2024: The Speaker and Language Recognition Workshop, pages 17-24, 2024 |
[DOI] [URL] |
σ-GPTs: A New Approach to Autoregressive Models., , and , in: European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2024 |
|
Score Normalization for Demographic Fairness in Face Recognition, , , , and , in: IEEE International Joint Conference on Biometrics (IJCB 2024), 2024 |
|
SDFR: Synthetic Data for Face Recognition Competition, , , , and , in: 2024 IEEE 18th International Conference on Automatic Face and Gesture Recognition (FG), IEEE, 2024 |
[DOI] [URL] |
Second Edition FRCSyn Challenge at CVPR 2024: Face Recognition Challenge in the Era of Synthetic Data, , , , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, pages 3173-3183, 2024 |
[URL] |
Segmenting Object Affordances: Reproducibility and Sensitivity to Scale, , , and , in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024 |
|
Selective Mixup Helps with Distribution Shifts, But Not (Only) because of Mixup, , and , in: International Conference on Machine Learning (ICML), 2024 |
[URL] |
Sharingan: A Transformer Architecture for Multi-Person Gaze Following, , and , in: Int. Conference Computer Vision and Pattern Recognition (CVPR), Seatle, 2024 |
|
Sparse multi-view hand-object reconstruction for unseen environments, , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2024 |
[URL] |
Speech and Language Recognition with Low-rank Adaptation of Pretrained Models, , , , and , in: Interspeech 2024, 2024 |
|
Suppressing Noise Disparity in Training Data for Automatic Pathological Speech Detection, and , in: IWAENC, 2024 |
SYLLABLE LEVEL FEATURES FOR PARKINSON'S DISEASE DETECTION FROM SPEECH, and , in: ICASSP, 2024 |
|
Synergizing Natural Language Towards Enhanced Shared Autonomy, , and , in: Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, 2024 |
[URL] |
Synthetic to Authentic: Transferring Realism to 3D Face Renderings for Boosting Face Recognition, , and , in: European Conference on Computer Vision Workshops, 2024 |
|
Test-time adaptation for automatic pathological speech detection in noisy environments, and , in: EUSIPCO, 2024 |
|
TokenVerse: Towards Unifying Speech and NLP Tasks via Transducer-based ASR, , , , , , , , and , in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, ACL, 2024 |
[URL] |
Towards interfacing large language models with ASR systems using confidence measures and prompting, , , , and , in: Proceedings of Interspeech, pages 2980-2984, 2024 |
[DOI] |
Towards Robo-Coach: Robot Interactive Stiffness/Position Adaptation for Human Strength and Conditioning Training, , , , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), 2024 |
Towards Wine Tasting Activity Recognition for a Digital Sommelier, , , and , in: Proc. ACM Workshop on Exploring Innovative Technology for Commensality and Human-Food Interaction, 2024 |
Tracking linguistic information in transformer-based sentence embeddings through targeted sparsification, and , in: Proceedings of the 9th Workshop on Representation Learning for NLP, 2024 |
[URL] |
Understanding the effects of language-specific class imbalance in multilingual fine-tuning, and , in: Findings of the European chapter of Association for Computational Linguistics, 2024, 2024 |
|
Unveiling Synthetic Faces: How Synthetic Datasets Can Expose Real Identities, and , in: NeurIPS Workshop on New Frontiers in Adversarial Machine Learning, 2024 |
|
Using Backbone Foundation Model for Evaluating Fairness in Chest Radiography Without Demographic Data, , and , in: Proceedings of the IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR), 2024 |
|
Vascular Biometrics Experiments on Candy -- A New Contactless Finger-Vein Dataset, , , , and , in: Proceedings of the International Conference on Pattern Recognition (ICPR), Calcutta (India), 2024 |
|
Vulnerability of Face Age Verification to Replay Attacks, , , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2024 |
|
ZooPFL: Exploring Black-box Foundation Models for Personalized Federated Learning, , , , , , , , and , in: NeurIPS 2024 Workshop on Federated Learning, 2024 |
[URL] |
2023
A benchmark for the simulation of meshed district heating networks based on anonymised monitoring data, and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2023 |
[DOI] [URL] |
A Machine Learning Model for the Prediction of Building Hourly Heating Demand from CityGML Files: Training Workflow and Deployment as an API, , and , in: Proceedings of Building Simulation 2023: 18th Conference of IBPSA, pages 2932 - 2939, 2023 |
[DOI] [URL] |
A Multitask and Kernel approach for Learning to Push Objects with a Task-Parameterized Deep Q-Network, , , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2023 |
|
A Study on the Importance of Formant Transitions for Stop-Consonant Classification in VCV Sequence, , , and , in: Interspeech, Dublin, Ireland, ISCA, 2023 |
|
A VAE for Transformers with Nonparametric Variational Information Bottleneck, and , in: The Eleventh International Conference on Learning Representations, 2023 |
[URL] |
Affordance segmentation of hand-occluded containers from exocentric images, , , , , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), 2023 |
[DOI] [URL] |
Approximating Optimal Morphing Attacks using Template Inversion, , and , in: IEEE International Joint Conference on Biometric, 2023 |
[DOI] |
Automatic Speech Analysis Framework for ATC Communication in HAAWAII, , , , , and , in: 13th SESAR Innovation Days, 2023 |
|
Automatic Speech Recognition and Understanding for Radar Label Maintenance Support Increases Safety and Reduces Air Traffic Controllers’ Workload, , , , , , , , , , , and , in: Fifteenth USA/Europe Air Traffic Management Research and Development Seminar (ATM2023), Eurocontrol (Europe), FAA (U.S.), Savannah, Georgia, USA, 2023 |
[URL] |
BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications, , , , , , and , in: 2023 IEEE Spoken Language Technology Workshop (SLT), IEEE, 2023 |
[URL] |
Black-box Attacks on Image Activity Prediction and its Natural Language Explanations, , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), 2023 |
[DOI] [URL] |
Blackbox Face Reconstruction from Deep Facial Embeddings Using A Different Face Recognition Model, and , in: Proceedings of the IEEE International Conference on Image Processing (ICIP), Kuala Lumpur, Malaysia, pages 2435-2439, 2023 |
[DOI] [URL] |
BLESS: Benchmarking Large Language Models on Sentence Simplification, , , , , , and , in: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Singapore, 2023 |
|
Bridging the Gap: Heterogeneous Face Recognition with Conditional Adaptive Instance Modulation, and , in: IJCB, 2023 |
|
Can ChatGPT Detect Intent? Evaluating Large Language Models for Spoken Language Understanding, and , in: Proc. INTERSPEECH 2023, pages 1109-1113, 2023 |
[DOI] |
Can Language Models Learn Analogical Reasoning? Investigating Training Objectives and Comparisons to Human Performance, and , in: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Singapore, Association for Computational Linguistics, 2023 |
|
Can personalised hygienic masks be used to attack face recognition systems?, , , and , in: Proceedings of IEEE International Joint Conference on Biometrics (IJCB2023), 2023 |
|
Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers?, and , in: Proceedings of Interspeech, 2023 |
|
ChildPlay: A New Benchmark for Understanding Children's Gaze Behaviour, , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023 |
|
CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice, , , and , in: Proc. Interspeech 2023, 2023 |
[URL] |
Complex Daily Activities, Country-Level Diversity, and Smartphone Sensing: A Study in Denmark, Italy, Mongolia, Paraguay, and UK, , , , , , , , , , , , , , , , , , , , , and , in: Proceedings of the 2023 ACM CHI Conference on Human Factors in Computing Systems (CHI), Association for Computing Machinery, 2023 |
[DOI] |
Customization of Automatic Speech Recognition Engines for Rare Word Detection Without Costly Model Re-Training, , , , , , and , in: Proc. 13th SESAR Innovation Days, Seville, Spain, 2023 |
[DOI] [URL] |
Data-driven Urban Building Energy Modeling with Machine Learning in Satom (CH), , and , in: 6th International IEEE Conference AND Workshop in Obuda on Electrical and Power Engineering, 2023 |
Demonstration-guided Optimal Control for Long-term Non-prehensile Planar Manipulation, , , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), pages 4999-5005, 2023 |
[DOI] |
Diffusion Transformer for Adaptive Text-to-Speech, and , in: Proc. 12th ISCA Speech Synthesis Workshop (SSW 12), 2023 |
[DOI] |
Document-level Text Simplification with Coherence Evaluation, , , and , in: Proceedings of the Second Workshop on Text Simplification, Accessibility and Readability, 2023 |
|
EFaR 2023: Efficient Face Recognition Competition, , , , and , in: IEEE International Joint Conference on Biometrics (IJCB 2023), 2023 |
|
Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks, , , , , , , , and , in: Proceedings of the 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, 2023 |
|
Efficient Grapevine Structure Estimation in Vineyards Conditions, , , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision, Paris, pages 712--720, 2023 |
[URL] |
End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation, , , , , , and , in: The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), Singapore, 2023 |
[URL] |
Energy assessment of a district by integrating solar thermal in district heating network: a dynamic analysis approach, , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2023 |
[DOI] [URL] |
Enhancing Multi-modal Classification of Violent Events using Image Captioning, , and , in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2023), CEUR Workshop Proceedings, Jaén, Spain, 2023 |
[URL] |
Enhancing user acceptance in automated systems with human-centric lighting: the role of visual comfort, personality, and preference, , , , , , , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2023 |
[DOI] [URL] |
ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (CVPR), pages 17408-17419, 2023 |
[DOI] |
Examining European Press Coverage of the Covid-19 No-Vax Movement: An NLP Framework, and , in: 2nd ACM International Workshop on Multimedia AI against Disinformation (MAD '23), June 12, 2023, Thessaloniki, Greece, 2023 |
|
Face Reconstruction from Facial Templates by Learning Latent Space of a Generator Network, and , in: Thirty-seventh Conference on Neural Information Processing Systems, 2023 |
[URL] |
Factors that Affect Personalization of Robots for Older Adults, , and , in: CONCATENATE Workshop at HRI 2023 in Stockholm, Sweden, 2023 |
[URL] |
Few-shot Dysarthric Speech Recognition with Text-to-Speech Data Augmentation, and , in: Proceedings of Interspeech, pages 156-160, 2023 |
[DOI] [URL] |
Findings of the IWSLT 2023 evaluation campaign, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: Proceedings of the IWSLT conference, 2023 |
Framing the News: From Human Perception to Large Language Model Inferences, and , in: International Conference on Multimedia Retrieval (ICMR '23), June 12--15, 2023, Thessaloniki, Greece, 2023 |
|
Fully Automatic Grading of Retinal Vasculitis on Fluorescein Angiography Time-lapse from Real-world Data in Clinical Settings, , , , , , , and , in: 2023 IEEE 36th International Symposium on Computer-Based Medical Systems (CBMS), L'Aquila, Italy, 2023, pages 689-693, 2023 |
[DOI] [URL] |
GAP: Differentially Private Graph Neural Networks with Aggregation Perturbation, , , and , in: 32nd USENIX Security Symposium (USENIX Security 23), 2023 |
|
How Does Pre-trained Wav2Vec 2.0 Perform on Domain-Shifted ASR? An Extensive Benchmark on Air Traffic Control Communications, , , , , , , , and , in: 2023 IEEE Spoken Language Technology Workshop (SLT), IEEE, 2023 |
[URL] |
Human-Robot Collaboration in a Sanding Task, , , , , , , , and , in: Proceedings of the Human Factors and Ergonomics Society Annual Meeting, 2023 |
|
HyperConformer: Multi-head HyperMixer for Efficient Speech Recognition, , , and , in: Proc. Interspeech 2023, Ireland, 2023 |
|
HyperMixer: An MLP-based Low Cost Alternative to Transformers, , , , , , and , in: Proc. of the 61st Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Toronto, Canada, pages 15632-15654, 2023 |
[DOI] |
ID and OOD performance are sometimes inversely correlated on real-world datasets, , , and , in: Advances in Neural Information Processing Systems (NeurIPS), 2023 |
Implementing contextual biasing in GPU decoder for online ASR, , , , , , and , in: Proc. Interspeech 2023, 2023 |
|
Implicit phonetic information modeling for speech emotion recognition, , and , in: Interspeech, Dublin, Ireland, ISCA, 2023 |
|
International Conference on the Voynich Manuscript 2022, , , , , , and , in: Proceedings of the International Conference on Historical Cryptology, 2023 |
Inversion of Deep Facial Templates using Synthetic Data, and , in: Proceedings of the IEEE International Joint Conference on Biometric, 2023 |
[DOI] [URL] |
Keep Sensors in Check: Disentangling Country-Level Generalization Issues in Mobile Sensor-Based Models with Diversity Scores, , , and , in: Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, 2023 |
|
Learning Disentangled Representations for Natural Language Definitions, , , and , in: In Findings of the European chapter of Association for Computational Linguistics, 2023 |
|
Learning diverse features in vision transformers for improved generalization, , , and , in: ICML 2023: The Second Workshop on Spurious Correlations, Invariance and Stability, 2023 |
[URL] |
Learning Joint Space Reference Manifold for Reliable Physical Assistance, , , , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), pages 10412-10417, 2023 |
[DOI] |
Learning to Abstract with Nonparametric Variational Information Bottleneck, , and , in: The 2023 Conference on Empirical Methods in Natural Language Processing, 2023 |
[URL] |
Leveraging Diffusion Disentangled Representations to Mitigate Shortcuts in Underspecified Visual Tasks, , , , and , in: NeurIPS Workshop on Diffusion Models, 2023 |
[URL] |
MLP-Hash: Protecting Face Templates via Hashing of Randomized Multi-Layer Perceptron, , and , in: Proceedings of the 31st European Signal Processing Conference, Helsinki, Finland, 2023 |
[DOI] [URL] |
Multi-image deconvolution of thermal images with a boundary condition weighting scheme, , , , and , in: Target and Background Signatures IX, International Society for Optics and Photonics, Amsterdam, pages 149-158, SPIE, 2023 |
[DOI] [URL] |
Multi-IVE: Privacy Enhancement of Multiple Soft-Biometrics in Face Embeddings, , , , , , , and , in: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023 |
[URL] |
NLI4CT: Multi-Evidence Natural Language Inference for Clinical Trial Reports, , , , , and , in: Proceedings of The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), 2023 |
Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviews, , , and , in: Proceedings of Interspeech, 2023 |
|
On Interventional Probing in High Dimensions: An NLI Case Study, , , and , in: Findings of the 17th European Chapter of the Association for Computational Linguistics, 2023 |
Parameter-Efficient Tuning With Adaptive Bottlenecks For Automatic Speech Recognition, , , , , and , in: Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU'23, 2023 |
|
Potential for district heating networks from waste heat: an assessment tool and its application to sewage treatment plants in the Canton of Zurich, and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2023 |
[DOI] [URL] |
Quantified Canine: Inferring Dog Personality From Wearables, , , , , and , in: Proceedings of the 2023 ACM CHI Conference on Human Factors in Computing Systems (CHI) 2023, Association for Computing Machinery, 2023 |
[DOI] |
Referencing in YouTube Knowledge Communication Videos, and , in: ACM International Conference on Interactive Media Experiences (IMX '23), June 2023, Nantes, France, 2023 |
|
Remote Cancelable Biometric System for Verification and Identification Applications, , , and , in: Proceedings of the International Conference of the Biometrics Special Interest Group (BIOSIG), 2023 |
[DOI] [URL] |
Robust Execution of Assembly Policies Using a Pose Invariant Task Representation, , , , , and , in: 20th International Conference on Ubiquitous Robots (UR), Honolulu, HI, USA, IEEE, 2023 |
|
RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question, , , , , , and , in: Association for Computational Linguistics: ACL 2023, Toronto, Canada, 2023 |
[URL] |
SemEval-2023 Task 7: Multi-Evidence Natural Language Inference for Clinical Trial Data, , , , , and , in: Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023), Association for Computational Linguistics, 2023 |
[DOI] [URL] |
Shortcut Bias Mitigation via Ensemble Diversity Using Diffusion Probabilistic Models, , , , , and , in: Under review, 2023 |
[URL] |
Situated Participatory Design: A Method for In Situ Design of Robotic Interaction with Older Adults, , and , in: CHI '23: Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023 |
[DOI] [URL] |
SoftGPT: Learn Goal-oriented Soft Object Manipulation Skills by Generative Pre-trained Heterogeneous Graph Transformer, , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2023 |
|
Strong and Efficient Baselines for Open Domain Conversational Question Answering, , and , in: Findings of EMNLP, Association for Computational Linguistics, 2023 |
[DOI] [URL] |
Syntax-Aware Graph-to-Graph Transformer for Semantic Role Labelling, and , in: Procceedings of 8th Workshop on Representation Learning for NLP, 2023 |
[URL] |
SynthDistill: Face Recognition with Knowledge Distillation from Synthetic Data, , and , in: IEEE International Joint Conference on Biometrics (IJCB 2023), 2023 |
[DOI] |
Template Inversion Attack against Face Recognition Systems using 3D Face Reconstruction, and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 19662-19672, 2023 |
[DOI] [URL] |
The AI4Autism Project: A Multimodal and Interdisciplinary Approach to Autism Diagnosis and Stratification, , , , , , , and , in: Companion Publication of the 25th International Conference on Multimodal Interaction, Paris, France, pages 414–425, Association for Computing Machinery, 2023 |
[DOI] [URL] |
The Idiap Speech Synthesis System for the Blizzard Challenge 2023, , , and , in: Proc. 18th Blizzard Challenge Workshop, 2023 |
[DOI] |
The Interpreter Understands Your Meaning: End-to-end Spoken Language Understanding Aided by Speech Translation, and , in: Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023, pages 4408-4423, Association for Computational Linguistics, 2023 |
[DOI] |
The Unconstrained Ear Recognition Challenge 2023: Maximizing Performance and Minimizing Bias, and , in: IEEE International Joint Conference on Biometrics (IJCB 2023), 2023 |
|
Toward responsible face datasets: modeling the distribution of a disentangled latent space for sampling face images from demographic groups, , and , in: IEEE International Joint Conference on Biometrics, 2023 |
|
Towards Improved Replicability of Human Studies in Human-Robot Interaction: Recommendations for Formalized Reporting, , , , , , , , , , and , in: Companion of the 2023 ACM/IEEE International Conference on Human-Robot Interaction, pages 629-633, 2023 |
|
Towards learning emotion information from short segments of speech, , , , and , in: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Rhodes Island, Greece, IEEE, 2023 |
|
Transformers as Graph-to-Graph Models, , , and , in: Big Picture Workshop at EMNLP 2023, 2023 |
Transformers, Tables and Frame Semantics, , , and , in: International Conference on Semantic Computing, 2023 |
Understanding the Social Context of Eating with Multimodal Smartphone Sensing: The Role of Country Diversity, , and , in: 25th ACM International Conference on Multimodal Interaction, 2023 |
[DOI] [URL] |
Using Commercial ASR Solutions to Assess Reading Skills in Children: A Case Report, , , , , and , in: Proceedings of Interspeech, pages 4573-4577, 2023 |
[DOI] [URL] |
Verification of PyDHN - a Python library for the thermo-hydraulic simulation of district heating networks - through the DESTEST, , and , in: Proceedings of Building Simulation 2023: 18th Conference of IBPSA, IBPSA, IBPSA, 2023 |
[DOI] [URL] |
VP-STO: Via-point-based Stochastic Trajectory Optimization for Reactive Robot Behavior, , , and , in: Proc. IEEE International Conference on Robotics and Automation (ICRA), 2023 |
|
Vulnerability of Automatic Identity Recognition to Audio-Visual Deepfakes, , , and , in: IEEE International Joint Conference on Biometrics, 2023 |
|
Zero-shot Retrieval: Augmenting Pre-trained Models with Search Engines, , , , , , and , in: Under review, 2023 |
[URL] |
2022
A Corpus and Evaluation for Predicting Semi-Structured Human Annotations, , , , and , in: Workshop on Generation, Evaluation and Metrics (GEM), 2022 |
|
A Modular Multimodal Architecture for Gaze Target Prediction: Application to Privacy-Sensitive Settings, , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022 |
|
A two-step approach to leverage contextual data: speech recognition in air-traffic communications, , , , and , in: Proc. of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022 |
|
Active Learning by Feature Mixing, , , , , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022 |
Adversarial-free speaker identity-invariant representation learning for automatic dysarthric speech classification, and , in: Annual Conference of the International Speech Communication Association, 2022 |
|
An anomaly detection approach for backdoored neural networks: face recognition as a case study, and , in: 21st International Conference of the Biometrics Special Interest Group (BIOSIG 2022), Darmstadt, Germany, 2022 |
|
An Empirical Comparison of Semantic Similarity Methods for Analyzing down-streaming Automatic Minuting task, , , , and , in: Pacific Asia Conference on Language, Information and Computation (PACLIC 36), In ACL Anthology Proceedings, 2022 |
|
An End-to-End Multilingual System for Automatic Minuting of Multi-Party Dialogues, , , and , in: Pacific Asia Conference on Language, Information and Computation (PACLIC 36) , In proceedings of ACL Anthology, 2022 |
|
An exploratory interplay between daylight, general and task lighting for visual comfort and electricity savings in a personal office space, , , , , , and , in: Proceedings of ISES and IEA SHC International Conference on Solar Energy for Buildings and Industry, Kassel, Germany, 2022 |
Are GAN-based Morphs Threatening Face Recognition?, , , and , in: International Conference on Acoustics, Speech and Signal Processing, 2022 |
|
Automatic Minuting: A Pipeline Method for Generating Minutes, , , and , in: Pacific Asia Conference on Language, Information and Computation (PACLIC 36), In proceedings of ACL Anthology, 2022 |
|
Automatic Summarization for Creative Writing: Denoising Auto-Encoder based Pipeline Method for Generating Summary of Movie Scripts, , , , and , in: Automatic Summarization for Creative Writing, International Conference on Computational Linguistics (COLING 2022), 2022 |
|
Bag-of-Vectors Autoencoders for Unsupervised Conditional Text Generation, and , in: Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), Association for Computational Linguistics, Online, pages 468–488, 2022 |
[URL] |
Bayesian Recurrent Units and the Forward Backward Algorithm, and , in: Proc. Interspeech 2022, pages 4137-4141, 2022 |
[DOI] |
Bio-Medical Multi-label Scientific Literature Classification using LWAN and Dual-attention module, , , , and , in: PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION (PACLIC 2022), In Proceedings of ACL Anthology, 2022 |
|
Borrowing from yourself: Faster future video segmentation with partial channel update, and , in: International Conference on Pattern Recognition, 2022 |
|
Case-Based Abductive Natural Language Inference, , and , in: Proceedings of the 29th International Conference on Computational Linguistics, 2022 |
[URL] |
Comparing Biosignal and Acoustic feature Representation for Continuous Emotion Recognition, , , , and , in: International Multimodal Sentiment Analysis Workshop and Challenge, 2022 |
|
Comparing supervised and self-supervised embedding for ExVo Multi-Task learning track, , , and , in: Proceedings of the ICML Expressive Vocalizations Workshop held in conjunction with the 39th International Conference on Machine Learning, Maryland, USA, 2022 |
|
Comparison of 5 methods for the evaluation of intelligibility in mild to moderate French dysarthric speech, , , , , , , , , and , in: Annual Conference of the International Speech Communication Association, pages 2188-2192, 2022 |
[DOI] |
Conversational Speech Recognition Needs Data? Experiments with Austrian German, , , and , in: Proceedings of the 13th Language Resources and Evaluation Conference, European Language Resources Association, pages 4684--4691, 2022 |
[URL] |
Custom attribution loss for improving generalization and interpretability of deepfake detection, , and , in: International Conference on Acoustics, Speech, and Signal Processing, 2022 |
|
Decomposing Natural Logic Inferences for Neural NLI, , , , and , in: BlackBoxNLP: Workshop on analyzing and interpreting neural networks for NLP, 2022 |
DeepCon: An End-to-End Multilingual Toolkit for Automatic Minuting of Multi-Party Dialogues, , , and , in: Special Interest Group on Discourse and Dialogue (SIGDIAL 2022), 2022 |
|
Demystifying the Scribes behind the Voynich Manuscript using Computational Linguistic Techniques, , and , in: Proceedings of the 1st International Conference on the Voynich Manuscript, 2022 |
DHgeN: Automated Generation of District Heating Network Layouts for Feasibility Studies, and , in: -, 2022 |
|
EBMs vs. CL: Exploring Self-Supervised Visual Pretraining for Visual Question Answering, , , , and , in: arXiv, 2022 |
Efficient Training of Low-Curvature Neural Networks, , , and , in: NeurIPS 2022, 2022 |
[URL] |
Efficient Wind Speed Nowcasting with GPU-Accelerated Nearest Neighbors Algorithm, , and , in: Proceedings of SIAM Data Mining, Virginia US and Virtual, 2022 |
Evading the Simplicity Bias: Training a Diverse Set of Models Discovers Solutions with Superior OOD Generalization, , , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022 |
|
Expanded Lattice Embeddings for Spoken Document Retrieval on Informal Meetings, , , , and , in: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, 2022 |
[DOI] |
Experimental investigation on STFT phase Representations for deep learning-based dysarthric speech detection, and , in: International Conference on Acoustics, Speech, and Signal Processing, 2022 |
|
Face Anthropometry Aware Audio-visual Age Verification, and , in: ACM Multimedia, 2022 |
|
Face Reconstruction from Deep Facial Embeddings using a Convolutional Neural Network, , and , in: Proceedings of the 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, France, IEEE, 2022 |
[DOI] [URL] |
Fairness Index Measures to Evaluate Bias in Biometric Recognition, and , in: International Conference on Pattern Recognition Workshops, 2022 |
|
From Undercomplete to Sparse Overcomplete Autoencoders to Improve LF-MMI Speech Recognition, and , in: Proceedings of Interspeech Conference, 2022 |
|
GeoNeRF: Generalizing NeRF with Geometry Priors, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (CVPR), 2022 |
[URL] |
Grammar Based Speaker Role Identification for Air Traffic Control Speech Recognition, , , , , , and , in: 12th SESAR Innovation Days, 2022 |
|
Graph Refinement for Coreference Resolution, and , in: Findings of Association for >Computational Linguistics: ACL 2022, 2022 |
Health Talk: Understanding Practices of Popular Professional YouTubers, , , , and , in: Proceedings of the 21st International Conference on Mobile and Ubiquitous Multimedia, 2022 |
Hierarchical Multi-task learning framework for Isometric-Speech Language Translation, , , and , in: ACL, 2022 |
|
HMIST: Hierarchical Multilingual Isometric Speech Translation using Multi-Task Learning Framework for Automatic Dubbing, , , and , in: PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION (PACLIC 2022), In Proceedings of ACL Anthology, 2022 |
|
How Did Europe’s Press Cover Covid-19 Vaccination News? A Five-Country Analysis, and , in: MAD '22: Proceedings of the 1st International Workshop on Multimedia AI against Disinformation, 2022 |
[DOI] [URL] |
Hybrid Autoregressive Inference for Scalable Multi-hop Explanation Regeneration, , , and , in: Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022 |
Hybrid Protection of Biometric Templates by Combining Homomorphic Encryption and Cancelable Biometrics, , , , , and , in: Proceedings of the 2022 International Joint Conference on Biometrics (IJCB), Abu Dhabi, United Arab Emirates (UAE), IEEE, 2022 |
[DOI] [URL] |
IDIAP Submission@LT-EDI-ACL2022 : Hope Speech Detection for Equality, Diversity and Inclusion, and , in: ACL, 2022 |
|
IDIAP Submission@LT-EDI-ACL2022: Detecting Signs of Depression from Social Media Text, and , in: ACL, 2022 |
|
IDIAP Submission@LT-EDI-ACL2022: Homophobia/Transphobia Detection in social media comments, and , in: ACL Proceedings, 2022 |
|
IDIAPers @ Causal News Corpus 2022: Efficient Causal Relation Identification Through a Prompt-based Few-shot Approach, , , , , , and , in: The 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE @ EMNLP 2022), 2022 |
[URL] |
IDIAPers @ Causal News Corpus 2022: Extracting Cause-Effect-Signal Triplets via Pre-trained Autoregressive Language Model, , , , , , and , in: The 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE @ EMNLP 2022), 2022 |
[URL] |
IDIAP_TIET@LT-EDI-ACL2022 : Hope Speech Detection in Social Media using Contextualized BERT with Attention Mechanism, , and , in: ACL, 2022 |
|
Imitation of Manipulation Skills Using Multiple Geometries, , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2022 |
|
Indexing Protected Deep Face Templates by Frequent Binary Patterns, , , , and , in: Proceedings of the 2022 International Joint Conference on Biometrics (IJCB), Abu Dhabi, United Arab Emirates (UAE), IEEE, 2022 |
[DOI] [URL] |
Innovators@SMM4H'22: An Ensembles Approach for self-reporting of COVID-19 Vaccination Status Tweets, , , , and , in: International Conference on Computational Linguistics (COLING 2022), 2022 |
|
Innovators@SMM4H'22: An Ensembles Approach for Stance and Premise Classification of COVID-19 Health Mandates Tweets, , , , and , in: International Conference on Computational Linguistics (COLING 2022), 2022 |
Learning to Guide Online Multi-Contact Receding Horizon Planning, , , , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2022 |
Leveraging Events Sub-Categories for Violent-Events Detection in Social Media, , and , in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022), 2022 |
[URL] |
Local estimation of parametric point spread functions in thermal images via convolutional neural networks, , , and , in: SPIE sensors + imaging, Target and Background Signatures VIII, Berlin, Germany, pages 1227009 1--8, SPIE, 2022 |
[DOI] [URL] |
Low-Level Physiological Implications of End-to-End Learning for Speech Recognition, and , in: Proc. Interspeech 2022, pages 749--753, 2022 |
[DOI] |
Modeling Of Pre-trained Neural Network Embeddings Learned From Raw Waveform For Covid-19 Infection Detection, , , and , in: Proceedings of ICASSP, 2022 |
|
Multilingual Extraction and Categorization of Lexical Collocations with Graph-aware Transformers, , , , and , in: Proceedings of the 11th Joint Conference on Lexical and Computational Semantics, Seattle, USA, pages 89–100, 2022 |
|
Multimodality for NLP-Centered Applications: Resources, Advances and Frontiers, , , and , in: International Conference on Language Resources and Evaluation (LREC 2022), 2022 |
|
On Breathing Pattern Information in Synthetic Speech, and , in: Proceedings of Interspeech, 2022 |
|
On the detection of morphing attacks generated by GANs, and , in: 21st International Conference of the Biometrics Special Interest Group (BIOSIG 2022), 2022 |
|
On-demand compute reduction with stochastic wav2vec 2.0, , , and , in: Proceedings of Interspeech, 2022 |
|
Paumer: Patch Pausing Transformer for Semantic Segmentation, , and , in: 33th British Machine Vision Conference 2022, London, UK, 21 - 24 November 2022, 2022 |
|
PERFECT: Prompt-free and Efficient Few-shot Learning with Language Models, , , , , , and , in: ACL, 2022 |
|
Pre-training Data Quality and Quantity for a Low-Resource Language: New Corpus and BERT Models for Maltese, , , , and , in: Proceedings of the workshop on Deep Learning for Low-Resource NLP, 2022 |
[URL] |
Predicting is not understanding: Recognizing and addressing underspecification in machine learning, , and , in: European Conference on Computer Vision, pages 458-476, Springer, 2022 |
Pulmonary Tuberculosis Screening from Radiological Signs on Chest X-Ray Images Using Deep Models, , and , in: Union World Conference on Lung Health, The Union, 2022 |
Reactive Anticipatory Robot Skills with Memory, , and , in: The International Symposium on Robotics Research, 2022 |
|
Readback Error Detection by Automatic Speech Recognition and Understanding -- Results of HAAWAII Project for Isavia’s Enroute Airspace, , , , , , , , , and , in: 11th SESAR Innovation Days, SESAR, pages 9, 2022 |
|
Reasoning over vision and language: Exploring the benefits of supplemental knowledge, , , and , in: arXiv, 2022 |
Residual Feature Pyramid Network for Enhancement of Vascular Patterns, and , in: IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022 |
|
SelecMix: Debiased Learning by Contradicting-pair Sampling, , , , , , and , in: Advances in Neural Information Processing Systems, 2022 |
SelecMix: Debiased Learning by Mixing up Contradicting Pairs, , , , , , and , in: ICML Workshop on Spurious Correlations, Invariance and Stability, 2022 |
Shallow Discourse Parsing for Open Information Extraction and Text Simplification, , and , in: 3rd Workshop on Computational Approaches to Discourse (CODI) @ COLING, 2022 |
SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages, , , , , and , in: The 2022 Conference on Empirical Methods in Natural Language Processing, 2022 |
|
Speaker recognition on mono-channel telephony recordings, , , , and , in: The Speaker and Language Recognition Workshop, 2022 |
|
Speech and Natural Language Processing Technologies for Pseudo-Pilot Simulator, , , , , and , in: 12th SESAR Innovation Days, 2022 |
|
Symmetry-induced Disentanglement on Graphs, , and , in: Advances in Neural Information Processing Systems 35, 2022 |
Systematicity, Compositionality and Transitivity of Deep NLP Models: a Metamorphic Testing Perspective, , , , and , in: Findings of the ACL, 2022 |
Team Innovators at SemEval-2022 for Task 8: Multi-Task Training with Hyperpartisan and Semantic Relation for Multi-Lingual News Article Similarity, , , , and , in: Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022), NAACL 2022, 2022 |
|
TextGraphs 2022 Shared Task on Natural Language Premise Selection, , , , and , in: Proceedings of TextGraphs-16: Graph-based Methods for Natural Language Processing, 2022 |
[URL] |
The Winning Approach for the Recommendation Systems Shared Task @REST_MEX 2022, , , , and , in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022), 2022 |
[URL] |
To be or not to be an Integer? Encoding Variables for Mathematical Text, , , , and , in: Findings of the ACL, 2022 |
Towards Accessible Sign Language Learning and Assessment, , , and , in: ACM International Conference on Multimodal Interaction, Bangalore, INDIA, pages 626-631, 2022 |
[DOI] |
Towards Automatic Prediction of Non-Expert Perceived Speech Fluency Ratings, , , and , in: ACM International Conference on Multimodal Interaction (ICMI Companion), 2022 |
[DOI] |
Towards energy hubs: an innovative Geographic Information System based approach for cluster definition, , , and , in: ICREC 2022 Conference Proceedings, 2022 |
UM-DFKI Maltese Speech Translation, , , , , , , , , and , in: Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023), 2022 |
UNSL at eRisk 2022: Decision policies with history for early classification, , , and , in: CEUR Workshop Proceedings, 2022 |
[URL] |
Unsupervised Token-level Hallucination Detection from Summary Generation By-products, and , in: Workshop on Generation, Evaluation and Metrics (GEM), 2022 |
|
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering, , and , in: Proceedings of Interspeech, 2022 |
|
Vision-Language Pretraining: Current Trends and the Future, , and , in: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts, 2022 |
[URL] |
Visually Grounded Interpretation of Noun-Noun Compounds in English, , , and , in: Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics, Association for Computational Linguistics, 2022 |
Voyager: Data Discovery for Onboarding in Data Science, , , and , in: 37th IEEE International Conference on Data Engineering (ICDE), 2022 |
What Do Compressed Multilingual Machine Translation Models Forget?, , , , , and , in: The 2022 Conference on Empirical Methods in Natural Language Processing, 2022 |
|
Why Scholars Are Diagramming Neural Network Models, , and , in: 13th International Conference on the Theory and Application of Diagrams, 2022 |
Your Day in Your Pocket: Complex Activity Recognition from Smartphone Accelerometers, , and , in: EAI Pervasive Health, 2022 |
|
2021
A Bayesian Interpretation of the Light Gated Recurrent Unit, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2021 |
[DOI] |
A Comparative Study Of Simulation Tools To Model The Solar Irradiation On Building Façades, , , , , , , , , , , , , , , and , in: Proceedings of SWC 2021: ISES Solar World Congress, ISES, 2021 |
[DOI] [URL] |
A COMPARISON OF METHODS FOR OOV-WORD RECOGNITION ON A NEW PUBLIC DATASET, , and , in: 2021 IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE Signal Processing Society, Toronto, Ontario, Canada, 2021 |
|
A Laser-based Dual-arm System for Precise Control of Collaborative Robots, , and , in: IEEE International Conference on Robotics and Automation, 2021 |
|
A machine-learning model for the prediction of aggregated building heating demand from pan-European land-use maps, , and , in: Journal of Physics: Conference Series, 2021 |
[DOI] |
An Attention Mechanism for Deep Q-Networks with Applications in Robotic Pushing, , and , in: Proc. of Workshop on Emerging paradigms for robotic manipulation: from the lab to the productive world, ICRA, 2021 |
An Efficient Image-to-Image Translation HourGlass-based Architecture for Object Pushing Policy Learning, , and , in: IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021 |
|
An Evaluation Benchmark for Automatic Speech Recognition of German-English Code-Switching, , and , in: IEEE Automatic Speech Recognition and Understanding Workshop, 2021 |
|
An Objective Evaluation Framework for Pathological Speech Synthesis, , , , , and , in: Proceedings of ITG Conference on Speech Communication, 2021 |
|
Approximating the Mental Lexicon from Clinical Interviews as a Support Tool for Depression Detection, , , , and , in: Proceedings of the 2021 International Conference on Multimodal Interaction, ACM, 2021 |
[DOI] |
Automated Interpretation of Air Traffic Control Communication: The Journey from Spoken Words to a Deeper Understanding of the Meaning, , , , , , , and , in: 2021 IEEE/AIAA 40th Digital Avionics Systems Conference (DASC), San Antonio, TX, USA, pages 1-9, IEEE, 2021 |
[DOI] |
Automatic and perceptual discrimination between dysarthria, apraxia of speech, and neurotypical speech, , , and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2021 |
|
Automatic Dialect Detection for Low Resource Santali Language, , , , , and , in: Proceeding of International Conference on Information Technology (OCIT), 2021 |
|
AUTOMATIC DYSARTHRIC SPEECH DETECTION EXPLOITING PAIRWISE DISTANCE-BASED CONVOLUTIONAL NEURAL NETWORKS, , and , in: 45th International Conference on Acoustics, Speech, and Signal Processing, Toronto, Canada, pages 7328–7332, 2021 |
|
Automatic processing pipeline for collecting and annotating air-traffic voice communication data, , , , , , , , , and , in: Proceedings of 9th OpenSky Symposium 2020, OpenSky Network, Brussels, Belgium, pages 1-9, MDPI, 2021 |
|
Beyond question-based biases: Assessing multimodal shortcut learning in visual question answering, , , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021 |
|
Boosting of contextual information in ASR for air-traffic call-sign recognition, , , , , , , and , in: Interspeech 2021, 2021 |
|
Challenges for Using Impact Regularizers to Avoid Negative Side Effects, , and , in: SafeAI 2021 - AAAI's Workshop on Artificial Intelligence Safety, 2021 |
|
Compacter: Efficient Low-Rank Hypercomplex Adapter Layers, , and , in: NeurIPS, 2021 |
|
Comparing CTC and LFMMI for out-of-domain adaptation of wav2vec 2.0 acoustic model, , and , in: Proceedings of Interspeech, 2021 |
[URL] |
Contextual Semi-Supervised Learning: An Approach To Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems, , , , , , and , in: Interspeech 2021, 2021 |
[URL] |
Cost–effective Variational Active Entity Resolution, , , , and , in: 37th IEEE International Conference on Data Engineering (ICDE), 2021 |
[URL] |
Cross Modal Focal Loss for RGBD Face Anti-Spoofing, and , in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021 |
|
Deep Auto-Encoding and Biohashing for Secure Finger Vein Recognition, and , in: Proceedings of the 2021 IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE, Toronto, Canada, 2021 |
[DOI] [URL] |
DepthInSpace: Exploitation and Fusion of Multiple Video Frames for Structured-Light Depth Estimation, , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 6039-6048, 2021 |
[URL] |
Development of a lung segmentation algorithm for analog imaged chest X-Ray: preliminary results, , , , and , in: XV Brazilian Congress on Computational Intelligence, Joinville, Brazil, 2021 |
[URL] |
Disentangling Generative Factors in Natural Language with Discrete Variational Autoencoders, and , in: The 2021 Conference on Empirical Methods in Natural Language Processing, 2021 |
District heating network modelling for future integration of solar thermal energy, , , and , in: Journal of Physics: Conference Series, pages 012089, IOP Publishing, 2021 |
[DOI] |
Do Natural Language Explanations Represent Valid Logical Arguments? Verifying Entailment in Explainable NLI Gold Standards, , and , in: 14th International Conference on Computational Semantics, 2021 |
[URL] |
Does My Representation Capture X? Probe-Ably, , , , and , in: 59th Annual Meeting of the Association for Computational Linguistics (Demonstration track), 2021 |
[URL] |
Encoding Explanatory Knowledge for Zero-shot Science Question Answering, , , and , in: 14th International Conference on Computational Semantics, 2021 |
[URL] |
Estimating Nonplanar Flow from 2D Motion-blurred Widefield Microscopy Images via Deep Learning, and , in: International Symposium on Biomedical Imaging, 2021, 2021 |
|
Explainable Inference Over Grounding-Abstract Chains for Science Questions, , and , in: 59th Annual Meeting of the Association for Computational Linguistics (ACL Findings), 2021 |
|
Explainable Natural Language Reasoning via Conceptual Unification, , and , in: 16th conference of the European Chapter of the Association for Computational Linguistics (EACL), 2021 |
[URL] |
Face Liveness Detection Competition (LivDet-Face) - 2021, , , , , , , , , and , in: International Joint Conference on Biometrics, 2021 |
Fusion of Acoustic and Linguistic Information Using Supervised Autoencoder for Improved Emotion Recognition, , and , in: 2nd Multimodal Sentiment Analysis Challenge (MuSe '21), October 24, 2021, Virtual Event, China, 2021 |
[DOI] |
Graph2Speak: Improving Speaker Identification using Network Knowledge in Criminal Conversational Data, , , and , in: 1st ISCA Symposium on Security and Privacy in Speech Communication, pages 10--13, 2021 |
[DOI] |
Handling acoustic variation in dysarthric speech recognition systems through model combination, and , in: Proceedings of Interspeech, 2021 |
|
Identification of F1 and F2 in speech using modified zero frequency filtering, and , in: Proceedings of Interspeech, 2021 |
|
Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models, , , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021 |
|
Implementation of machine learning techniques for the quasi real-time blind and electric lighting optimization in a controlled experimental facility, , , , , and , in: Journal of Physics: Conference Series, IOP Publishing, 2021 |
[DOI] [URL] |
Imposing Relation Structure in Language-Model Embeddings Using Contrastive Learning, , , and , in: Proceedings of the 25th Conference on Computational Natural Language Learning, Online, pages 337-348, Association for Computational Linguistics, 2021 |
Improving Emotional TTS with an Emotion Intensity Input from Unsupervised Extraction, and , in: 11th ISCA Speech Synthesis Workshop, 2021 |
[URL] |
Improving Generalization of Deepfake Detection by Training for Attribution, , and , in: International Workshop on Multimedia Signal Processing, 2021 |
|
Intrinsically-Motivated Robot Learning of Bayesian Probabilistic Movement Primitives, and , in: ICRA workshop: "Towards Curious Robots: Modern Approaches for Intrinsically-Motivated Intelligent Behavior", 2021 |
|
Late Fusion of the Available Lexicon and Raw Waveform-based Acoustic Modeling for Depression and Dementia Recognition, , , , , and , in: Proceedings of Interspeech 2021, ISCA-International Speech Communication Association 2021, 2021 |
|
LATTICE-FREE MMI ADAPTATION OF SELF-SUPERVISED PRETRAINED ACOUSTIC MODELS, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, 2021 |
[URL] |
Learning to Translate Low-Resourced Swiss German Dialectal Speech into Standard German Text, , and , in: IEEE Automatic Speech Recognition and Understanding Workshop, Colombia, Cartagena, IEEE, 2021 |
|
Locally Private Graph Neural Networks, and , in: ACM Conference on Computer and Communications Security (CCS), 2021 |
|
Machine learning techniques for the daylight and electric lighting performance predictions, , and , in: Proceedings of Building Simulation 2021, 2021 |
Measuring Speech Recognition And Understanding Performance in Air Traffic Control Domain Beyond Word Error Rates, , , , , , , , and , in: 11th SESAR Innovation Days, 2021 |
|
Modeling Dialectal Variation for Swiss German Automatic Speech Recognition, , and , in: Proceedings of Interspeech, 2021 |
[DOI] |
Multi-Adversarial Learning for Cross-Lingual Word Embeddings, , and , in: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Online, pages 463-472, 2021 |
Multi-task Neural Network for Robust Multiple Speaker Embedding Extraction, , and , in: Proceedings of Interspeech 2021, 2021 |
Multi-task Single Channel Speech Enhancement Using Speech Presence Probability As A Secondary Task Training Target, , and , in: European Signal Processing Conference, EUSIPCO 2021, 2021 |
|
Multimodal Neural Machine Translation System for English to Bengali, , , , , , and , in: Proceedings of the First Workshop on Multimodal Machine Translation for Low Resource Languages (MMTLRL 2021), Online (Virtual Mode), pages 31--39, INCOMA Ltd., 2021 |
[URL] |
Multitask adaptation with Lattice-Free MMI for multi-genre speech recognition of low resource languages, , and , in: Proceedings of Interspeech 2021, 2021 |
|
Natural Language Inference over Tables: Enabling Explainable Data Exploration on Data Lakes, , , and , in: 18th Extended Semantic Web Conference (ESWC), 2021 |
[URL] |
NLPHut's Participation at WAT2021, , , , , , , and , in: Proceedings of the 8th Workshop on Asian Translation (WAT2021), pages 146--154, Association for Computational Linguistics, 2021 |
[URL] |
On Modeling Glottal Source Information for Phonation Assessment in Parkinson’s Disease, , , , and , in: Proceedings of Interspeech, 2021 |
|
On the Effectiveness of Vision Transformers for Zero-shot Face Anti-Spoofing, and , in: International Joint Conference on Biometrics (IJCB 2021), 2021 |
|
On the Language-specificity of Multilingual BERT and the Impact of Fine-tuning, , , and , in: Proceedings of the Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2021 |
On the Recognition Performance of BioHashing on state-of-the-art Face Recognition models, , and , in: Proceedings of the 13th IEEE International Workshop on Information Forensics and Security (WIFS), Montpellier, France, IEEE, 2021 |
[DOI] [URL] |
On The Relationship Between Speech-based Breathing Signal Prediction Evaluation Measures And Breathing Parameters Estimation, , , , and , in: Proc. of ICASSP, 2021 |
|
On the use of automatically generated synthetic image datasets for benchmarking face recognition, , and , in: International Joint Conference on Biometrics (IJCB 2021), 2021 |
|
Open Machine Translation for Low Resource South American Languages (AmericasNLP 2021 Shared Task Contribution), , , , , , , , and , in: Proceedings of the First Workshop on Natural Language Processing for Indigenous Languages of the Americas, pages 218–223, Association for Computational Linguistics, 2021 |
[DOI] [URL] |
Open-Set Speaker Identification pipeline in live criminal investigations, and , in: 1st ISCA Symposium on Security and Privacy in Speech Communication, 2021 |
|
Optics Versus Computation: Influence of Illumination and Reconstruction Model Accuracy in Focal-Plane-Scanning Optical Projection Tomography, and , in: 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI), Nice, France, pages 567-570, IEEE, 2021 |
[DOI] |
Optimal Control Combining Emulation and Imitation to Acquire Physical Assistance Skills, , and , in: Proc. IEEE Intl Conf. on Advanced Robotics (ICAR), 2021 |
|
Optimization of robot configurations for motion planning in industrial riveting, , , and , in: Proc. IEEE Intl Conf. on Advanced Robotics (ICAR), 2021 |
|
Overview of the 8th Workshop on Asian Translation, , , , , , , , , , , , , , , and , in: Proceedings of the 8th Workshop on Asian Translation (WAT2021), pages 1--45, Association for Computational Linguistics, 2021 |
[URL] |
Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks, , , and , in: ACL, 2021 |
|
Perspectives and limitations of visible-thermal image pair synthesis via generative adversarial networks, , , , and , in: Security + Defence, Target and Background Signatures VII, Proc. of SPIE, online only, pages 1186509-1--1186509-8, SPIE, 2021 |
[DOI] [URL] |
Phoneme based Respiratory Analysis of Read Speech, , , and , in: Proceedings of European Signal Processing Conference (EUSIPCO), 2021 |
|
Pose Transformers (POTR): Human Motion Prediction with Non-Autoregressive Transformers, , and , in: International Conference in Computer Vision - Workshops, 2021 |
|
Probabilistic Iterative LQR for Short Time Horizon MPC, and , in: International Conference on Intelligent Robots and Systems, pages 579-585, 2021 |
[DOI] |
PROMPT: Probabilistic Motion Primitives based Trajectory Planning, , , and , in: Proceedings of Robotics: Science and Systems, 2021 |
[DOI] [URL] |
Readback Error Detection by Automatic Speech Recognition to Increase ATM Safety, , , , , , , , , , , , , and , in: Fourteenth USA/Europe Air Traffic Management Research and Development Seminar (ATM2021), The United States Federal Aviation Administration (FAA), EUROCONTROL, pages 10, 2021 |
[URL] |
Rethinking the Role of Gradient-based Attribution Methods for Model Interpretability, and , in: International Conference on Learning Representations, 2021 |
|
Robust Command Recognition for Lithuanian Air Traffic Control Tower Utterances, , , , , , , and , in: Interspeech, 2021 |
|
ROXANNE Research Platform: Automate criminal investigations, , , , , and , in: Interspeech Show and Tell 2021, 2021 |
|
ROXSD: a Simulated Dataset of Communication in Organized Crime, , , , and , in: 1st ISCA Symposium on Security and Privacy in Speech Communication, 2021 |
|
Scholarly AI system diagrams as an access point to mental models, , and , in: Diagrams, 2021 |
Sentence-level Planning for Especially Abstractive Summarization, and , in: Proceedings of the Third Workshop on New Frontiers in Summarization, pages 1--14, Association for Computational Linguistics, 2021 |
[URL] |
Speech Activity Detection Based on Multilingual Speech Recognition System, , and , in: Interspeech, 2021 |
|
STAR: Cross-modal Statement Representation for Selecting Relevant Mathematical Premises, and , in: 16th conference of the European Chapter of the Association for Computational Linguistics (EACL), 2021 |
Structuralist analysis for neural network system diagrams, , and , in: Diagrams, 2021 |
Subjective and objective evaluation of deepfake videos, and , in: The international Conference on Acoustics, Speech, and Signal Processing, 2021 |
|
Supervised Speech Representation Learning for Parkinson's Disease Classification, and , in: ITG Conference on Speech Communication, 2021 |
|
Supporting Context Monotonicity Abstractions in Neural NLI Models, , , , and , in: Natural Logic Meets Machine Learning Workshop, 2021 |
[URL] |
Switching Contexts: Transportability Measures for NLP, , , and , in: 14th International Conference on Computational Semantics, 2021 |
[URL] |
Syntax-Aware Graph-to-Graph Transformer for Semantic Role Labelling, and , in: Arxiv, 2021 |
|
Test time Adaptation through Perturbation Robustness, and , in: Workshop on Distribution Shifts, 35th Conference on Neural Information Processing Systems (NeurIPS 2021), 2021 |
|
The DCU-EPFL Enhanced Dependency Parser at the IWPT 2021 Shared Task, , , , and , in: Proceedings of the 17th International Conference on Parsing Technologies and the IWPT 2021 Shared Task on Parsing into Enhanced Universal Dependencies, Online, pages 204-212, Association for Computational Linguistics, 2021 |
|
The Theory, Practice, and Ethical Challenges of Designing a Diversity-Aware Platform for Social Relations, , , , , , , and , in: Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society, pages 11, ACM, 2021 |
[DOI] |
Trajectory Prediction with Compressed 3D Environment Representation using Tensor Train Decomposition, , , and , in: International Conference on Advanced Robotics, 2021 |
|
Trust indicators and explainable AI: A study on user perceptions, , , , , , and , in: Proc. Int. Conf. on Human-Computer Interaction, Bari, Italy, 2021 |
|
Uncertainty Reduction for Model Adaptation in Semantic Segmentation, and , in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021 |
|
Unequivocal cardiac phase sorting from alternating ramp- and pulse-illuminated microscopy image sequences, , , , and , in: 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI), pages 868-872, 2021 |
[DOI] [URL] |
Unification-based Reconstruction of Multi-hop Explanations for Science Questions, , and , in: 16th conference of the European Chapter of the Association for Computational Linguistics, 2021 |
[URL] |
Unshuffling data for improved generalization in visual question answering, , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021 |
|
Variational Information Bottleneck for Effective Low-Resource Fine-Tuning, , and , in: ICLR, 2021 |
|
Vein Enhancement with Deep Auto-Encoders to improve Finger Vein Recognition, , and , in: Biometrics Special Interest Group (BIOSIG 2021), 2021 |
|
Visual Focus of Attention Estimation in 3D Scene with an Arbitrary Number of Targets, and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pages 9, IEEE, 2021 |
|
What is SemEval evaluating? A Systematic Analysis of Evaluation Campaigns in NLP, , , and , in: EMNLP, 2nd Workshop on Evaluation and Comparison of NLP Systems, 2021 |
Whole Body Model Predictive Control with a Memory of Motion:Experiments on a Torque-Controlled Talos, , , , , , , , , , , and , in: IEEE International Conference on Robotics and Automation, 2021 |
|
Zurich Like New: Analyzing Open Urban Multimodal Data, , and , in: Proceedings of the 1st International Workshop on Multimedia Computing for Urban Data, 2021 |
|
2020
A Comparison of Acoustic and Linguistics Methodologies for Alzheimer's Dementia Recognition, , , , , , , , , , and , in: Proceedings of Interspeech, pages 2182-2186, 2020 |
|
A memory of motion for visual predictive control tasks, , and , in: International Conference on Robotics and Automation, 2020 |
|
A Phonology-based Approach for Isolated Sign Production Assessment in Sign Language, , , and , in: Companion Publication of the 2020 International Conference on Multimodal Interaction (ICMI '20 Companion), 2020 |
|
Active Improvement of Control Policies with Bayesian Gaussian Mixture Model, , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, 2020 |
Alone or With Others? Understanding Eating Episodes of College Students with Mobile Sensing, , and , in: 19th International Conference on Mobile and Ubiquitous Multimedia, ACM, Essen, Germany, pages 162–166, Association for Computing Machinery, 2020 |
[DOI] [URL] |
An HMM Approach with Inherent Model Selection for Sign Language and Gesture Recognition, , and , in: Proceedings of the International Conference on Language Resources and Evaluation LREC 2020, 2020 |
|
An Integrated and strategic evaluation of automatic blind controls to achieve energy and occupant's comfort objectives, and , in: Proceedings of the 5th IBPSA-England Conference on Building Simulation and Optimization (Virtual), Loughborough, UK, 2020 |
[URL] |
Analysis and Transfer of Human Movement Manipulability in Industry-like Activities, , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, 2020 |
|
Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications, , , , , , , , , , , , , , , and , in: Proceedings of 8th OpenSky Symposium 2020, OpenSky Network, pages 1-10, MDPI, 2020 |
[DOI] [URL] |
Automatic Discrimination of Apraxia of Speech and Dysarthria using a Minimalistic Set of Handcrafted Features, , , and , in: Interspeech, 2020 |
|
Automatic Speech Recognition Benchmark for Air-Traffic Communications, , , , and , in: Proc. Interspeech 2020, pages 2297-2301, 2020 |
[DOI] |
BertAA: BERT fine-tuning for Authorship Attribution, , , and , in: Proceedings of the 17th International Conference on Natural Language Processing, 2020 |
|
CNN Patch Pooling for Detecting 3D Mask Presentation Attacks in NIR, and , in: IEEE International Conference on Image Processing, 2020 |
|
DeepFocus: a Few-shot Microscope Slide Auto-Focus using a Sample Invariant CNN-based Sharpness Function, and , in: International Symposium on Biomedical Imaging, 2020 |
|
Detection of S1 and S2 locations in phonocardiogram signals using zero frequency filter, , , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020 |
|
Detection of Similar Languages and Dialects Using Deep Supervised Autoencoders, , , , and , in: Proceedings of the 17th International Conference on Natural Language Processing, 2020 |
|
DOMAIN ADAPTATION FOR GENERALIZATION OF FACE PRESENTATION ATTACK DETECTION IN MOBILE SETTINGS WITH MINIMAL INFORMATION, , and , in: 45th International Conference on Acoustics, Speech, and Signal Processing, Barcelona, Spain, IEEE, 2020 |
[URL] |
Dysarthric Speech Recognition with Lattice-Free MMI, and , in: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 6109-6113, 2020 |
[DOI] [URL] |
End-to-End Bias Mitigation by Modelling Biases in Corpora, , and , in: ACL, 2020 |
|
Estimating The Degree of Sleepiness by Integrating Articulatory Feature Knowledge In Raw Waveform Based CNNs, , and , in: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Barcelona, Spain, 2020 |
|
Exact Preimages of Neural Network Aircraft Collision Avoidance Systems, and , in: Machine Learning for Engineering Modeling, Simulation, and Design Workshop at Neural Information Processing Systems 2020, 2020 |
|
Fast Transformers with Clustered Attention, , and , in: Proceedings of the International Conference on Neural Information Processing Systems, 2020 |
Fourier movement primitives: an approach for learning rhythmic robot skills from demonstrations, , and , in: Robotics: Science and Systems, 2020 |
|
Generating Master Faces for Use in PerformingWolf Attacks on Face Recognition Systems, , , and , in: International Join Conference on Biometrics, 2020 |
|
Generative adversarial training of product of policies for robust and adaptive movement primitives, , and , in: In Proc. Conference on Robot Learning (CoRL), 2020 |
|
Graph-to-Graph Transformer for Transition-based Dependency Parsing, and , in: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, 2020 |
[URL] |
Graph-to-Graph Transformer for Transition-based Dependency Parsing, and , in: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Findings, ACL, Online, pages 3278–3289, Association for Computational Linguistics, 2020 |
[URL] |
Idiap & UAM participation at GermEval 2020: Classification and Regression of Cognitive and Motivational Style from Text, , , , and , in: Proceedings of the GermEval 2020 Shared Task on the Classification and Regression of Cognitive and Motivational style from Text, 2020 |
[URL] |
Idiap and UAM Participation at MEX-A3T Evaluation Campaign, , , , and , in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2020) co-located with 36th Conference of the Spanish Society for Natural Language Processing (SEPLN 2020), pages 6, CEUR Workshop Proceedings, 2020 |
[URL] |
Idiap Submission to Swiss-German Language Detection Shared Task, , , , and , in: Proceedings of the 5th Swiss Text Analytics Conference (SwissText) & 16th Conference on Natural Language Processing (KONVENS), CEUR Workshop Proceedings, 2020 |
[URL] |
IMPROVING CROSS-DATASET PERFORMANCE OF FACE PRESENTATION ATTACK DETECTION SYSTEMS USING FACE RECOGNITION DATASETS, , and , in: 45th International Conference on Acoustics, Speech, and Signal Processing, IEEE, 2020 |
[URL] |
INCREMENTAL SEMI-SUPERVISED LEARNING FOR MULTI-GENRE SPEECH RECOGNITION, , , , , and , in: Proceedings of ICASSP 2020, 2020 |
|
Iris Liveness Detection Competition (LivDet-Iris) – The 2020 Edition, , , , , , , , , , , , , and , in: INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB 2020), 2020 |
[URL] |
Lattice-Free Maximum Mutual Information Training of Multilingual Speech Recognition System, , , , , and , in: In Proceedings of Interspeech 2020, pages 4746--4750, ISCA, 2020 |
|
Learning How to Walk: Warm-starting Optimal Control Solver with Memory of Motion, , , , and , in: International Conference on Robotics and Automation, 2020 |
|
Learning Urban Nightlife Routines from Mobile Data, , and , in: Proc. Int. Conf. on Mobile and Ubiquitous Multimedia, Essen, Germany, 2020 |
|
Learning, Generating and Adapting Wave Gestures for Expressive Human-Robot Interaction, , and , in: Proc. ACM/IEEE Intl Conf. on Human-Robot Interaction (HRI), pages 386-388, 2020 |
[DOI] [URL] |
ManiGaze: a Dataset for Evaluating Remote Gaze Estimator in Object Manipulation Situations, , and , in: Symposium on Eye Tracking Research and Applications, Stuttgart, Germany, pages 5, ACM, 2020 |
[DOI] |
ODIANLP's Participation in WAT2020, , , , , , , , and , in: Proceedings of the 7th Workshop on Asian Translation, ACL Anthology, 2020 |
|
Optimizer Benchmarking Needs to Account for Hyperparameter Tuning, , , , and , in: Proceedings of the 37th International Conference on Machine Learning, Vienna, Austria, 2020 |
[URL] |
Overview of the 7th Workshop on Asian Translation, , , , , , , , , , , and , in: Proceedings of the 7th Workshop on Asian Translation, Association for Computational Linguistics, 2020 |
[URL] |
Partially-supervised Mention Detection, and , in: Proceedings of the Third Workshop on Computational Models of Reference, Anaphora and Coreference, 2020 |
|
Plucking Motions for Tea Harvesting Robots Using Probabilistic Movement Primitives, , , and , in: IEEE International Conference on Robotics and Automation, 2020 |
Plug and Play Autoencoders for Conditional Text Generation, , , , and , in: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, Online, 2020 |
|
Protecting Mobile Food Diaries from Getting too Personal, , and , in: 19th International Conference on Mobile and Ubiquitous Multimedia, Essen, Germany, pages 212–222, Association for Computing Machinery, 2020 |
[DOI] [URL] |
pyannote.audio: neural building blocks for speaker diarization, , , , , , , , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2020 |
[URL] |
Real-Time Segmentation Networks should be Latency Aware, and , in: Asian Conference on Computer Vision, 2020 |
|
Residual Pose: A Decoupled Approach for Depth-based 3D Human Pose Estimation, , , and , in: IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020 |
|
Spatial Attention for Far-Field Speech Recognition with Deep Beamforming Neural Networks, , , , , and , in: 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, pages 7499-7503, 2020 |
[DOI] |
Supervised domain adaptation for text-independent speaker verification using limited data, , , and , in: Interspeech, pages 3815-3819, 2020 |
[URL] |
SYNTHETIC SPEECH REFERENCES FOR AUTOMATIC PATHOLOGICAL SPEECH INTELLIGIBILITY ASSESSMENT, , and , in: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Barcelona, Spain, 2020 |
|
The MuMMER data set for Robot Perception in multi-party HRI Scenarios, , , and , in: Proceedings of the 29th IEEE International Conference on Robot & Human Interactive Communication, 2020 |
|
The societal and ethical relevance of computational Creativity, , and , in: Proceedings of the International Conference on Computational Creativity, 2020 |
The Unstoppable Rise of Computational Linguistics in Deep Learning, , in: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Online, pages 6294-6306, Association for Computational Linguistics, 2020 |
[DOI] [URL] |
Towards Multilingual Sign Language Recognition, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020 |
|
Transformers are RNNs: Fast Autoregressive Transformers with Linear Attention, , , and , in: Proceedings of International Conference on Machine Learning, 2020 |
Understanding Applicants' Reactions to Asynchronous Video Interviews through Self-Reports and Nonverbal Cues, , , , , and , in: Proc. ACM Int. Conf. on Multimodal Interaction (ICMI), Utrecht, 2020 |
|
Understanding Heavy Drinking at Night through Smartphone Sensing and Active Human Engagement, , , and , in: Proceedings of the 14th EAI International Conference on Pervasive Computing Technologies for Healthcare, 2020 |
|
Unsupervised Representation Learning for Gaze Estimation, and , in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020 |
|
Variational Inference with Mixture Model Approximation for Applications in Robotics, , and , in: International Conference on Robotics and Automation, 2020 |
|
2019
#Drink Or #Drunk: Multimodal Signals and Drinking Practices on Instagram, , and , in: Proceedings of the 13th EAI International Conference on Pervasive Computing Technologies for Healthcare, Trento, Italy, 2019 |
|
A BAYESIAN APPROACH TO INTER-TASK FUSION FOR SPEAKER RECOGNITION, , and , in: In Proceedings of ICASSP 2019, Brighton, ENGLAND, pages 5786-5790, 2019 |
|
A Deep Learning Approach for Robust Head Pose Independent Eye Movements Recognition from Videos, , and , in: 2019 ACM Symposium on Eye Tracking Research and Applications, pages 5, ACM, 2019 |
[DOI] |
A Learning-Based Framework for Quantized Compressed Sensing, , and , in: A Learning-Based Framework for Quantized Compressed Sensing, 2019 |
|
A morphological based PV generation and energy consumption predictive model for Singapore neighbourhood, and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
A smart luminaire in an office environment: impact on light distribution, user interactions and comfort, , , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
Abstract Text Summarization: A Low Resource Challenge, and , in: In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2019), HongKong, China, pages 5, Association for Computational Linguistics (ACL), 2019 |
|
Adaptation of Assistant Based Speech Recognition to New Domains and Its Acceptance by Air Traffic Controllers, , , , , , , , , and , in: Proceedings of the 2nd International Conference on Intelligent Human Systems Integration (IHSI 2019): Integrating People and Intelligent Systems, San Diego, California, USA, pages 820 - 826, 2019 |
[DOI] |
Adaptation of Multiple Sound Source Localization Neural Networks with Weak Supervision and Domain-Adversarial Training, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brighton, United Kingdom, pages 770-774, 2019 |
[DOI] |
Aligning Multilingual Word Embeddings for Cross-Modal Retrieval Task, , and , in: Proceedings of the Second Workshop on Fact Extraction and VERification (FEVER), Hong Kong, pages 27-33, Association for Computational Linguistics, 2019 |
[DOI] [URL] |
AM-FM DECOMPOSITION OF SPEECH SIGNAL: APPLICATIONS FOR SPEECH PRIVACY AND DIAGNOSIS, , , , and , in: 11th International workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, Universita Degli Studi Firenze, Firenze, Italy, 2019 |
[URL] |
An End-to-end Network to Synthesize Intonation Using a Generalized Command Response Model, , , , and , in: ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, United Kingdom, pages 7040-7044, IEEE, 2019 |
[DOI] [URL] |
AN INVESTIGATION OF MULTILINGUAL ASR USING END-TO-END LF-MMI, , and , in: International Conference on Acoustics, Speech and Signal Processing, 2019 |
|
ANALYZING UNCERTAINTIES IN SPEECH RECOGNITION USING DROPOUT, , , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
Automatic Diagnosis of Alzheimer's Disease Using Neural Network Language Models, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
|
Bayesian Optimization Meets Riemannian Manifolds in Robot Learning, , , and , in: Conference on Robot Learning, 2019 |
|
BookTubing Across Regions: Examining Differences based on Nonverbal and Verbal Cues, , and , in: Proc. ACM Int. Conf. on Interactive Experiences for Television and Online Video (TVX), Salford, ENGLAND, 2019 |
[DOI] |
Building energy models with Morphological urban-scale parameters: a case study in Turin, , , , , and , in: Proceedings of 4th Building Simulation Applications Conference - BSA 2019, 2019 |
[URL] |
CBOW Is Not All You Need: Combining CBOW with the Compositional Matrix Space Model, , and , in: International Conference on Learning Representations, New Orleans, Louisiana, USA, 2019 |
[URL] |
CityLearn v1.0: An OpenAI Gym Environment for Demand Response with Deep Reinforcement Learning, , , and , in: Proceedings of the 6th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation, New-York, USA, pages 356-357, ACM, 2019 |
[DOI] |
CO2 experimental measurements towards the development of a predictive framework using user actions in smart buildings, , , , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
Cross-lingual Automatic Speech Recognition Exploiting Articulatory Features, , , , , and , in: Proceedings of APSIPA ASC 2019, 2019 |
Custom Silicone Face Masks - Vulnerability of Commercial Face Recognition Systems & Presentation Attack Detection, , , , , , and , in: Proceedings of 7th IAPR/IEEE International Workshop on Biometrics and Forensics, 2019 |
|
Daylight regulated by automated external Venetian blinds based on HDR sky luminance mapping in winter, , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
Deep Pixel-wise Binary Supervision for Face Presentation Attack Detection, and , in: International Conference on Biometrics, 2019 |
|
Deep Residual Output Layers for Neural Language Generation, and , in: Proceedings of the 36th International Conference on Machine Learning (ICML), 2019 |
|
Discovering Eating Routines in Context with a Smartphone App, , , and , in: Ubicomp/Iswc'19 Adjunct: Proceedings Of The 2019 Acm International Joint Conference On Pervasive And Ubiquitous Computing And Proceedings Of The 2019 Acm International Symposium On Wearable Computers, London, pages 422-429, 2019 |
[DOI] |
Domain Adaptation in Multi-Channel Autoencoder based Features for Robust Face Anti-Spoofing, , and , in: International Conference on Biometrics 2019, IEEE, 2019 |
|
EMPIRICAL EVALUATION AND COMBINATION OF PUNCTUATION PREDICTION MODELS APPLIED TO BROADCAST NEWS, and , in: Proceedings of 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2019 |
|
End-to-End Accented Speech Recognition, , and , in: International Conference on Speech and Language Processing, Interspeech, ISCA, Graz, Austria, pages 2140-2144, 2019 |
[DOI] |
Exploiting semi-supervised training through a dropout regularization in end-to-end speech recognition, , , and , in: Proc. of Interspeech 2019, 2019 |
Full-Gradient Representation for Neural Network Visualization, and , in: Advances in Neural Information Processing Systems, 2019 |
[URL] |
Generalized temporal sampling with active illumination in optical microscopy, and , in: Proceeding of the SPIE Conference Optics and Photonics, Wavelets and Sparsity XVIII, SPIE, San Diego, California, United States, SPIE, 2019 |
|
HMM-based Approaches to Model Multichannel Information in Sign Language inspired from Articulatory Features-based Speech Processing, , , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
|
Idiap Abstract Text Summarization System for German Text Summarization Task, and , in: Proceedings of the 4th edition of the Swiss Text Analytics Conference, 2019 |
[URL] |
Idiap NMT System for WAT 2019 Multimodal Translation Task, and , in: Proceedings of the 6th Workshop on Asian Translation, Hong Kong, China, pages 175–180, Association for Computational Linguistics, 2019 |
[DOI] [URL] |
Implicit discourse relation classification with syntax-aware contextualized word representations, , , and , in: Proceedings of the 32nd International Florida Artificial Intelligence Research Society Conference, 2019 |
Improving Children Speech Recognition through Feature Learning from Raw Speech Signal, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
|
Improving dual-arm assembly by master-slave compliance, , , and , in: Proc. IEEE Intl Conf. on Robotics and Automation, pages 8676-8682, 2019 |
|
Improving Few-Shot User-Specific Gaze Adaptation via Gaze Redirection Synthesis, , and , in: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019 |
INCREMENTAL TRANSFER LEARNING IN TWO-PASS INFORMATION BOTTLENECK BASED SPEAKER DIARIZATION SYSTEM FOR MEETINGS, , , and , in: Proceedings of ICASSP 2019, pages 6291-6295, 2019 |
Joint estimation of RETF vector and power spectral densities for speech enhancement based on alternating least squares, , and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, pages 795--799, 2019 |
Learning an event sequence embedding for event-based deep stereo, , , , and , in: Proceedings of the IEEE International Conference on Computer Vision, 2019 |
Learning from demonstration with model-based Gaussian process, , and , in: Conference on Robot Learning, 2019 |
|
Learning voice source related information for depression detection, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
|
Multi-agent reinforcement learning for adaptive demand response in smart cities, , , , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
Multi-Spectral Widefield Microscopy of the Beating Heart through Post-Acquisition Synchronization and Unmixing, , , and , in: 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019), Venice, Italy, pages 1382-1385, 2019 |
[DOI] |
Multilingual Bottleneck Features for Query by Example Spoken Term Detection, , and , in: IEEE Automatic Speech Recognition and Understanding Workshop, 2019 |
|
Neural VTLN for Speaker Adaptation in TTS, and , in: Proc. 10th ISCA Speech Synthesis Workshop, ISCA, Vienna, Austria, pages 6, 2019 |
[DOI] |
Open-Vocabulary Keyword Spotting With Audio And Text Embeddings, , , and , in: Proceedings of Interspeech 2019, 2019 |
[DOI] |
Overview of the 6th Workshop on Asian Translation, , in: Proceedings of the 6th Workshop on Asian Translation, Hong Kong, China, pages 1–35, Association for Computational Linguistics, 2019 |
[DOI] [URL] |
PATHOLOGICAL SPEECH INTELLIGIBILITY ASSESSMENT BASED ON THE SHORT-TIME OBJECTIVE INTELLIGIBILITY MEASURE, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brighton, UK, pages 6405--6409, 2019 |
|
Processing Megapixel Images with Deep Attention-Sampling Models, and , in: Proceedings of International Conference on Machine Learning, 2019 |
[URL] |
Reducing Noise in GAN Training with Variance Reduced Extragradient, , , and , in: Proceedings of the international conference on Neural Information Processing Systems, 2019 |
Reinforcement learning of trajectory distributions: Applications in assisted teleoperation and motion planning, , , , , and , in: IEEE International Conference on Intelligent Robots and Systems, 2019 |
Retrofitting, district heating and energy storage: neighborhood energy planning, , , , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
SARAL: A Low-Resource Cross-Lingual Domain-Focused Information Retrieval System for Effective Rapid Document Triage, , , , , , , , , , , and , in: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: System Demonstrations. 2019, pages 19-24, 2019 |
SATokE: How can Syntax-Aware Contextualized Word Representations Benefit Implicit Discourse Relation Classification?, , , and , in: Ptroc. 2019 Conference sur l'Apprentissage automatique, 2019 |
Segment-level training of ANNs based on acoustic confidence measures for hybrid HMM/ANN Speech Recognition, and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
|
Selecting, Planning, and Rewriting: A Modular Approach for Data-to-Document Generation and Translation, , and , in: WNGT EMNLP, 2019 |
|
Self-attention for Speech Emotion Recognition, , and , in: Proc. Interspeech 2019, 2019 |
[DOI] |
Social Multimedia, Diversity, and Global South Cities: A Double Blind Side, , , and , in: Proc. ACM Workshop on Fairness, Accountability, and Transparency in Multimedia (FAT/MM), Nice, 2019 |
|
Spectral Subspace Analysis for Automatic Assessment of Pathological Speech Intelligibility, , and , in: Proceedings of Interspeech, Graz, Austria, pages 3038--3042, 2019 |
|
Spoken language identification using language bottleneck features, , , , , and , in: Proceedings of TSD, 2019 |
|
Super-gaussianity of Speech Spectral Coefficients as a Potential Biomarker for Dysarthric Speech Detection, and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2019 |
Tampered Speaker Inconsistency Detection with Phonetically Aware Audio-visual Features, , , , , , , and , in: International Conference on Machine Learning, 2019 |
|
The Simulation of Mean Radiant Temperature in Outdoor Conditions: A Review of Architectural Tools Calculation Assumptions, , , and , in: Proceedings of Building Simulation 2019: 16th Conference of IBPSA, 2019 |
Unbiased semi-supervised LF-MMI training using dropout, , , and , in: Proceedings of Interspeech 2019, 2019 |
[DOI] |
Uncertainty-aware imitation learning using kernelized movement primitives, , , , and , in: IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019 |
|
Understanding and Visualizing Raw Waveform-based CNNs, , , and , in: Proceedings of Interspeech, 2019 |
|
Understanding the performance gap: a machine learning approach on residential buildings in Turin, Italy, , , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
Using Speech Production Knowledge for Raw Waveform Modelling based Styrian Dialect Identification, and , in: Proceedings of Interspeech, 2019 |
|
Virtual High-Framerate Microscopy of the Beating Heart via Sorting of Still Images, , , , and , in: 2019 IEEE 16th International Symposium on Biomedical Imaging, pages 312--315, 2019 |
|
Vulnerability assessment and detection of Deepfake videos, and , in: IAPR International Conference on Biometrics, 2019 |
|
Vulnerability of Face Recognition to Deep Morphing, and , in: International Conference on Biometrics for Borders, 2019 |
|
Weakly-Supervised Concept-based Adversarial Learning for Cross-lingual Word Embeddings, , and , in: Proc. 2019 Conference on Empirical Methods in Natural Language Processing, 2019 |
2018
A Differential Approach for Gaze Estimation with Calibration, , , and , in: 29TH BRITISH MACHINE VISION CONFERENCE, 2018 |
|
A Neural Model to Predict Parameters for a Generalized Command Response Model of Intonation, and , in: Proc. Interspeech 2018, pages 3147-3151, 2018 |
[DOI] |
A Neural Model to Predict Parameters for a Generalized Command Response Model of Intonation, and , in: MLSLP-18 Proceedings, Hyderabad, 2018 |
[URL] |
Ambiance in Social Media Venues: Visual Cue Interpretation by Machines and Crowds, , and , in: IEEE CVPR Workshop on Visual Understanding of Subjective Attributes, 2018 |
|
Analysis of Language Dependent Front-End for Speaker Recognition, , and , in: Proceedings of Interspeech 2018, Hyderabad, INDIA, pages 1101-1105, 2018 |
[DOI] |
Beyond Weight Tying: Learning Joint Input-Output Embeddings for Neural Machine Translation, , and , in: Proceedings of the Third Conference on Machine Translation (WMT), 2018 |
|
Bimanual Skill Learning with Pose and Joint Space Constraints, , , and , in: Proc. of the IEEE-RAS Intl Conf. on Humanoid Robots (Humanoids), 2018 |
|
Building Blocks of Assistant Based Speech Recognition for Air Traffic Management Applications, , , , , , , and , in: Conference: SESAR Innovation Days 2018, European Union, Eurocontrol, Salzburg, Austria, SESARJU, 2018 |
[URL] |
CNN based Query by Example Spoken Term Detection, , and , in: Proceedings of Interspeech, 2018 |
|
Complexity reduction of eigenvalue decomposition-based diffuse power spectral density estimators using the power method, , and , in: Proc. International Conference on Acoustics, Speech, and Signal Processing, Calgary, Canada, pages 451-455, 2018 |
|
CONTEXT-AWARE ATTENTION MECHANISM FOR SPEECH EMOTION RECOGNITION, , , and , in: IEEE Workshop on Spoken Language Technology, Athens, Greece, pages 126-131, 2018 |
[URL] |
Deep Multitask Gaze Estimation with a Constrained Landmark-Gaze Model, , and , in: European Conference on Computer Vision Workshop, 2018 |
|
Deep Neural Networks for Multiple Speaker Detection and Localization, , and , in: 2018 IEEE International Conference on Robotics and Automation (ICRA), Brisbane, AUSTRALIA, pages 74-79, 2018 |
[DOI] |
Denoising and Raw-waveform Networks for Weakly-Supervised Gender Identification on Noisy Speech, , , , , and , in: Proceedings of Interspeech 2018, Hyderabad, INDIA, pages 292-296, 2018 |
[DOI] |
DNN based speaker embedding using content information for text-dependent speaker verification, , , and , in: Proceedings of 2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2018 |
|
Document-Level Neural Machine Translation with Hierarchical Attention Networks, , , and , in: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2018 |
|
Embedding Context-Dependent Variations of Prosodic Contours using Variational Encoding for Decomposing the Structure of Speech Prosody, , , , and , in: Workshop on Prosody and Meaning: Information Structure and Beyond, Aix-en-Provence, France, 2018 |
[URL] |
End-to-end text-dependent speaker verification using novel distance measures, , and , in: Proceedings of Interspeech 2018, Hyderabad, INDIA, Aug 02-Sep 06, 2018, pages 3598-3602, 2018 |
[DOI] |
Enhancing Trust in eAssessment - the TeSLA System Solution, , , , and , in: Technology Enhanced Assessment Conference., 2018 |
|
Experimental evaluation of speech enhancement methods in remote microphone systems for hearing aids, , , , and , in: Proc. EuroNoise 2018, Crete, Greece, pages 351-358, 2018 |
|
Facing Employers and Customers: What Do Gaze and Expressions Tell About Soft Skills?, , , and , in: Proceedings of the 17th International Conference on Mobile and Ubiquitous Multimedia, Cairo, Egypt, pages 121-126, ASSOC COMPUTING MACHINERY, 2018 |
[DOI] |
Far-field ASR Using Low-rank and Sparse Soft Targets from Parallel Data, , and , in: IEEE Workshop on Spoken Language Technology, Athens, GREECE, pages 581-587, IEEE, 2018 |
|
Fast cross-correlation based wrist vein recognition algorithm with rotation and translation compensation, , , and , in: Sixth International Workshop on Biometrics and Forensics, 2018 |
|
Fast Language Adaptation Using Phonological Information, , and , in: Proceedings of Interspeech 2018, Hyderabad, INDIA, pages 2459-2463, 2018 |
[DOI] |
Generalizing Robot Imitation Learning with Invariant Hidden Semi-Markov Models, , , , , , , and , in: 13th Intl Workshop on the Algorithmic Foundations of Robotics (WAFR), 2018 |
|
Geodesic Convolutional Shape Optimization, , , and , in: Proceedings of the International Conference on Machine Learning, 2018 |
Geometry-aware Control and Learning in Robotics, and , in: R:SS Pioneers Workshop, 2018 |
Geometry-aware Robot Manipulability Transfer, , and , in: R:SS Workshop on Learning and Inference in Robotics: Integrating Structure, Priors and Models, 2018 |
|
Geometry-aware Tracking of Manipulability Ellipsoids, , , and , in: Robotics: Science and Systems, Pittsburgh, USA, 2018 |
|
Implementing Fusion Techniques for the Classification of Paralinguistic Information, , , and , in: Proceedings of Interspeech 2018, pages 526-530, 2018 |
|
Investigating Depth Domain Adaptation for Efficient Human Pose Estimation, , , and , in: European Conference on Computer Vision - Workshops, 2018 |
|
Iterative alternating least-aquares approach to jointly estimate the RETFs and the diffuse PSD, , and , in: Proc. ITG conference on Speech Communication, Oldenburg, Germany, 2018 |
|
Iterative Learning of Speech Recognition Models for Air Traffic Control, , , , , , and , in: Proceedings of Interspeech 2018, ISCA, Hyderabad, India, pages 3519-3523, 2018 |
[DOI] |
Joining high-level symbolic planning with low-level motion primitives in adaptive HRI: application to dressing assistance, , , , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), 2018 |
Joint late reverberation and noise power spectral density estimation in a spatially homogeneous noise field, and , in: Proc. International Conference on Acoustics, Speech, and Signal Processing, Calgary, Canada, pages 441-445, 2018 |
|
Joint Localization and Classification of Multiple Sound Sources Using a Multi-task Neural Network, , and , in: Proceedings of Interspeech, pages 312--316, 2018 |
[DOI] |
Knowledge Transfer with Jacobian Matching, and , in: Proceedings of the International Conference on Machine Learning, 2018 |
[URL] |
Kronecker Recurrent Units, , and , in: Proceedings of the International Conference on Machine Learning, 2018 |
Leveraging Convolutional Pose Machines for Fast and Accurate Head Pose Estimation, , and , in: 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Madrid, SPAIN, pages 1089-1094, IEEE, 2018 |
|
Low-latency speaker spotting with online diarization and detection, , , , , , , , and , in: The Speaker and Language Recognition Workshop (Odyssey), 2018 |
|
Multilingual bottleneck features for subword modeling in zero-resource languages, and , in: Proc. Interspeech, pages 2668-2672, 2018 |
[DOI] |
NASAL SPEECH SOUNDS DETECTION USING CONNECTIONIST TEMPORAL, and , in: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 2018 |
|
Not All Samples Are Created Equal: Deep Learning with Importance Sampling, and , in: Proceedings of International Conference on Machine Learning, 2018 |
|
On Effectiveness of Anomaly Detection Approaches against Unseen Presentation Attacks in Face Anti-Spoofing, , , and , in: The 11th IAPR International Conference on Biometrics (ICB 2018), 2018 |
|
On Learning to Identify Genders from Raw Speech Signal Using CNNs, , and , in: Proceedings of Interspeech, Hyderabad, INDIA, pages 287-291, 2018 |
[DOI] |
On Learning Vocal Tract System Related Speaker Discriminative Information from Raw Signal Using CNNs, , and , in: Proceedings of Interspeech, Hyderabad, INDIA, pages 1116-1120, 2018 |
|
On the Use of Convolutional Neural Networks for Speech Presentation Attack Detection, , , , and , in: International Conference on Identity, Security and Behavior Analysis, 2018 |
|
Phonological Posterior Hashing for Query by Example Spoken Term Detection, , and , in: Proceedings of Interspeech, 2018 |
|
Practical Deep Stereo (PDS): Toward applications-friendly deep stereo matching, , and , in: Proceedings of the international conference on Neural Information Processing Systems, 2018 |
Probabilistic Learning of Torque Controllers from Kinematic and Force Constraints, , , , and , in: Proc. of the IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), pages 6552-6559, 2018 |
|
Pulse-based Features for Face Presentation Attack Detection, and , in: Proceedings of BTAS 2018, special session on Image and Video Forensics in Biometrics, 2018 |
|
Real-time Convolutional Networks for Depth-based Human Pose Estimation, , , and , in: IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018 |
|
Real-Time DCT Learning-based Reconstruction of Neural Signals, , and , in: EUSIPCO, 2018 |
|
Robust and Discriminative Speaker Embedding via Intra-Class Distance Variance Regularization, and , in: Proceedings of Interspeech, Hyderabad, INDIA, pages 2257-2261, 2018 |
[DOI] |
SCALAR - Simultaneous Calibration of 2D Laser And Robot's Kinematic Parameters Using Three Planar Constraints, , and , in: International Conference on Intelligent Robots, 2018 |
|
Self-Attentive Residual Decoder for Neural Machine Translation, , , and , in: Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2018 |
|
Semi-Blind Spatially-Variant Deconvolution in Optical Microscopy with Local Point Spread Function Estimation by Use of Convolutional Neural Networks, and , in: 2018 25th IEEE International Conference on Image Processing (ICIP), pages 3818-3822, IEEE, 2018 |
[DOI] |
Semi-supervised Adaptation of Assistant Based Speech Recognition Models for different Approach Areas, , , , , , , , , , and , in: 37th AIAA/IEEE Digital Avionics Systems Conference, AIAA/IEEE, London, 2018 |
[URL] |
SGAN: An Alternative Training of Generative Adversarial Networks, and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, pages 9407-9415, IEEE, 2018 |
[DOI] |
SIIP: An Innovative Speaker Identification Approach for Law Enforcement Agencies, , , , , , , , , and , in: Big Data and Artificial Intelligence for Military Decision Making, http://www.sto.nato.int/, pages PT-1 - 1: PT-1 - 14, STO, 2018 |
[DOI] [URL] |
Single-channel late reverberation power spectral density estimation using denoising autoencoders, and , in: Proc. Annual Conference of the International Speech Communication Association, Hyderabad, India, 2018 |
|
SMILE Swiss German Sign Language Dataset, , , , , , , , , , , and , in: Language Resources and Evaluation Conference, 2018 |
Speaker Inconsistency Detection in Tampered Video, and , in: European Signal Processing Conference, 2018 |
|
Spoofing Deep Face Recognition With Custom Silicone Masks, , and , in: Proceedings of BTAS2018, 2018 |
|
Statistical modeling of speech spectral coefficients in patients with Parkinson's disease, and , in: Proc. ITG conference on Speech Communication, Oldenburg, Germany, 2018 |
|
Stochastic Variance Reduced Gradient Optimization of Generative Adversarial Networks, , , and , in: International Conference on Machine Learning (ICML) workshop on Theoretical Foundations and Applications of Deep Generative Models, 2018 |
Towards directly modeling raw speech signal for speaker verification using CNNs, , and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, Calgary, CANADA, pages 4884-4888, 2018 |
|
Towards Quantifying the Entropy of Fingervein Patterns across Different Feature Extractors, and , in: 2018 IEEE 4th International Conference on Identity, Security, and Behavior Analysis (ISBA), 2018 |
|
UNICITY: A depth maps database for people detection in security airlocks, , , , , , , and , in: IEEE International Conference on Advanced Video and Signal-based Surveillance Workshop, 2018 |
|
Vlogging Over Time: Longitudinal Impressions and Behavior in YouTube, , , , and , in: Proceedings of the 17th International Conference on Mobile and Ubiquitous Multimedia, Cairo, EGYPT, pages 37-46, 2018 |
[DOI] |
WatchNet: Efficient and Depth-based Network for People Detection in Video Surveillance Systems, , , and , in: IEEE International Conference on Advanced Video and Signal-based Surveillance, Auckland, NEW ZEALAND, pages 109-114, 2018 |
|
WILDTRACK: A Multi-camera HD Dataset for Dense Unscripted Pedestrian Detection, , , , , , , , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, pages 5030-5039, 2018 |
[DOI] |
Words Worth: Verbal Content and Hirability Impressions in YouTube Video Resumes, , and , in: Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, 2018 |
|
2017
#Healthy #Fondue #Dinner: Analysis and Inference of Food and Drink Consumption Patterns on Instagram, and , in: Proceedings of the 16th International Conference on Mobile and Ubiquitous Multimedia, 2017 |
|
A Competition on Generalized Software-based Face Presentation Attack Detection in Mobile Scenarios, , , , , , , , , and , in: Proceedings of the International Joint Conference on Biometrics, 2017, 2017 |
|
A Context-Aware Speech recognition and Understanding System for Air Traffic Control Domain, , , , , and , in: Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, Okinawa, Japan, 2017 |
|
A Domain Adaptation Approach to Improve Speaker Turn Embedding Using Face Representation, and , in: ACM International Conference on Multimodal Interaction, Glasgow, Scotland, ACM, 2017 |
|
A Generative Model for Intention Recognition and Manipulation Assistance in Teleoperation, and , in: IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017 |
|
A Sub-Quadratic Exact Medoid Algorithm, and , in: Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017 |
Active Online Anomaly Detection using Dirichlet Process Mixture Model and Gaussian Process Classification, , , , and , in: IEEE Winter Conference on Applications of Computer Vision (WACV), Washington, 2017 |
|
An Investigation of Deep Neural Networks for Multilingual Speech Recognition Training and Adaptation, , and , in: Proc. of Interspeech, 2017 |
|
BEAT: An Open-Science Web Platform, , and , in: Thirty-fourth International Conference on Machine Learning, Sydney, Australia, 2017 |
[URL] |
Bob Speaks Kaldi, , , , and , in: Proc. of Interspeech, 2017 |
|
Boosted Exudate Segmentation in Retinal Images using Residual Nets, , , and , in: Proceedings of the MICCAI Workshop on Ophthalmic Medical Image Analysis, 2017 |
Consistent Translation of Repeated Nouns using Syntactic and Semantic Cues, , and , in: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL), Valencia, Spain, Association for Computational Linguistics, 2017 |
|
Content Normalization for Text-dependent Speaker Verification, , , and , in: Proc. of Interspeech, 2017 |
|
Continuously Reproducing Toolchains in Pattern Recognition and Machine Learning Experiments, , , , , and , in: Thirty-fourth International Conference on Machine Learning, Sidney, Australia, 2017 |
[URL] |
Cross-Eyed 2017: Cross-Spectral Iris/Periocular Recognition Competition., , , , , , , , , , , , , and , in: IEEE/IAPR International Joint Conference on Biometrics, Denver, Colorado, USA, IEEE, 2017 |
Deep Multi-Camera People Detection, and , in: Proceedings of the IEEE International Conference on Machine Learning and Applications, 2017 |
Deep Occlusion Reasoning for Multi-Camera Multi-Target Detection, , and , in: Proceedings of the IEEE International Conference on Computer Vision, 2017 |
Dynamic Graffiti Stylisation with Stochastic Optimal Control, , and , in: Intl Workshop on movement and computing (MOCO), London, UK, pages 1-8, ACM, 2017 |
[DOI] [URL] |
Elderly People Living Alone: Detecting Home Visits with Ambient and Wearable Sensing, , , and , in: In Proceedings of MMHealth, 2017 |
|
End-to-End Convolutional Neural Network-based Voice Presentation Attack Detection, , and , in: International Joint Conference on Biometrics, Denver, Colorado, USA, 2017 |
|
Examining Linguistic Content and Skill Impression Structure for Job Interview Analytics in Hospitality, and , in: Proceedings of the 16th International Conference on Mobile and Ubiquitous Multimedia, 2017 |
|
Exploiting Eigenposteriors for Semi-supervised Training of DNN Acoustic Models with Sequence Discrimination, , and , in: Proceedings of Interspeech, 2017 |
|
EXPLOITING SEQUENCE INFORMATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION, , , and , in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, New Orleans, pages 5370-5374, 2017 |
|
Exploratory Study on Direct Prediction of Diabetes using Deep Residual Networks, , , and , in: Proceedings of the thematic conference on computational vision and medical image processing, 2017 |
Gaussian Mixture Regression on Symmetric Positive Definite Matrices Manifolds: Application to Wrist Motion Estimation with sEMG, and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, pages 59-64, 2017 |
[URL] |
Generating Calligraphic Trajectories with Model Predictive Control, , and , in: Proc. 43rd Conf. on Graphics Interface, Edmonton, AL, Canada, pages 132-139, 2017 |
[DOI] |
How May I Help You? Behavior and Impressions in Hospitality Service Encounters, , and , in: Proceddings of 19th ACM International Conference on Multimodal Interaction, 2017 |
|
Implementing gender-dependent vowel-level analysis for boosting speech-based depression recognition, , , and , in: Proceedings of Interspeech 2017, 2017 |
|
Improving hand and wrist activity detection using tactile sensors and tensor regression methods on Riemannian manifolds, , and , in: Proc. of the Myoelectric Control Symposium, 2017 |
[URL] |
Improving speaker turn embedding by crossmodal transfer learning from face embedding, and , in: ICCV Workshop on Computer Vision for Audio-Visual Media, 2017 |
|
Insiders and Outsiders: Comparing Urban Impressions between Population Groups, , and , in: International Conference on Multimedia Retrieval, ACM, 2017 |
[DOI] |
INTRA-CLASS COVARIANCE ADAPTATION IN PLDA BACK-ENDS FOR SPEAKER VERIFICATION, , , and , in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, pages 5365-5369, 2017 |
[DOI] |
K-Medoids For K-Means Seeding, and , in: Proceedings of the international conference on Neural Information Processing Systems, 2017 |
Learning Manipulability Ellipsoids for Task Compatibility in Robot Manipulation, , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, pages 3183-3189, 2017 |
[URL] |
Learning Task-Space Synergies using Riemannian Geometry, , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), Vancouver, Canada, pages 73-78, IEEE, 2017 |
[URL] |
Low-rank and Sparse Soft Targets to Learn Better DNN Acoustic Models, , and , in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), 2017 |
|
Machine Learning of Controller Command Prediction Models from Recorded Radar Data and Controller Speech Utterances, , , , , , and , in: Proceedings of the 7th SESAR Innovation Days (SID), University of Belgrade, Belgrade, Serbia, 2017 |
|
Machine translation of Spanish personal and possessive pronouns using anaphora probabilities, and , in: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL), Valencia, Spain, Association for Computational Linguistics, 2017 |
|
Multi-Modal Mean-Fields via Cardinality-Based Clamping, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2017 |
Multi-view Representation Learning Via GCCA for Multimodal Analysis of Parkinson's Disease, , , , , , , , , , , , , , and , in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), 2017 |
|
Multilingual Hierarchical Attention Networks for Document Classification, and , in: Proceedings of the 8th International Joint Conference on Natural Language Processing (IJCNLP), pages 1015-1025, 2017 |
|
Non-Markovian Globally Consistent Multi-Object Tracking, , , and , in: Proceedings of the IEEE International Conference on Computer Vision, 2017 |
Non-parametric warping via local scale estimation for non-stationary Gaussian process modelling, , , and , in: Wavelets and Sparsity XVII, pages 1039421, International Society for Optics and Photonics, 2017 |
[DOI] [URL] |
On Job Training: Automated Interpersonal Behavior Assessment & Real-Time Feedback, , in: Proceedings of the 25th ACM International Conference on Multimedia, 2017, 2017 |
|
On the Generalization of Fused Systems in Voice Presentation Attack Detection, , , , and , in: 16th International Conference of the Biometrics Special Interest Group, 2017 |
|
On the Impact of Non-modal Phonation On Phonological Features, , , , , , , , , , , , , and , in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), 2017 |
|
Planification adaptative d'expériences numériques par paquets en contexte non stationnaire pour une étude de fissuration mécanique, , , , and , in: 23eme Congres Francais de Mecanique, 28 aout - 1er septembre 2017, Lille, France (FR), AFM, 2017 |
[URL] |
Robust and Accurate 3D Head Pose Estimation through 3DMM and Online Head Model Reconstruction, , and , in: Proceedings of the 12th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2017), 2017 |
|
Semi-supervised Learning with Semantic Knowledge Extraction for Improved Speech Recognition in Air Traffic Control, , , , , and , in: Proceedings of Interspeech 2017, Stockholm, Sweden, pages 2406-2410, 2017 |
[DOI] |
Sense-Aware Statistical Machine Translation using Adaptive Context-Dependent Clustering, , and , in: Proceedings of Second Conference on Machine Translation (WMT17), 2017 |
|
Shape Representations for Maya Codical Glyphs: Knowledge-driven or Deep?, , and , in: 15th International Workshop on Content-Based Multimedia Indexing, 2017 |
|
Social Scene Understanding: End-to-End Multi-Person Action Localization and Collective Activity Recognition, , , , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2017 |
Sparse Pronunciation Codes for Perceptual Phonetic Information Assessment, , , and , in: Workshop on Signal Processing with Adaptive Sparse Structured Representations (SPARS), 2017 |
|
Subspace Regularized Dynamic Time Warping for Spoken Query Detection, , and , in: Workshop on Signal Processing with Adaptive Sparse Structured Representations (SPARS), 2017 |
|
Supervised Gaze Bias Correction for Gaze Coding in Interactions, and , in: ECEM COGAIN Symposium, pages 3, 2017 |
|
Supervisory teleoperation with online learning and optimal control, and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Singapore, pages 1534-1540, IEEE, 2017 |
[URL] |
The SUMMA Platform Prototype, and , in: Proceedings of the EACL 2017 Software Demonstrations, Valencia, Spain, pages 116--119, 2017 |
[URL] |
Towards a breakthrough Speaker Identification approach for Law Enforcement Agencies: SIIP, , , , , , , , , and , in: European Intelligence and Security Informatics Conference (EISIC) 2017, Athenes, Greece, pages 32-39, IEEE Computer Society, 2017 |
[DOI] [URL] |
Towards large scale multimedia indexing: A case study on person discovery in broadcast news, , and , in: 15th International Workshop on Content-Based Multimedia Indexing, 2017 |
|
Towards the Use of Social Interaction Conventions as Prior for Gaze Model Adaptation, , and , in: Proceedings of 19th ACM International Conference on Multimodal Interaction, pages 9, ACM, 2017 |
[DOI] |
Trajectory and Foothold Optimization using Low-Dimensional Models for Rough Terrain Locomotion, , , , , , , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Singapore, pages 1096-1103, IEEE, 2017 |
[URL] |
Using Coreference Links to Improve Spanish-to-English Machine Translation, and , in: Proceedings of the 2nd Workshop on Coreference Resolution Beyond OntoNotes (CORBON 2017), Valencia, Spain, pages 30-40, Association for Computational Linguistics (ACL), 2017 |
|
Validation of an Automatic Metric for the Accuracy of Pronoun Translation (APT), and , in: Proceedings of the Third Workshop on Discourse in Machine Translation (DiscoMT), Denmark, Copenhagen, Association for Computational Linguistics (ACL), 2017 |
|
Venues in Social Media: Examining Ambiance Perception Through Scene Semantics, , and , in: Proceedings of the 25th ACM International Conference on Multimedia, ACM, 2017, 2017 |
|
Weakly Supervised Learning of Deep Metrics for Stereo Reconstruction, , and , in: Proceedings of the IEEE International Conference on Computer Vision, 2017 |
What you can't see can help you -- extended-range imaging for 3D-mask presentation attack detection, and , in: Proceedings of the 16th International Conference on Biometrics Special Interest Group., Darmstadt (Germany), Gesellschaft fuer Informatik e.V. (GI), 2017 |
|
2016
A Contextual Language Model to Improve Machine Translation of Pronouns by Re-ranking Translation Hypotheses, and , in: European Association for Machine Translation, 2016 |
A MultiPath Network for Object Detection, , , , , , and , in: Proceedings of the British Machine Vision Conference, BMVA Press, 2016 |
[URL] |
A Point-Spread-Function-Aware Filtered Backprojection Algorithm for Focal-Plane-Scanning Optical Projection Tomography, and , in: 2016 IEEE International Symposium on Biomedical Imaging, 2016 |
An agonist-antagonist pitch production model, and , in: Lecture Notes in Artificial Intelligence: 18th International Conference, SPECOM 2016, Budapest, Hungary, pages 84--91, 2016 |
|
Ancient Maya Writings as High-Dimensional Data: a Visualization Approach, , , and , in: Digital Humanities (DH), Krakow, 2016 |
|
Anomaly detection in elderly daily behavior in ambient sensing environments, , , and , in: Proceedings of the 7th Int. Workshop on Human Behavior Understanding, ACM Multimedia, 2016, Amsterdam, Netherlands, 2016 |
|
Assessing a Shape Descriptor for Analysis of Mesoamerican Hieroglyphics: A View Towards Practice in Digital Humanities, , and , in: Digital Humanities Conference (DH), Krakow, 2016 |
|
Audiovisual Summarization of Lectures and Meetings Using a Segment Similarity Graph, , and , in: Proceedings of the ACM International Conference on Multimedia Retrieval (ICMR), ACM, New York, NY, ACM Press, 2016 |
Comparing Two Strategies for Query Expansion in a News Monitoring System, and , in: Natural Language Processing and Information Systems: 21st International Conference on Applications of Natural Language to Information Systems, NLDB 2016, pages 267-275, Springer-Verlag, 2016 |
[DOI] |
Cross-database evaluation of audio-based spoofing detection systems, and , in: Interspeech, San Francisco, USA, 2016 |
[URL] |
DEEP NEURAL NETWORK BASED POSTERIORS FOR TEXT-DEPENDENT SPEAKER VERIFICATION, , , and , in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5050-5054, IEEE, 2016 |
|
Deep Neural Networks for Syntactic Parsing of Morphologically Rich Languages, and , in: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016 |
|
Design of a Speech Corpus for Research on Cross-Lingual Prosody Transfer, , , , , , , , , , , and , in: Lecture Notes in Artificial Intelligence: 18th International Conference, SPECOM 2016, Budapest, Hungary, pages 199--206, 2016 |
|
Dexterous Undersea Interventions with Far Distance Onshore Supervision: the DexROV Project, , , , , , , , , , , , , , , , , , , , , , and , in: IFAC Conference on Control Applications in Marine Systems (CAMS), Trondheim, Norway, pages 414-419, 2016 |
[DOI] [URL] |
Dites-Moi: Wearable Feedback on Conversational Behavior, , , and , in: Proceedings of the 15th International Conference on Mobile and Ubiquitous Multimedia, 2016 |
|
Efficient Posterior Exemplar Search Space Hashing Exploiting Class-Specific Sparsity Structures, , , and , in: Interspeech, San Francisco, CA, 2016 |
|
Emphasis Recreation for TTS using Intonation Atoms, and , in: 9th ISCA Speech Synthesis Workshop, pages 14--20, 2016 |
[DOI] |
EUMSSI team at the MediaEval Person Discovery Challenge 2016, , and , in: MediaEval Benchmarking Initiative for Multimedia Evaluation, Hilversum, Netherlands, 2016 |
|
Exploiting Low-dimensional Structures to Enhance DNN based Acoustic Modeling in Speech Recognition, , , and , in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5690-5694, IEEE, 2016 |
|
Fast K-Means with Accurate Bounds, and , in: Proceedings of the International Conference on Machine Learning (ICML), New York, 2016 |
Findings of the 2016 WMT Shared Task on Cross-lingual Pronoun Prediction, , , , , , , , and , in: Proceedings of WMT 2016 (First Conference on Machine Translation), Association for Computational Linguistics, Berlin, Germany, pages 525–542, 2016 |
[URL] |
Heterogeneous Face Recognition using Inter-Session Variability Modelling, and , in: IEEE Computer Society Workshop on Biometrics, Las Vegas - USA, IEEE, 2016 |
|
Hierarchical Planning of Dynamic Movements without Scheduled Contact Sequences, , , , and , in: Proceedings of the IEEE International Conference of Robotics and Automation, 2016 |
HMM-based Non-native Accent Assessment using Posterior Features, , and , in: Proceedings of Interspeech, San Francisco, USA, 2016 |
|
Human versus Machine Attention in Document Classification: A Dataset with Crowdsourced Annotations, and , in: Proceedings of the EMNLP 2016 Workshop on Natural Language Processing for Social Media, Austin, USA, 2016 |
|
Importance Sampling Tree for Large-scale Empirical Expectation, , and , in: Proceedings of the International Conference on Machine Learning (ICML), New-York, 2016 |
Improving Pronoun Translation by Modeling Coreference Uncertainty, and , in: Proceedings of the First Conference on Machine Translation (WMT16), Berlin, Germany, 2016 |
|
Improving Under-Resourced Language ASR Through Latent Subword Unit Space Discovery, and , in: Proceedings of Interspeech, 2016 |
|
INFORMATION THEORETIC CLUSTERING FOR UNSUPERVISED DOMAIN-ADAPTATION, , and , in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5580-5584, IEEE, 2016 |
|
InnerView: Learning Place Ambiance from Social Media Images, , and , in: Proceedings of the 24th ACM International Conference on Multimedia, ACM, 2016 |
[DOI] |
Inter-task System Fusion for Speaker Recognition, , , , and , in: Proceeedings of the INTERSPEECH, 2016 |
|
Investigating Cross-lingual Multi-level Adaptive Networks: The Importance of the Correlation of Source and Target Languages, , , , and , in: Proceedings of the International Workshop on Spoken Language Translation, Seattle, WA, USA, 2016 |
|
Investigating Spectral Amplitude Modulation Phase Hierarchy Features in Speech Synthesis, , , and , in: 9th ISCA Speech Synthesis Workshop, 2016 |
|
Joint Operation of Voice Biometrics and Presentation Attack Detection, and , in: IEEE International Conference on Biometrics: Theory, Applications and Systems, 2016 |
[URL] |
Large Scale Hard Sample Mining with Monte Carlo Tree Search, and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2016 |
|
Learning assistive teleoperation behaviors from demonstration, and , in: Proc. IEEE International Symposium on Safety, Security and Rescue Robotics, pages 258-263, 2016 |
|
Learning dynamic graffiti strokes with a compliant robot, , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, pages 3981-3986, 2016 |
[URL] |
Learning Multimodal Temporal Representation for Dubbing Detection in Broadcast Media, and , in: ACM Multimedia, Amsterdam, ACM, 2016 |
|
Learning to Refine Object Segments, , , and , in: Computer Vision - ECCV 2016, Amsterdam, pages 75-91, Springer, 2016 |
[DOI] [URL] |
Long-Term Time-Sensitive Costs for CRF-Based Tracking by Detection, , and , in: 2nd Workshop on Benchmarking Multi-target Tracking: MOTChallenge 2016, Amsterdam, 2016 |
|
Low-Rank Representation of Nearest Neighbor Phone Posterior Probabilities to Enhance DNN Acoustic Modeling, , , and , in: Interspeech, 2016 |
|
Manual and automatic labeling of discourse connectives for machine translation (Keynote paper), , in: TextLink: Structuring Discourse in Multilingual Europe (Handbook of the Second Action Conference), Budapest, Hungary, pages 16-20, 2016 |
[URL] |
Modeling Unvoiced Sounds In Statistical Parametric Speech Synthesis with a Continuous Vocoder, , , and , in: Proc. of EUSIPCO, Budapest, Hungary, 2016 |
|
Multilingual Visual Sentiment Concept Matching, , , , , , and , in: Proceedings of the International Conference on Multimedia Retrieval (ICMR), 2016 |
|
Nested Mini-Batch K-Means, and , in: Proceedings of NIPS, 2016 |
Neural Network-based Word Alignment through Score Aggregation, , and , in: Proceedings of the ACL 1st Conference on Machine Translation, 2016 |
|
Online Inference in Bayesian Non-Parametric Mixture Models under Small Variance Asymptotics, and , in: NIPS workshop on Advances in Approximate Bayesian Inference, Barcelona, Spain, pages 1-5, 2016 |
[URL] |
Online motion synthesis with minimal intervention control and formal safety guarantees, , , and , in: Proc. IEEE Intl Conf. on Systems, Man, and Cybernetics, Budapest, Hungary, 2016 |
|
Overview of BTAS 2016 Speaker Anti-spoofing Competition, , , , , , , , , , , , , , , and , in: IEEE International Conference on Biometrics: Theory, Applications and Systems, 2016 |
[URL] |
PAoS Markers: Trajectory Analysis of Selective Phonological Posteriors for Assessment of Progressive Apraxia of Speech, , and , in: Proceeding on the 7th Workshop on Speech and Language Processing for Assistive Technologies (SLPAT), 2016 |
|
PhonVoc: A Phonetic and Phonological Vocoding Toolkit, and , in: Interspeech, San Francisco, USA, 2016 |
|
Phrase Representations for Multiword Expressions, and , in: Proceedings of the 12th Workshop on Multiword Expressions, 2016 |
|
Presentation Attack Detection Using Long-Term Spectral Statistics for Trustworthy Speaker Verification, , and , in: International Conference of the Biometrics Special Interest Group (BIOSIG), 2016 |
|
Principled Parallel Mean-Field Inference for Discrete Random Fields, , , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2016 |
Probabilistic Amplitude Demodulation Features in Speech Synthesis for Improving Prosody, , and , in: Proceedings of Interspeech, San Francisco, USA, 2016 |
|
Pronoun Language Model and Grammatical Heuristics for Aiding Pronoun Prediction, and , in: Proceedings of the First Conference on Machine Translation (WMT16), Berlin, Germany, ACL, 2016 |
|
Scalable Metric Learning via Weighted Approximate Rank Component Analysis, and , in: ECCV 2016, 2016 |
|
Sound Pattern Matching for Automatic Prosodic Event Detection, , , , and , in: Interspeech, San Francisco, USA, 2016 |
|
Stochastic learning and control in multiple coordinate systems, , in: Intl Workshop on Human-Friendly Robotics, Genoa, Italy, pages 1-5, 2016 |
|
Stressful First Impressions in Job Interviews, , , , and , in: Proceedings of the 18th ACM International Conference on Multimodal Interaction, pages 325-332, 2016 |
|
Subspace Detection of DNN Posterior Probabilities via Sparse Representation for Query by Example Spoken Term Detection, , and , in: Interspeech, 2016 |
|
SYSTEM FUSION AND SPEAKER LINKING FOR LONGITUDINAL DIARIZATION OF TV SHOWS, , , and , in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5495-5499, IEEE, 2016 |
|
Temporally Subsampled Detection for Accurate and Efficient Face Tracking and Diarization, , , and , in: International Conference on Pattern Recognition, Cancun, Mexico, IEEE, 2016 |
|
The Night is Young: Urban Crowdsourcing of Nightlife Patterns, , , , , , and , in: Proceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing, ACM, 2016 |
[DOI] |
The REPLAY-MOBILE Face Presentation-Attack Database, , , and , in: Proceedings of the International Conference on Biometrics Special Interests Group, 2016 |
|
The SIWIS database: a multilingual speech database with acted emphasis, , , , , , , , , , , and , in: Proceedings of Interspeech, San Francisco, USA, pages 1532--1535, 2016 |
[DOI] |
Towards building an attentive artificial listener: on the perception of attentiveness in audio-visual feedback tokens, , , , , , and , in: Proceedings of the 18th ACM International Conference on Multimodal Interaction, Tokyo, Japan, pages 21-28, ACM, 2016 |
[DOI] |
Training on the Job: Behavioral Analysis of Job Interviews in Hospitality, , , , , and , in: Proceedings of the 18th ACM International Conference on Multimodal Interaction, pages 84-91, 2016 |
|
Transferring Neural Representations for Low-dimensional Indexing of Maya Hieroglyphic Art, , , , , , , , and , in: Proc. ECCV Workshop on Computer Vision for Art Analysis, Amsterdam, pages 842-855, Springer, 2016 |
[DOI] [URL] |
Two-Pass IB based Speaker Diarization System using Meeting-Specific ANN based Features, , , and , in: Proceedings of Interspeech 2016, pages 2199-2203, 2016 |
Unified Prosody Model based on Atom Decomposition for Emphasis Detection, , , , , and , in: Proceedings of ETAI, 2016 |
|
Unsupervised Interpretable Pattern Discovery in Time Series Using Autoencoders, , , and , in: IAPR Int. Workshops on Structural and Syntactic Pattern Recognition (SSPR), 2016 |
|
Using the TED Talks to Evaluate Spoken Post-editing of Machine Translation, and , in: Proceedings of the 10th Language Resources and Evaluation Conference (LREC), Portoroz, Slovenia, 2016 |
|
Variable Duration Movement Encoding with Minimal Intervention Control, , and , in: Proc. of the IEEE Intl Conf. on Robotics and Automation (ICRA), pages 497-503, 2016 |
|
When Naïve Bayes Nearest Neighbors Meet Convolutional Neural Networks, , and , in: Computer Vision and Pattern Recognition (CVPR), IEEE Conference on, 2016 |
|
Wiki-LDA: A Mixed-Method Approach for Effective Interest Mining on Twitter Data, , , and , in: Proceedings of CSEDU 2016, 2016 |
|
2015
A Deeper Look at Dataset Bias, , , and , in: German Conference on Pattern Recognition, Aachen, Germany, pages 504–516, Springer International Publishing, 2015 |
[DOI] |
An Empirical Model of Emphatic Word Detection, and , in: Proc. of Interspeech, Dresden, Germany, pages 573-577, ISCA, 2015 |
|
An HMM-Based Formalism for Automatic Subword Unit Derivation and Pronunciation Generation, and , in: International Conference on Acoustics, Speech and Signal Processing, pages 4639-4643, IEEE, 2015 |
[DOI] |
An Investigation of Muscle Models for Physiologically Based Intonation Modelling, and , in: Proceedings of the 23rd Telecommunications Forum, pages 468--471, 2015 |
[DOI] |
Analysis of CNN-based Speech Recognition System using Raw Speech as Input, , and , in: Proceedings of Interspeech, ISCA, Dresden, pages 11-15, ISCA, 2015 |
|
Annotators' agreement and spontaneous emotion classification performance, and , in: Proceedings of Interspeech, pages 1546-1550, 2015 |
|
Atom Decomposition-based Intonation Modelling, , and , in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, Australia, pages 4744--4748, IEEE, 2015 |
[DOI] |
Automatic Accentedness Evaluation of Non-Native Speech Using Phonetic and Sub-Phonetic Posterior Probabilities, , , and , in: Proceedings of Interspeech, 2015 |
|
Building context-dependent DNN acoustic models using Kullback-Leibler divergence-based state tying, , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2015 |
|
Channel Selection in the Short-time Modulation Domain for Distant Speech Recognition, , , , and , in: Proceedings of Interspeech, pages 741-745, 2015 |
|
COMBINING SGMM SPEAKER VECTORS AND KL-HMM APPROACH FOR SPEAKER DIARIZATION, , and , in: Proceedings of ICASSP 2015, pages 4834-4837, 2015 |
|
CommuniSense: Crowdsourcing Road Hazards in Nairobi, , , , , , and , in: Proceedings of the 17th International Conference on Human-Computer Interaction with Mobile Devices and Services, Copenhagen, Denmark, pages 445-456, ACM, 2015 |
[DOI] [URL] |
Convolutional Neural Networks-based Continuous Speech Recognition using Raw Speech Signal, , and , in: International Conference on Acoustics, Speech and Signal Procecssing, IEEE, South Brisbane, QLD, pages 4295 - 4299, IEEE, 2015 |
|
Cross-Corpus Acoustic Emotion Recognition: Variances and Strategies (Extended Abstract), , , , , , and , in: Proceedings of ACII, Xi'an, pages 470-476, IEEE, 2015 |
[DOI] |
Deciphering the Silent Participant. On the Use of Audio-Visual Cues for the Classification of Listener Categories in Group Discussions, , , and , in: Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, ACM, Seattle, Washington, USA, pages 107--114, ACM, 2015 |
[DOI] |
DexROV: Dexterous Undersea Inspection and Maintenance in Presence of Communication Latencies, , , , , , , , , , , , , , , and , in: IFAC Workshop on Navigation, Guidance and Control of Underwater Vehicles (NGCUV), pages 218-223, 2015 |
Dictionary Learning for Sparse Representation of Neural Network Exemplars in Speech Recognition, , and , in: Proceedings of SPARS 2015: Workshop on Signal Processing with Adaptive Sparse Structured Representations, 2015, pages 1093, 2015 |
|
EMPLOYMENT OF SUBSPACE GAUSSIAN MIXTURE MODELS IN SPEAKER RECOGNITION, , , and , in: 2015 IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE, Brisbane, Australia, pages 4445-4449, 2015 |
[URL] |
Estimation of Divergence-Free 3D Cardiac Blood Flow in a Zebrafish Larva Using Multi-View Microscopy, and , in: Biomedical Imaging (ISBI), 2015 IEEE 12th International Symposium on, IEEE, Brooklyn, NY, USA, pages 385-388, 2015 |
[DOI] |
EUMSSI team at the MediaEval Person Discovery Challenge, , , and , in: Working Notes Proceedings of the MediaEval 2015 Workshop, Wurzen, Germany, 2015 |
[URL] |
Exploring Dataset Similarities using PCA-based Feature Selection, , , and , in: Proceedings of ACII, Xi'an, pages 387-393, IEEE, 2015 |
[DOI] |
Finger vein Liveness Detection Using Motion Magnification, , , and , in: IEEE International Conference on Biometrics: Theory, Applications and Systems, pages 1-7, 2015 |
[DOI] |
From Image-level to Pixel-level Labeling with Convolutional Networks, and , in: Computer Vision and Patter Recognition (CVPR), Boston, MA, pages 1713-1721, IEEE, 2015 |
[DOI] [URL] |
Gender Classification by LUT based boosting of Overlapping Block Patterns, , and , in: Scandinavian Conference on Image Analysis, pages 530-542, Springer International Publishing, 2015 |
[DOI] [URL] |
Head Nod Detection from a Full 3D Model, , and , in: Proceedings of the ICCV 2015, pages 528-536, 2015 |
|
I would hire you in a minute: Thin slices of nonverbal behavior in job interviews, and , in: Proceedings of the ACM International Conference on Multimodal Interaction (ICMI), pages 51-58, 2015 |
|
Integrated Pronunciation Learning for Automatic Speech Recognition Using Probabilistic Lexical Modeling, , and , in: International Conference on Acoustics, Speech and Signal Processing, South Brisbane, QLD, pages 5176-5180, 2015 |
[DOI] |
Integrating Online I-vector extractor with Information Bottleneck based Speaker Diarization system, , , and , in: Proceedings of Interspeech 2015, pages 3105-3109, 2015 |
|
Intensity-Based Point-Spread-Function-Aware Registration for Multi-View Applications in Optical Microscopy, , and , in: Biomedical Imaging (ISBI), 2015 IEEE 12th International Symposium on, pages 306-309, IEEE, 2015 |
[DOI] |
International Conference on Mobile and Ubiquitous Multimedia, , and , in: Happy and Agreeable? Multi-Label Classification of Impressions in Social Video, Linz, Austria, pages 109-120, ACM, 2015 |
[DOI] [URL] |
Joint RNN-Based Greedy Parsing and Word Composition, and , in: Proceedings of ICLR 2015, 2015 |
|
KL-HMM BASED SPEAKER DIARIZATION SYSTEM FOR MEETINGS, and , in: Proceedings of ICASSP 2015, pages 4435-4439, 2015 |
|
Kullback-Leibler Proximal Variational Inference, , , and , in: Proceedings of the international conference on Neural Information Processing Systems, pages 3402-3410, 2015 |
|
Learned Minimal Intervention Control Synthesis based on Hidden Semi-Markov Models, , and , in: Proc. of the 8th Intl Workshop on Human-Friendly Robotics, pages 17, 2015 |
|
Learning bimanual end-effector poses from demonstrations using task-parameterized dynamical systems, , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), Hamburg, Germany, pages 464-470, 2015 |
|
Learning Feature Mapping using Deep Neural Network Bottleneck Features for Distant Large Vocabulary Speech Recognition, , , , , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing, pages 4540-4544, 2015 |
[DOI] |
Learning Optimal Controllers in Human-robot Cooperative Transportation Tasks with Position and Force Constraints, , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), Hamburg, Germany, pages 1024-1030, 2015 |
|
Learning to Segments Objects Candidates, , and , in: Advances in Neural Information Processing Systems, Montreal, Canada, pages 1990-1998, Curran Associates, Inc., 2015 |
[URL] |
Leveraging Compounds to Improve Noun Phrase Translation from Chinese and German, , , , , and , in: Proceedings of the ACL-IJCNLP 2015 Student Research Workshop, Beijing, China, pages 8-15, 2015 |
|
Looking at Cities in Mexico with Crowds, , and , in: Proceedings of the 2015 Annual Symposium on Computing for Development, London, United Kingdom, pages 127-135, ACM, 2015 |
[DOI] [URL] |
Loud and Trendy: Crowdsourcing Impressions of Social Ambiance in Popular Indoor Urban Places, and , in: Proceedings of the 23rd ACM International Conference on Multimedia, Brisbane, Australia, pages 211-220, ACM, 2015 |
[DOI] [URL] |
N-gram-Based Low-Dimensional Representation for Document Classification, and , in: International Conference on Learning Representations, 2015 |
[URL] |
Neuromorphic Based Oscillatory Device for Incremental Syllable Boundary Detection, and , in: Proc. of Interspeech, Dresden, Germany, pages 1191-1195, ISCA, 2015 |
|
Novel GCC-PHAT Model in Diffuse Sound Field for Microphone Array Pairwise Distance Based Calibration, , , , , , and , in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 2669-2673, 2015 |
|
Objective Intelligibility Assessment of Text-to-Speech Systems Through Utterance Verification, , , and , in: Proceedings of Interspeech, Dresden, Germany, pages 3501-3505, 2015 |
[URL] |
Objective Speech Intelligibility Assessment through Comparison of Phoneme Class Conditional Probability Sequences, , and , in: 40th IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, Australia, pages 4924-4928, 2015 |
[DOI] |
On Application Of Non-Negative Matrix Factorization for Ad Hoc Microphone Array Calibration from Incomplete Noisy Distances, , , , and , in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 2694-2698, IEEE, 2015 |
[DOI] |
On Compressibility of Neural Network phonological Features for Low Bit Rate Speech Coding, , and , in: Proceeding of Interspeech, pages 418-422, ISCA, 2015 |
|
On the Vulnerability of Palm Vein Recognition to Spoofing Attacks, and , in: The 8th IAPR International Conference on Biometrics (ICB), pages 319 - 325, 2015 |
[DOI] [URL] |
On the Vulnerability of Speaker Verification to Realistic Voice Spoofing, , , and , in: IEEE International Conference on Biometrics: Theory, Applications and Systems, pages 1-8, IEEE, 2015 |
[DOI] [URL] |
Overlapping Speech, Utterance Duration and Affective Content in HHI and HCI - an Comparison, , , , and , in: 6th IEEE Conference on Cognitive Infocommunications, Gyor, pages 83-88, 2015 |
[DOI] |
Palm Vein Database and Experimental Framework for Reproducible Research, and , in: IEEE International Conference of the Biometrics Special Interest Group, pages 1-7, 2015 |
[DOI] [URL] |
Periocular Biometrics in Mobile Environment, and , in: IEEE Seventh International Conference on Biometrics: Theory, Applications and Systems, Arlington, USA, pages 1-7, IEEE, 2015 |
[DOI] |
Personality Trait Classification via Co-Occurrent Multiparty Multimodal Event Discovery, , and , in: Proceedings of the ACM International Conference on Multimodal Interaction, Seattle, Washington, USA, pages 15-22, ACM, 2015 |
[DOI] |
Phonological Vocoding Using Artificial Neural Networks, , and , in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, Australia, pages 4844-4848, IEEE, 2015 |
[DOI] |
Phrase-based Image Captioning, , and , in: International Conference on Machine Learning (ICML), Lille, France, pages 2085–2094, JMLR, 2015 |
[URL] |
Probability Occupancy Maps for Occluded Depth Images, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, pages 2829-2837, 2015 |
Pronoun Translation and Prediction with or without Coreference Links, , and , in: Proceedings of the Second Workshop on Discourse in Machine Translation (DiscoMT), Lisbon, Portugal, pages 94–100, 2015 |
|
Pronunciation Lexicon Development for Under-Resourced Languages Using Automatically Derived Subword Units: A Case Study on Scottish Gaelic, , and , in: 4th Biennial Workshop on Less-Resourced Languages, 2015 |
|
Query Refinement Using Conversational Context: a Method and an Evaluation Resource, and , in: Proceedings of NLDB 2015 (20th International Conference on Applications of Natural Language to Information Systems), Passau, Germany, pages 89-102, Springer-Verlag Berlin, 2015 |
[DOI] |
Robot Learning with Task-Parameterized Generative Models, , in: Proc. Intl Symp. on Robotics Research, 2015 |
|
Robust Microphone Placement for Source Localization from Noisy Distance Measurements, , , , and , in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 2579-2583, IEEE, 2015 |
[DOI] |
Sparse Modeling of Neural Network Posterior Probabilities for Exemplar-Based Speech Recognition, , and , in: Proceedings of SPARS 2015: Workshop on Signal Processing with Adaptive Sparse Structured Representations, 2015 |
|
Sparse Modeling of Posterior Exemplars for Keyword Detection, , , and , in: Proceedings of Interspeech, pages 3690-3694, 2015 |
|
The 1st Competition on Counter Measures to Finger Vein Spoofing Attacks, , , , , , , , , and , in: The 8th IAPR International Conference on Biometrics (ICB), pages 513 - 518, 2015 |
[DOI] [URL] |
Towards utterance-based neural network adaptation in acoustic modeling, , , and , in: IEEE Automatic Speech Recognition and Understanding Workshop, pages 289-295, 2015 |
|
Transfer Learning through Greedy Subset Selection, , and , in: Image Analysis and Processing - ICIAP 2015, Genoa, Italy, pages 3-14, Springer International Publishing, 2015 |
[DOI] |
Visual Affect Around the World: A Large-scale Multilingual Visual Sentiment Ontology, , , , , and , in: Proceedings of the ACM International Conference on Multimedia, pages 159--168, 2015 |
|
Weighted Correlation based Atom Decomposition Intonation Modelling, , , and , in: Proceedings of Interspeech, Dresden, Germany, pages 1601--1605, 2015 |
|
2014
3D Gaze Tracking and Automatic Gaze Coding from RGB-D Cameras, and , in: IEEE Conference in Computer Vision and Pattern Recognition, Vision Meets Cognition Workshop, Columbus, Ohio, USA, 2014 |
|
A Conditional Random field approach for audio-visual people diarization, , , , and , in: Proceedings of the 39th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 116 - 120, IEEE, 2014 |
[DOI] |
A Skill Transfer Approach for Continuum Robots - Imitation of Octopus Reaching Motion with the STIFF-FLOP Robot, , , and , in: In Proc. of the AAAI Symp. on Knowledge, Skill, and Behavior Transfer in Autonomous Robots, Arlington, VA, USA, pages 49-52, 2014 |
[URL] |
A task-parameterized probabilistic model with minimal intervention control, , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Hong Kong, pages 3339 - 3344, IEEE, 2014 |
[DOI] |
Ad-Hoc Microphone Array Calibration from Partial Distance Measurements, , , and , in: Proceeding of 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014 |
Ad-Hoc Microphone Array Calibration from Partial Distance Measurements, , , and , in: Proceedings of the 4th Joint Workshop on Hands-free speech communication and Microphone Arrays, Villers-les-Nancy, pages 1 - 5, IEEE, 2014 |
[DOI] |
Artificial neural network features for speaker diarization, , and , in: IEEE Spoken Language Technology workshop, South Lake Tahoe, USA, 2014 |
|
Audio-Visual Gender Recognition in Uncontrolled Environment Using Variability Modeling Techniques, , and , in: International Joint Conference on Biometrics, Clearwater, Florida, USA, pages 1 - 8, IEEE, 2014 |
[DOI] [URL] |
Automated Bobbing and Phase Analysis to Measure Walking Entrainment, , , , , , and , in: IEEE International Conference on Image Processing (ICIP), Paris, 2014 |
|
Automatic Blinking Detection towards Stress Discovery, , , and , in: Proc. ACM Int. Conf. on Multimodal Interaction, Istanbul, pages 307-310, ACM New York, 2014 |
[DOI] |
Automatic Maya Hieroglyph Retrieval Using Shape and Context Information, , , , and , in: ACM MM, pages 4, 2014 |
[URL] |
Automatic Speech Recognition and Translation of a Swiss German Dialect: Walliserdeutsch, , and , in: Proceedings of Interspeech, 2014 |
|
Capturing Upper Body Motion in Conversation: an Appearance Quasi-Invariant Approach, , , and , in: Proc. ACM Int. Conf. on Multimodal Interaction, Istanbul, pages 327-334, ACM New York, 2014 |
[DOI] |
Comparison of Two Methods for Unsupervised Person Identification in TV Shows, , , , and , in: 12th International Workshop on Content-Based Multimedia Indexing, 2014 |
|
Cross-Database Evaluation With an Open Finger Vein Sensor, , , and , in: IEEE Workshop on Biometric Measurements and Systems for Security and Medical Applications (BioMS), Rome, Italy, pages 30-35, IEEE, 2014 |
[DOI] |
Cross-linguistic annotation of narrativity for English/French verb tense disambiguation, and , in: 9th Edition of the Language Resources and Evaluation Conference, Reykjavik, Iceland, 2014 |
|
Detecting and Labeling Speakers on Overlapping Speech using Vector Taylor Series, , and , in: INTERSPEECH, 2014 |
|
Detecting speaker roles and topic changes in multiparty conversations using latent topic models, and , in: Proceedings of Interspeech, 2014 |
|
Development of Bilingual ASR System for MediaParl Corpus, , , and , in: Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech 2014), Singapore, ISCA, 2014 |
|
Dialect Levelling in Finnish: A Universal Speech Attribute Approach, , , , , , and , in: The 15th Annual Conference of the International Speech Communication Association, 2014 |
Diarizing Large Corpora using Multi-modal Speaker Linking, , , and , in: INTERSPEECH 2014, 2014 |
|
Dynamic Programming Boosting for Discriminative Macro-Action Discovery, and , in: International Conference on Machine Learning, 2014 |
|
Effect of nonverbal behavioral patterns on the performance of small groups, and , in: ICMI Workshop on Understanding and Modeling Multiparty Multimodal Interactions, Istanbul, Turkey, 2014 |
|
Efficient Sample Mining for Object Detection, and , in: Proceedings of the 6th Asian Conference on Machine Learning (ACML), Nha Trang, Vietnam, 2014 |
|
Enforcing Topic Diversity in a Document Recommender for Conversations, and , in: Proceedings of the Coling 2014 (25th International Conference on Computational Linguistics), Dublin, Ireland, pages 746-759, IEEE, 2014 |
|
English-French Verb Phrase Alignment in Europarl for Tense Translation Modeling, , and , in: The Ninth Language Resources and Evaluation Conference, Reykjavik, Iceland, 2014 |
|
Explaining the Stars: Weighted Multiple-Instance Learning for Aspect-Based Sentiment Analysis, and , in: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar, 2014 |
|
Exploiting Scene Cues for Dropped Object Detection, , and , in: 9th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications., 2014 |
|
Exploiting un-transcribed foreign data for speech recognition in well-resourced languages, , , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Florence, IT, pages 2322 - 2326, IEEE, 2014 |
[DOI] |
EYEDIAP: A Database for the Development and Evaluation of Gaze Estimation Algorithms from RGB and RGB-D Cameras, , and , in: Proceedings of the ACM Symposium on Eye Tracking Research and Applications, Safety Harbor, Florida, United States of America, ACM, 2014 |
[DOI] |
Face identification from overlaid texts using Local Face Recurrent Patterns and CRF models, , , , and , in: IEEE International Conference on Image Processing 2014, Paris, IEEE, 2014 |
|
Feature Switching in the i-vector Framework for Speaker Verification, , , , and , in: Proc. of Interspeech 2014, pages 5, 2014 |
Geometric Generative Gaze Estimation (G3E) for Remote RGB-D Cameras, and , in: IEEE Computer Vision and Pattern Recognition Conference, Columbus, Ohio,USA, pages 1773-1780, IEEE, 2014 |
[DOI] |
Hierarchical speaker clustering methods for the NIST i-vector Challenge, , , and , in: Odyssey: The Speaker and Language Recognition Workshop, 2014 |
|
How Do You Like Your Virtual Agent?: Human-Agent Interaction Experience through Nonverbal Features and Personality Traits, , and , in: Human Behavior Understanding, pages 1-15, Springer, 2014 |
|
Importance of Prosody in Swiss French Accent for Speech Synthesis, and , in: Nouveaux cahiers de linguistique francaise, 2014 |
|
Improving Head and Body Pose Estimation through Semi-supervised Manifold Alignment, , , , and , in: International Conference on Image Processing, 2014 |
|
Improving Speaker Diarization using social role information, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2014 |
|
Inferring social relationships in a phone call from a single party's speech, , and , in: ICASSP, Florence, IT, pages 4843 - 4847, IEEE, 2014 |
[DOI] |
Information Bottleneck based Speaker Diarization of Meetings using Non-speech as Side Information, and , in: ICASSP, Florence, IT, pages 96 - 100, IEEE, 2014 |
[DOI] |
Introducing I-Vectors for Joint Anti-spoofing and Speaker Verification, , , , and , in: The 15th Annual Conference of the International Speech Communication Association, 2014 |
|
Is That a Jaguar? Segmenting Ancient Maya Glyphs via Crowdsourcing, , and , in: Proc. ACM Int. Workshop on Crowdsourcing for Multimedia, Orlando, pages 37-40, ACM New York, 2014 |
[DOI] |
Joint Phoneme Segmentation Inference and Classification using CRFs, , and , in: Global Conference on Signal and Information Processing, Atlanta, GA, pages 587 - 591, IEEE, 2014 |
[DOI] |
Jointly Informative Feature Selection, and , in: International Conference on Artificial Intelligence and Statistics, pages 567–575, 2014 |
|
Learning adaptive movements from demonstration and self-guided exploration, , and , in: Proc. IEEE Intl Conf. on Development and Learning and on Epigenetic Robotics (ICDL-EpiRob), Genoa, Italy, pages 160-165, 2014 |
|
Learning Force and Position Constraints in Human-robot Cooperative Transportation, , and , in: Proc. IEEE Intl Symposium on Robot and Human Interactive Communication (Ro-Man), Edinburgh, Scotland, UK, pages 619-624, 2014 |
|
Learning from demonstrations with partially observable task parameters, , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Hong Kong, pages 3309 - 3314, IEEE, 2014 |
[DOI] |
Learning to Learn, from Transfer Learning to Domain Adaptation: A Unifying Perspective, and , in: Proceedings of the Computer Vision and Pattern Recognition, Columbus, OH, pages 1442-1449, IEEE, 2014 |
[DOI] |
LETHA: Learning from High Quality Inputs for 3D Pose Estimation in Low Quality Images, , , and , in: Proceedings of the International Conference on 3D vision, pages 517–524, 2014 |
Mode of Teaching Based Segmentation and Annotation of Video Lectures, , and , in: International Workshop on Content-Based Multimedia Indexing, 2014 |
|
Model-based Sparse Component Analysis for Reverberant Speech Localization, , , and , in: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing, pages 1439 - 1443, IEEE, 2014 |
[DOI] |
Modeling Overlapping Speech using Vector Taylor Series, , and , in: Odyssey: The Speaker and Language Recognition Workshop, Joensuu, Finland, 2014 |
|
Multi-Source Adaptive Learning for Fast Control of Prosthetics Hand, , and , in: Proceedings of the International Conference on Pattern Recognition, Stockholm, pages 2769 - 2774, 2014 |
[DOI] |
Multi-source Posteriors for Speech Activity Detection on Public Talks, and , in: INTERSPEECH, 2014 |
|
Multilingual Deep Neural Network based Acoustic Modeling For Rapid Language Adaptation, , , , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Florence, pages 7639-7643, IEEE, 2014 |
[DOI] |
Multimodal Reranking of Content-based Recommendations for Hyperlinking Video Snippets, , , and , in: ACM International Conference on Multimedia Retrieval, 2014 |
|
Null space redundancy learning for a flexible surgical robot, , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Hong Kong, pages 2443 - 2448, IEEE, 2014 |
[DOI] |
On Modeling Context-Dependent Clustered States: Comparing HMM/GMM, Hybrid HMM/ANN and KL-HMM Approaches, , and , in: International Conference on Acoustics, Speech, and Signal Processing, Florence, IT, pages 7659 - 7663, IEEE, 2014 |
[DOI] |
On Recognition of Non-Native Speech Using Probabilistic Lexical Model, and , in: Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech 2014), 2014 |
|
On the Vulnerability of Finger Vein Recognition to Spoofing, , and , in: IEEE International Conference of the Biometrics Special Interest Group (BIOSIG), Darmstadt, Germay, pages 1 - 10, IEEE, 2014 |
|
Overview of the ImageCLEF 2014 Domain Adaptation Task, and , in: ImageCLEF 2014: Overview and analysis of the results, 2014 |
|
Phoneme Background Model for Information Bottleneck based Speaker Diarization, , and , in: Interspeech 2014, 2014 |
Phoneme Background Model for Information Bottleneck based Speaker Diarization, , and , in: Interspeech, Singapore, 2014 |
|
Posterior-based Sparse Representation for Automatic Speech Recognition, , , and , in: Proceeding of Interspeech, 2014 |
|
Prosody in Swiss French Accents: Investigation using Analysis by Synthesis, , , and , in: Speech Prosody, 2014 |
|
Recurrent Convolutional Neural Networks for Scene Labeling, and , in: 31st International Conference on Machine Learning (ICML), Beijing, China, pages 82-90, JMLR, 2014 |
[URL] |
Recurrent Greedy Parsing with Neural Networks, and , in: Proceedings of ECML 2014, pages 130-144, Springer Berlin Heidelberg, 2014 |
[DOI] |
Rewards-driven control of robot arm by decoding EEG signals, , and , in: Engineering in Medicine and Biology Society (EMBC), 2014 36th Annual International Conference of the IEEE, pages 1658-1661, IEEE, 2014 |
[DOI] [URL] |
ROCKIT: Roadmap for Conversational Interaction Technologies, , , , , , , , , , , , , , and , in: Proceedings of the 2014 Workshop on Roadmapping the Future of Multimodal Interaction Research including Business Opportunities and Challenges (RFMIR '14), pages 39-42, ACM, 2014 |
[DOI] |
Sample Distillation for Object Detection and Image Classification, , and , in: Proceedings of the 6th Asian Conference on Machine Learning (ACML), Nha Trang, Vietnam, 2014 |
|
Scalable Probabilistic Models: Applied to Face Identification in the Wild, and , in: 8th European Biometrics Research and Industry Awards, European Association for Biometrics, Darmstadt, Germany, 2014 |
[URL] |
Scene Recognition with Naive Bayes Non-linear Learning, and , in: Proceedings of the 22nd International Conference on Pattern Recognition, Stockholm, pages 3404 - 3409, IEEE, 2014 |
[DOI] |
Skills Learning in Robots by Interaction with Users and Environment, , in: In Proc. of the Intl Conf. on Ubiquitous Robots and Ambient Intelligence (URAI), Kuala Lumpur, Malaysia, pages 161-162, 2014 |
[URL] |
SPEAR: An open source toolbox for speaker recognition based on Bob, , and , in: Proceedings of the 39th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 1655 - 1659, 2014 |
[DOI] [URL] |
Stress and Accent Transmission In HMM-Based Syllable-Context Very Low Bit Rate Speech Coding, , , and , in: Interspeech, 2014 |
|
SVR vs MLP for Phone Duration Modelling in HMM-based Speech Synthesis, , and , in: Speech Prosody, 2014 |
|
SWISS FRENCH REGIONAL ACCENT IDENTIFICATION, , , , , and , in: Odyssey: The Speaker and Language Recognition Workshop, 2014 |
|
Syllable-based Regional Swiss French Accent Identification using Prosodic Features, , , and , in: Nouveaux cahiers de linguistique francaise, 2014 |
|
The DBOX Corpus Collection of Spoken Human-Human and Human-Machine Dialogues, , , , , , , , , , , , , and , in: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), Reykjavik, Iceland, European Language Resources Association (ELRA), 2014 |
[URL] |
The MAAYA Project: Multimedia Analysis and Access for Documentation and Decipherment of Maya Epigraphy, , , , , , and , in: Proc. Digital Humanities Conference, Lausanne, 2014 |
|
The SP2 SCOPES Project on Speech Prosody, , , , , , , , and , in: DOGS2014 - Digital speech and image processing, 2014 |
|
The Workshop on Computational Personality Recognition 2014, , , , , and , in: Proceedings of the ACM International Conference on Multimedia, 2014 |
|
The Young and the City: Crowdsourcing Urban Awareness in a Developing Country, , and , in: Proceedings of the First International Conference on IoT in Urban Space, pages 74-79, 2014 |
[DOI] [URL] |
Tracking Interacting Objects Optimally Using Integer Programming, , , and , in: Proceedings of the European Conference on Computer Vision, pages 17-32, 2014 |
|
Translation and Prosody in Swiss Languages, , , , , , , , , , and , in: Nouveaux cahiers de linguistique francaise, 2014 |
|
What to Show? Automatic Stream Selection Among Multiple Sensors, , and , in: International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, 2014 |
|
Who Will Get the Grant ? A Multimodal Corpus for the Analysis of Conversational Behaviours in Group Interviews, , , , and , in: International Conference on Multimodal Interaction, Understanding and Modeling Multiparty, Multimodal Interactions Workshop, Istanbul, Turkey, ACM, 2014 |
[DOI] |
Within- and Cross- Database Evaluations for Gender Classification via BeFIT Protocols, , and , in: International Workshop on Multimedia Signal Processing, pages 1-6, 2014 |
[DOI] [URL] |
Word Embeddings through Hellinger PCA, and , in: 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014 |
|
2013
3D Head Pose and Gaze Tracking and Their Application to Diverse Multimodal Tasks, , in: Doctoral consortium of the 15th ACM International Conference on Multimodal Interaction, Sydney, Australia, 2013 |
[DOI] |
A Multipath Sparse Beamfroming Method, , , and , in: Signal Processing with Adaptive Sparse Structured Representations SPARS, 2013 |
|
A Probabilistic Framework for Multiple Speaker Localization, , , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2013 |
|
A Semi-Automated System for Accurate Gaze Coding in Natural Dyadic Interactions, , , and , in: 15th ACM International Conference on Multimodal Interaction, Sydney, Australia, ACM, 2013 |
[DOI] |
Accelerated Training of Linear Object Detectors, and , in: CVPR 2013 Workshop on Structured Prediction, 2013 |
[URL] |
ACCENT ADAPTATION USING SUBSPACE GAUSSIAN MIXTURE MODELS, , , and , in: The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Vancouver, BC, Canada, pages 7170-7174, 2013 |
[DOI] |
Alignment of Binocular-Binaural Data Using a Moving Audio-Visual Target, , and , in: Proc. of IEEE Int. Workshop on Multimedia Signal Processing (MMSP), 2013 |
|
An Open-source State-of-the-art Toolbox for Broadcast News Diarization, , , , , and , in: INTERSPEECH, 2013 |
|
Anti-spoofing in action: joint operation with a verification system, , and , in: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Workshop on Biometrics, Portland, Oregon, 2013 |
|
Are ACT's scores increasing with better translation quality?, , in: Are ACT's scores increasing with better translation quality?, pages 6, 2013 |
|
Assessing the Accuracy of Discourse Connective Translations: Validation of an Automatic Metric, and , in: 14th International Conference on Intelligent Text Processing and Computational Linguistics, University of the Aegean, Samos, Greece, pages 236-247, Springer, 2013 |
[DOI] |
Automatic Social Role Recognition In Professional Meetings Using Conditional Random Fields, and , in: Proceedings of Interspeech, 2013 |
|
Automatic Staging of Audio with Emotions, and , in: International Conference on Affective Computing and Intelligent Interaction, 2013 |
Body communicative cue extraction for conversational analysis, , , , and , in: Proceedings of IEEE International Conference on Automatic Face and Gesture Recognition, 2013 |
|
Can face anti-spoofing countermeasures work in a real world scenario?, , , and , in: International Conference on Biometrics, Madrid, Spain, 2013 |
[URL] |
Combining Content with User Preferences for TED Lecture Recommendation, and , in: Proceedings of the 11th International Workshop on Content Based Multimedia Indexing, Veszprém, Hungary, IEEE, 2013 |
|
Complementary Countermeasures for Detecting Scenic Face Spoofing Attacks, , , , and , in: International Conference on Biometrics, Madrid, Spain, 2013 |
[URL] |
Computing Text Semantic Relatedness using the Contents and Links of a Hypertext Encyclopedia, and , in: International Joint Conference on artificial intelligence, 2013 |
|
Context Aware Addressee Estimation for Human Robot Interaction, , , and , in: Proceedings of the 6th Workshop on Eye Gaze in Intelligent Human Machine Interaction: Gaze in Multimodal Interaction, 2013 |
Creative Applications of Human Behavior Understanding. HBU 2013: 1-14, , , and , in: Human Behavior Understanding, pages 1-14, 2013 |
Cross-Domain Personality Prediction: From Video Blogs to Small Group Meetings, and , in: 15th ACM International Conference on Multimodal Interaction, 2013 |
|
Crosslingual Tandem-SGMM: Exploiting Out-Of-Language Data for Acoustic Model and Feature Level Adaptation, , and , in: Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013), ISCA - International Speech Communication Association, Lyon, France, pages 510-514, ISCA, 2013 |
|
Deformable Part Models with Individual Part Scaling, and , in: British Machine Vision Conference, 2013 |
|
Detecting Narrativity to Improve English to French Translation of Simple Past Verbs, , and , in: Proceedings of the 1st DiscoMT Workshop at ACL 2013 (51st Annual Meeting of the Association for Computational Linguistics), Sofia, Bulgaria, pages 33-42, 2013 |
|
Discovering Temporal Patterns in Water Quality Time Series, Focusing on Floods with the LDA method, , , , , , , and , in: European Geosciences Union, 2013 |
|
Distinguishing the Popularity Between Topics: A System for Up-to-date Opinion Retrieval and Mining in the Web, , and , in: Proceedings of the 14th International Conference on Intelligent Text Processing and Computational Linguistics, LNCS, Samos, Greece, ACM, 2013 |
[URL] |
Diverse Keyword Extraction from Conversations, and , in: Proceedings of the ACL 2013 (51th Annual Meeting of the Association for Computational Linguistics ), Short Papers, Sofia, Bulgaria, pages 651-657, ACL, 2013 |
|
Estimating Phoneme Class Conditional Probabilities from Raw Speech Signal using Convolutional Neural Networks, , and , in: Proceedings of Interspeech, 2013 |
|
Euclidean Distance Matrix Completion for Ad-hoc Microphone Array Calibration, , , and , in: Proceedings IEEE International Conference On Digital Signal Processing, 2013 |
|
Evaluating Intra- and Crosslingual Adaptation for Non-native Speech Recognition in a Bilingual Environment, and , in: Proceedings of the 4th IEEE International Conference on Cognitive Infocommunications, IEEE, Budapest, Hungary, pages 357-361, 2013 |
|
Evaluating Shape Descriptors for Detection of Maya Hieroglyphs, , and , in: in Proc. Mexican Conf. on Pattern Recognition, Queretaro, 2013 |
|
Exploiting Accelerometers to Improve Movement Classification for Prosthetics, and , in: International Conference on Rehabilitation Robotics, 2013 |
|
Fast Object Detection with Entropy-Driven Evaluation, , , and , in: Proceedings of the Conference on Computer Vision and Pattern Recognition, 2013 |
FEATURE AND SCORE LEVEL COMBINATION OF SUBSPACE GAUSSIANS IN LVCSR TASK, , and , in: The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Vancouver, BC, Canada, pages 7604-7608, 2013 |
[DOI] |
From Foursquare to my Square: Learning Check-in Behavior from Multiple Sources, , and , in: The 7th International AAAI Conference on Weblogs and Social Media, 2013 |
|
From N to N+1: Multiclass Transfer Incremental Learning, , and , in: Proceedings of the Conference on Computer Vision and Pattern Recognition, 2013 |
|
Fusing Matching and Biometric Similarity Measures for Face Diarization in Video, , and , in: Proceedings of the 3rd ACM conference on International conference on multimedia retrieval, Dallas, Texas, USA, pages 97-104, ACM, 2013 |
|
Gammatone Wavelet Cepstral Coefficients for Robust Speech Recognition, , and , in: Proceedings of IEEE TENCON, 2013 |
|
Given that, Should I Respond? Contextual Addressee Estimation in Multi-Party Human-Robot Interactions, and , in: Proceedings of Human Robot Interaction (HRI) Conference, 2013 |
|
Grapheme and Multilingual Posterior Features for Under-Resourced Speech Recognition: A Study on Scottish Gaelic, , and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2013 |
|
I4U Submission to NIST SRE 2012: a large-scale collaborative effort for noise-robust speaker verification, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: INTERSPEECH, Lyon, France, 2013 |
|
Idiap at MediaEval 2013: Search and Hyperlinking Task, , , and , in: MediaEval 2013 Workshop, Barcelona, Spain, CEUR-WS.org, 2013 |
|
Impact of deep MLP architecture on different acoustic modeling techniques for under-resourced speech recognition, , , and , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, 2013 |
|
Implicitation of Discourse Connectives in (Machine) Translation, and , in: Proceedings of the 1st DiscoMT Workshop at ACL 2013 (51st Annual Meeting of the Association for Computational Linguistics), Sofia, Bulgaria, pages 19-26, 2013 |
|
Improved Overlap Speech Diarization of Meeting Recordings using Long-term Conversational Features, and , in: ICASSP, 2013 |
Improved Overlap Speech Diarization of Meeting Recordings using Long-term Conversational Features, and , in: ICASSP, 2013 |
|
Improving Grapheme-based ASR by Probabilistic Lexical Modeling Approach, and , in: Proceedings of Interspeech, 2013 |
|
Inferring Mood in Ubiquitous Conversational Video, , , , , and , in: 12th International Conference on Mobile and Ubiquitous Multimedia, Luleå, Sweden, ACM Press, 2013 |
|
Inferring social activities with mobile sensor networks, , , , and , in: 15th ACM International Conference on Multimodal Interaction, 2013 |
|
Investigating the Impact of Language Style and Vocal Expression on Social Roles of Participants in Professional Meetings, and , in: Affective Computing and Intelligent Interaction, Geneva, pages 324-329, IEEE, 2013 |
[DOI] |
Learning to Rank on Network Data, , and , in: Mining and Learning with Graphs, 2013 |
|
Leveraging the robot dialog state for visual focus of attention recognition, , , , and , in: Proceedings of the 15th ACM on International Conference on Multimodal Interaction, 2013 |
Machine Translation with Many Manually Labeled Discourse Connectives, and , in: Proceedings of the 1st DiscoMT Workshop at ACL 2013 (51st Annual Meeting of the Association for Computational Linguistics), Sofia, Bulgaria, pages 43-50, 2013 |
|
Manifold Sparse Beamforming, , and , in: IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing, Saint Martin, France, pages 113-116, IEEE, 2013 |
[DOI] |
MLP-based Factor Analysis for Tandem Speech Recognition, and , in: Proceedings of the International Conference on Acoustics, Speech and Signal Processing, 2013 |
|
Multi-factor Segmentation for Topic Visualization and Recommendation: the MUST-VIS System, , , , , , , and , in: Proceedings of the 21st ACM International Conference on Multimedia, Barcelona, Spain, pages 365-368, ACM, 2013 |
[DOI] [URL] |
Multiclass Latent Locally Linear Support Vector Machines, , and , in: JMLR W&CP, Volume 29: Asian Conference on Machine Learning, Canberra, Australia, pages 229-244, 2013 |
[URL] |
Multimodal Analysis of Body Communication Cues in Employment Interviews, , , and , in: 15th ACM International Conference on Multimodal Interaction Proceedings, 2013 |
|
Noise Intrusiveness Factors in Speech Telecommunications, , , and , in: Proceedings of the AIA-DAGA 2013 International Conference on Acoustics, Merano, Italy, pages 436-439, 2013 |
|
On the (Un)importance of the Contextual Factors In HMM-Based Speech Synthesis, , and , in: Proceedings of the IEEE Intl. Conference on Acoustics, Speech and Signal Processing (ICASSP), Vancouver, Canada, pages 8140 - 8143, 2013 |
|
On the Improvements of Uni-modal and Bi-modal Fusions of Speaker and Face Recognition for Mobile Biometrics, , , and , in: Biometric Technologies in Forensic Science, Nijmegen, The Netherlands, 2013 |
|
One of a Kind: Inferring Personality Impressions in Meetings, and , in: 15th ACM International Conference on Multimodal Interaction, 2013 |
|
Overview of the ImageCLEF 2013 Robot Vision Task, , , and , in: Working Notes, CLEF 2013, 2013 |
|
Parameter Estimation and Contextual Adaptation for a Multi-Object Tracking CRF Model, and , in: IEEE Workshop on Performance Evaluation of Tracking and Surveillance, 2013 |
|
Person Independent 3D Gaze Estimation From Remote RGB-D Cameras, and , in: International Conference on Image Processing, Melbourne, Australia, IEEE, 2013 |
[DOI] |
Probabilistic Lexical Modeling and Unsupervised Training for Zero-Resourced ASR, , and , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, 2013 |
|
Processing and Linking Audio Events in Large Multimedia Archives: The EU inEvent Project, , , , , , , , and , in: Workshop on Speech, Language and Audio in Multimedia, 2013 |
|
Reservoir Boosting : Between Online and Offline Ensemble Learning, and , in: Proceedings of the international conference on Neural Information Processing Systems, 2013 |
|
Revisiting the Generality of the Rank-based Human Mobility Model, and , in: Proceedings of the 2013 ACM Conference on Pervasive and Ubiquitous Computing Adjunct Publication, Zurich, Switzerland, pages 1209-1218, ACM, 2013 |
[DOI] [URL] |
Sentiment Analysis of User Comments for One-Class Collaborative Filtering over TED Talks, and , in: 36th ACM SIGIR Conference on Research and Development in Information Retrieval, Dublin, Ireland, ACM, 2013 |
|
Speaker adaptive Kullback-Leibler divergence based hidden Markov models, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2013 |
|
Speaking Swiss: Languages and Venues in Foursquare, and , in: Proceedings of the 21st ACM International Conference on Multimedia, Barcelona, Spain, pages 501-504, ACM, 2013 |
[DOI] [URL] |
Spoofing Attacks To 2D Face Recognition Systems With 3D Masks, and , in: International Conference of the Biometrics Special Interes Group, Darmstadt, Germany, 2013 |
|
Spoofing in 2D Face Recognition with 3D Masks and Anti-spoofing with Kinect, and , in: Biometrics: Theory, Applications and Systems, Washington DC, USA, 2013 |
|
Stability and Hypothesis Transfer Learning, and , in: International Conference on Machine Learning, 2013 |
|
Structured Sparse Acoustic Modeling for Speech Separation, , , and , in: Signal Processing with Adaptive Sparse Structured Representations SPARS, SPARS, 2013 |
|
Syllable-based Pitch Encoding for Low Bit Rate Speech Coding with Recognition/Synthesis Architecture, , and , in: Proc. of Interspeech 2013, Lyon, France, 2013 |
|
The 2013 Face Recognition Evaluation in Mobile Environment, , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: The 6th IAPR International Conference on Biometrics, 2013 |
|
The 2013 Speaker Recognition Evaluation in Mobile Environment, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: The 6th IAPR International Conference on Biometrics, 2013 |
|
The 2nd competition on counter measures to 2D face spoofing attacks, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: International Conference of Biometrics 2013, Madrid, Spain, 2013 |
|
The vernissage corpus: a conversational human-robot-interaction dataset, , , , , , , , , and , in: Proceedings of the 8th ACM/IEEE international conference on Human-robot interaction, 2013 |
|
Time-Sensitive Topic Models for Action Recognition in Videos, , and , in: IEEE International Conference on Image Processing, 2013 |
|
Transfer in Inverse Reinforcement Learning for Multiple Strategies, and , in: IEEE/RSJ International Conference on Intelligent Robots and Systems, Tokyo, Japan, pages 3244-3250, IEEE, 2013 |
[DOI] [URL] |
Understanding Factors in Emotion Perception, and , in: ISCA Speech Synthesis Workshop, 2013 |
|
Using Phonological Phrase Segmentation to Improve Automatic Keyword Spotting for the Highly Agglutinating Hungarian Language, and , in: Proc. of Interspeech 2013, 2013 |
Who is Persuasive? The Role of Perceived Personality and Communication Modality in Social Multimedia, , , , and , in: International Conference on Multimodal Interaction, 2013 |
2012
A Multiple Hypothesis Gaussian Mixture Filter for Acoustic Source Localization and Tracking, , and , in: 13th International Workshop on Acoustic Signal Enhancement, pages 233-236, 2012 |
|
A TDOA Gaussian Mixture Model for Improving Acoustic Source Tracking, , , and , in: 20th European Signal Processing Conference, 2012 |
|
A tree-based distance between distributions: application to classification of neurons, and , in: ICASSP 2012 : IEEE International Conference on Acoustics, Speech and Signal Processing, 2012 |
Acoustic Data-driven Grapheme-to-Phoneme Conversion using KL-HMM, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2012 |
|
An Agent-Based Focused Crawling Framework for Topic- and Genre-Related Web Document Discovery, , and , in: 24th IEEE International Conference on Tools with Artificial Intelligence, Athens, Greece, IEEE, 2012 |
[URL] |
An Open Source Framework for Standardized Comparisons of Face Recognition Algorithms, , and , in: Computer Vision - ECCV 2012. Workshops and Demonstrations, Idiap Research Institute, Heidelberg, pages 547-556, Springer Berlin, 2012 |
[DOI] [URL] |
Annotation and Recognition of Personality Traits in Spoken Conversations from the AMI Meetings Corpus, , and , in: Proceedings of Interspeech 2012, 2012 |
|
Assessing the Impact of Language Style on Emergent Leadership Perception from Ubiquitous Audio, , and , in: Proceedings of the 11th International Conference on Mobile and Ubiquitous Multimedia, Ulm, Germany, 2012 |
|
Automatic detection of conflict escalation in spoken conversations, , and , in: INTERSPEECH, ISCA, Portland, Oregon, USA., 2012 |
|
Automatic detection of conflicts in spoken conversations: ratings and analysis of broadcast political debates, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, Japan, 2012 |
|
Automatic Speaker Role Labeling in AMI Meetings: Recognition of Formal and Social Roles, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, Japan, 2012, 2012 |
|
Baseline Multimodal Place Classifier for the 2012 Robot Vision Task, , and , in: Working Notes of the ImageCLEF 2012 Laboratory, 2012 |
|
Beyond Dataset Bias: Multi-task Unaligned Shared Knowledge Transfer, , , and , in: Asian Conference on Computer Vision, 2012 |
|
Bi-Modal Authentication in Mobile Environments Using Session Variability Modelling, , , , and , in: Proceedings of the 21st International Conference on Pattern Recognition, 2012 |
|
Bi-Modal Person Recognition on a Mobile Phone: using mobile phone data, , , , , , , , , , , , , and , in: IEEE ICME Workshop on Hot Topics in Mobile Multimedia, 2012 |
|
Bob: a free signal processing and machine learning toolbox for researchers, , , , , and , in: Proceedings of the ACM Multimedia Conference, 2012 |
[URL] |
Boosting localized binary features for speech recognition, , and , in: Symposium on Machine Learning in Speech and Language Processing (MLSLP), 2012 |
|
Boosting under-resourced speech recognizers by exploiting out of language data - Case study on Afrikaans, , and , in: Proceedings of the 3rd International Workshop on Spoken Languages Technologies for Under-resourced Languages, Cape Town, pages 60--67, 2012 |
|
Bridging the Past, Present and Future: Modeling Scene Activities From Event Relationships and Global Rules, , and , in: IEEE Conference on Computer Vision and Pattern Recognition, 2012, Providence, Rhode Island, USA, 2012 |
Building the NinaPro Database: a Resource for the Biorobotics Community, , , , , , , , and , in: Proceedings of the Fourth IEEE RAS/EMBS International Conference on Biomedical Robotics and Biomechatronics, 2012 |
|
Checking In or Checked In: Comparing Large-Scale Manual and Automatic Location Disclosure Patterns, , and , in: Proceedings of the 11th International Conference on Mobile and Ubiquitous Multimedia, Ulm, Germany, 2012 |
Collecting data for socially intelligent surveillance and monitoring approaches: the case of conflict in competitive conversations, , , and , in: International Symposium on Communications, Control, and Signal Processing, 2012 |
|
Combination of Sparse Classification and Multilayer Perceptron for Noise Robust ASR, , , , , and , in: Proceedings of Interspeech, 2012 |
|
Combining Acoustic Data Driven G2P and Letter-to-Sound Rules for Under Resource Lexicon Generation, and , in: Proceedings of Interspeech, Portland, Oregon, 2012 |
|
COMBINING CEPSTRAL NORMALIZATION AND COCHLEAR IMPLANT-LIKE SPEECH PROCESSING FOR MICROPHONE ARRAY-BASED SPEECH RECOGNITION, , and , in: Proceedings of the IEEE Workshop on Spoken Language Technology, 2012 |
|
Combining transcription-based and acoustic-based speaker identifications for broadcast news, , , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2012 |
|
COMBINING VOCAL TRACT LENGTH NORMALIZATION WITH HIERARCHIAL LINEAR TRANSFORMATIONS, , , and , in: Proceedings in International conference on Speech and Signal processing, Kyoto, Japan, pages 4493-4496, IEEE SPS (ICASSP), 2012 |
|
Comparing different acoustic modeling techniques for multilingual boosting, , , , and , in: Proceedings of Interspeech, Portland, Oregon, 2012 |
|
Computational Methods For Structured Sparse Component Analysis of Convolutive Speech Mixtures, , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2012 |
|
Contextual Conditional Models for Smartphone-based Human Mobility Prediction, and , in: Proceedings of the 14th ACM International Conference on Ubiquitous Computing, 2012 |
|
Conversion of Recurrent Neural Network Language Models to Weighted Finite State Transducers for Automatic Speech Recognition, and , in: Proceedings of Interspeech, Portland, Oregon, USA, pages to appear, 2012 |
|
Crowdsourcing Micro-Level Multimedia Annotations: The Challenges of Evaluation and Interface, , , and , in: Proceedings of International ACM Workshop on Crowdsourcing for Multimedia, 2012 |
Detecting and Labeling Folk Literature in Spoken Cultural Heritage Archives using Structural and Prosodic Features, and , in: IEEE Content Based Multimedia Indexing, 2012 |
|
DiarTk : An Open Source Toolkit for Research in Multistream Speaker Diarization and its Application to Meetings Recordings, and , in: Proceedings of Interspeech, 2012 |
|
Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns, , , , and , in: Proceedings of the eighth international conference on Language Resources and Evaluation (LREC), pages 5, 2012 |
|
Empirical validations of multilingual annotation schemes for discourse relations, , , and , in: 8th Joint ACL-ISO Workshop on Interoperable Semantic Annotation, 2012 |
|
Exact Acceleration of Linear Object Detectors, and , in: Proceedings of the European Conference on Computer Vision, 2012 |
|
Experiences in the Creation of an Electromyography Database to Help Hand Amputated Persons, , , , , , and , in: Proceedings of the 24th European Medical Informatics Conference, 2012 |
|
Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies, and , in: Proceedings of the eighth international conference on Language Resources and Evaluation (LREC), Istanbul, TR, pages 6, 2012 |
|
Extracting Informative Textual Parts from Web Pages Containing User-Generated Content, , and , in: 12th International Conference on Knowledge Management and Knowledge Technologies, ACM ICPS, Graz, Austria, pages 4:1--4:8, ACM, 2012 |
[URL] |
Extracting Mobile Behavioral Patterns with the Distant N-Gram Topic Model, and , in: Proceedings of the IEEE International Symposium on Wearable Computers, Newcastle, 2012 |
|
Face Recognition with Disparity Corrected Gabor Phase Differences, , and , in: Artificial Neural Networks and Machine Learning, Heidelberg, pages 411-418, Springer Berlin, 2012 |
[DOI] |
Face Verification using Gabor Filtering and Adapted Gaussian Mixture Models, , and , in: Proceedings of the 11th International Conference of the Biometrics Special Interest Group, Darmstadt, Germany, pages 397-408, GI-Edition, 2012 |
|
FaceTube: predicting personality from facial expressions of emotion in online conversational video, , and , in: Proceedings International Conference on Multimodal Interfaces (ICMI-MLMI), 2012 |
|
From Speech to Personality: Mapping Voice Quality and Intonation into Personality Differences, , , and , in: in Proceedings of ACM Multimedia 2012, 2012 |
|
Gaze Estimation From Multimodal Kinect Data, and , in: IEEE Conference in Computer Vision and Pattern Recognition, Workshop on Gesture Recognition, Providence, RI, USA, 2012 |
[DOI] |
Generating Exact Lattices in The WFST Framework, , , , , , , , , , , , and , in: Proceedings of 2012 IEEE International Conference on Acoustics, Speech and Signal Processing., The 37th International Conference on Acoustics, Speech, and Signal Processing, Kyoto, JP, Kyoto, Japan, pages 4213-4216, IEEE Signal Processing Societ, 2012 |
[DOI] |
Impact du degré de supervision sur l'adaptation à un domaine d'un modèle de langage à partir du Web, , , and , in: Actes de la conference conjointe JEP-TALN-RECITAL 2012, Grenoble, France, pages 193-200, ATALA/AFCP, 2012 |
|
IMPROVING ACOUSTIC BASED KEYWORD SPOTTING USING LVCSR LATTICES, , and , in: Proceedings on IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE, Japan, pages 4413-4416, 2012 |
Indoor Scene Recognition using Task and Saliency-driven Feature Pooling, and , in: Proceedings of the British Machine Vision Conference, Guildford, UK, 2012 |
|
Investigating the Midline Effect for Visual Focus of Attention Recognition, and , in: Int Conf. on Multimodal Interaction (ICMI), Santa Monica, 2012 |
|
Iterative Relevance Feedback with Adaptive Exploration/Exploitation Trade-off, and , in: Proceedings of the 21st ACM Conference on Information and Knowledge Management, pages 1323-1331, 2012 |
|
Joint Detection and Localization of Multiple Speakers using a Probabilistic Interpretation of the Steered Response Power, , , and , in: Statistical and Perceptual Audition Workshop, 2012 |
|
LBP-TOP based countermeasure against face spoofing attacks, , , and , in: International Workshop on Computer Vision With Local Binary Pattern Variants - ACCV, pages 12, 2012 |
|
Leveraging over prior knowledge for online learning of visual categories, , , and , in: Proceedings of the British Machine Vision Conference, 2012 |
|
Linking Speaking and Looking Behavior Patterns with Group Composition, Perception, and Performance, , , , and , in: Proceedings of the International Conference on Multimodal Interaction (ICMI), Santa Monica, USA, 2012 |
|
Machine Translation of Labeled Discourse Connectives, , , and , in: Proceedings of the Tenth Biennial Conference of the Association for Machine Translation in the Americas (AMTA), pages 10, 2012 |
|
Macro-Action Discovery Based on Change Point Detection and Boosting, and , in: International Conference on Machine Learning and Applications, 2012 |
|
MediaParl: Bilingual mixed language accented speech database, , , , , and , in: Proceedings of the 2012 IEEE Workshop on Spoken Language Technology, pages 263--268, 2012 |
|
Microphone Array Beampattern Characterization for Hands-free Speech Applications, , and , in: IEEE 7th Sensor Array and Multichannel Signal Processing Workshop(SAM), Hoboken, NJ, USA, pages 473-476, 2012 |
|
Modeling dominance effects on nonverbal behaviors using granger causality, , , , , and , in: Proceedings of International Conference on Multimodal Interaction, ICMI 2012, Santa Monica, CA, 2012 |
|
Multimodal Cue Detection Engine for Orchestrated Entertainment, , , and , in: Proceedings International Conference on MultiMedia Modeling, Klagenfurt, Austria, 2012 |
|
On Speaker-Independent Personality Perception and Prediction from Speech, , , , , and , in: in Proceedings of INTERSPEECH 2012, 2012 |
|
On the Challenge of Classifying 52 Hand Movements from Surface Electromyography, , and , in: 34th Annual Conference of the IEEE Engineering in Medicine & Biology Society, 2012 |
|
On the Effectiveness of Local Binary Patterns in Face Anti-spoofing, , and , in: Proceedings of the 11th International Conference of the Biometrics Special Interes Group, 2012 |
|
Overview of the ImageCLEF 2012 Robot Vision Task, , and , in: Working Notes of the ImageCLEF 2012 Laboratory, 2012 |
|
Predicting the Conflict Level in Television Political Debates: an Approach Based on Crowdsourcing, Nonverbal Communication and Gaussian Processes, , , and , in: ACM Multimedia, 2012 |
Reading Companion: The Technical and Social Design of an Automated Reading Tutor, , , , , and , in: Workshop on Child, Computer and Interaction, Portland, Oregon, U.S.A., 2012 |
|
Recognizing the Visual Focus of Attention for Human Robot Interaction, , and , in: IEEE International Conference on Intelligent Robots and Systems (IROS) - Human Behavior Understanding Workshop(IROS-HBU), 2012 |
|
Robot-to-group Interaction in a Vernissage: Architecture & Dataset for Multi-party Dialog, , , , , , , and , in: Proceedings of 5th International Conference on Cognitive Systems, 2012 |
|
Robust triphone mapping for acoustic modeling, , and , in: Proceedings of Interspeech, Portland, Oregon, 2012 |
|
Socio-Technical Network Analysis from Wearable Interactions, , and , in: International Symposium on Wearable Computers, 2012 |
|
Speaker Diarization and Linking of Large Corpora, and , in: Proceedings of the IEEE Workshop on Spoken Language Technology, 2012 |
|
Speaker Diarization of Meetings based on large TDOA feature vectors, and , in: Proceedings of International Conference on Acoustic, Speech and Signal Processing, 2012 |
|
Speaker diarization of overlapping speech based on silence distribution in meeting recordings, and , in: INTERSPEECH, Portland, Oregon, USA, 2012 |
|
StressSense: Detecting Stress in Unconstrained Acoustic Environments using Smartphones, , , , , , , and , in: Ubicomp'12, Pittsburgh, 2012 |
|
Structured Sparse Coding for Microphone Array Location Calibration, , , and , in: SAPA-SCALE Conference, The 5th ISCA workshop on statistical and perceptual audition, 2012 |
|
Sub-Band Based Log-Energy and its Dynamic Range Stretching for Robust In-Car Speech Recognition, and , in: Proceedings of the 13th Annual Conference of the International Speech Communication Association (InterSpeech), Portland, Oregon, 2012 |
|
Supervised and unsupervised Web-based language model domain adaptation, , , and , in: Proceedings of Interspeech, Portland, Oregon, USA, pages to appear, 2012 |
|
Synthetic References for Template-based ASR using Posterior Features, , and , in: Proceedings of Interspeech, Portland, Oregon, USA, 2012 |
|
Template-based ASR using Posterior features and synthetic references: comparing different TTS systems, , and , in: SAPA-SCALE Conference, International Speech Communication Association, 2012 |
|
The Good, the Bad, and the Angry: Analyzing Crowdsourced Impressions of Vloggers, and , in: Proceedings of AAAI International Conference on Weblogs and Social Media, 2012 |
|
The I4U Submission to the 2012 NIST Speaker Recognition Evaluation, , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: NIST Speaker Recognition Conference, 2012 |
The Idiap Speaker Recognition Evaluation System at NIST SRE 2012, , and , in: NIST Speaker Recognition Conference, NIST, Orlando, USA, 2012 |
|
The INTERSPEECH 2012 Speaker Trait Challenge, , , , , , , , , , , and , in: in Proceedings of INTERSPEECH, 2012 |
The Mobile Data Challenge: Big Data for Mobile Computing Research, , , , , , , , and , in: Pervasive Computing, Newcastle, 2012 |
|
Translating English Discourse Connectives into Arabic: a Corpus-based Analysis and an Evaluation Metric, and , in: Fourth Workshop on Computational Approaches to Arabic Script-based Languages at Proceedings of the Tenth Biennial Conference of the Association for Machine Translation in the Americas (AMTA), 2012 |
|
Unsupervised Activity Analysis and Monitoring algorithms for Effective Surveillance Systems, , , , , , , and , in: European Conference on Computer Vision, 2012 |
|
Using Crowdsourcing to Compare Document Recommendation Strategies for Conversations, and , in: RecSys, Recommendation Utility Evaluation (RUE 2012), Dublin, Ireland, pages 15-20, 2012 |
|
Using KL-divergence and multilingual information to improve ASR for under-resourced languages, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, pages 4869--4872, 2012 |
|
Using Self-Context for Multimodal Detection of Head Nods in Face-to-Face Interactions, , and , in: Proceedings of the 14th ACM International Conference on Multimodal Interaction, 2012 |
|
Using Sense-labeled Discourse Connectives for Statistical Machine Translation, and , in: Proceedings of the EACL2012 Workshop on Hybrid Approaches to Machine Translation (HyTra), Avignon, FR, pages 129--138, 2012 |
|
Using Sparse Classification Outputs as Feature Observations for Noise Robust ASR, , , , , and , in: Proceedings of Interspeech, 2012 |
|
We are not Contortionists: Coupled Adaptive Learning for Head and Body Orientation Estimation in Surveillance Video, and , in: IEEE International Conference on Computer Vision and Pattern Recognition, 2012 |
|
2011
A Bimodal Sound Source Model for Vehicle Tracking in Traffic Monitoring, , , and , in: European Signal Processing Conference, 2011 |
|
A BSS-based Approach for Localization of Simultaneous Speakers in Reverberant Conditions, , , and , in: Proceedings of the 19th European Signal Processing Conference (EUSIPCO), 2011 |
|
A Compressive Sensing Based Compressed Neural Network for Sound Source Localization, , and , in: Proceedings of International Symposium on Artificial Intelligence and Signal Processing, 2011 |
|
A Corpus-based Contrastive Analysis for Defining Minimal Semantics of Inter-sentential Dependencies for Machine Translation, , , and , in: Proceedings of the GSCL2011 Workshop on "Contrastive Analysis - Translation Studies - Machine Translation: What can we learn from each other?", Hamburg, Germany, pages 5, 2011 |
|
A Joint Estimation of Head and Body Orientation Cues in Surveillance Video, , and , in: IEEE International Workshop on Socially Intelligent Surveillance and Monitoring, 2011 |
|
A Just-in-Time Document Retrieval System for Dialogues or Monologues, , , and , in: SIGDIAL 2011 (12th annual SIGDIAL Meeting on Discourse and Dialogue), Demonstration Session, Portland, OR, pages 350-352, 2011 |
|
A Large-Scale Database of Images and Captions for Automatic Face Naming, , , and , in: Proceedings of the 22nd British Machine Vision Conference, 2011 |
|
A Speech-based Just-in-Time Retrieval System using Semantic Search, , , and , in: Proceedings of the ACL-HLT 2011 System Demonstrations (49th Annual Meeting of the Association for Computational Linguistics), Portland, OR, pages 80-86, 2011 |
[URL] |
An Audio Visual Corpus for Emergent Leader Analysis, , and , in: Multimodal Corpora for Machine Learning: Taking Stock and Road mapping the Future, 2011 |
An Integrated Framework for Multi-Channel Multi-Source Localization and Voice Activity Detection, , , , and , in: The Third Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2011 |
|
Analysis and Comparison of Recent MLP Features for LVCSR Systems, , and , in: Proceedings of Interspeech 2011, 2011 |
|
Audio Spatio-Temporal Fingerprints for Cloudless Real-Time Hands-Free Diarization on Mobile Devices, , in: Proceedings of the 3rd Joint Workshop on Hands-Free Speech Communication and Microphone Arrays, Edinburgh, UK, 2011 |
|
Automated Quantification of Morphodynamics for High-Throughput Live Cell Imaging Datasets, , , , , and , in: 1st International SystemsX.ch Conference on Systems Biology, 2011 |
Automatic Time Skew Detection and Correction, , in: Proceedings International Conference on Signal Acquisition and Processing, Singapore, 2011 |
|
Boosting with Maximum Adaptive Sampling, and , in: Proceedings of the Neural Information Processing Systems Conference, 2011 |
Building 'directional corpora' for unbiased contrastive analysis, and , in: Proceedings of Corpus Linguistics Conference, Birmingham, UK, pages 29-30, 2011 |
|
Combined Estimation of Location and Body Pose in Surveillance Video, , and , in: AVSS, 2011 |
|
Competition on Counter Measures to 2-D Facial Spoofing Attacks, , , , , , , , , , , , , , , , , , , , , , , and , in: Proceedings of IAPR IEEE International Joint Conference on Biometrics (IJCB), Washington DC, USA, 2011 |
|
Contextual grouping: discovering real-life interaction types from longitudinal Bluetooth data, and , in: 12th International Conference on Mobile Data Management, 2011 |
|
Counter-Measures to Photo Attacks in Face Recognition: a public database and a baseline, and , in: International Joint Conference on Biometrics 2011, 2011 |
[URL] |
Cross-Lingual Speaker Discrimination Using Natural and Synthetic Speech, and , in: Proceedings of Interspeech, Florence, Italy, 2011 |
|
Deep Learning for Efficient Discriminative Parsing, , in: International Conference on Artificial Intelligence and Statistics, 2011 |
|
Detection-Based Multi-Human Tracking Using a CRF Model, , and , in: The Eleventh IEEE International Workshop on Visual Surveillance, 2011 |
|
Disambiguating discourse connectives using parallel corpora: senses vs. translations, , , , , and , in: Proceedings of Corpus Linguistics Conference, Birmingham, UK, pages 104-105, 2011 |
|
Disambiguating Temporal-Contrastive Discourse Connectives for Machine Translation, , in: Proceedings of ACL-HLT 2011 Student Session, Association for Computational Linguistics, Portland, OR, pages 46--51, 2011 |
|
Engagement-based Multi-party Dialog with a Humanoid Robot, , , , , , and , in: Proceedings of the SIGDIAL 2011: the 12th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pages 341-343, 2011 |
|
Environment - Application - Adaptation: a Community Architecture for Ambient Intelligence, , in: International Conference on Ambient Computing, Applications, Services and Technologies, 2011 |
Exploiting Long-Term Observations for Track Creation and Deletion in Online Multi-Face Tracking, and , in: IEEE Conference on Automatic Face and Gesture Recognition, pages 525-530, IEEE, 2011 |
|
Exploiting observers' judgements for nonverbal group interaction analysis, , and , in: IEEE Conference on Automatic Face and Gesture Recognition, pages 6, IEEE, 2011 |
|
Extracting and Locating Temporal Motifs in Video Scenes Using a Hierarchical Non Parametric Bayesian Model, , and , in: IEEE Conference on Computer Vision and Pattern Recognition, 2011 |
|
Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition, , and , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Hawaii, USA, pages 348-353, 2011 |
|
Fast Speaker Verification on Mobile Phone data using Boosted Slice Classifiers, , and , in: IAPR IEEE International Joint Conference on Biometrics, Washington DC, 2011 |
|
Finding Audio-Visual Events in Informal Social Gatherings, , , and , in: IEEE/ACM 13th International Conference on Multimodal Interaction, 2011 |
|
FlowBoost - Appearance Learning from Sparsely Annotated Video, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2011 |
Grapheme-based Automatic Speech Recognition using KL-HMM, , , and , in: Proceedings of Interspeech, 2011 |
|
GroupUs: Smartphone Proximity Data and Human Interaction Type Mining, and , in: 15th annual International Symposium on Wearable Computers, San Francisco, USA, 2011 |
|
HEAT: Iterative Relevance Feedback with One Million Images, and , in: Proceedings of the IEEE International Conference on Computer Vision, pages 2118-2125, 2011 |
Hierarchical Tandem Features for ASR in Mandarin, , and , in: Proceedings of Interspeech, 2011 |
How Comparable are Parallel Corpora? Measuring the Distribution of General Vocabulary and Connectives, , , and , in: Proceedings of 4th Workshop on Building and Using Comparable Corpora, ACL, Portland, OR, pages 78--86, 2011 |
|
Humans as Feature Extractors: Combining Prosody and Personality Perception for Better Speaking Style Recognition, and , in: Proceeding of IEEE Int Conference on Systems, Man, and Cybernetics - Special Sessions, 2011 |
|
Impact of Excitation Frequency on Short-Term Recording Synchronisation and Confidence Estimation, , in: Proceedings European Signal Processing Conference, Barcelona, Spain, 2011 |
|
Improving Articulatory Feature and Phoneme Recognition using Multitask Learning, and , in: Artificial Neural Networks and Machine Learning - ICANN 2011, pages 299-306, Springer Berlin / Heidelberg, 2011 |
[DOI] [URL] |
Improving non-native ASR through stochastic multilingual phoneme space transformations, , , , and , in: Proceedings of Interspeech, Florence, Italy, pages 537-540, 2011 |
|
Inferring truth from multiple annotators for social interaction analysis, , and , in: Neural Information Processing Systems (NIPS) Workshop on Modeling Human Communication Dynamics (HCD), pages 4, 2011 |
|
Information Bottleneck Features for HMM/GMM Speaker Diarization of Meetings Recordings, and , in: Interspeech, Florence, Italy, pages 953-956, 2011 |
|
Integrating articulatory features using Kullback-Leibler divergence based acoustic model for phoneme recognition, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, pages 5192 - 5195, 2011 |
[DOI] |
Inter-session Variability Modelling and Joint Factor Analysis for Face Authentication, , , and , in: International Joint Conference on Biometrics, 2011 |
Joint Adaptive Colour Modelling and Skin, Hair and Clothing Segmentation Using Coherent Probabilistic Index Maps, and , in: British Machine Vision Conference, British Machine Vision Association, Dundee, UK, 2011 |
|
Joint Optimization of Hidden Conditional Random Fields and Non Linear Feature Extraction, , and , in: Proceedings of International Conference on Document Analysis and Recognition, 2011 |
Just-in-Time Multimodal Association and Fusion from Home Entertainment, , , and , in: Proceedings IEEE International Conference on Multimedia & Expo, Barcelona, Spain, 2011 |
|
Language dependent universal phoneme posterior estimation for mixed language speech recognition, , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Prag, CZ, pages 5012-5015, 2011 |
|
Language-Independent Socio-Emotional Role Recognition in the AMI Meetings Corpus, and , in: Proceedings of Interspeech, 2011 |
|
Learning Structured Embeddings of Knowledge Bases, , , and , in: Conference on Artificial Intelligence, 2011 |
|
Look at who's talking, , , , and , in: Proceedings of International Conference on Ambient Intelligence, pages 68-76, 2011 |
LP Residual Features for Robust, Privacy-Sensitive Speaker Diarization, , and , in: Interspeech, 2011 |
|
Machine learning techniques to analyse complex, computer vision-extracted, dynamic cellular phenotypes, , , , , and , in: 1st International SystemsX.ch Conference on Systems Biology, 2011 |
Model-based Compressive Sensing for Multi-party Distant Speech Recognition, , and , in: 2011 IEEE International Conference on Acoustics, Speech and Signal Processing, 2011 |
|
Morphodynamic profiling to explore spatio-temporal signaling networks regulating neurite outgrowth, , , , , , and , in: 1st International SystemsX.ch Conference on Systems Biology, 2011 |
Multi-camera Open Space Human Activity Discovery for Anomaly Detection, , and , in: 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011 |
|
Multi-party Speech Recovery Exploiting Structured Sparsity Models, , , and , in: Proceedings of Interspeech, 2011 |
|
Multiclass Transfer Learning from Unconstrained Priors, , and , in: Proceedings of the 13th International Conference on Computer Vision, 2011 |
|
Multilingual Annotation and Disambiguation of Discourse Connectives for Machine Translation, , , and , in: Proceedings of 12th SIGdial Meeting on Discourse and Dialogue, Association for Computational Linguistics, Portland, OR, pages 194--203, 2011 |
|
MULTISTREAM SPEAKER DIARIZATION THROUGH INFORMATION BOTTLENECK SYSTEM OUTPUTS COMBINATION, , and , in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2011 |
|
New world, New Worlds: Visual Analysis of Pre-Columbian Pictorial Collections., , , and , in: Proceedings of the International Workshop on Multimedia for Cultural Heritage, Modena, Italy., Springer CCIS series book, 2011 |
|
People-Centric Mobile Sensing with a Pragmatic Twist: from Behavioral Data Points to Active User Involvement, , and , in: International Conference on Human-Computer Interaction with Mobile Devices and Services, 2011 |
|
Pervasive Sensing to Model Political Opinions in Face-to-Face Networks, , , and , in: Pervasive, San Francisco, 2011 |
|
Phoneme Recognition using Boosted Binary Features, , and , in: IEEE Intl. Conference on Acoustics, Speech and Signal Processing 2011, 2011 |
|
Phonological Knowledge Guided HMM State Mapping for Cross-Lingual Speaker Adaptation, and , in: Proceedings of Interspeech, Florence, Italy, 2011 |
|
Posterior Features for Template-based ASR, , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Prague, Czech Republic, 2011 |
|
Recent Developments in Social Signal Processing, , and , in: Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, pages 380-385, 2011 |
Searching the Past: An Improved Shape Descriptor to Retrieve Maya Hieroglyphs., , , and , in: Proceedings of the ACM International Conference in Multimedia, Scottsdale, USA, ACM, 2011 |
|
Smartphone usage in the wild: a large-scale analysis of applications and context, , and , in: 13th International Conference on Multimodal Interaction, 2011 |
|
Social Focus of Attention as a Time Function Derived from Multimodal Signals, and , in: Proceedings IEEE International Conference on Multimedia & Expo, Barcelona, Spain, 2011 |
|
Speaker Diarization of Meetings based on Speaker Role N-gram Models, , and , in: Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011 |
|
Tasting Families of Features for Image Classification, and , in: International Conference on Computer Vision, 2011 |
|
The Kaldi Speech Recognition Toolkit, , , , , , , , , , , , and , in: IEEE 2011 Workshop on Automatic Speech Recognition and Understanding, Hilton Waikoloa Village, Big Island, Hawaii, US, IEEE Signal Processing Society, 2011 |
|
The MASH Project, , , and , in: European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2011 |
|
The TA2 Database - A Multi-Modal Database from Home Entertainment, , and , in: International Conference on Signal Acquisition and Processing, Singapore, 2011 |
|
Torch7: A Matlab-like Environment for Machine Learning, , and , in: BigLearn, NIPS Workshop, 2011 |
|
Towards Computational Proxemics: Inferring Social Relations from Interpersonal Distances, , , , , and , in: Proceedings of the IEEE International Conference on Social Computing, pages 290-297, 2011 |
Towards semi-supervised learning of semantic spatial concepts, and , in: IEEE International Conference on Robotics and Automation, 2011 |
|
Tracking Multiple Objects under Global Appearance Constraints, , , and , in: Proceedings of the IEEE International Conference on Computer Vision, 2011 |
Transferring Activities: Updating Human Behavior Analysis, , , , and , in: Visual Surveillance Workshop at ICCV, 2011 |
|
Ultra-Fast Optimization Algorithm for Sparse Multi Kernel Learning, and , in: Proceedings of the 28th International Conference on Machine Learning, 2011 |
|
Understanding Social Signals in Multi-party Conversations: Automatic Recognition of Socio-Emotional Roles in the AMI Meeting Corpus, , , and , in: Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, pages 374-379, 2011 |
Using a Wikipedia-based Semantic Relatedness Measure for Document Clustering., and , in: Graph-based Methods for Natural Language Processing, 2011 |
|
Who's Who with Big-Five: Analyzing and Classifying Personality Traits with Smartphones, , and , in: International Symposium on Wearable Computing, pages 8, 2011 |
|
You Are Known by How You Vlog: Personality Impressions and Nonverbal Behavior in YouTube, , and , in: Proceedings of AAAI International Conference on Weblogs and Social Media, Barcelona, 2011 |
|
2010
A Comparative Study of MLP Front-ends for Mandarin ASR, , , , and , in: Proceedings of Interspeech, Japan, 2010 |
|
A Comparison of Supervised and Unsupervised Cross-Lingual Speaker Adaptation Approaches for HMM-Based Speech Synthesis, , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Dallas, U.S.A., 2010 |
|
A Multi Cue Discriminative Approach to Semantic Place Classification, , and , in: CLEF 2010 Notebook Papers/LABs/Workshops, 2010 |
|
A Multimodal Corpus for Studying Dominance in Small Group Conversations, , and , in: LREC workshop on Multimodal Corpora: Advances in Capturing, Coding and Analyzing Multimodality, Malta, May 2010, 2010 |
|
A Random Walk Framework to Compute Textual Semantic Similarity: a Unified Model for Three Benchmark Tasks, and , in: Proceedings of the 4th IEEE International Conference on Semantic Computing (ICSC 2010 ), Carnegie Mellon University, Pittsburgh, PA, USA, 2010 |
|
A Sparsity Constraint for Topic Models - Application to Temporal Activity Mining, , and , in: NIPS-2010 Workshop on Practical Applications of Sparse Modeling: Open Issues and New Directions, 2010 |
|
Advances in Fast Multistream Diarization based on the Information Bottleneck Framework, , and , in: Proceedings of Interspeech, 2010 |
An Adaptive Initialization Method for Speaker Diarization based on Prosodic Features, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, pages 4946-4949, 2010 |
|
An Alternative Scanning Strategy to Detect Faces, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010 |
|
An Analysis of Language Mismatch in HMM State Mapping-Based Cross-Lingual Speaker Adaptation, and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
|
Analysis of Phone Posterior Feature Space Exploiting Class Specific Sparsity and MLP-based Similarity Measure, , and , in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, 2010 |
|
Application of Out-Of-Language Detection To Spoken-Term Detection, and , in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010 |
|
Are you a Werewolf? Detecting deceptive roles and outcomes in a conversational role-playing game, and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2010 |
|
Audio–Visual Synchronisation for Speaker Diarisation, , and , in: International Conference on Speech and Language Processing, Interspeech, Makuhari, Japan, 2010 |
|
Automatic Content Linking: Speech-based Just-in-time Retrieval for Multimedia Archives, , , , , and , in: Proceedings of the 33rd Annual ACM SIGIR Conference, Geneva, Switzerland, pages 703, 2010 |
[DOI] |
Automatic Role Recognition Based on Conversational and Prosodic Behaviour, , , and , in: Proceedings of the ACM International Conference on Multimedia, 2010 |
|
Automatic Temporal Alignment of AV Data with Confidence Estimation, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010 |
|
BOOSTED BINARY FEATURES FOR NOISE-ROBUST SPEAKER VERIFICATION, , and , in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, Texas, 2010 |
|
By their apps you shall understand them: mining large-scale patterns of mobile phone usage, and , in: The 9th International Conference on Mobile and Ubiquitous Multimedia, 2010 |
|
Crossmodal Matching of Speakers using Lip and Voice Features in Temporally Non-overlapping Audio and Video Streams, and , in: 20th International Conference on Pattern Recognition, Istanbul, Turkey, International Association for Pattern Recognition (IAPR), Istanbul, Turkey, 2010 |
|
Delineating Trees in Noisy 2D Images and 3D Image Stacks, , , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, pages 2799–2806, 2010 |
Determination of Pitch Range Based on Onset and Offset Analysis in Modulation Frequency Domain, , , and , in: Proceedings of 5th International Symposium on Telecommunications, 2010 |
|
Discovering Human Places of Interest from Multimodal Mobile Phone Data, and , in: Proceedings of 9th International Conference on on Mobile and Ubiquitous Multimedia (MUM,',','), Limassol, Cyprus, 2010 |
|
English Spoken Term Detection in Multilingual Recordings, , and , in: Proceedings of Interspeech, Makuhari, Japan, 2010, ISCA, Makuhari, Japan, 2010 |
|
Evaluating the Robustness of Privacy-Sensitive Audio Features for Speech Detection in Personal Audio Log Scenarios, , , and , in: ICASSP 2010, 2010 |
|
Extracting Motifs from Time Series Generated by Concurrent Activities., , and , in: NIPS workshop on Learning and Planning from Batch Time Series Data, 2010 |
|
Fast Bounding Box Estimation based Face Detection, and , in: ECCV, Workshop on Face Detection: Where we are, and what next?, 2010 |
[URL] |
Floor Holder Detection and End of Speaker Turn Prediction in Meetings, , and , in: International Conference on Speech and Language Processing, Interspeech, Makuhari, Japan, ISCA, 2010 |
|
Fusing Audio-Visual Nonverbal Cues to Detect Dominant People in Conversations, and , in: 20th International Conference on Pattern Recognition, Istanbul, Turkey, 2010, Istanbul, Turkey, 2010 |
|
Hands Free Audio Analysis from Home Entertainment, , and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
|
Hierarchical Multilayer Perceptron based Language Identification, , and , in: Proceedings of Interspeech, Makuhari, Japan, pages 2722-2725, 2010 |
|
Identifying Emergent Leadership in Small Groups using Nonverbal Communicative Cues, , , and , in: Proc. ICMI-MLMI '10 International Conference on Multimodal Interfaces and the Workshop on Machine Learning for Multimodal Interaction, Beijing, ACM New York, NY, USA ©2010, 2010 |
[DOI] |
Implementation of VTLN for Statistical Speech Synthesis, , , and , in: Proceedings of ISCA Speech Synthesis Workshop, Kyoto, Japan, 2010 |
|
Improving Speech Processing trough Social Signals: Automatic Speaker Segmentation of Political Debates using Role based Turn-Taking Patterns., and , in: Proceedings of ACM Multimedia Workshop on Social Signal Processing, 2010 |
|
Introducing Crossmodal Biometrics:Person Identification from Distinct Audio & Visual Streams, and , in: IEEE Fourth International Conference on Biometrics: Theory, Applications and Systems, 2010 |
|
Joint Cascade Optimization Using a Product Of Boosted Classifiers, and , in: Proceedings of the Neural Information Processing Systems Conference, pages 1315–1323, 2010 |
Kodak Moments and Flickr Diamonds: How Users Shape Large-scale Media, , and , in: Proc. of the 18th Intl. Conf. on Multimedia, Firenze, Italy, 2010 |
Learning from Candidate Labeling Sets, and , in: Advances in Neural Information Processing Systems 23 (NIPS10), NIPS Foundation, Vancouver, B.C., Canada, MIT Press, 2010 |
|
Leveraging speaker diarization for meeting recognition from distant microphones, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, pages 4390--4393, 2010 |
Mining Human Location-Routines Using a Multi-Level Approach to Topic Modeling, and , in: 2010 IEEE Second International Conference on Social Computing, SIN Symposium, Minneapolis, Minnesota, USA, 2010 |
|
Mobile Social Signal Processing: vision and research issues, , and , in: Proceedings of the International Workshop on Mobile HCI, Lisbon, pages 513-516, 2010 |
|
Multistream Speaker Diarization beyond Two Acoustic Feature Streams, , and , in: International Conference on Acoustics, Speech, and Signal Processing, 2010 |
|
Neural conditional random fields, and , in: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, Chia Laguna, Sardinia, Italy, JMLR: W&CP, 2010 |
|
Object Recognition using Visuo-Affordance Maps, , , and , in: International Conference on Intelligent Robots and Systems, Taipei, pages 1572-1578, IEEE, 2010 |
[DOI] |
OM-2: An Online Multi-class Multi-kernel Learning Algorithm, , , , and , in: In Proceeding of CVPR 2010, Online Learning for Computer Vision Workshop, 2010 |
|
Online-Batch Strongly Convex Multi Kernel Learning, , and , in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2010 |
|
Personalising speech-to-speech translation in the EMIME project, , , , , , , , , , , , , , , , , , and , in: Proceedings of the ACL 2010 System Demonstrations, Association for Computational Linguistics, Uppsala, Sweden, 2010 |
[URL] |
Probabilistic Latent Sequential Motifs: Discovering temporal activity patterns in video scenes, , and , in: BMVC 2010, Aberystwyth University, Aberystwyth, BMVA Press, 2010 |
|
Recognizing conversational context in group interaction using privacy-sensitive mobile sensors, , , and , in: Proceedings of International Conference on Mobile and Ubiquitous Multimedia, Limassol, Cyprus, 2010 |
|
Safety in Numbers: Learning Categories from Few Examples with Multi Model Knowledge Transfer, , and , in: Proceedings of IEEE Computer Vision and Pattern Recognition Conference, San Francisco, CA, pages 3081-3088, IEEE, 2010 |
[DOI] |
Single Channel Speech Separation with a Frame-based Pitch Range Estimation Method in Modulation Frequency, , , and , in: Proceedings of 5th International Symposium on Telecommunications, 2010 |
|
Social Signal Processing: Understanding Nonverbal Communication in Social Interactions, and , in: Proceedings of Measuring Behavior 2010, Eindhoven (The Netherlands), 2010 |
|
Socially Intelligent Surveillance and Monitoring: Analysing Social Dimensions of Physical Space, , and , in: Proceedings of International Workshop on Socially Intelligent Surveillance and Monitoring, San Francisco, pages 51-58, 2010 |
|
Sparse Component Analysis for Speech Recognition in Multi-Speaker Environment, , and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
|
Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project, , , , , , , , , , , , , , , , , , and , in: Proceedings of the 7th ISCA Speech Synthesis Workshop, Kyoto, Japan, 2010 |
|
Speech Enhancement using an Improved MMSE Estimator with Laplacian Prior, , and , in: Proceedings of 5th International Symposium on Telecommunications, 2010 |
|
The ACLD: Speech-based Just-in-Time Retrieval of Meeting Transcripts, Documents and Websites, , , and , in: ACM Multimedia Workshop on Searching Spontaneous Conversational Speech, Florence, Italy, 2010 |
|
The AMIDA 2009 Meeting Transcription System, , , , , , , , and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
|
The Robot Vision Track at ImageCLEF 2010, , , and , in: CLEF 2010 Notebook Papers/LABs/Workshops, 2010 |
[URL] |
The Voice of Personality: Mapping Nonverbal Vocal Behavior into Trait Attributions, , and , in: Proceedings of ACM Multimedia Workshop on Social Signal Processing, 2010 |
|
The Wolf Corpus: Exploring group behaviour in a competitive role-playing game, and , in: ACM Multimedia, 2010 |
|
Towards a quantitative measure of rareness, and , in: DIRAC Workshop at the European Conference on Machine Learning, pages 129-136, Springer Berlin Heidelberg, 2010 |
[DOI] |
Towards a standard for dialogue act annotation, , , , , , , , , , , and , in: 7th International Conference on Language Resources and Evaluation, Malta, 2010 |
[URL] |
Towards mixed language speech recognition systems, , and , in: Proceedings of Interspeech, Makuhari, Japan, pages 278-281, 2010 |
|
Towards rich mobile phone datasets: Lausanne data collection campaign, , , , and , in: Proc. ACM Int. Conf. on Pervasive Services (ICPS,',','), Berlin., 2010 |
|
Tracter: A Lightweight Dataflow Framework, and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
|
Using Audio and Visual Cues for Speaker Diarisation Initialisation, and , in: International Conference on Acoustics, Speech and Signal Processing, 2010 |
|
VARIATIONAL BAYESIAN SPEAKER DIARIZATION OF MEETING RECORDINGS, , and , in: Proceedings of ICASSP, 2010 |
|
View-Based Appearance Model Online Learning for 3D Deformable Face Tracking, and , in: Proc. Int. Conf. on Computer Vision Theory and Applications, Angers, 2010 |
|
Visual processing-inspired Fern-Audio features for Noise-Robust Speaker Verification, and , in: ACM 25th Symposium on Applied Computing, 2010, Sierre, Switzerland, Association for Computing Machinery, 2010 |
|
Vlogcast Yourself: Nonverbal Behavior and Attention in Social Media, and , in: Proceedings International Conference on Multimodal Interfaces (ICMI-MLMI), 2010 |
|
Voices of Vlogging, and , in: Proceedings of AAAI International Conference on Weblogs and Social Media, Washington DC, 2010 |
|
VTLN Adaptation for Statistical Speech Synthesis, , , and , in: Proceedings of ICASSP, Dallas, Texas, 2010 |
|
2009
A Multimedia Retrieval System Using Speech Input, , , , , , , , , , and , in: Proceedings of ICMI-MLMI 2009 (11th International Conference on Multimodal Interfaces and 6th Workshop on Machine Learning for Multimodal Interaction), Cambridge, MA, 2009 |
|
A theoretical framework for transfer of knowledge across modalities in artificial and cognitive systems, , , and , in: International Conference on Developmental Learning, 2009 |
|
An online framework for learning novel concepts over multiple cues, , and , in: Proceeding of The 9th Asian Conference on Computer Vision, Xi'an, China, 2009 |
|
APPLICATIONS OF SIGNAL ANALYSIS USING AUTOREGRESSIVE MODELS FOR AMPLITUDE MODULATION, , , and , in: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009, WASPAA '09., IEEE, Mohonk Mountain House, New Paltz, New York, USA, 2009 |
[URL] |
Arithmetic Coding of Sub-Band Residuals in FDLP Speech/Audio Codec, , and , in: 10th Annual Conference of the International Speech Communication Association, ISCA, Brighton, England, ISCA 2009, 2009 |
|
Automatic Out-of-Language Detection Based on Confidence Measures Derived fromLVCSR Word and Phone Lattices, , in: 10thAnnual Conference of the International Speech Communication Association, ISCA, Brighton, England, 2009 |
|
Automatic Role Recognition in Multiparty Recordings Using Social Networks and Probabilistic Sequential Models, , and , in: ACM International Conference on Multimedia, 2009 |
|
Automatic vs. human question answering over multimedia meeting recordings, and , in: 10th Annual Conference of the International Speech Communication Association, 2009 |
|
Bayesian Networks to Combine Intensity and Color Information in Face Recognition, and , in: International Conference on Biometrics, Springer, 2009 |
|
Canal9: A database of political debates for analysis of social interactions, , , and , in: Proceedings of the International Conference on Affective Computing and Intelligent Interaction (IEEE International Workshop on Social Signal Processing), Amsterdam, Netherlands, 2009 |
[DOI] |
Characterising Conversationsal Group Dynamics Using Nonverbal Behaviour, , and , in: Proceedings ICME 2009, 2009 |
|
Discovering Group Nonverbal Conversational Patterns with Topics, and , in: Proceedings ICMI-MLMI, 2009 |
|
Dynamic Partitioned Sampling For Tracking With Discriminative Features, , and , in: Proceedings of the British Maschine Vision Conference, London, 2009 |
|
Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, , and , in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, Pilsen, Czech Republic, Springer - Verlag, Berlin Heidelberg 2009, 2009 |
|
Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, , and , in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, Pilsen, Czech Republic, Springer - Verlag, Berlin Heidelberg 2009, 2009 |
|
Evaluation of Probabilistic Occupancy Map People Detection for Surveillance Systems, , , , and , in: Proceedings of the IEEE International Workshop on Performance Evaluation of Tracking and Surveillance, 2009 |
Flickr Hypergroups, , , , and , in: Proceedings of the 17th ACM International Conference on Multimedia, 2009 |
|
Haar Local Binary Pattern Feature for Fast Illumination Invariant Face Detection, and , in: British Machine Vision Conference 2009, 2009 |
|
Hierarchical Processing of the Modulation Spectrum for GALE Mandarin LVCSR system, , , and , in: Proceedings of the 10thAnnual Conference of the International Speech Communication Association (Interspeech), Brighton, 2009 |
|
Hill-Climbing Attack to an Eigenface-Based Face Verification System, , , , and , in: Proceedings of the First IEEE International Conference on Biometrics, Identity and Security (BIdS), 2009 |
|
Implicit Human Centered Tagging, , and , in: Proceedings of IEEE Conference on Multimedia and Expo, 2009 |
|
Investigating Privacy-Sensitive Features for Speech Detection in Multiparty Conversations, , , and , in: Proceedings of Interspeech 2009, 2009 |
|
Investigating the use of Visual Focus of Attention for Audio-Visual Speaker Diarisation, , , and , in: Proceedings of the ACM International Conference on Multimedia, Beijing, China, 2009 |
|
Joint Pose Estimator and Feature Learning for Object Detection, , , and , in: Proceedings of the IEEE International Conference on Computer Vision, 2009 |
KL Realignment for Speaker Diarization with Multiple Feature Streams, , and , in: 10th Annual Conference of the International Speech Communication Association, 2009 |
Learning and Predicting Multimodal Daily Life Patterns from Cell Phones, and , in: ICMI-MLMI, 2009 |
|
Learning Large Margin Likelihood for Realtime Head Pose Tracking, and , in: IEEE Int. Conference on Image Processing, Cairo, Egypt, IEEE, 2009 |
|
Learning Rotational Features for Filament Detection, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2009 |
MDCT for Encoding Residual Signals in Frequency Domain Linear Prediction, , and , in: Audio Engineering Society (AES,',','), 127th Convention, Audio Engineering Society (AES), Audio Engineering Society, 60 East 42nd Street, New York, New York 10165-2520, USA;, 2009 |
[URL] |
Measuring the gap between HMM-based ASR and TTS, , and , in: Proceedings of Interspeech, Brighton, U.K., 2009 |
|
Memoirs of Togetherness from Audio Logs, , in: Proceedings International ICST Conference on User Centric Media, Venice, Italy, 2009 |
|
MLP Based Hierarchical System for Task Adaptation in ASR, , and , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, 2009 |
|
Model adaptation with least-square SVM for adaptive hand prosthetics, , , , and , in: IEEE International conference on Robotics and Automation, 2009 |
|
MULTI-MODAL SPEAKER DIARIZATION OF REAL-WORLD MEETINGS USING COMPRESSED-DOMAIN VIDEO FEATURES, , and , in: International Conference on Audio, Speech and Signal Processing, 2009 |
|
MUTUAL INFORMATION BASED CHANNEL SELECTION FOR SPEAKER DIARIZATION OF MEETINGS DATA, , and , in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2009 |
|
Mutual Information based Channel Selection for Speaker Diarization of Meetings Data, , and , in: Proceedings of International conference on acoustics speech and signal processing, 2009 |
Non-linear mapping for multi-channel speech separation and robust overlapping speech recognition, , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009 |
|
Out-of-Scene AV Data Detection, , in: Proceedings IADIS International Conference Applied Computing, Rome, Italy, 2009 |
|
Overview of the CLEF 2009 medical image annotation track, , , , and , in: Workshop of the Cross-Language Evaluation Forum, Corfu, Greece, pages 85-93, Springer Berlin Heidelberg, 2009 |
[DOI] |
Parts-Based Face Verification using Local Frequency Bands, and , in: in Proceedings of IEEE/IAPR International Conference on Biometrics, 2009 |
|
Posterior features applied to speech recognition tasks with user-defined vocabulary, , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009 |
|
Predicting Remote Versus Collocated Group Interactions using Nonverbal Cues, , and , in: Proc. Int. Conf. on Multimodal Interfaces, Workshop on Multimodal Sensor-Based Systems and Mobile Phones for Social Computing,, Cambridge, 2009 |
[DOI] |
Real-Time ASR from Meetings, , , , , , , , and , in: Proceedings of Interspeech, Brighton, UK., 2009 |
|
Retrieving Ancient Maya Glyphs with Shape Context, , , and , in: 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops, Kyoto, Japan, IEEE, 2009 |
|
Robust Discriminative Keyword Spotting for Emotionally Colored Spontaneous Speech using Bidirectional LSTM Networks, , , , , and , in: IEEE International Conference on Acoustic, Speech, and Signal Processing, 2009 |
|
Robust Speaker Diarization for Short Speech Recordings, and , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, pages 432-437, 2009 |
|
Robustness of Phase based Features for Speaker Recognition, , and , in: Proceedings of Interspeech, 2009 |
|
SNR Features for Automatic Speech Recognition, , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, 2009 |
|
Social Network Analysis in Multimedia Indexing: Making Sense of People in Multiparty Recordings, , in: Proceedings of the Doctoral Consortium of the International Conference on Affective Computing & Intelligent Interaction (ACII), 2009 |
|
Speaker Change Detection with Privacy-Preserving Audio Cues, , , and , in: Proceedings of ICMI-MLMI 2009, 2009 |
|
Speech recognition with speech synthesis models by marginalising over decision tree leaves, , and , in: Proceedings of Interspeech, Brighton, U.K., 2009 |
|
Steerable Features for Statistical 3D Dendrite Detection, , , , and , in: Proceedings of the International Conference on Medical Image Computing and Computer Assisted Intervention, 2009 |
Structure and appearance features for robust 3D facial actions tracking, and , in: IEEE Proc. Int. Conf. on Multimedia and Expo, IEEE, 2009 |
|
The more you know, the less you learn: from knowledge transfer to one-shot learning of object categories, and , in: British Machine Vision Conference, 2009 |
|
Topic Models for Scene Analysis and Abnormality Detection, and , in: 9th International Workshop in Visual Surveillance, IEEE, Kyoto, Japan, IEEE, 2009 |
|
Towards a theoretical framework for learning multi-modal patterns for embodied agents, , , , , , , and , in: International Conference on Image Analysis and Processing, 2009 |
|
Visual Activity Context For Focus of Attention Estimation in Dynamic Meetings, , and , in: International Conference on Multimedia & Expo, 2009 |
|
Visual Speaker Localization Aided by Acoustic Models, , and , in: ACM Multimedia, 2009 |
Volterra Series for Analyzing MLP based Phoneme Posterior Probability Estimator, , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2009 |
|
Wearing a YouTube hat: directors, comedians, gurus, and user aggregated behavior, and , in: Proceedings of the 17th ACM International Conference on Multimedia, ACM, 2009 |
|
Who's Doing What: Joint Modeling of Names and Verbs for Simultaneous Face and Pose Annotation, , and , in: Advances in Neural Information Processing Systems 22 (NIPS09), NIPS Foundation, Vancouver, B.C., Canada, MIT Press, 2009 |
|
YOU ARE FIRED! NONVERBAL ROLE ANALYSIS IN COMPETITIVE MEETINGS, , and , in: Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP,',','), Taiwan., 2009 |
|
You live, you learn, you forget: continuous learning of visual places with a forgetting mechanism, , and , in: International Conference on Robotic and Systems, 2009 |
2008
A Comparative Psychophysical and EEG Study of Different Feedback Modalities for HRI, , , , and , in: 3rd ACM/IEEE Conf on Human-Robot Interaction (HRI08), 2008 |
|
A Distance Model for Rhythms, , , and , in: 25th International Conference on Machine Learning (ICML), 2008 |
|
Adaptive Beamforming with a Maximum Negentropy Criterion, , , , and , in: Proceedings of the Joint Workshop on Hands-free Speech Communication and Microphone Arrays, Italy, 2008 |
|
An SVM Confidence-Based Approach to Medical Image Annotation, , and , in: Workshop of the Cross-Language Evaluation Forum, 2008 |
|
Analyzing Flickr Groups, and , in: Proc. of the Intl. Conf. on Image and Video Retrieval, ACM, 2008 |
Analyzing Interactions Between Navigation Strategies Using a Computational Model of Action Selection, , , , and , in: Int Conf Spatial Cognition 2008, 2008 |
|
Associating Audio-Visual Activity Cues in a Dominance Estimation Framework, , , and , in: First IEEE Workshop on CVPR for Human Communicative Behavior Analysis, 2008 |
|
Asynchronous detection and classification of oscillatory brain activity, , and , in: 16 European Signal Processing Conference, 2008 |
|
Automated Delineation of Dendritic Networks in Noisy Image Stacks, , and , in: proceedings of the European Conference on Computer Vision, 2008 |
Autoregressive Modelling of Hilbert Envelopes for Wide-band Audio Coding, , , and , in: AES 124th Convention, Audio Engineering Society, 2008 |
|
Beyond Novelty Detection: Incongruent Events, when General and Specific Classifiers Disagree, , , , , , and , in: Advances in Neural Information Processing Systems 21, 2008 |
|
Biologically Motivated Audio-Visual Cue Integration for Object, , , , , , , , and , in: Proceedings of the first Internatinal Conference on Cognitive Systems, 2008 |
|
Brain-Computer Interfaces for HCI and Games, , , , , and , in: Proceedings of the 26th Annual CHI Conference on Human Factors in Computing Systems, Extended Abstracts, 2008 |
|
Calibration from statistical properties of the visual world, , and , in: European Conf. on Computer Vision, 2008 |
|
COMBINATION OF AGGLOMERATIVE AND SEQUENTIAL CLUSTERING FOR SPEAKER DIARIZATION, , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008 |
|
Combining Evidence from a Generative and a Discriminative Model in Phoneme Recognition, and , in: Proceedings of Interspeech, 2008 |
|
Composite Kernel Learning, , and , in: Proceedings of the 25th Annual International Conference on Machine Learning (ICML 2008), Omnipress, 2008 |
|
Continuous Brain-Actuated Control of an Intelligent Wheelchair by Human EEG, , , , , , and , in: In proceedings, 4th Intl. Brain-Computer Interface Workshop and Training Course, 2008 |
|
Cue Integration for Medical Image Annotation, , and , in: Advances in Multilingual and Multimodal Information Retrieval: 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Budapest, Hungary, September 19-21, 2007, Revised Selected Papers, Springer-Verlag, 2008 |
|
Daily Routine Classification from Mobile Phone Data, and , in: Workshop on Machine Learning and Multimodal Interaction (MLMI08), 2008 |
|
Detecting queues at vending machines: a statistical layered approach, and , in: Proc. Int. Conf. on Pattern Recognition (ICPR), Tampa, 2008 |
|
Discovering Human Routines from Cell Phone Data with Topic Models, and , in: IEEE International Symposium on Wearable Computers (ISWC), 2008 |
|
Emulating Temporal Receptive Fields of Auditory Mid-Brain Neurons for Automatic Speech Recognition, and , in: Proc. 16th European Signal Processing Conference (EUSIPCO), 2008 |
|
Exploiting Contextual Information for Improved Phoneme Recognition, , , and , in: "IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)", 2008 |
|
Exploiting Contextual Information for Speech/Non-Speech Detection, , and , in: Text, Speech and Dialogue, Brno, Czech Republic, Springer-Verlag Berlin, Heidelberg, 2008 |
|
Fast Approximate Spoken Term Detection from Sequence of Phonemes, , , and , in: Workshop on Searching Spontaneous Conversational Speech at SIGIR, 2008 |
|
Fast human detection from videos using covariance features, and , in: European Conference on Computer Vision, workshop on Visual Surveillance (ECCV-VS), Marseille, 2008 |
|
Filter Bank Design based on Minimization of Individual Aliasing Terms for Minimum Mutual Information Subband Adaptive Beamforming, , , , , and , in: Proceedings of ICASSP 2008, Las Vegas, USA, 2008 |
|
Front-end for Far-field Speech Recognition based on Frequency Domain Linear Prediction, , and , in: Interspeech 2008, 2008 |
|
Graphical representation of meetings on mobile devices, , and , in: MobileHCI 2008 (10th International Conference on Human-Computer Interaction with Mobile Devices and Services, Demonstrations Session), Amsterdam, 2008 |
|
Hierarchical and Parallel Processing of Modulation Spectrum for ASR applications, and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008 |
|
Hierarchical Integration of Phonetic and Lexical Knowledge in Phone Posterior Estimation, and , in: ICASSP'08, 2008 |
|
Hilbert Envelope Based Features for Far-Field Speech Recognition, , and , in: MLMI 2008, 2008 |
|
Hilbert Envelope Based Spectro-Temporal Features for Phoneme Recognition in Telephone Speech, , and , in: Interspeech 2008, 2008 |
|
Identifying Dominant People in Meetings from Audio-Visual Sensors, and , in: International Conference on Automatic Face and Gesture Recognition, Amsterdam, The Netherlands, 2008 |
|
Improving Contextual Quality Models for MT Evaluation Based on Evaluators' Feedback, , and , in: 6th International Conference on Language Resources and Evaluation, Marrakech, Morocco, 2008 |
In-Context Phone Posteriors as Complementary Features for Tandem ASR, and , in: ICSLP'08, 2008 |
|
Integration of TDOA Features in Information Bottleneck Framework for Fast Speaker Diarization, , and , in: Interspeech 2008, 2008 |
|
Introducing Temporal Asymmetries in Feature Extraction for Automatic Speech Recognition, and , in: Interspeech 2008, 2008 |
|
Investigating Automatic Dominance Estimation in Groups From Visual Attention and Speaking Activity, , , , and , in: International Conference on Multi-modal Interfaces, 2008 |
|
Maximum kurtosis beamforming with the generalized sidelobe canceller, , , , , and , in: Proceedings of INTERSPEECH, September 2008, Brisbane, Australia, 2008 |
|
Multi-camera 3d person tracking with particle filter in a surveillance environment, and , in: 16th European Signal processing Conference (EUSIPCO), 2008 |
|
Multi-camera multi-person 3d space tracking with mcmc in surveillance scenarios, and , in: European Conference on Computer Vision, workshop on Multi Camera and Multi-modal Sensor Fusion Algorithms and Applications (ECCV-M2SFA2), Marseille, 2008 |
|
Multi-Camera Tracking and Atypical Motion Detection with Behavioral Maps, , and , in: proceedings of the European Conference on Computer Vision, 2008 |
Multi-party Focus of Attention Recognition in Meetings from Head Pose and Multimodal Contextual Cues, and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008 |
|
Object Category Detection using Audio-visual Cues, , , , and , in: International Conference on Computer Vision Systems (ICVS08), 2008 |
|
On the Combination of Auditory and Modulation Frequency Channels for ASR applications, and , in: Interspeech 2008, 2008 |
|
Perceptually motivated Sub-band Decomposition for FDLP Audio Coding, , , , and , in: Text, Speech and Dialogue, Brno, Czech Republic, Springer-Verlag Berlin, Heidelberg, 2008 |
|
Predicting the Dominant Clique in Meetings through Fusion of Nonverbal Cues, , , and , in: ACM MM 2008, 2008 |
|
Predicting Two Facets of Social Verticality in Meetings from Five-Minute Time Slices and Nonverbal Cues, , , and , in: Proceedings - ICMI 2008, 2008 |
|
Principled Detection-by-classification from Multiple Views, , and , in: proceedings of the International Conference on Computer Vision Theory and Applications, 2008 |
Recognition and Understanding of Meetings Overview of the European AMI and AMIDA Projects, and , in: LangTech 2008, 2008 |
|
Recognition of Anticipatory Behavior from Human EEG, , and , in: In proceedings, 4th Intl. Brain-Computer Interface Workshop and Training Course, 2008 |
|
Reference-based vs. task-based evaluation of human language technology, , in: LREC 2008 ELRA Workshop on Evaluation, ELRA, Marrakech, Morocco, 2008 |
|
Reverse Correlation for analyzing MLP Posterior Features in ASR, , and , in: 11th International Conference on Text, Speech, and Dialogue, 2008 |
|
Role Recognition for Meeting Participants: an Approach Based on Lexical Information and Social Network Analysis, , , , and , in: ACM International Conference on Multimedia, Vancouver, Canada, 2008 |
|
Role Recognition in Multiparty Recordings using Social Affiliation Networks and Discrete Distributions, , , and , in: International Conference on Multimodal Interfaces, Chania, Greece, 2008 |
|
Silence Models in Weighted Finite-State Transducers, , in: Interspeech, 2008 |
|
Simultaneous Real-Time Detection of Motor Imagery and Error-Related Potentials for Improved BCI Accuracy, and , in: Proceedings of the 4th International Brain-Computer Interface Workshop and Training Course, 2008 |
|
Social Signal Processing: State-of-the-Art and Future Perspectives of an Emerging Domain, , , and , in: Proceedings of the ACM International Conference on Multimedia, 2008 |
|
Social Signals, their Function, and Automatic Analysis: A Survey, , , and , in: Proceedings of International Conference on Multimodal Interfaces (to appear), 2008 |
|
Spectral Noise Shaping: Improvements in Speech/Audio Codec Based on Linear Prediction in Spectral Domain, , , and , in: INTERSPEECH 2008, 2008 |
|
Spectro-Temporal Features for Automatic Speech Recognition using Linear Prediction in Spectral Domain, , and , in: EUSIPCO 2008, 2008 |
|
Support Vector Machines with a Reject Option, , , and , in: Proceedings of the 22nd Annual Conference on Neural Information Processing Systems, 2008 |
|
SVM-based Discriminative Accumulation Scheme for Place Recognition, , and , in: Proceedings of the IEEE International Conference on Robotics and Automation (ICRA08), 2008 |
|
Task-based evaluation of meeting browsers: from BET task elicitation to user behavior analysis, , , and , in: 6th International Conference on Language Resources and Evaluation, Marrakech, Morocco, 2008 |
|
Temporal Masking for Bit-rate Reduction in Audio Codec Based on Frequency Domain Linear Prediction, , , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008 |
|
The AMIDA Automatic Content Linking Device: Just-in-Time Document Retrieval in Meetings, , , , , , , and , in: Machine Learning for Multimodal Interaction V, Utrecht, Springer-Verlag, 2008 |
[DOI] |
The DIRAC AWEAR Audio-Visual Platform for Detection of Unexpected and Incongruent Events, , , , , , , , , , , , , and , in: Proceedings of the International Conference on Multimodal Interfaces, 2008 |
|
The Projectron: a Bounded Kernel-Based Perceptron, , and , in: Int. Conf. on Machine Learning, 2008 |
|
Timbre and Rhythmic TRAP-TANDEM features for music information retrieval, , in: "Int. Conf. on Music Information Retrieval (ISMIR)", 2008 |
|
Topickr: Flickr Groups and Users Reloaded, and , in: MM '08: Proc. of the 16th ACM Intl. Conf. on Multimedia, ACM, 2008 |
Towards Audio-Visual On-line Diarization Of Participants In Group Meetings, and , in: European Conference on Computer Vision Workshop on Multi-camera and Multi-modal Sensor Fusion, 2008 |
|
Towards Robust Place Recognition for Robot Localization, , , , , and , in: IEEE International Conference on Robotics ad Automation, 2008 |
|
Understanding Metro Station Usage using Closed Circuit Television Cameras Analysis, , , , , , and , in: 11th International IEEE Conference on Intelligent Transportation Systems (ITSC), Bejing, 2008 |
|
Visual Focus of Attention Estimation from Head Pose Posterior Probability Distributions, and , in: International Conference on Multi-media & Expo, 2008 |
|
What Did You Do Today? Discovering Daily Routines from Large-Scale Mobile Data, and , in: ACM International Conference on Multimedia (ACMMM), 2008 |
|
2007
A Cognitive and Unsupervised MAP Adaptation Approach to the Recognition of the Focus of Attention from Head Pose, and , in: International Conference on Multi-Media & Expo (ICME07), 2007 |
|
A Generative Model for Rhythms, , , and , in: NIPS Workshop on Brain, Music and Cognition, 2007 |
|
A supervised learning approach based on STDP and polychronization in spiking neuron networks, , and , in: European Symposium on Artificial Neural Networks, ESANN, 2007 |
|
Adaptive Shared Control of a Brain-Actuated Simulated Wheelchair, , , , , , , and , in: Proceedings of the 10th IEEE International Conference on Rehabilitation Robotics, 2007 |
|
AGGLOMERATIVE INFORMATION BOTTLENECK FOR SPEAKER DIARIZATION OF MEETINGS DATA, , and , in: IEEE Automatic Speech Recognition and Understanding Workshop, 2007 |
|
An Acoustic Model Based on Kullback-Leibler Divergence for Posterior Features, , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007 |
|
An Asynchronous and Non-Invasive Brain-Actuated Wheelchair, , , , , , , and , in: Proceedings of the 13th International Symposium on Robotics Research, 2007 |
|
Augmenting Astronaut's Capabilities through Brain-Machine Interfaces, , , and , in: Proceedings of the 20th International Joint Conference on Artificial Intelligence, Workshop on Artificial Intelligence for Space Applications, 2007 |
|
Bayesian Controller for a Novel Semi-Autonomous Navigation Concept, , , and , in: 3rd European Conference on Mobile Robots (ECMR 2007), 2007 |
|
Biometric Person Authentication IS A Multiple Classifier Problem, and , in: 7th International Workshop on Multiple Classifier Systems, MCS, 2007 |
|
Brain-Machine Interfaces through Control of Electroencephalographic Signals and Vibrotactile Feedback, , , , , , , , and , in: Proceedings of the 12th International Conference on Human-Computer Interaction, 2007 |
|
Broadcast News Story Segmentation Using Social Network Analysis and Hidden Markov Models, and , in: ACM International Conference on Multimedia, 2007 |
|
CLEF2007 Image Annotation Task: an SVM-based Cue Integration Approach, , and , in: Proceedings of ImageCLEF 2007 -LNCS, 2007 |
|
Combination of Acoustic Classifiers based on Dempster-Shafer Theory of evidence, and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007 |
|
Confidence-based Cue Integration for Visual Place Recognition, and , in: IEEE International Conference on Intelligent RObot Systems (IROS), 2007 |
|
Detection and Recognition of Number Sequences in Spoken Utterances, and , in: 2nd Workshop on Speech in Mobile and Pervasive Environments (SiMPE), 2007 |
|
Discriminative Keyword Spotting, , and , in: Workshop on Non-Linear Speech Processing, Paris, France, 2007 |
|
EEG-Based Brain-Computer Interaction: Improved Accuracy by Automatic Single-Trial Error Detection, and , in: Advances in Neural Information Processing Systems 21, 2007 |
|
ESTIMATING THE DOMINANT PERSON IN MULTI-PARTY CONVERSATIONS USING SPEAKER DIARIZATION STRATEGIES, , , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2007 |
Face Authentication with Salient Local Features and Static Bayesian Network, and , in: IEEE / IAPR Intl. Conf. On Biometrics (ICB), 2007 |
|
Feature Extraction for Multi-Class BCI using Canonical Variates Analysis, , , , and , in: Proceedings of the IEEE International Symposium on Intelligent Signal Processing, 2007 |
|
Frequency Domain Linear Prediction for QMF Sub-bands and Applications to Audio Coding, , , and , in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), 2007 |
|
Hierarchical Neural Networks Feature Extraction for LVCSR system, , , , , and , in: Interspeech 2007, 2007 |
|
Hierarchical Penalization, , and , in: Advances in Neural Information Processing Systems 21, 2007 |
|
Incremental Learning for Place Recognition in Dynamic Environments, , , and , in: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS07), 2007 |
|
Indoor Place Recognition using Online Independent Support Vector Machines, , , , and , in: 18th British Machine Vision Conference (BMVC07), 2007 |
|
Learning the Inter-frame Distance for Discriminative Template-based Keyword Detection, and , in: International Conference on Speech Communication and Technology (INTERSPEECH), 2007 |
|
More Efficiency in Multiple Kernel Learning, , , and , in: International Conference on Machine Learning (ICML), 2007 |
|
Multi-Layer Background Subtraction Based on Color and Texture, and , in: CVPR 2007 Workshop on Visual Surveillance (VS2007), 2007 |
|
Multi-stream Features Combination based on Dempster-Shafer Rule for LVCSR System, , and , in: Interspeech 2007, 2007 |
|
Non-Invasive Brain-Actuated Interaction, , , , and , in: Proceedings of the 2nd International Symposium on Brain, Vision and Artificial Intelligence, 2007 |
|
Non-linear Spectral Contrast Stretching for In-car Speech Recognition, and , in: Interspeech-Eurospeech # to appear in html, 2007 |
|
Non-Uniform Speech/Audio Coding Exploiting Predictability of Temporal Evolution of Spectral Envelopes, , , and , in: Tenth International Conference on TEXT, SPEECH and DIALOGUE (TSD), 2007 |
|
On Confusions in a Phoneme Recognizer, , and , 2007 |
|
Posterior-Based Features and Distances in Template Matching for Speech Recognition, and , in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), 2007 |
|
Probabilistic Head Pose Tracking Evaluation in Single and Multiple Camera Setups, and , in: Classification of Events, Activities and Relationship Evaluation and Workshop, 2007 |
|
Recognition and Understanding of Meetings The AMI and AMIDA Projects, , and , in: Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU'07, 2007 |
|
Semantic Segmentation of Radio Programs Using Social Network Analysis and Duration Distribution Modeling, , and , in: IEEE International Conference on Multimedia and Expo (ICME), 2007 |
|
Sparse Probabilistic Classifiers, and , in: International Conference on Machine Learning (ICML), 2007 |
|
SVM-based Transfer of Visual Knowledge Across Robotic Platforms, , and , in: International Conference on Computer Vision Systems (ICVS07), 2007 |
|
The use of brain-computer interfacing for ambient intelligence, , , , , and , in: In the book, Constructing Ambient Intelligence: AmI-07 Workshops Proceedings, Max M\:uhlh\:auser, Alois Ferscha, and Erwin Aitenbichler (Eds.,',','), LNCS, Springer Verlag, 2008., 2007 |
|
To Err Is Human: Learning from Error Potentials in Brain-Computer Interfaces, , and , in: 1st International Conference on Cognitive Neurodynamics (ICCN 2007), 2007 |
|
Unsupervised Speech/Non-speech Detection for Automatic Speech Recognition in Meeting Rooms, , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007 |
|
Using Audio and Video Features to Classify the Most Dominant Person in a Group Meeting, , , , , , , , and , in: "", 2007 |
|
Vibrotactile Feedback in the Context of Mu-Rhythm based BCI, , , , , , , , , , , and , in: Proceedings of the 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2007 |
|
Visuo-Spatial Attention Frame Recognition for Brain-Computer Interfaces, , , , , , and , in: Proceedings of the 1st International Conference on Cognitive Neurodynamics, 2007 |
|
Wide-Band Perceptual Audio Coding based on Frequency-Domain Linear Prediction, , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007 |
|
2006
2D Multi-Person Tracking: A Comparative Study in AMI Meetings, , , , , and , in: Classification of Events, Activities, and Relationships (CLEAR) 2006, 2006 |
|
A Discriminative Approach for the Retrieval of Images from Text Queries, , and , in: European Conference on Machine Learning (ECML), 2006 |
|
A Discriminative Approach to Robust Visual Place Recognition, , , and , in: IEEE International Conference on Intelligent RObot Systems (IROS), 2006 |
|
A Max Kernel For Text-Independent Speaker Verification Systems, and , in: Second Workshop on Multimodal User Authentication, MMUA, 2006 |
|
A Neural Network to Retrieve Images from Text Queries, and , in: International Conference on Artificial Neural Networks (ICANN), 2006 |
|
A Study on Visual Focus of Attention Recognition from Head Pose in a Meeting Room, and , in: 3rd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI06), 2006 |
|
Analyzing Group Interactions in Conversations: a Review, , in: IEEE Int. Conf. on Multisensor Fusion and Integration for Intelligent Systems (MFI), 2006 |
|
Bi-Modal Face and Speech Authentication: a BioLogin Demonstration System, , , and , in: Workshop on Multimodal User Authentication (MMUA), 2006 |
|
Constructing visual models with a latent space approach, , , and , in: the Springer series of Lecture Notes in Computer Science, 2006 |
|
Cursive Character Challenge: a New Database for Machine Learning and Pattern Recognition, , and , in: Proceedings of International Conference on Pattern Recognition (ICPR), 2006 |
|
Detecting Abandoned Luggage Items in a Public Space, , and , in: IEEE Performance Evaluation of Tracking and Surveillance Workshop (PETS), 2006 |
|
Detection and Application of Influence Rankings in Small Group Meetings, , , and , in: In the Eighth International Conference on Multimodal Interfaces (ICMI'06), 2006 |
|
Discriminant linear processing of time-frequency plane, and , in: International Conference on Spoken Language Processing, 2006 |
|
Discriminative Kernel-Based Phoneme Sequence Recognition, , , , and , in: The 9th International Conference on Spoken Language Processing (INTERSPEECH), Pittsburgh, PA, 2006 |
|
Exploring Contextual Information in a Layered Framework for Group Action Recognition, , and , in: In the Eighth International Conference on Multimodal Interfaces (ICMI'06), 2006 |
|
Face Authentication Using Adapted Local Binary Pattern Histograms, and , in: 9th European Conference on Computer Vision (ECCV), 2006 |
|
Finding groups of people in Google news, and , in: ACM Int. Conf. on Human-Centered Multimedia (HCM), 2006 |
|
Hand Posture Classification and Recognition using the Modified Census Transform, , and , in: IEEE Int. Conf. on Automatic Face and Gesture Recognition (AFGR), 2006 |
|
Haptic Feedback Compared with Visual Feedback for BCI, , , , , and , in: Proceedings of the 3rd International Brain-Computer Interface Workshop & Training Course 2006, 2006 |
|
High Frequency Bands and Estimated Local Field Potentials to Improve Single-Trial Classification of Electroencephalographic Signals, , , , , and , in: Proceedings of the 3rd International Brain-Computer Interface Workshop & Training Course 2006, 2006 |
|
Indexation de Documents Manuscrits, , in: Proceedings du Colloque International Francophone sur l'Ecrit et le Document (CIFED06), 2006 |
|
Infinite Models for Speaker Clustering, , in: International Conference on Spoken Language Processing, 2006 |
|
Integrating co-occurrence and spatial contexts on patch-based scene segmentation, , , and , in: Beyond Patches Workshop, in conjunction with CVPR, 2006 |
|
Investigating Lexical Substitution Scoring for Subtitle Generation, , , , and , in: Proceedings of the 10th Conference on Computational Natural Language Learning (CoNLL)., 2006 |
|
Juicer: A Weighted Finite-State Transducer speech decoder, , , , , and , in: 3rd Joint Workshop on Multimodal Interaction and Related Machine LEarning Algorithms MLMI'06, 2006 |
|
Kernel Methods for Melanoma Recognition, , and , in: Medical Informatics in Europe (MIE), 2006 |
|
Kernel Methods for Melanoma Recognition, , and , in: Proceedings of Workshop on Computer Vision Approaches to Medical Image Analysis (CVAMIA) 2006), 2006 |
|
Learning to Retrieve Images from Text Queries with a Discriminative Model, , and , in: International Workshop on Adaptive Multimedia Retrieval (AMR), 2006 |
|
Local Binary Patterns as an Image Preprocessing for Face Authentication, , and , in: IEEE Int. Conf. on Automatic Face and Gesture Recognition (AFGR), 2006 |
|
Melanoma Recognition Using Representative and Discriminative Kernel Classifiers, , and , in: International Workshop on Computer Vision Applications for Medical Image Analysis, 2006 |
|
Modeling Interactions from Email Communication, , and , in: Proc. IEEE International Conference on Multimedia & Expo (ICME,',','), 2006, 2006 |
|
Multi-Person Tracking in Meetings: A Comparative Study, , , , , and , in: Multimodal Interaction and Related Machine Learning Algorithms (MLMI), 2006 |
|
Multi-stream ASR: An Oracle Perspective, , and , in: Proceedings of ISCA International Conference on Spoken Language Processing (ICSLP), 2006 |
|
Natural Scene Image Modeling using Color and Texture Visterms., and , in: Conference on Image and Video Retrieval CIVR, 2006 |
|
Nearly optimal exploration-exploitation decision thresholds, , in: Int. Conf. on Artificial Neural Networks (ICANN), 2006 |
|
Non-Invasive Brain Computer Interface for Mental Control of a Simulated Wheelchair, , , , , , and , in: Proceedings of the 3rd International Brain-Computer Interface Workshop & Training Course 2006, 2006 |
|
Online Classifier Adaptation in High Frequency EEG, , and , in: Proceedings of the 3rd International Brain-Computer Interface Workshop & Training Course 2006, 2006 |
|
Posterior Based Keyword Spotting with A Priori Thresholds, , , and , in: International Conference on Spoken Language Processing (ICSLP), 2006 |
|
Prospects on Brain-Machine Interfaces for Space System Control, , , , , , , , , , , , , , , , , and , in: Proceedings of the 57th International Astronautical Conference, 2006 |
|
Revisiting Doddington's Zoo: A Systematic Method to Assess User-dependent Variabilities, , and , in: Multimodal User Authentication (MMUA), 2006 |
|
Sociometry Based Multiparty Audio Recordings Segmentation, , in: Proceedings of the IEEE Conference on Multimedia and Expo (ICME 2006), 2006 |
|
Sociometry Based Multiparty Audio Recordings Summarization, , in: Proceedings of International Conference on Pattern Recognition (ICPR 2006), 2006 |
|
Speaker Localization for Microphone Array-Based ASR: The Effects of Accuracy on Overlapping Speech, and , in: Proc. Int. Conf. on Multimodal Interfaces (ICMI), 2006 |
|
Speech Coding based on Spectral Dynamics, , , and , in: Ninth International Conference on TEXT, SPEECH and DIALOGUE (TSD), 2006 |
|
The More you Learn, the Less you Store: Memory-Controlled Incremental SVM, and , in: Proceedings of International Cognitive Vision Workshop (ICVW) 2006), 2006 |
|
The segmentation of multi-channel meeting recordings for automatic speech recognition, , and , in: Int. Conf. on Spoken Language Processing (Interspeech ICSLP), 2006 |
|
Threshold Selection for Unsupervised Detection, with an Application to Microphone Arrays, , and , in: Proceedings of ICASSP 2006, 2006 |
|
Tracking the Multi Person Wandering Visual Focus of Attention, , , and , in: International Conference on Multimodal Interfaces (ICMI06), 2006 |
|
Unified Inference for Variational Bayesian Linear Gaussian State-Space Models, and , in: NIPS, 2006 |
|
Using Chimeric Users to Construct Fusion Classifiers in Biometric Authentication Tasks: An Investigation, and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2006 |
|
Using more informative posterior probabilities for speech recognition, , , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2006 |
|
Using Pitch as Prior Knowledge in Template-Based Speech Recognition, , and , in: Proceedings of ICASSP, 2006, 2006 |
|
Using Posterior-Based Features in Template Matching for Speech Recognition, , and , in: International Conference on Spoken Language Processing, 2006 |
|
Writer Identification for Smart Meeting Room Systems, , , , , and , in: Seventh IAPR Workshop on Document Analysis Systems, DAS, 2006 |
|
2005
A Graphical Model for Chord Progressions Embedded in a Psychoacoustic Space, , , and , in: Proceedings of the 22nd International Conference on Machine Learning, 2005 |
|
A Meeting Browser Evaluation Test, , , and , in: CHI '92: Proceedings of the SIGCHI conference on Human factors in computing systems, Portland, OR, USA, ACM Press, 2005 |
|
A Neural Network for Text Representation, and , in: International Conference on Artificial Neural Networks, ICANN, 2005 |
|
A Novel Approach to Combining Client-Dependent and Confidence Information in Multimodal Biometric, and , in: Fifth Int'l. Conf. Audio- and Video-Based Biometric Person Authentication AVBPA, 2005 |
|
A Probabilistic Interpretation of SVMs with an Application to Unbalanced Classification, , and , in: Advances in Neural Information Processing Systems, NIPS 15, 2005 |
|
A Probabilistic Model for Chord Progressions, , and , in: Proceedings of the Sixth International Conference on Music Information Retrieval (ISMIR), 2005 |
|
A Rao-Blackwellized Mixed State Particle Filter for Head Pose Tracking, and , in: ACM ICMI Workshop on Multimodal Multiparty Meeting Processing (MMMP), 2005 |
|
A Sector-Based, Frequency-Domain Approach to Detection and Localization of Multiple Speakers, and , in: Proceedings of ICASSP 2005, 2005 |
|
A Spectrogram Model for Enhanced Source Localization and Noise-Robust ASR, , and , in: Proceedings of INTERSPEECH 2005, 2005 |
|
AV16.3: an Audio-Visual Corpus for Speaker Localization and Tracking, , and , in: Proceedings of the 2004 MLMI Workshop, S. Bengio and H. Bourlard Eds, Springer Verlag, 2005 |
|
Benchmarking Non-Parametric Statistical Tests, , and , in: Advances in Neural Information Processing Systems, NIPS 18. MIT Press, 2005 |
|
Boosting word error rates, and , in: IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP, 2005 |
Database, Protocol and Tools for Evaluating Score-Level Fusion Algorithms in Biometric Authentication, and , in: Fifth Int'l. Conf. Audio- and Video-Based Biometric Person Authentication AVBPA, 2005 |
|
Detecting Group Interest-level in Meetings, , , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2005 |
|
Developing and Enhancing Posterior Based Speech Recognition Systems, , , and , in: Proceedings of Interspeech, 2005 |
|
EER of Fixed and Trainable Fusion Classifiers: A Theoretical Study with Application to Biometric Authentication Tasks, and , in: Sixth International Workshop on Multiple Classifier System (MCS2005), 2005 |
|
Effect of Segmentation Method on Video Retrieval Performance, and , in: Proceedings of the 2005 IEEE International Conference on Multimedia and Expo (ICME-05), 2005 |
|
Evaluation of Multiple Cues Head Pose Tracking Algorithm in Natural Environments, and , in: International Conference on Multimedia & Expo ICME 2005, 2005 |
|
Exploiting Hyperlinks to Learn a Retrieval Model, and , in: NIPS Workshop on Learning to Rank, 2005 |
|
Extracting Information from Multimedia Meeting Collections, , and , in: 7th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2005 |
|
F-ratio Client-Dependent Normalisation on Biometric Authentication Tasks, and , in: Proceedings of the 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-05), 2005 |
|
Generative Independent Component Analysis for EEG Classification, and , in: European Symposium on Artificial Neural Networks ESANN, 2005 |
|
Generative Temporal ICA for Classification in Asynchronous BCI Systems, and , in: The 2nd International IEEE EMBS Conference On Neural Engineering, 2005 |
|
Gradient estimates of return distributions, and , in: PASCAL Workshop on Principled Methods of Trading Exploration and Exploitation, 2005 |
|
Hierarchical Multi-Stream Posterior Based Speech Recognition System, , and , in: Proceedings MLMI workshop, 2005 |
|
Implicit Control of Noise Canceller for Speech Enhancement, , and , in: Proceedings of INTERSPEECH 2005, 2005 |
|
Improving Fusion with Margin-Derived Confidence In Biometric Authentication Tasks, and , in: Fifth Int'l. Conf. Audio- and Video-Based Biometric Person Authentication AVBPA, 2005 |
|
Improving Speech Recognition Using a Data-Driven Approach, , and , in: Proceedings of Interspeech, 2005, 2005 |
|
Inferring Document Similarity from Hyperlinks, and , in: ACM Conference on Information and Knowledge Management, 2005 |
|
Learning influence among interacting Markov chains, , , and , in: NIPS, 2005 |
|
Machine Learning for Multimodal Interaction: First International Workshop, MLMI'2004, Springer-Verlag Heidelberg, 2005 |
Modeling Scenes with Local Descriptors and Latent Aspects, , , , , and , in: IEEE Int. Conf. on Computer Vision, 2005 |
|
Multi-resolution RASTA filtering for TANDEM-based ASR, and , in: Proceedings of Interspeech 2005, 2005 |
|
Multi-resolution Spectral Entropy Based Feature for Robust ASR, , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2005 |
|
Multichannel Speech Enhancement in Cars: Explicit vs. Implicit Adaptation Control, , and , in: Proceedings of HSCMA 2005, 2005 |
|
Multimodal Integration for Meeting Group Action Segmentation and Recognition, , , , , and , in: MLMI, 2005 |
|
Multimodal Multispeaker Probabilistic Tracking in Meetings, , , and , in: Proc. Int. Conf. on Multimodal Interfaces (ICMI), 2005 |
|
On Accuracy/Robustness/Complexity Trade-Offs in Face Verification, , and , in: IEEE International Conference on Information Technology and Applications, ICITA, 2005 |
|
Semi-supervised Adapted HMMs for Unusual Event Detection, , , and , in: Pro. IEEE CVPR, 2005 |
|
Semi-supervised Meeting Event Recognition with Adapted HMMs, , and , in: Pro. IEEE ICME, 2005 |
|
Spectral Entropy Feature in Full-Combination Multi-Stream for Robust ASR, and , in: Proceedings of ISCA European Conference on Speech Communication and Technology (Eurospeech), 2005 |
|
Speech Acquisition in Meetings with an Audio-Visual Sensor Array, , , , and , in: Pro. IEEE ICME, 2005 |
|
The AMI Meeting Corpus: a Pre-Announcement, , , , , , , , , , , , , , , , and , in: Machine Learning for Multimodal Interaction: Second International Workshop, MLMI'2005, 2005 |
|
The Expected Performance Curve, , and , in: International Conference on Machine Learning, ICML, Workshop on ROC Analysis in Machine Learning, 2005 |
|
The Role of Speech in Multimodal Human-Computer Interaction (Towards Reliable Rejection of Non-Keyword Input), , and , in: Proceedings of 8th International Conference on Text, Speech and Dialogue - TSD 2005, 2005 |
|
Tracking People in Meetings with Particles, , , , and , in: Proc. Int. Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS,',','), invited paper, 2005 |
|
Unsupervised Spectral Subtraction for Noise-Robust ASR, , , and , in: Proceedings of the 2005 IEEE ASRU Workshop, 2005 |
|
You Are Wrong!---Automatic Detection of Interaction Errors from Brain Waves, and , in: Proceedings of the 19th International Joint Conference on Artificial Intelligence, 2005 |
|
2004
A Gentle Hessian for Efficient Gradient Descent, and , in: IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP, 2004 |
|
A probabilistic framework for joint head tracking and pose estimation, and , in: 17th Int. Conf. Pattern Recognition (ICPR), 2004 |
|
A Sector-Based Approach for Localization of Multiple Speakers with Microphone Arrays, and , in: Proceedings of the 2004 SAPA Workshop, 2004 |
|
A Statistical Significance Test for Person Authentication, and , in: Proceedings of Odyssey 2004: The Speaker and Language Recognition Workshop, 2004 |
|
A Symmetric Transformation for LDA-based Face Verification, , in: Proceedings of the 6th International Conference on Automatic Face and Gesture Recognition, IEEE Computer Society Press, 2004 |
|
An Investigation of Spectral Subband Centroids for Speaker Authentication, , and , in: Int'l Conf. on Biometric Authentication, 2004 |
|
An Online Audio Indexing System, , and , 2004 |
|
Assessing Scene Structuring in Consumer Videos, , , , and , in: Int. Conf. on Image and Video Retrieval (CIVR), 2004 |
|
Boosting HMMs with an application to speech recognition, and , in: IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP, 2004 |
|
Boosting Pixel-based Classifiers for Face Verification, and , in: Biometric Authentication Workshop of the 8th European Conference on Computer Vision, BIOAW2004, Springer-Verlag, 2004 |
|
Clustering And Segmenting Speakers And Their Locations In Meetings, , and , in: ICASSP, 2004 |
|
Confidence Measures in Multiple pronunciations Modeling For Speaker Verification, and , in: Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04), 2004 |
|
Cue integration through discriminative accumulation, and , in: International Conference on Computer Vision and Pattern Recognition, 2004 |
|
Effect of Recognition Errors on Information Retrieval Performance, , in: Proceedings of International Workshop on Frontiers in Handwriting Recognition, 2004 |
|
Embedding motion in model-based stochastic tracking, and , in: 17th Int. Conf. Pattern Recognition (ICPR), 2004 |
|
Entropy Based Combination of Tandem Representations for Noise Robust ASR, , , , and , in: Proceedings of the INTERSPEECH-ICSLP-04, 2004 |
|
Estimating the Quality of Face Localization for Face Verification, , , and , in: IEEE International Conference on Image Processing, ICIP, 2004 |
|
Face Verification Using Adapted Generative Models, , and , in: The 6th International Conference on Automatic Face and Gesture Recognition, FG2004, IEEE, 2004 |
|
Fusion of Structural and Color Local Descriptors for Enhanced Object Recognition, and , in: Proceedings IEEE WIAMIS 2004(5th International Workshop on Image Analysis for Multimedia Interactive Services,',','), 21-23 April, 2004, Lisboa, Portugal, 2004 |
|
HMM and IOHMM Modeling of EEG Rhythms for Asynchronous BCI Systems, and , in: European Symposium on Artificial Neural Networks ESANN, 2004 |
|
Joint Decoding for Phoneme-Grapheme Continuous Speech Recognition, , and , in: Proceedings of ICASSP, 2004 |
|
Links Between Perceptrons, MLPs and SVMs, and , in: International Conference on Machine Learning, ICML, 2004 |
|
LP-TRAP: Linear predictive temporal patterns, , and , 2004 |
|
Modeling Individual and Group Actions in Meetings With Layered HMMs, , , , and , in: IEEE Transaction on Multimedia, June, 2006, 2004 |
|
Modeling Individual and Group Actions in Meetings: a Two-Layer HMM Framework, , , , and , in: the Second IEEE Workshop on Event Mining: Detection and Recognition of Events in Video, In Association with CVPR, 2004 |
|
Modelling Auxiliary Features in Tandem Systems, , , and , in: Proceedings of ICSLP, 2004 |
|
Multimodal Group Action Clustering in Meetings, , , , and , in: ACM 2nd International Workshop on Video Surveillance & Sensor Networks in conjunction with 12th ACM International Conference on Multimedia, 2004 |
|
New Nonsense Syllables Database -- Analyses and Preliminary ASR Experiments, , , and , in: Proceedings of International Conference on Spoken Language Processing (ICSLP), 2004 |
|
Noise-Robust Multi-Stream Fusion for Text-Independent Speaker Authentication, and , in: The Speaker and Recognition Workshop, 2004 |
|
Noisy Text Categorization, , in: Proceedings of International Conference on Pattern Recognition (ICPR), 2004 |
|
On Performance Evaluation of Face Detection and Localization Algorithms, , , and , in: 17th International Conference on Pattern Recognition (ICPR), 2004 |
|
On the Need for On-Line Learning in Brain-Computer Interfaces, , in: Proceedings of the International Joint Conference on Neural Networks, 2004 |
|
On Use of Task Independent Training Data in Tandem Feature Extraction, and , in: Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04), 2004 |
|
Online Policy Adaptation for Ensemble Classifiers, and , in: 12th European Symposium on Artificial Neural Networks, ESANN 04, 2004 |
Order Matters: A Distributed Sampling Method for Multi-Object Tracking, , in: British Machine Vision Conference (BMVC), 2004 |
|
Phase AutoCorrelation (PAC) features in Entropy based Multi-Stream for Robust Speech Recognition, , , and , in: Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04), 2004 |
|
PLSA-based Image Auto-Annotation: Constraining the Latent Space, and , in: Proc. ACM Int. Conf. on Multimedia (ACM MM), 2004 |
|
Posteriori Probabilities and Likelihoods Combination for Speech and Speaker Recognition, and , in: International Conference on Spoken Language Processing (ICSLP~2004), 2004 |
|
Pseudo-Syntactic Language Modeling for Disfluent Speech Recognition, , in: Proceedings of SST 2004 (10th Australian International Conference on Speech Science & Technology,',','), Sydney, Australia, 2004, 2004 |
|
Real-Time Face Detection Using Boosting Learning in Hierarchical Feature Spaces, , and , in: the International Conference on Pattern Recognition (ICPR), 2004 |
|
Recognition of Isolated Complex Mono- and Bi-Manual 3D Hand Gestures, , and , in: Proc. of the sixth International Conference on Automatic Face and Gesture Recognition, 2004 |
|
Restoring Locomotion with a Thought Controlled Mobile Robot, , in: Proceedings of the 4th Forum of European Neuroscience, 2004 |
Robust Playfield Segmentation using MAP Adaptation, and , in: Proc. 17th International Conference on Pattern Recognition (ICPR 2004), 2004 |
|
Spectral Entropy Based Feature for Robust ASR, , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2004 |
|
Spectro-Temporal Activity Pattern (STAP) Features for Noise Robust ASR, , , and , in: Proceedings of the INTERSPEECH-ICSLP-04, 2004 |
|
Statistical Transformations of Frontal Models for Non-Frontal Face Verification, and , in: Proceedings of the IEEE International Conference on Image Processing (ICIP), 2004 |
|
Subjective Evaluation of Join Cost Functions Used in Unit Selection Speech Synthesis, and , in: Proceedings of ICSLP, 2004 |
|
Tangent Vector Kernels for Invariant Image Classification with SVMs, and , in: 17th Int. Conf. Pattern Recognition (ICPR), 2004 |
|
The Expected Performance Curve: a New Assessment Measure for Person Authentication, and , in: Proceedings of Odyssey 2004: The Speaker and Language Recognition Workshop, 2004 |
|
Theme Topic Mixture Model: A Graphical Model for Document Representation, and , in: Pascal Workshop on Text Mining and Understanding, 2004 |
|
Unsupervised Location-Based Segmentation of Multi-Party Speech, , and , in: Proceedings of the 2004 ICASSP-NIST Meeting Recognition Workshop, 2004 |
|
Using RASTA in task independent TANDEM feature extraction, , and , in: Proceedings of ICSLP, 2004, 2004 |
|
Why Do Multi-Stream, Multi-Band and Multi-Modal Approaches Work on Biometric User Authentication Tasks?, and , in: Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04), 2004 |
|
2003
A Hierarchical Keyframe User Interface for Browsing Video over the Internet, , , and , in: Proceedings of the 9th International Conference on Human-Computer Interaction (INTERACT-2003), IOS Press, 2003 |
|
A Mixed-State I-Particle Filter for Multi-Camera Speaker Tracking, , , and , in: IEEE Int. Conf. on Computer Vision Workshop on Multimedia Technologies for E-Learning and Collaboration (ICCV-WOMTEC), 2003 |
|
A Robust Speaker Clustering Algorithm, and , in: IEEE Automatic Speech Recognition Understanding Workshop, 2003 |
|
Adaptive Brain Interfaces for Communication and Control, , in: Proceedings of the 10th International Conference on Human-Computer Interaction, 2003 |
|
An Asynchronous Hidden Markov Model for Audio-Visual Speech Recognition, , in: Advances in Neural Information Processing Systems, NIPS 15, MIT Press, 2003 |
|
An Implicit Motion Likelihood for Tracking with Particle Filters, , and , in: British Machine Vision Conference (BMVC), Springer Verlag, 2003 |
|
Audio-Visual Speaker Tracking with Importance Particle Filters, , , , and , in: IEEE International Conference on Image Processing (ICIP), 2003 |
|
Augmenting Frontal Face Models for Non-Frontal Verification, and , in: Proceedings of the 2003 Workshop on Multimodal User Authentication (MMUA'03), 2003 |
Client Dependent GMM-SVM Models for Speaker Verification, and , in: International Conference on Artificial Neural Networks, ICANN/ICONIP 2003, Springer Verlag, 2003 |
|
Comparison of different feature classifiers for brain computer interfaces, , , , , , , , and , in: Proceedings of the 1st International IEEE EMBS Conference on Neural Engineering, 2003 |
Comparison of MLP and GMM Classifiers for Face Verification on XM2VTS, , and , in: 4th International Conference on AUDIO- and VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, 2003 |
|
Confusion Matrix Based Entropy Correction in Multi-stream Combination, and , in: Proceedings of ISCA European Conference on Speech Communication and Technology (Eurospeech), 2003 |
|
Direct Non-Invasive Brain Computer Interfaces, , , , and , in: Proceedings of the 9th International Conference on Functional Mapping of the Human Brain, 2003 |
Hybrid HMM/ANN and GMM Combination for User-Customized Password Speaker Verification, and , in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003 |
|
Improving Face Authetication Using Virtual Samples, , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003 |
|
Location Based Speaker Segmentation, and , in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003 |
|
Mel-Cepstrum Modulation Spectrum (MCMS) Features for Robust ASR, , , and , in: IEEE ASRU, 2003 |
|
Microphone Array Speech Recognition : Experiments on Overlapping Speech in Meetings, and , in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003 |
|
Modeling Human Interaction in Meetings, , , , , , , and , in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2003 |
|
Modélisation implicite du mouvement en suivi par filtrage de Monte Carlo séquentiel, and , in: GRETSI conference, Signal and Image Processing,, 2003 |
|
Multi-Modal Audio-Visual Event Recognition for Football Analysis, , and , in: Proc. IEEE Workshop on Neural Networks for Signal Processing (NNSP), 2003 |
|
Multimodal Authentication using Asynchronous HMMs, , in: 4th International Conference on Audio- and Video-Based Biometric Person Authentication, AVBPA, Springer-Verlag, 2003 |
|
New Entropy Based Combination Rules in HMM/ANN Multi-stream ASR, , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2003 |
|
Noise Resistant Audio-Visual Verification via Structural Constraints, and , in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003 |
Non-Invasive Brain-Actuated Control of a Mobile Robot, , , and , in: Proceedings of the 18th International Joint Conference on Artificial Intelligence, 2003 |
|
Non-Linear Variance Reduction Techniques in Biometric Authentication, and , in: Workshop on Multimodal User Authentication, 2003 |
|
Nonlinear Spectral Transformations for Robust Speech Recognition, , and , in: Proceedings of the IEEE Automatic Speech Recognition and Understanding (ASRU) Workshop 2003, 2003 |
|
Offline Recognition of Large Vocabulary Cursive Handwritten Text, , and , in: Proceedings of International Conference on Document Analysis and Recognition (ICDAR), 2003 |
|
On automatic annotation of meeting databases, , , , and , in: IEEE International Conference on Image Processing (ICIP), 2003 |
|
On Factorizing Spectral Dynamics for Robust Speech Recognition, , , and , in: Eurospeech, 2003 |
|
On Image Auto-Annotation with Latent Space Models, and , in: Proc. ACM Int. Conf. on Multimedia (ACM MM), 2003 |
|
On the Combination of Speech and Speaker Recognition, and , in: European Conference On Speech, Communication and Technology (EUROSPEECH'03), 2003 |
|
Phase AutoCorrelation (PAC) derived Robust Speech Features, , and , in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003 |
|
Phoneme-Grapheme Based Speech Recognition System, , , and , in: Proceedings of IEEE ASRU, 2003 |
|
Robust Features for Frontal Face Authentication in Difficult Image Conditions, and , in: Proceedings of 4th International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA-03), 2003 |
Scalability Analysis of Audio-Visual Person Identity Verification, , , and , in: 4th International Conference on Audio- and Video-Based Biometric Person Authentication, AVBPA, Springer-Verlag, 2003 |
|
Segmenting Multiple Concurrent Speakers Using Microphone Arrays, , and , in: Proceedings of Eurospeech 2003, 2003 |
|
Sequential Monte Carlo Video Text Segmentation, and , in: ICIP, 2003 |
|
Spectral Structuring of Home Videos, , and , in: International Conference on Image and Video Retrieval (CIVR'03), Springer Verlag, 2003 |
|
Speech & Face Based Biometric Authentication at IDIAP, , , , , , , and , in: Proceedings of the 2003 IEEE International Conference on Multimedia & Expo (ICME-03), 2003 |
Speech recognition of spontaneous, noisy speech using auxiliary information in Bayesian networks, , and , in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003 |
Studying Phase Synchrony for Classification of Mental Tasks in Brain Machine Interfaces, , , and , in: Proceedings of the Conference of the International Society for Brain Electromagnetic Topography, 2003 |
The BANCA Database and Evaluation Protocol, , , , , , , , , , , and , in: 4th International Conference on Audio- and Video-Based Biometric Person Authentication, AVBPA, Springer-Verlag, 2003 |
|
TRAP-TANDEM: Data-driven extraction of temporal features from speech, , in: large part published in Proceedings of ASRU-2003, 2003 |
|
Using pitch frequency information in speech recognition, , and , in: Proceedings of Eurospeech, 2003 |
|
Vessel Segmentation and Branching Detection using an Adaptive Profile Kalman Filter in Retinal Blood Vessel Structure Analysis, and , in: Pattern Recognition and Image Analysis: First Iberian Conference, IbPRIA 2003, Springer-Verlag LNCS, 2003 |
|
Video Shot Clustering using Spectral Methods, , and , in: 3rd Workshop on Content-Based Multimedia Indexing (CBMI), 2003 |
|
2002
A Comparative Study of Adaptation Methods for Speaker Verification, and , in: International Conference on Spoken Language Processing ICSLP, 2002 |
|
A Multi-sample Multi-source Model for Biometric Authentication, , and , in: IEEE International Workshop on Neural Networks for Signal Processing (NNSP), 2002 |
|
A Parallel Mixture of SVMs for Very Large Scale Problems, , and , in: Advances in Neural Information Processing Systems, NIPS 14, MIT Press, 2002 |
|
A State-of-the-art Neural Network for Robust Face Verification, , and , in: Proceedings of the COST275 Workshop on The Advent of Biometrics on the Internet, 2002 |
|
Auxiliary Variables in Conditional Gaussian Mixtures for Automatic Speech Recognition, , and , in: Seventh International Conference on Spoken Language Processing (ICSLP~2002), 2002 |
|
Conditional Gaussian Mixture Models for Environmental Risk Mapping, , and , in: IEEE International Workshop on Neural Networks for Signal Processing (NNSP), 2002 |
|
Dynamic Bayesian Network Based Speech Recognition with Pitch and Energy as Auxiliary Variables, , , and , in: 2002 IEEE International Workshop on Neural Networks for for Signal Processing (NNSP~2002), 2002 |
|
Evaluation of Formant-Like Features for ASR, , , , , and , in: International Conference on Spoken Language Processing (ICSLP 2002), 2002 |
|
Evolution of the Mental States Operating a Brain-Computer Interface, , , and , in: Proceedings of the International Federation for Medical and Biological Engineering, 2002 |
|
Face Verification using MLP and SVM, and , in: XI Journees NeuroSciences et Sciences pour l'Ingenieur (NSI 2002), 2002 |
|
Facial Expression Analysis using Shape and Motion Information Extracted by Convolutional Neural Networks, , in: International IEEE Workshop on Neural Networks for Signal Processing (NNSP 02), 2002 |
|
Head-Pose Invariant Facial Expression Recognition using Convolutional Neural Networks, , in: International IEEE Conference on Multimodal Interfaces (ICMI 02), 2002 |
|
Improving Face Verification using Skin Color Information, and , in: Proceedings of the 16th International Conference on Pattern Recognition, IEEE Computer Society Press, 2002 |
|
Increasing Speech Recognition Noise Robustness with HMM2, , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 02), 2002 |
|
Linking Objects in Videos by Importance Sampling, and , in: IEEE International Conference on Multimedia and Expo, 2002 |
|
Low cost duration modelling for noise robust speech recognition, , and , in: Proc. ICSLP, 2002 |
|
Microphone Array Post-filter for Diffuse Noise Field, and , in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2002 |
|
Mixed Bayesian Networks with Auxiliary Variables for Automatic Speech Recognition, , and , in: International Conference on Pattern Recognition (ICPR~2002), 2002 |
|
Mutliscale Facial Expression Recognition using Convolutional Neural Networks, , in: Indian Conference on Computer Vision, Graphics and Image Processing (ICVGIP 02), 2002 |
|
Object Localization in Metric Spaces for Video Linking, and , in: IEEE Workshop on Motion and Video Computing, 2002 |
|
Offline Cursive Word Recognition using Continuous Density Hidden Markov Models trained with PCA or ICA Features, and , in: Proceedings of International Conference on Pattern Recognition, 2002 |
|
Probabilistic Home Video Structuring: Feature Selection and Performance Evaluation, , and , in: IEEE International Conference on Image Processing, 2002 |
|
Robust Face Analysis using Convolutional Neural Networks, , in: Proceedings of the International Conference on Pattern Recognition (ICPR 02), 2002 |
|
Robust HMM-Based Speech/Music Segmentation, , and , in: ICASSP, 2002 |
|
Robust Speech Recognition with Small Microphone Arrays using the Missing Data Approach, , and , in: Proceedings of International Conference on Speech and Language Processing (ICSLP), 2002 |
|
Scaling Large Learning Problems with Hard Parallel Mixtures, , and , in: International Workshop on Pattern Recognition with Support Vector Machines, SVM'2002, 2002 |
|
Speaker Normalization using HMM2, , and , in: Proceedings of the 2002 IEEE International Workshop on Neural Networks for Signal Processing (NNSP-02), 2002 |
|
Text Segmentation and Recognition in Complex Background Based on Markov Random Field, , and , in: Int. Conf. Pattern Recognition 2002, 2002 |
|
Unknown-Multiple Speaker clustering using HMM, , , and , in: ICSLP, 2002 |
|
User-Customized Password HMM Based Speaker Verification, and , in: Proceedings of the COST275 Workshop on the Advent of Biometrics on the Internet, 2002 |
|
User-Customized Password Speaker Verification based on HMM/ANN and GMM Models, and , in: International Conference on Spoken Language Processing (ICSLP~2002), 2002 |
|
Video Text Recognition Based on Markov Random Field and Grayscale Consistency Constraint, and , in: Int. Conf. Image Processing 2002, 2002 |
|
Writer adaptation techniques in HMM based Off-Line Cursive Script Recognition, and , in: Proceedings of 8$^{th}$ International Conference on Frontiers on Handwriting Recognition, 2002 |
|
2001
Adaptive ML-Weighting in Multi-Band Recombination of Gaussian Mixture ASR, , and , in: ICASSP, 2001 |
|
Confidence Evaluation for Risk Prediction, , and , in: 2001 Annual Conference of the IAMG, 2001 |
|
Data utility modelling for mismatch reduction, , in: Proc. CRAC (workshop on Consistent & Reliable Acoustic Cues for sound analysis), 2001 |
|
EEG pattern recognition through multi-stream evidence combination, , and , in: Proc. World Congress on Neuroinformatics, 2001 |
|
Error Correcting Posterior Combination for Robust Multi-Band Speech Recognition, and , in: EUROSPEECH, 2001 |
|
From missing data to maybe useful data: soft data modelling for noise robust ASR, , and , in: Proc. WISP, 2001 |
|
HMM2- Extraction of Formant Features and their Use for Robust ASR, , and , in: European Conference on Speech Communication and Technology (Eurospeech 2001), 2001 |
|
Learning the Decision Function for Speaker Verification, and , in: IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP, 2001 |
|
MAP Combination of Multi-Stream HMM or HMM/ANN Experts, , and , in: Proc. Eurospeech, 2001 |
|
Modeling Auxiliary Information in Bayesian Network Based ASR, , and , in: 7th European Conference on Speech Communication and Technology (Eurospeech~2001), 2001 |
|
New Approaches Towards Robust and Adaptive Speech Recognition, , and , in: Advances in Neural Information Processing Systems 13, MIT Press, 2001 |
|
Signal modeling with Non Uniform Topology lattice filters, and , in: Proc. ICASSP 2001, 2001 |
|
Speech Recognition Using Advanced HMM2 Features, , and , in: Automatic Speech Recognition and Understanding Workshop, 2001 |
|
Text Enhancement with Asymmetric Filter for Video OCR, , and , in: Proceedings of the 11th International Conference on Image Analysis and Processing, 2001 |
|
Text Identification in Complex Background using SVM, , and , in: Proceedings of the Int. Conf. on computer vision and pattern recognition, 2001 |
Video OCR for Sport Video Annotation and Retrieval, , and , in: Proceedings of the 8th IEEE International Conference on Mechatronics and Machine Vision in Practice, 2001 |
|
2000
A front-end using the harmonicity cue for speech enhancement in loud noise, , and , in: Int. Conf. on Spoken Language Processing (ICSLP), 2000 |
A MAP approach, with synchronous decoding and unit-based normalization for text-dependent speaker verification, , and , in: ICSLP, 2000 |
|
A neural network for classification with incomplete data: application to robust ASR, , , , and , in: Proc. ICSLP, 2000 |
|
Adaptation robuste de modèles HMM pour la vérification du locuteur dépendante du texte, and , in: Journee d'Etudes sur la Parole, Aussois, 2000 |
|
Audio visual speech recognition, , , , , , , and , Johns Hopkins University-CLSP, 2000 |
Automatic Speech Recognition using Dynamic Bayesian Networks with both Acoustic and Articulatory Variables, , , and , in: 6th International Conference on Spoken Language Processing: ICSLP~2000 (Interspeech~2000), 2000 |
|
Behavior of a Bayesian adaptation method for incremental enrollment in speaker verification, , , , , and , in: ICASSP2000 - IEEE International Conference on Acoustics, Speech, and Signal Processing, 2000 |
|
Blind acoustic source separation for cocktail party speech recognition, , , and , in: ICONIP, 7th IEEE Int. Conf. on Neural Information Processing, 2000 |
Comparison of HMM experts with MLP experts in the Full Combination Multi-Band Approach to Robust ASR, and , in: ICSLP, 2000 |
|
Comparison of Unsupervised and Supervised Training of RBF Neural Networks. Case Study: Mapping of Contamination Data, and , in: Neural Computation 2000, 2000 |
Environmental and Pollution Spatial Data Classification with Support Vector Machines and Geostatistics, , , and , in: Geostatistical congress 2000, 2000 |
Etudes comparatives des robustesses au bruit de l'approche 'Full Combination' et de son approximation, and , in: Journee d'Etudes sur la Parole, Aussois, 2000 |
|
Fast latent semantic indexing of spoken documents by using self-organizing maps, , in: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP'2000, 2000 |
|
From Multi-Band Full Combination to Multi-Stream Full Combination Processing in Robust ASR, , and , in: ISCA ITRW ASR2000, 2000 |
|
HMM2- A Novel Approach to HMM Emission Probability Estimation, , and , in: International Conference on Spoken Langugae Processing (ICSLP 2000), 2000 |
|
Incorporating Domain Knowledge with Video and Voice Data Analysis in News Broadcasts, , and , in: Proceedings of the Sixth ACM International Conference on Knowledge Discovery and Data Mining, ACM, Boston, MA, USA, 2000 |
|
Indexing spoken audio by LSA and SOMs, , in: Proceedings of the European Signal Processing Conference EUSIPCO'2000, 2000 |
Indoor Radon Risk Assessment with Geostatistics and Artificial Neural Networks, , , , , , and , in: Geostatistical congress 2000, 2000 |
Inverse lattice filtering of speech with adapted non-uniform delays, and , in: Proc. ICSLP 2000, 2000 |
|
Iterative Posterior-Based Keyword Spotting Without Filler Models, and , in: Proceedings of the IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, 2000 |
LPC modeling with speech production constraints, , in: Proc. 5th Speech Production Seminar, 2000 |
|
Multichannel signal separation for cocktail party speech recognition: a dynamic recurrent network, , , and , in: Int. Conf. on Spoken Language Processing (ICSLP), no IDIAP RR, see RESPITE www, 2000 |
Multiple Hypotheses Video OCR, and , in: Proceedings of the 4th International Workshop on Document Analysis System, 2000 |
|
Multiple Timescale Feature Combination towards Robust Speech Recognition, , in: KONVENS 2000 / Sprachkommunikation, 2000 |
|
Neural Network Residual Stochastic Co-simulation for Environmental Data Analysis, , , , and , in: Neural Computation 2000, 2000 |
Off-Line Cursive Script Recognition Based on Continuous Density HMM, and , in: Proceedings of 7th International Workshop on Frontiers in Handwriting Recognition, 2000 |
Recognition of Asymmetric Facial Action Unit Activities and Intensities, and , in: Proceedings of the International Conference on Pattern Recognition (ICPR 2000), 2000 |
|
Reconnaissance de la parole dans le bruit après renforcement fondé sur l'harmonicité, and , in: Proceedings of JEP'2000, no IDIAP RR, see RESPITE www, 2000 |
Relating LPC modeling to a factor-based articulatory model, , in: Proc. ICSLP 2000, 2000 |
|
Some applications of a priori knowledge in multi-stream HMM and HMM/ANN based ASR, , in: Phonus No.5,Dec.2000, ISSN 0949-1791, Proc. Workshop on Phonetics and Phonology in ASR, 2000 |
|
Test of several external posterior weighting functions for multiband Full Combination ASR, and , in: Int. Conf. on Spoken Language Processing (ICSLP), 2000 |
Using Multiple Time Scales in the Framework of Multi-Stream Speech Recognition, and , in: ICSLP, 2000 |
|
1999
A CASA front-end using the localisation cue for segregation and then cocktail-party speech recognition, , , and , in: Proc. IEEE Int. Conference on Speech Processing (ICSP), IEEE, 1999 |
A CASA-labelling model using the localisation cue for robust cocktail-party speech recognition, , and , in: Proc.\ European Conf.\ on Speech Communication and Technology (EUROSPEECH), 1999 |
A comparison of mixture models for density estimation, , in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'99), London: IEE, 1999 |
|
A comparison of two strategies for ASR in additive noise : Missing Data and Spectral Subtraction, and , in: 6th European Conference on Speech Communication and Technology --- Eurospeech'99, 1999 |
|
A measure of speech and pitch reliability from voicing, and , in: Proc. Int. Joint Conf. on Artificial Intelligence (IJCAI), Scandinavian AI Society, 1999 |
A new SNR-feature mapping for robust multistream speech recognition, and , in: Proc. Int. Congress on Phonetic Sciences (ICPhS), 1999 |
An Overview of the PICASSO Project Research Activities in Speaker Verification for Telephone Applications, , , , , , , , , , , and , in: 6th european conference on speech communication and technology --- eurospeech'99, 1999 |
Audio-Visual Person Verification, , , , and , in: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, June 1999, Fort Collins, USA, 1999 |
|
Blind separation of delayed and superimposed acoustic sources : learning algorithms an experimental study, , , , and , in: Proc. IEEE Int. Conference on Speech Processing (ICSP), IEEE, 1999 |
Classification using localized mixtures of experts, , in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'99), London: IEE, 1999 |
|
CLIENT / WORLD MODEL SYNCHRONOUS ALIGNEMENT FOR SPEAKER VERIFICATION, , , and , in: 6th European Conference on Speech Communication and Technology --- Eurospeech'99, 1999 |
|
Combinatorial Approach for Data Binarization, and , in: Principles of Data Mining and Knowledge Discovery: third european conference; proceedings / PKDD'99, Springer, 1999 |
|
Data binarization by discriminant elimination, , and , in: Proceedings of the ICML-99 Workshop: From Machine Learning to Knowledge Discovery in Databases, 1999 |
|
Decision-Oriented Environmental Mapping with Radial Basis Function Neural Networks, , , , and , in: Intelligent techniques for Spatio-Temporal Data Analysis in Environmental Applications. Workshop W07, 1999 |
Deliberate Imposture: a challenge for automatic speaker verification systems, and , in: Proceedings of the European Conference on Speech Communication and Technology, 1999 |
Different Weighting Schemes in the Full Combination Subbands Approach for Noise Robust ASR, , and , in: Robust Methods for Speech Recognition in Adverse Conditions, 1999 |
|
Environmental and Pollution Spatial Data Classification with Support Vector Machines and Geostatistics, , , and , in: Intelligent techniques for Spatio-Temporal Data Analysis in Environmental Applications. Workshop W07, 1999 |
Evaluating the Complexity of Databases for Person Identification and Verification, , and , in: 8th Int. Conf. Computer Analysis of Images and Patterns, Springer Verlag, 1999 |
|
Experimental evaluation of text-dependent speaker verification on laboratory and field test databases in the M2VTS project, , , and , in: Proceedings of the European Conference on Speech Communication and Technology, 1999 |
|
Extraction of Articulators in X-Ray Image Sequences, and , in: Proceedings of the European Conference on Speech Communication and Technology, 1999 |
|
Fast Face Detection using MLP and FFT, , and , in: Proc. Second International Conference on Audio and Video-based Biometric Person Authentication (AVBPA'99), 1999 |
|
Illumination-robust Pattern Matching Using Distorted Color Histograms, and , in: Pattern Recognition and Image Understanding, Infix, 1999 |
Incremental Enrollment of Speech Recognizers, and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'99,',','), Phoenix, Arizona, USA, 1999 |
Iterative Posterior-Based Keyword Spotting Without Filler Models, and , in: Proceedings of the IEEE Automatic Speech Recognition and Understanding (ASRU'99) Workshop, 1999 |
Latent Semantic Indexing by Self-Organizing Map, and , in: ESCA ETRW workshop on Accessing Information in Spoken Audio, 1999 |
|
LPC-based inversion of the DRM articulatory model, , in: Proc. Eurospeech'99, 1999 |
|
Multi Modal Verification for Teleservices and Security Applications, , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: IEEE International Conference on Multimedia Computing and Systems, 1999 |
Multi-Modal Data Fusion for Person Authentication using SVM, , in: Proc. Second International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'99), 1999 |
|
Non-Stationary Multi-Channel (Multi-Stream) Processing Towards Robust and Adaptive ASR, , in: Proc. of the ESCA Workshop on Robust Methods for Speech Recognition in Adverse Conditions, 1999 |
Robust Person Verification based on Speech and Facial Images, and , in: Proceedings of the European Conference on Speech Communication and Technology, 1999 |
|
The Elisa'99 Speaker Recognition and Tracking Systems, , , , , , , , , , , , , , , and , in: IEEE Workshop on Automatic Advanced Technologies, 1999 |
The full combination sub-bands approach to noise robust HMM/ANN based ASR, , and , in: 6th European Conference on Speech Communication and Technology --- Eurospeech'99, 1999 |
|
Towards introducing long-term statistics in MUSE for robust speech recognition, and , in: Automatic Speech Recognition and Understanding (ASRU) workshop, 1999 |
|
Tracking Articulators in X-ray Movies of the Vocal Tract, , in: 8th Int. Conf. Computer Analysis of Images and Patterns, Springer Verlag, 1999 |
|
XM2VTSDB: The Extended M2VTS Database, , , , and , in: Proc. Second International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'99), 1999 |
1998
A comparison of a priori threshold setting procedures for speaker verification in the CAVE project, , , , , , and , in: ICASSP 98, 1998 |
An overview of the cave project research activities in speaker verification, , , , , and , in: Reconnaissance du locuteur et ses applications commerciales et criminalistiques, 1998 |
Confidence Measures in Hybrid HMM/ANN Speech Recognition, and , in: Proceedings of Workshop on Text, Speech and Dialog (TSD'98) Brno, Czech Republic, 1998 |
Connectionist speech recognition, , in: Proceedings of IK'98, Interdisziplinares Kolleg, Spring Scholl, Gunne am Mohnessee, Germany, March 7--14, 1998 |
Continuous Audio-Visual Speech Recognition, and , in: Proc. 5th European Conference on Computer Vision, Springer Verlag, 1998 |
|
Decision fusion using a multi-linear classifier, , and , in: 1st International Conference on Multisource-Multisensor Data Fusion, 1998 |
Improved Pairwise Coupling Classification With Correcting Classifiers, and , in: Machine Learning: ECML-98, Springer, 1998 |
|
Improving Posterior Based Confidence Measures in Hybrid HMM/ANN Speech Recognition Systems, and , in: Proceedings of International Conference on Spoken Language Processing (ICSLP'98) Sydney, Australia, 1998 |
|
Interfacing of CASA and Multistream recognition, , , and , in: TSD'98-Text, Speech and Dialog International Workshop, BRNO-Czech Republic, 1998 |
|
Interfacing of CASA and partial recognition based on a multistream technique, , , and , in: ICSLP'98, Sidney, 1998 |
|
POLYCOST: a telephone-speech database for speaker recognition, , , and , in: Reconnaissance du locuteur et ses applications commerciales et criminalistiques, 1998 |
Reconnaissance multi-bandes de la parole bruitée par couplage entre les niveaux primitifs et d'identification, , , and , in: Journees Etude Parole - Martigny, 1998 |
|
Reconnaissance robuste de la parole par segmentation signal/bruit en sous-bandes, , , and , in: Neurosciences et Sciences de l'Ingenieur'98 - Munster, CNRS, 1998 |
|
Speech pre-processing against intentional imposture in speaker recognition, and , in: Proceedings of ICSLP, Sidney, 1998 |
Text dependent speaker verification using binary classifiers, , and , in: Proceedings of the 1998 IEEE International Conference on Acoustics, Speech, and Signal Processing --- ICASSP'98, IEEE, IEEE, 1998 |
|
Using the Multi-Stream Approach for Continuous Audio-Visual Speech Recognition: Experiments on the M2VTS Database, and , in: Proc. 5th Int. Conf. on Spoken Language Processing, 1998 |
|
Voice transformation, a tool for imposture of speaker verification, and , in: Proceedings of International Phonetic Science conference IPS98, Washington, 1998 |
Voice-B System, , , , and , in: IEEE 4th Workshop on Intercative Voice Technology for Telecommunications Applications (IVTTA'98) September 29--30, Torino, Italy, 1998 |
1997
A Connectionist System for Two-Dimensional Representation of Multivariate Location Data, and , in: Proceedings of the Fifth International Workshop on Artificial Intelligence for High Energy Physics, AIHENP, Lausanne, Switzerland, Elsevier Science, 1997 |
Acoustic-Labial Speaker Verification, , , and , in: Proceedings of the First International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'97), Springer Verlag, 1997 |
Adapting the 2-Class Recursive Deterministic Perceptron Neural Network to m Classes, , , and , in: Proceedings of the International Conference on Neural Networks, IEEE, IEEE, 1997 |
An Optical Thresholding Perceptron, , , , and , in: Proceedings of the Workshop on Optics and Computer Science, Geneva, Switzerland, 1997 |
|
Estimation of Global Posteriors and Forward-Backward Training of Hybrid HMM/ANN Systems, , , and , in: EUROSPEECH'97, 1997 |
|
Handwritten Digit Recognition with Binary Optical Perceptron, , , and , in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'97), Springer-Verlag, 1997 |
|
Hybrid HMM/ANN Systems for Speech Recognition: Overview and New Research Directions, and , in: International School on Neural Nets: Adaptive Processing of Temporal Information, Springer Verlag, 1997 |
|
Hybrid HMM/ANN Systems for Training Independent Tasks: Experiments on 'Phonebook' and Related Improvements, , , , and , in: IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, 1997 |
|
Integrating Acoustic and Labial Information for Speaker Identification and Verification, , , and , in: Proceedings of the European Conference on Speech Communication and Technology, 1997 |
Likelihood ratio adjustment for the compensation of model mismatch in speaker verification, and , in: Eurospeech 97, 1997 |
|
Mixtures of Experts Estimate A Posteriori Probabilities, , in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'97), Springer-Verlag, 1997 |
|
On the Complexity of Recognizing Iterated Differences of Polyhedra, , in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'97), Springer-Verlag, 1997 |
|
On the Decomposition of Polychotomies into Dichotomies, and , in: Proceedings of The Fourteenth International Conference on Machine Learning, Morgan Kaufmann, 1997 |
|
Person Authentication by Fusing Face and Speech Information, , , and , in: Proceedings of the First International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'97), Springer Verlag, 1997 |
Robust Speech Recognition based on Multi-Stream Features, , and , in: Proc. of the ESCA-NATO Workshop on Robust Speech Recognition for Unknown Communication Channels, 1997 |
|
Speaker Verification in the Telephone Network : Research Activities in the CAVE Project, , , , , and , in: Proceedings of the European Conference on Speech Communication and Technology (EUROSPEECH'97), 1997 |
|
Speaker-Dependent Speech Recognition Based on Phone-Like Unit Model -- Application to Voice Dialing, and , in: IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, 1997 |
|
State-of-the-Art and Recent Progress in Hybrid HMM/ANN Speech Recognition, , in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'97), Springer-Verlag, 1997 |
Subband-Based Speech Recognition, and , in: IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, 1997 |
|
Towards Speaker Independent Continuous Speechreading, , in: Proceedings of the European Conference on Speech Communication and Technology, 1997 |
|
Using Multiple Time Scales in a Multi-Stream Speech Recognition System, and , in: EUROSPEECH'97, 1997 |
|
1996
A Boolean Approach to Construct Neural Networks for Non-Boolean Problems, and , in: Proceedings of the 8th IEEE International Conference on Tools with Artificial Intelligence, IEEE, 1996 |
A Method for All-Positive Optical Multilayer Perceptrons, , and , in: Proceedings of the Third IEEE International Conference on Electronics, Circuits, and Systems, University of Patras, Rhodos, Greece, IEEE, 1996 |
|
Amelioration des performances de verification du locuteur par combinaison de methodes, , , and , in: Journees d'etudes sur la parole, JEP, 1996 |
Bounds on the Degree of High Order Binary Perceptrons, , in: Proceedings of ESANN'96, D facto, 1996 |
|
Combining methods to improve speaker verification decision, , , and , in: Proceedings of The Fourth International Conference on Spoken Language Processing, ICSLP, ICSLP, 1996 |
|
Connectionist Quantization Functions, , and , in: Proceedings of the '96 SIPAR-Workshop on Parallel and Distributed Computing, Scientific and Parallel Computing Group, University of Geneva, Geneva, Switzerland, 1996 |
|
ETC\_vérif : un environnement multi-agents de reconnaissance automatique de la parole en continu, and , in: Proceedings of JEP'96: XXIemes Journees d'etude sur la Parole, 1996 |
|
Extended Cauchy Machines, and , in: Proceedings of the International Conference on Neural Information Processing, 1996 |
Hardware-Friendly Learning Algorithms for Neural Networks: An Overview, and , in: Proceedings of the Fifth International Conference on Microelectronics for Neural Networks and Fuzzy Systems: MicroNeuro'96, EPFL and CSEM, Lausanne, Switzerland, IEEE Computer Society Press, 1996 |
|
Image Classification by Neural Networks for the Quality Control of Watches, , and , in: Proceedings ISAI /IFIS 1996, ITESM, Cancun, Mexico, ITESM, 1996 |
Learning to recognise talking faces, , and , in: Proceedings of the International Conference on Pattern Recognition (ICPR'96), IAPR, 1996 |
|
Locating and tracking facial speech features, , and , in: Proceedings of the International Conference on Pattern Recognition (ICPR'96), IAPR, 1996 |
|
Multi-modal person verification tools using speech and images, , in: European Conference on Multimedia Applications, Services and Techniques, 1996 |
Neural Network Pruning and Pruning Parameters, and , in: The 1st Workshop on Soft Computing, Dept. of Information Electronics Nagoya University, 1996 |
|
New time-frequency derived cepstral coefficients for automatic speech recognition, and , in: Proceedings of the 8th European Signal Processing Conference (Eusipco'96), 1996 |
|
Overcoming Inaccuracies in Optical Multilayer Perceptrons, , and , in: Proceedings of the First International Symposium on Neuro-Fuzzy Systems (AT'96), Lausanne, Switzerland, AATI, 1996 |
Polycost Database, , and , 1996 |
Secured vocal access to telephone servers, , , , and , in: Proceedings of IVTTA 1996 IEEE Third Workshop Interactive Voice Technology for Telecommunications Applications, 1996 |
Semi-automatic HMM-based annotation of the PolyCOST Database, , , and , in: Application of speaker recognition techniques in telephony, COST250, 1996 |
Sparse Initial Topologies for High Order Perceptrons, , and , in: Proceedings of the International Conference on Neural Networks, IEEE, 1996 |
Speachreading using shape and intensity information, , and , in: Proceedings of the 4th International Conference on Spoken Language Processing (ICSLP'96), 1996 |
|
Speaker identification by lipreading, , and , in: Proceedings of the 4th International Conference on Spoken Language Processing (ICSLP'96), 1996 |
|
Statistical lip modelling for visual speech recognition, , and , in: Proceedings of the 8th European Signal Processing Conference (Eusipco'96), 1996 |
|
Sun Workstation and SwissNet Platform for Speech Recognition and Speaker Verification over the Telephone, , , , and , in: Proceedings of Workstations und ihre Anwendungen, SIWORK'96, 1996 |
|
Superceptron Construction, , and , in: Proceedings of the '96 SIPAR-Workshop on Parallel and Distributed Computing, Scientific and Parallel Computing Group, University of Geneva, Geneva, Switzerland, 1996 |
Swiss PolyPhone and PolyVar: Building Databases for Speech Recognition and Speaker Verification, and , in: Proceedings of The 3rd Slovenian-German and 2nd SDRV Workshop, Speech and Image Understanding, 1996 |
Traitement préliminaire de l'image d'un texte manuscrit en vue de sa reconnaissance: une méthode de sur-segmentation, , and , in: 4eme Colloque National sur l'A?crit et le Document (CNED'96), 1996 |
Un système prédictif de la structuration syntaxico-rythmique d'un énoncé à l'aide d'informations prosodiques, , and , in: Proceedings of JEP'96: XXIemes Journees d'etude sur la Parole, 1996 |
|
Validating Different Flexible Vocabulary Approaches on the Swiss French PolyPhone and PolyVar databases, , , and , in: Proceedings of ICSLP 96, 1996 |
Visual Speech Recognition using Active Shape Models and Hidden Markov Models, , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'96), 1996 |
|
1995
A graphical tool for monitoring Oz objects activity, and , in: Proc. of WOz'95: International Workshop on Oz Programming, IDIAP, Uni. Fribourg, 1995 |
A study of Intra- and Inter-Speaker Variability in the Voices of Twins for Speaker Verification, and , in: International Congress of Phonetic Sciences, 1995 |
Boolean Logic Inspired High Order Perceptron Construction, , and , in: SIPAR Workshop'95 Parallel and Distributed Systems, SIPAR SI Group for Parallel Systems, Biel School of Engineering, Computer Science Department, 1995 |
|
Discrimination of the voices of twins and siblings for speaker verification, and , in: 4th European Conference on Speech Communication and Technology, 1995 |
Environnement multi-agents de reconnaissance automatique de la parole en continu, and , in: Actes des 3emes Journees Francophones sur l'Intelligence Artificielle Distribuee et les Systemes Multi-agents, 1995 |
ETC\_vérif, a Prototype of a Cooperative Automatic Speech Recognition System, and , in: Proc. of WOz'95: International Workshop on Oz Programming, IDIAP, Uni. Fribourg, 1995 |
Evaluating pruning methods, and , in: 1995 International Symposium on Artificial Neural Networks (ISANN'95), 1995 |
|
Gain Elimination form Backpropagation Neural Networks, , and , in: Proceedings of the International Conference on Neural Networks, IEEE, Perth, IEEE, 1995 |
Handwriting Recognition, , in: Second Asian Conference on Computer Vision (ACCV'95,',','), Singapore, 1995 |
Lexical filtrering by means of prosodic information, , and , in: International Congress of Phonetic Sciences, 1995 |
Microprosodic study of isolated French word corpora, , in: 4th European Conference on Speech Communication and Technology, 1995 |
Neural nets approaches to Speaker Verification: comparison with Second Order Statistical Measure, and , in: ICASSP, 1995 |
Non-Ontogenic Sparse Neural Networks, , and , in: Proceedings of the International Conference on Neural Networks, IEEE, IEEE, 1995 |
Ontogenic High Order Cauchy Machines, and , in: Proceedings of the SIPAR Workshop '95: Parallel and Distributed Systems, Biel School of Engineering, 1995 |
Optical Multilayer Perceptrons based on Liquid Crystal Devices, , , and , in: Optics and Information, Cercle SFO/SEE d'Opto-informatique, Mulhouse, France, European Optical Society (EOS), 1995 |
Reliability in a Multi-agent Spoken Language Recognition System, and , in: 4th European Conference on Speech Communication and Technology, 1995 |
Swiss-French Polyphone: a Telephone Speech Database to develop Interactive Voice Servers, , , and , in: Linguistic Databases, 1995 |
The Effects of Optical Thresholding in Backpropagation Neural Networks, , and , in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'95 and NeuroNimes'95), ENNS, Paris, France, EC2 & Cie, 1995 |
The use of prosodic agents in a cooperative automatic speech recognition system, and , in: International Congress of Phonetic Sciences, 1995 |
1994
A system for the off-line recognition of handwritten text, , in: International Conference on Pattern Recognition (ICPR,',','), Jerusalem, 1994 |
Design and Implementation of a System for the Recognition of Handwritten Responses on US Census Forms, , in: IAPR Workshop on Document Analysis Systems, 1994 |
Modular Object-Oriented Neural Network Simulators and Topology Generalizations, , and , in: Proceedings of the International Conference on Artificial Neural Networks (ICANN 94), Sorrento, Italy, Springer-Verlag, 1994 |
|
Results on the Steepness in Backpropagation Neural Networks, , and , in: Proceedings of the '94 SIPAR-Workshop on Parallel and Distributed Computing, SI Group for Parallel Systems, 1994 |
Weight Initialization for High Order and Multilayer Perceptrons, and , in: Proceedings of the '94 SIPAR--Workshop on Parallel and Distributed Computing, SI Group for Parallel Systems, 1994 |
1993
Do Backpropagation trained neural networks have normal weight distributions?, and , in: International Conference on Artificial neural Networks, 1993 |
|
Higher-Order Statistics in Visual Object Recognition, , in: Proc. IEEE Conf. on Computer Vision and Pattern Recognition, 1993 |
|
Recognition of Handprinted Digits using Optimal Bounded Error Matching, , in: International Conference on Document Analysis and Retrieval (ICDAR,',','), Tsukuba Science City, Japan, 1993 |
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 |