All conference papers
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 |
2024
A Differentiable Integer Linear Programming Solver for Explanation-Based Natural Language Inference, , and , in: The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, 2024 |
A Novel and Responsible Dataset for Face Presentation Attack Detection on Mobile Devices, , , , , and , in: The IEEE International Joint Conference on Biometrics, Buffalo, New York, pages 8, 2024 |
|
A System for Human-Robot Teaming through End-User Programming and Shared Autonomy, , , , , and , in: Proceedings of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, pages 231-239, 2024 |
[DOI] [URL] |
A Unified Model for Gaze Following and Social Gaze Prediction, , , and , in: The 18th IEEE International Conference on Automatic Face and Gesture Recognition, 2024 |
|
Adversarial Robustness Analysis in Automatic Pathological Speech Detection Approaches, and , in: Interspeech, 2024 |
|
Assessing the Reliability of Biometric Authentication on Virtual Reality Devices, , and , in: Proceedings of IEEE International Joint Conference on Biometrics, 2024 |
|
Breaking Template Protection: Reconstruction of Face Images from Protected Facial Templates, and , in: 18th International Conference on Automatic Face and Gesture Recognition (FG), 2024 |
|
CCDb-HG: Novel Annotations and Gaze-Aware Representations for Head Gesture Recognition, , , and , in: 18th IEEE Int. Conference on Automatic Face and Gesture Recognition (FG), Istanbul,, 2024 |
|
ChatGPT and biometrics: an assessment of face recognition, gender detection, and age estimation capabilities, , , , and , in: 2024 IEEE International Conference on Image Processing (ICIP), 2024 |
|
ChildPlay-Hand: A Dataset of Hand Manipulations in the Wild, , and , in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024 |
|
COMPARING DATA-DRIVEN AND HANDCRAFTED FEATURES FOR DIMENSIONAL EMOTION RECOGNITION, , and , in: International Conference on Acoustics, Speech and Signal Processing, 2024 |
|
Comparing Stability and Discriminatory Power of Hand-crafted Versus Deep Radiomics: A 3D-Printed Anthropomorphic Phantom Study, , , , , , , , , , and , in: Proceedings of the 12th European Workshop on Visual Information Processing, 2024 |
|
Configuration Space Distance Fields for Manipulation Planning, , , and , in: Robotics: Science and Systems (RSS), 2024, 2024 |
|
CONTENT-BASED OBJECTIVE EVALUATION OF ARTIFICIALLY GENERATED SIGN LANGUAGE VIDEOS, , , , , and , in: ICASSP, 2024 |
|
CONTEXTUAL BIASING METHODS FOR IMPROVING RARE WORD DETECTION IN AUTOMATIC SPEECH RECOGNITION, , , , , , , and , in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, Seoul, Korea, 2024 |
|
Cross-transfer Knowledge between Speech and Text Encoders to Evaluate Customer Satisfaction, , , , and , in: Interspeech, Kos Island, Greece, ISCA, 2024 |
|
D-LGP: Dynamic Logic-Geometric Program for Reactive Task and Motion Planning, , and , in: IEEE International Conference on Robotics and Automation, 2024 |
|
DAIC-WOZ: On the Validity of Using the Therapist's prompts in Automatic Depression Detection from Clinical Interviews, , , , , and , in: Proceedings of the 6th Clinical Natural Language Processing Workshop, Association for Computational Linguistics, 2024 |
|
Deep Variational Privacy Funnel: General Modeling with Applications in Face Recognition, , and , in: 49th IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE, 2024 |
[DOI] [URL] |
Demographic Fairness Transformer for Bias Mitigation in Face Recognition, and , in: Proceedings of IEEE International Joint Conference on Biometrics (IJCB2024), 2024 |
|
Detecting Criminal Networks via Non-Content Communication Data Analysis Techniques from the TRACY Project, , , , , , , , , and , in: 15th EAI International Conference on Digital Forensics & Cyber Crime, 2024 |
|
DiffuCOMET: Contextual Commonsense Knowledge Diffusion, , , , , and , in: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Bangkok, Thailand, pages 4809–4831, Association for Computational Linguistics, 2024 |
[DOI] [URL] |
Does Data-Efficient Generalization Exacerbate Bias in Foundation Models?, , , , , and , in: Proceedings of the 18th European Conference on Computer Vision, 2024 |
|
Enhancing Ethical Explanations of Large Language Models through Iterative Symbolic Refinement, , , and , in: The 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024), 2024 |
Entity Matching Across Small Networks Using Node Attributes, , , , , , , , , and , in: ECAI 2024 - 27th European Conference on Artificial Intelligence, October 19-24, 2024, Santiago de Compostela, Spain - Including 13th Conference on Prestigious Applications of Intelligent Systems (PAIS 2024), Proceedings, 2024 |
|
Estimating the Causal Effects of Natural Logic Features in Transformer-Based NLI Models, , and , in: The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, 2024 |
Evaluating the Effectiveness of Attack-Agnostic Features for Morphing Attack Detection, and , in: International Joint Conference on Biometrics, 2024 |
|
Explaining models relating objects and privacy, , , , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024 |
[URL] |
Exploring the Zero-Shot Capabilities of Vision-Language Models for Improving Gaze Following, , , and , in: Int. Conf. Computer Vision and Pattern Recognition (CVPR), Workshop on Gaze Estimation and Prediction in the Wild, 2024 |
|
Extending the Cooperative Dual-Task Space in Conformal Geometric Algebra, and , in: Proc. IEEE Intl Conf. on Robotics and Automation, 2024 |
|
Face Liveness Detection Competition (LivDet-Face) - 2024, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: IEEE International Joint Conference on Biometrics, 2024 |
|
Face Recognition Using Lensless Camera, , and , in: Proceedings of the 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2024 |
[DOI] [URL] |
Face Reconstruction from Partially Leaked Facial Embeddings, and , in: Proceedings of the 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2024 |
[DOI] [URL] |
Feature Representations for Automatic Meerkat Vocalization Classification, , , and , in: 4th International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots, 2024 |
|
Fine-tuning Self-Supervised Models For Language Identification Using Orthonormal Constraint, , , , , , and , in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP), 2024 |
|
FRCSyn Challenge at WACV 2024: Face Recognition Challenge in the Era of Synthetic Data, , , , , and , in: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Workshops, pages 892-901, 2024 |
[URL] |
Generalized Policy Iteration using Tensor Approximation for Hybrid Control, , and , in: International Conference on Learning Representations (ICLR), 2024 |
|
GLoFool: global enhancements and local perturbations to craft adversarial images, and , in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024 |
|
Heterogeneous Face Recognition Using Domain Invariant Units, and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2024 |
|
Human Interest or Conflict? Leveraging LLMs for Automated Framing Analysis in TV Shows, , and , in: ACM International Conference on Interactive Media Experiences, 2024 |
|
Image-guided topic modeling for interpretable privacy classification, and , in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024 |
|
Improving Semantic Control in Discrete Latent Spaces with Transformer Quantized Variational Autoencoders, , , , and , in: Findings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024), 2024 |
Latent Enhancing AutoEncoder for Occluded Image Classification, , in: Proceedings of International Conference on Image Processing, 2024 |
|
Learning About Social Context from Smartphone Data: Generalization Across Countries and Daily Life Moments, , and , in: Proc. ACM Conference on Human Factors in Computing Systems, 2024 |
|
Logic-Skill Programming: An Optimization-based Approach to Sequential Skill Planning, , , and , in: Proc. Robotics: Science and Systems (RSS), 2024 |
|
Mapping the Media Landscape: Predicting Factual Reporting and Political Bias Through Web Interactions, , , and , in: Experimental IR Meets Multilinguality, Multimodality, and Interaction - 14th International Conference of the CLEF Association, CLEF, 2024, Grenoble, France, September 9-12, 2024, Proceedings, 2024 |
|
Mitigating Demographic Bias in Face Recognition via Regularized Score Calibration, and , in: IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, IEEE/CVF, 2024 |
|
Modality Agnostic Heterogeneous Face Recognition with Switch Style Modulators, and , in: IEEE International Joint Conference on Biometrics, 2024 |
|
Multi-Relational Hyperbolic Word Embeddings from Natural Language Definitions, , and , in: The 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024), 2024 |
Multitask Speech Recognition and Speaker Change Detection for Unknown Number of Speakers, , , , , , , and , in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, 2024 |
Neurocomputational model of speech recognition for pathological speech detection: a case study on Parkinson’s disease speech detection, and , in: Proceedings of Interspeech, Kos Island, Greece, pages 3590-3594, 2024 |
[DOI] [URL] |
Nonparametric Variational Regularisation of Pretrained Transformers, and , in: First conference on Language Modelling, 2024 |
[URL] |
Normalizing Flows for Speaker and Language Recognition Backend, , , , and , in: Odyssey 2024: The Speaker and Language Recognition Workshop, 2024 |
On the Utility of Speech and Audio Foundation Models for Marmoset Call Analysis, and , in: 4th International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots, 2024 |
|
Open-Vocabulary Object 6D Pose Estimation, , , , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024 |
[URL] |
Predicting Heart Activity from Speech using Data-driven and Knowledge-based features, , and , in: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2024 |
|
Probability-Aware Word-Confusion-Network-to-Text Alignment Approach for Intent Classification, , , , , , , and , in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, 2024 |
ProGAP: Progressive Graph Neural Networks with Differential Privacy Guarantees, and , in: The 17th ACM International Conference on Web Search and Data Mining, 2024 |
|
Recursive Forward Dynamics for Serial Kinematic Chains using Conformal Geometric Algebra, and , in: In Proc. Intl Workshop on the Algorithmic Foundations of Robotics (WAFR), 2024 |
Refining Tuberculosis Detection in CXR Imaging: Addressing Bias in Deep Neural Networks via Interpretability, , and , in: Proceedings of the 12th European Workshop on Visual Information Processing, 2024 |
|
Reliability Estimation of News Media Sources: Birds of a Feather Flock Together, , , and , in: Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics, 2024 |
|
Representing Robot Geometry as Distance Fields: Applications to Whole-body Manipulation, , , and , in: IEEE International Conference on Robotics and Automation, 2024 |
|
ROXSD: The ROXANNE Multimodal and Simulated Dataset for Advancing Criminal Investigations, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: Odyssey 2024: The Speaker and Language Recognition Workshop, 2024 |
|
σ-GPTs: A New Approach to Autoregressive Models., , and , in: European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2024 |
|
Score Normalization for Demographic Fairness in Face Recognition, , , , and , in: IEEE International Joint Conference on Biometrics (IJCB 2024), 2024 |
|
SDFR: Synthetic Data for Face Recognition Competition, , , , and , in: 2024 IEEE 18th International Conference on Automatic Face and Gesture Recognition (FG), IEEE, 2024 |
[DOI] [URL] |
Second Edition FRCSyn Challenge at CVPR 2024: Face Recognition Challenge in the Era of Synthetic Data, , , , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, pages 3173-3183, 2024 |
[URL] |
Segmenting Object Affordances: Reproducibility and Sensitivity to Scale, , , and , in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024 |
|
Sharingan: A Transformer Architecture for Multi-Person Gaze Following, , and , in: Int. Conference Computer Vision and Pattern Recognition (CVPR), Seatle, 2024 |
|
Sparse multi-view hand-object reconstruction for unseen environments, , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2024 |
[URL] |
Speech and Language Recognition with Low-rank Adaptation of Pretrained Models, , , , and , in: Interspeech 2024, 2024 |
|
Suppressing Noise Disparity in Training Data for Automatic Pathological Speech Detection, and , in: IWAENC, 2024 |
SYLLABLE LEVEL FEATURES FOR PARKINSON'S DISEASE DETECTION FROM SPEECH, and , in: ICASSP, 2024 |
|
Synergizing Natural Language Towards Enhanced Shared Autonomy, , and , in: Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, 2024 |
[URL] |
Test-time adaptation for automatic pathological speech detection in noisy environments, and , in: EUSIPCO, 2024 |
|
Towards interfacing large language models with ASR systems using confidence measures and prompting, , , , and , in: Proceedings of Interspeech, pages 2980-2984, 2024 |
[DOI] |
Towards Robo-Coach: Robot Interactive Stiffness/Position Adaptation for Human Strength and Conditioning Training, , , , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), 2024 |
Towards Wine Tasting Activity Recognition for a Digital Sommelier, , , and , in: Proc. ACM Workshop on Exploring Innovative Technology for Commensality and Human-Food Interaction, 2024 |
Understanding the effects of language-specific class imbalance in multilingual fine-tuning, and , in: Findings of the European chapter of Association for Computational Linguistics, 2024, 2024 |
|
Using Backbone Foundation Model for Evaluating Fairness in Chest Radiography Without Demographic Data, , and , in: Proceedings of the IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR), 2024 |
|
Vascular Biometrics Experiments on Candy -- A New Contactless Finger-Vein Dataset, , , , and , in: Proceedings of the International Conference on Pattern Recognition (ICPR), Calcutta (India), 2024 |
|
Vulnerability of Face Age Verification to Replay Attacks, , , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2024 |
|
2023
A benchmark for the simulation of meshed district heating networks based on anonymised monitoring data, and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2023 |
[DOI] [URL] |
A Machine Learning Model for the Prediction of Building Hourly Heating Demand from CityGML Files: Training Workflow and Deployment as an API, , and , in: Proceedings of Building Simulation 2023: 18th Conference of IBPSA, pages 2932 - 2939, 2023 |
[DOI] [URL] |
A Multitask and Kernel approach for Learning to Push Objects with a Task-Parameterized Deep Q-Network, , , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2023 |
|
A Study on the Importance of Formant Transitions for Stop-Consonant Classification in VCV Sequence, , , and , in: Interspeech, Dublin, Ireland, ISCA, 2023 |
|
A Symbolic Framework for Systematic Evaluation of Mathematical Reasoning with Transformers, , , and , in: Under review, 2023 |
[URL] |
A VAE for Transformers with Nonparametric Variational Information Bottleneck, and , in: The Eleventh International Conference on Learning Representations, 2023 |
[URL] |
Affordance segmentation of hand-occluded containers from exocentric images, , , , , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), 2023 |
[DOI] [URL] |
Approximating Optimal Morphing Attacks using Template Inversion, , and , in: IEEE International Joint Conference on Biometric, 2023 |
[DOI] |
Automatic Speech Analysis Framework for ATC Communication in HAAWAII, , , , , and , in: 13th SESAR Innovation Days, 2023 |
|
Automatic Speech Recognition and Understanding for Radar Label Maintenance Support Increases Safety and Reduces Air Traffic Controllers’ Workload, , , , , , , , , , , and , in: Fifteenth USA/Europe Air Traffic Management Research and Development Seminar (ATM2023), Eurocontrol (Europe), FAA (U.S.), Savannah, Georgia, USA, 2023 |
[URL] |
BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications, , , , , , and , in: 2023 IEEE Spoken Language Technology Workshop (SLT), IEEE, 2023 |
[URL] |
Bi-directional Training for Composed Image Retrieval via Text Prompt Learning, , , , and , in: Under review, 2023 |
[URL] |
Black-box Attacks on Image Activity Prediction and its Natural Language Explanations, , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), 2023 |
[DOI] [URL] |
Blackbox Face Reconstruction from Deep Facial Embeddings Using A Different Face Recognition Model, and , in: Proceedings of the IEEE International Conference on Image Processing (ICIP), Kuala Lumpur, Malaysia, pages 2435-2439, 2023 |
[DOI] [URL] |
BLESS: Benchmarking Large Language Models on Sentence Simplification, , , , , , and , in: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Singapore, 2023 |
|
Bridging the Gap: Heterogeneous Face Recognition with Conditional Adaptive Instance Modulation, and , in: IJCB, 2023 |
|
Can ChatGPT Detect Intent? Evaluating Large Language Models for Spoken Language Understanding, and , in: Proc. INTERSPEECH 2023, pages 1109-1113, 2023 |
[DOI] |
Can Language Models Learn Analogical Reasoning? Investigating Training Objectives and Comparisons to Human Performance, and , in: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Singapore, Association for Computational Linguistics, 2023 |
|
Can personalised hygienic masks be used to attack face recognition systems?, , , and , in: Proceedings of IEEE International Joint Conference on Biometrics (IJCB2023), 2023 |
|
Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers?, and , in: Proceedings of Interspeech, 2023 |
|
Candidate Set Re-ranking for Composed Image Retrieval with Dual Multi-modal Encoder, , , and , in: Under review, 2023 |
[URL] |
ChildPlay: A New Benchmark for Understanding Children's Gaze Behaviour, , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023 |
|
CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice, , , and , in: Proc. Interspeech 2023, 2023 |
[URL] |
Complex Daily Activities, Country-Level Diversity, and Smartphone Sensing: A Study in Denmark, Italy, Mongolia, Paraguay, and UK, , , , , , , , , , , , , , , , , , , , , and , in: Proceedings of the 2023 ACM CHI Conference on Human Factors in Computing Systems (CHI), Association for Computing Machinery, 2023 |
[DOI] |
Customization of Automatic Speech Recognition Engines for Rare Word Detection Without Costly Model Re-Training, , , , , , and , in: Proc. 13th SESAR Innovation Days, Seville, Spain, 2023 |
[DOI] [URL] |
Data-driven Urban Building Energy Modeling with Machine Learning in Satom (CH), , and , in: 6th International IEEE Conference AND Workshop in Obuda on Electrical and Power Engineering, 2023 |
Demonstration-guided Optimal Control for Long-term Non-prehensile Planar Manipulation, , , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), pages 4999-5005, 2023 |
[DOI] |
Diffusion Transformer for Adaptive Text-to-Speech, and , in: Proc. 12th ISCA Speech Synthesis Workshop (SSW 12), 2023 |
[DOI] |
Document-level Text Simplification with Coherence Evaluation, , , and , in: Proceedings of the Second Workshop on Text Simplification, Accessibility and Readability, 2023 |
|
EFaR 2023: Efficient Face Recognition Competition, , , , and , in: IEEE International Joint Conference on Biometrics (IJCB 2023), 2023 |
|
Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks, , , , , , , , and , in: Proceedings of the 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, 2023 |
|
Efficient Grapevine Structure Estimation in Vineyards Conditions, , , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision, Paris, pages 712--720, 2023 |
[URL] |
End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation, , , , , , and , in: The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), Singapore, 2023 |
[URL] |
Energy assessment of a district by integrating solar thermal in district heating network: a dynamic analysis approach, , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2023 |
[DOI] [URL] |
Enhancing Multi-modal Classification of Violent Events using Image Captioning, , and , in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2023), CEUR Workshop Proceedings, Jaén, Spain, 2023 |
[URL] |
Enhancing user acceptance in automated systems with human-centric lighting: the role of visual comfort, personality, and preference, , , , , , , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2023 |
[DOI] [URL] |
ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (CVPR), pages 17408-17419, 2023 |
[DOI] |
Examining European Press Coverage of the Covid-19 No-Vax Movement: An NLP Framework, and , in: 2nd ACM International Workshop on Multimedia AI against Disinformation (MAD '23), June 12, 2023, Thessaloniki, Greece, 2023 |
|
Face Reconstruction from Facial Templates by Learning Latent Space of a Generator Network, and , in: Thirty-seventh Conference on Neural Information Processing Systems, 2023 |
[URL] |
Factors that Affect Personalization of Robots for Older Adults, , and , in: CONCATENATE Workshop at HRI 2023 in Stockholm, Sweden, 2023 |
[URL] |
Few-shot Dysarthric Speech Recognition with Text-to-Speech Data Augmentation, and , in: Proceedings of Interspeech, pages 156-160, 2023 |
[DOI] [URL] |
Findings of the IWSLT 2023 evaluation campaign, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: Proceedings of the IWSLT conference, 2023 |
Framing the News: From Human Perception to Large Language Model Inferences, and , in: International Conference on Multimedia Retrieval (ICMR '23), June 12--15, 2023, Thessaloniki, Greece, 2023 |
|
Fully Automatic Grading of Retinal Vasculitis on Fluorescein Angiography Time-lapse from Real-world Data in Clinical Settings, , , , , , , and , in: 2023 IEEE 36th International Symposium on Computer-Based Medical Systems (CBMS), L'Aquila, Italy, 2023, pages 689-693, 2023 |
[DOI] [URL] |
GAP: Differentially Private Graph Neural Networks with Aggregation Perturbation, , , and , in: 32nd USENIX Security Symposium (USENIX Security 23), 2023 |
|
How Does Pre-trained Wav2Vec 2.0 Perform on Domain-Shifted ASR? An Extensive Benchmark on Air Traffic Control Communications, , , , , , , , and , in: 2023 IEEE Spoken Language Technology Workshop (SLT), IEEE, 2023 |
[URL] |
Human-Robot Collaboration in a Sanding Task, , , , , , , , and , in: Proceedings of the Human Factors and Ergonomics Society Annual Meeting, 2023 |
|
HyperConformer: Multi-head HyperMixer for Efficient Speech Recognition, , , and , in: Proc. Interspeech 2023, Ireland, 2023 |
|
HyperMixer: An MLP-based Low Cost Alternative to Transformers, , , , , , and , in: Proc. of the 61st Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Toronto, Canada, pages 15632-15654, 2023 |
[DOI] |
ID and OOD performance are sometimes inversely correlated on real-world datasets, , , and , in: Advances in Neural Information Processing Systems (NeurIPS), 2023 |
Implementing contextual biasing in GPU decoder for online ASR, , , , , , and , in: Proc. Interspeech 2023, 2023 |
|
Implicit phonetic information modeling for speech emotion recognition, , and , in: Interspeech, Dublin, Ireland, ISCA, 2023 |
|
International Conference on the Voynich Manuscript 2022, , , , , , and , in: Proceedings of the International Conference on Historical Cryptology, 2023 |
Inversion of Deep Facial Templates using Synthetic Data, and , in: Proceedings of the IEEE International Joint Conference on Biometric, 2023 |
[DOI] [URL] |
Keep Sensors in Check: Disentangling Country-Level Generalization Issues in Mobile Sensor-Based Models with Diversity Scores, , , and , in: Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, 2023 |
|
Learning Disentangled Representations for Natural Language Definitions, , , and , in: In Findings of the European chapter of Association for Computational Linguistics, 2023 |
|
Learning diverse features in vision transformers for improved generalization, , , and , in: ICML 2023: The Second Workshop on Spurious Correlations, Invariance and Stability, 2023 |
[URL] |
Learning Joint Space Reference Manifold for Reliable Physical Assistance, , , , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), pages 10412-10417, 2023 |
[DOI] |
Learning to Abstract with Nonparametric Variational Information Bottleneck, , and , in: The 2023 Conference on Empirical Methods in Natural Language Processing, 2023 |
[URL] |
Leveraging Diffusion Disentangled Representations to Mitigate Shortcuts in Underspecified Visual Tasks, , , , and , in: Under review, 2023 |
[URL] |
MLP-Hash: Protecting Face Templates via Hashing of Randomized Multi-Layer Perceptron, , and , in: Proceedings of the 31st European Signal Processing Conference, Helsinki, Finland, 2023 |
[DOI] [URL] |
Multi-image deconvolution of thermal images with a boundary condition weighting scheme, , , , and , in: Target and Background Signatures IX, International Society for Optics and Photonics, Amsterdam, pages 149-158, SPIE, 2023 |
[DOI] [URL] |
Multi-IVE: Privacy Enhancement of Multiple Soft-Biometrics in Face Embeddings, , , , , , , and , in: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023 |
[URL] |
NLI4CT: Multi-Evidence Natural Language Inference for Clinical Trial Reports, , , , , and , in: Proceedings of The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), 2023 |
Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviews, , , and , in: Proceedings of Interspeech, 2023 |
|
On Interventional Probing in High Dimensions: An NLI Case Study, , , and , in: Findings of the 17th European Chapter of the Association for Computational Linguistics, 2023 |
Parameter-Efficient Tuning With Adaptive Bottlenecks For Automatic Speech Recognition, , , , , and , in: Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU'23, 2023 |
|
Potential for district heating networks from waste heat: an assessment tool and its application to sewage treatment plants in the Canton of Zurich, and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2023 |
[DOI] [URL] |
Quantified Canine: Inferring Dog Personality From Wearables, , , , , and , in: Proceedings of the 2023 ACM CHI Conference on Human Factors in Computing Systems (CHI) 2023, Association for Computing Machinery, 2023 |
[DOI] |
Referencing in YouTube Knowledge Communication Videos, and , in: ACM International Conference on Interactive Media Experiences (IMX '23), June 2023, Nantes, France, 2023 |
|
Remote Cancelable Biometric System for Verification and Identification Applications, , , and , in: Proceedings of the International Conference of the Biometrics Special Interest Group (BIOSIG), 2023 |
[DOI] [URL] |
Robust Execution of Assembly Policies Using a Pose Invariant Task Representation, , , , , and , in: 20th International Conference on Ubiquitous Robots (UR), Honolulu, HI, USA, IEEE, 2023 |
|
RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question, , , , , , and , in: Association for Computational Linguistics: ACL 2023, Toronto, Canada, 2023 |
[URL] |
Selective Mixup Helps with Distribution Shifts, But Not (Only) because of Mixup, , and , in: Under review, 2023 |
[URL] |
SemEval-2023 Task 7: Multi-Evidence Natural Language Inference for Clinical Trial Data, , , , , and , in: Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023), Association for Computational Linguistics, 2023 |
[DOI] [URL] |
Shortcut Bias Mitigation via Ensemble Diversity Using Diffusion Probabilistic Models, , , , , and , in: Under review, 2023 |
[URL] |
Situated Participatory Design: A Method for In Situ Design of Robotic Interaction with Older Adults, , and , in: CHI '23: Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023 |
[DOI] [URL] |
SoftGPT: Learn Goal-oriented Soft Object Manipulation Skills by Generative Pre-trained Heterogeneous Graph Transformer, , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2023 |
|
Syntax-Aware Graph-to-Graph Transformer for Semantic Role Labelling, and , in: Procceedings of 8th Workshop on Representation Learning for NLP, 2023 |
[URL] |
SynthDistill: Face Recognition with Knowledge Distillation from Synthetic Data, , and , in: IEEE International Joint Conference on Biometrics (IJCB 2023), 2023 |
[DOI] |
Template Inversion Attack against Face Recognition Systems using 3D Face Reconstruction, and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 19662-19672, 2023 |
[DOI] [URL] |
The AI4Autism Project: A Multimodal and Interdisciplinary Approach to Autism Diagnosis and Stratification, , , , , , , and , in: Companion Publication of the 25th International Conference on Multimodal Interaction, Paris, France, pages 414–425, Association for Computing Machinery, 2023 |
[DOI] [URL] |
The Idiap Speech Synthesis System for the Blizzard Challenge 2023, , , and , in: Proc. 18th Blizzard Challenge Workshop, 2023 |
[DOI] |
The Interpreter Understands Your Meaning: End-to-end Spoken Language Understanding Aided by Speech Translation, and , in: Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023, pages 4408-4423, Association for Computational Linguistics, 2023 |
[DOI] |
The Unconstrained Ear Recognition Challenge 2023: Maximizing Performance and Minimizing Bias, and , in: IEEE International Joint Conference on Biometrics (IJCB 2023), 2023 |
|
Toward responsible face datasets: modeling the distribution of a disentangled latent space for sampling face images from demographic groups, , and , in: IEEE International Joint Conference on Biometrics, 2023 |
|
Towards Improved Replicability of Human Studies in Human-Robot Interaction: Recommendations for Formalized Reporting, , , , , , , , , , and , in: Companion of the 2023 ACM/IEEE International Conference on Human-Robot Interaction, pages 629-633, 2023 |
|
Towards learning emotion information from short segments of speech, , , , and , in: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Rhodes Island, Greece, IEEE, 2023 |
|
Transformers as Graph-to-Graph Models, , , and , in: Big Picture Workshop at EMNLP 2023, 2023 |
Transformers, Tables and Frame Semantics, , , and , in: International Conference on Semantic Computing, 2023 |
Understanding the Social Context of Eating with Multimodal Smartphone Sensing: The Role of Country Diversity, , and , in: 25th ACM International Conference on Multimodal Interaction, 2023 |
[DOI] [URL] |
Using Commercial ASR Solutions to Assess Reading Skills in Children: A Case Report, , , , , and , in: Proceedings of Interspeech, pages 4573-4577, 2023 |
[DOI] [URL] |
Verification of PyDHN - a Python library for the thermo-hydraulic simulation of district heating networks - through the DESTEST, , and , in: Proceedings of Building Simulation 2023: 18th Conference of IBPSA, IBPSA, IBPSA, 2023 |
[DOI] [URL] |
VP-STO: Via-point-based Stochastic Trajectory Optimization for Reactive Robot Behavior, , , and , in: Proc. IEEE International Conference on Robotics and Automation (ICRA), 2023 |
|
Vulnerability of Automatic Identity Recognition to Audio-Visual Deepfakes, , , and , in: IEEE International Joint Conference on Biometrics, 2023 |
|
Zero-shot Retrieval: Augmenting Pre-trained Models with Search Engines, , , , , , and , in: Under review, 2023 |
[URL] |
ZooPFL: Exploring Black-box Foundation Models for Personalized Federated Learning, , , , , , , , and , in: Under review, 2023 |
[URL] |
2022
A Corpus and Evaluation for Predicting Semi-Structured Human Annotations, , , , and , in: Workshop on Generation, Evaluation and Metrics (GEM), 2022 |
|
A Modular Multimodal Architecture for Gaze Target Prediction: Application to Privacy-Sensitive Settings, , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022 |
|
A two-step approach to leverage contextual data: speech recognition in air-traffic communications, , , , and , in: Proc. of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022 |
|
Active Learning by Feature Mixing, , , , , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022 |
Adversarial-free speaker identity-invariant representation learning for automatic dysarthric speech classification, and , in: Annual Conference of the International Speech Communication Association, 2022 |
|
An anomaly detection approach for backdoored neural networks: face recognition as a case study, and , in: 21st International Conference of the Biometrics Special Interest Group (BIOSIG 2022), Darmstadt, Germany, 2022 |
|
An Empirical Comparison of Semantic Similarity Methods for Analyzing down-streaming Automatic Minuting task, , , , and , in: Pacific Asia Conference on Language, Information and Computation (PACLIC 36), In ACL Anthology Proceedings, 2022 |
|
An End-to-End Multilingual System for Automatic Minuting of Multi-Party Dialogues, , , and , in: Pacific Asia Conference on Language, Information and Computation (PACLIC 36) , In proceedings of ACL Anthology, 2022 |
|
An exploratory interplay between daylight, general and task lighting for visual comfort and electricity savings in a personal office space, , , , , , and , in: Proceedings of ISES and IEA SHC International Conference on Solar Energy for Buildings and Industry, Kassel, Germany, 2022 |
Are GAN-based Morphs Threatening Face Recognition?, , , and , in: International Conference on Acoustics, Speech and Signal Processing, 2022 |
|
Automatic Minuting: A Pipeline Method for Generating Minutes, , , and , in: Pacific Asia Conference on Language, Information and Computation (PACLIC 36), In proceedings of ACL Anthology, 2022 |
|
Automatic Summarization for Creative Writing: Denoising Auto-Encoder based Pipeline Method for Generating Summary of Movie Scripts, , , , and , in: Automatic Summarization for Creative Writing, International Conference on Computational Linguistics (COLING 2022), 2022 |
|
Bag-of-Vectors Autoencoders for Unsupervised Conditional Text Generation, and , in: Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), Association for Computational Linguistics, Online, pages 468–488, 2022 |
[URL] |
Bayesian Recurrent Units and the Forward Backward Algorithm, and , in: Proc. Interspeech 2022, pages 4137-4141, 2022 |
[DOI] |
Bio-Medical Multi-label Scientific Literature Classification using LWAN and Dual-attention module, , , , and , in: PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION (PACLIC 2022), In Proceedings of ACL Anthology, 2022 |
|
Borrowing from yourself: Faster future video segmentation with partial channel update, and , in: International Conference on Pattern Recognition, 2022 |
|
Case-Based Abductive Natural Language Inference, , and , in: Proceedings of the 29th International Conference on Computational Linguistics, 2022 |
[URL] |
Comparing Biosignal and Acoustic feature Representation for Continuous Emotion Recognition, , , , and , in: International Multimodal Sentiment Analysis Workshop and Challenge, 2022 |
|
Comparing supervised and self-supervised embedding for ExVo Multi-Task learning track, , , and , in: Proceedings of the ICML Expressive Vocalizations Workshop held in conjunction with the 39th International Conference on Machine Learning, Maryland, USA, 2022 |
|
Comparison of 5 methods for the evaluation of intelligibility in mild to moderate French dysarthric speech, , , , , , , , , and , in: Annual Conference of the International Speech Communication Association, pages 2188-2192, 2022 |
[DOI] |
Conversational Speech Recognition Needs Data? Experiments with Austrian German, , , and , in: Proceedings of the 13th Language Resources and Evaluation Conference, European Language Resources Association, pages 4684--4691, 2022 |
[URL] |
Custom attribution loss for improving generalization and interpretability of deepfake detection, , and , in: International Conference on Acoustics, Speech, and Signal Processing, 2022 |
|
Decomposing Natural Logic Inferences for Neural NLI, , , , and , in: BlackBoxNLP: Workshop on analyzing and interpreting neural networks for NLP, 2022 |
DeepCon: An End-to-End Multilingual Toolkit for Automatic Minuting of Multi-Party Dialogues, , , and , in: Special Interest Group on Discourse and Dialogue (SIGDIAL 2022), 2022 |
|
Demystifying the Scribes behind the Voynich Manuscript using Computational Linguistic Techniques, , and , in: Proceedings of the 1st International Conference on the Voynich Manuscript, 2022 |
DHgeN: Automated Generation of District Heating Network Layouts for Feasibility Studies, and , in: -, 2022 |
|
EBMs vs. CL: Exploring Self-Supervised Visual Pretraining for Visual Question Answering, , , , and , in: arXiv, 2022 |
Efficient Training of Low-Curvature Neural Networks, , , and , in: NeurIPS 2022, 2022 |
[URL] |
Efficient Wind Speed Nowcasting with GPU-Accelerated Nearest Neighbors Algorithm, , and , in: Proceedings of SIAM Data Mining, Virginia US and Virtual, 2022 |
Evading the Simplicity Bias: Training a Diverse Set of Models Discovers Solutions with Superior OOD Generalization, , , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022 |
|
Expanded Lattice Embeddings for Spoken Document Retrieval on Informal Meetings, , , , and , in: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, 2022 |
[DOI] |
Experimental investigation on STFT phase Representations for deep learning-based dysarthric speech detection, and , in: International Conference on Acoustics, Speech, and Signal Processing, 2022 |
|
Face Anthropometry Aware Audio-visual Age Verification, and , in: ACM Multimedia, 2022 |
|
Face Reconstruction from Deep Facial Embeddings using a Convolutional Neural Network, , and , in: Proceedings of the 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, France, IEEE, 2022 |
[DOI] [URL] |
Fairness Index Measures to Evaluate Bias in Biometric Recognition, and , in: International Conference on Pattern Recognition Workshops, 2022 |
|
From Undercomplete to Sparse Overcomplete Autoencoders to Improve LF-MMI Speech Recognition, and , in: Proceedings of Interspeech Conference, 2022 |
|
GeoNeRF: Generalizing NeRF with Geometry Priors, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (CVPR), 2022 |
[URL] |
Grammar Based Speaker Role Identification for Air Traffic Control Speech Recognition, , , , , , and , in: 12th SESAR Innovation Days, 2022 |
|
Graph Refinement for Coreference Resolution, and , in: Findings of Association for >Computational Linguistics: ACL 2022, 2022 |
Health Talk: Understanding Practices of Popular Professional YouTubers, , , , and , in: Proceedings of the 21st International Conference on Mobile and Ubiquitous Multimedia, 2022 |
Hierarchical Multi-task learning framework for Isometric-Speech Language Translation, , , and , in: ACL, 2022 |
|
HMIST: Hierarchical Multilingual Isometric Speech Translation using Multi-Task Learning Framework for Automatic Dubbing, , , and , in: PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION (PACLIC 2022), In Proceedings of ACL Anthology, 2022 |
|
How Did Europe’s Press Cover Covid-19 Vaccination News? A Five-Country Analysis, and , in: MAD '22: Proceedings of the 1st International Workshop on Multimedia AI against Disinformation, 2022 |
[DOI] [URL] |
Hybrid Autoregressive Inference for Scalable Multi-hop Explanation Regeneration, , , and , in: Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022 |
Hybrid Protection of Biometric Templates by Combining Homomorphic Encryption and Cancelable Biometrics, , , , , and , in: Proceedings of the 2022 International Joint Conference on Biometrics (IJCB), Abu Dhabi, United Arab Emirates (UAE), IEEE, 2022 |
[DOI] [URL] |
IDIAP Submission@LT-EDI-ACL2022 : Hope Speech Detection for Equality, Diversity and Inclusion, and , in: ACL, 2022 |
|
IDIAP Submission@LT-EDI-ACL2022: Detecting Signs of Depression from Social Media Text, and , in: ACL, 2022 |
|
IDIAP Submission@LT-EDI-ACL2022: Homophobia/Transphobia Detection in social media comments, and , in: ACL Proceedings, 2022 |
|
IDIAPers @ Causal News Corpus 2022: Efficient Causal Relation Identification Through a Prompt-based Few-shot Approach, , , , , , and , in: The 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE @ EMNLP 2022), 2022 |
[URL] |
IDIAPers @ Causal News Corpus 2022: Extracting Cause-Effect-Signal Triplets via Pre-trained Autoregressive Language Model, , , , , , and , in: The 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE @ EMNLP 2022), 2022 |
[URL] |
IDIAP_TIET@LT-EDI-ACL2022 : Hope Speech Detection in Social Media using Contextualized BERT with Attention Mechanism, , and , in: ACL, 2022 |
|
Imitation of Manipulation Skills Using Multiple Geometries, , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2022 |
|
Indexing Protected Deep Face Templates by Frequent Binary Patterns, , , , and , in: Proceedings of the 2022 International Joint Conference on Biometrics (IJCB), Abu Dhabi, United Arab Emirates (UAE), IEEE, 2022 |
[DOI] [URL] |
Innovators@SMM4H'22: An Ensembles Approach for self-reporting of COVID-19 Vaccination Status Tweets, , , , and , in: International Conference on Computational Linguistics (COLING 2022), 2022 |
|
Innovators@SMM4H'22: An Ensembles Approach for Stance and Premise Classification of COVID-19 Health Mandates Tweets, , , , and , in: International Conference on Computational Linguistics (COLING 2022), 2022 |
Learning to Guide Online Multi-Contact Receding Horizon Planning, , , , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2022 |
Leveraging Events Sub-Categories for Violent-Events Detection in Social Media, , and , in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022), 2022 |
[URL] |
Local estimation of parametric point spread functions in thermal images via convolutional neural networks, , , and , in: SPIE sensors + imaging, Target and Background Signatures VIII, Berlin, Germany, pages 1227009 1--8, SPIE, 2022 |
[DOI] [URL] |
Low-Level Physiological Implications of End-to-End Learning for Speech Recognition, and , in: Proc. Interspeech 2022, pages 749--753, 2022 |
[DOI] |
Modeling Of Pre-trained Neural Network Embeddings Learned From Raw Waveform For Covid-19 Infection Detection, , , and , in: Proceedings of ICASSP, 2022 |
|
Multilingual Extraction and Categorization of Lexical Collocations with Graph-aware Transformers, , , , and , in: Proceedings of the 11th Joint Conference on Lexical and Computational Semantics, Seattle, USA, pages 89–100, 2022 |
|
Multimodality for NLP-Centered Applications: Resources, Advances and Frontiers, , , and , in: International Conference on Language Resources and Evaluation (LREC 2022), 2022 |
|
On Breathing Pattern Information in Synthetic Speech, and , in: Proceedings of Interspeech, 2022 |
|
On the detection of morphing attacks generated by GANs, and , in: 21st International Conference of the Biometrics Special Interest Group (BIOSIG 2022), 2022 |
|
On-demand compute reduction with stochastic wav2vec 2.0, , , and , in: Proceedings of Interspeech, 2022 |
|
Paumer: Patch Pausing Transformer for Semantic Segmentation, , and , in: 33th British Machine Vision Conference 2022, London, UK, 21 - 24 November 2022, 2022 |
|
PERFECT: Prompt-free and Efficient Few-shot Learning with Language Models, , , , , , and , in: ACL, 2022 |
|
Pre-training Data Quality and Quantity for a Low-Resource Language: New Corpus and BERT Models for Maltese, , , , and , in: Proceedings of the workshop on Deep Learning for Low-Resource NLP, 2022 |
[URL] |
Predicting is not understanding: Recognizing and addressing underspecification in machine learning, , and , in: European Conference on Computer Vision, pages 458-476, Springer, 2022 |
Pulmonary Tuberculosis Screening from Radiological Signs on Chest X-Ray Images Using Deep Models, , and , in: Union World Conference on Lung Health, The Union, 2022 |
Reactive Anticipatory Robot Skills with Memory, , and , in: The International Symposium on Robotics Research, 2022 |
|
Readback Error Detection by Automatic Speech Recognition and Understanding -- Results of HAAWAII Project for Isavia’s Enroute Airspace, , , , , , , , , and , in: 11th SESAR Innovation Days, SESAR, pages 9, 2022 |
|
Reasoning over vision and language: Exploring the benefits of supplemental knowledge, , , and , in: arXiv, 2022 |
Residual Feature Pyramid Network for Enhancement of Vascular Patterns, and , in: IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022 |
|
SelecMix: Debiased Learning by Contradicting-pair Sampling, , , , , , and , in: Advances in Neural Information Processing Systems, 2022 |
SelecMix: Debiased Learning by Mixing up Contradicting Pairs, , , , , , and , in: ICML Workshop on Spurious Correlations, Invariance and Stability, 2022 |
Shallow Discourse Parsing for Open Information Extraction and Text Simplification, , and , in: 3rd Workshop on Computational Approaches to Discourse (CODI) @ COLING, 2022 |
SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages, , , , , and , in: The 2022 Conference on Empirical Methods in Natural Language Processing, 2022 |
|
Speaker recognition on mono-channel telephony recordings, , , , and , in: The Speaker and Language Recognition Workshop, 2022 |
|
Speech and Natural Language Processing Technologies for Pseudo-Pilot Simulator, , , , , and , in: 12th SESAR Innovation Days, 2022 |
|
Symmetry-induced Disentanglement on Graphs, , and , in: Advances in Neural Information Processing Systems 35, 2022 |
Systematicity, Compositionality and Transitivity of Deep NLP Models: a Metamorphic Testing Perspective, , , , and , in: Findings of the ACL, 2022 |
Team Innovators at SemEval-2022 for Task 8: Multi-Task Training with Hyperpartisan and Semantic Relation for Multi-Lingual News Article Similarity, , , , and , in: Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022), NAACL 2022, 2022 |
|
TextGraphs 2022 Shared Task on Natural Language Premise Selection, , , , and , in: Proceedings of TextGraphs-16: Graph-based Methods for Natural Language Processing, 2022 |
[URL] |
The Winning Approach for the Recommendation Systems Shared Task @REST_MEX 2022, , , , and , in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022), 2022 |
[URL] |
To be or not to be an Integer? Encoding Variables for Mathematical Text, , , , and , in: Findings of the ACL, 2022 |
Towards Accessible Sign Language Learning and Assessment, , , and , in: ACM International Conference on Multimodal Interaction, Bangalore, INDIA, pages 626-631, 2022 |
[DOI] |
Towards Automatic Prediction of Non-Expert Perceived Speech Fluency Ratings, , , and , in: ACM International Conference on Multimodal Interaction (ICMI Companion), 2022 |
[DOI] |
Towards energy hubs: an innovative Geographic Information System based approach for cluster definition, , , and , in: ICREC 2022 Conference Proceedings, 2022 |
UM-DFKI Maltese Speech Translation, , , , , , , , , and , in: Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023), 2022 |
UNSL at eRisk 2022: Decision policies with history for early classification, , , and , in: CEUR Workshop Proceedings, 2022 |
[URL] |
Unsupervised Token-level Hallucination Detection from Summary Generation By-products, and , in: Workshop on Generation, Evaluation and Metrics (GEM), 2022 |
|
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering, , and , in: Proceedings of Interspeech, 2022 |
|
Vision-Language Pretraining: Current Trends and the Future, , and , in: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts, 2022 |
[URL] |
Visually Grounded Interpretation of Noun-Noun Compounds in English, , , and , in: Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics, Association for Computational Linguistics, 2022 |
Voyager: Data Discovery for Onboarding in Data Science, , , and , in: 37th IEEE International Conference on Data Engineering (ICDE), 2022 |
What Do Compressed Multilingual Machine Translation Models Forget?, , , , , and , in: The 2022 Conference on Empirical Methods in Natural Language Processing, 2022 |
|
Why Scholars Are Diagramming Neural Network Models, , and , in: 13th International Conference on the Theory and Application of Diagrams, 2022 |
Your Day in Your Pocket: Complex Activity Recognition from Smartphone Accelerometers, , and , in: EAI Pervasive Health, 2022 |
|
2021
A Bayesian Interpretation of the Light Gated Recurrent Unit, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2021 |
[DOI] |
A Comparative Study Of Simulation Tools To Model The Solar Irradiation On Building Façades, , , , , , , , , , , , , , , and , in: Proceedings of SWC 2021: ISES Solar World Congress, ISES, 2021 |
[DOI] [URL] |
A COMPARISON OF METHODS FOR OOV-WORD RECOGNITION ON A NEW PUBLIC DATASET, , and , in: 2021 IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE Signal Processing Society, Toronto, Ontario, Canada, 2021 |
|
A Laser-based Dual-arm System for Precise Control of Collaborative Robots, , and , in: IEEE International Conference on Robotics and Automation, 2021 |
|
A machine-learning model for the prediction of aggregated building heating demand from pan-European land-use maps, , and , in: Journal of Physics: Conference Series, 2021 |
[DOI] |
An Attention Mechanism for Deep Q-Networks with Applications in Robotic Pushing, , and , in: Proc. of Workshop on Emerging paradigms for robotic manipulation: from the lab to the productive world, ICRA, 2021 |
An Efficient Image-to-Image Translation HourGlass-based Architecture for Object Pushing Policy Learning, , and , in: IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021 |
|
An Evaluation Benchmark for Automatic Speech Recognition of German-English Code-Switching, , and , in: IEEE Automatic Speech Recognition and Understanding Workshop, 2021 |
|
An Objective Evaluation Framework for Pathological Speech Synthesis, , , , , and , in: Proceedings of ITG Conference on Speech Communication, 2021 |
|
Approximating the Mental Lexicon from Clinical Interviews as a Support Tool for Depression Detection, , , , and , in: Proceedings of the 2021 International Conference on Multimodal Interaction, ACM, 2021 |
[DOI] |
Automated Interpretation of Air Traffic Control Communication: The Journey from Spoken Words to a Deeper Understanding of the Meaning, , , , , , , and , in: 2021 IEEE/AIAA 40th Digital Avionics Systems Conference (DASC), San Antonio, TX, USA, pages 1-9, IEEE, 2021 |
[DOI] |
Automatic and perceptual discrimination between dysarthria, apraxia of speech, and neurotypical speech, , , and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2021 |
|
Automatic Dialect Detection for Low Resource Santali Language, , , , , and , in: Proceeding of International Conference on Information Technology (OCIT), 2021 |
|
AUTOMATIC DYSARTHRIC SPEECH DETECTION EXPLOITING PAIRWISE DISTANCE-BASED CONVOLUTIONAL NEURAL NETWORKS, , and , in: 45th International Conference on Acoustics, Speech, and Signal Processing, Toronto, Canada, pages 7328–7332, 2021 |
|
Automatic processing pipeline for collecting and annotating air-traffic voice communication data, , , , , , , , , and , in: Proceedings of 9th OpenSky Symposium 2020, OpenSky Network, Brussels, Belgium, pages 1-9, MDPI, 2021 |
|
Beyond question-based biases: Assessing multimodal shortcut learning in visual question answering, , , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021 |
|
Boosting of contextual information in ASR for air-traffic call-sign recognition, , , , , , , and , in: Interspeech 2021, 2021 |
|
Challenges for Using Impact Regularizers to Avoid Negative Side Effects, , and , in: SafeAI 2021 - AAAI's Workshop on Artificial Intelligence Safety, 2021 |
|
Compacter: Efficient Low-Rank Hypercomplex Adapter Layers, , and , in: NeurIPS, 2021 |
|
Comparing CTC and LFMMI for out-of-domain adaptation of wav2vec 2.0 acoustic model, , and , in: Proceedings of Interspeech, 2021 |
[URL] |
Contextual Semi-Supervised Learning: An Approach To Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems, , , , , , and , in: Interspeech 2021, 2021 |
[URL] |
Cost–effective Variational Active Entity Resolution, , , , and , in: 37th IEEE International Conference on Data Engineering (ICDE), 2021 |
[URL] |
Cross Modal Focal Loss for RGBD Face Anti-Spoofing, and , in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021 |
|
Deep Auto-Encoding and Biohashing for Secure Finger Vein Recognition, and , in: Proceedings of the 2021 IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE, Toronto, Canada, 2021 |
[DOI] [URL] |
DepthInSpace: Exploitation and Fusion of Multiple Video Frames for Structured-Light Depth Estimation, , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 6039-6048, 2021 |
[URL] |
Development of a lung segmentation algorithm for analog imaged chest X-Ray: preliminary results, , , , and , in: XV Brazilian Congress on Computational Intelligence, Joinville, Brazil, 2021 |
[URL] |
Disentangling Generative Factors in Natural Language with Discrete Variational Autoencoders, and , in: The 2021 Conference on Empirical Methods in Natural Language Processing, 2021 |
District heating network modelling for future integration of solar thermal energy, , , and , in: Journal of Physics: Conference Series, pages 012089, IOP Publishing, 2021 |
[DOI] |
Do Natural Language Explanations Represent Valid Logical Arguments? Verifying Entailment in Explainable NLI Gold Standards, , and , in: 14th International Conference on Computational Semantics, 2021 |
[URL] |
Does My Representation Capture X? Probe-Ably, , , , and , in: 59th Annual Meeting of the Association for Computational Linguistics (Demonstration track), 2021 |
[URL] |
Encoding Explanatory Knowledge for Zero-shot Science Question Answering, , , and , in: 14th International Conference on Computational Semantics, 2021 |
[URL] |
Estimating Nonplanar Flow from 2D Motion-blurred Widefield Microscopy Images via Deep Learning, and , in: International Symposium on Biomedical Imaging, 2021, 2021 |
|
Explainable Inference Over Grounding-Abstract Chains for Science Questions, , and , in: 59th Annual Meeting of the Association for Computational Linguistics (ACL Findings), 2021 |
|
Explainable Natural Language Reasoning via Conceptual Unification, , and , in: 16th conference of the European Chapter of the Association for Computational Linguistics (EACL), 2021 |
[URL] |
Face Liveness Detection Competition (LivDet-Face) - 2021, , , , , , , , , and , in: International Joint Conference on Biometrics, 2021 |
Fusion of Acoustic and Linguistic Information Using Supervised Autoencoder for Improved Emotion Recognition, , and , in: 2nd Multimodal Sentiment Analysis Challenge (MuSe '21), October 24, 2021, Virtual Event, China, 2021 |
[DOI] |
Graph2Speak: Improving Speaker Identification using Network Knowledge in Criminal Conversational Data, , , and , in: 1st ISCA Symposium on Security and Privacy in Speech Communication, pages 10--13, 2021 |
[DOI] |
Handling acoustic variation in dysarthric speech recognition systems through model combination, and , in: Proceedings of Interspeech, 2021 |
|
Identification of F1 and F2 in speech using modified zero frequency filtering, and , in: Proceedings of Interspeech, 2021 |
|
Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models, , , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021 |
|
Implementation of machine learning techniques for the quasi real-time blind and electric lighting optimization in a controlled experimental facility, , , , , and , in: Journal of Physics: Conference Series, IOP Publishing, 2021 |
[DOI] [URL] |
Imposing Relation Structure in Language-Model Embeddings Using Contrastive Learning, , , and , in: Proceedings of the 25th Conference on Computational Natural Language Learning, Online, pages 337-348, Association for Computational Linguistics, 2021 |
Improving Emotional TTS with an Emotion Intensity Input from Unsupervised Extraction, and , in: 11th ISCA Speech Synthesis Workshop, 2021 |
[URL] |
Improving Generalization of Deepfake Detection by Training for Attribution, , and , in: International Workshop on Multimedia Signal Processing, 2021 |
|
Intrinsically-Motivated Robot Learning of Bayesian Probabilistic Movement Primitives, and , in: ICRA workshop: "Towards Curious Robots: Modern Approaches for Intrinsically-Motivated Intelligent Behavior", 2021 |
|
Late Fusion of the Available Lexicon and Raw Waveform-based Acoustic Modeling for Depression and Dementia Recognition, , , , , and , in: Proceedings of Interspeech 2021, ISCA-International Speech Communication Association 2021, 2021 |
|
LATTICE-FREE MMI ADAPTATION OF SELF-SUPERVISED PRETRAINED ACOUSTIC MODELS, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, 2021 |
[URL] |
Learning to Translate Low-Resourced Swiss German Dialectal Speech into Standard German Text, , and , in: IEEE Automatic Speech Recognition and Understanding Workshop, Colombia, Cartagena, IEEE, 2021 |
|
Locally Private Graph Neural Networks, and , in: ACM Conference on Computer and Communications Security (CCS), 2021 |
|
Machine learning techniques for the daylight and electric lighting performance predictions, , and , in: Proceedings of Building Simulation 2021, 2021 |
Measuring Speech Recognition And Understanding Performance in Air Traffic Control Domain Beyond Word Error Rates, , , , , , , , and , in: 11th SESAR Innovation Days, 2021 |
|
Modeling Dialectal Variation for Swiss German Automatic Speech Recognition, , and , in: Proceedings of Interspeech, 2021 |
[DOI] |
Multi-Adversarial Learning for Cross-Lingual Word Embeddings, , and , in: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Online, pages 463-472, 2021 |
Multi-task Neural Network for Robust Multiple Speaker Embedding Extraction, , and , in: Proceedings of Interspeech 2021, 2021 |
Multi-task Single Channel Speech Enhancement Using Speech Presence Probability As A Secondary Task Training Target, , and , in: European Signal Processing Conference, EUSIPCO 2021, 2021 |
|
Multimodal Neural Machine Translation System for English to Bengali, , , , , , and , in: Proceedings of the First Workshop on Multimodal Machine Translation for Low Resource Languages (MMTLRL 2021), Online (Virtual Mode), pages 31--39, INCOMA Ltd., 2021 |
[URL] |
Multitask adaptation with Lattice-Free MMI for multi-genre speech recognition of low resource languages, , and , in: Proceedings of Interspeech 2021, 2021 |
|
Natural Language Inference over Tables: Enabling Explainable Data Exploration on Data Lakes, , , and , in: 18th Extended Semantic Web Conference (ESWC), 2021 |
[URL] |
NLPHut's Participation at WAT2021, , , , , , , and , in: Proceedings of the 8th Workshop on Asian Translation (WAT2021), pages 146--154, Association for Computational Linguistics, 2021 |
[URL] |
On Modeling Glottal Source Information for Phonation Assessment in Parkinson’s Disease, , , , and , in: Proceedings of Interspeech, 2021 |
|
On the Effectiveness of Vision Transformers for Zero-shot Face Anti-Spoofing, and , in: International Joint Conference on Biometrics (IJCB 2021), 2021 |
|
On the Language-specificity of Multilingual BERT and the Impact of Fine-tuning, , , and , in: Proceedings of the Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2021 |
On the Recognition Performance of BioHashing on state-of-the-art Face Recognition models, , and , in: Proceedings of the 13th IEEE International Workshop on Information Forensics and Security (WIFS), Montpellier, France, IEEE, 2021 |
[DOI] [URL] |
On The Relationship Between Speech-based Breathing Signal Prediction Evaluation Measures And Breathing Parameters Estimation, , , , and , in: Proc. of ICASSP, 2021 |
|
On the use of automatically generated synthetic image datasets for benchmarking face recognition, , and , in: International Joint Conference on Biometrics (IJCB 2021), 2021 |
|
Open Machine Translation for Low Resource South American Languages (AmericasNLP 2021 Shared Task Contribution), , , , , , , , and , in: Proceedings of the First Workshop on Natural Language Processing for Indigenous Languages of the Americas, pages 218–223, Association for Computational Linguistics, 2021 |
[DOI] [URL] |
Open-Set Speaker Identification pipeline in live criminal investigations, and , in: 1st ISCA Symposium on Security and Privacy in Speech Communication, 2021 |
|
Optics Versus Computation: Influence of Illumination and Reconstruction Model Accuracy in Focal-Plane-Scanning Optical Projection Tomography, and , in: 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI), Nice, France, pages 567-570, IEEE, 2021 |
[DOI] |
Optimal Control Combining Emulation and Imitation to Acquire Physical Assistance Skills, , and , in: Proc. IEEE Intl Conf. on Advanced Robotics (ICAR), 2021 |
|
Optimization of robot configurations for motion planning in industrial riveting, , , and , in: Proc. IEEE Intl Conf. on Advanced Robotics (ICAR), 2021 |
|
Overview of the 8th Workshop on Asian Translation, , , , , , , , , , , , , , , and , in: Proceedings of the 8th Workshop on Asian Translation (WAT2021), pages 1--45, Association for Computational Linguistics, 2021 |
[URL] |
Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks, , , and , in: ACL, 2021 |
|
Perspectives and limitations of visible-thermal image pair synthesis via generative adversarial networks, , , , and , in: Security + Defence, Target and Background Signatures VII, Proc. of SPIE, online only, pages 1186509-1--1186509-8, SPIE, 2021 |
[DOI] [URL] |
Phoneme based Respiratory Analysis of Read Speech, , , and , in: Proceedings of European Signal Processing Conference (EUSIPCO), 2021 |
|
Pose Transformers (POTR): Human Motion Prediction with Non-Autoregressive Transformers, , and , in: International Conference in Computer Vision - Workshops, 2021 |
|
Probabilistic Iterative LQR for Short Time Horizon MPC, and , in: International Conference on Intelligent Robots and Systems, pages 579-585, 2021 |
[DOI] |
PROMPT: Probabilistic Motion Primitives based Trajectory Planning, , , and , in: Proceedings of Robotics: Science and Systems, 2021 |
[DOI] [URL] |
Readback Error Detection by Automatic Speech Recognition to Increase ATM Safety, , , , , , , , , , , , , and , in: Fourteenth USA/Europe Air Traffic Management Research and Development Seminar (ATM2021), The United States Federal Aviation Administration (FAA), EUROCONTROL, pages 10, 2021 |
[URL] |
Rethinking the Role of Gradient-based Attribution Methods for Model Interpretability, and , in: International Conference on Learning Representations, 2021 |
|
Robust Command Recognition for Lithuanian Air Traffic Control Tower Utterances, , , , , , , and , in: Interspeech, 2021 |
|
ROXANNE Research Platform: Automate criminal investigations, , , , , and , in: Interspeech Show and Tell 2021, 2021 |
|
ROXSD: a Simulated Dataset of Communication in Organized Crime, , , , and , in: 1st ISCA Symposium on Security and Privacy in Speech Communication, 2021 |
|
Scholarly AI system diagrams as an access point to mental models, , and , in: Diagrams, 2021 |
Sentence-level Planning for Especially Abstractive Summarization, and , in: Proceedings of the Third Workshop on New Frontiers in Summarization, pages 1--14, Association for Computational Linguistics, 2021 |
[URL] |
Speech Activity Detection Based on Multilingual Speech Recognition System, , and , in: Interspeech, 2021 |
|
STAR: Cross-modal Statement Representation for Selecting Relevant Mathematical Premises, and , in: 16th conference of the European Chapter of the Association for Computational Linguistics (EACL), 2021 |
Structuralist analysis for neural network system diagrams, , and , in: Diagrams, 2021 |
Subjective and objective evaluation of deepfake videos, and , in: The international Conference on Acoustics, Speech, and Signal Processing, 2021 |
|
Supervised Speech Representation Learning for Parkinson's Disease Classification, and , in: ITG Conference on Speech Communication, 2021 |
|
Supporting Context Monotonicity Abstractions in Neural NLI Models, , , , and , in: Natural Logic Meets Machine Learning Workshop, 2021 |
[URL] |
Switching Contexts: Transportability Measures for NLP, , , and , in: 14th International Conference on Computational Semantics, 2021 |
[URL] |
Syntax-Aware Graph-to-Graph Transformer for Semantic Role Labelling, and , in: Arxiv, 2021 |
|
Test time Adaptation through Perturbation Robustness, and , in: Workshop on Distribution Shifts, 35th Conference on Neural Information Processing Systems (NeurIPS 2021), 2021 |
|
The DCU-EPFL Enhanced Dependency Parser at the IWPT 2021 Shared Task, , , , and , in: Proceedings of the 17th International Conference on Parsing Technologies and the IWPT 2021 Shared Task on Parsing into Enhanced Universal Dependencies, Online, pages 204-212, Association for Computational Linguistics, 2021 |
|
The Theory, Practice, and Ethical Challenges of Designing a Diversity-Aware Platform for Social Relations, , , , , , , and , in: Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society, pages 11, ACM, 2021 |
[DOI] |
Trajectory Prediction with Compressed 3D Environment Representation using Tensor Train Decomposition, , , and , in: International Conference on Advanced Robotics, 2021 |
|
Trust indicators and explainable AI: A study on user perceptions, , , , , , and , in: Proc. Int. Conf. on Human-Computer Interaction, Bari, Italy, 2021 |
|
Uncertainty Reduction for Model Adaptation in Semantic Segmentation, and , in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021 |
|
Unequivocal cardiac phase sorting from alternating ramp- and pulse-illuminated microscopy image sequences, , , , and , in: 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI), pages 868-872, 2021 |
[DOI] [URL] |
Unification-based Reconstruction of Multi-hop Explanations for Science Questions, , and , in: 16th conference of the European Chapter of the Association for Computational Linguistics, 2021 |
[URL] |
Unshuffling data for improved generalization in visual question answering, , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021 |
|
Variational Information Bottleneck for Effective Low-Resource Fine-Tuning, , and , in: ICLR, 2021 |
|
Vein Enhancement with Deep Auto-Encoders to improve Finger Vein Recognition, , and , in: Biometrics Special Interest Group (BIOSIG 2021), 2021 |
|
Visual Focus of Attention Estimation in 3D Scene with an Arbitrary Number of Targets, and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pages 9, IEEE, 2021 |
|
What is SemEval evaluating? A Systematic Analysis of Evaluation Campaigns in NLP, , , and , in: EMNLP, 2nd Workshop on Evaluation and Comparison of NLP Systems, 2021 |
Whole Body Model Predictive Control with a Memory of Motion:Experiments on a Torque-Controlled Talos, , , , , , , , , , , and , in: IEEE International Conference on Robotics and Automation, 2021 |
|
Zurich Like New: Analyzing Open Urban Multimodal Data, , and , in: Proceedings of the 1st International Workshop on Multimedia Computing for Urban Data, 2021 |
|
2020
A Comparison of Acoustic and Linguistics Methodologies for Alzheimer's Dementia Recognition, , , , , , , , , , and , in: Proceedings of Interspeech, pages 2182-2186, 2020 |
|
A memory of motion for visual predictive control tasks, , and , in: International Conference on Robotics and Automation, 2020 |
|
A Phonology-based Approach for Isolated Sign Production Assessment in Sign Language, , , and , in: Companion Publication of the 2020 International Conference on Multimodal Interaction (ICMI '20 Companion), 2020 |
|
Active Improvement of Control Policies with Bayesian Gaussian Mixture Model, , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, 2020 |
Alone or With Others? Understanding Eating Episodes of College Students with Mobile Sensing, , and , in: 19th International Conference on Mobile and Ubiquitous Multimedia, ACM, Essen, Germany, pages 162–166, Association for Computing Machinery, 2020 |
[DOI] [URL] |
An HMM Approach with Inherent Model Selection for Sign Language and Gesture Recognition, , and , in: Proceedings of the International Conference on Language Resources and Evaluation LREC 2020, 2020 |
|
An Integrated and strategic evaluation of automatic blind controls to achieve energy and occupant's comfort objectives, and , in: Proceedings of the 5th IBPSA-England Conference on Building Simulation and Optimization (Virtual), Loughborough, UK, 2020 |
[URL] |
Analysis and Transfer of Human Movement Manipulability in Industry-like Activities, , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, 2020 |
|
Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications, , , , , , , , , , , , , , , and , in: Proceedings of 8th OpenSky Symposium 2020, OpenSky Network, pages 1-10, MDPI, 2020 |
[DOI] [URL] |
Automatic Discrimination of Apraxia of Speech and Dysarthria using a Minimalistic Set of Handcrafted Features, , , and , in: Interspeech, 2020 |
|
Automatic Speech Recognition Benchmark for Air-Traffic Communications, , , , and , in: Proc. Interspeech 2020, pages 2297-2301, 2020 |
[DOI] |
BertAA: BERT fine-tuning for Authorship Attribution, , , and , in: Proceedings of the 17th International Conference on Natural Language Processing, 2020 |
|
CNN Patch Pooling for Detecting 3D Mask Presentation Attacks in NIR, and , in: IEEE International Conference on Image Processing, 2020 |
|
DeepFocus: a Few-shot Microscope Slide Auto-Focus using a Sample Invariant CNN-based Sharpness Function, and , in: International Symposium on Biomedical Imaging, 2020 |
|
Detection of S1 and S2 locations in phonocardiogram signals using zero frequency filter, , , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020 |
|
Detection of Similar Languages and Dialects Using Deep Supervised Autoencoders, , , , and , in: Proceedings of the 17th International Conference on Natural Language Processing, 2020 |
|
DOMAIN ADAPTATION FOR GENERALIZATION OF FACE PRESENTATION ATTACK DETECTION IN MOBILE SETTINGS WITH MINIMAL INFORMATION, , and , in: 45th International Conference on Acoustics, Speech, and Signal Processing, Barcelona, Spain, IEEE, 2020 |
[URL] |
Dysarthric Speech Recognition with Lattice-Free MMI, and , in: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 6109-6113, 2020 |
[DOI] [URL] |
End-to-End Bias Mitigation by Modelling Biases in Corpora, , and , in: ACL, 2020 |
|
Estimating The Degree of Sleepiness by Integrating Articulatory Feature Knowledge In Raw Waveform Based CNNs, , and , in: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Barcelona, Spain, 2020 |
|
Exact Preimages of Neural Network Aircraft Collision Avoidance Systems, and , in: Machine Learning for Engineering Modeling, Simulation, and Design Workshop at Neural Information Processing Systems 2020, 2020 |
|
Fast Transformers with Clustered Attention, , and , in: Proceedings of the International Conference on Neural Information Processing Systems, 2020 |
Fourier movement primitives: an approach for learning rhythmic robot skills from demonstrations, , and , in: Robotics: Science and Systems, 2020 |
|
Generating Master Faces for Use in PerformingWolf Attacks on Face Recognition Systems, , , and , in: International Join Conference on Biometrics, 2020 |
|
Generative adversarial training of product of policies for robust and adaptive movement primitives, , and , in: In Proc. Conference on Robot Learning (CoRL), 2020 |
|
Graph-to-Graph Transformer for Transition-based Dependency Parsing, and , in: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, 2020 |
[URL] |
Graph-to-Graph Transformer for Transition-based Dependency Parsing, and , in: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Findings, ACL, Online, pages 3278–3289, Association for Computational Linguistics, 2020 |
[URL] |
Idiap & UAM participation at GermEval 2020: Classification and Regression of Cognitive and Motivational Style from Text, , , , and , in: Proceedings of the GermEval 2020 Shared Task on the Classification and Regression of Cognitive and Motivational style from Text, 2020 |
[URL] |
Idiap and UAM Participation at MEX-A3T Evaluation Campaign, , , , and , in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2020) co-located with 36th Conference of the Spanish Society for Natural Language Processing (SEPLN 2020), pages 6, CEUR Workshop Proceedings, 2020 |
[URL] |
Idiap Submission to Swiss-German Language Detection Shared Task, , , , and , in: Proceedings of the 5th Swiss Text Analytics Conference (SwissText) & 16th Conference on Natural Language Processing (KONVENS), CEUR Workshop Proceedings, 2020 |
[URL] |
IMPROVING CROSS-DATASET PERFORMANCE OF FACE PRESENTATION ATTACK DETECTION SYSTEMS USING FACE RECOGNITION DATASETS, , and , in: 45th International Conference on Acoustics, Speech, and Signal Processing, IEEE, 2020 |
[URL] |
INCREMENTAL SEMI-SUPERVISED LEARNING FOR MULTI-GENRE SPEECH RECOGNITION, , , , , and , in: Proceedings of ICASSP 2020, 2020 |
|
Iris Liveness Detection Competition (LivDet-Iris) – The 2020 Edition, , , , , , , , , , , , , and , in: INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB 2020), 2020 |
[URL] |
Lattice-Free Maximum Mutual Information Training of Multilingual Speech Recognition System, , , , , and , in: In Proceedings of Interspeech 2020, pages 4746--4750, ISCA, 2020 |
|
Learning How to Walk: Warm-starting Optimal Control Solver with Memory of Motion, , , , and , in: International Conference on Robotics and Automation, 2020 |
|
Learning Urban Nightlife Routines from Mobile Data, , and , in: Proc. Int. Conf. on Mobile and Ubiquitous Multimedia, Essen, Germany, 2020 |
|
Learning, Generating and Adapting Wave Gestures for Expressive Human-Robot Interaction, , and , in: Proc. ACM/IEEE Intl Conf. on Human-Robot Interaction (HRI), pages 386-388, 2020 |
[DOI] [URL] |
ManiGaze: a Dataset for Evaluating Remote Gaze Estimator in Object Manipulation Situations, , and , in: Symposium on Eye Tracking Research and Applications, Stuttgart, Germany, pages 5, ACM, 2020 |
[DOI] |
ODIANLP's Participation in WAT2020, , , , , , , , and , in: Proceedings of the 7th Workshop on Asian Translation, ACL Anthology, 2020 |
|
Optimizer Benchmarking Needs to Account for Hyperparameter Tuning, , , , and , in: Proceedings of the 37th International Conference on Machine Learning, Vienna, Austria, 2020 |
[URL] |
Overview of the 7th Workshop on Asian Translation, , , , , , , , , , , and , in: Proceedings of the 7th Workshop on Asian Translation, Association for Computational Linguistics, 2020 |
[URL] |
Partially-supervised Mention Detection, and , in: Proceedings of the Third Workshop on Computational Models of Reference, Anaphora and Coreference, 2020 |
|
Plucking Motions for Tea Harvesting Robots Using Probabilistic Movement Primitives, , , and , in: IEEE International Conference on Robotics and Automation, 2020 |
Plug and Play Autoencoders for Conditional Text Generation, , , , and , in: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, Online, 2020 |
|
Protecting Mobile Food Diaries from Getting too Personal, , and , in: 19th International Conference on Mobile and Ubiquitous Multimedia, Essen, Germany, pages 212–222, Association for Computing Machinery, 2020 |
[DOI] [URL] |
pyannote.audio: neural building blocks for speaker diarization, , , , , , , , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2020 |
[URL] |
Real-Time Segmentation Networks should be Latency Aware, and , in: Asian Conference on Computer Vision, 2020 |
|
Residual Pose: A Decoupled Approach for Depth-based 3D Human Pose Estimation, , , and , in: IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020 |
|
Spatial Attention for Far-Field Speech Recognition with Deep Beamforming Neural Networks, , , , , and , in: 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, pages 7499-7503, 2020 |
[DOI] |
Supervised domain adaptation for text-independent speaker verification using limited data, , , and , in: Interspeech, pages 3815-3819, 2020 |
[URL] |
SYNTHETIC SPEECH REFERENCES FOR AUTOMATIC PATHOLOGICAL SPEECH INTELLIGIBILITY ASSESSMENT, , and , in: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Barcelona, Spain, 2020 |
|
The MuMMER data set for Robot Perception in multi-party HRI Scenarios, , , and , in: Proceedings of the 29th IEEE International Conference on Robot & Human Interactive Communication, 2020 |
|
The societal and ethical relevance of computational Creativity, , and , in: Proceedings of the International Conference on Computational Creativity, 2020 |
The Unstoppable Rise of Computational Linguistics in Deep Learning, , in: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Online, pages 6294-6306, Association for Computational Linguistics, 2020 |
[DOI] [URL] |
Towards Multilingual Sign Language Recognition, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020 |
|
Transformers are RNNs: Fast Autoregressive Transformers with Linear Attention, , , and , in: Proceedings of International Conference on Machine Learning, 2020 |
Understanding Applicants' Reactions to Asynchronous Video Interviews through Self-Reports and Nonverbal Cues, , , , , and , in: Proc. ACM Int. Conf. on Multimodal Interaction (ICMI), Utrecht, 2020 |
|
Understanding Heavy Drinking at Night through Smartphone Sensing and Active Human Engagement, , , and , in: Proceedings of the 14th EAI International Conference on Pervasive Computing Technologies for Healthcare, 2020 |
|
Unsupervised Representation Learning for Gaze Estimation, and , in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020 |
|
Variational Inference with Mixture Model Approximation for Applications in Robotics, , and , in: International Conference on Robotics and Automation, 2020 |
|
2019
#Drink Or #Drunk: Multimodal Signals and Drinking Practices on Instagram, , and , in: Proceedings of the 13th EAI International Conference on Pervasive Computing Technologies for Healthcare, Trento, Italy, 2019 |
|
A BAYESIAN APPROACH TO INTER-TASK FUSION FOR SPEAKER RECOGNITION, , and , in: In Proceedings of ICASSP 2019, Brighton, ENGLAND, pages 5786-5790, 2019 |
|
A Deep Learning Approach for Robust Head Pose Independent Eye Movements Recognition from Videos, , and , in: 2019 ACM Symposium on Eye Tracking Research and Applications, pages 5, ACM, 2019 |
[DOI] |
A Learning-Based Framework for Quantized Compressed Sensing, , and , in: A Learning-Based Framework for Quantized Compressed Sensing, 2019 |
|
A morphological based PV generation and energy consumption predictive model for Singapore neighbourhood, and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
A smart luminaire in an office environment: impact on light distribution, user interactions and comfort, , , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
Abstract Text Summarization: A Low Resource Challenge, and , in: In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2019), HongKong, China, pages 5, Association for Computational Linguistics (ACL), 2019 |
|
Adaptation of Assistant Based Speech Recognition to New Domains and Its Acceptance by Air Traffic Controllers, , , , , , , , , and , in: Proceedings of the 2nd International Conference on Intelligent Human Systems Integration (IHSI 2019): Integrating People and Intelligent Systems, San Diego, California, USA, pages 820 - 826, 2019 |
[DOI] |
Adaptation of Multiple Sound Source Localization Neural Networks with Weak Supervision and Domain-Adversarial Training, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brighton, United Kingdom, pages 770-774, 2019 |
[DOI] |
Aligning Multilingual Word Embeddings for Cross-Modal Retrieval Task, , and , in: Proceedings of the Second Workshop on Fact Extraction and VERification (FEVER), Hong Kong, pages 27-33, Association for Computational Linguistics, 2019 |
[DOI] [URL] |
AM-FM DECOMPOSITION OF SPEECH SIGNAL: APPLICATIONS FOR SPEECH PRIVACY AND DIAGNOSIS, , , , and , in: 11th International workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, Universita Degli Studi Firenze, Firenze, Italy, 2019 |
[URL] |
An End-to-end Network to Synthesize Intonation Using a Generalized Command Response Model, , , , and , in: ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, United Kingdom, pages 7040-7044, IEEE, 2019 |
[DOI] [URL] |
AN INVESTIGATION OF MULTILINGUAL ASR USING END-TO-END LF-MMI, , and , in: International Conference on Acoustics, Speech and Signal Processing, 2019 |
|
ANALYZING UNCERTAINTIES IN SPEECH RECOGNITION USING DROPOUT, , , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
Automatic Diagnosis of Alzheimer's Disease Using Neural Network Language Models, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
|
Bayesian Optimization Meets Riemannian Manifolds in Robot Learning, , , and , in: Conference on Robot Learning, 2019 |
|
BookTubing Across Regions: Examining Differences based on Nonverbal and Verbal Cues, , and , in: Proc. ACM Int. Conf. on Interactive Experiences for Television and Online Video (TVX), Salford, ENGLAND, 2019 |
[DOI] |
Building energy models with Morphological urban-scale parameters: a case study in Turin, , , , , and , in: Proceedings of 4th Building Simulation Applications Conference - BSA 2019, 2019 |
[URL] |
CBOW Is Not All You Need: Combining CBOW with the Compositional Matrix Space Model, , and , in: International Conference on Learning Representations, New Orleans, Louisiana, USA, 2019 |
[URL] |
CityLearn v1.0: An OpenAI Gym Environment for Demand Response with Deep Reinforcement Learning, , , and , in: Proceedings of the 6th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation, New-York, USA, pages 356-357, ACM, 2019 |
[DOI] |
CO2 experimental measurements towards the development of a predictive framework using user actions in smart buildings, , , , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
Cross-lingual Automatic Speech Recognition Exploiting Articulatory Features, , , , , and , in: Proceedings of APSIPA ASC 2019, 2019 |
Custom Silicone Face Masks - Vulnerability of Commercial Face Recognition Systems & Presentation Attack Detection, , , , , , and , in: Proceedings of 7th IAPR/IEEE International Workshop on Biometrics and Forensics, 2019 |
|
Daylight regulated by automated external Venetian blinds based on HDR sky luminance mapping in winter, , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
Deep Pixel-wise Binary Supervision for Face Presentation Attack Detection, and , in: International Conference on Biometrics, 2019 |
|
Deep Residual Output Layers for Neural Language Generation, and , in: Proceedings of the 36th International Conference on Machine Learning (ICML), 2019 |
|
Discovering Eating Routines in Context with a Smartphone App, , , and , in: Ubicomp/Iswc'19 Adjunct: Proceedings Of The 2019 Acm International Joint Conference On Pervasive And Ubiquitous Computing And Proceedings Of The 2019 Acm International Symposium On Wearable Computers, London, pages 422-429, 2019 |
[DOI] |
Domain Adaptation in Multi-Channel Autoencoder based Features for Robust Face Anti-Spoofing, , and , in: International Conference on Biometrics 2019, IEEE, 2019 |
|
EMPIRICAL EVALUATION AND COMBINATION OF PUNCTUATION PREDICTION MODELS APPLIED TO BROADCAST NEWS, and , in: Proceedings of 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2019 |
|
End-to-End Accented Speech Recognition, , and , in: International Conference on Speech and Language Processing, Interspeech, ISCA, Graz, Austria, pages 2140-2144, 2019 |
[DOI] |
Exploiting semi-supervised training through a dropout regularization in end-to-end speech recognition, , , and , in: Proc. of Interspeech 2019, 2019 |
Full-Gradient Representation for Neural Network Visualization, and , in: Advances in Neural Information Processing Systems, 2019 |
[URL] |
Generalized temporal sampling with active illumination in optical microscopy, and , in: Proceeding of the SPIE Conference Optics and Photonics, Wavelets and Sparsity XVIII, SPIE, San Diego, California, United States, SPIE, 2019 |
|
HMM-based Approaches to Model Multichannel Information in Sign Language inspired from Articulatory Features-based Speech Processing, , , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
|
Idiap Abstract Text Summarization System for German Text Summarization Task, and , in: Proceedings of the 4th edition of the Swiss Text Analytics Conference, 2019 |
[URL] |
Idiap NMT System for WAT 2019 Multimodal Translation Task, and , in: Proceedings of the 6th Workshop on Asian Translation, Hong Kong, China, pages 175–180, Association for Computational Linguistics, 2019 |
[DOI] [URL] |
Implicit discourse relation classification with syntax-aware contextualized word representations, , , and , in: Proceedings of the 32nd International Florida Artificial Intelligence Research Society Conference, 2019 |
Improving Children Speech Recognition through Feature Learning from Raw Speech Signal, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
|
Improving dual-arm assembly by master-slave compliance, , , and , in: Proc. IEEE Intl Conf. on Robotics and Automation, pages 8676-8682, 2019 |
|
Improving Few-Shot User-Specific Gaze Adaptation via Gaze Redirection Synthesis, , and , in: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019 |
INCREMENTAL TRANSFER LEARNING IN TWO-PASS INFORMATION BOTTLENECK BASED SPEAKER DIARIZATION SYSTEM FOR MEETINGS, , , and , in: Proceedings of ICASSP 2019, pages 6291-6295, 2019 |
Joint estimation of RETF vector and power spectral densities for speech enhancement based on alternating least squares, , and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, pages 795--799, 2019 |
Learning an event sequence embedding for event-based deep stereo, , , , and , in: Proceedings of the IEEE International Conference on Computer Vision, 2019 |
Learning from demonstration with model-based Gaussian process, , and , in: Conference on Robot Learning, 2019 |
|
Learning voice source related information for depression detection, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
|
Multi-agent reinforcement learning for adaptive demand response in smart cities, , , , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
Multi-Spectral Widefield Microscopy of the Beating Heart through Post-Acquisition Synchronization and Unmixing, , , and , in: 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019), Venice, Italy, pages 1382-1385, 2019 |
[DOI] |
Multilingual Bottleneck Features for Query by Example Spoken Term Detection, , and , in: IEEE Automatic Speech Recognition and Understanding Workshop, 2019 |
|
Neural VTLN for Speaker Adaptation in TTS, and , in: Proc. 10th ISCA Speech Synthesis Workshop, ISCA, Vienna, Austria, pages 6, 2019 |
[DOI] |
Open-Vocabulary Keyword Spotting With Audio And Text Embeddings, , , and , in: Proceedings of Interspeech 2019, 2019 |
[DOI] |
Overview of the 6th Workshop on Asian Translation, , in: Proceedings of the 6th Workshop on Asian Translation, Hong Kong, China, pages 1–35, Association for Computational Linguistics, 2019 |
[DOI] [URL] |
PATHOLOGICAL SPEECH INTELLIGIBILITY ASSESSMENT BASED ON THE SHORT-TIME OBJECTIVE INTELLIGIBILITY MEASURE, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brighton, UK, pages 6405--6409, 2019 |
|
Processing Megapixel Images with Deep Attention-Sampling Models, and , in: Proceedings of International Conference on Machine Learning, 2019 |
[URL] |
Reducing Noise in GAN Training with Variance Reduced Extragradient, , , and , in: Proceedings of the international conference on Neural Information Processing Systems, 2019 |
Reinforcement learning of trajectory distributions: Applications in assisted teleoperation and motion planning, , , , , and , in: IEEE International Conference on Intelligent Robots and Systems, 2019 |
Retrofitting, district heating and energy storage: neighborhood energy planning, , , , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
SARAL: A Low-Resource Cross-Lingual Domain-Focused Information Retrieval System for Effective Rapid Document Triage, , , , , , , , , , , and , in: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: System Demonstrations. 2019, pages 19-24, 2019 |
SATokE: How can Syntax-Aware Contextualized Word Representations Benefit Implicit Discourse Relation Classification?, , , and , in: Ptroc. 2019 Conference sur l'Apprentissage automatique, 2019 |
Segment-level training of ANNs based on acoustic confidence measures for hybrid HMM/ANN Speech Recognition, and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
|
Selecting, Planning, and Rewriting: A Modular Approach for Data-to-Document Generation and Translation, , and , in: WNGT EMNLP, 2019 |
|
Self-attention for Speech Emotion Recognition, , and , in: Proc. Interspeech 2019, 2019 |
[DOI] |
Social Multimedia, Diversity, and Global South Cities: A Double Blind Side, , , and , in: Proc. ACM Workshop on Fairness, Accountability, and Transparency in Multimedia (FAT/MM), Nice, 2019 |
|
Spectral Subspace Analysis for Automatic Assessment of Pathological Speech Intelligibility, , and , in: Proceedings of Interspeech, Graz, Austria, pages 3038--3042, 2019 |
|
Spoken language identification using language bottleneck features, , , , , and , in: Proceedings of TSD, 2019 |
|
Super-gaussianity of Speech Spectral Coefficients as a Potential Biomarker for Dysarthric Speech Detection, and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2019 |
Tampered Speaker Inconsistency Detection with Phonetically Aware Audio-visual Features, , , , , , , and , in: International Conference on Machine Learning, 2019 |
|
The Simulation of Mean Radiant Temperature in Outdoor Conditions: A Review of Architectural Tools Calculation Assumptions, , , and , in: Proceedings of Building Simulation 2019: 16th Conference of IBPSA, 2019 |
Unbiased semi-supervised LF-MMI training using dropout, , , and , in: Proceedings of Interspeech 2019, 2019 |
[DOI] |
Uncertainty-aware imitation learning using kernelized movement primitives, , , , and , in: IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019 |
|
Understanding and Visualizing Raw Waveform-based CNNs, , , and , in: Proceedings of Interspeech, 2019 |
|
Understanding the performance gap: a machine learning approach on residential buildings in Turin, Italy, , , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
Using Speech Production Knowledge for Raw Waveform Modelling based Styrian Dialect Identification, and , in: Proceedings of Interspeech, 2019 |
|
Virtual High-Framerate Microscopy of the Beating Heart via Sorting of Still Images, , , , and , in: 2019 IEEE 16th International Symposium on Biomedical Imaging, pages 312--315, 2019 |
|
Vulnerability assessment and detection of Deepfake videos, and , in: IAPR International Conference on Biometrics, 2019 |
|
Vulnerability of Face Recognition to Deep Morphing, and , in: International Conference on Biometrics for Borders, 2019 |
|
Weakly-Supervised Concept-based Adversarial Learning for Cross-lingual Word Embeddings, , and , in: Proc. 2019 Conference on Empirical Methods in Natural Language Processing, 2019 |
2018
A Differential Approach for Gaze Estimation with Calibration, , , and , in: 29TH BRITISH MACHINE VISION CONFERENCE, 2018 |
|
A Neural Model to Predict Parameters for a Generalized Command Response Model of Intonation, and , in: Proc. Interspeech 2018, pages 3147-3151, 2018 |
[DOI] |
A Neural Model to Predict Parameters for a Generalized Command Response Model of Intonation, and , in: MLSLP-18 Proceedings, Hyderabad, 2018 |
[URL] |
Ambiance in Social Media Venues: Visual Cue Interpretation by Machines and Crowds, , and , in: IEEE CVPR Workshop on Visual Understanding of Subjective Attributes, 2018 |
|
Analysis of Language Dependent Front-End for Speaker Recognition, , and , in: Proceedings of Interspeech 2018, Hyderabad, INDIA, pages 1101-1105, 2018 |
[DOI] |
Beyond Weight Tying: Learning Joint Input-Output Embeddings for Neural Machine Translation, , and , in: Proceedings of the Third Conference on Machine Translation (WMT), 2018 |
|
Bimanual Skill Learning with Pose and Joint Space Constraints, , , and , in: Proc. of the IEEE-RAS Intl Conf. on Humanoid Robots (Humanoids), 2018 |
|
Building Blocks of Assistant Based Speech Recognition for Air Traffic Management Applications, , , , , , , and , in: Conference: SESAR Innovation Days 2018, European Union, Eurocontrol, Salzburg, Austria, SESARJU, 2018 |
[URL] |
CNN based Query by Example Spoken Term Detection, , and , in: Proceedings of Interspeech, 2018 |
|
Complexity reduction of eigenvalue decomposition-based diffuse power spectral density estimators using the power method, , and , in: Proc. International Conference on Acoustics, Speech, and Signal Processing, Calgary, Canada, pages 451-455, 2018 |
|
CONTEXT-AWARE ATTENTION MECHANISM FOR SPEECH EMOTION RECOGNITION, , , and , in: IEEE Workshop on Spoken Language Technology, Athens, Greece, pages 126-131, 2018 |
[URL] |
Deep Multitask Gaze Estimation with a Constrained Landmark-Gaze Model, , and , in: European Conference on Computer Vision Workshop, 2018 |
|
Deep Neural Networks for Multiple Speaker Detection and Localization, , and , in: 2018 IEEE International Conference on Robotics and Automation (ICRA), Brisbane, AUSTRALIA, pages 74-79, 2018 |
[DOI] |
Denoising and Raw-waveform Networks for Weakly-Supervised Gender Identification on Noisy Speech, , , , , and , in: Proceedings of Interspeech 2018, Hyderabad, INDIA, pages 292-296, 2018 |
[DOI] |
DNN based speaker embedding using content information for text-dependent speaker verification, , , and , in: Proceedings of 2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2018 |
|
Document-Level Neural Machine Translation with Hierarchical Attention Networks, , , and , in: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2018 |
|
Embedding Context-Dependent Variations of Prosodic Contours using Variational Encoding for Decomposing the Structure of Speech Prosody, , , , and , in: Workshop on Prosody and Meaning: Information Structure and Beyond, Aix-en-Provence, France, 2018 |
[URL] |
End-to-end text-dependent speaker verification using novel distance measures, , and , in: Proceedings of Interspeech 2018, Hyderabad, INDIA, Aug 02-Sep 06, 2018, pages 3598-3602, 2018 |
[DOI] |
Enhancing Trust in eAssessment - the TeSLA System Solution, , , , and , in: Technology Enhanced Assessment Conference., 2018 |
|
Experimental evaluation of speech enhancement methods in remote microphone systems for hearing aids, , , , and , in: Proc. EuroNoise 2018, Crete, Greece, pages 351-358, 2018 |
|
Facing Employers and Customers: What Do Gaze and Expressions Tell About Soft Skills?, , , and , in: Proceedings of the 17th International Conference on Mobile and Ubiquitous Multimedia, Cairo, Egypt, pages 121-126, ASSOC COMPUTING MACHINERY, 2018 |
[DOI] |
Far-field ASR Using Low-rank and Sparse Soft Targets from Parallel Data, , and , in: IEEE Workshop on Spoken Language Technology, Athens, GREECE, pages 581-587, IEEE, 2018 |
|
Fast cross-correlation based wrist vein recognition algorithm with rotation and translation compensation, , , and , in: Sixth International Workshop on Biometrics and Forensics, 2018 |
|
Fast Language Adaptation Using Phonological Information, , and , in: Proceedings of Interspeech 2018, Hyderabad, INDIA, pages 2459-2463, 2018 |
[DOI] |
Generalizing Robot Imitation Learning with Invariant Hidden Semi-Markov Models, , , , , , , and , in: 13th Intl Workshop on the Algorithmic Foundations of Robotics (WAFR), 2018 |
|
Geodesic Convolutional Shape Optimization, , , and , in: Proceedings of the International Conference on Machine Learning, 2018 |
Geometry-aware Control and Learning in Robotics, and , in: R:SS Pioneers Workshop, 2018 |
Geometry-aware Robot Manipulability Transfer, , and , in: R:SS Workshop on Learning and Inference in Robotics: Integrating Structure, Priors and Models, 2018 |
|
Geometry-aware Tracking of Manipulability Ellipsoids, , , and , in: Robotics: Science and Systems, Pittsburgh, USA, 2018 |
|
Implementing Fusion Techniques for the Classification of Paralinguistic Information, , , and , in: Proceedings of Interspeech 2018, pages 526-530, 2018 |
|
Investigating Depth Domain Adaptation for Efficient Human Pose Estimation, , , and , in: European Conference on Computer Vision - Workshops, 2018 |
|
Iterative alternating least-aquares approach to jointly estimate the RETFs and the diffuse PSD, , and , in: Proc. ITG conference on Speech Communication, Oldenburg, Germany, 2018 |
|
Iterative Learning of Speech Recognition Models for Air Traffic Control, , , , , , and , in: Proceedings of Interspeech 2018, ISCA, Hyderabad, India, pages 3519-3523, 2018 |
[DOI] |
Joining high-level symbolic planning with low-level motion primitives in adaptive HRI: application to dressing assistance, , , , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), 2018 |
Joint late reverberation and noise power spectral density estimation in a spatially homogeneous noise field, and , in: Proc. International Conference on Acoustics, Speech, and Signal Processing, Calgary, Canada, pages 441-445, 2018 |
|
Joint Localization and Classification of Multiple Sound Sources Using a Multi-task Neural Network, , and , in: Proceedings of Interspeech, pages 312--316, 2018 |
[DOI] |
Knowledge Transfer with Jacobian Matching, and , in: Proceedings of the International Conference on Machine Learning, 2018 |
[URL] |
Kronecker Recurrent Units, , and , in: Proceedings of the International Conference on Machine Learning, 2018 |
Leveraging Convolutional Pose Machines for Fast and Accurate Head Pose Estimation, , and , in: 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Madrid, SPAIN, pages 1089-1094, IEEE, 2018 |
|
Low-latency speaker spotting with online diarization and detection, , , , , , , , and , in: The Speaker and Language Recognition Workshop (Odyssey), 2018 |
|
Multilingual bottleneck features for subword modeling in zero-resource languages, and , in: Proc. Interspeech, pages 2668-2672, 2018 |
[DOI] |
NASAL SPEECH SOUNDS DETECTION USING CONNECTIONIST TEMPORAL, and , in: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 2018 |
|
Not All Samples Are Created Equal: Deep Learning with Importance Sampling, and , in: Proceedings of International Conference on Machine Learning, 2018 |
|
On Effectiveness of Anomaly Detection Approaches against Unseen Presentation Attacks in Face Anti-Spoofing, , , and , in: The 11th IAPR International Conference on Biometrics (ICB 2018), 2018 |
|
On Learning to Identify Genders from Raw Speech Signal Using CNNs, , and , in: Proceedings of Interspeech, Hyderabad, INDIA, pages 287-291, 2018 |
[DOI] |
On Learning Vocal Tract System Related Speaker Discriminative Information from Raw Signal Using CNNs, , and , in: Proceedings of Interspeech, Hyderabad, INDIA, pages 1116-1120, 2018 |
|
On the Use of Convolutional Neural Networks for Speech Presentation Attack Detection, , , , and , in: International Conference on Identity, Security and Behavior Analysis, 2018 |
|
Phonological Posterior Hashing for Query by Example Spoken Term Detection, , and , in: Proceedings of Interspeech, 2018 |
|
Practical Deep Stereo (PDS): Toward applications-friendly deep stereo matching, , and , in: Proceedings of the international conference on Neural Information Processing Systems, 2018 |
Probabilistic Learning of Torque Controllers from Kinematic and Force Constraints, , , , and , in: Proc. of the IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), pages 6552-6559, 2018 |
|
Pulse-based Features for Face Presentation Attack Detection, and , in: Proceedings of BTAS 2018, special session on Image and Video Forensics in Biometrics, 2018 |
|
Real-time Convolutional Networks for Depth-based Human Pose Estimation, , , and , in: IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018 |
|
Real-Time DCT Learning-based Reconstruction of Neural Signals, , and , in: EUSIPCO, 2018 |
|
Robust and Discriminative Speaker Embedding via Intra-Class Distance Variance Regularization, and , in: Proceedings of Interspeech, Hyderabad, INDIA, pages 2257-2261, 2018 |
[DOI] |
SCALAR - Simultaneous Calibration of 2D Laser And Robot's Kinematic Parameters Using Three Planar Constraints, , and , in: International Conference on Intelligent Robots, 2018 |
|
Self-Attentive Residual Decoder for Neural Machine Translation, , , and , in: Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2018 |
|
Semi-Blind Spatially-Variant Deconvolution in Optical Microscopy with Local Point Spread Function Estimation by Use of Convolutional Neural Networks, and , in: 2018 25th IEEE International Conference on Image Processing (ICIP), pages 3818-3822, IEEE, 2018 |
[DOI] |
Semi-supervised Adaptation of Assistant Based Speech Recognition Models for different Approach Areas, , , , , , , , , , and , in: 37th AIAA/IEEE Digital Avionics Systems Conference, AIAA/IEEE, London, 2018 |
[URL] |
SGAN: An Alternative Training of Generative Adversarial Networks, and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, pages 9407-9415, IEEE, 2018 |
[DOI] |
SIIP: An Innovative Speaker Identification Approach for Law Enforcement Agencies, , , , , , , , , and , in: Big Data and Artificial Intelligence for Military Decision Making, http://www.sto.nato.int/, pages PT-1 - 1: PT-1 - 14, STO, 2018 |
[DOI] [URL] |
Single-channel late reverberation power spectral density estimation using denoising autoencoders, and , in: Proc. Annual Conference of the International Speech Communication Association, Hyderabad, India, 2018 |
|
SMILE Swiss German Sign Language Dataset, , , , , , , , , , , and , in: Language Resources and Evaluation Conference, 2018 |
Speaker Inconsistency Detection in Tampered Video, and , in: European Signal Processing Conference, 2018 |
|
Spoofing Deep Face Recognition With Custom Silicone Masks, , and , in: Proceedings of BTAS2018, 2018 |
|
Statistical modeling of speech spectral coefficients in patients with Parkinson's disease, and , in: Proc. ITG conference on Speech Communication, Oldenburg, Germany, 2018 |
|
Stochastic Variance Reduced Gradient Optimization of Generative Adversarial Networks, , , and , in: International Conference on Machine Learning (ICML) workshop on Theoretical Foundations and Applications of Deep Generative Models, 2018 |
Towards directly modeling raw speech signal for speaker verification using CNNs, , and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, Calgary, CANADA, pages 4884-4888, 2018 |
|
Towards Quantifying the Entropy of Fingervein Patterns across Different Feature Extractors, and , in: 2018 IEEE 4th International Conference on Identity, Security, and Behavior Analysis (ISBA), 2018 |
|
UNICITY: A depth maps database for people detection in security airlocks, , , , , , , and , in: IEEE International Conference on Advanced Video and Signal-based Surveillance Workshop, 2018 |
|
Vlogging Over Time: Longitudinal Impressions and Behavior in YouTube, , , , and , in: Proceedings of the 17th International Conference on Mobile and Ubiquitous Multimedia, Cairo, EGYPT, pages 37-46, 2018 |
[DOI] |
WatchNet: Efficient and Depth-based Network for People Detection in Video Surveillance Systems, , , and , in: IEEE International Conference on Advanced Video and Signal-based Surveillance, Auckland, NEW ZEALAND, pages 109-114, 2018 |
|
WILDTRACK: A Multi-camera HD Dataset for Dense Unscripted Pedestrian Detection, , , , , , , , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, pages 5030-5039, 2018 |
[DOI] |
Words Worth: Verbal Content and Hirability Impressions in YouTube Video Resumes, , and , in: Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, 2018 |
|
2017
#Healthy #Fondue #Dinner: Analysis and Inference of Food and Drink Consumption Patterns on Instagram, and , in: Proceedings of the 16th International Conference on Mobile and Ubiquitous Multimedia, 2017 |
|
A Competition on Generalized Software-based Face Presentation Attack Detection in Mobile Scenarios, , , , , , , , , and , in: Proceedings of the International Joint Conference on Biometrics, 2017, 2017 |
|
A Context-Aware Speech recognition and Understanding System for Air Traffic Control Domain, , , , , and , in: Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, Okinawa, Japan, 2017 |
|
A Domain Adaptation Approach to Improve Speaker Turn Embedding Using Face Representation, and , in: ACM International Conference on Multimodal Interaction, Glasgow, Scotland, ACM, 2017 |
|
A Generative Model for Intention Recognition and Manipulation Assistance in Teleoperation, and , in: IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017 |
|
A Sub-Quadratic Exact Medoid Algorithm, and , in: Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017 |
Active Online Anomaly Detection using Dirichlet Process Mixture Model and Gaussian Process Classification, , , , and , in: IEEE Winter Conference on Applications of Computer Vision (WACV), Washington, 2017 |
|
An Investigation of Deep Neural Networks for Multilingual Speech Recognition Training and Adaptation, , and , in: Proc. of Interspeech, 2017 |
|
BEAT: An Open-Science Web Platform, , and , in: Thirty-fourth International Conference on Machine Learning, Sydney, Australia, 2017 |
[URL] |
Bob Speaks Kaldi, , , , and , in: Proc. of Interspeech, 2017 |
|
Boosted Exudate Segmentation in Retinal Images using Residual Nets, , , and , in: Proceedings of the MICCAI Workshop on Ophthalmic Medical Image Analysis, 2017 |
Consistent Translation of Repeated Nouns using Syntactic and Semantic Cues, , and , in: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL), Valencia, Spain, Association for Computational Linguistics, 2017 |
|
Content Normalization for Text-dependent Speaker Verification, , , and , in: Proc. of Interspeech, 2017 |
|
Continuously Reproducing Toolchains in Pattern Recognition and Machine Learning Experiments, , , , , and , in: Thirty-fourth International Conference on Machine Learning, Sidney, Australia, 2017 |
[URL] |
Cross-Eyed 2017: Cross-Spectral Iris/Periocular Recognition Competition., , , , , , , , , , , , , and , in: IEEE/IAPR International Joint Conference on Biometrics, Denver, Colorado, USA, IEEE, 2017 |
Deep Multi-Camera People Detection, and , in: Proceedings of the IEEE International Conference on Machine Learning and Applications, 2017 |
Deep Occlusion Reasoning for Multi-Camera Multi-Target Detection, , and , in: Proceedings of the IEEE International Conference on Computer Vision, 2017 |
Dynamic Graffiti Stylisation with Stochastic Optimal Control, , and , in: Intl Workshop on movement and computing (MOCO), London, UK, pages 1-8, ACM, 2017 |
[DOI] [URL] |
Elderly People Living Alone: Detecting Home Visits with Ambient and Wearable Sensing, , , and , in: In Proceedings of MMHealth, 2017 |
|
End-to-End Convolutional Neural Network-based Voice Presentation Attack Detection, , and , in: International Joint Conference on Biometrics, Denver, Colorado, USA, 2017 |
|
Examining Linguistic Content and Skill Impression Structure for Job Interview Analytics in Hospitality, and , in: Proceedings of the 16th International Conference on Mobile and Ubiquitous Multimedia, 2017 |
|
Exploiting Eigenposteriors for Semi-supervised Training of DNN Acoustic Models with Sequence Discrimination, , and , in: Proceedings of Interspeech, 2017 |
|
EXPLOITING SEQUENCE INFORMATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION, , , and , in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, New Orleans, pages 5370-5374, 2017 |
|
Exploratory Study on Direct Prediction of Diabetes using Deep Residual Networks, , , and , in: Proceedings of the thematic conference on computational vision and medical image processing, 2017 |
Gaussian Mixture Regression on Symmetric Positive Definite Matrices Manifolds: Application to Wrist Motion Estimation with sEMG, and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, pages 59-64, 2017 |
[URL] |
Generating Calligraphic Trajectories with Model Predictive Control, , and , in: Proc. 43rd Conf. on Graphics Interface, Edmonton, AL, Canada, pages 132-139, 2017 |
[DOI] |
How May I Help You? Behavior and Impressions in Hospitality Service Encounters, , and , in: Proceddings of 19th ACM International Conference on Multimodal Interaction, 2017 |
|
Implementing gender-dependent vowel-level analysis for boosting speech-based depression recognition, , , and , in: Proceedings of Interspeech 2017, 2017 |
|
Improving hand and wrist activity detection using tactile sensors and tensor regression methods on Riemannian manifolds, , and , in: Proc. of the Myoelectric Control Symposium, 2017 |
[URL] |
Improving speaker turn embedding by crossmodal transfer learning from face embedding, and , in: ICCV Workshop on Computer Vision for Audio-Visual Media, 2017 |
|
Insiders and Outsiders: Comparing Urban Impressions between Population Groups, , and , in: International Conference on Multimedia Retrieval, ACM, 2017 |
[DOI] |
INTRA-CLASS COVARIANCE ADAPTATION IN PLDA BACK-ENDS FOR SPEAKER VERIFICATION, , , and , in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, pages 5365-5369, 2017 |
[DOI] |
K-Medoids For K-Means Seeding, and , in: Proceedings of the international conference on Neural Information Processing Systems, 2017 |
Learning Manipulability Ellipsoids for Task Compatibility in Robot Manipulation, , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, pages 3183-3189, 2017 |
[URL] |
Learning Task-Space Synergies using Riemannian Geometry, , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), Vancouver, Canada, pages 73-78, IEEE, 2017 |
[URL] |
Low-rank and Sparse Soft Targets to Learn Better DNN Acoustic Models, , and , in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), 2017 |
|
Machine Learning of Controller Command Prediction Models from Recorded Radar Data and Controller Speech Utterances, , , , , , and , in: Proceedings of the 7th SESAR Innovation Days (SID), University of Belgrade, Belgrade, Serbia, 2017 |
|
Machine translation of Spanish personal and possessive pronouns using anaphora probabilities, and , in: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL), Valencia, Spain, Association for Computational Linguistics, 2017 |
|
Multi-Modal Mean-Fields via Cardinality-Based Clamping, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2017 |
Multi-view Representation Learning Via GCCA for Multimodal Analysis of Parkinson's Disease, , , , , , , , , , , , , , and , in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), 2017 |
|
Multilingual Hierarchical Attention Networks for Document Classification, and , in: Proceedings of the 8th International Joint Conference on Natural Language Processing (IJCNLP), pages 1015-1025, 2017 |
|
Non-Markovian Globally Consistent Multi-Object Tracking, , , and , in: Proceedings of the IEEE International Conference on Computer Vision, 2017 |
Non-parametric warping via local scale estimation for non-stationary Gaussian process modelling, , , and , in: Wavelets and Sparsity XVII, pages 1039421, International Society for Optics and Photonics, 2017 |
[DOI] [URL] |
On Job Training: Automated Interpersonal Behavior Assessment & Real-Time Feedback, , in: Proceedings of the 25th ACM International Conference on Multimedia, 2017, 2017 |
|
On the Generalization of Fused Systems in Voice Presentation Attack Detection, , , , and , in: 16th International Conference of the Biometrics Special Interest Group, 2017 |
|
On the Impact of Non-modal Phonation On Phonological Features, , , , , , , , , , , , , and , in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), 2017 |
|
Planification adaptative d'expériences numériques par paquets en contexte non stationnaire pour une étude de fissuration mécanique, , , , and , in: 23eme Congres Francais de Mecanique, 28 aout - 1er septembre 2017, Lille, France (FR), AFM, 2017 |
[URL] |
Robust and Accurate 3D Head Pose Estimation through 3DMM and Online Head Model Reconstruction, , and , in: Proceedings of the 12th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2017), 2017 |
|
Semi-supervised Learning with Semantic Knowledge Extraction for Improved Speech Recognition in Air Traffic Control, , , , , and , in: Proceedings of Interspeech 2017, Stockholm, Sweden, pages 2406-2410, 2017 |
[DOI] |
Sense-Aware Statistical Machine Translation using Adaptive Context-Dependent Clustering, , and , in: Proceedings of Second Conference on Machine Translation (WMT17), 2017 |
|
Shape Representations for Maya Codical Glyphs: Knowledge-driven or Deep?, , and , in: 15th International Workshop on Content-Based Multimedia Indexing, 2017 |
|
Social Scene Understanding: End-to-End Multi-Person Action Localization and Collective Activity Recognition, , , , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2017 |
Sparse Pronunciation Codes for Perceptual Phonetic Information Assessment, , , and , in: Workshop on Signal Processing with Adaptive Sparse Structured Representations (SPARS), 2017 |
|
Subspace Regularized Dynamic Time Warping for Spoken Query Detection, , and , in: Workshop on Signal Processing with Adaptive Sparse Structured Representations (SPARS), 2017 |
|
Supervised Gaze Bias Correction for Gaze Coding in Interactions, and , in: ECEM COGAIN Symposium, pages 3, 2017 |
|
Supervisory teleoperation with online learning and optimal control, and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Singapore, pages 1534-1540, IEEE, 2017 |
[URL] |
The SUMMA Platform Prototype, and , in: Proceedings of the EACL 2017 Software Demonstrations, Valencia, Spain, pages 116--119, 2017 |
[URL] |
Towards a breakthrough Speaker Identification approach for Law Enforcement Agencies: SIIP, , , , , , , , , and , in: European Intelligence and Security Informatics Conference (EISIC) 2017, Athenes, Greece, pages 32-39, IEEE Computer Society, 2017 |
[DOI] [URL] |
Towards large scale multimedia indexing: A case study on person discovery in broadcast news, , and , in: 15th International Workshop on Content-Based Multimedia Indexing, 2017 |
|
Towards the Use of Social Interaction Conventions as Prior for Gaze Model Adaptation, , and , in: Proceedings of 19th ACM International Conference on Multimodal Interaction, pages 9, ACM, 2017 |
[DOI] |
Trajectory and Foothold Optimization using Low-Dimensional Models for Rough Terrain Locomotion, , , , , , , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Singapore, pages 1096-1103, IEEE, 2017 |
[URL] |
Using Coreference Links to Improve Spanish-to-English Machine Translation, and , in: Proceedings of the 2nd Workshop on Coreference Resolution Beyond OntoNotes (CORBON 2017), Valencia, Spain, pages 30-40, Association for Computational Linguistics (ACL), 2017 |
|
Validation of an Automatic Metric for the Accuracy of Pronoun Translation (APT), and , in: Proceedings of the Third Workshop on Discourse in Machine Translation (DiscoMT), Denmark, Copenhagen, Association for Computational Linguistics (ACL), 2017 |
|
Venues in Social Media: Examining Ambiance Perception Through Scene Semantics, , and , in: Proceedings of the 25th ACM International Conference on Multimedia, ACM, 2017, 2017 |
|
Weakly Supervised Learning of Deep Metrics for Stereo Reconstruction, , and , in: Proceedings of the IEEE International Conference on Computer Vision, 2017 |
What you can't see can help you -- extended-range imaging for 3D-mask presentation attack detection, and , in: Proceedings of the 16th International Conference on Biometrics Special Interest Group., Darmstadt (Germany), Gesellschaft fuer Informatik e.V. (GI), 2017 |
|
2016
A Contextual Language Model to Improve Machine Translation of Pronouns by Re-ranking Translation Hypotheses, and , in: European Association for Machine Translation, 2016 |
A MultiPath Network for Object Detection, , , , , , and , in: Proceedings of the British Machine Vision Conference, BMVA Press, 2016 |
[URL] |
A Point-Spread-Function-Aware Filtered Backprojection Algorithm for Focal-Plane-Scanning Optical Projection Tomography, and , in: 2016 IEEE International Symposium on Biomedical Imaging, 2016 |
An agonist-antagonist pitch production model, and , in: Lecture Notes in Artificial Intelligence: 18th International Conference, SPECOM 2016, Budapest, Hungary, pages 84--91, 2016 |
|
Ancient Maya Writings as High-Dimensional Data: a Visualization Approach, , , and , in: Digital Humanities (DH), Krakow, 2016 |
|
Anomaly detection in elderly daily behavior in ambient sensing environments, , , and , in: Proceedings of the 7th Int. Workshop on Human Behavior Understanding, ACM Multimedia, 2016, Amsterdam, Netherlands, 2016 |
|
Assessing a Shape Descriptor for Analysis of Mesoamerican Hieroglyphics: A View Towards Practice in Digital Humanities, , and , in: Digital Humanities Conference (DH), Krakow, 2016 |
|
Audiovisual Summarization of Lectures and Meetings Using a Segment Similarity Graph, , and , in: Proceedings of the ACM International Conference on Multimedia Retrieval (ICMR), ACM, New York, NY, ACM Press, 2016 |
Comparing Two Strategies for Query Expansion in a News Monitoring System, and , in: Natural Language Processing and Information Systems: 21st International Conference on Applications of Natural Language to Information Systems, NLDB 2016, pages 267-275, Springer-Verlag, 2016 |
[DOI] |
Cross-database evaluation of audio-based spoofing detection systems, and , in: Interspeech, San Francisco, USA, 2016 |
[URL] |
DEEP NEURAL NETWORK BASED POSTERIORS FOR TEXT-DEPENDENT SPEAKER VERIFICATION, , , and , in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5050-5054, IEEE, 2016 |
|
Deep Neural Networks for Syntactic Parsing of Morphologically Rich Languages, and , in: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016 |
|
Design of a Speech Corpus for Research on Cross-Lingual Prosody Transfer, , , , , , , , , , , and , in: Lecture Notes in Artificial Intelligence: 18th International Conference, SPECOM 2016, Budapest, Hungary, pages 199--206, 2016 |
|
Dexterous Undersea Interventions with Far Distance Onshore Supervision: the DexROV Project, , , , , , , , , , , , , , , , , , , , , , and , in: IFAC Conference on Control Applications in Marine Systems (CAMS), Trondheim, Norway, pages 414-419, 2016 |
[DOI] [URL] |
Dites-Moi: Wearable Feedback on Conversational Behavior, , , and , in: Proceedings of the 15th International Conference on Mobile and Ubiquitous Multimedia, 2016 |
|
Efficient Posterior Exemplar Search Space Hashing Exploiting Class-Specific Sparsity Structures, , , and , in: Interspeech, San Francisco, CA, 2016 |
|
Emphasis Recreation for TTS using Intonation Atoms, and , in: 9th ISCA Speech Synthesis Workshop, pages 14--20, 2016 |
[DOI] |
EUMSSI team at the MediaEval Person Discovery Challenge 2016, , and , in: MediaEval Benchmarking Initiative for Multimedia Evaluation, Hilversum, Netherlands, 2016 |
|
Exploiting Low-dimensional Structures to Enhance DNN based Acoustic Modeling in Speech Recognition, , , and , in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5690-5694, IEEE, 2016 |
|
Fast K-Means with Accurate Bounds, and , in: Proceedings of the International Conference on Machine Learning (ICML), New York, 2016 |
Findings of the 2016 WMT Shared Task on Cross-lingual Pronoun Prediction, , , , , , , , and , in: Proceedings of WMT 2016 (First Conference on Machine Translation), Association for Computational Linguistics, Berlin, Germany, pages 525–542, 2016 |
[URL] |
Heterogeneous Face Recognition using Inter-Session Variability Modelling, and , in: IEEE Computer Society Workshop on Biometrics, Las Vegas - USA, IEEE, 2016 |
|
Hierarchical Planning of Dynamic Movements without Scheduled Contact Sequences, , , , and , in: Proceedings of the IEEE International Conference of Robotics and Automation, 2016 |
HMM-based Non-native Accent Assessment using Posterior Features, , and , in: Proceedings of Interspeech, San Francisco, USA, 2016 |
|
Human versus Machine Attention in Document Classification: A Dataset with Crowdsourced Annotations, and , in: Proceedings of the EMNLP 2016 Workshop on Natural Language Processing for Social Media, Austin, USA, 2016 |
|
Importance Sampling Tree for Large-scale Empirical Expectation, , and , in: Proceedings of the International Conference on Machine Learning (ICML), New-York, 2016 |
Improving Pronoun Translation by Modeling Coreference Uncertainty, and , in: Proceedings of the First Conference on Machine Translation (WMT16), Berlin, Germany, 2016 |
|
Improving Under-Resourced Language ASR Through Latent Subword Unit Space Discovery, and , in: Proceedings of Interspeech, 2016 |
|
INFORMATION THEORETIC CLUSTERING FOR UNSUPERVISED DOMAIN-ADAPTATION, , and , in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5580-5584, IEEE, 2016 |
|
InnerView: Learning Place Ambiance from Social Media Images, , and , in: Proceedings of the 24th ACM International Conference on Multimedia, ACM, 2016 |
[DOI] |
Inter-task System Fusion for Speaker Recognition, , , , and , in: Proceeedings of the INTERSPEECH, 2016 |
|
Investigating Cross-lingual Multi-level Adaptive Networks: The Importance of the Correlation of Source and Target Languages, , , , and , in: Proceedings of the International Workshop on Spoken Language Translation, Seattle, WA, USA, 2016 |
|
Investigating Spectral Amplitude Modulation Phase Hierarchy Features in Speech Synthesis, , , and , in: 9th ISCA Speech Synthesis Workshop, 2016 |
|
Joint Operation of Voice Biometrics and Presentation Attack Detection, and , in: IEEE International Conference on Biometrics: Theory, Applications and Systems, 2016 |
[URL] |
Large Scale Hard Sample Mining with Monte Carlo Tree Search, and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2016 |
|
Learning assistive teleoperation behaviors from demonstration, and , in: Proc. IEEE International Symposium on Safety, Security and Rescue Robotics, pages 258-263, 2016 |
|
Learning dynamic graffiti strokes with a compliant robot, , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, pages 3981-3986, 2016 |
[URL] |
Learning Multimodal Temporal Representation for Dubbing Detection in Broadcast Media, and , in: ACM Multimedia, Amsterdam, ACM, 2016 |
|
Learning to Refine Object Segments, , , and , in: Computer Vision - ECCV 2016, Amsterdam, pages 75-91, Springer, 2016 |
[DOI] [URL] |
Long-Term Time-Sensitive Costs for CRF-Based Tracking by Detection, , and , in: 2nd Workshop on Benchmarking Multi-target Tracking: MOTChallenge 2016, Amsterdam, 2016 |
|
Low-Rank Representation of Nearest Neighbor Phone Posterior Probabilities to Enhance DNN Acoustic Modeling, , , and , in: Interspeech, 2016 |
|
Manual and automatic labeling of discourse connectives for machine translation (Keynote paper), , in: TextLink: Structuring Discourse in Multilingual Europe (Handbook of the Second Action Conference), Budapest, Hungary, pages 16-20, 2016 |
[URL] |
Modeling Unvoiced Sounds In Statistical Parametric Speech Synthesis with a Continuous Vocoder, , , and , in: Proc. of EUSIPCO, Budapest, Hungary, 2016 |
|
Multilingual Visual Sentiment Concept Matching, , , , , , and , in: Proceedings of the International Conference on Multimedia Retrieval (ICMR), 2016 |
|
Nested Mini-Batch K-Means, and , in: Proceedings of NIPS, 2016 |
Neural Network-based Word Alignment through Score Aggregation, , and , in: Proceedings of the ACL 1st Conference on Machine Translation, 2016 |
|
Online Inference in Bayesian Non-Parametric Mixture Models under Small Variance Asymptotics, and , in: NIPS workshop on Advances in Approximate Bayesian Inference, Barcelona, Spain, pages 1-5, 2016 |
[URL] |
Online motion synthesis with minimal intervention control and formal safety guarantees, , , and , in: Proc. IEEE Intl Conf. on Systems, Man, and Cybernetics, Budapest, Hungary, 2016 |
|
Overview of BTAS 2016 Speaker Anti-spoofing Competition, , , , , , , , , , , , , , , and , in: IEEE International Conference on Biometrics: Theory, Applications and Systems, 2016 |
[URL] |
PAoS Markers: Trajectory Analysis of Selective Phonological Posteriors for Assessment of Progressive Apraxia of Speech, , and , in: Proceeding on the 7th Workshop on Speech and Language Processing for Assistive Technologies (SLPAT), 2016 |
|
PhonVoc: A Phonetic and Phonological Vocoding Toolkit, and , in: Interspeech, San Francisco, USA, 2016 |
|