All conference papers sorted by author
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 |
A
Exploratory Study on Direct Prediction of Diabetes using Deep Residual Networks, , , and , in: Proceedings of the thematic conference on computational vision and medical image processing, 2017 |
Boosted Exudate Segmentation in Retinal Images using Residual Nets, , , and , in: Proceedings of the MICCAI Workshop on Ophthalmic Medical Image Analysis, 2017 |
A BSS-based Approach for Localization of Simultaneous Speakers in Reverberant Conditions, , , and , in: Proceedings of the 19th European Signal Processing Conference (EUSIPCO), 2011 |
|
Gammatone Wavelet Cepstral Coefficients for Robust Speech Recognition, , and , in: Proceedings of IEEE TENCON, 2013 |
|
GLoFool: global enhancements and local perturbations to craft adversarial images, and , in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024 |
|
Findings of the IWSLT 2023 evaluation campaign, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: Proceedings of the IWSLT conference, 2023 |
Vision-Language Pretraining: Current Trends and the Future, , and , in: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts, 2022 |
[URL] |
Entity Matching Across Small Networks Using Node Attributes, , , , , , , , , and , in: ECAI 2024 - 27th European Conference on Artificial Intelligence, October 19-24, 2024, Santiago de Compostela, Spain - Including 13th Conference on Prestigious Applications of Intelligent Systems (PAIS 2024), Proceedings, 2024 |
|
Unknown-Multiple Speaker clustering using HMM, , , and , in: ICSLP, 2002 |
|
Clustering And Segmenting Speakers And Their Locations In Meetings, , and , in: ICASSP, 2004 |
|
An Online Audio Indexing System, , and , 2004 |
|
Robust HMM-Based Speech/Music Segmentation, , and , in: ICASSP, 2002 |
|
A Robust Speaker Clustering Algorithm, and , in: IEEE Automatic Speech Recognition Understanding Workshop, 2003 |
|
Multimodal Integration for Meeting Group Action Segmentation and Recognition, , , , , and , in: MLMI, 2005 |
|
Finding Audio-Visual Events in Informal Social Gatherings, , , and , in: IEEE/ACM 13th International Conference on Multimodal Interaction, 2011 |
|
Joint Pose Estimator and Feature Learning for Object Detection, , , and , in: Proceedings of the IEEE International Conference on Computer Vision, 2009 |
FlowBoost - Appearance Learning from Sparsely Annotated Video, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2011 |
Learning from demonstrations with partially observable task parameters, , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Hong Kong, pages 3309 - 3314, IEEE, 2014 |
[DOI] |
Brain-Machine Interfaces through Control of Electroencephalographic Signals and Vibrotactile Feedback, , , , , , , , and , in: Proceedings of the 12th International Conference on Human-Computer Interaction, 2007 |
|
Framing the News: From Human Perception to Large Language Model Inferences, and , in: International Conference on Multimedia Retrieval (ICMR '23), June 12--15, 2023, Thessaloniki, Greece, 2023 |
|
Examining European Press Coverage of the Covid-19 No-Vax Movement: An NLP Framework, and , in: 2nd ACM International Workshop on Multimedia AI against Disinformation (MAD '23), June 12, 2023, Thessaloniki, Greece, 2023 |
|
How Did Europe’s Press Cover Covid-19 Vaccination News? A Five-Country Analysis, and , in: MAD '22: Proceedings of the 1st International Workshop on Multimedia AI against Disinformation, 2022 |
[DOI] [URL] |
Human Interest or Conflict? Leveraging LLMs for Automated Framing Analysis in TV Shows, , and , in: ACM International Conference on Interactive Media Experiences, 2024 |
|
Fully Automatic Grading of Retinal Vasculitis on Fluorescein Angiography Time-lapse from Real-world Data in Clinical Settings, , , , , , , and , in: 2023 IEEE 36th International Symposium on Computer-Based Medical Systems (CBMS), L'Aquila, Italy, 2023, pages 689-693, 2023 |
[DOI] [URL] |
The DIRAC AWEAR Audio-Visual Platform for Detection of Unexpected and Incongruent Events, , , , , , , , , , , , , and , in: Proceedings of the International Conference on Multimodal Interfaces, 2008 |
|
Biologically Motivated Audio-Visual Cue Integration for Object, , , , , , , , and , in: Proceedings of the first Internatinal Conference on Cognitive Systems, 2008 |
|
BEAT: An Open-Science Web Platform, , and , in: Thirty-fourth International Conference on Machine Learning, Sydney, Australia, 2017 |
[URL] |
Bob: a free signal processing and machine learning toolbox for researchers, , , , , and , in: Proceedings of the ACM Multimedia Conference, 2012 |
[URL] |
Continuously Reproducing Toolchains in Pattern Recognition and Machine Learning Experiments, , , , , and , in: Thirty-fourth International Conference on Machine Learning, Sidney, Australia, 2017 |
[URL] |
Counter-Measures to Photo Attacks in Face Recognition: a public database and a baseline, and , in: International Joint Conference on Biometrics 2011, 2011 |
[URL] |
Multilingual Extraction and Categorization of Lexical Collocations with Graph-aware Transformers, , , , and , in: Proceedings of the 11th Joint Conference on Lexical and Computational Semantics, Seattle, USA, pages 89–100, 2022 |
|
Segmenting Object Affordances: Reproducibility and Sensitivity to Scale, , , and , in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024 |
|
Affordance segmentation of hand-occluded containers from exocentric images, , , , , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), 2023 |
[DOI] [URL] |
Detection and Recognition of Number Sequences in Spoken Utterances, and , in: 2nd Workshop on Speech in Mobile and Pervasive Environments (SiMPE), 2007 |
|
Posterior-Based Features and Distances in Template Matching for Speech Recognition, and , in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), 2007 |
|
Posterior features applied to speech recognition tasks with user-defined vocabulary, , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009 |
|
Using RASTA in task independent TANDEM feature extraction, , and , in: Proceedings of ICSLP, 2004, 2004 |
|
An Acoustic Model Based on Kullback-Leibler Divergence for Posterior Features, , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007 |
|
Using Posterior-Based Features in Template Matching for Speech Recognition, , and , in: International Conference on Spoken Language Processing, 2006 |
|
Using Pitch as Prior Knowledge in Template-Based Speech Recognition, , and , in: Proceedings of ICASSP, 2006, 2006 |
|
Improving Speech Recognition Using a Data-Driven Approach, , and , in: Proceedings of Interspeech, 2005, 2005 |
|
Cross-Domain Personality Prediction: From Video Blogs to Small Group Meetings, and , in: 15th ACM International Conference on Multimodal Interaction, 2013 |
|
One of a Kind: Inferring Personality Impressions in Meetings, and , in: 15th ACM International Conference on Multimodal Interaction, 2013 |
|
Fusing Audio-Visual Nonverbal Cues to Detect Dominant People in Conversations, and , in: 20th International Conference on Pattern Recognition, Istanbul, Turkey, 2010, Istanbul, Turkey, 2010 |
|
A Multimodal Corpus for Studying Dominance in Small Group Conversations, , and , in: LREC workshop on Multimodal Corpora: Advances in Capturing, Coding and Analyzing Multimodality, Malta, May 2010, 2010 |
|
Anomaly detection in elderly daily behavior in ambient sensing environments, , , and , in: Proceedings of the 7th Int. Workshop on Human Behavior Understanding, ACM Multimedia, 2016, Amsterdam, Netherlands, 2016 |
|
Model-based Compressive Sensing for Multi-party Distant Speech Recognition, , and , in: 2011 IEEE International Conference on Acoustics, Speech and Signal Processing, 2011 |
|
Sparse Component Analysis for Speech Recognition in Multi-Speaker Environment, , and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
|
Model-based Sparse Component Analysis for Reverberant Speech Localization, , , and , in: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing, pages 1439 - 1443, IEEE, 2014 |
[DOI] |
On Compressibility of Neural Network phonological Features for Low Bit Rate Speech Coding, , and , in: Proceeding of Interspeech, pages 418-422, ISCA, 2015 |
|
Sparse Pronunciation Codes for Perceptual Phonetic Information Assessment, , , and , in: Workshop on Signal Processing with Adaptive Sparse Structured Representations (SPARS), 2017 |
|
PAoS Markers: Trajectory Analysis of Selective Phonological Posteriors for Assessment of Progressive Apraxia of Speech, , and , in: Proceeding on the 7th Workshop on Speech and Language Processing for Assistive Technologies (SLPAT), 2016 |
|
Computational Methods For Structured Sparse Component Analysis of Convolutive Speech Mixtures, , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2012 |
|
Structured Sparse Acoustic Modeling for Speech Separation, , , and , in: Signal Processing with Adaptive Sparse Structured Representations SPARS, SPARS, 2013 |
|
Efficient Posterior Exemplar Search Space Hashing Exploiting Class-Specific Sparsity Structures, , , and , in: Interspeech, San Francisco, CA, 2016 |
|
On Application Of Non-Negative Matrix Factorization for Ad Hoc Microphone Array Calibration from Incomplete Noisy Distances, , , , and , in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 2694-2698, IEEE, 2015 |
[DOI] |
Analysis of Phone Posterior Feature Space Exploiting Class Specific Sparsity and MLP-based Similarity Measure, , and , in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, 2010 |
|
A Multipath Sparse Beamfroming Method, , , and , in: Signal Processing with Adaptive Sparse Structured Representations SPARS, 2013 |
|
Structured Sparse Coding for Microphone Array Location Calibration, , , and , in: SAPA-SCALE Conference, The 5th ISCA workshop on statistical and perceptual audition, 2012 |
|
Phonological Posterior Hashing for Query by Example Spoken Term Detection, , and , in: Proceedings of Interspeech, 2018 |
|
Multi-party Speech Recovery Exploiting Structured Sparsity Models, , , and , in: Proceedings of Interspeech, 2011 |
|
Complex Daily Activities, Country-Level Diversity, and Smartphone Sensing: A Study in Denmark, Italy, Mongolia, Paraguay, and UK, , , , , , , , , , , , , , , , , , , , , and , in: Proceedings of the 2023 ACM CHI Conference on Human Factors in Computing Systems (CHI), Association for Computing Machinery, 2023 |
[DOI] |
LP-TRAP: Linear predictive temporal patterns, , and , 2004 |
|
Experiences in the Creation of an Electromyography Database to Help Hand Amputated Persons, , , , , , and , in: Proceedings of the 24th European Medical Informatics Conference, 2012 |
|
Building the NinaPro Database: a Resource for the Biorobotics Community, , , , , , , , and , in: Proceedings of the Fourth IEEE RAS/EMBS International Conference on Biomedical Robotics and Biomechatronics, 2012 |
|
Discovering Temporal Patterns in Water Quality Time Series, Focusing on Floods with the LDA method, , , , , , , and , in: European Geosciences Union, 2013 |
|
Effect of nonverbal behavioral patterns on the performance of small groups, and , in: ICMI Workshop on Understanding and Modeling Multiparty Multimodal Interactions, Istanbul, Turkey, 2014 |
|
B
Visual Activity Context For Focus of Attention Estimation in Dynamic Meetings, , and , in: International Conference on Multimedia & Expo, 2009 |
|
Multi-party Focus of Attention Recognition in Meetings from Head Pose and Multimodal Contextual Cues, and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008 |
|
Visual Focus of Attention Estimation from Head Pose Posterior Probability Distributions, and , in: International Conference on Multi-media & Expo, 2008 |
|
Probabilistic Head Pose Tracking Evaluation in Single and Multiple Camera Setups, and , in: Classification of Events, Activities and Relationship Evaluation and Workshop, 2007 |
|
A Study on Visual Focus of Attention Recognition from Head Pose in a Meeting Room, and , in: 3rd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI06), 2006 |
|
A Rao-Blackwellized Mixed State Particle Filter for Head Pose Tracking, and , in: ACM ICMI Workshop on Multimodal Multiparty Meeting Processing (MMMP), 2005 |
|
Evaluation of Multiple Cues Head Pose Tracking Algorithm in Natural Environments, and , in: International Conference on Multimedia & Expo ICME 2005, 2005 |
|
A probabilistic framework for joint head tracking and pose estimation, and , in: 17th Int. Conf. Pattern Recognition (ICPR), 2004 |
|
Social Scene Understanding: End-to-End Multi-Person Action Localization and Collective Activity Recognition, , , , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2017 |
Probability Occupancy Maps for Occluded Depth Images, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, pages 2829-2837, 2015 |
Towards Improved Replicability of Human Studies in Human-Robot Interaction: Recommendations for Formalized Reporting, , , , , , , , , , and , in: Companion of the 2023 ACM/IEEE International Conference on Human-Robot Interaction, pages 629-633, 2023 |
|
Posterior-based Sparse Representation for Automatic Speech Recognition, , , and , in: Proceeding of Interspeech, 2014 |
|
Image-guided topic modeling for interpretable privacy classification, and , in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024 |
|
Black-box Attacks on Image Activity Prediction and its Natural Language Explanations, , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), 2023 |
[DOI] [URL] |
The BANCA Database and Evaluation Protocol, , , , , , , , , , , and , in: 4th International Conference on Audio- and Video-Based Biometric Person Authentication, AVBPA, Springer-Verlag, 2003 |
|
A Compressive Sensing Based Compressed Neural Network for Sound Source Localization, , and , in: Proceedings of International Symposium on Artificial Intelligence and Signal Processing, 2011 |
|
Principled Parallel Mean-Field Inference for Discrete Random Fields, , , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2016 |
Deep Occlusion Reasoning for Multi-Camera Multi-Target Detection, , and , in: Proceedings of the IEEE International Conference on Computer Vision, 2017 |
Multi-Modal Mean-Fields via Cardinality-Based Clamping, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2017 |
Geodesic Convolutional Shape Optimization, , , and , in: Proceedings of the International Conference on Machine Learning, 2018 |
Unified Inference for Variational Bayesian Linear Gaussian State-Space Models, and , in: NIPS, 2006 |
|
Robust Playfield Segmentation using MAP Adaptation, and , in: Proc. 17th International Conference on Pattern Recognition (ICPR 2004), 2004 |
|
Multi-Modal Audio-Visual Event Recognition for Football Analysis, , and , in: Proc. IEEE Workshop on Neural Networks for Signal Processing (NNSP), 2003 |
|
The DCU-EPFL Enhanced Dependency Parser at the IWPT 2021 Shared Task, , , , and , in: Proceedings of the 17th International Conference on Parsing Technologies and the IWPT 2021 Shared Task on Parsing into Enhanced Universal Dependencies, Online, pages 204-212, Association for Computational Linguistics, 2021 |
|
Unsupervised Interpretable Pattern Discovery in Time Series Using Autoencoders, , , and , in: IAPR Int. Workshops on Structural and Syntactic Pattern Recognition (SSPR), 2016 |
|
Implementation of machine learning techniques for the quasi real-time blind and electric lighting optimization in a controlled experimental facility, , , , , and , in: Journal of Physics: Conference Series, IOP Publishing, 2021 |
[DOI] [URL] |
An Integrated and strategic evaluation of automatic blind controls to achieve energy and occupant's comfort objectives, and , in: Proceedings of the 5th IBPSA-England Conference on Building Simulation and Optimization (Virtual), Loughborough, UK, 2020 |
[URL] |
An exploratory interplay between daylight, general and task lighting for visual comfort and electricity savings in a personal office space, , , , , , and , in: Proceedings of ISES and IEA SHC International Conference on Solar Energy for Buildings and Industry, Kassel, Germany, 2022 |
Machine learning techniques for the daylight and electric lighting performance predictions, , and , in: Proceedings of Building Simulation 2021, 2021 |
Lexical filtrering by means of prosodic information, , and , in: International Congress of Phonetic Sciences, 1995 |
Learning to Abstract with Nonparametric Variational Information Bottleneck, , and , in: The 2023 Conference on Empirical Methods in Natural Language Processing, 2023 |
[URL] |
Dialect Levelling in Finnish: A Universal Speech Attribute Approach, , , , , , and , in: The 15th Annual Conference of the International Speech Communication Association, 2014 |
Do Backpropagation trained neural networks have normal weight distributions?, and , in: International Conference on Artificial neural Networks, 1993 |
|
Feature Representations for Automatic Meerkat Vocalization Classification, , , and , in: 4th International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots, 2024 |
|
Tracking Multiple Objects under Global Appearance Constraints, , , and , in: Proceedings of the IEEE International Conference on Computer Vision, 2011 |
Multi-Modal Data Fusion for Person Authentication using SVM, , in: Proc. Second International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'99), 1999 |
|
Fast Face Detection using MLP and FFT, , and , in: Proc. Second International Conference on Audio and Video-based Biometric Person Authentication (AVBPA'99), 1999 |
|
Audio-Visual Person Verification, , , , and , in: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, June 1999, Fort Collins, USA, 1999 |
|
Multimodal Authentication using Asynchronous HMMs, , in: 4th International Conference on Audio- and Video-Based Biometric Person Authentication, AVBPA, Springer-Verlag, 2003 |
|
An Asynchronous Hidden Markov Model for Audio-Visual Speech Recognition, , in: Advances in Neural Information Processing Systems, NIPS 15, MIT Press, 2003 |
|
Machine Learning for Multimodal Interaction: First International Workshop, MLMI'2004, Springer-Verlag Heidelberg, 2005 |
Biometric Person Authentication IS A Multiple Classifier Problem, and , in: 7th International Workshop on Multiple Classifier Systems, MCS, 2007 |
|
The Expected Performance Curve: a New Assessment Measure for Person Authentication, and , in: Proceedings of Odyssey 2004: The Speaker and Language Recognition Workshop, 2004 |
|
A Statistical Significance Test for Person Authentication, and , in: Proceedings of Odyssey 2004: The Speaker and Language Recognition Workshop, 2004 |
|
Learning the Decision Function for Speaker Verification, and , in: IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP, 2001 |
|
The Expected Performance Curve, , and , in: International Conference on Machine Learning, ICML, Workshop on ROC Analysis in Machine Learning, 2005 |
|
Venues in Social Media: Examining Ambiance Perception Through Scene Semantics, , and , in: Proceedings of the 25th ACM International Conference on Multimedia, ACM, 2017, 2017 |
|
Confidence Measures in Multiple pronunciations Modeling For Speaker Verification, and , in: Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04), 2004 |
|
Posteriori Probabilities and Likelihoods Combination for Speech and Speaker Recognition, and , in: International Conference on Spoken Language Processing (ICSLP~2004), 2004 |
|
Hybrid HMM/ANN and GMM Combination for User-Customized Password Speaker Verification, and , in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003 |
|
On the Combination of Speech and Speaker Recognition, and , in: European Conference On Speech, Communication and Technology (EUROSPEECH'03), 2003 |
|
User-Customized Password HMM Based Speaker Verification, and , in: Proceedings of the COST275 Workshop on the Advent of Biometrics on the Internet, 2002 |
|
User-Customized Password Speaker Verification based on HMM/ANN and GMM Models, and , in: International Conference on Spoken Language Processing (ICSLP~2002), 2002 |
|
Multi-Camera Tracking and Atypical Motion Detection with Behavioral Maps, , and , in: proceedings of the European Conference on Computer Vision, 2008 |
Principled Detection-by-classification from Multiple Views, , and , in: proceedings of the International Conference on Computer Vision Theory and Applications, 2008 |
Evaluation of Probabilistic Occupancy Map People Detection for Surveillance Systems, , , , and , in: Proceedings of the IEEE International Workshop on Performance Evaluation of Tracking and Surveillance, 2009 |
Dynamic Graffiti Stylisation with Stochastic Optimal Control, , and , in: Intl Workshop on movement and computing (MOCO), London, UK, pages 1-8, ACM, 2017 |
[DOI] [URL] |
Generating Calligraphic Trajectories with Model Predictive Control, , and , in: Proc. 43rd Conf. on Graphics Interface, Edmonton, AL, Canada, pages 132-139, 2017 |
[DOI] |
Learning dynamic graffiti strokes with a compliant robot, , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, pages 3981-3986, 2016 |
[URL] |
Confidence Measures in Hybrid HMM/ANN Speech Recognition, and , in: Proceedings of Workshop on Text, Speech and Dialog (TSD'98) Brno, Czech Republic, 1998 |
Improving Posterior Based Confidence Measures in Hybrid HMM/ANN Speech Recognition Systems, and , in: Proceedings of International Conference on Spoken Language Processing (ICSLP'98) Sydney, Australia, 1998 |
|
Reconnaissance de la parole dans le bruit après renforcement fondé sur l'harmonicité, and , in: Proceedings of JEP'2000, no IDIAP RR, see RESPITE www, 2000 |
A measure of speech and pitch reliability from voicing, and , in: Proc. Int. Joint Conf. on Artificial Intelligence (IJCAI), Scandinavian AI Society, 1999 |
A front-end using the harmonicity cue for speech enhancement in loud noise, , and , in: Int. Conf. on Spoken Language Processing (ICSLP), 2000 |
Interfacing of CASA and partial recognition based on a multistream technique, , , and , in: ICSLP'98, Sidney, 1998 |
|
A new SNR-feature mapping for robust multistream speech recognition, and , in: Proc. Int. Congress on Phonetic Sciences (ICPhS), 1999 |
Experimental evaluation of text-dependent speaker verification on laboratory and field test databases in the M2VTS project, , , and , in: Proceedings of the European Conference on Speech Communication and Technology, 1999 |
|
Hierarchical Multi-task learning framework for Isometric-Speech Language Translation, , , and , in: ACL, 2022 |
|
DeepCon: An End-to-End Multilingual Toolkit for Automatic Minuting of Multi-Party Dialogues, , , and , in: Special Interest Group on Discourse and Dialogue (SIGDIAL 2022), 2022 |
|
An End-to-End Multilingual System for Automatic Minuting of Multi-Party Dialogues, , , and , in: Pacific Asia Conference on Language, Information and Computation (PACLIC 36) , In proceedings of ACL Anthology, 2022 |
|
Multimodal Reranking of Content-based Recommendations for Hyperlinking Video Snippets, , , and , in: ACM International Conference on Multimedia Retrieval, 2014 |
|
Idiap at MediaEval 2013: Search and Hyperlinking Task, , , and , in: MediaEval 2013 Workshop, Barcelona, Spain, CEUR-WS.org, 2013 |
|
Audiovisual Summarization of Lectures and Meetings Using a Segment Similarity Graph, , and , in: Proceedings of the ACM International Conference on Multimedia Retrieval (ICMR), ACM, New York, NY, ACM Press, 2016 |
Multi-factor Segmentation for Topic Visualization and Recommendation: the MUST-VIS System, , , , , , , and , in: Proceedings of the 21st ACM International Conference on Multimedia, Barcelona, Spain, pages 365-368, ACM, 2013 |
[DOI] [URL] |
CONTEXTUAL BIASING METHODS FOR IMPROVING RARE WORD DETECTION IN AUTOMATIC SPEECH RECOGNITION, , , , , , , and , in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, Seoul, Korea, 2024 |
|
Customization of Automatic Speech Recognition Engines for Rare Word Detection Without Costly Model Re-Training, , , , , , and , in: Proc. 13th SESAR Innovation Days, Seville, Spain, 2023 |
[DOI] [URL] |
Vascular Biometrics Experiments on Candy -- A New Contactless Finger-Vein Dataset, , , , and , in: Proceedings of the International Conference on Pattern Recognition (ICPR), Calcutta (India), 2024 |
|
What you can't see can help you -- extended-range imaging for 3D-mask presentation attack detection, and , in: Proceedings of the 16th International Conference on Biometrics Special Interest Group., Darmstadt (Germany), Gesellschaft fuer Informatik e.V. (GI), 2017 |
|
Spoofing Deep Face Recognition With Custom Silicone Masks, , and , in: Proceedings of BTAS2018, 2018 |
|
HMIST: Hierarchical Multilingual Isometric Speech Translation using Multi-Task Learning Framework for Automatic Dubbing, , , and , in: PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION (PACLIC 2022), In Proceedings of ACL Anthology, 2022 |
|
Team Innovators at SemEval-2022 for Task 8: Multi-Task Training with Hyperpartisan and Semantic Relation for Multi-Lingual News Article Similarity, , , , and , in: Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022), NAACL 2022, 2022 |
|
You Are Known by How You Vlog: Personality Impressions and Nonverbal Behavior in YouTube, , and , in: Proceedings of AAAI International Conference on Weblogs and Social Media, Barcelona, 2011 |
|
The Good, the Bad, and the Angry: Analyzing Crowdsourced Impressions of Vloggers, and , in: Proceedings of AAAI International Conference on Weblogs and Social Media, 2012 |
|
Voices of Vlogging, and , in: Proceedings of AAAI International Conference on Weblogs and Social Media, Washington DC, 2010 |
|
Vlogcast Yourself: Nonverbal Behavior and Attention in Social Media, and , in: Proceedings International Conference on Multimodal Interfaces (ICMI-MLMI), 2010 |
|
Wearing a YouTube hat: directors, comedians, gurus, and user aggregated behavior, and , in: Proceedings of the 17th ACM International Conference on Multimedia, ACM, 2009 |
|
FaceTube: predicting personality from facial expressions of emotion in online conversational video, , and , in: Proceedings International Conference on Multimodal Interfaces (ICMI-MLMI), 2012 |
|
Energy assessment of a district by integrating solar thermal in district heating network: a dynamic analysis approach, , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2023 |
[DOI] [URL] |
An Overview of the PICASSO Project Research Activities in Speaker Verification for Telephone Applications, , , , , , , , , , , and , in: 6th european conference on speech communication and technology --- eurospeech'99, 1999 |
Likelihood ratio adjustment for the compensation of model mismatch in speaker verification, and , in: Eurospeech 97, 1997 |
|
An overview of the cave project research activities in speaker verification, , , , , and , in: Reconnaissance du locuteur et ses applications commerciales et criminalistiques, 1998 |
Speaker Verification in the Telephone Network : Research Activities in the CAVE Project, , , , , and , in: Proceedings of the European Conference on Speech Communication and Technology (EUROSPEECH'97), 1997 |
|
Bayesian Recurrent Units and the Forward Backward Algorithm, and , in: Proc. Interspeech 2022, pages 4137-4141, 2022 |
[DOI] |
A Bayesian Interpretation of the Light Gated Recurrent Unit, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2021 |
[DOI] |
People-Centric Mobile Sensing with a Pragmatic Twist: from Behavioral Data Points to Active User Involvement, , and , in: International Conference on Human-Computer Interaction with Mobile Devices and Services, 2011 |
|
Cost–effective Variational Active Entity Resolution, , , , and , in: 37th IEEE International Conference on Data Engineering (ICDE), 2021 |
[URL] |
Voyager: Data Discovery for Onboarding in Data Science, , , and , in: 37th IEEE International Conference on Data Engineering (ICDE), 2022 |
YOU ARE FIRED! NONVERBAL ROLE ANALYSIS IN COMPETITIVE MEETINGS, , and , in: Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP,',','), Taiwan., 2009 |
|
Building energy models with Morphological urban-scale parameters: a case study in Turin, , , , , and , in: Proceedings of 4th Building Simulation Applications Conference - BSA 2019, 2019 |
[URL] |
Understanding the performance gap: a machine learning approach on residential buildings in Turin, Italy, , , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
A benchmark for the simulation of meshed district heating networks based on anonymised monitoring data, and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2023 |
[DOI] [URL] |
Verification of PyDHN - a Python library for the thermo-hydraulic simulation of district heating networks - through the DESTEST, , and , in: Proceedings of Building Simulation 2023: 18th Conference of IBPSA, IBPSA, IBPSA, 2023 |
[DOI] [URL] |
Learning Structured Embeddings of Knowledge Bases, , , and , in: Conference on Artificial Intelligence, 2011 |
|
Secured vocal access to telephone servers, , , , and , in: Proceedings of IVTTA 1996 IEEE Third Workshop Interactive Voice Technology for Telecommunications Applications, 1996 |
SARAL: A Low-Resource Cross-Lingual Domain-Focused Information Retrieval System for Effective Rapid Document Triage, , , , , , , , , , , and , in: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: System Demonstrations. 2019, pages 19-24, 2019 |
A Competition on Generalized Software-based Face Presentation Attack Detection in Mobile Scenarios, , , , , , , , , and , in: Proceedings of the International Joint Conference on Biometrics, 2017, 2017 |
|
Implicit Control of Noise Canceller for Speech Enhancement, , and , in: Proceedings of INTERSPEECH 2005, 2005 |
|
Non-Stationary Multi-Channel (Multi-Stream) Processing Towards Robust and Adaptive ASR, , in: Proc. of the ESCA Workshop on Robust Methods for Speech Recognition in Adverse Conditions, 1999 |
Connectionist speech recognition, , in: Proceedings of IK'98, Interdisziplinares Kolleg, Spring Scholl, Gunne am Mohnessee, Germany, March 7--14, 1998 |
New Approaches Towards Robust and Adaptive Speech Recognition, , and , in: Advances in Neural Information Processing Systems 13, MIT Press, 2001 |
|
Subband-Based Speech Recognition, and , in: IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, 1997 |
|
Processing and Linking Audio Events in Large Multimedia Archives: The EU inEvent Project, , , , , , , , and , in: Workshop on Speech, Language and Audio in Multimedia, 2013 |
|
State-of-the-Art and Recent Progress in Hybrid HMM/ANN Speech Recognition, , in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'97), Springer-Verlag, 1997 |
Hybrid HMM/ANN Systems for Speech Recognition: Overview and New Research Directions, and , in: International School on Neural Nets: Adaptive Processing of Temporal Information, Springer Verlag, 1997 |
|
Recognition and Understanding of Meetings Overview of the European AMI and AMIDA Projects, and , in: LangTech 2008, 2008 |
|
Your Day in Your Pocket: Complex Activity Recognition from Smartphone Accelerometers, , and , in: EAI Pervasive Health, 2022 |
|
A COMPARISON OF METHODS FOR OOV-WORD RECOGNITION ON A NEW PUBLIC DATASET, , and , in: 2021 IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE Signal Processing Society, Toronto, Ontario, Canada, 2021 |
|
pyannote.audio: neural building blocks for speaker diarization, , , , , , , , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2020 |
[URL] |
Handwriting Recognition, , in: Second Asian Conference on Computer Vision (ACCV'95,',','), Singapore, 1995 |
Design and Implementation of a System for the Recognition of Handwritten Responses on US Census Forms, , in: IAPR Workshop on Document Analysis Systems, 1994 |
A system for the off-line recognition of handwritten text, , in: International Conference on Pattern Recognition (ICPR,',','), Jerusalem, 1994 |
Higher-Order Statistics in Visual Object Recognition, , in: Proc. IEEE Conf. on Computer Vision and Pattern Recognition, 1993 |
|
Recognition of Handprinted Digits using Optimal Bounded Error Matching, , in: International Conference on Document Analysis and Retrieval (ICDAR,',','), Tsukuba Science City, Japan, 1993 |
Vein Enhancement with Deep Auto-Encoders to improve Finger Vein Recognition, , and , in: Biometrics Special Interest Group (BIOSIG 2021), 2021 |
|
Augmenting Astronaut's Capabilities through Brain-Machine Interfaces, , , and , in: Proceedings of the 20th International Joint Conference on Artificial Intelligence, Workshop on Artificial Intelligence for Space Applications, 2007 |
|
Trajectory Prediction with Compressed 3D Environment Representation using Tensor Train Decomposition, , , and , in: International Conference on Advanced Robotics, 2021 |
|
Null space redundancy learning for a flexible surgical robot, , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Hong Kong, pages 2443 - 2448, IEEE, 2014 |
[DOI] |
Learning adaptive movements from demonstration and self-guided exploration, , and , in: Proc. IEEE Intl Conf. on Development and Learning and on Epigenetic Robotics (ICDL-EpiRob), Genoa, Italy, pages 160-165, 2014 |
|
Towards a standard for dialogue act annotation, , , , , , , , , , , and , in: 7th International Conference on Language Resources and Evaluation, Malta, 2010 |
[URL] |
IDIAPers @ Causal News Corpus 2022: Efficient Causal Relation Identification Through a Prompt-based Few-shot Approach, , , , , , and , in: The 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE @ EMNLP 2022), 2022 |
[URL] |
DAIC-WOZ: On the Validity of Using the Therapist's prompts in Automatic Depression Detection from Clinical Interviews, , , , , and , in: Proceedings of the 6th Clinical Natural Language Processing Workshop, Association for Computational Linguistics, 2024 |
|
Reliability Estimation of News Media Sources: Birds of a Feather Flock Together, , , and , in: Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics, 2024 |
|
Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviews, , , and , in: Proceedings of Interspeech, 2023 |
|
Online Classifier Adaptation in High Frequency EEG, , and , in: Proceedings of the 3rd International Brain-Computer Interface Workshop & Training Course 2006, 2006 |
|
C
Stochastic learning and control in multiple coordinate systems, , in: Intl Workshop on Human-Friendly Robotics, Genoa, Italy, pages 1-5, 2016 |
|
Robot Learning with Task-Parameterized Generative Models, , in: Proc. Intl Symp. on Robotics Research, 2015 |
|
Skills Learning in Robots by Interaction with Users and Environment, , in: In Proc. of the Intl Conf. on Ubiquitous Robots and Ambient Intelligence (URAI), Kuala Lumpur, Malaysia, pages 161-162, 2014 |
[URL] |
A task-parameterized probabilistic model with minimal intervention control, , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Hong Kong, pages 3339 - 3344, IEEE, 2014 |
[DOI] |
The Winning Approach for the Recommendation Systems Shared Task @REST_MEX 2022, , , , and , in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022), 2022 |
[URL] |
Voice-B System, , , , and , in: IEEE 4th Workshop on Intercative Voice Technology for Telecommunications Applications (IVTTA'98) September 29--30, Torino, Italy, 1998 |
Cursive Character Challenge: a New Database for Machine Learning and Pattern Recognition, , and , in: Proceedings of International Conference on Pattern Recognition (ICPR), 2006 |
|
Ambiance in Social Media Venues: Visual Cue Interpretation by Machines and Crowds, , and , in: IEEE CVPR Workshop on Visual Understanding of Subjective Attributes, 2018 |
|
Shape Representations for Maya Codical Glyphs: Knowledge-driven or Deep?, , and , in: 15th International Workshop on Content-Based Multimedia Indexing, 2017 |
|
Is That a Jaguar? Segmenting Ancient Maya Glyphs via Crowdsourcing, , and , in: Proc. ACM Int. Workshop on Crowdsourcing for Multimedia, Orlando, pages 37-40, ACM New York, 2014 |
[DOI] |
Ancient Maya Writings as High-Dimensional Data: a Visualization Approach, , , and , in: Digital Humanities (DH), Krakow, 2016 |
|
Joining high-level symbolic planning with low-level motion primitives in adaptive HRI: application to dressing assistance, , , , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), 2018 |
Large Scale Hard Sample Mining with Monte Carlo Tree Search, and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2016 |
|
Efficient Sample Mining for Object Detection, and , in: Proceedings of the 6th Asian Conference on Machine Learning (ACML), Nha Trang, Vietnam, 2014 |
|
The MuMMER data set for Robot Perception in multi-party HRI Scenarios, , , and , in: Proceedings of the 29th IEEE International Conference on Robot & Human Interactive Communication, 2020 |
|
Importance Sampling Tree for Large-scale Empirical Expectation, , and , in: Proceedings of the International Conference on Machine Learning (ICML), New-York, 2016 |
Sample Distillation for Object Detection and Image Classification, , and , in: Proceedings of the 6th Asian Conference on Machine Learning (ACML), Nha Trang, Vietnam, 2014 |
|
Leveraging Convolutional Pose Machines for Fast and Accurate Head Pose Estimation, , and , in: 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Madrid, SPAIN, pages 1089-1094, IEEE, 2018 |
|
Overview of the ImageCLEF 2014 Domain Adaptation Task, and , in: ImageCLEF 2014: Overview and analysis of the results, 2014 |
|
Face Verification using MLP and SVM, and , in: XI Journees NeuroSciences et Sciences pour l'Ingenieur (NSI 2002), 2002 |
|
Face Verification Using Adapted Generative Models, , and , in: The 6th International Conference on Automatic Face and Gesture Recognition, FG2004, IEEE, 2004 |
|
Comparison of MLP and GMM Classifiers for Face Verification on XM2VTS, , and , in: 4th International Conference on AUDIO- and VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, 2003 |
|
Understanding Metro Station Usage using Closed Circuit Television Cameras Analysis, , , , , , and , in: 11th International IEEE Conference on Intelligent Transportation Systems (ITSC), Bejing, 2008 |
|
The AMI Meeting Corpus: a Pre-Announcement, , , , , , , , , , , , , , , , and , in: Machine Learning for Multimodal Interaction: Second International Workshop, MLMI'2005, 2005 |
|
Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies, and , in: Proceedings of the eighth international conference on Language Resources and Evaluation (LREC), Istanbul, TR, pages 6, 2012 |
|
Building 'directional corpora' for unbiased contrastive analysis, and , in: Proceedings of Corpus Linguistics Conference, Birmingham, UK, pages 29-30, 2011 |
|
How Comparable are Parallel Corpora? Measuring the Distribution of General Vocabulary and Connectives, , , and , in: Proceedings of 4th Workshop on Building and Using Comparable Corpora, ACL, Portland, OR, pages 78--86, 2011 |
|
Learning Disentangled Representations for Natural Language Definitions, , , and , in: In Findings of the European chapter of Association for Computational Linguistics, 2023 |
|
The Workshop on Computational Personality Recognition 2014, , , , , and , in: Proceedings of the ACM International Conference on Multimedia, 2014 |
|
How Do You Like Your Virtual Agent?: Human-Agent Interaction Experience through Nonverbal Features and Personality Traits, , and , in: Human Behavior Understanding, pages 1-15, Springer, 2014 |
|
Sound Pattern Matching for Automatic Prosodic Event Detection, , , , and , in: Interspeech, San Francisco, USA, 2016 |
|
PhonVoc: A Phonetic and Phonological Vocoding Toolkit, and , in: Interspeech, San Francisco, USA, 2016 |
|
An Empirical Model of Emphatic Word Detection, and , in: Proc. of Interspeech, Dresden, Germany, pages 573-577, ISCA, 2015 |
|
Robust triphone mapping for acoustic modeling, , and , in: Proceedings of Interspeech, Portland, Oregon, 2012 |
|
Bob Speaks Kaldi, , , , and , in: Proc. of Interspeech, 2017 |
|
Stress and Accent Transmission In HMM-Based Syllable-Context Very Low Bit Rate Speech Coding, , , and , in: Interspeech, 2014 |
|
On the (Un)importance of the Contextual Factors In HMM-Based Speech Synthesis, , and , in: Proceedings of the IEEE Intl. Conference on Acoustics, Speech and Signal Processing (ICASSP), Vancouver, Canada, pages 8140 - 8143, 2013 |
|
Syllable-based Pitch Encoding for Low Bit Rate Speech Coding with Recognition/Synthesis Architecture, , and , in: Proc. of Interspeech 2013, Lyon, France, 2013 |
|
On the Impact of Non-modal Phonation On Phonological Features, , , , , , , , , , , , , and , in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), 2017 |
|
Phonological Vocoding Using Artificial Neural Networks, , and , in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, Australia, pages 4844-4848, IEEE, 2015 |
[DOI] |
NASAL SPEECH SOUNDS DETECTION USING CONNECTIONIST TEMPORAL, and , in: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 2018 |
|
Intensity-Based Point-Spread-Function-Aware Registration for Multi-View Applications in Optical Microscopy, , and , in: Biomedical Imaging (ISBI), 2015 IEEE 12th International Symposium on, pages 306-309, IEEE, 2015 |
[DOI] |
Competition on Counter Measures to 2-D Facial Spoofing Attacks, , , , , , , , , , , , , , , , , , , , , , , and , in: Proceedings of IAPR IEEE International Joint Conference on Biometrics (IJCB), Washington DC, USA, 2011 |
|
A Point-Spread-Function-Aware Filtered Backprojection Algorithm for Focal-Plane-Scanning Optical Projection Tomography, and , in: 2016 IEEE International Symposium on Biomedical Imaging, 2016 |
Estimation of Divergence-Free 3D Cardiac Blood Flow in a Zebrafish Larva Using Multi-View Microscopy, and , in: Biomedical Imaging (ISBI), 2015 IEEE 12th International Symposium on, IEEE, Brooklyn, NY, USA, pages 385-388, 2015 |
[DOI] |
A Study on the Importance of Formant Transitions for Stop-Consonant Classification in VCV Sequence, , , and , in: Interspeech, Dublin, Ireland, ISCA, 2023 |
|
To Err Is Human: Learning from Error Potentials in Brain-Computer Interfaces, , and , in: 1st International Conference on Cognitive Neurodynamics (ICCN 2007), 2007 |
|
Asynchronous detection and classification of oscillatory brain activity, , and , in: 16 European Signal Processing Conference, 2008 |
|
WILDTRACK: A Multi-camera HD Dataset for Dense Unscripted Pedestrian Detection, , , , , , , , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, pages 5030-5039, 2018 |
[DOI] |
SGAN: An Alternative Training of Generative Adversarial Networks, and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, pages 9407-9415, IEEE, 2018 |
[DOI] |
Deep Multi-Camera People Detection, and , in: Proceedings of the IEEE International Conference on Machine Learning and Applications, 2017 |
Reducing Noise in GAN Training with Variance Reduced Extragradient, , , and , in: Proceedings of the international conference on Neural Information Processing Systems, 2019 |
Stochastic Variance Reduced Gradient Optimization of Generative Adversarial Networks, , , and , in: International Conference on Machine Learning (ICML) workshop on Theoretical Foundations and Applications of Deep Generative Models, 2018 |
International Conference on Mobile and Ubiquitous Multimedia, , and , in: Happy and Agreeable? Multi-Label Classification of Impressions in Social Video, Linz, Austria, pages 109-120, ACM, 2015 |
[DOI] [URL] |
Combined Estimation of Location and Body Pose in Surveillance Video, , and , in: AVSS, 2011 |
|
A Joint Estimation of Head and Body Orientation Cues in Surveillance Video, , and , in: IEEE International Workshop on Socially Intelligent Surveillance and Monitoring, 2011 |
|
We are not Contortionists: Coupled Adaptive Learning for Head and Body Orientation Estimation in Surveillance Video, and , in: IEEE International Conference on Computer Vision and Pattern Recognition, 2012 |
|
Text Identification in Complex Background using SVM, , and , in: Proceedings of the Int. Conf. on computer vision and pattern recognition, 2001 |
Multiple Hypotheses Video OCR, and , in: Proceedings of the 4th International Workshop on Document Analysis System, 2000 |
|
Sequential Monte Carlo Video Text Segmentation, and , in: ICIP, 2003 |
|
Text Segmentation and Recognition in Complex Background Based on Markov Random Field, , and , in: Int. Conf. Pattern Recognition 2002, 2002 |
|
Text Enhancement with Asymmetric Filter for Video OCR, , and , in: Proceedings of the 11th International Conference on Image Analysis and Processing, 2001 |
|
Video OCR for Sport Video Annotation and Retrieval, , and , in: Proceedings of the 8th IEEE International Conference on Mechatronics and Machine Vision in Practice, 2001 |
|
Diffusion Transformer for Adaptive Text-to-Speech, and , in: Proc. 12th ISCA Speech Synthesis Workshop (SSW 12), 2023 |
[DOI] |
The Idiap Speech Synthesis System for the Blizzard Challenge 2023, , , and , in: Proc. 18th Blizzard Challenge Workshop, 2023 |
[DOI] |
Head Nod Detection from a Full 3D Model, , and , in: Proceedings of the ICCV 2015, pages 528-536, 2015 |
|
Generative Independent Component Analysis for EEG Classification, and , in: European Symposium on Artificial Neural Networks ESANN, 2005 |
|
Generative Temporal ICA for Classification in Asynchronous BCI Systems, and , in: The 2nd International IEEE EMBS Conference On Neural Engineering, 2005 |
|
HMM and IOHMM Modeling of EEG Rhythms for Asynchronous BCI Systems, and , in: European Symposium on Artificial Neural Networks ESANN, 2004 |
|
Anti-spoofing in action: joint operation with a verification system, , and , in: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Workshop on Biometrics, Portland, Oregon, 2013 |
|
On the Effectiveness of Local Binary Patterns in Face Anti-spoofing, , and , in: Proceedings of the 11th International Conference of the Biometrics Special Interes Group, 2012 |
|
The 2nd competition on counter measures to 2D face spoofing attacks, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: International Conference of Biometrics 2013, Madrid, Spain, 2013 |
|
Exploiting observers' judgements for nonverbal group interaction analysis, , and , in: IEEE Conference on Automatic Face and Gesture Recognition, pages 6, IEEE, 2011 |
|
Inferring truth from multiple annotators for social interaction analysis, , and , in: Neural Information Processing Systems (NIPS) Workshop on Modeling Human Communication Dynamics (HCD), pages 4, 2011 |
|
Who's Who with Big-Five: Analyzing and Classifying Personality Traits with Smartphones, , and , in: International Symposium on Wearable Computing, pages 8, 2011 |
|
Are you a Werewolf? Detecting deceptive roles and outcomes in a conversational role-playing game, and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2010 |
|
Multichannel signal separation for cocktail party speech recognition: a dynamic recurrent network, , , and , in: Int. Conf. on Spoken Language Processing (ICSLP), no IDIAP RR, see RESPITE www, 2000 |
Blind separation of delayed and superimposed acoustic sources : learning algorithms an experimental study, , , , and , in: Proc. IEEE Int. Conference on Speech Processing (ICSP), IEEE, 1999 |
Swiss-French Polyphone: a Telephone Speech Database to develop Interactive Voice Servers, , , and , in: Linguistic Databases, 1995 |
Discrimination of the voices of twins and siblings for speaker verification, and , in: 4th European Conference on Speech Communication and Technology, 1995 |
Neural nets approaches to Speaker Verification: comparison with Second Order Statistical Measure, and , in: ICASSP, 1995 |
A study of Intra- and Inter-Speaker Variability in the Voices of Twins for Speaker Verification, and , in: International Congress of Phonetic Sciences, 1995 |
Towards energy hubs: an innovative Geographic Information System based approach for cluster definition, , , and , in: ICREC 2022 Conference Proceedings, 2022 |
Vibrotactile Feedback in the Context of Mu-Rhythm based BCI, , , , , , , , , , , and , in: Proceedings of the 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2007 |
|
Comparison of different feature classifiers for brain computer interfaces, , , , , , , , and , in: Proceedings of the 1st International IEEE EMBS Conference on Neural Engineering, 2003 |
Environnement multi-agents de reconnaissance automatique de la parole en continu, and , in: Actes des 3emes Journees Francophones sur l'Intelligence Artificielle Distribuee et les Systemes Multi-agents, 1995 |
A graphical tool for monitoring Oz objects activity, and , in: Proc. of WOz'95: International Workshop on Oz Programming, IDIAP, Uni. Fribourg, 1995 |
Reliability in a Multi-agent Spoken Language Recognition System, and , in: 4th European Conference on Speech Communication and Technology, 1995 |
ETC\_vérif : un environnement multi-agents de reconnaissance automatique de la parole en continu, and , in: Proceedings of JEP'96: XXIemes Journees d'etude sur la Parole, 1996 |
|
ETC\_vérif, a Prototype of a Cooperative Automatic Speech Recognition System, and , in: Proc. of WOz'95: International Workshop on Oz Programming, IDIAP, Uni. Fribourg, 1995 |
On the use of automatically generated synthetic image datasets for benchmarking face recognition, , and , in: International Joint Conference on Biometrics (IJCB 2021), 2021 |
|
Evaluating the Effectiveness of Attack-Agnostic Features for Morphing Attack Detection, and , in: International Joint Conference on Biometrics, 2024 |
|
On the detection of morphing attacks generated by GANs, and , in: 21st International Conference of the Biometrics Special Interest Group (BIOSIG 2022), 2022 |
|
Approximating Optimal Morphing Attacks using Template Inversion, , and , in: IEEE International Joint Conference on Biometric, 2023 |
[DOI] |
Deep Learning for Efficient Discriminative Parsing, , in: International Conference on Artificial Intelligence and Statistics, 2011 |
|
A Gentle Hessian for Efficient Gradient Descent, and , in: IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP, 2004 |
|
Links Between Perceptrons, MLPs and SVMs, and , in: International Conference on Machine Learning, ICML, 2004 |
|
A Parallel Mixture of SVMs for Very Large Scale Problems, , and , in: Advances in Neural Information Processing Systems, NIPS 14, MIT Press, 2002 |
|
Scaling Large Learning Problems with Hard Parallel Mixtures, , and , in: International Workshop on Pattern Recognition with Support Vector Machines, SVM'2002, 2002 |
|
Torch7: A Matlab-like Environment for Machine Learning, , and , in: BigLearn, NIPS Workshop, 2011 |
|
Validating Different Flexible Vocabulary Approaches on the Swiss French PolyPhone and PolyVar databases, , , and , in: Proceedings of ICSLP 96, 1996 |
Swiss PolyPhone and PolyVar: Building Databases for Speech Recognition and Speaker Verification, and , in: Proceedings of The 3rd Slovenian-German and 2nd SDRV Workshop, Speech and Image Understanding, 1996 |
Low-Level Physiological Implications of End-to-End Learning for Speech Recognition, and , in: Proc. Interspeech 2022, pages 749--753, 2022 |
[DOI] |
Open-Vocabulary Object 6D Pose Estimation, , , , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024 |
[URL] |
The REPLAY-MOBILE Face Presentation-Attack Database, , , and , in: Proceedings of the International Conference on Biometrics Special Interests Group, 2016 |
|
Borrowing from yourself: Faster future video segmentation with partial channel update, and , in: International Conference on Pattern Recognition, 2022 |
|
Real-Time Segmentation Networks should be Latency Aware, and , in: Asian Conference on Computer Vision, 2020 |
|
Paumer: Patch Pausing Transformer for Semantic Segmentation, , and , in: 33th British Machine Vision Conference 2022, London, UK, 21 - 24 November 2022, 2022 |
|
Towards Computational Proxemics: Inferring Social Relations from Interpersonal Distances, , , , , and , in: Proceedings of the IEEE International Conference on Social Computing, pages 290-297, 2011 |
Look at who's talking, , , , and , in: Proceedings of International Conference on Ambient Intelligence, pages 68-76, 2011 |
Extended Cauchy Machines, and , in: Proceedings of the International Conference on Neural Information Processing, 1996 |
Ontogenic High Order Cauchy Machines, and , in: Proceedings of the SIPAR Workshop '95: Parallel and Distributed Systems, Biel School of Engineering, 1995 |
A Comparison of Acoustic and Linguistics Methodologies for Alzheimer's Dementia Recognition, , , , , , , , , , and , in: Proceedings of Interspeech, pages 2182-2186, 2020 |
|
Experimental evaluation of speech enhancement methods in remote microphone systems for hearing aids, , , , and , in: Proc. EuroNoise 2018, Crete, Greece, pages 351-358, 2018 |
|
Scalability Analysis of Audio-Visual Person Identity Verification, , , and , in: 4th International Conference on Audio- and Video-Based Biometric Person Authentication, AVBPA, Springer-Verlag, 2003 |
|
D
Zero-shot Retrieval: Augmenting Pre-trained Models with Search Engines, , , , , , and , in: Under review, 2023 |
[URL] |
Beyond question-based biases: Assessing multimodal shortcut learning in visual question answering, , , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021 |
|
Whole Body Model Predictive Control with a Memory of Motion:Experiments on a Torque-Controlled Talos, , , , , , , , , , , and , in: IEEE International Conference on Robotics and Automation, 2021 |
|
Iris Liveness Detection Competition (LivDet-Iris) – The 2020 Edition, , , , , , , , , , , , , and , in: INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB 2020), 2020 |
[URL] |
INCREMENTAL TRANSFER LEARNING IN TWO-PASS INFORMATION BOTTLENECK BASED SPEAKER DIARIZATION SYSTEM FOR MEETINGS, , , and , in: Proceedings of ICASSP 2019, pages 6291-6295, 2019 |
Two-Pass IB based Speaker Diarization System using Meeting-Specific ANN based Features, , , and , in: Proceedings of Interspeech 2016, pages 2199-2203, 2016 |
D
Can face anti-spoofing countermeasures work in a real world scenario?, , , and , in: International Conference on Biometrics, Madrid, Spain, 2013 |
[URL] |
LBP-TOP based countermeasure against face spoofing attacks, , , and , in: International Workshop on Computer Vision With Local Binary Pattern Variants - ACCV, pages 12, 2012 |
|
Heterogeneous Face Recognition using Inter-Session Variability Modelling, and , in: IEEE Computer Society Workshop on Biometrics, Las Vegas - USA, IEEE, 2016 |
|
Periocular Biometrics in Mobile Environment, and , in: IEEE Seventh International Conference on Biometrics: Theory, Applications and Systems, Arlington, USA, pages 1-7, IEEE, 2015 |
[DOI] |
D
Decision-Oriented Environmental Mapping with Radial Basis Function Neural Networks, , , , and , in: Intelligent techniques for Spatio-Temporal Data Analysis in Environmental Applications. Workshop W07, 1999 |
Indoor Radon Risk Assessment with Geostatistics and Artificial Neural Networks, , , , , , and , in: Geostatistical congress 2000, 2000 |
Neural Network Residual Stochastic Co-simulation for Environmental Data Analysis, , , , and , in: Neural Computation 2000, 2000 |
DNN based speaker embedding using content information for text-dependent speaker verification, , , and , in: Proceedings of 2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2018 |
|
DEEP NEURAL NETWORK BASED POSTERIORS FOR TEXT-DEPENDENT SPEAKER VERIFICATION, , , and , in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5050-5054, IEEE, 2016 |
|
End-to-end text-dependent speaker verification using novel distance measures, , and , in: Proceedings of Interspeech 2018, Hyderabad, INDIA, Aug 02-Sep 06, 2018, pages 3598-3602, 2018 |
[DOI] |
INFORMATION THEORETIC CLUSTERING FOR UNSUPERVISED DOMAIN-ADAPTATION, , and , in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5580-5584, IEEE, 2016 |
|
Content Normalization for Text-dependent Speaker Verification, , , and , in: Proc. of Interspeech, 2017 |
|
Exploiting semi-supervised training through a dropout regularization in end-to-end speech recognition, , , and , in: Proc. of Interspeech 2019, 2019 |
EXPLOITING SEQUENCE INFORMATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION, , , and , in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, New Orleans, pages 5370-5374, 2017 |
|
Floor Holder Detection and End of Speaker Turn Prediction in Meetings, , and , in: International Conference on Speech and Language Processing, Interspeech, Makuhari, Japan, ISCA, 2010 |
|
Low-rank and Sparse Soft Targets to Learn Better DNN Acoustic Models, , and , in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), 2017 |
|
Exploiting Eigenposteriors for Semi-supervised Training of DNN Acoustic Models with Sequence Discrimination, , and , in: Proceedings of Interspeech, 2017 |
|
Sparse Modeling of Neural Network Posterior Probabilities for Exemplar-Based Speech Recognition, , and , in: Proceedings of SPARS 2015: Workshop on Signal Processing with Adaptive Sparse Structured Representations, 2015 |
|
Dictionary Learning for Sparse Representation of Neural Network Exemplars in Speech Recognition, , and , in: Proceedings of SPARS 2015: Workshop on Signal Processing with Adaptive Sparse Structured Representations, 2015, pages 1093, 2015 |
|
Far-field ASR Using Low-rank and Sparse Soft Targets from Parallel Data, , and , in: IEEE Workshop on Spoken Language Technology, Athens, GREECE, pages 581-587, IEEE, 2018 |
|
Modeling Overlapping Speech using Vector Taylor Series, , and , in: Odyssey: The Speaker and Language Recognition Workshop, Joensuu, Finland, 2014 |
|
Detecting and Labeling Speakers on Overlapping Speech using Vector Taylor Series, , and , in: INTERSPEECH, 2014 |
|
Exploiting Low-dimensional Structures to Enhance DNN based Acoustic Modeling in Speech Recognition, , , and , in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5690-5694, IEEE, 2016 |
|
Nearly optimal exploration-exploitation decision thresholds, , in: Int. Conf. on Artificial Neural Networks (ICANN), 2006 |
|
Gradient estimates of return distributions, and , in: PASCAL Workshop on Principled Methods of Trading Exploration and Exploitation, 2005 |
|
Boosting word error rates, and , in: IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP, 2005 |
Boosting HMMs with an application to speech recognition, and , in: IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP, 2004 |
|
Online Policy Adaptation for Ensemble Classifiers, and , in: 12th European Symposium on Artificial Neural Networks, ESANN 04, 2004 |
Speech recognition with speech synthesis models by marginalising over decision tree leaves, , and , in: Proceedings of Interspeech, Brighton, U.K., 2009 |
|
The segmentation of multi-channel meeting recordings for automatic speech recognition, , and , in: Int. Conf. on Spoken Language Processing (Interspeech ICSLP), 2006 |
|
Measuring the gap between HMM-based ASR and TTS, , and , in: Proceedings of Interspeech, Brighton, U.K., 2009 |
|
COMBINING CEPSTRAL NORMALIZATION AND COCHLEAR IMPLANT-LIKE SPEECH PROCESSING FOR MICROPHONE ARRAY-BASED SPEECH RECOGNITION, , and , in: Proceedings of the IEEE Workshop on Spoken Language Technology, 2012 |
|
Neural conditional random fields, and , in: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, Chia Laguna, Sardinia, Italy, JMLR: W&CP, 2010 |
|
Smartphone usage in the wild: a large-scale analysis of applications and context, , and , in: 13th International Conference on Multimodal Interaction, 2011 |
|
Contextual Conditional Models for Smartphone-based Human Mobility Prediction, and , in: Proceedings of the 14th ACM International Conference on Ubiquitous Computing, 2012 |
|
Contextual grouping: discovering real-life interaction types from longitudinal Bluetooth data, and , in: 12th International Conference on Mobile Data Management, 2011 |
|
GroupUs: Smartphone Proximity Data and Human Interaction Type Mining, and , in: 15th annual International Symposium on Wearable Computers, San Francisco, USA, 2011 |
|
By their apps you shall understand them: mining large-scale patterns of mobile phone usage, and , in: The 9th International Conference on Mobile and Ubiquitous Multimedia, 2010 |
|
Inferring social activities with mobile sensor networks, , , , and , in: 15th ACM International Conference on Multimodal Interaction, 2013 |
|
Analyzing Interactions Between Navigation Strategies Using a Computational Model of Action Selection, , , , and , in: Int Conf Spatial Cognition 2008, 2008 |
|
District heating network modelling for future integration of solar thermal energy, , , and , in: Journal of Physics: Conference Series, pages 012089, IOP Publishing, 2021 |
[DOI] |
Sun Workstation and SwissNet Platform for Speech Recognition and Speaker Verification over the Telephone, , , , and , in: Proceedings of Workstations und ihre Anwendungen, SIWORK'96, 1996 |
|
Improving Children Speech Recognition through Feature Learning from Raw Speech Signal, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
|
Segment-level training of ANNs based on acoustic confidence measures for hybrid HMM/ANN Speech Recognition, and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
|
Using Speech Production Knowledge for Raw Waveform Modelling based Styrian Dialect Identification, and , in: Proceedings of Interspeech, 2019 |
|
Towards Automatic Prediction of Non-Expert Perceived Speech Fluency Ratings, , , and , in: ACM International Conference on Multimodal Interaction (ICMI Companion), 2022 |
[DOI] |
Learning voice source related information for depression detection, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
|
Accelerated Training of Linear Object Detectors, and , in: CVPR 2013 Workshop on Structured Prediction, 2013 |
[URL] |
Deformable Part Models with Individual Part Scaling, and , in: British Machine Vision Conference, 2013 |
|
Exact Acceleration of Linear Object Detectors, and , in: Proceedings of the European Conference on Computer Vision, 2012 |
|
Tasting Families of Features for Image Classification, and , in: International Conference on Computer Vision, 2011 |
|
Boosting with Maximum Adaptive Sampling, and , in: Proceedings of the Neural Information Processing Systems Conference, 2011 |
Person Authentication by Fusing Face and Speech Information, , , and , in: Proceedings of the First International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'97), Springer Verlag, 1997 |
The TA2 Database - A Multi-Modal Database from Home Entertainment, , and , in: International Conference on Signal Acquisition and Processing, Singapore, 2011 |
|
Exploiting Long-Term Observations for Track Creation and Deletion in Online Multi-Face Tracking, and , in: IEEE Conference on Automatic Face and Gesture Recognition, pages 525-530, IEEE, 2011 |
|
Dynamic Partitioned Sampling For Tracking With Discriminative Features, , and , in: Proceedings of the British Maschine Vision Conference, London, 2009 |
|
UNICITY: A depth maps database for people detection in security airlocks, , , , , , , and , in: IEEE International Conference on Advanced Video and Signal-based Surveillance Workshop, 2018 |
|
Using Multiple Time Scales in a Multi-Stream Speech Recognition System, and , in: EUROSPEECH'97, 1997 |
|
Hybrid HMM/ANN Systems for Training Independent Tasks: Experiments on 'Phonebook' and Related Improvements, , , , and , in: IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, 1997 |
|
Robust Speech Recognition based on Multi-Stream Features, , and , in: Proc. of the ESCA-NATO Workshop on Robust Speech Recognition for Unknown Communication Channels, 1997 |
|
Using the Multi-Stream Approach for Continuous Audio-Visual Speech Recognition: Experiments on the M2VTS Database, and , in: Proc. 5th Int. Conf. on Spoken Language Processing, 1998 |
|
E
SMILE Swiss German Sign Language Dataset, , , , , , , , , , , and , in: Language Resources and Evaluation Conference, 2018 |
Audio-Visual Gender Recognition in Uncontrolled Environment Using Variability Modeling Techniques, , and , in: International Joint Conference on Biometrics, Clearwater, Florida, USA, pages 1 - 8, IEEE, 2014 |
[DOI] [URL] |
Scalable Probabilistic Models: Applied to Face Identification in the Wild, and , in: 8th European Biometrics Research and Industry Awards, European Association for Biometrics, Darmstadt, Germany, 2014 |
[URL] |
Face Verification using Gabor Filtering and Adapted Gaussian Mixture Models, , and , in: Proceedings of the 11th International Conference of the Biometrics Special Interest Group, Darmstadt, Germany, pages 397-408, GI-Edition, 2012 |
|
Predicting Heart Activity from Speech using Data-driven and Knowledge-based features, , and , in: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2024 |
|
Non-Ontogenic Sparse Neural Networks, , and , in: Proceedings of the International Conference on Neural Networks, IEEE, IEEE, 1995 |
Environment - Application - Adaptation: a Community Architecture for Ambient Intelligence, , in: International Conference on Ambient Computing, Applications, Services and Technologies, 2011 |
What to Show? Automatic Stream Selection Among Multiple Sensors, , and , in: International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, 2014 |
|
Extracting and Locating Temporal Motifs in Video Scenes Using a Hierarchical Non Parametric Bayesian Model, , and , in: IEEE Conference on Computer Vision and Pattern Recognition, 2011 |
|
Multi-camera Open Space Human Activity Discovery for Anomaly Detection, , and , in: 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011 |
|
Spoofing in 2D Face Recognition with 3D Masks and Anti-spoofing with Kinect, and , in: Biometrics: Theory, Applications and Systems, Washington DC, USA, 2013 |
|
Spoofing Attacks To 2D Face Recognition Systems With 3D Masks, and , in: International Conference of the Biometrics Special Interes Group, Darmstadt, Germany, 2013 |
|
Within- and Cross- Database Evaluations for Gender Classification via BeFIT Protocols, , and , in: International Workshop on Multimedia Signal Processing, pages 1-6, 2014 |
[DOI] [URL] |
Normalizing Flows for Speaker and Language Recognition Backend, , , , and , in: Odyssey 2024: The Speaker and Language Recognition Workshop, 2024 |
Improving Contextual Quality Models for MT Evaluation Based on Evaluators' Feedback, , and , in: 6th International Conference on Language Resources and Evaluation, Marrakech, Morocco, 2008 |
E
Multi-modal person verification tools using speech and images, , in: European Conference on Multimedia Applications, Services and Techniques, 1996 |
E
An Attention Mechanism for Deep Q-Networks with Applications in Robotic Pushing, , and , in: Proc. of Workshop on Emerging paradigms for robotic manipulation: from the lab to the productive world, ICRA, 2021 |
Reinforcement learning of trajectory distributions: Applications in assisted teleoperation and motion planning, , , , , and , in: IEEE International Conference on Intelligent Robots and Systems, 2019 |
An Efficient Image-to-Image Translation HourGlass-based Architecture for Object Pushing Policy Learning, , and , in: IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021 |
|
A Multitask and Kernel approach for Learning to Push Objects with a Task-Parameterized Deep Q-Network, , , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2023 |
|
F
Open-Set Speaker Identification pipeline in live criminal investigations, and , in: 1st ISCA Symposium on Security and Privacy in Speech Communication, 2021 |
|
ROXANNE Research Platform: Automate criminal investigations, , , , , and , in: Interspeech Show and Tell 2021, 2021 |
|
Graph2Speak: Improving Speaker Identification using Network Knowledge in Criminal Conversational Data, , , and , in: 1st ISCA Symposium on Security and Privacy in Speech Communication, pages 10--13, 2021 |
[DOI] |
BertAA: BERT fine-tuning for Authorship Attribution, , , and , in: Proceedings of the 17th International Conference on Natural Language Processing, 2020 |
|
IDIAPers @ Causal News Corpus 2022: Extracting Cause-Effect-Signal Triplets via Pre-trained Autoregressive Language Model, , , , , , and , in: The 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE @ EMNLP 2022), 2022 |
[URL] |
Socio-Technical Network Analysis from Wearable Interactions, , and , in: International Symposium on Wearable Computers, 2012 |
|
Extracting Mobile Behavioral Patterns with the Distant N-Gram Topic Model, and , in: Proceedings of the IEEE International Symposium on Wearable Computers, Newcastle, 2012 |
|
Mining Human Location-Routines Using a Multi-Level Approach to Topic Modeling, and , in: 2010 IEEE Second International Conference on Social Computing, SIN Symposium, Minneapolis, Minnesota, USA, 2010 |
|
Learning and Predicting Multimodal Daily Life Patterns from Cell Phones, and , in: ICMI-MLMI, 2009 |
|
Daily Routine Classification from Mobile Phone Data, and , in: Workshop on Machine Learning and Multimodal Interaction (MLMI08), 2008 |
|
Discovering Human Routines from Cell Phone Data with Topic Models, and , in: IEEE International Symposium on Wearable Computers (ISWC), 2008 |
|
What Did You Do Today? Discovering Daily Routines from Large-Scale Mobile Data, and , in: ACM International Conference on Multimedia (ACMMM), 2008 |
|
Demystifying the Scribes behind the Voynich Manuscript using Computational Linguistic Techniques, , and , in: Proceedings of the 1st International Conference on the Voynich Manuscript, 2022 |
Head-Pose Invariant Facial Expression Recognition using Convolutional Neural Networks, , in: International IEEE Conference on Multimodal Interfaces (ICMI 02), 2002 |
|
Mutliscale Facial Expression Recognition using Convolutional Neural Networks, , in: Indian Conference on Computer Vision, Graphics and Image Processing (ICVGIP 02), 2002 |
|
Robust Face Analysis using Convolutional Neural Networks, , in: Proceedings of the International Conference on Pattern Recognition (ICPR 02), 2002 |
|
Facial Expression Analysis using Shape and Motion Information Extracted by Convolutional Neural Networks, , in: International IEEE Workshop on Neural Networks for Signal Processing (NNSP 02), 2002 |
|
Recognition of Asymmetric Facial Action Unit Activities and Intensities, and , in: Proceedings of the International Conference on Pattern Recognition (ICPR 2000), 2000 |
|
Social Network Analysis in Multimedia Indexing: Making Sense of People in Multiparty Recordings, , in: Proceedings of the Doctoral Consortium of the International Conference on Affective Computing & Intelligent Interaction (ACII), 2009 |
|
Automatic Role Recognition in Multiparty Recordings Using Social Networks and Probabilistic Sequential Models, , and , in: ACM International Conference on Multimedia, 2009 |
|
Role Recognition in Multiparty Recordings using Social Affiliation Networks and Discrete Distributions, , , and , in: International Conference on Multimodal Interfaces, Chania, Greece, 2008 |
|
Role Recognition for Meeting Participants: an Approach Based on Lexical Information and Social Network Analysis, , , , and , in: ACM International Conference on Multimedia, Vancouver, Canada, 2008 |
|
Multi-source Posteriors for Speech Activity Detection on Public Talks, and , in: INTERSPEECH, 2014 |
|
MLP-based Factor Analysis for Tandem Speech Recognition, and , in: Proceedings of the International Conference on Acoustics, Speech and Signal Processing, 2013 |
|
Speaker Diarization and Linking of Large Corpora, and , in: Proceedings of the IEEE Workshop on Spoken Language Technology, 2012 |
|
Inter-task System Fusion for Speaker Recognition, , , , and , in: Proceeedings of the INTERSPEECH, 2016 |
|
SYSTEM FUSION AND SPEAKER LINKING FOR LONGITUDINAL DIARIZATION OF TV SHOWS, , , and , in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5495-5499, IEEE, 2016 |
|
Diarizing Large Corpora using Multi-modal Speaker Linking, , , and , in: INTERSPEECH 2014, 2014 |
|
High Frequency Bands and Estimated Local Field Potentials to Improve Single-Trial Classification of Electroencephalographic Signals, , , , , and , in: Proceedings of the 3rd International Brain-Computer Interface Workshop & Training Course 2006, 2006 |
|
Simultaneous Real-Time Detection of Motor Imagery and Error-Related Potentials for Improved BCI Accuracy, and , in: Proceedings of the 4th International Brain-Computer Interface Workshop and Training Course, 2008 |
|
EEG-Based Brain-Computer Interaction: Improved Accuracy by Automatic Single-Trial Error Detection, and , in: Advances in Neural Information Processing Systems 21, 2007 |
|
You Are Wrong!---Automatic Detection of Interaction Errors from Brain Waves, and , in: Proceedings of the 19th International Joint Conference on Artificial Intelligence, 2005 |
|
A Connectionist System for Two-Dimensional Representation of Multivariate Location Data, and , in: Proceedings of the Fifth International Workshop on Artificial Intelligence for High Energy Physics, AIHENP, Lausanne, Switzerland, Elsevier Science, 1997 |
Stressful First Impressions in Job Interviews, , , , and , in: Proceedings of the 18th ACM International Conference on Multimodal Interaction, pages 325-332, 2016 |
|
The MASH Project, , , and , in: European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2011 |
|
Speaker-Dependent Speech Recognition Based on Phone-Like Unit Model -- Application to Voice Dialing, and , in: IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, 1997 |
|
Scene Recognition with Naive Bayes Non-linear Learning, and , in: Proceedings of the 22nd International Conference on Pattern Recognition, Stockholm, pages 3404 - 3409, IEEE, 2014 |
[DOI] |
Indoor Scene Recognition using Task and Saliency-driven Feature Pooling, and , in: Proceedings of the British Machine Vision Conference, Guildford, UK, 2012 |
|
Multiclass Latent Locally Linear Support Vector Machines, , and , in: JMLR W&CP, Volume 29: Asian Conference on Machine Learning, Canberra, Australia, pages 229-244, 2013 |
[URL] |
A Multi Cue Discriminative Approach to Semantic Place Classification, , and , in: CLEF 2010 Notebook Papers/LABs/Workshops, 2010 |
|
Comparison of 5 methods for the evaluation of intelligibility in mild to moderate French dysarthric speech, , , , , , , , , and , in: Annual Conference of the International Speech Communication Association, pages 2188-2192, 2022 |
[DOI] |
New Nonsense Syllables Database -- Analyses and Preliminary ASR Experiments, , , and , in: Proceedings of International Conference on Spoken Language Processing (ICSLP), 2004 |
|
Behavior of a Bayesian adaptation method for incremental enrollment in speaker verification, , , , , and , in: ICASSP2000 - IEEE International Conference on Acoustics, Speech, and Signal Processing, 2000 |
|
MULTI-MODAL SPEAKER DIARIZATION OF REAL-WORLD MEETINGS USING COMPRESSED-DOMAIN VIDEO FEATURES, , and , in: International Conference on Audio, Speech and Signal Processing, 2009 |
|
Visual Speaker Localization Aided by Acoustic Models, , and , in: ACM Multimedia, 2009 |
Estimating The Degree of Sleepiness by Integrating Articulatory Feature Knowledge In Raw Waveform Based CNNs, , and , in: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Barcelona, Spain, 2020 |
|
Automatic Diagnosis of Alzheimer's Disease Using Neural Network Language Models, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
|
3D Head Pose and Gaze Tracking and Their Application to Diverse Multimodal Tasks, , in: Doctoral consortium of the 15th ACM International Conference on Multimodal Interaction, Sydney, Australia, 2013 |
[DOI] |
EYEDIAP: A Database for the Development and Evaluation of Gaze Estimation Algorithms from RGB and RGB-D Cameras, , and , in: Proceedings of the ACM Symposium on Eye Tracking Research and Applications, Safety Harbor, Florida, United States of America, ACM, 2014 |
[DOI] |
A Semi-Automated System for Accurate Gaze Coding in Natural Dyadic Interactions, , , and , in: 15th ACM International Conference on Multimodal Interaction, Sydney, Australia, ACM, 2013 |
[DOI] |
Geometric Generative Gaze Estimation (G3E) for Remote RGB-D Cameras, and , in: IEEE Computer Vision and Pattern Recognition Conference, Columbus, Ohio,USA, pages 1773-1780, IEEE, 2014 |
[DOI] |
3D Gaze Tracking and Automatic Gaze Coding from RGB-D Cameras, and , in: IEEE Conference in Computer Vision and Pattern Recognition, Vision Meets Cognition Workshop, Columbus, Ohio, USA, 2014 |
|
Person Independent 3D Gaze Estimation From Remote RGB-D Cameras, and , in: International Conference on Image Processing, Melbourne, Australia, IEEE, 2013 |
[DOI] |
Gaze Estimation From Multimodal Kinect Data, and , in: IEEE Conference in Computer Vision and Pattern Recognition, Workshop on Gesture Recognition, Providence, RI, USA, 2012 |
[DOI] |
Morphodynamic profiling to explore spatio-temporal signaling networks regulating neurite outgrowth, , , , , , and , in: 1st International SystemsX.ch Conference on Systems Biology, 2011 |
G
Modeling Unvoiced Sounds In Statistical Parametric Speech Synthesis with a Continuous Vocoder, , , and , in: Proc. of EUSIPCO, Budapest, Hungary, 2016 |
|
Feature Extraction for Multi-Class BCI using Canonical Variates Analysis, , , , and , in: Proceedings of the IEEE International Symposium on Intelligent Signal Processing, 2007 |
|
An Asynchronous and Non-Invasive Brain-Actuated Wheelchair, , , , , , , and , in: Proceedings of the 13th International Symposium on Robotics Research, 2007 |
|
Continuous Brain-Actuated Control of an Intelligent Wheelchair by Human EEG, , , , , , and , in: In proceedings, 4th Intl. Brain-Computer Interface Workshop and Training Course, 2008 |
|
Visuo-Spatial Attention Frame Recognition for Brain-Computer Interfaces, , , , , , and , in: Proceedings of the 1st International Conference on Cognitive Neurodynamics, 2007 |
|
Hill-Climbing Attack to an Eigenface-Based Face Verification System, , , , and , in: Proceedings of the First IEEE International Conference on Biometrics, Identity and Security (BIdS), 2009 |
|
Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, , and , in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, Pilsen, Czech Republic, Springer - Verlag, Berlin Heidelberg 2009, 2009 |
|
Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, , and , in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, Pilsen, Czech Republic, Springer - Verlag, Berlin Heidelberg 2009, 2009 |
|
MDCT for Encoding Residual Signals in Frequency Domain Linear Prediction, , and , in: Audio Engineering Society (AES,',','), 127th Convention, Audio Engineering Society (AES), Audio Engineering Society, 60 East 42nd Street, New York, New York 10165-2520, USA;, 2009 |
[URL] |
Temporal Masking for Bit-rate Reduction in Audio Codec Based on Frequency Domain Linear Prediction, , , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008 |
|
Spectral Noise Shaping: Improvements in Speech/Audio Codec Based on Linear Prediction in Spectral Domain, , , and , in: INTERSPEECH 2008, 2008 |
|
Autoregressive Modelling of Hilbert Envelopes for Wide-band Audio Coding, , , and , in: AES 124th Convention, Audio Engineering Society, 2008 |
|
Front-end for Far-field Speech Recognition based on Frequency Domain Linear Prediction, , and , in: Interspeech 2008, 2008 |
|
APPLICATIONS OF SIGNAL ANALYSIS USING AUTOREGRESSIVE MODELS FOR AMPLITUDE MODULATION, , , and , in: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009, WASPAA '09., IEEE, Mohonk Mountain House, New Paltz, New York, USA, 2009 |
[URL] |
DexROV: Dexterous Undersea Inspection and Maintenance in Presence of Communication Latencies, , , , , , , , , , , , , , , and , in: IFAC Workshop on Navigation, Guidance and Control of Underwater Vehicles (NGCUV), pages 218-223, 2015 |
Dexterous Undersea Interventions with Far Distance Onshore Supervision: the DexROV Project, , , , , , , , , , , , , , , , , , , , , , and , in: IFAC Conference on Control Applications in Marine Systems (CAMS), Trondheim, Norway, pages 414-419, 2016 |
[DOI] [URL] |
Investigating the use of Visual Focus of Attention for Audio-Visual Speaker Diarisation, , , and , in: Proceedings of the ACM International Conference on Multimedia, Beijing, China, 2009 |
|
Using Audio and Visual Cues for Speaker Diarisation Initialisation, and , in: International Conference on Acoustics, Speech and Signal Processing, 2010 |
|
Audio–Visual Synchronisation for Speaker Diarisation, , and , in: International Conference on Speech and Language Processing, Interspeech, Makuhari, Japan, 2010 |
|
Recognition of Anticipatory Behavior from Human EEG, , and , in: In proceedings, 4th Intl. Brain-Computer Interface Workshop and Training Course, 2008 |
|
The use of brain-computer interfacing for ambient intelligence, , , , , and , in: In the book, Constructing Ambient Intelligence: AmI-07 Workshops Proceedings, Max M\:uhlh\:auser, Alois Ferscha, and Erwin Aitenbichler (Eds.,',','), LNCS, Springer Verlag, 2008., 2007 |
|
SNR Features for Automatic Speech Recognition, , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, 2009 |
|
Silence Models in Weighted Finite-State Transducers, , in: Interspeech, 2008 |
|
Translation and Prosody in Swiss Languages, , , , , , , , , , and , in: Nouveaux cahiers de linguistique francaise, 2014 |
|
Tracter: A Lightweight Dataflow Framework, and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
|
Real-Time ASR from Meetings, , , , , , , , and , in: Proceedings of Interspeech, Brighton, UK., 2009 |
|
Automatic Speech Recognition and Translation of a Swiss German Dialect: Walliserdeutsch, , and , in: Proceedings of Interspeech, 2014 |
|
Analyzing Group Interactions in Conversations: a Review, , in: IEEE Int. Conf. on Multisensor Fusion and Integration for Intelligent Systems (MFI), 2006 |
|
Discovering Eating Routines in Context with a Smartphone App, , , and , in: Ubicomp/Iswc'19 Adjunct: Proceedings Of The 2019 Acm International Joint Conference On Pervasive And Ubiquitous Computing And Proceedings Of The 2019 Acm International Symposium On Wearable Computers, London, pages 422-429, 2019 |
[DOI] |
A Mixed-State I-Particle Filter for Multi-Camera Speaker Tracking, , , and , in: IEEE Int. Conf. on Computer Vision Workshop on Multimedia Technologies for E-Learning and Collaboration (ICCV-WOMTEC), 2003 |
|
Audio-Visual Speaker Tracking with Importance Particle Filters, , , , and , in: IEEE International Conference on Image Processing (ICIP), 2003 |
|
Multimodal Multispeaker Probabilistic Tracking in Meetings, , , and , in: Proc. Int. Conf. on Multimodal Interfaces (ICMI), 2005 |
|
On automatic annotation of meeting databases, , , , and , in: IEEE International Conference on Image Processing (ICIP), 2003 |
|
Detecting Group Interest-level in Meetings, , , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2005 |
|
Tracking People in Meetings with Particles, , , , and , in: Proc. Int. Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS,',','), invited paper, 2005 |
|
The MAAYA Project: Multimedia Analysis and Access for Documentation and Decipherment of Maya Epigraphy, , , , , , and , in: Proc. Digital Humanities Conference, Lausanne, 2014 |
|
New world, New Worlds: Visual Analysis of Pre-Columbian Pictorial Collections., , , and , in: Proceedings of the International Workshop on Multimedia for Cultural Heritage, Modena, Italy., Springer CCIS series book, 2011 |
|
Vlogging Over Time: Longitudinal Impressions and Behavior in YouTube, , , , and , in: Proceedings of the 17th International Conference on Mobile and Ubiquitous Multimedia, Cairo, EGYPT, pages 37-46, 2018 |
[DOI] |
Social Multimedia, Diversity, and Global South Cities: A Double Blind Side, , , and , in: Proc. ACM Workshop on Fairness, Accountability, and Transparency in Multimedia (FAT/MM), Nice, 2019 |
|
Linking Objects in Videos by Importance Sampling, and , in: IEEE International Conference on Multimedia and Expo, 2002 |
|
Object Localization in Metric Spaces for Video Linking, and , in: IEEE Workshop on Motion and Video Computing, 2002 |
|
Probabilistic Home Video Structuring: Feature Selection and Performance Evaluation, , and , in: IEEE International Conference on Image Processing, 2002 |
|
Assessing Scene Structuring in Consumer Videos, , , , and , in: Int. Conf. on Image and Video Retrieval (CIVR), 2004 |
|
Modeling Interactions from Email Communication, , and , in: Proc. IEEE International Conference on Multimedia & Expo (ICME,',','), 2006, 2006 |
|
Extracting Information from Multimedia Meeting Collections, , and , in: 7th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2005 |
|
Comparison of Two Methods for Unsupervised Person Identification in TV Shows, , , , and , in: 12th International Workshop on Content-Based Multimedia Indexing, 2014 |
|
A Conditional Random field approach for audio-visual people diarization, , , , and , in: Proceedings of the 39th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 116 - 120, IEEE, 2014 |
[DOI] |
Face identification from overlaid texts using Local Face Recurrent Patterns and CRF models, , , , and , in: IEEE International Conference on Image Processing 2014, Paris, IEEE, 2014 |
|
Combining methods to improve speaker verification decision, , , and , in: Proceedings of The Fourth International Conference on Spoken Language Processing, ICSLP, ICSLP, 1996 |
|
Deliberate Imposture: a challenge for automatic speaker verification systems, and , in: Proceedings of the European Conference on Speech Communication and Technology, 1999 |
Speech pre-processing against intentional imposture in speaker recognition, and , in: Proceedings of ICSLP, Sidney, 1998 |
Voice transformation, a tool for imposture of speaker verification, and , in: Proceedings of International Phonetic Science conference IPS98, Washington, 1998 |
Amelioration des performances de verification du locuteur par combinaison de methodes, , , and , in: Journees d'etudes sur la parole, JEP, 1996 |
Polycost Database, , and , 1996 |
Text dependent speaker verification using binary classifiers, , and , in: Proceedings of the 1998 IEEE International Conference on Acoustics, Speech, and Signal Processing --- ICASSP'98, IEEE, IEEE, 1998 |
|
Efficient Grapevine Structure Estimation in Vineyards Conditions, , , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision, Paris, pages 712--720, 2023 |
[URL] |
EFaR 2023: Efficient Face Recognition Competition, , , , and , in: IEEE International Joint Conference on Biometrics (IJCB 2023), 2023 |
|
Heterogeneous Face Recognition Using Domain Invariant Units, and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2024 |
|
Modality Agnostic Heterogeneous Face Recognition with Switch Style Modulators, and , in: IEEE International Joint Conference on Biometrics, 2024 |
|
Bridging the Gap: Heterogeneous Face Recognition with Conditional Adaptive Instance Modulation, and , in: IJCB, 2023 |
|
The Unconstrained Ear Recognition Challenge 2023: Maximizing Performance and Minimizing Bias, and , in: IEEE International Joint Conference on Biometrics (IJCB 2023), 2023 |
|
Cross Modal Focal Loss for RGBD Face Anti-Spoofing, and , in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021 |
|
On the Effectiveness of Vision Transformers for Zero-shot Face Anti-Spoofing, and , in: International Joint Conference on Biometrics (IJCB 2021), 2021 |
|
Deep Pixel-wise Binary Supervision for Face Presentation Attack Detection, and , in: International Conference on Biometrics, 2019 |
|
Embedding Context-Dependent Variations of Prosodic Contours using Variational Encoding for Decomposing the Structure of Speech Prosody, , , , and , in: Workshop on Prosody and Meaning: Information Structure and Beyond, Aix-en-Provence, France, 2018 |
[URL] |
An agonist-antagonist pitch production model, and , in: Lecture Notes in Artificial Intelligence: 18th International Conference, SPECOM 2016, Budapest, Hungary, pages 84--91, 2016 |
|
An Investigation of Muscle Models for Physiologically Based Intonation Modelling, and , in: Proceedings of the 23rd Telecommunications Forum, pages 468--471, 2015 |
[DOI] |
Unified Prosody Model based on Atom Decomposition for Emphasis Detection, , , , , and , in: Proceedings of ETAI, 2016 |
|
Weighted Correlation based Atom Decomposition Intonation Modelling, , , and , in: Proceedings of Interspeech, Dresden, Germany, pages 1601--1605, 2015 |
|
Exploiting Accelerometers to Improve Movement Classification for Prosthetics, and , in: International Conference on Rehabilitation Robotics, 2013 |
|
Object Recognition using Visuo-Affordance Maps, , , and , in: International Conference on Intelligent Robots and Systems, Taipei, pages 1572-1578, IEEE, 2010 |
[DOI] |
Conditional Gaussian Mixture Models for Environmental Risk Mapping, , and , in: IEEE International Workshop on Neural Networks for Signal Processing (NNSP), 2002 |
|
Environmental and Pollution Spatial Data Classification with Support Vector Machines and Geostatistics, , , and , in: Geostatistical congress 2000, 2000 |
Environmental and Pollution Spatial Data Classification with Support Vector Machines and Geostatistics, , , and , in: Intelligent techniques for Spatio-Temporal Data Analysis in Environmental Applications. Workshop W07, 1999 |
Confidence Evaluation for Risk Prediction, , and , in: 2001 Annual Conference of the IAMG, 2001 |
|
Reactive Anticipatory Robot Skills with Memory, , and , in: The International Symposium on Robotics Research, 2022 |
|
Optimization of robot configurations for motion planning in industrial riveting, , , and , in: Proc. IEEE Intl Conf. on Advanced Robotics (ICAR), 2021 |
|
Active Improvement of Control Policies with Bayesian Gaussian Mixture Model, , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, 2020 |
Investigating Lexical Substitution Scoring for Subtitle Generation, , , , and , in: Proceedings of the 10th Conference on Computational Natural Language Learning (CoNLL)., 2006 |
|
Test of several external posterior weighting functions for multiband Full Combination ASR, and , in: Int. Conf. on Spoken Language Processing (ICSLP), 2000 |
A CASA-labelling model using the localisation cue for robust cocktail-party speech recognition, , and , in: Proc.\ European Conf.\ on Speech Communication and Technology (EUROSPEECH), 1999 |
Interfacing of CASA and Multistream recognition, , , and , in: TSD'98-Text, Speech and Dialog International Workshop, BRNO-Czech Republic, 1998 |
|
Reconnaissance multi-bandes de la parole bruitée par couplage entre les niveaux primitifs et d'identification, , , and , in: Journees Etude Parole - Martigny, 1998 |
|
Reconnaissance robuste de la parole par segmentation signal/bruit en sous-bandes, , , and , in: Neurosciences et Sciences de l'Ingenieur'98 - Munster, CNRS, 1998 |
|
The SIWIS database: a multilingual speech database with acted emphasis, , , , , , , , , , , and , in: Proceedings of Interspeech, San Francisco, USA, pages 1532--1535, 2016 |
[DOI] |
On the Generalization of Fused Systems in Voice Presentation Attack Detection, , , , and , in: 16th International Conference of the Biometrics Special Interest Group, 2017 |
|
Steerable Features for Statistical 3D Dendrite Detection, , , , and , in: Proceedings of the International Conference on Medical Image Computing and Computer Assisted Intervention, 2009 |
Learning Rotational Features for Filament Detection, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2009 |
Automated Delineation of Dendritic Networks in Noisy Image Stacks, , and , in: proceedings of the European Conference on Computer Vision, 2008 |
Automated Quantification of Morphodynamics for High-Throughput Live Cell Imaging Datasets, , , , , and , in: 1st International SystemsX.ch Conference on Systems Biology, 2011 |
Delineating Trees in Noisy 2D Images and 3D Image Stacks, , , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, pages 2799–2806, 2010 |
Building context-dependent DNN acoustic models using Kullback-Leibler divergence-based state tying, , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2015 |
|
Manifold Sparse Beamforming, , and , in: IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing, Saint Martin, France, pages 113-116, IEEE, 2013 |
[DOI] |
A Probabilistic Interpretation of SVMs with an Application to Unbalanced Classification, , and , in: Advances in Neural Information Processing Systems, NIPS 15, 2005 |
|
Support Vector Machines with a Reject Option, , , and , in: Proceedings of the 22nd Annual Conference on Neural Information Processing Systems, 2008 |
|
Zurich Like New: Analyzing Open Urban Multimodal Data, , and , in: Proceedings of the 1st International Workshop on Multimedia Computing for Urban Data, 2021 |
|
Learning the Inter-frame Distance for Discriminative Template-based Keyword Detection, and , in: International Conference on Speech Communication and Technology (INTERSPEECH), 2007 |
|
A Neural Network to Retrieve Images from Text Queries, and , in: International Conference on Artificial Neural Networks (ICANN), 2006 |
|
Exploiting Hyperlinks to Learn a Retrieval Model, and , in: NIPS Workshop on Learning to Rank, 2005 |
|
Inferring Document Similarity from Hyperlinks, and , in: ACM Conference on Information and Knowledge Management, 2005 |
|
A Discriminative Approach for the Retrieval of Images from Text Queries, , and , in: European Conference on Machine Learning (ECML), 2006 |
|
Learning to Retrieve Images from Text Queries with a Discriminative Model, , and , in: International Workshop on Adaptive Multimedia Retrieval (AMR), 2006 |
|
Effect of Segmentation Method on Video Retrieval Performance, and , in: Proceedings of the 2005 IEEE International Conference on Multimedia and Expo (ICME-05), 2005 |
|
Spoken language identification using language bottleneck features, , , , , and , in: Proceedings of TSD, 2019 |
|
Cross-linguistic annotation of narrativity for English/French verb tense disambiguation, and , in: 9th Edition of the Language Resources and Evaluation Conference, Reykjavik, Iceland, 2014 |
|
Calibration from statistical properties of the visual world, , and , in: European Conf. on Computer Vision, 2008 |
|
A Hierarchical Keyframe User Interface for Browsing Video over the Internet, , , and , in: Proceedings of the 9th International Conference on Human-Computer Interaction (INTERACT-2003), IOS Press, 2003 |
|
Findings of the 2016 WMT Shared Task on Cross-lingual Pronoun Prediction, , , , , , , , and , in: Proceedings of WMT 2016 (First Conference on Machine Translation), Association for Computational Linguistics, Berlin, Germany, pages 525–542, 2016 |
[URL] |
Refining Tuberculosis Detection in CXR Imaging: Addressing Bias in Deep Neural Networks via Interpretability, , and , in: Proceedings of the 12th European Workshop on Visual Information Processing, 2024 |
|
The 2013 Face Recognition Evaluation in Mobile Environment, , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: The 6th IAPR International Conference on Biometrics, 2013 |
|
Face Recognition with Disparity Corrected Gabor Phase Differences, , and , in: Artificial Neural Networks and Machine Learning, Heidelberg, pages 411-418, Springer Berlin, 2012 |
[DOI] |
An Open Source Framework for Standardized Comparisons of Face Recognition Algorithms, , and , in: Computer Vision - ECCV 2012. Workshops and Demonstrations, Idiap Research Institute, Heidelberg, pages 547-556, Springer Berlin, 2012 |
[DOI] [URL] |
A Unified Model for Gaze Following and Social Gaze Prediction, , , and , in: The 18th IEEE International Conference on Automatic Face and Gesture Recognition, 2024 |
|
A Modular Multimodal Architecture for Gaze Target Prediction: Application to Privacy-Sensitive Settings, , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022 |
|
Exploring the Zero-Shot Capabilities of Vision-Language Models for Improving Gaze Following, , , and , in: Int. Conf. Computer Vision and Pattern Recognition (CVPR), Workshop on Gaze Estimation and Prediction in the Wild, 2024 |
|
Studying Phase Synchrony for Classification of Mental Tasks in Brain Machine Interfaces, , , and , in: Proceedings of the Conference of the International Society for Brain Electromagnetic Topography, 2003 |
H
Query Refinement Using Conversational Context: a Method and an Evaluation Resource, and , in: Proceedings of NLDB 2015 (20th International Conference on Applications of Natural Language to Information Systems), Passau, Germany, pages 89-102, Springer-Verlag Berlin, 2015 |
[DOI] |
Enforcing Topic Diversity in a Document Recommender for Conversations, and , in: Proceedings of the Coling 2014 (25th International Conference on Computational Linguistics), Dublin, Ireland, pages 746-759, IEEE, 2014 |
|
Diverse Keyword Extraction from Conversations, and , in: Proceedings of the ACL 2013 (51th Annual Meeting of the Association for Computational Linguistics ), Short Papers, Sofia, Bulgaria, pages 651-657, ACL, 2013 |
|
Using Crowdsourcing to Compare Document Recommendation Strategies for Conversations, and , in: RecSys, Recommendation Utility Evaluation (RUE 2012), Dublin, Ireland, pages 15-20, 2012 |
|
Error Correcting Posterior Combination for Robust Multi-Band Speech Recognition, and , in: EUROSPEECH, 2001 |
|
Using Multiple Time Scales in the Framework of Multi-Stream Speech Recognition, and , in: ICSLP, 2000 |
|
Adaptive ML-Weighting in Multi-Band Recombination of Gaussian Mixture ASR, , and , in: ICASSP, 2001 |
|
Etudes comparatives des robustesses au bruit de l'approche 'Full Combination' et de son approximation, and , in: Journee d'Etudes sur la Parole, Aussois, 2000 |
|
Comparison of HMM experts with MLP experts in the Full Combination Multi-Band Approach to Robust ASR, and , in: ICSLP, 2000 |
|
From Multi-Band Full Combination to Multi-Stream Full Combination Processing in Robust ASR, , and , in: ISCA ITRW ASR2000, 2000 |
|
Different Weighting Schemes in the Full Combination Subbands Approach for Noise Robust ASR, , and , in: Robust Methods for Speech Recognition in Adverse Conditions, 1999 |
|
A System for Human-Robot Teaming through End-User Programming and Shared Autonomy, , , , , and , in: Proceedings of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, pages 231-239, 2024 |
[DOI] [URL] |
The AMIDA 2009 Meeting Transcription System, , , , , , , , and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
|
Are ACT's scores increasing with better translation quality?, , in: Are ACT's scores increasing with better translation quality?, pages 6, 2013 |
|
Assessing the Accuracy of Discourse Connective Translations: Validation of an Automatic Metric, and , in: 14th International Conference on Intelligent Text Processing and Computational Linguistics, University of the Aegean, Samos, Greece, pages 236-247, Springer, 2013 |
[DOI] |
Translating English Discourse Connectives into Arabic: a Corpus-based Analysis and an Evaluation Metric, and , in: Fourth Workshop on Computational Approaches to Arabic Script-based Languages at Proceedings of the Tenth Biennial Conference of the Association for Machine Translation in the Americas (AMTA), 2012 |
|
An Objective Evaluation Framework for Pathological Speech Synthesis, , , , , and , in: Proceedings of ITG Conference on Speech Communication, 2021 |
|
ChatGPT and biometrics: an assessment of face recognition, gender detection, and age estimation capabilities, , , , and , in: 2024 IEEE International Conference on Image Processing (ICIP), 2024 |
|
Supervisory teleoperation with online learning and optimal control, and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Singapore, pages 1534-1540, IEEE, 2017 |
[URL] |
Learning assistive teleoperation behaviors from demonstration, and , in: Proc. IEEE International Symposium on Safety, Security and Rescue Robotics, pages 258-263, 2016 |
|
Can ChatGPT Detect Intent? Evaluating Large Language Models for Spoken Language Understanding, and , in: Proc. INTERSPEECH 2023, pages 1109-1113, 2023 |
[DOI] |
The Interpreter Understands Your Meaning: End-to-end Spoken Language Understanding Aided by Speech Translation, and , in: Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023, pages 4408-4423, Association for Computational Linguistics, 2023 |
[DOI] |
Spatial Attention for Far-Field Speech Recognition with Deep Beamforming Neural Networks, , , , , and , in: 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, pages 7499-7503, 2020 |
[DOI] |
Multi-task Neural Network for Robust Multiple Speaker Embedding Extraction, , and , in: Proceedings of Interspeech 2021, 2021 |
Adaptation of Multiple Sound Source Localization Neural Networks with Weak Supervision and Domain-Adversarial Training, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brighton, United Kingdom, pages 770-774, 2019 |
[DOI] |
Deep Neural Networks for Multiple Speaker Detection and Localization, , and , in: 2018 IEEE International Conference on Robotics and Automation (ICRA), Brisbane, AUSTRALIA, pages 74-79, 2018 |
[DOI] |
Joint Localization and Classification of Multiple Sound Sources Using a Multi-task Neural Network, , and , in: Proceedings of Interspeech, pages 312--316, 2018 |
[DOI] |
Detection-Based Multi-Human Tracking Using a CRF Model, , and , in: The Eleventh IEEE International Workshop on Visual Surveillance, 2011 |
|
Parameter Estimation and Contextual Adaptation for a Multi-Object Tracking CRF Model, and , in: IEEE Workshop on Performance Evaluation of Tracking and Surveillance, 2013 |
|
Improving Head and Body Pose Estimation through Semi-supervised Manifold Alignment, , , , and , in: International Conference on Image Processing, 2014 |
|
Automatic Speech Recognition and Understanding for Radar Label Maintenance Support Increases Safety and Reduces Air Traffic Controllers’ Workload, , , , , , , , , , , and , in: Fifteenth USA/Europe Air Traffic Management Research and Development Seminar (ATM2023), Eurocontrol (Europe), FAA (U.S.), Savannah, Georgia, USA, 2023 |
[URL] |
Readback Error Detection by Automatic Speech Recognition to Increase ATM Safety, , , , , , , , , , , , , and , in: Fourteenth USA/Europe Air Traffic Management Research and Development Seminar (ATM2021), The United States Federal Aviation Administration (FAA), EUROCONTROL, pages 10, 2021 |
[URL] |
Readback Error Detection by Automatic Speech Recognition and Understanding -- Results of HAAWAII Project for Isavia’s Enroute Airspace, , , , , , , , , and , in: 11th SESAR Innovation Days, SESAR, pages 9, 2022 |
|
Measuring Speech Recognition And Understanding Performance in Air Traffic Control Domain Beyond Word Error Rates, , , , , , , , and , in: 11th SESAR Innovation Days, 2021 |
|
The Unstoppable Rise of Computational Linguistics in Deep Learning, , in: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Online, pages 6294-6306, Association for Computational Linguistics, 2020 |
[DOI] [URL] |
A VAE for Transformers with Nonparametric Variational Information Bottleneck, and , in: The Eleventh International Conference on Learning Representations, 2023 |
[URL] |
Transformers as Graph-to-Graph Models, , , and , in: Big Picture Workshop at EMNLP 2023, 2023 |
Estimation of Global Posteriors and Forward-Backward Training of Hybrid HMM/ANN Systems, , , and , in: EUROSPEECH'97, 1997 |
|
Sparse Probabilistic Classifiers, and , in: International Conference on Machine Learning (ICML), 2007 |
|
Multilingual bottleneck features for subword modeling in zero-resource languages, and , in: Proc. Interspeech, pages 2668-2672, 2018 |
[DOI] |
Few-shot Dysarthric Speech Recognition with Text-to-Speech Data Augmentation, and , in: Proceedings of Interspeech, pages 156-160, 2023 |
[DOI] [URL] |
Handling acoustic variation in dysarthric speech recognition systems through model combination, and , in: Proceedings of Interspeech, 2021 |
|
Dysarthric Speech Recognition with Lattice-Free MMI, and , in: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 6109-6113, 2020 |
[DOI] [URL] |
TRAP-TANDEM: Data-driven extraction of temporal features from speech, , in: large part published in Proceedings of ASRU-2003, 2003 |
|
Multi-resolution RASTA filtering for TANDEM-based ASR, and , in: Proceedings of Interspeech 2005, 2005 |
|
The Role of Speech in Multimodal Human-Computer Interaction (Towards Reliable Rejection of Non-Keyword Input), , and , in: Proceedings of 8th International Conference on Text, Speech and Dialogue - TSD 2005, 2005 |
|
Pulse-based Features for Face Presentation Attack Detection, and , in: Proceedings of BTAS 2018, special session on Image and Video Forensics in Biometrics, 2018 |
|
Bayesian Networks to Combine Intensity and Color Information in Face Recognition, and , in: International Conference on Biometrics, Springer, 2009 |
|
Face Authentication with Salient Local Features and Static Bayesian Network, and , in: IEEE / IAPR Intl. Conf. On Biometrics (ICB), 2007 |
|
Local Binary Patterns as an Image Preprocessing for Face Authentication, , and , in: IEEE Int. Conf. on Automatic Face and Gesture Recognition (AFGR), 2006 |
|
Towards utterance-based neural network adaptation in acoustic modeling, , , and , in: IEEE Automatic Speech Recognition and Understanding Workshop, pages 289-295, 2015 |
|
Learning Feature Mapping using Deep Neural Network Bottleneck Features for Distant Large Vocabulary Speech Recognition, , , , , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing, pages 4540-4544, 2015 |
[DOI] |
Channel Selection in the Short-time Modulation Domain for Distant Speech Recognition, , , , and , in: Proceedings of Interspeech, pages 741-745, 2015 |
|
Blind acoustic source separation for cocktail party speech recognition, , , and , in: ICONIP, 7th IEEE Int. Conf. on Neural Information Processing, 2000 |
Emphasis Recreation for TTS using Intonation Atoms, and , in: 9th ISCA Speech Synthesis Workshop, pages 14--20, 2016 |
[DOI] |
Importance of Prosody in Swiss French Accent for Speech Synthesis, and , in: Nouveaux cahiers de linguistique francaise, 2014 |
|
Atom Decomposition-based Intonation Modelling, , and , in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, Australia, pages 4744--4748, IEEE, 2015 |
[DOI] |
Prosody in Swiss French Accents: Investigation using Analysis by Synthesis, , , and , in: Speech Prosody, 2014 |
|
SYLLABLE LEVEL FEATURES FOR PARKINSON'S DISEASE DETECTION FROM SPEECH, and , in: ICASSP, 2024 |
Neurocomputational model of speech recognition for pathological speech detection: a case study on Parkinson’s disease speech detection, and , in: Interspeech, Kos Island, Greece, 2024 |
|
Assessing a Shape Descriptor for Analysis of Mesoamerican Hieroglyphics: A View Towards Practice in Digital Humanities, , and , in: Digital Humanities Conference (DH), Krakow, 2016 |
|
Automatic Maya Hieroglyph Retrieval Using Shape and Context Information, , , , and , in: ACM MM, pages 4, 2014 |
[URL] |
Elderly People Living Alone: Detecting Home Visits with Ambient and Wearable Sensing, , , and , in: In Proceedings of MMHealth, 2017 |
|
The Wolf Corpus: Exploring group behaviour in a competitive role-playing game, and , in: ACM Multimedia, 2010 |
|
Towards Audio-Visual On-line Diarization Of Participants In Group Meetings, and , in: European Conference on Computer Vision Workshop on Multi-camera and Multi-modal Sensor Fusion, 2008 |
|
Identifying Dominant People in Meetings from Audio-Visual Sensors, and , in: International Conference on Automatic Face and Gesture Recognition, Amsterdam, The Netherlands, 2008 |
|
ESTIMATING THE DOMINANT PERSON IN MULTI-PARTY CONVERSATIONS USING SPEAKER DIARIZATION STRATEGIES, , , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2007 |
Associating Audio-Visual Activity Cues in a Dominance Estimation Framework, , , and , in: First IEEE Workshop on CVPR for Human Communicative Behavior Analysis, 2008 |
|
Investigating Automatic Dominance Estimation in Groups From Visual Attention and Speaking Activity, , , , and , in: International Conference on Multi-modal Interfaces, 2008 |
|
Using Audio and Video Features to Classify the Most Dominant Person in a Group Meeting, , , , , , , , and , in: "", 2007 |
|
SelecMix: Debiased Learning by Contradicting-pair Sampling, , , , , , and , in: Advances in Neural Information Processing Systems, 2022 |
SelecMix: Debiased Learning by Mixing up Contradicting Pairs, , , , , , and , in: ICML Workshop on Spurious Correlations, Invariance and Stability, 2022 |
Neuromorphic Based Oscillatory Device for Incremental Syllable Boundary Detection, and , in: Proc. of Interspeech, Dresden, Germany, pages 1191-1195, ISCA, 2015 |
|
I
Face Liveness Detection Competition (LivDet-Face) - 2024, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: IEEE International Joint Conference on Biometrics, 2024 |
|
Nonlinear Spectral Transformations for Robust Speech Recognition, , and , in: Proceedings of the IEEE Automatic Speech Recognition and Understanding (ASRU) Workshop 2003, 2003 |
|
Spectro-Temporal Activity Pattern (STAP) Features for Noise Robust ASR, , , and , in: Proceedings of the INTERSPEECH-ICSLP-04, 2004 |
|
Phase AutoCorrelation (PAC) derived Robust Speech Features, , and , in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003 |
|
Phase AutoCorrelation (PAC) features in Entropy based Multi-Stream for Robust Speech Recognition, , , and , in: Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04), 2004 |
|
Entropy Based Combination of Tandem Representations for Noise Robust ASR, , , , and , in: Proceedings of the INTERSPEECH-ICSLP-04, 2004 |
|
Speaker Normalization using HMM2, , and , in: Proceedings of the 2002 IEEE International Workshop on Neural Networks for Signal Processing (NNSP-02), 2002 |
|
Speaker adaptive Kullback-Leibler divergence based hidden Markov models, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2013 |
|
MediaParl: Bilingual mixed language accented speech database, , , , , and , in: Proceedings of the 2012 IEEE Workshop on Spoken Language Technology, pages 263--268, 2012 |
|
Improving non-native ASR through stochastic multilingual phoneme space transformations, , , , and , in: Proceedings of Interspeech, Florence, Italy, pages 537-540, 2011 |
|
Using KL-divergence and multilingual information to improve ASR for under-resourced languages, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, pages 4869--4872, 2012 |
|
Boosting under-resourced speech recognizers by exploiting out of language data - Case study on Afrikaans, , and , in: Proceedings of the 3rd International Workshop on Spoken Languages Technologies for Under-resourced Languages, Cape Town, pages 60--67, 2012 |
|
Towards mixed language speech recognition systems, , and , in: Proceedings of Interspeech, Makuhari, Japan, pages 278-281, 2010 |
|
Language dependent universal phoneme posterior estimation for mixed language speech recognition, , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Prag, CZ, pages 5012-5015, 2011 |
|
Comparing different acoustic modeling techniques for multilingual boosting, , , , and , in: Proceedings of Interspeech, Portland, Oregon, 2012 |
|
An Adaptive Initialization Method for Speaker Diarization based on Prosodic Features, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, pages 4946-4949, 2010 |
|
Robust Speaker Diarization for Short Speech Recordings, and , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, pages 432-437, 2009 |
|
Hierarchical Multilayer Perceptron based Language Identification, , and , in: Proceedings of Interspeech, Makuhari, Japan, pages 2722-2725, 2010 |
|
Impact of deep MLP architecture on different acoustic modeling techniques for under-resourced speech recognition, , , and , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, 2013 |
|
Exploiting un-transcribed foreign data for speech recognition in well-resourced languages, , , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Florence, IT, pages 2322 - 2326, IEEE, 2014 |
[DOI] |
Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition, , and , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Hawaii, USA, pages 348-353, 2011 |
|
A two-step approach to leverage contextual data: speech recognition in air-traffic communications, , , , and , in: Proc. of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022 |
|
Implementing contextual biasing in GPU decoder for online ASR, , , , , , and , in: Proc. Interspeech 2023, 2023 |
|
Enhancing Trust in eAssessment - the TeSLA System Solution, , , , and , in: Technology Enhanced Assessment Conference., 2018 |
|
J
Improving Generalization of Deepfake Detection by Training for Attribution, , and , in: International Workshop on Multimedia Signal Processing, 2021 |
|
Experimental investigation on STFT phase Representations for deep learning-based dysarthric speech detection, and , in: International Conference on Acoustics, Speech, and Signal Processing, 2022 |
|
Adversarial-free speaker identity-invariant representation learning for automatic dysarthric speech classification, and , in: Annual Conference of the International Speech Communication Association, 2022 |
|
Supervised Speech Representation Learning for Parkinson's Disease Classification, and , in: ITG Conference on Speech Communication, 2021 |
|
AUTOMATIC DYSARTHRIC SPEECH DETECTION EXPLOITING PAIRWISE DISTANCE-BASED CONVOLUTIONAL NEURAL NETWORKS, , and , in: 45th International Conference on Acoustics, Speech, and Signal Processing, Toronto, Canada, pages 7328–7332, 2021 |
|
SYNTHETIC SPEECH REFERENCES FOR AUTOMATIC PATHOLOGICAL SPEECH INTELLIGIBILITY ASSESSMENT, , and , in: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Barcelona, Spain, 2020 |
|
PATHOLOGICAL SPEECH INTELLIGIBILITY ASSESSMENT BASED ON THE SHORT-TIME OBJECTIVE INTELLIGIBILITY MEASURE, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brighton, UK, pages 6405--6409, 2019 |
|
Spectral Subspace Analysis for Automatic Assessment of Pathological Speech Intelligibility, , and , in: Proceedings of Interspeech, Graz, Austria, pages 3038--3042, 2019 |
|
VP-STO: Via-point-based Stochastic Trajectory Optimization for Reactive Robot Behavior, , , and , in: Proc. IEEE International Conference on Robotics and Automation (ICRA), 2023 |
|
Multi-Spectral Widefield Microscopy of the Beating Heart through Post-Acquisition Synchronization and Unmixing, , , and , in: 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019), Venice, Italy, pages 1382-1385, 2019 |
[DOI] |
Generalized temporal sampling with active illumination in optical microscopy, and , in: Proceeding of the SPIE Conference Optics and Photonics, Wavelets and Sparsity XVIII, SPIE, San Diego, California, United States, SPIE, 2019 |
|
Geometry-aware Control and Learning in Robotics, and , in: R:SS Pioneers Workshop, 2018 |
Gaussian Mixture Regression on Symmetric Positive Definite Matrices Manifolds: Application to Wrist Motion Estimation with sEMG, and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, pages 59-64, 2017 |
[URL] |
Improving hand and wrist activity detection using tactile sensors and tensor regression methods on Riemannian manifolds, , and , in: Proc. of the Myoelectric Control Symposium, 2017 |
[URL] |
Learning from demonstration with model-based Gaussian process, , and , in: Conference on Robot Learning, 2019 |
|
Geometry-aware Tracking of Manipulability Ellipsoids, , , and , in: Robotics: Science and Systems, Pittsburgh, USA, 2018 |
|
Analysis and Transfer of Human Movement Manipulability in Industry-like Activities, , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, 2020 |
|
Geometry-aware Robot Manipulability Transfer, , and , in: R:SS Workshop on Learning and Inference in Robotics: Integrating Structure, Priors and Models, 2018 |
|
Bayesian Optimization Meets Riemannian Manifolds in Robot Learning, , , and , in: Conference on Robot Learning, 2019 |
|
Predicting Two Facets of Social Verticality in Meetings from Five-Minute Time Slices and Nonverbal Cues, , , and , in: Proceedings - ICMI 2008, 2008 |
|
Characterising Conversationsal Group Dynamics Using Nonverbal Behaviour, , and , in: Proceedings ICME 2009, 2009 |
|
Discovering Group Nonverbal Conversational Patterns with Topics, and , in: Proceedings ICMI-MLMI, 2009 |
|
Predicting the Dominant Clique in Meetings through Fusion of Nonverbal Cues, , , and , in: ACM MM 2008, 2008 |
|
Recognizing conversational context in group interaction using privacy-sensitive mobile sensors, , , and , in: Proceedings of International Conference on Mobile and Ubiquitous Multimedia, Limassol, Cyprus, 2010 |
|
Given that, Should I Respond? Contextual Addressee Estimation in Multi-Party Human-Robot Interactions, and , in: Proceedings of Human Robot Interaction (HRI) Conference, 2013 |
|
Linking Speaking and Looking Behavior Patterns with Group Composition, Perception, and Performance, , , , and , in: Proceedings of the International Conference on Multimodal Interaction (ICMI), Santa Monica, USA, 2012 |
|
The vernissage corpus: a conversational human-robot-interaction dataset, , , , , , , , , and , in: Proceedings of the 8th ACM/IEEE international conference on Human-robot interaction, 2013 |
|
Comparing Stability and Discriminatory Power of Hand-crafted Versus Deep Radiomics: A 3D-Printed Anthropomorphic Phantom Study, , , , , , , , , , and , in: Proceedings of the 12th European Workshop on Visual Information Processing, 2024 |
|
ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (CVPR), pages 17408-17419, 2023 |
[DOI] |
DepthInSpace: Exploitation and Fusion of Multiple Video Frames for Structured-Light Depth Estimation, , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 6039-6048, 2021 |
[URL] |
GeoNeRF: Generalizing NeRF with Geometry Priors, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (CVPR), 2022 |
[URL] |
Kronecker Recurrent Units, , and , in: Proceedings of the International Conference on Machine Learning, 2018 |
Scalable Metric Learning via Weighted Approximate Rank Component Analysis, and , in: ECCV 2016, 2016 |
|
Finding groups of people in Google news, and , in: ACM Int. Conf. on Human-Centered Multimedia (HCM), 2006 |
|
Visual Affect Around the World: A Large-scale Multilingual Visual Sentiment Ontology, , , , , and , in: Proceedings of the ACM International Conference on Multimedia, pages 159--168, 2015 |
|
Integrating Acoustic and Labial Information for Speaker Identification and Verification, , , and , in: Proceedings of the European Conference on Speech Communication and Technology, 1997 |
Acoustic-Labial Speaker Verification, , , and , in: Proceedings of the First International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'97), Springer Verlag, 1997 |
End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation, , , , , , and , in: The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), Singapore, 2023 |
[URL] |
Contextual Semi-Supervised Learning: An Approach To Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems, , , , , , and , in: Interspeech 2021, 2021 |
[URL] |
Automatic Speech Recognition Benchmark for Air-Traffic Communications, , , , and , in: Proc. Interspeech 2020, pages 2297-2301, 2020 |
[DOI] |
How Does Pre-trained Wav2Vec 2.0 Perform on Domain-Shifted ASR? An Extensive Benchmark on Air Traffic Control Communications, , , , , , , , and , in: 2023 IEEE Spoken Language Technology Workshop (SLT), IEEE, 2023 |
[URL] |
CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice, , , and , in: Proc. Interspeech 2023, 2023 |
[URL] |
BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications, , , , , , and , in: 2023 IEEE Spoken Language Technology Workshop (SLT), IEEE, 2023 |
[URL] |
Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications, , , , , , , , , , , , , , , and , in: Proceedings of 8th OpenSky Symposium 2020, OpenSky Network, pages 1-10, MDPI, 2020 |
[DOI] [URL] |
SemEval-2023 Task 7: Multi-Evidence Natural Language Inference for Clinical Trial Data, , , , , and , in: Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023), Association for Computational Linguistics, 2023 |
[DOI] [URL] |
NLI4CT: Multi-Evidence Natural Language Inference for Clinical Trial Reports, , , , , and , in: Proceedings of The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), 2023 |
Understanding the effects of language-specific class imbalance in multilingual fine-tuning, and , in: Findings of the European chapter of Association for Computational Linguistics, 2024, 2024 |
|
Recognition of Isolated Complex Mono- and Bi-Manual 3D Hand Gestures, , and , in: Proc. of the sixth International Conference on Automatic Face and Gesture Recognition, 2004 |
|
Hand Posture Classification and Recognition using the Modified Census Transform, , and , in: IEEE Int. Conf. on Automatic Face and Gesture Recognition (AFGR), 2006 |
|
K
From Undercomplete to Sparse Overcomplete Autoencoders to Improve LF-MMI Speech Recognition, and , in: Proceedings of Interspeech Conference, 2022 |
|
On Learning to Identify Genders from Raw Speech Signal Using CNNs, , and , in: Proceedings of Interspeech, Hyderabad, INDIA, pages 287-291, 2018 |
[DOI] |
Modeling dominance effects on nonverbal behaviors using granger causality, , , , , and , in: Proceedings of International Conference on Multimodal Interaction, ICMI 2012, Santa Monica, CA, 2012 |
|
Understanding the Social Context of Eating with Multimodal Smartphone Sensing: The Role of Country Diversity, , and , in: 25th ACM International Conference on Multimodal Interaction, 2023 |
[DOI] [URL] |
Reading Companion: The Technical and Social Design of an Automated Reading Tutor, , , , , and , in: Workshop on Child, Computer and Interaction, Portland, Oregon, U.S.A., 2012 |
|
Processing Megapixel Images with Deep Attention-Sampling Models, and , in: Proceedings of International Conference on Machine Learning, 2019 |
[URL] |
Not All Samples Are Created Equal: Deep Learning with Importance Sampling, and , in: Proceedings of International Conference on Machine Learning, 2018 |
|
Transformers are RNNs: Fast Autoregressive Transformers with Linear Attention, , , and , in: Proceedings of International Conference on Machine Learning, 2020 |
Haptic Feedback Compared with Visual Feedback for BCI, , , , , and , in: Proceedings of the 3rd International Brain-Computer Interface Workshop & Training Course 2006, 2006 |
|
A Neural Network for Text Representation, and , in: International Conference on Artificial Neural Networks, ICANN, 2005 |
|
Theme Topic Mixture Model: A Graphical Model for Document Representation, and , in: Pascal Workshop on Text Mining and Understanding, 2004 |
|
Benchmarking Non-Parametric Statistical Tests, , and , in: Advances in Neural Information Processing Systems, NIPS 18. MIT Press, 2005 |
|
Towards introducing long-term statistics in MUSE for robust speech recognition, and , in: Automatic Speech Recognition and Understanding (ASRU) workshop, 1999 |
|
A comparison of two strategies for ASR in additive noise : Missing Data and Spectral Subtraction, and , in: 6th European Conference on Speech Communication and Technology --- Eurospeech'99, 1999 |
|
Discriminative Keyword Spotting, , and , in: Workshop on Non-Linear Speech Processing, Paris, France, 2007 |
|
Discriminative Kernel-Based Phoneme Sequence Recognition, , , , and , in: The 9th International Conference on Spoken Language Processing (INTERSPEECH), Pittsburgh, PA, 2006 |
|
Hierarchical Integration of Phonetic and Lexical Knowledge in Phone Posterior Estimation, and , in: ICASSP'08, 2008 |
|
In-Context Phone Posteriors as Complementary Features for Tandem ASR, and , in: ICSLP'08, 2008 |
|
Hierarchical Multi-Stream Posterior Based Speech Recognition System, , and , in: Proceedings MLMI workshop, 2005 |
|
Posterior Based Keyword Spotting with A Priori Thresholds, , , and , in: International Conference on Spoken Language Processing (ICSLP), 2006 |
|
Using more informative posterior probabilities for speech recognition, , , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2006 |
|
Developing and Enhancing Posterior Based Speech Recognition Systems, , , and , in: Proceedings of Interspeech, 2005 |
|
BLESS: Benchmarking Large Language Models on Sentence Simplification, , , , , , and , in: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Singapore, 2023 |
|
Alignment of Binocular-Binaural Data Using a Moving Audio-Visual Target, , and , in: Proc. of IEEE Int. Workshop on Multimedia Signal Processing (MMSP), 2013 |
|
Kullback-Leibler Proximal Variational Inference, , , and , in: Proceedings of the international conference on Neural Information Processing Systems, pages 3402-3410, 2015 |
|
Bio-Medical Multi-label Scientific Literature Classification using LWAN and Dual-attention module, , , , and , in: PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION (PACLIC 2022), In Proceedings of ACL Anthology, 2022 |
|
IDIAP_TIET@LT-EDI-ACL2022 : Hope Speech Detection in Social Media using Contextualized BERT with Attention Mechanism, , and , in: ACL, 2022 |
|
Towards a breakthrough Speaker Identification approach for Law Enforcement Agencies: SIIP, , , , , , , , , and , in: European Intelligence and Security Informatics Conference (EISIC) 2017, Athenes, Greece, pages 32-39, IEEE Computer Society, 2017 |
[DOI] [URL] |
SIIP: An Innovative Speaker Identification Approach for Law Enforcement Agencies, , , , , , , , , and , in: Big Data and Artificial Intelligence for Military Decision Making, http://www.sto.nato.int/, pages PT-1 - 1: PT-1 - 14, STO, 2018 |
[DOI] [URL] |
INCREMENTAL SEMI-SUPERVISED LEARNING FOR MULTI-GENRE SPEECH RECOGNITION, , , , , and , in: Proceedings of ICASSP 2020, 2020 |
|
Modeling Dialectal Variation for Swiss German Automatic Speech Recognition, , and , in: Proceedings of Interspeech, 2021 |
[DOI] |
Learning to Translate Low-Resourced Swiss German Dialectal Speech into Standard German Text, , and , in: IEEE Automatic Speech Recognition and Understanding Workshop, Colombia, Cartagena, IEEE, 2021 |
|
An Evaluation Benchmark for Automatic Speech Recognition of German-English Code-Switching, , and , in: IEEE Automatic Speech Recognition and Understanding Workshop, 2021 |
|
Hierarchical speaker clustering methods for the NIST i-vector Challenge, , , and , in: Odyssey: The Speaker and Language Recognition Workshop, 2014 |
|
SPEAR: An open source toolbox for speaker recognition based on Bob, , and , in: Proceedings of the 39th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 1655 - 1659, 2014 |
[DOI] [URL] |
The Idiap Speaker Recognition Evaluation System at NIST SRE 2012, , and , in: NIST Speaker Recognition Conference, NIST, Orlando, USA, 2012 |
|
Fusing Matching and Biometric Similarity Measures for Face Diarization in Video, , and , in: Proceedings of the 3rd ACM conference on International conference on multimedia retrieval, Dallas, Texas, USA, pages 97-104, ACM, 2013 |