All conference papers sorted by title
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 |
#
#Drink Or #Drunk: Multimodal Signals and Drinking Practices on Instagram, , and , in: Proceedings of the 13th EAI International Conference on Pervasive Computing Technologies for Healthcare, Trento, Italy, 2019 |
|
#Healthy #Fondue #Dinner: Analysis and Inference of Food and Drink Consumption Patterns on Instagram, and , in: Proceedings of the 16th International Conference on Mobile and Ubiquitous Multimedia, 2017 |
|
2
2D Multi-Person Tracking: A Comparative Study in AMI Meetings, , , , , and , in: Classification of Events, Activities, and Relationships (CLEAR) 2006, 2006 |
|
3
3D Gaze Tracking and Automatic Gaze Coding from RGB-D Cameras, and , in: IEEE Conference in Computer Vision and Pattern Recognition, Vision Meets Cognition Workshop, Columbus, Ohio, USA, 2014 |
|
3D Head Pose and Gaze Tracking and Their Application to Diverse Multimodal Tasks, , in: Doctoral consortium of the 15th ACM International Conference on Multimodal Interaction, Sydney, Australia, 2013 |
[DOI] |
A
A BAYESIAN APPROACH TO INTER-TASK FUSION FOR SPEAKER RECOGNITION, , and , in: In Proceedings of ICASSP 2019, Brighton, ENGLAND, pages 5786-5790, 2019 |
|
A Bayesian Interpretation of the Light Gated Recurrent Unit, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2021 |
[DOI] |
A benchmark for the simulation of meshed district heating networks based on anonymised monitoring data, and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2023 |
[DOI] [URL] |
A Bimodal Sound Source Model for Vehicle Tracking in Traffic Monitoring, , , and , in: European Signal Processing Conference, 2011 |
|
A Boolean Approach to Construct Neural Networks for Non-Boolean Problems, and , in: Proceedings of the 8th IEEE International Conference on Tools with Artificial Intelligence, IEEE, 1996 |
A BSS-based Approach for Localization of Simultaneous Speakers in Reverberant Conditions, , , and , in: Proceedings of the 19th European Signal Processing Conference (EUSIPCO), 2011 |
|
A CASA front-end using the localisation cue for segregation and then cocktail-party speech recognition, , , and , in: Proc. IEEE Int. Conference on Speech Processing (ICSP), IEEE, 1999 |
A CASA-labelling model using the localisation cue for robust cocktail-party speech recognition, , and , in: Proc.\ European Conf.\ on Speech Communication and Technology (EUROSPEECH), 1999 |
A Cognitive and Unsupervised MAP Adaptation Approach to the Recognition of the Focus of Attention from Head Pose, and , in: International Conference on Multi-Media & Expo (ICME07), 2007 |
|
A Comparative Psychophysical and EEG Study of Different Feedback Modalities for HRI, , , , and , in: 3rd ACM/IEEE Conf on Human-Robot Interaction (HRI08), 2008 |
|
A Comparative Study of Adaptation Methods for Speaker Verification, and , in: International Conference on Spoken Language Processing ICSLP, 2002 |
|
A Comparative Study of MLP Front-ends for Mandarin ASR, , , , and , in: Proceedings of Interspeech, Japan, 2010 |
|
A Comparative Study Of Simulation Tools To Model The Solar Irradiation On Building Façades, , , , , , , , , , , , , , , and , in: Proceedings of SWC 2021: ISES Solar World Congress, ISES, 2021 |
[DOI] [URL] |
A comparison of a priori threshold setting procedures for speaker verification in the CAVE project, , , , , , and , in: ICASSP 98, 1998 |
A Comparison of Acoustic and Linguistics Methodologies for Alzheimer's Dementia Recognition, , , , , , , , , , and , in: Proceedings of Interspeech, pages 2182-2186, 2020 |
|
A COMPARISON OF METHODS FOR OOV-WORD RECOGNITION ON A NEW PUBLIC DATASET, , and , in: 2021 IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE Signal Processing Society, Toronto, Ontario, Canada, 2021 |
|
A comparison of mixture models for density estimation, , in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'99), London: IEE, 1999 |
|
A Comparison of Supervised and Unsupervised Cross-Lingual Speaker Adaptation Approaches for HMM-Based Speech Synthesis, , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Dallas, U.S.A., 2010 |
|
A comparison of two strategies for ASR in additive noise : Missing Data and Spectral Subtraction, and , in: 6th European Conference on Speech Communication and Technology --- Eurospeech'99, 1999 |
|
A Competition on Generalized Software-based Face Presentation Attack Detection in Mobile Scenarios, , , , , , , , , and , in: Proceedings of the International Joint Conference on Biometrics, 2017, 2017 |
|
A Compressive Sensing Based Compressed Neural Network for Sound Source Localization, , and , in: Proceedings of International Symposium on Artificial Intelligence and Signal Processing, 2011 |
|
A Conditional Random field approach for audio-visual people diarization, , , , and , in: Proceedings of the 39th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 116 - 120, IEEE, 2014 |
[DOI] |
A Connectionist System for Two-Dimensional Representation of Multivariate Location Data, and , in: Proceedings of the Fifth International Workshop on Artificial Intelligence for High Energy Physics, AIHENP, Lausanne, Switzerland, Elsevier Science, 1997 |
A Context-Aware Speech recognition and Understanding System for Air Traffic Control Domain, , , , , and , in: Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, Okinawa, Japan, 2017 |
|
A Contextual Language Model to Improve Machine Translation of Pronouns by Re-ranking Translation Hypotheses, and , in: European Association for Machine Translation, 2016 |
A Corpus and Evaluation for Predicting Semi-Structured Human Annotations, , , , and , in: Workshop on Generation, Evaluation and Metrics (GEM), 2022 |
|
A Corpus-based Contrastive Analysis for Defining Minimal Semantics of Inter-sentential Dependencies for Machine Translation, , , and , in: Proceedings of the GSCL2011 Workshop on "Contrastive Analysis - Translation Studies - Machine Translation: What can we learn from each other?", Hamburg, Germany, pages 5, 2011 |
|
A Deep Learning Approach for Robust Head Pose Independent Eye Movements Recognition from Videos, , and , in: 2019 ACM Symposium on Eye Tracking Research and Applications, pages 5, ACM, 2019 |
[DOI] |
A Deeper Look at Dataset Bias, , , and , in: German Conference on Pattern Recognition, Aachen, Germany, pages 504–516, Springer International Publishing, 2015 |
[DOI] |
A Differentiable Integer Linear Programming Solver for Explanation-Based Natural Language Inference, , and , in: The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, 2024 |
A Differential Approach for Gaze Estimation with Calibration, , , and , in: 29TH BRITISH MACHINE VISION CONFERENCE, 2018 |
|
A Discriminative Approach for the Retrieval of Images from Text Queries, , and , in: European Conference on Machine Learning (ECML), 2006 |
|
A Discriminative Approach to Robust Visual Place Recognition, , , and , in: IEEE International Conference on Intelligent RObot Systems (IROS), 2006 |
|
A Distance Model for Rhythms, , , and , in: 25th International Conference on Machine Learning (ICML), 2008 |
|
A Domain Adaptation Approach to Improve Speaker Turn Embedding Using Face Representation, and , in: ACM International Conference on Multimodal Interaction, Glasgow, Scotland, ACM, 2017 |
|
A front-end using the harmonicity cue for speech enhancement in loud noise, , and , in: Int. Conf. on Spoken Language Processing (ICSLP), 2000 |
A Generative Model for Intention Recognition and Manipulation Assistance in Teleoperation, and , in: IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017 |
|
A Generative Model for Rhythms, , , and , in: NIPS Workshop on Brain, Music and Cognition, 2007 |
|
A Gentle Hessian for Efficient Gradient Descent, and , in: IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP, 2004 |
|
A Graphical Model for Chord Progressions Embedded in a Psychoacoustic Space, , , and , in: Proceedings of the 22nd International Conference on Machine Learning, 2005 |
|
A graphical tool for monitoring Oz objects activity, and , in: Proc. of WOz'95: International Workshop on Oz Programming, IDIAP, Uni. Fribourg, 1995 |
A Hierarchical Keyframe User Interface for Browsing Video over the Internet, , , and , in: Proceedings of the 9th International Conference on Human-Computer Interaction (INTERACT-2003), IOS Press, 2003 |
|
A Joint Estimation of Head and Body Orientation Cues in Surveillance Video, , and , in: IEEE International Workshop on Socially Intelligent Surveillance and Monitoring, 2011 |
|
A Just-in-Time Document Retrieval System for Dialogues or Monologues, , , and , in: SIGDIAL 2011 (12th annual SIGDIAL Meeting on Discourse and Dialogue), Demonstration Session, Portland, OR, pages 350-352, 2011 |
|
A Large-Scale Database of Images and Captions for Automatic Face Naming, , , and , in: Proceedings of the 22nd British Machine Vision Conference, 2011 |
|
A Laser-based Dual-arm System for Precise Control of Collaborative Robots, , and , in: IEEE International Conference on Robotics and Automation, 2021 |
|
A Learning-Based Framework for Quantized Compressed Sensing, , and , in: A Learning-Based Framework for Quantized Compressed Sensing, 2019 |
|
A Machine Learning Model for the Prediction of Building Hourly Heating Demand from CityGML Files: Training Workflow and Deployment as an API, , and , in: Proceedings of Building Simulation 2023: 18th Conference of IBPSA, pages 2932 - 2939, 2023 |
[DOI] [URL] |
A machine-learning model for the prediction of aggregated building heating demand from pan-European land-use maps, , and , in: Journal of Physics: Conference Series, 2021 |
[DOI] |
A MAP approach, with synchronous decoding and unit-based normalization for text-dependent speaker verification, , and , in: ICSLP, 2000 |
|
A Max Kernel For Text-Independent Speaker Verification Systems, and , in: Second Workshop on Multimodal User Authentication, MMUA, 2006 |
|
A measure of speech and pitch reliability from voicing, and , in: Proc. Int. Joint Conf. on Artificial Intelligence (IJCAI), Scandinavian AI Society, 1999 |
A Meeting Browser Evaluation Test, , , and , in: CHI '92: Proceedings of the SIGCHI conference on Human factors in computing systems, Portland, OR, USA, ACM Press, 2005 |
|
A memory of motion for visual predictive control tasks, , and , in: International Conference on Robotics and Automation, 2020 |
|
A Method for All-Positive Optical Multilayer Perceptrons, , and , in: Proceedings of the Third IEEE International Conference on Electronics, Circuits, and Systems, University of Patras, Rhodos, Greece, IEEE, 1996 |
|
A Mixed-State I-Particle Filter for Multi-Camera Speaker Tracking, , , and , in: IEEE Int. Conf. on Computer Vision Workshop on Multimedia Technologies for E-Learning and Collaboration (ICCV-WOMTEC), 2003 |
|
A Modular Multimodal Architecture for Gaze Target Prediction: Application to Privacy-Sensitive Settings, , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022 |
|
A morphological based PV generation and energy consumption predictive model for Singapore neighbourhood, and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
A Multi Cue Discriminative Approach to Semantic Place Classification, , and , in: CLEF 2010 Notebook Papers/LABs/Workshops, 2010 |
|
A Multi-sample Multi-source Model for Biometric Authentication, , and , in: IEEE International Workshop on Neural Networks for Signal Processing (NNSP), 2002 |
|
A Multimedia Retrieval System Using Speech Input, , , , , , , , , , and , in: Proceedings of ICMI-MLMI 2009 (11th International Conference on Multimodal Interfaces and 6th Workshop on Machine Learning for Multimodal Interaction), Cambridge, MA, 2009 |
|
A Multimodal Corpus for Studying Dominance in Small Group Conversations, , and , in: LREC workshop on Multimodal Corpora: Advances in Capturing, Coding and Analyzing Multimodality, Malta, May 2010, 2010 |
|
A MultiPath Network for Object Detection, , , , , , and , in: Proceedings of the British Machine Vision Conference, BMVA Press, 2016 |
[URL] |
A Multipath Sparse Beamfroming Method, , , and , in: Signal Processing with Adaptive Sparse Structured Representations SPARS, 2013 |
|
A Multiple Hypothesis Gaussian Mixture Filter for Acoustic Source Localization and Tracking, , and , in: 13th International Workshop on Acoustic Signal Enhancement, pages 233-236, 2012 |
|
A Multitask and Kernel approach for Learning to Push Objects with a Task-Parameterized Deep Q-Network, , , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2023 |
|
A Neural Model to Predict Parameters for a Generalized Command Response Model of Intonation, and , in: Proc. Interspeech 2018, pages 3147-3151, 2018 |
[DOI] |
A Neural Model to Predict Parameters for a Generalized Command Response Model of Intonation, and , in: MLSLP-18 Proceedings, Hyderabad, 2018 |
[URL] |
A neural network for classification with incomplete data: application to robust ASR, , , , and , in: Proc. ICSLP, 2000 |
|
A Neural Network for Text Representation, and , in: International Conference on Artificial Neural Networks, ICANN, 2005 |
|
A Neural Network to Retrieve Images from Text Queries, and , in: International Conference on Artificial Neural Networks (ICANN), 2006 |
|
A new SNR-feature mapping for robust multistream speech recognition, and , in: Proc. Int. Congress on Phonetic Sciences (ICPhS), 1999 |
A Novel and Responsible Dataset for Face Presentation Attack Detection on Mobile Devices, , , , , and , in: The IEEE International Joint Conference on Biometrics, Buffalo, New York, pages 8, 2024 |
|
A Novel Approach to Combining Client-Dependent and Confidence Information in Multimodal Biometric, and , in: Fifth Int'l. Conf. Audio- and Video-Based Biometric Person Authentication AVBPA, 2005 |
|
A Parallel Mixture of SVMs for Very Large Scale Problems, , and , in: Advances in Neural Information Processing Systems, NIPS 14, MIT Press, 2002 |
|
A Phonology-based Approach for Isolated Sign Production Assessment in Sign Language, , , and , in: Companion Publication of the 2020 International Conference on Multimodal Interaction (ICMI '20 Companion), 2020 |
|
A Point-Spread-Function-Aware Filtered Backprojection Algorithm for Focal-Plane-Scanning Optical Projection Tomography, and , in: 2016 IEEE International Symposium on Biomedical Imaging, 2016 |
A probabilistic framework for joint head tracking and pose estimation, and , in: 17th Int. Conf. Pattern Recognition (ICPR), 2004 |
|
A Probabilistic Framework for Multiple Speaker Localization, , , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2013 |
|
A Probabilistic Interpretation of SVMs with an Application to Unbalanced Classification, , and , in: Advances in Neural Information Processing Systems, NIPS 15, 2005 |
|
A Probabilistic Model for Chord Progressions, , and , in: Proceedings of the Sixth International Conference on Music Information Retrieval (ISMIR), 2005 |
|
A Random Walk Framework to Compute Textual Semantic Similarity: a Unified Model for Three Benchmark Tasks, and , in: Proceedings of the 4th IEEE International Conference on Semantic Computing (ICSC 2010 ), Carnegie Mellon University, Pittsburgh, PA, USA, 2010 |
|
A Rao-Blackwellized Mixed State Particle Filter for Head Pose Tracking, and , in: ACM ICMI Workshop on Multimodal Multiparty Meeting Processing (MMMP), 2005 |
|
A Robust Speaker Clustering Algorithm, and , in: IEEE Automatic Speech Recognition Understanding Workshop, 2003 |
|
A Sector-Based Approach for Localization of Multiple Speakers with Microphone Arrays, and , in: Proceedings of the 2004 SAPA Workshop, 2004 |
|
A Sector-Based, Frequency-Domain Approach to Detection and Localization of Multiple Speakers, and , in: Proceedings of ICASSP 2005, 2005 |
|
A Semi-Automated System for Accurate Gaze Coding in Natural Dyadic Interactions, , , and , in: 15th ACM International Conference on Multimodal Interaction, Sydney, Australia, ACM, 2013 |
[DOI] |
A Skill Transfer Approach for Continuum Robots - Imitation of Octopus Reaching Motion with the STIFF-FLOP Robot, , , and , in: In Proc. of the AAAI Symp. on Knowledge, Skill, and Behavior Transfer in Autonomous Robots, Arlington, VA, USA, pages 49-52, 2014 |
[URL] |
A smart luminaire in an office environment: impact on light distribution, user interactions and comfort, , , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
A Sparsity Constraint for Topic Models - Application to Temporal Activity Mining, , and , in: NIPS-2010 Workshop on Practical Applications of Sparse Modeling: Open Issues and New Directions, 2010 |
|
A Spectrogram Model for Enhanced Source Localization and Noise-Robust ASR, , and , in: Proceedings of INTERSPEECH 2005, 2005 |
|
A Speech-based Just-in-Time Retrieval System using Semantic Search, , , and , in: Proceedings of the ACL-HLT 2011 System Demonstrations (49th Annual Meeting of the Association for Computational Linguistics), Portland, OR, pages 80-86, 2011 |
[URL] |
A State-of-the-art Neural Network for Robust Face Verification, , and , in: Proceedings of the COST275 Workshop on The Advent of Biometrics on the Internet, 2002 |
|
A Statistical Significance Test for Person Authentication, and , in: Proceedings of Odyssey 2004: The Speaker and Language Recognition Workshop, 2004 |
|
A study of Intra- and Inter-Speaker Variability in the Voices of Twins for Speaker Verification, and , in: International Congress of Phonetic Sciences, 1995 |
A Study on the Importance of Formant Transitions for Stop-Consonant Classification in VCV Sequence, , , and , in: Interspeech, Dublin, Ireland, ISCA, 2023 |
|
A Study on Visual Focus of Attention Recognition from Head Pose in a Meeting Room, and , in: 3rd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI06), 2006 |
|
A Sub-Quadratic Exact Medoid Algorithm, and , in: Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017 |
A supervised learning approach based on STDP and polychronization in spiking neuron networks, , and , in: European Symposium on Artificial Neural Networks, ESANN, 2007 |
|
A Symbolic Framework for Systematic Evaluation of Mathematical Reasoning with Transformers, , , and , in: Under review, 2023 |
[URL] |
A Symmetric Transformation for LDA-based Face Verification, , in: Proceedings of the 6th International Conference on Automatic Face and Gesture Recognition, IEEE Computer Society Press, 2004 |
|
A System for Human-Robot Teaming through End-User Programming and Shared Autonomy, , , , , and , in: Proceedings of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, pages 231-239, 2024 |
[DOI] [URL] |
A system for the off-line recognition of handwritten text, , in: International Conference on Pattern Recognition (ICPR,',','), Jerusalem, 1994 |
A task-parameterized probabilistic model with minimal intervention control, , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Hong Kong, pages 3339 - 3344, IEEE, 2014 |
[DOI] |
A TDOA Gaussian Mixture Model for Improving Acoustic Source Tracking, , , and , in: 20th European Signal Processing Conference, 2012 |
|
A theoretical framework for transfer of knowledge across modalities in artificial and cognitive systems, , , and , in: International Conference on Developmental Learning, 2009 |
|
A tree-based distance between distributions: application to classification of neurons, and , in: ICASSP 2012 : IEEE International Conference on Acoustics, Speech and Signal Processing, 2012 |
A two-step approach to leverage contextual data: speech recognition in air-traffic communications, , , , and , in: Proc. of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022 |
|
A Unified Model for Gaze Following and Social Gaze Prediction, , , and , in: The 18th IEEE International Conference on Automatic Face and Gesture Recognition, 2024 |
|
A VAE for Transformers with Nonparametric Variational Information Bottleneck, and , in: The Eleventh International Conference on Learning Representations, 2023 |
[URL] |
Abstract Text Summarization: A Low Resource Challenge, and , in: In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2019), HongKong, China, pages 5, Association for Computational Linguistics (ACL), 2019 |
|
Accelerated Training of Linear Object Detectors, and , in: CVPR 2013 Workshop on Structured Prediction, 2013 |
[URL] |
ACCENT ADAPTATION USING SUBSPACE GAUSSIAN MIXTURE MODELS, , , and , in: The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Vancouver, BC, Canada, pages 7170-7174, 2013 |
[DOI] |
Acoustic Data-driven Grapheme-to-Phoneme Conversion using KL-HMM, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2012 |
|
Acoustic-Labial Speaker Verification, , , and , in: Proceedings of the First International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'97), Springer Verlag, 1997 |
Active Improvement of Control Policies with Bayesian Gaussian Mixture Model, , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, 2020 |
Active Learning by Feature Mixing, , , , , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022 |
Active Online Anomaly Detection using Dirichlet Process Mixture Model and Gaussian Process Classification, , , , and , in: IEEE Winter Conference on Applications of Computer Vision (WACV), Washington, 2017 |
|
Ad-Hoc Microphone Array Calibration from Partial Distance Measurements, , , and , in: Proceeding of 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014 |
Ad-Hoc Microphone Array Calibration from Partial Distance Measurements, , , and , in: Proceedings of the 4th Joint Workshop on Hands-free speech communication and Microphone Arrays, Villers-les-Nancy, pages 1 - 5, IEEE, 2014 |
[DOI] |
Adaptation of Assistant Based Speech Recognition to New Domains and Its Acceptance by Air Traffic Controllers, , , , , , , , , and , in: Proceedings of the 2nd International Conference on Intelligent Human Systems Integration (IHSI 2019): Integrating People and Intelligent Systems, San Diego, California, USA, pages 820 - 826, 2019 |
[DOI] |
Adaptation of Multiple Sound Source Localization Neural Networks with Weak Supervision and Domain-Adversarial Training, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brighton, United Kingdom, pages 770-774, 2019 |
[DOI] |
Adaptation robuste de modèles HMM pour la vérification du locuteur dépendante du texte, and , in: Journee d'Etudes sur la Parole, Aussois, 2000 |
|
Adapting the 2-Class Recursive Deterministic Perceptron Neural Network to m Classes, , , and , in: Proceedings of the International Conference on Neural Networks, IEEE, IEEE, 1997 |
Adaptive Beamforming with a Maximum Negentropy Criterion, , , , and , in: Proceedings of the Joint Workshop on Hands-free Speech Communication and Microphone Arrays, Italy, 2008 |
|
Adaptive Brain Interfaces for Communication and Control, , in: Proceedings of the 10th International Conference on Human-Computer Interaction, 2003 |
|
Adaptive ML-Weighting in Multi-Band Recombination of Gaussian Mixture ASR, , and , in: ICASSP, 2001 |
|
Adaptive Shared Control of a Brain-Actuated Simulated Wheelchair, , , , , , , and , in: Proceedings of the 10th IEEE International Conference on Rehabilitation Robotics, 2007 |
|
Advances in Fast Multistream Diarization based on the Information Bottleneck Framework, , and , in: Proceedings of Interspeech, 2010 |
Adversarial-free speaker identity-invariant representation learning for automatic dysarthric speech classification, and , in: Annual Conference of the International Speech Communication Association, 2022 |
|
Affordance segmentation of hand-occluded containers from exocentric images, , , , , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), 2023 |
[DOI] [URL] |
AGGLOMERATIVE INFORMATION BOTTLENECK FOR SPEAKER DIARIZATION OF MEETINGS DATA, , and , in: IEEE Automatic Speech Recognition and Understanding Workshop, 2007 |
|
Aligning Multilingual Word Embeddings for Cross-Modal Retrieval Task, , and , in: Proceedings of the Second Workshop on Fact Extraction and VERification (FEVER), Hong Kong, pages 27-33, Association for Computational Linguistics, 2019 |
[DOI] [URL] |
Alignment of Binocular-Binaural Data Using a Moving Audio-Visual Target, , and , in: Proc. of IEEE Int. Workshop on Multimedia Signal Processing (MMSP), 2013 |
|
Alone or With Others? Understanding Eating Episodes of College Students with Mobile Sensing, , and , in: 19th International Conference on Mobile and Ubiquitous Multimedia, ACM, Essen, Germany, pages 162–166, Association for Computing Machinery, 2020 |
[DOI] [URL] |
AM-FM DECOMPOSITION OF SPEECH SIGNAL: APPLICATIONS FOR SPEECH PRIVACY AND DIAGNOSIS, , , , and , in: 11th International workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, Universita Degli Studi Firenze, Firenze, Italy, 2019 |
[URL] |
Ambiance in Social Media Venues: Visual Cue Interpretation by Machines and Crowds, , and , in: IEEE CVPR Workshop on Visual Understanding of Subjective Attributes, 2018 |
|
Amelioration des performances de verification du locuteur par combinaison de methodes, , , and , in: Journees d'etudes sur la parole, JEP, 1996 |
An Acoustic Model Based on Kullback-Leibler Divergence for Posterior Features, , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007 |
|
An Adaptive Initialization Method for Speaker Diarization based on Prosodic Features, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, pages 4946-4949, 2010 |
|
An Agent-Based Focused Crawling Framework for Topic- and Genre-Related Web Document Discovery, , and , in: 24th IEEE International Conference on Tools with Artificial Intelligence, Athens, Greece, IEEE, 2012 |
[URL] |
An agonist-antagonist pitch production model, and , in: Lecture Notes in Artificial Intelligence: 18th International Conference, SPECOM 2016, Budapest, Hungary, pages 84--91, 2016 |
|
An Alternative Scanning Strategy to Detect Faces, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010 |
|
An Analysis of Language Mismatch in HMM State Mapping-Based Cross-Lingual Speaker Adaptation, and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
|
An anomaly detection approach for backdoored neural networks: face recognition as a case study, and , in: 21st International Conference of the Biometrics Special Interest Group (BIOSIG 2022), Darmstadt, Germany, 2022 |
|
An Asynchronous and Non-Invasive Brain-Actuated Wheelchair, , , , , , , and , in: Proceedings of the 13th International Symposium on Robotics Research, 2007 |
|
An Asynchronous Hidden Markov Model for Audio-Visual Speech Recognition, , in: Advances in Neural Information Processing Systems, NIPS 15, MIT Press, 2003 |
|
An Attention Mechanism for Deep Q-Networks with Applications in Robotic Pushing, , and , in: Proc. of Workshop on Emerging paradigms for robotic manipulation: from the lab to the productive world, ICRA, 2021 |
An Audio Visual Corpus for Emergent Leader Analysis, , and , in: Multimodal Corpora for Machine Learning: Taking Stock and Road mapping the Future, 2011 |
An Efficient Image-to-Image Translation HourGlass-based Architecture for Object Pushing Policy Learning, , and , in: IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021 |
|
An Empirical Comparison of Semantic Similarity Methods for Analyzing down-streaming Automatic Minuting task, , , , and , in: Pacific Asia Conference on Language, Information and Computation (PACLIC 36), In ACL Anthology Proceedings, 2022 |
|
An Empirical Model of Emphatic Word Detection, and , in: Proc. of Interspeech, Dresden, Germany, pages 573-577, ISCA, 2015 |
|
An End-to-End Multilingual System for Automatic Minuting of Multi-Party Dialogues, , , and , in: Pacific Asia Conference on Language, Information and Computation (PACLIC 36) , In proceedings of ACL Anthology, 2022 |
|
An End-to-end Network to Synthesize Intonation Using a Generalized Command Response Model, , , , and , in: ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, United Kingdom, pages 7040-7044, IEEE, 2019 |
[DOI] [URL] |
An Evaluation Benchmark for Automatic Speech Recognition of German-English Code-Switching, , and , in: IEEE Automatic Speech Recognition and Understanding Workshop, 2021 |
|
An exploratory interplay between daylight, general and task lighting for visual comfort and electricity savings in a personal office space, , , , , , and , in: Proceedings of ISES and IEA SHC International Conference on Solar Energy for Buildings and Industry, Kassel, Germany, 2022 |
An HMM Approach with Inherent Model Selection for Sign Language and Gesture Recognition, , and , in: Proceedings of the International Conference on Language Resources and Evaluation LREC 2020, 2020 |
|
An HMM-Based Formalism for Automatic Subword Unit Derivation and Pronunciation Generation, and , in: International Conference on Acoustics, Speech and Signal Processing, pages 4639-4643, IEEE, 2015 |
[DOI] |
An Implicit Motion Likelihood for Tracking with Particle Filters, , and , in: British Machine Vision Conference (BMVC), Springer Verlag, 2003 |
|
An Integrated and strategic evaluation of automatic blind controls to achieve energy and occupant's comfort objectives, and , in: Proceedings of the 5th IBPSA-England Conference on Building Simulation and Optimization (Virtual), Loughborough, UK, 2020 |
[URL] |
An Integrated Framework for Multi-Channel Multi-Source Localization and Voice Activity Detection, , , , and , in: The Third Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2011 |
|
An Investigation of Deep Neural Networks for Multilingual Speech Recognition Training and Adaptation, , and , in: Proc. of Interspeech, 2017 |
|
AN INVESTIGATION OF MULTILINGUAL ASR USING END-TO-END LF-MMI, , and , in: International Conference on Acoustics, Speech and Signal Processing, 2019 |
|
An Investigation of Muscle Models for Physiologically Based Intonation Modelling, and , in: Proceedings of the 23rd Telecommunications Forum, pages 468--471, 2015 |
[DOI] |
An Investigation of Spectral Subband Centroids for Speaker Authentication, , and , in: Int'l Conf. on Biometric Authentication, 2004 |
|
An Objective Evaluation Framework for Pathological Speech Synthesis, , , , , and , in: Proceedings of ITG Conference on Speech Communication, 2021 |
|
An Online Audio Indexing System, , and , 2004 |
|
An online framework for learning novel concepts over multiple cues, , and , in: Proceeding of The 9th Asian Conference on Computer Vision, Xi'an, China, 2009 |
|
An Open Source Framework for Standardized Comparisons of Face Recognition Algorithms, , and , in: Computer Vision - ECCV 2012. Workshops and Demonstrations, Idiap Research Institute, Heidelberg, pages 547-556, Springer Berlin, 2012 |
[DOI] [URL] |
An Open-source State-of-the-art Toolbox for Broadcast News Diarization, , , , , and , in: INTERSPEECH, 2013 |
|
An Optical Thresholding Perceptron, , , , and , in: Proceedings of the Workshop on Optics and Computer Science, Geneva, Switzerland, 1997 |
|
An overview of the cave project research activities in speaker verification, , , , , and , in: Reconnaissance du locuteur et ses applications commerciales et criminalistiques, 1998 |
An Overview of the PICASSO Project Research Activities in Speaker Verification for Telephone Applications, , , , , , , , , , , and , in: 6th european conference on speech communication and technology --- eurospeech'99, 1999 |
An SVM Confidence-Based Approach to Medical Image Annotation, , and , in: Workshop of the Cross-Language Evaluation Forum, 2008 |
|
Analysis and Comparison of Recent MLP Features for LVCSR Systems, , and , in: Proceedings of Interspeech 2011, 2011 |
|
Analysis and Transfer of Human Movement Manipulability in Industry-like Activities, , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, 2020 |
|
Analysis of CNN-based Speech Recognition System using Raw Speech as Input, , and , in: Proceedings of Interspeech, ISCA, Dresden, pages 11-15, ISCA, 2015 |
|
Analysis of Language Dependent Front-End for Speaker Recognition, , and , in: Proceedings of Interspeech 2018, Hyderabad, INDIA, pages 1101-1105, 2018 |
[DOI] |
Analysis of Phone Posterior Feature Space Exploiting Class Specific Sparsity and MLP-based Similarity Measure, , and , in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, 2010 |
|
Analyzing Flickr Groups, and , in: Proc. of the Intl. Conf. on Image and Video Retrieval, ACM, 2008 |
Analyzing Group Interactions in Conversations: a Review, , in: IEEE Int. Conf. on Multisensor Fusion and Integration for Intelligent Systems (MFI), 2006 |
|
Analyzing Interactions Between Navigation Strategies Using a Computational Model of Action Selection, , , , and , in: Int Conf Spatial Cognition 2008, 2008 |
|
ANALYZING UNCERTAINTIES IN SPEECH RECOGNITION USING DROPOUT, , , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
Ancient Maya Writings as High-Dimensional Data: a Visualization Approach, , , and , in: Digital Humanities (DH), Krakow, 2016 |
|
Annotation and Recognition of Personality Traits in Spoken Conversations from the AMI Meetings Corpus, , and , in: Proceedings of Interspeech 2012, 2012 |
|
Annotators' agreement and spontaneous emotion classification performance, and , in: Proceedings of Interspeech, pages 1546-1550, 2015 |
|
Anomaly detection in elderly daily behavior in ambient sensing environments, , , and , in: Proceedings of the 7th Int. Workshop on Human Behavior Understanding, ACM Multimedia, 2016, Amsterdam, Netherlands, 2016 |
|
Anti-spoofing in action: joint operation with a verification system, , and , in: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Workshop on Biometrics, Portland, Oregon, 2013 |
|
Application of Out-Of-Language Detection To Spoken-Term Detection, and , in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010 |
|
APPLICATIONS OF SIGNAL ANALYSIS USING AUTOREGRESSIVE MODELS FOR AMPLITUDE MODULATION, , , and , in: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009, WASPAA '09., IEEE, Mohonk Mountain House, New Paltz, New York, USA, 2009 |
[URL] |
Approximating Optimal Morphing Attacks using Template Inversion, , and , in: IEEE International Joint Conference on Biometric, 2023 |
[DOI] |
Approximating the Mental Lexicon from Clinical Interviews as a Support Tool for Depression Detection, , , , and , in: Proceedings of the 2021 International Conference on Multimodal Interaction, ACM, 2021 |
[DOI] |
Are ACT's scores increasing with better translation quality?, , in: Are ACT's scores increasing with better translation quality?, pages 6, 2013 |
|
Are GAN-based Morphs Threatening Face Recognition?, , , and , in: International Conference on Acoustics, Speech and Signal Processing, 2022 |
|
Are you a Werewolf? Detecting deceptive roles and outcomes in a conversational role-playing game, and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2010 |
|
Arithmetic Coding of Sub-Band Residuals in FDLP Speech/Audio Codec, , and , in: 10th Annual Conference of the International Speech Communication Association, ISCA, Brighton, England, ISCA 2009, 2009 |
|
Artificial neural network features for speaker diarization, , and , in: IEEE Spoken Language Technology workshop, South Lake Tahoe, USA, 2014 |
|
Assessing a Shape Descriptor for Analysis of Mesoamerican Hieroglyphics: A View Towards Practice in Digital Humanities, , and , in: Digital Humanities Conference (DH), Krakow, 2016 |
|
Assessing Scene Structuring in Consumer Videos, , , , and , in: Int. Conf. on Image and Video Retrieval (CIVR), 2004 |
|
Assessing the Accuracy of Discourse Connective Translations: Validation of an Automatic Metric, and , in: 14th International Conference on Intelligent Text Processing and Computational Linguistics, University of the Aegean, Samos, Greece, pages 236-247, Springer, 2013 |
[DOI] |
Assessing the Impact of Language Style on Emergent Leadership Perception from Ubiquitous Audio, , and , in: Proceedings of the 11th International Conference on Mobile and Ubiquitous Multimedia, Ulm, Germany, 2012 |
|
Assessing the Reliability of Biometric Authentication on Virtual Reality Devices, , and , in: Proceedings of IEEE International Joint Conference on Biometrics, 2024 |
|
Associating Audio-Visual Activity Cues in a Dominance Estimation Framework, , , and , in: First IEEE Workshop on CVPR for Human Communicative Behavior Analysis, 2008 |
|
Asynchronous detection and classification of oscillatory brain activity, , and , in: 16 European Signal Processing Conference, 2008 |
|
Atom Decomposition-based Intonation Modelling, , and , in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, Australia, pages 4744--4748, IEEE, 2015 |
[DOI] |
Audio Spatio-Temporal Fingerprints for Cloudless Real-Time Hands-Free Diarization on Mobile Devices, , in: Proceedings of the 3rd Joint Workshop on Hands-Free Speech Communication and Microphone Arrays, Edinburgh, UK, 2011 |
|
Audio visual speech recognition, , , , , , , and , Johns Hopkins University-CLSP, 2000 |
Audio-Visual Gender Recognition in Uncontrolled Environment Using Variability Modeling Techniques, , and , in: International Joint Conference on Biometrics, Clearwater, Florida, USA, pages 1 - 8, IEEE, 2014 |
[DOI] [URL] |
Audio-Visual Person Verification, , , , and , in: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, June 1999, Fort Collins, USA, 1999 |
|
Audio-Visual Speaker Tracking with Importance Particle Filters, , , , and , in: IEEE International Conference on Image Processing (ICIP), 2003 |
|
Audio–Visual Synchronisation for Speaker Diarisation, , and , in: International Conference on Speech and Language Processing, Interspeech, Makuhari, Japan, 2010 |
|
Audiovisual Summarization of Lectures and Meetings Using a Segment Similarity Graph, , and , in: Proceedings of the ACM International Conference on Multimedia Retrieval (ICMR), ACM, New York, NY, ACM Press, 2016 |
Augmenting Astronaut's Capabilities through Brain-Machine Interfaces, , , and , in: Proceedings of the 20th International Joint Conference on Artificial Intelligence, Workshop on Artificial Intelligence for Space Applications, 2007 |
|
Augmenting Frontal Face Models for Non-Frontal Verification, and , in: Proceedings of the 2003 Workshop on Multimodal User Authentication (MMUA'03), 2003 |
Automated Bobbing and Phase Analysis to Measure Walking Entrainment, , , , , , and , in: IEEE International Conference on Image Processing (ICIP), Paris, 2014 |
|
Automated Delineation of Dendritic Networks in Noisy Image Stacks, , and , in: proceedings of the European Conference on Computer Vision, 2008 |
Automated Interpretation of Air Traffic Control Communication: The Journey from Spoken Words to a Deeper Understanding of the Meaning, , , , , , , and , in: 2021 IEEE/AIAA 40th Digital Avionics Systems Conference (DASC), San Antonio, TX, USA, pages 1-9, IEEE, 2021 |
[DOI] |
Automated Quantification of Morphodynamics for High-Throughput Live Cell Imaging Datasets, , , , , and , in: 1st International SystemsX.ch Conference on Systems Biology, 2011 |
Automatic Accentedness Evaluation of Non-Native Speech Using Phonetic and Sub-Phonetic Posterior Probabilities, , , and , in: Proceedings of Interspeech, 2015 |
|
Automatic and perceptual discrimination between dysarthria, apraxia of speech, and neurotypical speech, , , and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2021 |
|
Automatic Blinking Detection towards Stress Discovery, , , and , in: Proc. ACM Int. Conf. on Multimodal Interaction, Istanbul, pages 307-310, ACM New York, 2014 |
[DOI] |
Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications, , , , , , , , , , , , , , , and , in: Proceedings of 8th OpenSky Symposium 2020, OpenSky Network, pages 1-10, MDPI, 2020 |
[DOI] [URL] |
Automatic Content Linking: Speech-based Just-in-time Retrieval for Multimedia Archives, , , , , and , in: Proceedings of the 33rd Annual ACM SIGIR Conference, Geneva, Switzerland, pages 703, 2010 |
[DOI] |
Automatic detection of conflict escalation in spoken conversations, , and , in: INTERSPEECH, ISCA, Portland, Oregon, USA., 2012 |
|
Automatic detection of conflicts in spoken conversations: ratings and analysis of broadcast political debates, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, Japan, 2012 |
|
Automatic Diagnosis of Alzheimer's Disease Using Neural Network Language Models, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
|
Automatic Dialect Detection for Low Resource Santali Language, , , , , and , in: Proceeding of International Conference on Information Technology (OCIT), 2021 |
|
Automatic Discrimination of Apraxia of Speech and Dysarthria using a Minimalistic Set of Handcrafted Features, , , and , in: Interspeech, 2020 |
|
AUTOMATIC DYSARTHRIC SPEECH DETECTION EXPLOITING PAIRWISE DISTANCE-BASED CONVOLUTIONAL NEURAL NETWORKS, , and , in: 45th International Conference on Acoustics, Speech, and Signal Processing, Toronto, Canada, pages 7328–7332, 2021 |
|
Automatic Maya Hieroglyph Retrieval Using Shape and Context Information, , , , and , in: ACM MM, pages 4, 2014 |
[URL] |
Automatic Minuting: A Pipeline Method for Generating Minutes, , , and , in: Pacific Asia Conference on Language, Information and Computation (PACLIC 36), In proceedings of ACL Anthology, 2022 |
|
Automatic Out-of-Language Detection Based on Confidence Measures Derived fromLVCSR Word and Phone Lattices, , in: 10thAnnual Conference of the International Speech Communication Association, ISCA, Brighton, England, 2009 |
|
Automatic processing pipeline for collecting and annotating air-traffic voice communication data, , , , , , , , , and , in: Proceedings of 9th OpenSky Symposium 2020, OpenSky Network, Brussels, Belgium, pages 1-9, MDPI, 2021 |
|
Automatic Role Recognition Based on Conversational and Prosodic Behaviour, , , and , in: Proceedings of the ACM International Conference on Multimedia, 2010 |
|
Automatic Role Recognition in Multiparty Recordings Using Social Networks and Probabilistic Sequential Models, , and , in: ACM International Conference on Multimedia, 2009 |
|
Automatic Social Role Recognition In Professional Meetings Using Conditional Random Fields, and , in: Proceedings of Interspeech, 2013 |
|
Automatic Speaker Role Labeling in AMI Meetings: Recognition of Formal and Social Roles, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, Japan, 2012, 2012 |
|
Automatic Speech Analysis Framework for ATC Communication in HAAWAII, , , , , and , in: 13th SESAR Innovation Days, 2023 |
|
Automatic Speech Recognition and Translation of a Swiss German Dialect: Walliserdeutsch, , and , in: Proceedings of Interspeech, 2014 |
|
Automatic Speech Recognition and Understanding for Radar Label Maintenance Support Increases Safety and Reduces Air Traffic Controllers’ Workload, , , , , , , , , , , and , in: Fifteenth USA/Europe Air Traffic Management Research and Development Seminar (ATM2023), Eurocontrol (Europe), FAA (U.S.), Savannah, Georgia, USA, 2023 |
[URL] |
Automatic Speech Recognition Benchmark for Air-Traffic Communications, , , , and , in: Proc. Interspeech 2020, pages 2297-2301, 2020 |
[DOI] |
Automatic Speech Recognition using Dynamic Bayesian Networks with both Acoustic and Articulatory Variables, , , and , in: 6th International Conference on Spoken Language Processing: ICSLP~2000 (Interspeech~2000), 2000 |
|
Automatic Staging of Audio with Emotions, and , in: International Conference on Affective Computing and Intelligent Interaction, 2013 |
Automatic Summarization for Creative Writing: Denoising Auto-Encoder based Pipeline Method for Generating Summary of Movie Scripts, , , , and , in: Automatic Summarization for Creative Writing, International Conference on Computational Linguistics (COLING 2022), 2022 |
|
Automatic Temporal Alignment of AV Data with Confidence Estimation, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010 |
|
Automatic Time Skew Detection and Correction, , in: Proceedings International Conference on Signal Acquisition and Processing, Singapore, 2011 |
|
Automatic vs. human question answering over multimedia meeting recordings, and , in: 10th Annual Conference of the International Speech Communication Association, 2009 |
|
Autoregressive Modelling of Hilbert Envelopes for Wide-band Audio Coding, , , and , in: AES 124th Convention, Audio Engineering Society, 2008 |
|
Auxiliary Variables in Conditional Gaussian Mixtures for Automatic Speech Recognition, , and , in: Seventh International Conference on Spoken Language Processing (ICSLP~2002), 2002 |
|
AV16.3: an Audio-Visual Corpus for Speaker Localization and Tracking, , and , in: Proceedings of the 2004 MLMI Workshop, S. Bengio and H. Bourlard Eds, Springer Verlag, 2005 |
|
B
Bag-of-Vectors Autoencoders for Unsupervised Conditional Text Generation, and , in: Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), Association for Computational Linguistics, Online, pages 468–488, 2022 |
[URL] |
Baseline Multimodal Place Classifier for the 2012 Robot Vision Task, , and , in: Working Notes of the ImageCLEF 2012 Laboratory, 2012 |
|
Bayesian Controller for a Novel Semi-Autonomous Navigation Concept, , , and , in: 3rd European Conference on Mobile Robots (ECMR 2007), 2007 |
|
Bayesian Networks to Combine Intensity and Color Information in Face Recognition, and , in: International Conference on Biometrics, Springer, 2009 |
|
Bayesian Optimization Meets Riemannian Manifolds in Robot Learning, , , and , in: Conference on Robot Learning, 2019 |
|
Bayesian Recurrent Units and the Forward Backward Algorithm, and , in: Proc. Interspeech 2022, pages 4137-4141, 2022 |
[DOI] |
BEAT: An Open-Science Web Platform, , and , in: Thirty-fourth International Conference on Machine Learning, Sydney, Australia, 2017 |
[URL] |
Behavior of a Bayesian adaptation method for incremental enrollment in speaker verification, , , , , and , in: ICASSP2000 - IEEE International Conference on Acoustics, Speech, and Signal Processing, 2000 |
|
Benchmarking Non-Parametric Statistical Tests, , and , in: Advances in Neural Information Processing Systems, NIPS 18. MIT Press, 2005 |
|
BertAA: BERT fine-tuning for Authorship Attribution, , , and , in: Proceedings of the 17th International Conference on Natural Language Processing, 2020 |
|
BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications, , , , , , and , in: 2023 IEEE Spoken Language Technology Workshop (SLT), IEEE, 2023 |
[URL] |
Beyond Dataset Bias: Multi-task Unaligned Shared Knowledge Transfer, , , and , in: Asian Conference on Computer Vision, 2012 |
|
Beyond Novelty Detection: Incongruent Events, when General and Specific Classifiers Disagree, , , , , , and , in: Advances in Neural Information Processing Systems 21, 2008 |
|
Beyond question-based biases: Assessing multimodal shortcut learning in visual question answering, , , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021 |
|
Beyond Weight Tying: Learning Joint Input-Output Embeddings for Neural Machine Translation, , and , in: Proceedings of the Third Conference on Machine Translation (WMT), 2018 |
|
Bi-directional Training for Composed Image Retrieval via Text Prompt Learning, , , , and , in: Under review, 2023 |
[URL] |
Bi-Modal Authentication in Mobile Environments Using Session Variability Modelling, , , , and , in: Proceedings of the 21st International Conference on Pattern Recognition, 2012 |
|
Bi-Modal Face and Speech Authentication: a BioLogin Demonstration System, , , and , in: Workshop on Multimodal User Authentication (MMUA), 2006 |
|
Bi-Modal Person Recognition on a Mobile Phone: using mobile phone data, , , , , , , , , , , , , and , in: IEEE ICME Workshop on Hot Topics in Mobile Multimedia, 2012 |
|
Bimanual Skill Learning with Pose and Joint Space Constraints, , , and , in: Proc. of the IEEE-RAS Intl Conf. on Humanoid Robots (Humanoids), 2018 |
|
Bio-Medical Multi-label Scientific Literature Classification using LWAN and Dual-attention module, , , , and , in: PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION (PACLIC 2022), In Proceedings of ACL Anthology, 2022 |
|
Biologically Motivated Audio-Visual Cue Integration for Object, , , , , , , , and , in: Proceedings of the first Internatinal Conference on Cognitive Systems, 2008 |
|
Biometric Person Authentication IS A Multiple Classifier Problem, and , in: 7th International Workshop on Multiple Classifier Systems, MCS, 2007 |
|
Black-box Attacks on Image Activity Prediction and its Natural Language Explanations, , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), 2023 |
[DOI] [URL] |
Blackbox Face Reconstruction from Deep Facial Embeddings Using A Different Face Recognition Model, and , in: Proceedings of the IEEE International Conference on Image Processing (ICIP), Kuala Lumpur, Malaysia, pages 2435-2439, 2023 |
[DOI] [URL] |
BLESS: Benchmarking Large Language Models on Sentence Simplification, , , , , , and , in: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Singapore, 2023 |
|
Blind acoustic source separation for cocktail party speech recognition, , , and , in: ICONIP, 7th IEEE Int. Conf. on Neural Information Processing, 2000 |
Blind separation of delayed and superimposed acoustic sources : learning algorithms an experimental study, , , , and , in: Proc. IEEE Int. Conference on Speech Processing (ICSP), IEEE, 1999 |
Bob Speaks Kaldi, , , , and , in: Proc. of Interspeech, 2017 |
|
Bob: a free signal processing and machine learning toolbox for researchers, , , , , and , in: Proceedings of the ACM Multimedia Conference, 2012 |
[URL] |
Body communicative cue extraction for conversational analysis, , , , and , in: Proceedings of IEEE International Conference on Automatic Face and Gesture Recognition, 2013 |
|
BookTubing Across Regions: Examining Differences based on Nonverbal and Verbal Cues, , and , in: Proc. ACM Int. Conf. on Interactive Experiences for Television and Online Video (TVX), Salford, ENGLAND, 2019 |
[DOI] |
Boolean Logic Inspired High Order Perceptron Construction, , and , in: SIPAR Workshop'95 Parallel and Distributed Systems, SIPAR SI Group for Parallel Systems, Biel School of Engineering, Computer Science Department, 1995 |
|
BOOSTED BINARY FEATURES FOR NOISE-ROBUST SPEAKER VERIFICATION, , and , in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, Texas, 2010 |
|
Boosted Exudate Segmentation in Retinal Images using Residual Nets, , , and , in: Proceedings of the MICCAI Workshop on Ophthalmic Medical Image Analysis, 2017 |
Boosting HMMs with an application to speech recognition, and , in: IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP, 2004 |
|
Boosting localized binary features for speech recognition, , and , in: Symposium on Machine Learning in Speech and Language Processing (MLSLP), 2012 |
|
Boosting of contextual information in ASR for air-traffic call-sign recognition, , , , , , , and , in: Interspeech 2021, 2021 |
|
Boosting Pixel-based Classifiers for Face Verification, and , in: Biometric Authentication Workshop of the 8th European Conference on Computer Vision, BIOAW2004, Springer-Verlag, 2004 |
|
Boosting under-resourced speech recognizers by exploiting out of language data - Case study on Afrikaans, , and , in: Proceedings of the 3rd International Workshop on Spoken Languages Technologies for Under-resourced Languages, Cape Town, pages 60--67, 2012 |
|
Boosting with Maximum Adaptive Sampling, and , in: Proceedings of the Neural Information Processing Systems Conference, 2011 |
Boosting word error rates, and , in: IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP, 2005 |
Borrowing from yourself: Faster future video segmentation with partial channel update, and , in: International Conference on Pattern Recognition, 2022 |
|
Bounds on the Degree of High Order Binary Perceptrons, , in: Proceedings of ESANN'96, D facto, 1996 |
|
Brain-Computer Interfaces for HCI and Games, , , , , and , in: Proceedings of the 26th Annual CHI Conference on Human Factors in Computing Systems, Extended Abstracts, 2008 |
|
Brain-Machine Interfaces through Control of Electroencephalographic Signals and Vibrotactile Feedback, , , , , , , , and , in: Proceedings of the 12th International Conference on Human-Computer Interaction, 2007 |
|
Breaking Template Protection: Reconstruction of Face Images from Protected Facial Templates, and , in: 18th International Conference on Automatic Face and Gesture Recognition (FG), 2024 |
|
Bridging the Gap: Heterogeneous Face Recognition with Conditional Adaptive Instance Modulation, and , in: IJCB, 2023 |
|
Bridging the Past, Present and Future: Modeling Scene Activities From Event Relationships and Global Rules, , and , in: IEEE Conference on Computer Vision and Pattern Recognition, 2012, Providence, Rhode Island, USA, 2012 |
Broadcast News Story Segmentation Using Social Network Analysis and Hidden Markov Models, and , in: ACM International Conference on Multimedia, 2007 |
|
Building 'directional corpora' for unbiased contrastive analysis, and , in: Proceedings of Corpus Linguistics Conference, Birmingham, UK, pages 29-30, 2011 |
|
Building Blocks of Assistant Based Speech Recognition for Air Traffic Management Applications, , , , , , , and , in: Conference: SESAR Innovation Days 2018, European Union, Eurocontrol, Salzburg, Austria, SESARJU, 2018 |
[URL] |
Building context-dependent DNN acoustic models using Kullback-Leibler divergence-based state tying, , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2015 |
|
Building energy models with Morphological urban-scale parameters: a case study in Turin, , , , , and , in: Proceedings of 4th Building Simulation Applications Conference - BSA 2019, 2019 |
[URL] |
Building the NinaPro Database: a Resource for the Biorobotics Community, , , , , , , , and , in: Proceedings of the Fourth IEEE RAS/EMBS International Conference on Biomedical Robotics and Biomechatronics, 2012 |
|
By their apps you shall understand them: mining large-scale patterns of mobile phone usage, and , in: The 9th International Conference on Mobile and Ubiquitous Multimedia, 2010 |
|
C
Calibration from statistical properties of the visual world, , and , in: European Conf. on Computer Vision, 2008 |
|
Can ChatGPT Detect Intent? Evaluating Large Language Models for Spoken Language Understanding, and , in: Proc. INTERSPEECH 2023, pages 1109-1113, 2023 |
[DOI] |
Can face anti-spoofing countermeasures work in a real world scenario?, , , and , in: International Conference on Biometrics, Madrid, Spain, 2013 |
[URL] |
Can Language Models Learn Analogical Reasoning? Investigating Training Objectives and Comparisons to Human Performance, and , in: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Singapore, Association for Computational Linguistics, 2023 |
|
Can personalised hygienic masks be used to attack face recognition systems?, , , and , in: Proceedings of IEEE International Joint Conference on Biometrics (IJCB2023), 2023 |
|
Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers?, and , in: Proceedings of Interspeech, 2023 |
|
Canal9: A database of political debates for analysis of social interactions, , , and , in: Proceedings of the International Conference on Affective Computing and Intelligent Interaction (IEEE International Workshop on Social Signal Processing), Amsterdam, Netherlands, 2009 |
[DOI] |
Candidate Set Re-ranking for Composed Image Retrieval with Dual Multi-modal Encoder, , , and , in: Under review, 2023 |
[URL] |
Capturing Upper Body Motion in Conversation: an Appearance Quasi-Invariant Approach, , , and , in: Proc. ACM Int. Conf. on Multimodal Interaction, Istanbul, pages 327-334, ACM New York, 2014 |
[DOI] |
Case-Based Abductive Natural Language Inference, , and , in: Proceedings of the 29th International Conference on Computational Linguistics, 2022 |
[URL] |
CBOW Is Not All You Need: Combining CBOW with the Compositional Matrix Space Model, , and , in: International Conference on Learning Representations, New Orleans, Louisiana, USA, 2019 |
[URL] |
CCDb-HG: Novel Annotations and Gaze-Aware Representations for Head Gesture Recognition, , , and , in: 18th IEEE Int. Conference on Automatic Face and Gesture Recognition (FG), Istanbul,, 2024 |
|
Challenges for Using Impact Regularizers to Avoid Negative Side Effects, , and , in: SafeAI 2021 - AAAI's Workshop on Artificial Intelligence Safety, 2021 |
|
Channel Selection in the Short-time Modulation Domain for Distant Speech Recognition, , , , and , in: Proceedings of Interspeech, pages 741-745, 2015 |
|
Characterising Conversationsal Group Dynamics Using Nonverbal Behaviour, , and , in: Proceedings ICME 2009, 2009 |
|
ChatGPT and biometrics: an assessment of face recognition, gender detection, and age estimation capabilities, , , , and , in: 2024 IEEE International Conference on Image Processing (ICIP), 2024 |
|
Checking In or Checked In: Comparing Large-Scale Manual and Automatic Location Disclosure Patterns, , and , in: Proceedings of the 11th International Conference on Mobile and Ubiquitous Multimedia, Ulm, Germany, 2012 |
ChildPlay: A New Benchmark for Understanding Children's Gaze Behaviour, , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023 |
|
CityLearn v1.0: An OpenAI Gym Environment for Demand Response with Deep Reinforcement Learning, , , and , in: Proceedings of the 6th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation, New-York, USA, pages 356-357, ACM, 2019 |
[DOI] |
Classification using localized mixtures of experts, , in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'99), London: IEE, 1999 |
|
CLEF2007 Image Annotation Task: an SVM-based Cue Integration Approach, , and , in: Proceedings of ImageCLEF 2007 -LNCS, 2007 |
|
CLIENT / WORLD MODEL SYNCHRONOUS ALIGNEMENT FOR SPEAKER VERIFICATION, , , and , in: 6th European Conference on Speech Communication and Technology --- Eurospeech'99, 1999 |
|
Client Dependent GMM-SVM Models for Speaker Verification, and , in: International Conference on Artificial Neural Networks, ICANN/ICONIP 2003, Springer Verlag, 2003 |
|
Clustering And Segmenting Speakers And Their Locations In Meetings, , and , in: ICASSP, 2004 |
|
CNN based Query by Example Spoken Term Detection, , and , in: Proceedings of Interspeech, 2018 |
|
CNN Patch Pooling for Detecting 3D Mask Presentation Attacks in NIR, and , in: IEEE International Conference on Image Processing, 2020 |
|
CO2 experimental measurements towards the development of a predictive framework using user actions in smart buildings, , , , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
Collecting data for socially intelligent surveillance and monitoring approaches: the case of conflict in competitive conversations, , , and , in: International Symposium on Communications, Control, and Signal Processing, 2012 |
|
Combination of Acoustic Classifiers based on Dempster-Shafer Theory of evidence, and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007 |
|
COMBINATION OF AGGLOMERATIVE AND SEQUENTIAL CLUSTERING FOR SPEAKER DIARIZATION, , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008 |
|
Combination of Sparse Classification and Multilayer Perceptron for Noise Robust ASR, , , , , and , in: Proceedings of Interspeech, 2012 |
|
Combinatorial Approach for Data Binarization, and , in: Principles of Data Mining and Knowledge Discovery: third european conference; proceedings / PKDD'99, Springer, 1999 |
|
Combined Estimation of Location and Body Pose in Surveillance Video, , and , in: AVSS, 2011 |
|
Combining Acoustic Data Driven G2P and Letter-to-Sound Rules for Under Resource Lexicon Generation, and , in: Proceedings of Interspeech, Portland, Oregon, 2012 |
|
COMBINING CEPSTRAL NORMALIZATION AND COCHLEAR IMPLANT-LIKE SPEECH PROCESSING FOR MICROPHONE ARRAY-BASED SPEECH RECOGNITION, , and , in: Proceedings of the IEEE Workshop on Spoken Language Technology, 2012 |
|
Combining Content with User Preferences for TED Lecture Recommendation, and , in: Proceedings of the 11th International Workshop on Content Based Multimedia Indexing, Veszprém, Hungary, IEEE, 2013 |
|
Combining Evidence from a Generative and a Discriminative Model in Phoneme Recognition, and , in: Proceedings of Interspeech, 2008 |
|
Combining methods to improve speaker verification decision, , , and , in: Proceedings of The Fourth International Conference on Spoken Language Processing, ICSLP, ICSLP, 1996 |
|
COMBINING SGMM SPEAKER VECTORS AND KL-HMM APPROACH FOR SPEAKER DIARIZATION, , and , in: Proceedings of ICASSP 2015, pages 4834-4837, 2015 |
|
Combining transcription-based and acoustic-based speaker identifications for broadcast news, , , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2012 |
|
COMBINING VOCAL TRACT LENGTH NORMALIZATION WITH HIERARCHIAL LINEAR TRANSFORMATIONS, , , and , in: Proceedings in International conference on Speech and Signal processing, Kyoto, Japan, pages 4493-4496, IEEE SPS (ICASSP), 2012 |
|
CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice, , , and , in: Proc. Interspeech 2023, 2023 |
[URL] |
CommuniSense: Crowdsourcing Road Hazards in Nairobi, , , , , , and , in: Proceedings of the 17th International Conference on Human-Computer Interaction with Mobile Devices and Services, Copenhagen, Denmark, pages 445-456, ACM, 2015 |
[DOI] [URL] |
Compacter: Efficient Low-Rank Hypercomplex Adapter Layers, , and , in: NeurIPS, 2021 |
|
Comparing Biosignal and Acoustic feature Representation for Continuous Emotion Recognition, , , , and , in: International Multimodal Sentiment Analysis Workshop and Challenge, 2022 |
|
Comparing CTC and LFMMI for out-of-domain adaptation of wav2vec 2.0 acoustic model, , and , in: Proceedings of Interspeech, 2021 |
[URL] |
COMPARING DATA-DRIVEN AND HANDCRAFTED FEATURES FOR DIMENSIONAL EMOTION RECOGNITION, , and , in: International Conference on Acoustics, Speech and Signal Processing, 2024 |
|
Comparing different acoustic modeling techniques for multilingual boosting, , , , and , in: Proceedings of Interspeech, Portland, Oregon, 2012 |
|
Comparing Stability and Discriminatory Power of Hand-crafted Versus Deep Radiomics: A 3D-Printed Anthropomorphic Phantom Study, , , , , , , , , , and , in: Proceedings of the 12th European Workshop on Visual Information Processing, 2024 |
|
Comparing supervised and self-supervised embedding for ExVo Multi-Task learning track, , , and , in: Proceedings of the ICML Expressive Vocalizations Workshop held in conjunction with the 39th International Conference on Machine Learning, Maryland, USA, 2022 |
|
Comparing Two Strategies for Query Expansion in a News Monitoring System, and , in: Natural Language Processing and Information Systems: 21st International Conference on Applications of Natural Language to Information Systems, NLDB 2016, pages 267-275, Springer-Verlag, 2016 |
[DOI] |
Comparison of 5 methods for the evaluation of intelligibility in mild to moderate French dysarthric speech, , , , , , , , , and , in: Annual Conference of the International Speech Communication Association, pages 2188-2192, 2022 |
[DOI] |
Comparison of different feature classifiers for brain computer interfaces, , , , , , , , and , in: Proceedings of the 1st International IEEE EMBS Conference on Neural Engineering, 2003 |
Comparison of HMM experts with MLP experts in the Full Combination Multi-Band Approach to Robust ASR, and , in: ICSLP, 2000 |
|
Comparison of MLP and GMM Classifiers for Face Verification on XM2VTS, , and , in: 4th International Conference on AUDIO- and VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, 2003 |
|
Comparison of Two Methods for Unsupervised Person Identification in TV Shows, , , , and , in: 12th International Workshop on Content-Based Multimedia Indexing, 2014 |
|
Comparison of Unsupervised and Supervised Training of RBF Neural Networks. Case Study: Mapping of Contamination Data, and , in: Neural Computation 2000, 2000 |
Competition on Counter Measures to 2-D Facial Spoofing Attacks, , , , , , , , , , , , , , , , , , , , , , , and , in: Proceedings of IAPR IEEE International Joint Conference on Biometrics (IJCB), Washington DC, USA, 2011 |
|
Complementary Countermeasures for Detecting Scenic Face Spoofing Attacks, , , , and , in: International Conference on Biometrics, Madrid, Spain, 2013 |
[URL] |
Complex Daily Activities, Country-Level Diversity, and Smartphone Sensing: A Study in Denmark, Italy, Mongolia, Paraguay, and UK, , , , , , , , , , , , , , , , , , , , , and , in: Proceedings of the 2023 ACM CHI Conference on Human Factors in Computing Systems (CHI), Association for Computing Machinery, 2023 |
[DOI] |
Complexity reduction of eigenvalue decomposition-based diffuse power spectral density estimators using the power method, , and , in: Proc. International Conference on Acoustics, Speech, and Signal Processing, Calgary, Canada, pages 451-455, 2018 |
|
Composite Kernel Learning, , and , in: Proceedings of the 25th Annual International Conference on Machine Learning (ICML 2008), Omnipress, 2008 |
|
Computational Methods For Structured Sparse Component Analysis of Convolutive Speech Mixtures, , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2012 |
|
Computing Text Semantic Relatedness using the Contents and Links of a Hypertext Encyclopedia, and , in: International Joint Conference on artificial intelligence, 2013 |
|
Conditional Gaussian Mixture Models for Environmental Risk Mapping, , and , in: IEEE International Workshop on Neural Networks for Signal Processing (NNSP), 2002 |
|
Confidence Evaluation for Risk Prediction, , and , in: 2001 Annual Conference of the IAMG, 2001 |
|
Confidence Measures in Hybrid HMM/ANN Speech Recognition, and , in: Proceedings of Workshop on Text, Speech and Dialog (TSD'98) Brno, Czech Republic, 1998 |
Confidence Measures in Multiple pronunciations Modeling For Speaker Verification, and , in: Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04), 2004 |
|
Confidence-based Cue Integration for Visual Place Recognition, and , in: IEEE International Conference on Intelligent RObot Systems (IROS), 2007 |
|
Configuration Space Distance Fields for Manipulation Planning, , , and , in: Robotics: Science and Systems (RSS), 2024, 2024 |
|
Confusion Matrix Based Entropy Correction in Multi-stream Combination, and , in: Proceedings of ISCA European Conference on Speech Communication and Technology (Eurospeech), 2003 |
|
Connectionist Quantization Functions, , and , in: Proceedings of the '96 SIPAR-Workshop on Parallel and Distributed Computing, Scientific and Parallel Computing Group, University of Geneva, Geneva, Switzerland, 1996 |
|
Connectionist speech recognition, , in: Proceedings of IK'98, Interdisziplinares Kolleg, Spring Scholl, Gunne am Mohnessee, Germany, March 7--14, 1998 |
Consistent Translation of Repeated Nouns using Syntactic and Semantic Cues, , and , in: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL), Valencia, Spain, Association for Computational Linguistics, 2017 |
|
Constructing visual models with a latent space approach, , , and , in: the Springer series of Lecture Notes in Computer Science, 2006 |
|
Content Normalization for Text-dependent Speaker Verification, , , and , in: Proc. of Interspeech, 2017 |
|
CONTENT-BASED OBJECTIVE EVALUATION OF ARTIFICIALLY GENERATED SIGN LANGUAGE VIDEOS, , , , , and , in: ICASSP, 2024 |
|
Context Aware Addressee Estimation for Human Robot Interaction, , , and , in: Proceedings of the 6th Workshop on Eye Gaze in Intelligent Human Machine Interaction: Gaze in Multimodal Interaction, 2013 |
CONTEXT-AWARE ATTENTION MECHANISM FOR SPEECH EMOTION RECOGNITION, , , and , in: IEEE Workshop on Spoken Language Technology, Athens, Greece, pages 126-131, 2018 |
[URL] |
CONTEXTUAL BIASING METHODS FOR IMPROVING RARE WORD DETECTION IN AUTOMATIC SPEECH RECOGNITION, , , , , , , and , in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, Seoul, Korea, 2024 |
|
Contextual Conditional Models for Smartphone-based Human Mobility Prediction, and , in: Proceedings of the 14th ACM International Conference on Ubiquitous Computing, 2012 |
|
Contextual grouping: discovering real-life interaction types from longitudinal Bluetooth data, and , in: 12th International Conference on Mobile Data Management, 2011 |
|
Contextual Semi-Supervised Learning: An Approach To Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems, , , , , , and , in: Interspeech 2021, 2021 |
[URL] |
Continuous Audio-Visual Speech Recognition, and , in: Proc. 5th European Conference on Computer Vision, Springer Verlag, 1998 |
|
Continuous Brain-Actuated Control of an Intelligent Wheelchair by Human EEG, , , , , , and , in: In proceedings, 4th Intl. Brain-Computer Interface Workshop and Training Course, 2008 |
|
Continuously Reproducing Toolchains in Pattern Recognition and Machine Learning Experiments, , , , , and , in: Thirty-fourth International Conference on Machine Learning, Sidney, Australia, 2017 |
[URL] |
Conversational Speech Recognition Needs Data? Experiments with Austrian German, , , and , in: Proceedings of the 13th Language Resources and Evaluation Conference, European Language Resources Association, pages 4684--4691, 2022 |
[URL] |
Conversion of Recurrent Neural Network Language Models to Weighted Finite State Transducers for Automatic Speech Recognition, and , in: Proceedings of Interspeech, Portland, Oregon, USA, pages to appear, 2012 |
|
Convolutional Neural Networks-based Continuous Speech Recognition using Raw Speech Signal, , and , in: International Conference on Acoustics, Speech and Signal Procecssing, IEEE, South Brisbane, QLD, pages 4295 - 4299, IEEE, 2015 |
|
Cost–effective Variational Active Entity Resolution, , , , and , in: 37th IEEE International Conference on Data Engineering (ICDE), 2021 |
[URL] |
Counter-Measures to Photo Attacks in Face Recognition: a public database and a baseline, and , in: International Joint Conference on Biometrics 2011, 2011 |
[URL] |
Creative Applications of Human Behavior Understanding. HBU 2013: 1-14, , , and , in: Human Behavior Understanding, pages 1-14, 2013 |
Cross Modal Focal Loss for RGBD Face Anti-Spoofing, and , in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021 |
|
Cross-Corpus Acoustic Emotion Recognition: Variances and Strategies (Extended Abstract), , , , , , and , in: Proceedings of ACII, Xi'an, pages 470-476, IEEE, 2015 |
[DOI] |
Cross-database evaluation of audio-based spoofing detection systems, and , in: Interspeech, San Francisco, USA, 2016 |
[URL] |
Cross-Database Evaluation With an Open Finger Vein Sensor, , , and , in: IEEE Workshop on Biometric Measurements and Systems for Security and Medical Applications (BioMS), Rome, Italy, pages 30-35, IEEE, 2014 |
[DOI] |
Cross-Domain Personality Prediction: From Video Blogs to Small Group Meetings, and , in: 15th ACM International Conference on Multimodal Interaction, 2013 |
|
Cross-Eyed 2017: Cross-Spectral Iris/Periocular Recognition Competition., , , , , , , , , , , , , and , in: IEEE/IAPR International Joint Conference on Biometrics, Denver, Colorado, USA, IEEE, 2017 |
Cross-lingual Automatic Speech Recognition Exploiting Articulatory Features, , , , , and , in: Proceedings of APSIPA ASC 2019, 2019 |
Cross-Lingual Speaker Discrimination Using Natural and Synthetic Speech, and , in: Proceedings of Interspeech, Florence, Italy, 2011 |
|
Cross-linguistic annotation of narrativity for English/French verb tense disambiguation, and , in: 9th Edition of the Language Resources and Evaluation Conference, Reykjavik, Iceland, 2014 |
|
Cross-transfer Knowledge between Speech and Text Encoders to Evaluate Customer Satisfaction, , , , and , in: Interspeech, Kos Island, Greece, ISCA, 2024 |
|
Crosslingual Tandem-SGMM: Exploiting Out-Of-Language Data for Acoustic Model and Feature Level Adaptation, , and , in: Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013), ISCA - International Speech Communication Association, Lyon, France, pages 510-514, ISCA, 2013 |
|
Crossmodal Matching of Speakers using Lip and Voice Features in Temporally Non-overlapping Audio and Video Streams, and , in: 20th International Conference on Pattern Recognition, Istanbul, Turkey, International Association for Pattern Recognition (IAPR), Istanbul, Turkey, 2010 |
|
Crowdsourcing Micro-Level Multimedia Annotations: The Challenges of Evaluation and Interface, , , and , in: Proceedings of International ACM Workshop on Crowdsourcing for Multimedia, 2012 |
Cue Integration for Medical Image Annotation, , and , in: Advances in Multilingual and Multimodal Information Retrieval: 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Budapest, Hungary, September 19-21, 2007, Revised Selected Papers, Springer-Verlag, 2008 |
|
Cue integration through discriminative accumulation, and , in: International Conference on Computer Vision and Pattern Recognition, 2004 |
|
Cursive Character Challenge: a New Database for Machine Learning and Pattern Recognition, , and , in: Proceedings of International Conference on Pattern Recognition (ICPR), 2006 |
|
Custom attribution loss for improving generalization and interpretability of deepfake detection, , and , in: International Conference on Acoustics, Speech, and Signal Processing, 2022 |
|
Custom Silicone Face Masks - Vulnerability of Commercial Face Recognition Systems & Presentation Attack Detection, , , , , , and , in: Proceedings of 7th IAPR/IEEE International Workshop on Biometrics and Forensics, 2019 |
|
Customization of Automatic Speech Recognition Engines for Rare Word Detection Without Costly Model Re-Training, , , , , , and , in: Proc. 13th SESAR Innovation Days, Seville, Spain, 2023 |
[DOI] [URL] |
D
D-LGP: Dynamic Logic-Geometric Program for Reactive Task and Motion Planning, , and , in: IEEE International Conference on Robotics and Automation, 2024 |
|
DAIC-WOZ: On the Validity of Using the Therapist's prompts in Automatic Depression Detection from Clinical Interviews, , , , , and , in: Proceedings of the 6th Clinical Natural Language Processing Workshop, Association for Computational Linguistics, 2024 |
|
Daily Routine Classification from Mobile Phone Data, and , in: Workshop on Machine Learning and Multimodal Interaction (MLMI08), 2008 |
|
Data binarization by discriminant elimination, , and , in: Proceedings of the ICML-99 Workshop: From Machine Learning to Knowledge Discovery in Databases, 1999 |
|
Data utility modelling for mismatch reduction, , in: Proc. CRAC (workshop on Consistent & Reliable Acoustic Cues for sound analysis), 2001 |
|
Data-driven Urban Building Energy Modeling with Machine Learning in Satom (CH), , and , in: 6th International IEEE Conference AND Workshop in Obuda on Electrical and Power Engineering, 2023 |
Database, Protocol and Tools for Evaluating Score-Level Fusion Algorithms in Biometric Authentication, and , in: Fifth Int'l. Conf. Audio- and Video-Based Biometric Person Authentication AVBPA, 2005 |
|
Daylight regulated by automated external Venetian blinds based on HDR sky luminance mapping in winter, , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
Deciphering the Silent Participant. On the Use of Audio-Visual Cues for the Classification of Listener Categories in Group Discussions, , , and , in: Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, ACM, Seattle, Washington, USA, pages 107--114, ACM, 2015 |
[DOI] |
Decision fusion using a multi-linear classifier, , and , in: 1st International Conference on Multisource-Multisensor Data Fusion, 1998 |
Decision-Oriented Environmental Mapping with Radial Basis Function Neural Networks, , , , and , in: Intelligent techniques for Spatio-Temporal Data Analysis in Environmental Applications. Workshop W07, 1999 |
Decomposing Natural Logic Inferences for Neural NLI, , , , and , in: BlackBoxNLP: Workshop on analyzing and interpreting neural networks for NLP, 2022 |
Deep Auto-Encoding and Biohashing for Secure Finger Vein Recognition, and , in: Proceedings of the 2021 IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE, Toronto, Canada, 2021 |
[DOI] [URL] |
Deep Learning for Efficient Discriminative Parsing, , in: International Conference on Artificial Intelligence and Statistics, 2011 |
|
Deep Multi-Camera People Detection, and , in: Proceedings of the IEEE International Conference on Machine Learning and Applications, 2017 |
Deep Multitask Gaze Estimation with a Constrained Landmark-Gaze Model, , and , in: European Conference on Computer Vision Workshop, 2018 |
|
DEEP NEURAL NETWORK BASED POSTERIORS FOR TEXT-DEPENDENT SPEAKER VERIFICATION, , , and , in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5050-5054, IEEE, 2016 |
|
Deep Neural Networks for Multiple Speaker Detection and Localization, , and , in: 2018 IEEE International Conference on Robotics and Automation (ICRA), Brisbane, AUSTRALIA, pages 74-79, 2018 |
[DOI] |
Deep Neural Networks for Syntactic Parsing of Morphologically Rich Languages, and , in: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016 |
|
Deep Occlusion Reasoning for Multi-Camera Multi-Target Detection, , and , in: Proceedings of the IEEE International Conference on Computer Vision, 2017 |
Deep Pixel-wise Binary Supervision for Face Presentation Attack Detection, and , in: International Conference on Biometrics, 2019 |
|
Deep Residual Output Layers for Neural Language Generation, and , in: Proceedings of the 36th International Conference on Machine Learning (ICML), 2019 |
|
Deep Variational Privacy Funnel: General Modeling with Applications in Face Recognition, , and , in: 49th IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE, 2024 |
[DOI] [URL] |
DeepCon: An End-to-End Multilingual Toolkit for Automatic Minuting of Multi-Party Dialogues, , , and , in: Special Interest Group on Discourse and Dialogue (SIGDIAL 2022), 2022 |
|
DeepFocus: a Few-shot Microscope Slide Auto-Focus using a Sample Invariant CNN-based Sharpness Function, and , in: International Symposium on Biomedical Imaging, 2020 |
|
Deformable Part Models with Individual Part Scaling, and , in: British Machine Vision Conference, 2013 |
|
Deliberate Imposture: a challenge for automatic speaker verification systems, and , in: Proceedings of the European Conference on Speech Communication and Technology, 1999 |
Delineating Trees in Noisy 2D Images and 3D Image Stacks, , , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, pages 2799–2806, 2010 |
Demographic Fairness Transformer for Bias Mitigation in Face Recognition, and , in: Proceedings of IEEE International Joint Conference on Biometrics (IJCB2024), 2024 |
|
Demonstration-guided Optimal Control for Long-term Non-prehensile Planar Manipulation, , , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), pages 4999-5005, 2023 |
[DOI] |
Demystifying the Scribes behind the Voynich Manuscript using Computational Linguistic Techniques, , and , in: Proceedings of the 1st International Conference on the Voynich Manuscript, 2022 |
Denoising and Raw-waveform Networks for Weakly-Supervised Gender Identification on Noisy Speech, , , , , and , in: Proceedings of Interspeech 2018, Hyderabad, INDIA, pages 292-296, 2018 |
[DOI] |
DepthInSpace: Exploitation and Fusion of Multiple Video Frames for Structured-Light Depth Estimation, , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 6039-6048, 2021 |
[URL] |
Design and Implementation of a System for the Recognition of Handwritten Responses on US Census Forms, , in: IAPR Workshop on Document Analysis Systems, 1994 |
Design of a Speech Corpus for Research on Cross-Lingual Prosody Transfer, , , , , , , , , , , and , in: Lecture Notes in Artificial Intelligence: 18th International Conference, SPECOM 2016, Budapest, Hungary, pages 199--206, 2016 |
|
Detecting Abandoned Luggage Items in a Public Space, , and , in: IEEE Performance Evaluation of Tracking and Surveillance Workshop (PETS), 2006 |
|
Detecting and Labeling Folk Literature in Spoken Cultural Heritage Archives using Structural and Prosodic Features, and , in: IEEE Content Based Multimedia Indexing, 2012 |
|
Detecting and Labeling Speakers on Overlapping Speech using Vector Taylor Series, , and , in: INTERSPEECH, 2014 |
|
Detecting Group Interest-level in Meetings, , , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2005 |
|
Detecting Narrativity to Improve English to French Translation of Simple Past Verbs, , and , in: Proceedings of the 1st DiscoMT Workshop at ACL 2013 (51st Annual Meeting of the Association for Computational Linguistics), Sofia, Bulgaria, pages 33-42, 2013 |
|
Detecting queues at vending machines: a statistical layered approach, and , in: Proc. Int. Conf. on Pattern Recognition (ICPR), Tampa, 2008 |
|
Detecting speaker roles and topic changes in multiparty conversations using latent topic models, and , in: Proceedings of Interspeech, 2014 |
|
Detection and Application of Influence Rankings in Small Group Meetings, , , and , in: In the Eighth International Conference on Multimodal Interfaces (ICMI'06), 2006 |
|
Detection and Recognition of Number Sequences in Spoken Utterances, and , in: 2nd Workshop on Speech in Mobile and Pervasive Environments (SiMPE), 2007 |
|
Detection of S1 and S2 locations in phonocardiogram signals using zero frequency filter, , , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020 |
|
Detection of Similar Languages and Dialects Using Deep Supervised Autoencoders, , , , and , in: Proceedings of the 17th International Conference on Natural Language Processing, 2020 |
|
Detection-Based Multi-Human Tracking Using a CRF Model, , and , in: The Eleventh IEEE International Workshop on Visual Surveillance, 2011 |
|
Determination of Pitch Range Based on Onset and Offset Analysis in Modulation Frequency Domain, , , and , in: Proceedings of 5th International Symposium on Telecommunications, 2010 |
|
Developing and Enhancing Posterior Based Speech Recognition Systems, , , and , in: Proceedings of Interspeech, 2005 |
|
Development of a lung segmentation algorithm for analog imaged chest X-Ray: preliminary results, , , , and , in: XV Brazilian Congress on Computational Intelligence, Joinville, Brazil, 2021 |
[URL] |
Development of Bilingual ASR System for MediaParl Corpus, , , and , in: Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech 2014), Singapore, ISCA, 2014 |
|
DexROV: Dexterous Undersea Inspection and Maintenance in Presence of Communication Latencies, , , , , , , , , , , , , , , and , in: IFAC Workshop on Navigation, Guidance and Control of Underwater Vehicles (NGCUV), pages 218-223, 2015 |
Dexterous Undersea Interventions with Far Distance Onshore Supervision: the DexROV Project, , , , , , , , , , , , , , , , , , , , , , and , in: IFAC Conference on Control Applications in Marine Systems (CAMS), Trondheim, Norway, pages 414-419, 2016 |
[DOI] [URL] |
DHgeN: Automated Generation of District Heating Network Layouts for Feasibility Studies, and , in: -, 2022 |
|
Dialect Levelling in Finnish: A Universal Speech Attribute Approach, , , , , , and , in: The 15th Annual Conference of the International Speech Communication Association, 2014 |
Diarizing Large Corpora using Multi-modal Speaker Linking, , , and , in: INTERSPEECH 2014, 2014 |
|
DiarTk : An Open Source Toolkit for Research in Multistream Speaker Diarization and its Application to Meetings Recordings, and , in: Proceedings of Interspeech, 2012 |
|
Dictionary Learning for Sparse Representation of Neural Network Exemplars in Speech Recognition, , and , in: Proceedings of SPARS 2015: Workshop on Signal Processing with Adaptive Sparse Structured Representations, 2015, pages 1093, 2015 |
|
Different Weighting Schemes in the Full Combination Subbands Approach for Noise Robust ASR, , and , in: Robust Methods for Speech Recognition in Adverse Conditions, 1999 |
|
Diffusion Transformer for Adaptive Text-to-Speech, and , in: Proc. 12th ISCA Speech Synthesis Workshop (SSW 12), 2023 |
[DOI] |
Direct Non-Invasive Brain Computer Interfaces, , , , and , in: Proceedings of the 9th International Conference on Functional Mapping of the Human Brain, 2003 |
Disambiguating discourse connectives using parallel corpora: senses vs. translations, , , , , and , in: Proceedings of Corpus Linguistics Conference, Birmingham, UK, pages 104-105, 2011 |
|
Disambiguating Temporal-Contrastive Discourse Connectives for Machine Translation, , in: Proceedings of ACL-HLT 2011 Student Session, Association for Computational Linguistics, Portland, OR, pages 46--51, 2011 |
|
Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns, , , , and , in: Proceedings of the eighth international conference on Language Resources and Evaluation (LREC), pages 5, 2012 |
|
Discovering Eating Routines in Context with a Smartphone App, , , and , in: Ubicomp/Iswc'19 Adjunct: Proceedings Of The 2019 Acm International Joint Conference On Pervasive And Ubiquitous Computing And Proceedings Of The 2019 Acm International Symposium On Wearable Computers, London, pages 422-429, 2019 |
[DOI] |
Discovering Group Nonverbal Conversational Patterns with Topics, and , in: Proceedings ICMI-MLMI, 2009 |
|
Discovering Human Places of Interest from Multimodal Mobile Phone Data, and , in: Proceedings of 9th International Conference on on Mobile and Ubiquitous Multimedia (MUM,',','), Limassol, Cyprus, 2010 |
|
Discovering Human Routines from Cell Phone Data with Topic Models, and , in: IEEE International Symposium on Wearable Computers (ISWC), 2008 |
|
Discovering Temporal Patterns in Water Quality Time Series, Focusing on Floods with the LDA method, , , , , , , and , in: European Geosciences Union, 2013 |
|
Discriminant linear processing of time-frequency plane, and , in: International Conference on Spoken Language Processing, 2006 |
|
Discrimination of the voices of twins and siblings for speaker verification, and , in: 4th European Conference on Speech Communication and Technology, 1995 |
Discriminative Kernel-Based Phoneme Sequence Recognition, , , , and , in: The 9th International Conference on Spoken Language Processing (INTERSPEECH), Pittsburgh, PA, 2006 |
|
Discriminative Keyword Spotting, , and , in: Workshop on Non-Linear Speech Processing, Paris, France, 2007 |
|
Disentangling Generative Factors in Natural Language with Discrete Variational Autoencoders, and , in: The 2021 Conference on Empirical Methods in Natural Language Processing, 2021 |
Distinguishing the Popularity Between Topics: A System for Up-to-date Opinion Retrieval and Mining in the Web, , and , in: Proceedings of the 14th International Conference on Intelligent Text Processing and Computational Linguistics, LNCS, Samos, Greece, ACM, 2013 |
[URL] |
District heating network modelling for future integration of solar thermal energy, , , and , in: Journal of Physics: Conference Series, pages 012089, IOP Publishing, 2021 |
[DOI] |
Dites-Moi: Wearable Feedback on Conversational Behavior, , , and , in: Proceedings of the 15th International Conference on Mobile and Ubiquitous Multimedia, 2016 |
|
Diverse Keyword Extraction from Conversations, and , in: Proceedings of the ACL 2013 (51th Annual Meeting of the Association for Computational Linguistics ), Short Papers, Sofia, Bulgaria, pages 651-657, ACL, 2013 |
|
DNN based speaker embedding using content information for text-dependent speaker verification, , , and , in: Proceedings of 2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2018 |
|
Do Backpropagation trained neural networks have normal weight distributions?, and , in: International Conference on Artificial neural Networks, 1993 |
|
Do Natural Language Explanations Represent Valid Logical Arguments? Verifying Entailment in Explainable NLI Gold Standards, , and , in: 14th International Conference on Computational Semantics, 2021 |
[URL] |
Document-Level Neural Machine Translation with Hierarchical Attention Networks, , , and , in: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2018 |
|
Document-level Text Simplification with Coherence Evaluation, , , and , in: Proceedings of the Second Workshop on Text Simplification, Accessibility and Readability, 2023 |
|
Does Data-Efficient Generalization Exacerbate Bias in Foundation Models?, , , , , and , in: Proceedings of the 18th European Conference on Computer Vision, 2024 |
|
Does My Representation Capture X? Probe-Ably, , , , and , in: 59th Annual Meeting of the Association for Computational Linguistics (Demonstration track), 2021 |
[URL] |
DOMAIN ADAPTATION FOR GENERALIZATION OF FACE PRESENTATION ATTACK DETECTION IN MOBILE SETTINGS WITH MINIMAL INFORMATION, , and , in: 45th International Conference on Acoustics, Speech, and Signal Processing, Barcelona, Spain, IEEE, 2020 |
[URL] |
Domain Adaptation in Multi-Channel Autoencoder based Features for Robust Face Anti-Spoofing, , and , in: International Conference on Biometrics 2019, IEEE, 2019 |
|
Dynamic Bayesian Network Based Speech Recognition with Pitch and Energy as Auxiliary Variables, , , and , in: 2002 IEEE International Workshop on Neural Networks for for Signal Processing (NNSP~2002), 2002 |
|
Dynamic Graffiti Stylisation with Stochastic Optimal Control, , and , in: Intl Workshop on movement and computing (MOCO), London, UK, pages 1-8, ACM, 2017 |
[DOI] [URL] |
Dynamic Partitioned Sampling For Tracking With Discriminative Features, , and , in: Proceedings of the British Maschine Vision Conference, London, 2009 |
|
Dynamic Programming Boosting for Discriminative Macro-Action Discovery, and , in: International Conference on Machine Learning, 2014 |
|
Dysarthric Speech Recognition with Lattice-Free MMI, and , in: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 6109-6113, 2020 |
[DOI] [URL] |
E
EBMs vs. CL: Exploring Self-Supervised Visual Pretraining for Visual Question Answering, , , , and , in: arXiv, 2022 |
EEG pattern recognition through multi-stream evidence combination, , and , in: Proc. World Congress on Neuroinformatics, 2001 |
|
EEG-Based Brain-Computer Interaction: Improved Accuracy by Automatic Single-Trial Error Detection, and , in: Advances in Neural Information Processing Systems 21, 2007 |
|
EER of Fixed and Trainable Fusion Classifiers: A Theoretical Study with Application to Biometric Authentication Tasks, and , in: Sixth International Workshop on Multiple Classifier System (MCS2005), 2005 |
|
EFaR 2023: Efficient Face Recognition Competition, , , , and , in: IEEE International Joint Conference on Biometrics (IJCB 2023), 2023 |
|
Effect of nonverbal behavioral patterns on the performance of small groups, and , in: ICMI Workshop on Understanding and Modeling Multiparty Multimodal Interactions, Istanbul, Turkey, 2014 |
|
Effect of Recognition Errors on Information Retrieval Performance, , in: Proceedings of International Workshop on Frontiers in Handwriting Recognition, 2004 |
|
Effect of Segmentation Method on Video Retrieval Performance, and , in: Proceedings of the 2005 IEEE International Conference on Multimedia and Expo (ICME-05), 2005 |
|
Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks, , , , , , , , and , in: Proceedings of the 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, 2023 |
|
Efficient Grapevine Structure Estimation in Vineyards Conditions, , , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision, Paris, pages 712--720, 2023 |
[URL] |
Efficient Posterior Exemplar Search Space Hashing Exploiting Class-Specific Sparsity Structures, , , and , in: Interspeech, San Francisco, CA, 2016 |
|
Efficient Sample Mining for Object Detection, and , in: Proceedings of the 6th Asian Conference on Machine Learning (ACML), Nha Trang, Vietnam, 2014 |
|
Efficient Training of Low-Curvature Neural Networks, , , and , in: NeurIPS 2022, 2022 |
[URL] |
Efficient Wind Speed Nowcasting with GPU-Accelerated Nearest Neighbors Algorithm, , and , in: Proceedings of SIAM Data Mining, Virginia US and Virtual, 2022 |
Elderly People Living Alone: Detecting Home Visits with Ambient and Wearable Sensing, , , and , in: In Proceedings of MMHealth, 2017 |
|
Embedding Context-Dependent Variations of Prosodic Contours using Variational Encoding for Decomposing the Structure of Speech Prosody, , , , and , in: Workshop on Prosody and Meaning: Information Structure and Beyond, Aix-en-Provence, France, 2018 |
[URL] |
Embedding motion in model-based stochastic tracking, and , in: 17th Int. Conf. Pattern Recognition (ICPR), 2004 |
|
Emphasis Recreation for TTS using Intonation Atoms, and , in: 9th ISCA Speech Synthesis Workshop, pages 14--20, 2016 |
[DOI] |
EMPIRICAL EVALUATION AND COMBINATION OF PUNCTUATION PREDICTION MODELS APPLIED TO BROADCAST NEWS, and , in: Proceedings of 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2019 |
|
Empirical validations of multilingual annotation schemes for discourse relations, , , and , in: 8th Joint ACL-ISO Workshop on Interoperable Semantic Annotation, 2012 |
|
EMPLOYMENT OF SUBSPACE GAUSSIAN MIXTURE MODELS IN SPEAKER RECOGNITION, , , and , in: 2015 IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE, Brisbane, Australia, pages 4445-4449, 2015 |
[URL] |
Emulating Temporal Receptive Fields of Auditory Mid-Brain Neurons for Automatic Speech Recognition, and , in: Proc. 16th European Signal Processing Conference (EUSIPCO), 2008 |
|
Encoding Explanatory Knowledge for Zero-shot Science Question Answering, , , and , in: 14th International Conference on Computational Semantics, 2021 |
[URL] |
End-to-End Accented Speech Recognition, , and , in: International Conference on Speech and Language Processing, Interspeech, ISCA, Graz, Austria, pages 2140-2144, 2019 |
[DOI] |
End-to-End Bias Mitigation by Modelling Biases in Corpora, , and , in: ACL, 2020 |
|
End-to-End Convolutional Neural Network-based Voice Presentation Attack Detection, , and , in: International Joint Conference on Biometrics, Denver, Colorado, USA, 2017 |
|
End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation, , , , , , and , in: The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), Singapore, 2023 |
[URL] |
End-to-end text-dependent speaker verification using novel distance measures, , and , in: Proceedings of Interspeech 2018, Hyderabad, INDIA, Aug 02-Sep 06, 2018, pages 3598-3602, 2018 |
[DOI] |
Energy assessment of a district by integrating solar thermal in district heating network: a dynamic analysis approach, , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2023 |
[DOI] [URL] |
Enforcing Topic Diversity in a Document Recommender for Conversations, and , in: Proceedings of the Coling 2014 (25th International Conference on Computational Linguistics), Dublin, Ireland, pages 746-759, IEEE, 2014 |
|
Engagement-based Multi-party Dialog with a Humanoid Robot, , , , , , and , in: Proceedings of the SIGDIAL 2011: the 12th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pages 341-343, 2011 |
|
English Spoken Term Detection in Multilingual Recordings, , and , in: Proceedings of Interspeech, Makuhari, Japan, 2010, ISCA, Makuhari, Japan, 2010 |
|
English-French Verb Phrase Alignment in Europarl for Tense Translation Modeling, , and , in: The Ninth Language Resources and Evaluation Conference, Reykjavik, Iceland, 2014 |
|
Enhancing Ethical Explanations of Large Language Models through Iterative Symbolic Refinement, , , and , in: The 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024), 2024 |
Enhancing Multi-modal Classification of Violent Events using Image Captioning, , and , in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2023), CEUR Workshop Proceedings, Jaén, Spain, 2023 |
[URL] |
Enhancing Trust in eAssessment - the TeSLA System Solution, , , , and , in: Technology Enhanced Assessment Conference., 2018 |
|
Enhancing user acceptance in automated systems with human-centric lighting: the role of visual comfort, personality, and preference, , , , , , , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2023 |
[DOI] [URL] |
Entity Matching Across Small Networks Using Node Attributes, , , , , , , , , and , in: ECAI 2024 - 27th European Conference on Artificial Intelligence, October 19-24, 2024, Santiago de Compostela, Spain - Including 13th Conference on Prestigious Applications of Intelligent Systems (PAIS 2024), Proceedings, 2024 |
|
Entropy Based Combination of Tandem Representations for Noise Robust ASR, , , , and , in: Proceedings of the INTERSPEECH-ICSLP-04, 2004 |
|
Environment - Application - Adaptation: a Community Architecture for Ambient Intelligence, , in: International Conference on Ambient Computing, Applications, Services and Technologies, 2011 |
Environmental and Pollution Spatial Data Classification with Support Vector Machines and Geostatistics, , , and , in: Geostatistical congress 2000, 2000 |
Environmental and Pollution Spatial Data Classification with Support Vector Machines and Geostatistics, , , and , in: Intelligent techniques for Spatio-Temporal Data Analysis in Environmental Applications. Workshop W07, 1999 |
Environnement multi-agents de reconnaissance automatique de la parole en continu, and , in: Actes des 3emes Journees Francophones sur l'Intelligence Artificielle Distribuee et les Systemes Multi-agents, 1995 |
Error Correcting Posterior Combination for Robust Multi-Band Speech Recognition, and , in: EUROSPEECH, 2001 |
|
Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, , and , in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, Pilsen, Czech Republic, Springer - Verlag, Berlin Heidelberg 2009, 2009 |
|
Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, , and , in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, Pilsen, Czech Republic, Springer - Verlag, Berlin Heidelberg 2009, 2009 |
|
ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (CVPR), pages 17408-17419, 2023 |
[DOI] |
Estimating Nonplanar Flow from 2D Motion-blurred Widefield Microscopy Images via Deep Learning, and , in: International Symposium on Biomedical Imaging, 2021, 2021 |
|
Estimating Phoneme Class Conditional Probabilities from Raw Speech Signal using Convolutional Neural Networks, , and , in: Proceedings of Interspeech, 2013 |
|
Estimating the Causal Effects of Natural Logic Features in Transformer-Based NLI Models, , and , in: The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, 2024 |
Estimating The Degree of Sleepiness by Integrating Articulatory Feature Knowledge In Raw Waveform Based CNNs, , and , in: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Barcelona, Spain, 2020 |
|
ESTIMATING THE DOMINANT PERSON IN MULTI-PARTY CONVERSATIONS USING SPEAKER DIARIZATION STRATEGIES, , , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2007 |
Estimating the Quality of Face Localization for Face Verification, , , and , in: IEEE International Conference on Image Processing, ICIP, 2004 |
|
Estimation of Divergence-Free 3D Cardiac Blood Flow in a Zebrafish Larva Using Multi-View Microscopy, and , in: Biomedical Imaging (ISBI), 2015 IEEE 12th International Symposium on, IEEE, Brooklyn, NY, USA, pages 385-388, 2015 |
[DOI] |
Estimation of Global Posteriors and Forward-Backward Training of Hybrid HMM/ANN Systems, , , and , in: EUROSPEECH'97, 1997 |
|
ETC\_vérif : un environnement multi-agents de reconnaissance automatique de la parole en continu, and , in: Proceedings of JEP'96: XXIemes Journees d'etude sur la Parole, 1996 |
|
ETC\_vérif, a Prototype of a Cooperative Automatic Speech Recognition System, and , in: Proc. of WOz'95: International Workshop on Oz Programming, IDIAP, Uni. Fribourg, 1995 |
Etudes comparatives des robustesses au bruit de l'approche 'Full Combination' et de son approximation, and , in: Journee d'Etudes sur la Parole, Aussois, 2000 |
|
Euclidean Distance Matrix Completion for Ad-hoc Microphone Array Calibration, , , and , in: Proceedings IEEE International Conference On Digital Signal Processing, 2013 |
|
EUMSSI team at the MediaEval Person Discovery Challenge, , , and , in: Working Notes Proceedings of the MediaEval 2015 Workshop, Wurzen, Germany, 2015 |
[URL] |
EUMSSI team at the MediaEval Person Discovery Challenge 2016, , and , in: MediaEval Benchmarking Initiative for Multimedia Evaluation, Hilversum, Netherlands, 2016 |
|
Evading the Simplicity Bias: Training a Diverse Set of Models Discovers Solutions with Superior OOD Generalization, , , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022 |
|
Evaluating Intra- and Crosslingual Adaptation for Non-native Speech Recognition in a Bilingual Environment, and , in: Proceedings of the 4th IEEE International Conference on Cognitive Infocommunications, IEEE, Budapest, Hungary, pages 357-361, 2013 |
|
Evaluating pruning methods, and , in: 1995 International Symposium on Artificial Neural Networks (ISANN'95), 1995 |
|
Evaluating Shape Descriptors for Detection of Maya Hieroglyphs, , and , in: in Proc. Mexican Conf. on Pattern Recognition, Queretaro, 2013 |
|
Evaluating the Complexity of Databases for Person Identification and Verification, , and , in: 8th Int. Conf. Computer Analysis of Images and Patterns, Springer Verlag, 1999 |
|
Evaluating the Effectiveness of Attack-Agnostic Features for Morphing Attack Detection, and , in: International Joint Conference on Biometrics, 2024 |
|
Evaluating the Robustness of Privacy-Sensitive Audio Features for Speech Detection in Personal Audio Log Scenarios, , , and , in: ICASSP 2010, 2010 |
|
Evaluation of Formant-Like Features for ASR, , , , , and , in: International Conference on Spoken Language Processing (ICSLP 2002), 2002 |
|
Evaluation of Multiple Cues Head Pose Tracking Algorithm in Natural Environments, and , in: International Conference on Multimedia & Expo ICME 2005, 2005 |
|
Evaluation of Probabilistic Occupancy Map People Detection for Surveillance Systems, , , , and , in: Proceedings of the IEEE International Workshop on Performance Evaluation of Tracking and Surveillance, 2009 |
Evolution of the Mental States Operating a Brain-Computer Interface, , , and , in: Proceedings of the International Federation for Medical and Biological Engineering, 2002 |
|
Exact Acceleration of Linear Object Detectors, and , in: Proceedings of the European Conference on Computer Vision, 2012 |
|
Exact Preimages of Neural Network Aircraft Collision Avoidance Systems, and , in: Machine Learning for Engineering Modeling, Simulation, and Design Workshop at Neural Information Processing Systems 2020, 2020 |
|
Examining European Press Coverage of the Covid-19 No-Vax Movement: An NLP Framework, and , in: 2nd ACM International Workshop on Multimedia AI against Disinformation (MAD '23), June 12, 2023, Thessaloniki, Greece, 2023 |
|
Examining Linguistic Content and Skill Impression Structure for Job Interview Analytics in Hospitality, and , in: Proceedings of the 16th International Conference on Mobile and Ubiquitous Multimedia, 2017 |
|
Expanded Lattice Embeddings for Spoken Document Retrieval on Informal Meetings, , , , and , in: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, 2022 |
[DOI] |
Experiences in the Creation of an Electromyography Database to Help Hand Amputated Persons, , , , , , and , in: Proceedings of the 24th European Medical Informatics Conference, 2012 |
|
Experimental evaluation of speech enhancement methods in remote microphone systems for hearing aids, , , , and , in: Proc. EuroNoise 2018, Crete, Greece, pages 351-358, 2018 |
|
Experimental evaluation of text-dependent speaker verification on laboratory and field test databases in the M2VTS project, , , and , in: Proceedings of the European Conference on Speech Communication and Technology, 1999 |
|
Experimental investigation on STFT phase Representations for deep learning-based dysarthric speech detection, and , in: International Conference on Acoustics, Speech, and Signal Processing, 2022 |
|
Explainable Inference Over Grounding-Abstract Chains for Science Questions, , and , in: 59th Annual Meeting of the Association for Computational Linguistics (ACL Findings), 2021 |
|
Explainable Natural Language Reasoning via Conceptual Unification, , and , in: 16th conference of the European Chapter of the Association for Computational Linguistics (EACL), 2021 |
[URL] |
Explaining models relating objects and privacy, , , , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024 |
[URL] |
Explaining the Stars: Weighted Multiple-Instance Learning for Aspect-Based Sentiment Analysis, and , in: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar, 2014 |
|
Exploiting Accelerometers to Improve Movement Classification for Prosthetics, and , in: International Conference on Rehabilitation Robotics, 2013 |
|
Exploiting Contextual Information for Improved Phoneme Recognition, , , and , in: "IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)", 2008 |
|
Exploiting Contextual Information for Speech/Non-Speech Detection, , and , in: Text, Speech and Dialogue, Brno, Czech Republic, Springer-Verlag Berlin, Heidelberg, 2008 |
|
Exploiting Eigenposteriors for Semi-supervised Training of DNN Acoustic Models with Sequence Discrimination, , and , in: Proceedings of Interspeech, 2017 |
|
Exploiting Hyperlinks to Learn a Retrieval Model, and , in: NIPS Workshop on Learning to Rank, 2005 |
|
Exploiting Long-Term Observations for Track Creation and Deletion in Online Multi-Face Tracking, and , in: IEEE Conference on Automatic Face and Gesture Recognition, pages 525-530, IEEE, 2011 |
|
Exploiting Low-dimensional Structures to Enhance DNN based Acoustic Modeling in Speech Recognition, , , and , in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5690-5694, IEEE, 2016 |
|
Exploiting observers' judgements for nonverbal group interaction analysis, , and , in: IEEE Conference on Automatic Face and Gesture Recognition, pages 6, IEEE, 2011 |
|
Exploiting Scene Cues for Dropped Object Detection, , and , in: 9th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications., 2014 |
|
Exploiting semi-supervised training through a dropout regularization in end-to-end speech recognition, , , and , in: Proc. of Interspeech 2019, 2019 |
EXPLOITING SEQUENCE INFORMATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION, , , and , in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, New Orleans, pages 5370-5374, 2017 |
|
Exploiting un-transcribed foreign data for speech recognition in well-resourced languages, , , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Florence, IT, pages 2322 - 2326, IEEE, 2014 |
[DOI] |
Exploratory Study on Direct Prediction of Diabetes using Deep Residual Networks, , , and , in: Proceedings of the thematic conference on computational vision and medical image processing, 2017 |
Exploring Contextual Information in a Layered Framework for Group Action Recognition, , and , in: In the Eighth International Conference on Multimodal Interfaces (ICMI'06), 2006 |
|
Exploring Dataset Similarities using PCA-based Feature Selection, , , and , in: Proceedings of ACII, Xi'an, pages 387-393, IEEE, 2015 |
[DOI] |
Exploring the Zero-Shot Capabilities of Vision-Language Models for Improving Gaze Following, , , and , in: Int. Conf. Computer Vision and Pattern Recognition (CVPR), Workshop on Gaze Estimation and Prediction in the Wild, 2024 |
|
Extended Cauchy Machines, and , in: Proceedings of the International Conference on Neural Information Processing, 1996 |
Extending the Cooperative Dual-Task Space in Conformal Geometric Algebra, and , in: Proc. IEEE Intl Conf. on Robotics and Automation, 2024 |
|
Extracting and Locating Temporal Motifs in Video Scenes Using a Hierarchical Non Parametric Bayesian Model, , and , in: IEEE Conference on Computer Vision and Pattern Recognition, 2011 |
|
Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies, and , in: Proceedings of the eighth international conference on Language Resources and Evaluation (LREC), Istanbul, TR, pages 6, 2012 |
|
Extracting Information from Multimedia Meeting Collections, , and , in: 7th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2005 |
|
Extracting Informative Textual Parts from Web Pages Containing User-Generated Content, , and , in: 12th International Conference on Knowledge Management and Knowledge Technologies, ACM ICPS, Graz, Austria, pages 4:1--4:8, ACM, 2012 |
[URL] |
Extracting Mobile Behavioral Patterns with the Distant N-Gram Topic Model, and , in: Proceedings of the IEEE International Symposium on Wearable Computers, Newcastle, 2012 |
|
Extracting Motifs from Time Series Generated by Concurrent Activities., , and , in: NIPS workshop on Learning and Planning from Batch Time Series Data, 2010 |
|
Extraction of Articulators in X-Ray Image Sequences, and , in: Proceedings of the European Conference on Speech Communication and Technology, 1999 |
|
EYEDIAP: A Database for the Development and Evaluation of Gaze Estimation Algorithms from RGB and RGB-D Cameras, , and , in: Proceedings of the ACM Symposium on Eye Tracking Research and Applications, Safety Harbor, Florida, United States of America, ACM, 2014 |
[DOI] |
F
F-ratio Client-Dependent Normalisation on Biometric Authentication Tasks, and , in: Proceedings of the 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-05), 2005 |
|
Face Anthropometry Aware Audio-visual Age Verification, and , in: ACM Multimedia, 2022 |
|
Face Authentication Using Adapted Local Binary Pattern Histograms, and , in: 9th European Conference on Computer Vision (ECCV), 2006 |
|
Face Authentication with Salient Local Features and Static Bayesian Network, and , in: IEEE / IAPR Intl. Conf. On Biometrics (ICB), 2007 |
|
Face identification from overlaid texts using Local Face Recurrent Patterns and CRF models, , , , and , in: IEEE International Conference on Image Processing 2014, Paris, IEEE, 2014 |
|
Face Liveness Detection Competition (LivDet-Face) - 2021, , , , , , , , , and , in: International Joint Conference on Biometrics, 2021 |
Face Liveness Detection Competition (LivDet-Face) - 2024, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: IEEE International Joint Conference on Biometrics, 2024 |
|
Face Recognition Using Lensless Camera, , and , in: Proceedings of the 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2024 |
[DOI] [URL] |
Face Recognition with Disparity Corrected Gabor Phase Differences, , and , in: Artificial Neural Networks and Machine Learning, Heidelberg, pages 411-418, Springer Berlin, 2012 |
[DOI] |
Face Reconstruction from Deep Facial Embeddings using a Convolutional Neural Network, , and , in: Proceedings of the 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, France, IEEE, 2022 |
[DOI] [URL] |
Face Reconstruction from Facial Templates by Learning Latent Space of a Generator Network, and , in: Thirty-seventh Conference on Neural Information Processing Systems, 2023 |
[URL] |
Face Reconstruction from Partially Leaked Facial Embeddings, and , in: Proceedings of the 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2024 |
[DOI] [URL] |
Face Verification Using Adapted Generative Models, , and , in: The 6th International Conference on Automatic Face and Gesture Recognition, FG2004, IEEE, 2004 |
|
Face Verification using Gabor Filtering and Adapted Gaussian Mixture Models, , and , in: Proceedings of the 11th International Conference of the Biometrics Special Interest Group, Darmstadt, Germany, pages 397-408, GI-Edition, 2012 |
|
Face Verification using MLP and SVM, and , in: XI Journees NeuroSciences et Sciences pour l'Ingenieur (NSI 2002), 2002 |
|
FaceTube: predicting personality from facial expressions of emotion in online conversational video, , and , in: Proceedings International Conference on Multimodal Interfaces (ICMI-MLMI), 2012 |
|
Facial Expression Analysis using Shape and Motion Information Extracted by Convolutional Neural Networks, , in: International IEEE Workshop on Neural Networks for Signal Processing (NNSP 02), 2002 |
|
Facing Employers and Customers: What Do Gaze and Expressions Tell About Soft Skills?, , , and , in: Proceedings of the 17th International Conference on Mobile and Ubiquitous Multimedia, Cairo, Egypt, pages 121-126, ASSOC COMPUTING MACHINERY, 2018 |
[DOI] |
Factors that Affect Personalization of Robots for Older Adults, , and , in: CONCATENATE Workshop at HRI 2023 in Stockholm, Sweden, 2023 |
[URL] |
Fairness Index Measures to Evaluate Bias in Biometric Recognition, and , in: International Conference on Pattern Recognition Workshops, 2022 |
|
Far-field ASR Using Low-rank and Sparse Soft Targets from Parallel Data, , and , in: IEEE Workshop on Spoken Language Technology, Athens, GREECE, pages 581-587, IEEE, 2018 |
|
Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition, , and , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Hawaii, USA, pages 348-353, 2011 |
|
Fast Approximate Spoken Term Detection from Sequence of Phonemes, , , and , in: Workshop on Searching Spontaneous Conversational Speech at SIGIR, 2008 |
|
Fast Bounding Box Estimation based Face Detection, and , in: ECCV, Workshop on Face Detection: Where we are, and what next?, 2010 |
[URL] |
Fast cross-correlation based wrist vein recognition algorithm with rotation and translation compensation, , , and , in: Sixth International Workshop on Biometrics and Forensics, 2018 |
|
Fast Face Detection using MLP and FFT, , and , in: Proc. Second International Conference on Audio and Video-based Biometric Person Authentication (AVBPA'99), 1999 |
|
Fast human detection from videos using covariance features, and , in: European Conference on Computer Vision, workshop on Visual Surveillance (ECCV-VS), Marseille, 2008 |
|
Fast K-Means with Accurate Bounds, and , in: Proceedings of the International Conference on Machine Learning (ICML), New York, 2016 |
Fast Language Adaptation Using Phonological Information, , and , in: Proceedings of Interspeech 2018, Hyderabad, INDIA, pages 2459-2463, 2018 |
[DOI] |
Fast latent semantic indexing of spoken documents by using self-organizing maps, , in: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP'2000, 2000 |
|
Fast Object Detection with Entropy-Driven Evaluation, , , and , in: Proceedings of the Conference on Computer Vision and Pattern Recognition, 2013 |
Fast Speaker Verification on Mobile Phone data using Boosted Slice Classifiers, , and , in: IAPR IEEE International Joint Conference on Biometrics, Washington DC, 2011 |
|
Fast Transformers with Clustered Attention, , and , in: Proceedings of the International Conference on Neural Information Processing Systems, 2020 |
FEATURE AND SCORE LEVEL COMBINATION OF SUBSPACE GAUSSIANS IN LVCSR TASK, , and , in: The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Vancouver, BC, Canada, pages 7604-7608, 2013 |
[DOI] |
Feature Extraction for Multi-Class BCI using Canonical Variates Analysis, , , , and , in: Proceedings of the IEEE International Symposium on Intelligent Signal Processing, 2007 |
|
Feature Representations for Automatic Meerkat Vocalization Classification, , , and , in: 4th International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots, 2024 |
|
Feature Switching in the i-vector Framework for Speaker Verification, , , , and , in: Proc. of Interspeech 2014, pages 5, 2014 |
Few-shot Dysarthric Speech Recognition with Text-to-Speech Data Augmentation, and , in: Proceedings of Interspeech, pages 156-160, 2023 |
[DOI] [URL] |
Filter Bank Design based on Minimization of Individual Aliasing Terms for Minimum Mutual Information Subband Adaptive Beamforming, , , , , and , in: Proceedings of ICASSP 2008, Las Vegas, USA, 2008 |
|
Finding Audio-Visual Events in Informal Social Gatherings, , , and , in: IEEE/ACM 13th International Conference on Multimodal Interaction, 2011 |
|
Finding groups of people in Google news, and , in: ACM Int. Conf. on Human-Centered Multimedia (HCM), 2006 |
|
Findings of the 2016 WMT Shared Task on Cross-lingual Pronoun Prediction, , , , , , , , and , in: Proceedings of WMT 2016 (First Conference on Machine Translation), Association for Computational Linguistics, Berlin, Germany, pages 525–542, 2016 |
[URL] |
Findings of the IWSLT 2023 evaluation campaign, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: Proceedings of the IWSLT conference, 2023 |
Fine-tuning Self-Supervised Models For Language Identification Using Orthonormal Constraint, , , , , , and , in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP), 2024 |
|
Finger vein Liveness Detection Using Motion Magnification, , , and , in: IEEE International Conference on Biometrics: Theory, Applications and Systems, pages 1-7, 2015 |
[DOI] |
Flickr Hypergroups, , , , and , in: Proceedings of the 17th ACM International Conference on Multimedia, 2009 |
|
Floor Holder Detection and End of Speaker Turn Prediction in Meetings, , and , in: International Conference on Speech and Language Processing, Interspeech, Makuhari, Japan, ISCA, 2010 |
|
FlowBoost - Appearance Learning from Sparsely Annotated Video, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2011 |
Fourier movement primitives: an approach for learning rhythmic robot skills from demonstrations, , and , in: Robotics: Science and Systems, 2020 |
|
Framing the News: From Human Perception to Large Language Model Inferences, and , in: International Conference on Multimedia Retrieval (ICMR '23), June 12--15, 2023, Thessaloniki, Greece, 2023 |
|
Frequency Domain Linear Prediction for QMF Sub-bands and Applications to Audio Coding, , , and , in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), 2007 |
|
From Foursquare to my Square: Learning Check-in Behavior from Multiple Sources, , and , in: The 7th International AAAI Conference on Weblogs and Social Media, 2013 |
|
From Image-level to Pixel-level Labeling with Convolutional Networks, and , in: Computer Vision and Patter Recognition (CVPR), Boston, MA, pages 1713-1721, IEEE, 2015 |
[DOI] [URL] |
From missing data to maybe useful data: soft data modelling for noise robust ASR, , and , in: Proc. WISP, 2001 |
|
From Multi-Band Full Combination to Multi-Stream Full Combination Processing in Robust ASR, , and , in: ISCA ITRW ASR2000, 2000 |
|
From N to N+1: Multiclass Transfer Incremental Learning, , and , in: Proceedings of the Conference on Computer Vision and Pattern Recognition, 2013 |
|
From Speech to Personality: Mapping Voice Quality and Intonation into Personality Differences, , , and , in: in Proceedings of ACM Multimedia 2012, 2012 |
|
From Undercomplete to Sparse Overcomplete Autoencoders to Improve LF-MMI Speech Recognition, and , in: Proceedings of Interspeech Conference, 2022 |
|
Front-end for Far-field Speech Recognition based on Frequency Domain Linear Prediction, , and , in: Interspeech 2008, 2008 |
|
Full-Gradient Representation for Neural Network Visualization, and , in: Advances in Neural Information Processing Systems, 2019 |
[URL] |
Fully Automatic Grading of Retinal Vasculitis on Fluorescein Angiography Time-lapse from Real-world Data in Clinical Settings, , , , , , , and , in: 2023 IEEE 36th International Symposium on Computer-Based Medical Systems (CBMS), L'Aquila, Italy, 2023, pages 689-693, 2023 |
[DOI] [URL] |
Fusing Audio-Visual Nonverbal Cues to Detect Dominant People in Conversations, and , in: 20th International Conference on Pattern Recognition, Istanbul, Turkey, 2010, Istanbul, Turkey, 2010 |
|
Fusing Matching and Biometric Similarity Measures for Face Diarization in Video, , and , in: Proceedings of the 3rd ACM conference on International conference on multimedia retrieval, Dallas, Texas, USA, pages 97-104, ACM, 2013 |
|
Fusion of Acoustic and Linguistic Information Using Supervised Autoencoder for Improved Emotion Recognition, , and , in: 2nd Multimodal Sentiment Analysis Challenge (MuSe '21), October 24, 2021, Virtual Event, China, 2021 |
[DOI] |
Fusion of Structural and Color Local Descriptors for Enhanced Object Recognition, and , in: Proceedings IEEE WIAMIS 2004(5th International Workshop on Image Analysis for Multimedia Interactive Services,',','), 21-23 April, 2004, Lisboa, Portugal, 2004 |
|
G
Gain Elimination form Backpropagation Neural Networks, , and , in: Proceedings of the International Conference on Neural Networks, IEEE, Perth, IEEE, 1995 |
Gammatone Wavelet Cepstral Coefficients for Robust Speech Recognition, , and , in: Proceedings of IEEE TENCON, 2013 |
|
GAP: Differentially Private Graph Neural Networks with Aggregation Perturbation, , , and , in: 32nd USENIX Security Symposium (USENIX Security 23), 2023 |
|
Gaussian Mixture Regression on Symmetric Positive Definite Matrices Manifolds: Application to Wrist Motion Estimation with sEMG, and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, pages 59-64, 2017 |
[URL] |
Gaze Estimation From Multimodal Kinect Data, and , in: IEEE Conference in Computer Vision and Pattern Recognition, Workshop on Gesture Recognition, Providence, RI, USA, 2012 |
[DOI] |
Gender Classification by LUT based boosting of Overlapping Block Patterns, , and , in: Scandinavian Conference on Image Analysis, pages 530-542, Springer International Publishing, 2015 |
[DOI] [URL] |
Generalized Policy Iteration using Tensor Approximation for Hybrid Control, , and , in: International Conference on Learning Representations (ICLR), 2024 |
|
Generalized temporal sampling with active illumination in optical microscopy, and , in: Proceeding of the SPIE Conference Optics and Photonics, Wavelets and Sparsity XVIII, SPIE, San Diego, California, United States, SPIE, 2019 |
|
Generalizing Robot Imitation Learning with Invariant Hidden Semi-Markov Models, , , , , , , and , in: 13th Intl Workshop on the Algorithmic Foundations of Robotics (WAFR), 2018 |
|
Generating Calligraphic Trajectories with Model Predictive Control, , and , in: Proc. 43rd Conf. on Graphics Interface, Edmonton, AL, Canada, pages 132-139, 2017 |
[DOI] |
Generating Exact Lattices in The WFST Framework, , , , , , , , , , , , and , in: Proceedings of 2012 IEEE International Conference on Acoustics, Speech and Signal Processing., The 37th International Conference on Acoustics, Speech, and Signal Processing, Kyoto, JP, Kyoto, Japan, pages 4213-4216, IEEE Signal Processing Societ, 2012 |
[DOI] |
Generating Master Faces for Use in PerformingWolf Attacks on Face Recognition Systems, , , and , in: International Join Conference on Biometrics, 2020 |
|
Generative adversarial training of product of policies for robust and adaptive movement primitives, , and , in: In Proc. Conference on Robot Learning (CoRL), 2020 |
|
Generative Independent Component Analysis for EEG Classification, and , in: European Symposium on Artificial Neural Networks ESANN, 2005 |
|
Generative Temporal ICA for Classification in Asynchronous BCI Systems, and , in: The 2nd International IEEE EMBS Conference On Neural Engineering, 2005 |
|
Geodesic Convolutional Shape Optimization, , , and , in: Proceedings of the International Conference on Machine Learning, 2018 |
Geometric Generative Gaze Estimation (G3E) for Remote RGB-D Cameras, and , in: IEEE Computer Vision and Pattern Recognition Conference, Columbus, Ohio,USA, pages 1773-1780, IEEE, 2014 |
[DOI] |
Geometry-aware Control and Learning in Robotics, and , in: R:SS Pioneers Workshop, 2018 |
Geometry-aware Robot Manipulability Transfer, , and , in: R:SS Workshop on Learning and Inference in Robotics: Integrating Structure, Priors and Models, 2018 |
|
Geometry-aware Tracking of Manipulability Ellipsoids, , , and , in: Robotics: Science and Systems, Pittsburgh, USA, 2018 |
|
GeoNeRF: Generalizing NeRF with Geometry Priors, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (CVPR), 2022 |
[URL] |
Given that, Should I Respond? Contextual Addressee Estimation in Multi-Party Human-Robot Interactions, and , in: Proceedings of Human Robot Interaction (HRI) Conference, 2013 |
|
GLoFool: global enhancements and local perturbations to craft adversarial images, and , in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024 |
|
Gradient estimates of return distributions, and , in: PASCAL Workshop on Principled Methods of Trading Exploration and Exploitation, 2005 |
|
Grammar Based Speaker Role Identification for Air Traffic Control Speech Recognition, , , , , , and , in: 12th SESAR Innovation Days, 2022 |
|
Graph Refinement for Coreference Resolution, and , in: Findings of Association for >Computational Linguistics: ACL 2022, 2022 |
Graph-to-Graph Transformer for Transition-based Dependency Parsing, and , in: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, 2020 |
[URL] |
Graph-to-Graph Transformer for Transition-based Dependency Parsing, and , in: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Findings, ACL, Online, pages 3278–3289, Association for Computational Linguistics, 2020 |
[URL] |
Graph2Speak: Improving Speaker Identification using Network Knowledge in Criminal Conversational Data, , , and , in: 1st ISCA Symposium on Security and Privacy in Speech Communication, pages 10--13, 2021 |
[DOI] |
Grapheme and Multilingual Posterior Features for Under-Resourced Speech Recognition: A Study on Scottish Gaelic, , and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2013 |
|
Grapheme-based Automatic Speech Recognition using KL-HMM, , , and , in: Proceedings of Interspeech, 2011 |
|
Graphical representation of meetings on mobile devices, , and , in: MobileHCI 2008 (10th International Conference on Human-Computer Interaction with Mobile Devices and Services, Demonstrations Session), Amsterdam, 2008 |
|
GroupUs: Smartphone Proximity Data and Human Interaction Type Mining, and , in: 15th annual International Symposium on Wearable Computers, San Francisco, USA, 2011 |
|
H
Haar Local Binary Pattern Feature for Fast Illumination Invariant Face Detection, and , in: British Machine Vision Conference 2009, 2009 |
|
Hand Posture Classification and Recognition using the Modified Census Transform, , and , in: IEEE Int. Conf. on Automatic Face and Gesture Recognition (AFGR), 2006 |
|
Handling acoustic variation in dysarthric speech recognition systems through model combination, and , in: Proceedings of Interspeech, 2021 |
|
Hands Free Audio Analysis from Home Entertainment, , and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
|
Handwriting Recognition, , in: Second Asian Conference on Computer Vision (ACCV'95,',','), Singapore, 1995 |
Handwritten Digit Recognition with Binary Optical Perceptron, , , and , in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'97), Springer-Verlag, 1997 |
|
Haptic Feedback Compared with Visual Feedback for BCI, , , , , and , in: Proceedings of the 3rd International Brain-Computer Interface Workshop & Training Course 2006, 2006 |
|
Hardware-Friendly Learning Algorithms for Neural Networks: An Overview, and , in: Proceedings of the Fifth International Conference on Microelectronics for Neural Networks and Fuzzy Systems: MicroNeuro'96, EPFL and CSEM, Lausanne, Switzerland, IEEE Computer Society Press, 1996 |
|
Head Nod Detection from a Full 3D Model, , and , in: Proceedings of the ICCV 2015, pages 528-536, 2015 |
|
Head-Pose Invariant Facial Expression Recognition using Convolutional Neural Networks, , in: International IEEE Conference on Multimodal Interfaces (ICMI 02), 2002 |
|
Health Talk: Understanding Practices of Popular Professional YouTubers, , , , and , in: Proceedings of the 21st International Conference on Mobile and Ubiquitous Multimedia, 2022 |
HEAT: Iterative Relevance Feedback with One Million Images, and , in: Proceedings of the IEEE International Conference on Computer Vision, pages 2118-2125, 2011 |
Heterogeneous Face Recognition Using Domain Invariant Units, and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2024 |
|
Heterogeneous Face Recognition using Inter-Session Variability Modelling, and , in: IEEE Computer Society Workshop on Biometrics, Las Vegas - USA, IEEE, 2016 |
|
Hierarchical and Parallel Processing of Modulation Spectrum for ASR applications, and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008 |
|
Hierarchical Integration of Phonetic and Lexical Knowledge in Phone Posterior Estimation, and , in: ICASSP'08, 2008 |
|
Hierarchical Multi-Stream Posterior Based Speech Recognition System, , and , in: Proceedings MLMI workshop, 2005 |
|
Hierarchical Multi-task learning framework for Isometric-Speech Language Translation, , , and , in: ACL, 2022 |
|
Hierarchical Multilayer Perceptron based Language Identification, , and , in: Proceedings of Interspeech, Makuhari, Japan, pages 2722-2725, 2010 |
|
Hierarchical Neural Networks Feature Extraction for LVCSR system, , , , , and , in: Interspeech 2007, 2007 |
|
Hierarchical Penalization, , and , in: Advances in Neural Information Processing Systems 21, 2007 |
|
Hierarchical Planning of Dynamic Movements without Scheduled Contact Sequences, , , , and , in: Proceedings of the IEEE International Conference of Robotics and Automation, 2016 |
Hierarchical Processing of the Modulation Spectrum for GALE Mandarin LVCSR system, , , and , in: Proceedings of the 10thAnnual Conference of the International Speech Communication Association (Interspeech), Brighton, 2009 |
|
Hierarchical speaker clustering methods for the NIST i-vector Challenge, , , and , in: Odyssey: The Speaker and Language Recognition Workshop, 2014 |
|
Hierarchical Tandem Features for ASR in Mandarin, , and , in: Proceedings of Interspeech, 2011 |
High Frequency Bands and Estimated Local Field Potentials to Improve Single-Trial Classification of Electroencephalographic Signals, , , , , and , in: Proceedings of the 3rd International Brain-Computer Interface Workshop & Training Course 2006, 2006 |
|
Higher-Order Statistics in Visual Object Recognition, , in: Proc. IEEE Conf. on Computer Vision and Pattern Recognition, 1993 |
|
Hilbert Envelope Based Features for Far-Field Speech Recognition, , and , in: MLMI 2008, 2008 |
|
Hilbert Envelope Based Spectro-Temporal Features for Phoneme Recognition in Telephone Speech, , and , in: Interspeech 2008, 2008 |
|
Hill-Climbing Attack to an Eigenface-Based Face Verification System, , , , and , in: Proceedings of the First IEEE International Conference on Biometrics, Identity and Security (BIdS), 2009 |
|
HMIST: Hierarchical Multilingual Isometric Speech Translation using Multi-Task Learning Framework for Automatic Dubbing, , , and , in: PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION (PACLIC 2022), In Proceedings of ACL Anthology, 2022 |
|
HMM and IOHMM Modeling of EEG Rhythms for Asynchronous BCI Systems, and , in: European Symposium on Artificial Neural Networks ESANN, 2004 |
|
HMM-based Approaches to Model Multichannel Information in Sign Language inspired from Articulatory Features-based Speech Processing, , , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
|
HMM-based Non-native Accent Assessment using Posterior Features, , and , in: Proceedings of Interspeech, San Francisco, USA, 2016 |
|
HMM2- A Novel Approach to HMM Emission Probability Estimation, , and , in: International Conference on Spoken Langugae Processing (ICSLP 2000), 2000 |
|
HMM2- Extraction of Formant Features and their Use for Robust ASR, , and , in: European Conference on Speech Communication and Technology (Eurospeech 2001), 2001 |
|
How Comparable are Parallel Corpora? Measuring the Distribution of General Vocabulary and Connectives, , , and , in: Proceedings of 4th Workshop on Building and Using Comparable Corpora, ACL, Portland, OR, pages 78--86, 2011 |
|
How Did Europe’s Press Cover Covid-19 Vaccination News? A Five-Country Analysis, and , in: MAD '22: Proceedings of the 1st International Workshop on Multimedia AI against Disinformation, 2022 |
[DOI] [URL] |
How Do You Like Your Virtual Agent?: Human-Agent Interaction Experience through Nonverbal Features and Personality Traits, , and , in: Human Behavior Understanding, pages 1-15, Springer, 2014 |
|
<