Publications of Idiap sorted by journal and type
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 |
Publications of type Idiap-Internal-RR
2021
Modeling Source and System characteristics using Zero Frequency Filtering for Voice Activity Detection, , and , Idiap-Internal-RR-80-2021 |
Publications of type Idiap-RR
2024
EdgeFace: Efficient Face Recognition Model for Edge Devices, , , , and , Idiap-RR-01-2024 |
|
Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper, , , , , , , , and , Idiap-RR-10-2024 |
|
Feature Representations for Automatic Meerkat Vocalization Classification, , , and , Idiap-RR-06-2024 |
|
Posterior-based analysis of spatio-temporal features for Sign Language Assessment, , , , and , Idiap-RR-11-2024 |
|
Sentiment Analysis using pretrained LLMs, , and , Idiap-RR-05-2024 |
|
Towards Dynamic Skeleton-based Handshape Subunits for Sign Language Assessment, and , Idiap-RR-09-2024 |
|
2023
Approximating Optimal Morphing Attacks using Template Inversion, , and , Idiap-RR-07-2023 |
|
Attacking Face Recognition with T-shirts: Database, Vulnerability Assessment and Detection, and , Idiap-RR-08-2023 |
|
Bridging the Gap: Heterogeneous Face Recognition with Conditional Adaptive Instance Modulation, and , Idiap-RR-09-2023 |
|
Graph2Speak: Improving Speaker Identification using Network Knowledge in Criminal Conversational Data, , , and , Idiap-RR-01-2023 |
[URL] |
Idiap Scientific Report 2022, , , , , , , , , , , , , , , , , and , Idiap-RR-05-2023 |
|
Implementing contextual biasing in GPU decoder for online ASR, , , , , , and , Idiap-RR-02-2023 |
|
Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviews, , , and , Idiap-RR-03-2023 |
|
VP-STO: Via-point-based Stochastic Trajectory Optimization for Reactive Robot Behavior, , , and , Idiap-RR-04-2023 |
|
2022
A Comprehensive Evaluation on Multi-channel Biometric Face Presentation Attack Detection, , and , Idiap-RR-02-2022 |
|
An anomaly detection approach for backdoored neural networks: face recognition as a case study, and , Idiap-RR-08-2022 |
[URL] |
Applying Attention Based Models for Detecting Cognitive Processes and Mental Health Conditions, , , and , Idiap-RR-01-2022 |
|
End-to-end Accented Speech Recognition, , and , Idiap-RR-04-2022 |
|
Expanded Lattice Embeddings for Spoken Document Retrieval on Informal Meetings, , , , and , Idiap-RR-06-2022 |
|
IDIAPers @ Causal News Corpus 2022: Efficient Causal Relation Identification Through a Prompt-based Few-shot Approach, , , , , , and , Idiap-RR-13-2022 |
|
IDIAPers @ Causal News Corpus 2022: Extracting Cause-Effect-Signal Triplets via Pre-trained Autoregressive Language Model, , , , , , and , Idiap-RR-12-2022 |
|
On the detection of morphing attacks generated by GANs, and , Idiap-RR-07-2022 |
|
Robust Face Presentation Attack Detection with Multi-channel Neural Networks, and , Idiap-RR-03-2022 |
|
SPARSE AUTOENCODERS TO ENHANCE SPEECH RECOGNITION, and , Idiap-RR-10-2022 |
|
SPEECH MODELING USING SPARSE AUTOENCODERS, and , Idiap-RR-11-2022 |
|
2021
Adjustable Deterministic Pseudonymization of Speech, , and , Idiap-RR-12-2021 |
|
An Attention Mechanism for Deep Q-Networks with Applications in Robotic Pushing, , and , Idiap-RR-03-2021 |
|
Approximating the Mental Lexicon from Clinical Interviews as a Support Tool for Depression Detection, , , , and , Idiap-RR-19-2021 |
Bag-of-Vectors Autoencoders for Unsupervised Conditional Text Generation, and , Idiap-RR-21-2021 |
BertOdia: BERT pre-training for low resource Odia language, , , , , and , Idiap-RR-16-2021 |
|
BERTraffic: A Robust BERT-Based Approach for Speaker Change Detection and Role Identification of Air-Traffic Communications, , , , , , and , Idiap-RR-15-2021 |
Broadcast Media Content Categorization Using Low-Resolution Concepts, , , , and , Idiap-RR-06-2021 |
|
CHALLENGES IN BROADCAST MEDIA CONTENT CATEGORIZATION, , and , Idiap-RR-02-2021 |
|
Comparing CTC and LFMMI for out-of-domain adaptation of wav2vec 2.0 acoustic model, , and , Idiap-RR-04-2021 |
|
Contextual Semi-Supervised Learning: An Approach To Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems, , , , , , and , Idiap-RR-14-2021 |
[URL] |
Cross-lingual Automatic Speech Recognition Exploiting Articulatory Features, , , , and , Idiap-RR-05-2021 |
[URL] |
Grammar Based Identification Of Speaker Role For Improving ATCO And Pilot ASR, , , , , , and , Idiap-RR-22-2021 |
|
Improving callsign recognition with air-surveillance data in air-traffic communication, , , and , Idiap-RR-20-2021 |
[URL] |
Late Fusion of the Available Lexicon and Raw Waveform-based Acoustic Modeling for Depression and Dementia Recognition, , , , , and , Idiap-RR-09-2021 |
|
Multimodal Neural Machine Translation System for English to Bengali, , , , , , and , Idiap-RR-13-2021 |
|
NLPHut’s Participation at WAT2021, , , , , , , and , Idiap-RR-10-2021 |
|
Open Machine Translation for Low Resource South American Languages (AmericasNLP 2021 Shared Task Contribution), , , , , , , , and , Idiap-RR-07-2021 |
|
Supervised Speech Representation Learning for Parkinson's Disease Classification, and , Idiap-RR-08-2021 |
|
Towards Automatic Prediction of Non-Expert Perceived Speech Fluency Ratings, , , and , Idiap-RR-11-2021 |
|
2020
AM-FM DECOMPOSITION OF SPEECH SIGNAL: APPLICATIONS FOR SPEECH PRIVACY AND DIAGNOSIS, , , , and , Idiap-RR-01-2020 |
|
AUTOMATIC DYSARTHRIC SPEECH DETECTION EXPLOITING PAIRWISE DISTANCE-BASED CONVOLUTIONAL NEURAL NETWORKS, , and , Idiap-RR-32-2020 |
|
Can Your Face Detector Do Anti-spoofing? Face Presentation Attack Detection with a Multi-Channel Face Detector, and , Idiap-RR-12-2020 |
|
Comparison of Subword Segmentation Methods for Open-vocabulary ASR using a Difficulty Metric, , , and |
|
COMPARISON OF SUBWORD SEGMENTATION METHODS FOR OPEN-VOCABULARYEND-TO-END SPEECH RECOGNITION, , , and , Idiap-RR-34-2020 |
|
Deepfake detection: humans vs. machines, and , Idiap-RR-36-2020 |
|
Extractive Odia Text Summarization System: An OCR based Approach, , Idiap-RR-02-2020 |
|
Face Recognition Systems Under Spoofing Attacks, , , and , Idiap-RR-18-2020 |
|
German News Article Classification : A Multichannel CNN Approach, , and , Idiap-RR-09-2020 |
|
Gradient Alignment in Deep Neural Networks, and , Idiap-RR-14-2020 |
|
Idiap Abstract Text Summarization System for German Text Summarization Task, and , Idiap-RR-03-2020 |
|
Idiap NMT System for WAT 2019 Multimodal Translation Task, and , Idiap-RR-04-2020 |
|
Idiap Submission to Swiss-German Language Detection Shared Task, , , , and , Idiap-RR-11-2020 |
|
Language model domain adaptation for automatic speech recognition, , and , Idiap-RR-05-2020 |
|
LATTICE-FREE MMI ADAPTATION OF SELF-SUPERVISED PRETRAINED ACOUSTIC MODELS, , and , Idiap-RR-40-2020 |
[URL] |
Learning One Class Representations for Presentation Attack Detection using Multi-channel Convolutional Neural Networks, and , Idiap-RR-15-2020 |
|
Novel Architectures for Unsupervised Information Bottleneck based Speaker Diarization of Meetings, , , and , Idiap-RR-26-2020 |
|
OdiEnCorp 2.0: Odia-English Parallel Corpus for Machine Translation, , , , , and , Idiap-RR-08-2020 |
|
On the Effectiveness of Vision Transformers for Zero-shot Face Anti-Spoofing, and , Idiap-RR-30-2020 |
|
Plug and Play Autoencoders for Conditional Text Generation, , , , and , Idiap-RR-24-2020 |
|
Smartphone Multi-modal Biometric Authentication: Database and Evaluation, , , , , , , , and , Idiap-RR-17-2020 |
[URL] |
Taming GANs with Lookahead, , , and , Idiap-RR-20-2020 |
[URL] |
The High-Quality Wide Multi-Channel Attack (HQ-WMCA) database, , , , and , Idiap-RR-22-2020 |
|
Vulnerability Analysis of Face Morphing Attacks from Landmarks and Generative Adversarial Networks, , , and , Idiap-RR-38-2020 |
|
2019
AN END-TO-END NETWORK TO SYNTHESIZE INTONATION USING A GENERALIZED COMMAND RESPONSE MODEL, , , , and , Idiap-RR-05-2019 |
|
CBOW Is Not All You Need: Combining CBOW with the Compositional Matrix Space Model, , and , Idiap-RR-06-2019 |
[URL] |
Data-Driven Movement Subunit Extraction from Skeleton Information for Modeling Signs and Gestures, , and , Idiap-RR-02-2019 |
|
EMPIRICAL EVALUATION AND COMBINATION OF PUNCTUATION PREDICTION MODELS APPLIED TO BROADCAST NEWS, and , Idiap-RR-01-2019 |
|
Idiap submission to the NIST SRE 2019 Speaker Recognition Evaluation, , , , and , Idiap-RR-15-2019 |
|
INVESTIGATING TIME DELAY NEURAL NETWORK (TDNN) FOR LANGUAGE MODELING IN LOW RESOURCE AUTOMATIC SPEECH RECOGNITION, , , and , Idiap-RR-13-2019 |
|
Learning Entailment-Based Sentence Embeddings from Natural Language Inference, , and , Idiap-RR-20-2019 |
[URL] |
On the Tunability of Optimizers in Deep Learning, , , , and , Idiap-RR-19-2019 |
[URL] |
Processing Megapixel Images with Deep Attention-Sampling Models, and , Idiap-RR-07-2019 |
[URL] |
SPOKEN LANGUAGE IDENTIFICATION USING LANGUAGE BOTTLENECK FEATURES, , , , , and , Idiap-RR-08-2019 |
|
STACKED NEURAL NETWORKS WITH PARAMETER SHARING FOR MULTILINGUAL LANGUAGE MODELING, , , , , and , Idiap-RR-12-2019 |
|
The Speed Submission to DIHARD II: Contributions & Lessons Learned, , , , , , , , , , , , , and , Idiap-RR-14-2019 |
|
TOWARDS MULTILINGUAL SIGN LANGUAGE RECOGNITION, , and , Idiap-RR-16-2019 |
|
Understanding Raw Waveform based CNN through Low-rank Spectro-Temporal Decoupling, , and , Idiap-RR-11-2019 |
|
2018
A Neural Model to Predict Parameters for a Generalized Command Response Model of Intonation, and , Idiap-RR-10-2018 |
|
Deep Neural Networks for Multiple Speaker Detection and Localization, , and , Idiap-RR-02-2018 |
|
DeepFakes: a New Threat to Face Recognition? Assessment and Detection, and , Idiap-RR-18-2018 |
|
Designing second order recurrent neural networks for prosody modelling, , Idiap-RR-16-2018 |
|
DNN based speaker embedding using content information for text-dependent speaker verification, , , and , Idiap-RR-06-2018 |
|
Gradient-based spectral visualization of CNNs using raw waveforms, , , and , Idiap-RR-11-2018 |
|
Implémentation d'un algorithme de réduction de taille des réseaux de neurones, , Idiap-RR-03-2018 |
|
Knowledge Transfer with Jacobian Matching, and , Idiap-RR-04-2018 |
[URL] |
Modelling glottal source information for depression detection, , and , Idiap-RR-13-2018 |
|
Not All Samples Are Created Equal: Deep Learning with Importance Sampling, and , Idiap-RR-12-2018 |
|
Semi-blind spatially-variant deconvolution in optical microscopy with local Point Spread Function estimation by use of Convolutional Neural Networks, and , Idiap-RR-07-2018 |
|
Two-Pass IB based Speaker Diarization System using Meeting-Specific ANN based Features, , , and , Idiap-RR-09-2018 |
|
2017
A Sub-Quadratic Exact Medoid Algorithm, and , Idiap-RR-19-2017 |
|
Characterisation of voice quality of Parkinson's disease using differential phonological posterior features, , , , , and , Idiap-RR-16-2017 |
|
Consistent Translation of Repeated Nouns using Syntactic and Semantic Cues, , and , Idiap-RR-08-2017 |
|
CONTENT NORMALIZATION FOR TEXT-INDEPENDENT SPEAKER VERIFICATION, , , and , Idiap-RR-31-2017 |
|
Cross-lingual Transfer for News Article Labeling: Benchmarking Statistical and Neural Models, , and , Idiap-RR-26-2017 |
|
Evaluating Attention Networks for Anaphora Resolution, , , and , Idiap-RR-27-2017 |
|
EXPLOITING SEQUENCE INFORMATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION, , , and , Idiap-RR-04-2017 |
|
From Research to Reality: Evaluation of a Single-Computer Real-Time LVCSR System for Speech-Based Retrieval, , , and , Idiap-RR-12-2017 |
|
Long Term Spectral Statistics for Voice Presentation Attack Detection, , , and , Idiap-RR-11-2017 |
|
Machine translation of Spanish personal and possessive pronouns using anaphora probabilities, and , Idiap-RR-06-2017 |
|
Maya Codical Glyph Segmentation: A Crowdsourcing Approach, , and , Idiap-RR-01-2017 |
|
Multilingual Hierarchical Attention Networks for Document Classification, and , Idiap-RR-17-2017 |
[URL] |
NASAL SPEECH SOUNDS DETECTION USING CONNECTIONIST TEMPORAL CLASSIFICATION, and , Idiap-RR-28-2017 |
|
Perceptual Information Loss due to Impaired Speech Production, , and , Idiap-RR-20-2017 |
|
Real-time Multiple Head Tracking Using Texture and Colour Cues, and , Idiap-RR-02-2017 |
|
Supervised Gaze Bias Correction for Gaze Coding in Interactions, and , Idiap-RR-23-2017 |
|
Template-matching for Text-dependent Speaker Verification, , , and , Idiap-RR-32-2017 |
|
The SIWIS French Speech Synthesis Database – Design and recording of a high quality French database for speech synthesis, , , and , Idiap-RR-03-2017 |
|
Topic and Sentiment in Phrase-Based Statistical Machine Translation, , and , Idiap-RR-10-2017 |
|
Towards directly modeling raw speech signal for speaker verification using CNNs, , and , Idiap-RR-30-2017 |
|
Towards Document-Level Neural Machine Translation, , Idiap-RR-25-2017 |
|
Towards Weakly Supervised Acoustic Subword Unit Discovery and Lexicon Development Using Hidden Markov Models, , and , Idiap-RR-15-2017 |
|
Using Coreference Links to Improve Spanish-to-English Machine Translation, and , Idiap-RR-07-2017 |
|
2016
An Analysis of Rhythmic Staccato-Vocalization Based on Frequency Demodulation for Laughter Detection in Conversational Meetings, , , and , Idiap-RR-02-2016 |
|
Cognitive speech coding, and , Idiap-RR-27-2016 |
|
Composition of Deep and Spiking Neural Networks for Very Low Bit Rate Speech Coding, , , and , Idiap-RR-11-2016 |
|
DEEP NEURAL NETWORK BASED POSTERIORS FOR TEXT-DEPENDENT SPEAKER VERIFICATION, , , and , Idiap-RR-08-2016 |
|
End-to-End Acoustic Modeling using Convolutional Neural Networks for Automatic Speech Recognition, , and , Idiap-RR-18-2016 |
|
Explicit Suggestion of Query Terms for News Search using Topic Models and Word Embeddings, and , Idiap-RR-21-2016 |
|
Fast K-Means with Accurate Bounds, and , Idiap-RR-17-2016 |
|
Feature mapping using far-field microphones for distant speech recognition, , , and , Idiap-RR-20-2016 |
|
Implementation of the Standard I-vector System for the Kaldi Speech Recognition Toolkit, , , and , Idiap-RR-26-2016 |
|
Information Theoretic Analysis of Production-Perception Efficiency: Case Study of Speech Pathology, , and , Idiap-RR-30-2016 |
|
INFORMATION THEORETIC CLUSTERING FOR UNSUPERVISED DOMAIN-ADAPTATION, , and , Idiap-RR-09-2016 |
|
Intonation atom based emphasis transfer, and , Idiap-RR-14-2016 |
|
Investigating Spectral Amplitude Modulation Phase Hierarchy Features in Speech Synthesis, , , and , Idiap-RR-22-2016 |
|
Joint Operation of Voice Biometrics and Presentation Attack Detection, and , Idiap-RR-25-2016 |
[URL] |
Low-Rank Representation For Enhanced Deep Neural Network Acoustic Models, , Idiap-RR-05-2016 |
|
Low-Rank Representation of Nearest Neighbor Phone Posterior Probabilities to Enhance DNN Acoustic Modeling, , , and , Idiap-RR-04-2016 |
|
On Structured Sparsity of Phonological Posteriors for Linguistic Parsing, , and , Idiap-RR-07-2016 |
[URL] |
On the impact of non-modal phonation on phonological features, , , , , , , , , , , , , and , Idiap-RR-28-2016 |
|
Overview of BTAS 2016 Speaker Anti-spoofing Competition, , , , , , , , , , , , , , , and , Idiap-RR-24-2016 |
[URL] |
Phonetic and Phonological Posterior Search Space Hashing Exploiting Class-Specific Sparsity Structures, , , and , Idiap-RR-10-2016 |
|
Probabilistic Amplitude Demodulation features in Speech Synthesis for Improving Prosody, , and , Idiap-RR-12-2016 |
|
Question Answering in Conversations: Query Refinement Using Contextual and Semantic Information, , and , Idiap-RR-16-2016 |
|
Redundant Hash Addressing for Large-Scale Query by Example Spoken Query Detection, , and , Idiap-RR-31-2016 |
|
Sound Pattern Matching for Automatic Prosodic Event Detection, , , , and , Idiap-RR-03-2016 |
|
Sparse Hidden Markov Models for Exemplar-based Speech Recognition Using Deep Neural Network Posterior Features, , and , Idiap-RR-19-2016 |
|
Sparse Subspace Modeling for Query by Example Spoken Term Detection, , and , Idiap-RR-01-2016 |
|
Subspace Detection of DNN Posterior Probabilities via Sparse Representation for Query by Example Spoken Term Detection, , and , Idiap-RR-06-2016 |
|
The SIWIS database: a multilingual speech database with acted emphasis, , , , , , , , , , , and , Idiap-RR-13-2016 |
|
Twitter Sentiment Analysis (Almost) from Scratch, , and , Idiap-RR-15-2016 |
|
Validation of an Automatic Metric for the Accuracy of Pronoun Translation (APT), and , Idiap-RR-29-2016 |
|
2015
"The Sum of Its Parts": Joint Learning of Word and Phrase Representations with Autoencoders, and , Idiap-RR-21-2015 |
|
A New Identity for the Least-square Solution of Overdetermined Set of Linear Equations, , and , Idiap-RR-35-2015 |
|
Acoustic Data-Driven Grapheme-to-Phoneme Conversion in the Probabilistic Lexical Modeling Framework, , and , Idiap-RR-10-2015 |
|
An Empirical Model of Emphatic Word Detection, and , Idiap-RR-11-2015 |
|
Analysis of CNN-based Speech Recognition System using Raw Speech as Input, , and , Idiap-RR-23-2015 |
|
Automatic Accentedness Evaluation of Non-Native Speech Using Phonetic and Sub-Phonetic Posterior Probabilities, , , and , Idiap-RR-12-2015 |
|
Channel Selection in the Short-time Modulation Domain for Distant Speech Recognition; Comparison with the Envelope-Variance Measure, , , , and , Idiap-RR-30-2015 |
|
COMBINING SGMM SPEAKER VECTORS AND KL-HMM APPROACH FOR SPEAKER DIARIZATION, , and , Idiap-RR-17-2015 |
|
EMPLOYMENT OF SUBSPACE GAUSSIAN MIXTURE MODELS IN SPEAKER RECOGNITION, , , and , Idiap-RR-16-2015 |
|
Exploiting foreign resources for DNN-based ASR, , , , and , Idiap-RR-27-2015 |
|
HMM-based Non-native Accent Assessment using Posterior Features, , and , Idiap-RR-32-2015 |
|
Improving Real Time Factor of Information Bottleneck-based Speaker Diarization System, , and , Idiap-RR-18-2015 |
|
Incremental Syllable-Context Phonetic Vocoding, , , , and , Idiap-RR-05-2015 |
|
Integrating Online I-vector extractor with Information Bottleneck based Speaker Diarization system, , , and , Idiap-RR-20-2015 |
|
Joint Similarity Learning for Predicting Links in Networks with Multiple-type Links, and , Idiap-RR-29-2015 |
|
KL-HMM BASED SPEAKER DIARIZATION SYSTEM FOR MEETINGS, and , Idiap-RR-19-2015 |
|
Learning linearly separable features for speech recognition using convolutional neural networks, , and , Idiap-RR-24-2015 |
[URL] |
Leveraging Compounds to Improve Noun Phrase Translation from Chinese and German, , , , , and , Idiap-RR-09-2015 |
|
Neuromorphic Based Oscillatory Device for Incremental Syllable Boundary Detection, and , Idiap-RR-14-2015 |
|
On the Application of Automatic Subword Unit Derivation and Pronunciation Generation for Under-Resourced Language ASR: A Study on Scottish Gaelic, , and , Idiap-RR-13-2015 |
|
Phonological vocoding using artificial neural networks, , and , Idiap-RR-04-2015 |
|
Phrase-based Image Captioning, , and , Idiap-RR-08-2015 |
|
Posterior-Based Multi-Stream Formulation To Combine Multiple Grapheme-to-Phoneme Conversion Techniques, and , Idiap-RR-33-2015 |
|
Preliminary Work on Speaker Adaptation for DNN-Based Speech Synthesis, , and , Idiap-RR-02-2015 |
|
Simple Image Description Generator via a Linear Phrase-based Model, , and , Idiap-RR-22-2015 |
|
Speech vocoding for laboratory phonology, , and , Idiap-RR-07-2015 |
|
Syllabic Pitch Tuning for Neutral-to-Emotional Voice Conversion, , and , Idiap-RR-31-2015 |
|
Syntactic Parsing of Morphologically Rich Languages Using Deep Neural Networks, and , Idiap-RR-25-2015 |
|
Towards Multiple Pronunciation Generation in Acoustic G2P Conversion Framework, , and , Idiap-RR-34-2015 |
|
Transfer Learning through Greedy Subset Selection, , and , Idiap-RR-26-2015 |
|
2014
Acoustic and Lexical Resource Constrained ASR using Language-Independent Acoustic Model and Language-Dependent Probabilistic Lexical Model, and , Idiap-RR-02-2014 |
|
Articulatory Feature based Continuous Speech Recognition using Probabilistic Lexical Modeling, and , Idiap-RR-19-2014 |
|
Biometrics Evaluation under Spoofing Attacks, , and , Idiap-RR-12-2014 |
|
Convolutional Neural Networks-based Continuous Speech Recognition using Raw Speech Signal, , and , Idiap-RR-18-2014 |
|
Development of Bilingual ASR System for MediaParl Corpus, , , and , Idiap-RR-21-2014 |
|
Exemplar-based Sparse Representation for Posterior Features, , and , Idiap-RR-11-2014 |
|
Exploiting Long-Term Connectivity and Visual Motion in CRF-based Multi-Person Tracking, , and , Idiap-RR-06-2014 |
|
Exploiting Long-Term Connectivity and Visual Motion in CRF-based Multi-Person Tracking, , and , Idiap-RR-05-2014 |
|
EYEDIAP Database: Data Description and Gaze Tracking Evaluation Benchmarks, , and , Idiap-RR-08-2014 |
|
Feature Mapping of Multiple Beamformed Sources for Robust Overlapping Speech Recognition Using a Microphone Array, , , , , , and , Idiap-RR-17-2014 |
|
LETHA: Learning from High Quality Inputs for 3D Pose Estimation in Low Quality Images., , , and , Idiap-RR-22-2014 |
|
Prosody in Swiss French Accents: Investigation using Analysis by Synthesis, , , and , Idiap-RR-04-2014 |
|
Raw Speech Signal-based Continuous Speech Recognition using Convolutional Neural Networks, , and , Idiap-RR-15-2014 |
|
Sparse Gammatone Signal Model Predicts Perceived Noise Intrusiveness, and , Idiap-RR-07-2014 |
|
Stress and Accent Transmission In HMM-Based Syllable-Context Very Low Bit Rate Speech Coding, , , and , Idiap-RR-10-2014 |
|
SVR vs MLP for Phone Duration Modelling in HMM-based Speech Synthesis, , and , Idiap-RR-03-2014 |
|
Theoretical Analysis of Euclidean Distance Matrix Completion for Ad hoc Microphone Array Calibration, , Idiap-RR-20-2014 |
|
Topic-Level Extractive Summarization of Lectures and Meetings Using a Snippet Similarity Graph, and , Idiap-RR-09-2014 |
|
Weakly Supervised Object Segmentation with Convolutional Neural Networks, and , Idiap-RR-13-2014 |
|
2013
A Scalable Formulation of Probabilistic Linear Discriminant Analysis: Applied to Face Recognition, , , and , Idiap-RR-07-2013 |
[URL] |
ACCENT ADAPTATION USING SUBSPACE GAUSSIAN MIXTURE MODELS, , , and , Idiap-RR-38-2013 |
|
Adaptation Experiments on French MediaParl ASR, , Idiap-RR-10-2013 |
|
An Open-source State-of-the-art Toolbox for Broadcast News Diarization, , , , , and , Idiap-RR-33-2013 |
|
Analyse non supervisée d'activités en vidéo surveillance pour l'analyse de scène et la détection d'événements anormaux, and , Idiap-RR-20-2013 |
[URL] |
Anti-spoofing in action: joint operation with a verification system, , and , Idiap-RR-19-2013 |
|
Automatic Speech Indexing System of Bilingual Video Parliament Interventions, , , , , and , Idiap-RR-25-2013 |
|
Bi-Modal Biometric Authentication on Mobile Phones in Challenging Conditions, , , , and , Idiap-RR-30-2013 |
|
Bias Adaptation for Vocal Tract Length Normalization, , , and , Idiap-RR-12-2013 |
|
Comparing different acoustic modeling techniques for multilingual boosting, , , , and , Idiap-RR-01-2013 |
|
Convolutional Pitch Target Approximation Model for Speech Synthesis, and , Idiap-RR-05-2013 |
|
Crosslingual Tandem-SGMM: Exploiting Out-Of-Language Data for Acoustic Model and Feature Level Adaptation, , and , Idiap-RR-39-2013 |
|
End-to-end Phoneme Sequence Recognition using Convolutional Neural Networks, , and , Idiap-RR-40-2013 |
|
Enhancing State Mapping-Based Cross-Lingual Speaker Adaptation using Phonological Knowledge in a Data-Driven Manner, and , Idiap-RR-08-2013 |
|
Estimating Phoneme Class Conditional Probabilities from Raw Speech Signal using Convolutional Neural Networks, , and , Idiap-RR-13-2013 |
|
FEATURE AND SCORE LEVEL COMBINATION OF SUBSPACE GAUSSIANS IN LVCSR TASK, , and , Idiap-RR-37-2013 |
|
Fusing Matching and Biometric Similarity Measures for Face Diarization in Video, , and , Idiap-RR-31-2013 |
|
I4U Submission to NIST SRE 2012: a large-scale collaborative effort for noise-robust speaker verification, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , Idiap-RR-34-2013 |
|
Improving Grapheme-based ASR by Probabilistic Lexical Modeling Approach, and , Idiap-RR-14-2013 |
|
Investigating time-sensitive topic model approaches for action recognition, , and , Idiap-RR-26-2013 |
|
Is Deep Learning Really Necessary for Word Embeddings?, , and , Idiap-RR-44-2013 |
|
KL-HMM and Probabilistic Lexical Modeling, and , Idiap-RR-04-2013 |
|
MediaParl: Bilingual mixed language accented speech database, , , , , and , Idiap-RR-03-2013 |
|
On Modeling Context-dependent Clustered States: Comparing HMM/GMM, Hybrid HMM/ANN and KL-HMM Approaches, , and , Idiap-RR-43-2013 |
|
ON THE (UN)IMPORTANCE OF THE CONTEXTUAL FACTORS IN HMM-BASED SPEECH SYNTHESIS AND CODING, , and , Idiap-RR-06-2013 |
|
On the Improvements of Uni-modal and Bi-modal Fusions of Speaker and Face Recognition for Mobile Biometrics, , , and , Idiap-RR-35-2013 |
|
Probabilistic Lexical Modeling and Grapheme-based Automatic Speech Recognition, and , Idiap-RR-15-2013 |
|
Recurrent Convolutional Neural Networks for Scene Labeling, and , Idiap-RR-41-2013 |
|
Recurrent Convolutional Neural Networks for Scene Parsing, and , Idiap-RR-22-2013 |
|
Robust triphone mapping for acoustic modeling, , and , Idiap-RR-02-2013 |
|
Session Variability Modelling for Face Authentication, , , , and , Idiap-RR-17-2013 |
|
Spoofing Attacks To 2D Face Recognition Systems With 3D Masks, and , Idiap-RR-42-2013 |
|
Spoofing in 2D Face Recognition with 3D Masks and Anti-spoofing with Kinect, and , Idiap-RR-27-2013 |
|
Statistical models for HMM/ANN hybrids, and , Idiap-RR-11-2013 |
|
Syllable-based Pitch Encoding for Low Bit Rate Speech Coding with Recognition/Synthesis Architecture, , and , Idiap-RR-24-2013 |
|
The 2013 Face Recognition Evaluation in Mobile Environment, , , , , , , , , , , , , , , , , , , , , , , , , , , and , Idiap-RR-36-2013 |
|
The 2013 Speaker Recognition Evaluation in Mobile Environment, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , Idiap-RR-32-2013 |
|
The 2nd Competition on Counter Measures to 2D Face Spoofing Attacks, , and , Idiap-RR-18-2013 |
|
Understanding Factors in Emotion Perception, and , Idiap-RR-28-2013 |
|
Unsupervised Methods for Activity Analysis and Detection of Abnormal Events, and , Idiap-RR-21-2013 |
|
Using out-of-language data to improve an under-resourced speech recognizer, , , and , Idiap-RR-09-2013 |
|
Using Phonological Phrase Segmentation to Improve Automatic Keyword Spotting for the Highly Agglutinating Hungarian Language, and , Idiap-RR-23-2013 |
|
Word Embeddings through Hellinger PCA, and , Idiap-RR-29-2013 |
|
Learning Categories from Few Examples with Multi Model Knowledge Transfer, , and , Idiap-RR-16-2013 |
|
2012
A Survey on Language Modeling using Neural Networks, and , Idiap-RR-32-2012 |
|
An Open Source Framework for Standardized Comparisons of Face Recognition Algorithms, , and , Idiap-RR-29-2012 |
|
Application of Subspace Gaussian Mixture Models in Contrastive Acoustic Scenarios, , , and , Idiap-RR-20-2012 |
|
Automatic Social Role Recognition In Professional Meetings, and , Idiap-RR-35-2012 |
|
Baseline System for Automatic Speech Recognition with French GlobalPhone Database, and , Idiap-RR-26-2012 |
|
Bi-Modal Authentication in Mobile Environments Using Session Variability Modelling, , , , and , Idiap-RR-18-2012 |
|
Bi-Modal Person Recognition on a Mobile Phone: using mobile phone data, , , , , , , , , , , , , and , Idiap-RR-13-2012 |
|
Bob: a free signal processing and machine learning toolbox for researchers, , , , , and , Idiap-RR-25-2012 |
|
Boosting under-resourced speech recognizers by exploiting out of language data - Case study on Afrikaans, , and , Idiap-RR-15-2012 |
|
Combining Vocal Tract Length Normalization with Linear Transformations in a Bayesian Framework, , , and , Idiap-RR-11-2012 |
|
Conversion of Recurrent Neural Network Language Models to Weighted Finite State Transducers for Automatic Speech Recognition, and , Idiap-RR-21-2012 |
|
Data-Driven Enhancement of State Mapping-Based Cross-Lingual Speaker Adaptation, , Idiap-RR-38-2012 |
|
Face detection using boosted Jaccard distance-based regression, , and , Idiap-RR-02-2012 |
|
Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition, , and , Idiap-RR-01-2012 |
|
Grapheme and Multilingual Posterior Features For Under-Resource Speech Recognition: A Study on Scottish Gaelic, , and , Idiap-RR-34-2012 |
|
Impact du degré de supervision sur l'adaptation à un domaine d'un modèle de langage à partir du Web, , , and , Idiap-RR-23-2012 |
|
IMPROVING ACOUSTIC BASED KEYWORD SPOTTING USING LVCSR LATTICES, , and , Idiap-RR-36-2012 |
|
Improving Control of Dexterous Hand Prostheses Using Adaptive Learning, , , and , Idiap-RR-07-2012 |
|
Improving Object Classification using Pose Information, , , and , Idiap-RR-30-2012 |
|
Integrating Language Identification to improve Multilingual Speech Recognition, , Idiap-RR-24-2012 |
|
Integrating Posterior Features and Self-Organizing Maps for Isolated Word Recognition without Dynamic Programming, and , Idiap-RR-17-2012 |
|
Progress report of a project in very low bit-rate speech coding, , and , Idiap-RR-08-2012 |
|
Sub-Band Based Log-Energy and its Dynamic Range Stretching for Robust In-Car Speech Recognition, and , Idiap-RR-16-2012 |
|
Supervised and unsupervised Web-based language model domain adaptation, , , and , Idiap-RR-22-2012 |
|
The Kaldi Speech Recognition Toolkit, , , , , , , , , , , , and , Idiap-RR-04-2012 |
|
The Vernissage Corpus: A Multimodal Human-Robot-Interaction Dataset, , , , , , , , , and , Idiap-RR-33-2012 |
|
Towards a Technology of Nonverbal Communication: Vocal Behavior in Social and Affective Phenomena, and , Idiap-RR-05-2012 |
|
Transfer Learning of Visual Concepts across Robots: a Discriminative Approach, , and , Idiap-RR-06-2012 |
|
Translation Error Spotting from a User's Point of View, , Idiap-RR-31-2012 |
|
Using Crowdsourcing to Compare Document Recommendation Strategies for Conversations, and , Idiap-RR-14-2012 |
|
VTLN-Based Rapid Cross-Lingual Adaptation for Statistical Parametric Speech Synthesis, , , and , Idiap-RR-12-2012 |
|
Wordless Sounds: Robust Speaker Diarization using Privacy-Preserving Audio Representations, , and , Idiap-RR-28-2012 |
|
Cross-pollination of normalisation techniques from speaker to face authentication using Gaussian mixture models, , , and , Idiap-RR-03-2012 |
|
2011
A Large-Scale Database of Images and Captions for Automatic Face Naming, , , and , Idiap-RR-26-2011 |
|
A Speech-based Just-in-Time Retrieval System using Semantic Search, , , and , Idiap-RR-31-2011 |
|
Acoustic Data-driven Grapheme-to-Phoneme Conversion using KL-HMM, and , Idiap-RR-38-2011 |
|
AN INTEGRATED FRAMEWORK FOR MULTI-CHANNEL MULTI-SOURCE LOCALIZATION AND VOICE ACTIVITY DETECTION, , , , and , Idiap-RR-16-2011 |
|
Audio Spatio-Temporal Fingerprints for Cloudless Real-Time Hands-Free Diarization on Mobile Devices, , Idiap-RR-08-2011 |
|
BROADBAND BEAMPATTERN FOR MULTI-CHANNEL SPEECH ACQUISITION AND DISTANT SPEECH RECOGNITION, , and , Idiap-RR-39-2011 |
|
Continuous Speech Recognition using Boosted Binary Features, , and , Idiap-RR-35-2011 |
|
Exploiting Long-Term Observations for Track Creation and Deletion in Online Multi-Face Tracking, and , Idiap-RR-01-2011 |
|
Face Verification using Gabor Filtering and Adapted Gaussian Mixture Models, , and , Idiap-RR-37-2011 |
|
Finding Information in Multimedia Records of Meetings, , and , Idiap-RR-32-2011 |
|
HEAT: Iterative Relevance Feedback with One Million Images, and , Idiap-RR-33-2011 |
|
Impact of Excitation Frequency on Short-Term Recording Synchronisation and Confidence Estimation, , Idiap-RR-20-2011 |
|
Improving non-native ASR through stochastic multilingual phoneme space transformations, , , , and , Idiap-RR-19-2011 |
|
Integrating Articulatory Features using Kullback-Leibler Divergence based Acoustic Model for Phoneme Recognition, and , Idiap-RR-02-2011 |
|
Inter-session Variability Modelling and Joint Factor Analysis for Face Authentication, , , and , Idiap-RR-28-2011 |
|
Intuitive Recipes for Uncertainty Decoding with SNR Features for Noise Robust ASR, and , Idiap-RR-23-2011 |
|
Just-in-Time Multimodal Association and Fusion from Home Entertainment, , , and , Idiap-RR-10-2011 |
|
Language dependent universal phoneme posterior estimation for mixed language speech recognition, , , and , Idiap-RR-13-2011 |
|
Learning from Candidate Labeling Sets, and , Idiap-RR-27-2011 |
|
Learning from Images with Captions Using the Maximum Margin Set Algorithm, , , and , Idiap-RR-30-2011 |
|
LP Residual Features for Robust, Privacy-Sensitive Speaker Diarization, , and , Idiap-RR-14-2011 |
|
Model-Based Compressive Sensing for Multi-Party Distant Speech Recognition, , and , Idiap-RR-04-2011 |
|
Multi-party Speech Recovery Exploiting Structured Sparsity Models, , , and , Idiap-RR-22-2011 |
|
Multiclass Transfer Learning from Unconstrained Priors, , and , Idiap-RR-25-2011 |
|
Multimodal Cue Detection Engine for Orchestrated Entertainment, , , and , Idiap-RR-34-2011 |
|
Multitask Learning to Improve Articulatory Feature Estimation and Phoneme Recognition, and , Idiap-RR-21-2011 |
|
On-line unsupervised adaptation for face verification using Gaussian Mixture Models with multiple user models, , and , Idiap-RR-07-2011 |
|
Parts-Based Face Verification using Local Frequency Bands, and , Idiap-RR-06-2011 |
|
Privacy-Sensitive Audio Features for Speech/Nonspeech Detection, , , and , Idiap-RR-12-2011 |
|
Robustness of Group Delay Representations for Noisy Speech Signals, , and , Idiap-RR-36-2011 |
|
Social Focus of Attention as a Time Function Derived from Multimodal Signals, and , Idiap-RR-09-2011 |
|
Speech Enhancement using Beta-order MMSE Spectral Amplitude Estimator with Laplacian Prior, , , and , Idiap-RR-24-2011 |
|
Towards semi-supervised learning of semantic spatial concepts, and , Idiap-RR-03-2011 |
|
Ultra-Fast Optimization Algorithm for Sparse Multi Kernel Learning, and , Idiap-RR-11-2011 |
|
When Users Meet Technology: The Meeting Browser Development Helix, , and , Idiap-RR-05-2011 |
|
2010
A Comparison of Supervised and Unsupervised Cross-Lingual Speaker Adaptation Approaches for HMM-Based Speech Synthesis, , and , Idiap-RR-05-2010 |
|
A Sparsity Constraint for Topic Models - Application to Temporal Activity Mining, , and , Idiap-RR-36-2010 |
|
Advances in Fast Multistream Diarization based on the Information Bottleneck Framework, , and , Idiap-RR-23-2010 |
|
AMIDA/Klewel Mini-Project, , , and , Idiap-RR-03-2010 |
|
An Adaptive Initialization Method for Speaker Diarization based on Prosodic Features, and , Idiap-RR-02-2010 |
|
An Analysis of Language Mismatch in HMM State Mapping-Based Cross-Lingual Speaker Adaptation, and , Idiap-RR-16-2010 |
|
An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization, , and , Idiap-RR-22-2010 |
|
Application of Out-Of-Language Detection To Spoken-Term Detection, and , Idiap-RR-04-2010 |
|
Automatic Time Skew Detection and Correction, , Idiap-RR-42-2010 |
|
Crossmodal Matching of Speakers using Lip and Voice Features in Temporally Non-overlapping Audio and Video Streams, and , Idiap-RR-13-2010 |
|
Estimating Cohesion in Small Groups using Audio-Visual Nonverbal Behavior, and , Idiap-RR-12-2010 |
|
Fast Bounding Box Estimation based Face Detection, and , Idiap-RR-38-2010 |
|
Fusing Audio-Visual Nonverbal Cues to Detect Dominant People in Conversations, and , Idiap-RR-17-2010 |
|
Hands Free Audio Analysis from Home Entertainment, , and , Idiap-RR-27-2010 |
|
Hierarchical Multilayer Perceptron based Language Identification, , and , Idiap-RR-14-2010 |
|
Hierarchical Tandem Features for ASR in Mandarin, , and , Idiap-RR-39-2010 |
|
Implementation of VTLN for Statistical Speech Synthesis, , , and , Idiap-RR-32-2010 |
|
Introducing Crossmodal Biometrics: Person Identification from Distinct Audio & Visual Streams, and , Idiap-RR-29-2010 |
|
Investigation of kNN Classifier on Posterior Features Towards Application in Automatic Speech Recognition, , and , Idiap-RR-11-2010 |
|
KL Realignment for Speaker Diarization with Multiple Feature Streams, , and , Idiap-RR-24-2010 |
|
Kodak Moments and Flickr Diamonds: How Users Shape Large-scale Media, , and , Idiap-RR-20-2010 |
|
Measuring the gap between HMM-based ASR and TTS, , and , Idiap-RR-34-2010 |
|
Mining Human Location-Routines using a Multi-Level Topic Model, and , Idiap-RR-28-2010 |
|
Modeling and Understanding Flickr Communities through Topic-based Analysis, and , Idiap-RR-19-2010 |
|
OM-2: An Online Multi-class Multi-kernel Learning Algorithm, , , , and , Idiap-RR-06-2010 |
|
On Improving Face Detection Performance by Modelling Contextual Information, , and , Idiap-RR-43-2010 |
|
Online-Batch Strongly Convex Multi Kernel Learning, , and , Idiap-RR-07-2010 |
|
Probabilistic Latent Sequential Motifs: Discovering temporal activity patterns in video scenes, , and , Idiap-RR-33-2010 |
|
Study of Jacobian Normalization for VTLN, , and , Idiap-RR-25-2010 |
|
The ACLD: Speech-based Just-in-Time Retrieval of Multimedia Documents and Websites, , , and , Idiap-RR-26-2010 |
|
The DIRAC AWEAR Audio-Visual Platform for Detection of Unexpected and Incongruent Events, , , , , , , , , , , , , and , Idiap-RR-41-2010 |
|
The More you Learn, the Less you Store: Memory-controlled Incremental SVM for Visual Place Recognition, , and , Idiap-RR-08-2010 |
|
The TA2 Database - A Multi-Modal Database from Home Entertainment, , and , Idiap-RR-37-2010 |
|
Towards mixed language speech recognition systems, , and , Idiap-RR-15-2010 |
|
Towards Robust Place Recognition for Robot Localization, , , , , and , Idiap-RR-40-2010 |
|
Tuning-Robust Initialization Methods for Speaker Diarization, and , Idiap-RR-35-2010 |
|
2009
Analysis of F0 and Cepstral Features for Robust Automatic Gender Recognition, and , Idiap-RR-30-2009 |
|
APPLICATIONS OF SIGNAL ANALYSIS USING AUTOREGRESSIVE MODELS FOR AMPLITUDE MODULATION, , , and , Idiap-RR-35-2009 |
|
Automatic Out-of-Language Detection based on Confidence Measures derived from LVCSR Word and Phone Lattices, , Idiap-RR-06-2009 |
|
Automatic Temporal Alignment of AV Data, , and , Idiap-RR-39-2009 |
|
Automatic Temporal Alignment of AV Data with Confidence Estimation, , and , Idiap-RR-40-2009 |
|
Automatic vs. human question answering over multimedia meeting recordings, and , Idiap-RR-13-2009 |
|
Autoregressive Models of Amplitude Modulations in Audio Compression, , and , Idiap-RR-33-2009 |
|
Bayesian Networks to Combine Intensity and Color Information in Face Recognition, and , Idiap-RR-27-2009 |
|
Co-occurrence Models for Image Annotation and Retrieval, , Idiap-RR-22-2009 |
|
Comparing meeting browsers using a task-based evaluation method, , Idiap-RR-11-2009 |
|
Fast Human Detection in Videos using Joint Appearance and Foreground Learning from Covariances of Image Feature Subsets, and , Idiap-RR-19-2009 |
|
Haar Local Binary Pattern Feature for Fast Illumination Invariant Face Detection, and , Idiap-RR-28-2009 |
|
Improved Phone Posterior Estimation Through k-NN and MLP-Based Similarity, , Idiap-RR-18-2009 |
|
Investigating Privacy-Sensitive Features for Speech Detection in Multiparty Conversations, , , and , Idiap-RR-12-2009 |
|
MDCT for Encoding Residual Signals in Frequency Domain Linear Prediction, , and , Idiap-RR-34-2009 |
|
Memoirs of Togetherness from Audio Logs, , Idiap-RR-36-2009 |
|
Model Adaptation with Least-Squares SVM for Adaptive Hand Prosthetics, , , , and , Idiap-RR-05-2009 |
|
Multiple Object Tracking using Flow Linear Programming, , and , Idiap-RR-10-2009 |
|
Novel initialization methods for Speaker Diarization, , Idiap-RR-07-2009 |
|
On MLP-based Posterior Features for Template-based ASR, , , and , Idiap-RR-37-2009 |
|
Out-of-Scene AV Data Detection, , Idiap-RR-31-2009 |
|
Robust Speaker Diarization for Short Speech Recordings, and , Idiap-RR-26-2009 |
|
Robustness of Phase based Features for Speaker Recognition, , and , Idiap-RR-14-2009 |
|
Speaker Change Detection with Privacy-Preserving Audio Cues, , , and , Idiap-RR-23-2009 |
|
Speech/Non-Speech Detection in Meetings from Automatically Extracted Low Resolution Visual Features, and , Idiap-RR-20-2009 |
|
Support Vector Machines with a Reject Option, , , and , Idiap-RR-01-2009 |
|
Tagging and Retrieving Images with Co-Occurrence Models: from Corel to Flickr, and , Idiap-RR-21-2009 |
|
User Interface Design in a Just-in-time Retrieval System for Meetings, , , , , , and , Idiap-RR-38-2009 |
|
Visual activity context for focus of attention estimation in dynamic meetings, , and , Idiap-RR-02-2009 |
|
Visual processing-inspired Fern-Audio features for Noise-Robust Speaker Verification, and , Idiap-RR-29-2009 |
|
VTLN Adaptation for Statistical Speech Synthesis, , , and , Idiap-RR-41-2009 |
|
Wide-Band Audio Coding based on Frequency Domain Linear Prediction, , and , Idiap-RR-32-2009 |
|
Parts-Based Face Verification using Local Frequency Bands, and , Idiap-RR-03-2009 |
|
2008
A Data-driven Approach to Speech/Non-speech Detection, and , Idiap-RR-23-2008 |
|
A Distance Model for Rhythms, , , and , Idiap-RR-33-2008 |
|
A Neural Network based Regression Approach for Recognizing Simultaneous Speech, , , , and , Idiap-RR-10-2008 |
|
Acoustic Models for Posterior Features in Speech Recognition, , Idiap-RR-67-2008 |
|
Adaptive Beamforming with a Maximum Negentropy Criterion, , , , and , Idiap-RR-06-2008 |
|
Adaptive Beamforming with a Maximum Negentropy Criterion, , , , , and , Idiap-RR-29-2008 |
|
An Information Theoretic Approach to Speaker Diarization of Meeting Data, , and , Idiap-RR-58-2008 |
|
Analyzing Flickr Groups, and , Idiap-RR-03-2008 |
|
Analyzing Interactions Between Navigation Strategies Using a Computational Model of Action Selection, , , , and , Idiap-RR-48-2008 |
|
Associating Audio-Visual Activity Cues in a Dominance Estimation Framework, , , and , Idiap-RR-66-2008 |
|
Asynchronous detection and classification of oscillatory brain activity, , and , Idiap-RR-36-2008 |
|
Autoregressive Modelling of Hilbert Envelopes for Wide-band Audio Coding, , , and , Idiap-RR-40-2008 |
|
Calibration from statistical properties of the visual world, , and , Idiap-RR-63-2008 |
|
Characterizing the EEG Correlates of Exploratory Behavior, , , and , Idiap-RR-28-2008 |
|
CLEF2008 Image Annotation Task: an SVM Confidence-Based Approach, , and , Idiap-RR-77-2008 |
|
Combining Evidence from a Generative and a Discriminative Model in Phoneme Recognition, and , Idiap-RR-20-2008 |
|
Composite Kernel Learning, , and , Idiap-RR-59-2008 |
|
Continuous Brain-Actuated Control of an Intelligent Wheelchair by Human EEG, , , , , , and , Idiap-RR-53-2008 |
|
Detecting queues at vending machines: a statistical layered approach, and , Idiap-RR-04-2008 |
|
Discovering Human Routines from Cell Phone Data with Topic Models, and , Idiap-RR-32-2008 |
|
Discriminatove Keyword Spotting, , and , Idiap-RR-31-2008 |
|
Emulating Temporal Receptive Fields of Auditory Mid-Brain Neurons for Automatic Speech Recognition, and , Idiap-RR-24-2008 |
|
Enhanced Phone Posteriors for Improving Speech Recognition Systems, and , Idiap-RR-39-2008 |
|
Entropy coding of Quantized Spectral Components in FDLP audio codec, , and , Idiap-RR-71-2008 |
|
Exploiting contextual information for speech/non-speech detection, and , Idiap-RR-22-2008 |
|
Exploiting temporal context for speech/non-speech detection, , and , Idiap-RR-21-2008 |
|
Fast Approximate Spoken Term Detection from Sequence of Phonemes, , , and , Idiap-RR-45-2008 |
|
Filter Bank Design for Subband Adaptive Beamforming and Application to Speech Recognition, , , , , and , Idiap-RR-02-2008 |
|
Front-end for Far-field Speech Recognition based on Frequency Domain Linear Prediction, , and , Idiap-RR-17-2008 |
|
Hilbert Envelope Based Features for Far-Field Speech Recognition, , and , Idiap-RR-42-2008 |
|
Hilbert Envelope Based Specto-Temporal Features for Phoneme Recognition in Telephone Speech, , and , Idiap-RR-18-2008 |
|
How does a dictation machine recognize speech?, , and , Idiap-RR-72-2008 |
|
Identifying Dominant People in Meetings from Audio-Visual Sensors, and , Idiap-RR-65-2008 |
|
Inference in Switching Linear Dynamical Systems Applied to Noise Robust Speech Recognition of Isolated Digits, , Idiap-RR-35-2008 |
|
Integrating audio and vision for robust automatic gender recognition, and , Idiap-RR-73-2008 |
|
Integration of TDOA Features in Information Bottleneck Framework for Fast Speaker Diarization, , and , Idiap-RR-26-2008 |
|
Introducing Temporal Asymmetries in Feature Extraction for Automatic Speech Recognition, and , Idiap-RR-25-2008 |
|
Kernel Based Text-Independnent Speaker Verification, , and , Idiap-RR-68-2008 |
|
Low-Delay Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, , and , Idiap-RR-75-2008 |
|
Machine Learning for Information Retrieval, , Idiap-RR-34-2008 |
|
Maximum Negentropy Beamforming, , , , and , Idiap-RR-07-2008 |
|
MODIFIED DISCRETE COSINE TRANSFORM FOR ENCODING RESIDUAL SIGNALS IN FREQUENCY DOMAIN LINEAR PREDICTION, , and , Idiap-RR-74-2008 |
|
Modulation Frequency Features For Phoneme Recognition In Noisy Speech, , and , Idiap-RR-70-2008 |
|
Multi-layer Boosting for Pattern Recognition, , Idiap-RR-76-2008 |
|
Multi-Person Visual Focus of Attention from Head Pose and Meeting Contextual Cues, and , Idiap-RR-47-2008 |
|
Neural Network based Regression for Robust Overlapping Speech Recognition using Microphone Arrays, , , and , Idiap-RR-09-2008 |
|
On the Combination of Auditory and Modulation Frequency Channels for ASR applications, and , Idiap-RR-12-2008 |
|
Posterior Features Applied to Speech Recognition Tasks with Limited Training Data, , and , Idiap-RR-15-2008 |
|
Predicting the dominant clique in meetings through fusion of nonverbal cues, , , and , Idiap-RR-08-2008 |
|
Predictive Models for Music, , and , Idiap-RR-51-2008 |
|
Probabilistic Models for Melodic Prediction, , and , Idiap-RR-50-2008 |
|
Recognition and Understanding of Meetings Overview of the European AMI and AMIDA Projects, and , Idiap-RR-27-2008 |
|
Recognition of Anticipatory Behavior from Human EEG, , and , Idiap-RR-52-2008 |
|
Recognition Of Reverberant Speech Using Frequency Domain Linear Prediction, , and , Idiap-RR-41-2008 |
|
Reverse Correlation for analyzing MLP Posterior Features in ASR, , and , Idiap-RR-13-2008 |
|
Role Recognition for Meeting Participants: an Approach Based on Lexical Information and Social Network Analysis, , , , and , Idiap-RR-57-2008 |
|
Role Recognition in Multiparty Recordings using Social Affiliation Networks and Discrete Distributions, , and , Idiap-RR-64-2008 |
|
Silence Models in Weighted Finite-State Transducers, , Idiap-RR-19-2008 |
|
Spectral Noise Shaping: Improvements in Speech/Audio Codec Based on Linear Prediction in Spectral Domain, , , and , Idiap-RR-16-2008 |
|
Spectro-Temporal Features for Automatic Speech Recognition using Linear Prediction in Spectral Domain, , and , Idiap-RR-05-2008 |
|
The Projectron: a Bounded Kernel-Based Perceptron, , and , Idiap-RR-30-2008 |
|
Timbre and Rhythmic TRAP-TANDEM features for music information retrieval, , Idiap-RR-46-2008 |
|
Topickr: Flickr Groups and Users Reloaded, and , Idiap-RR-61-2008 |
|
understanding metro station usage using closed circuit television cameras analysis, , , , , , and , Idiap-RR-38-2008 |
|
Using KL-based Acoustic Models in a Large Vocabulary Recognition Task, , and , Idiap-RR-14-2008 |
|
What Did You Do Today? Discovering Daily Routines from Large-Scale Mobile Data, and , Idiap-RR-49-2008 |
|
2007
A Bayesian Switching Linear Dynamical System for Scale-Invariant robust speech extraction, and , Idiap-RR-52-2007 |
|
A Cognitive and Unsupervised MAP Adaptation Approach to the Recognition of the Focus of Attention from Head Pose, and , Idiap-RR-20-2007 |
|
A Comparative Psychophysical and EEG Study of Different Feedback Modalities for HRI, , , , and , Idiap-RR-78-2007 |
|
A Discriminative Kernel-based Model to Rank Images from Text Queries, and , Idiap-RR-38-2007 |
|
A Generative Model for Rhythms, , , and , Idiap-RR-70-2007 |
|
A Novel Statistical Generative Model Dedicated To Face Recognition, and , Idiap-RR-39-2007 |
|
A study of phoneme and grapheme based context-dependent ASR systems, and , Idiap-RR-12-2007 |
|
Adaptive Beamforming with a Minimum Mutual Information Criterion, , , , , and , Idiap-RR-74-2007 |
|
AGGLOMERATIVE INFORMATION BOTTLENECK FOR SPEAKER DIARIZATION OF MEETINGS DATA, , and , Idiap-RR-31-2007 |
|
Analysis of Confusion Matrix to Combine Evidence for Phoneme Recognition, , , and , Idiap-RR-27-2007 |
|
Bayesian Controller for a Novel Semi-Autonomous Navigation Concept, , , and , Idiap-RR-26-2007 |
|
Biometric Person Authentication IS A Multiple Classifier Problem, and , Idiap-RR-03-2007 |
|
Broadcast News Story Segmentation Using Social Network Analysis and Hidden Markov Models, and , Idiap-RR-30-2007 |
|
Classifying Materials in the Real World, , , and , Idiap-RR-69-2007 |
|
COMBINATION OF AGGLOMERATIVE AND SEQUENTIAL CLUSTERING FOR SPEAKER DIARIZATION, , and , Idiap-RR-51-2007 |
|
Comparing Different Word Lattice Rescoring Approaches Towards Keyword Spotting, , , and , Idiap-RR-32-2007 |
|
Confidence-based Cue Integration for Visual Place Recognition, and , Idiap-RR-17-2007 |
|
Daily Routine Classification from Mobile Phone Data, and , Idiap-RR-62-2007 |
|
Detection and Recognition of Number Sequences in Spoken Utterances, and , Idiap-RR-42-2007 |
|
Direct optimisation of a multilayer perceptron for the estimation of cepstral mean and variance statistics, and , Idiap-RR-13-2007 |
|
Discriminative Cue Integration for Medical Image Annotation, , and , Idiap-RR-64-2007 |
|
Dynamical Dirichlet Mixture Model, , and , Idiap-RR-02-2007 |
|
Effective post-processing for single-channel frequency-domain speech enhancement, , Idiap-RR-71-2007 |
|
ESTIMATING THE DOMINANT PERSON IN MULTI-PARTY CONVERSATIONS USING SPEAKER DIARIZATION STRATEGIES, , , and , Idiap-RR-60-2007 |
|
Exploiting Contextual Information for Improved Phoneme Recognition, , , and , Idiap-RR-65-2007 |
|
Exploiting Phoneme Similarities in Hybrid HMM-ANN Keyword Spotting, , and , Idiap-RR-11-2007 |
|
Face Authentication with Salient Local Features and Static Bayesian Network, and , Idiap-RR-04-2007 |
|
Fast Human Detection from Videos Using Covariance Features, and , Idiap-RR-68-2007 |
|
Feature Extraction for Multi-class BCI using Canonical Variates Analysis, , , , and , Idiap-RR-23-2007 |
|
Filter Bank Design based on Minimization of Individual Aliasing Terms for Minimum Mutual Information Subband Adaptive Beamforming, , , , , and , Idiap-RR-77-2007 |
|
Hierarchical and Parallel Processing of Modulation Spectrum for ASR applications, and , Idiap-RR-45-2007 |
|
Hierarchical Neural Networks Feature Extraction for LVCSR system, , , , , and , Idiap-RR-08-2007 |
|
Hierarchical Penalization, , and , Idiap-RR-76-2007 |
|
Human-Centered Computing: Toward a Human Revolution, , , and , Idiap-RR-57-2007 |
|
Joint Bi-Modal Face and Speaker Authentication using Explicit Polynomial Expansion, , Idiap-RR-14-2007 |
|
Keyword Spotting on Word Lattices, and , Idiap-RR-22-2007 |
|
Learning the Inter-frame Distance for Discriminative Template-based Keyword Detection, and , Idiap-RR-15-2007 |
|
Learning the structure of image collections with latent aspect models, , Idiap-RR-06-2007 |
|
LP-TRAPs in all senses, , Idiap-RR-66-2007 |
|
Mapping Nonverbal Communication into Social Status: Automatic Recognition of Journalists and Non-journalists in Radio News, , Idiap-RR-33-2007 |
|
Minimum Mutual Information Beamforming for Simultaneous Active Speakers, , , , , and , Idiap-RR-73-2007 |
|
MLP-based Log Spectral Energy Mapping for Robust Overlapping Speech Recognition, , , and , Idiap-RR-54-2007 |
|
More Efficiency in Multiple Kernel Learning, , , and , Idiap-RR-18-2007 |
|
Multi-Layer Background Subtraction Based on Color and Texture, and , Idiap-RR-67-2007 |
|
Multi-party Focus of Attention Recognition in Meetings from Head Pose and Multimodal Contextual Cues, and , Idiap-RR-50-2007 |
|
Multi-stream Features Combination based on Dempster-Shafer Rule for LVCSR System, , and , Idiap-RR-09-2007 |
|
Non-linear Spectral Contrast Stretching for In-car Speech Recognition, and , Idiap-RR-53-2007 |
|
Non-uniform QMF Decomposition for Wide-band Audio Coding based on Frequency Domain Linear Prediction, , , and , Idiap-RR-43-2007 |
|
Object Category Detection using Audio-visual Cues, , , , and , Idiap-RR-58-2007 |
|
On Confusions in a Phoneme Recognizer, , and , Idiap-RR-10-2007 |
|
On-line Independent Support Vector Machines for Cognitive Systems, , , , and , Idiap-RR-63-2007 |
|
Posterior-Based Features and Distances in Template Matching for Speech Recognition, and , Idiap-RR-41-2007 |
|
Probabilistic Head Pose Tracking Evaluation in Single and Multiple Camera Setups, and , Idiap-RR-21-2007 |
|
Recognition and Understanding of Meetings The AMI and AMIDA Projects, , and , Idiap-RR-46-2007 |
|
Robust overlapping speech recognition based on neural networks, , and , Idiap-RR-55-2007 |
|
Role Recognition in Radio Programs using Social Affiliation Networks and Mixtures of Discrete Distributions: an Approach Inspired by Social Cognition, and , Idiap-RR-40-2007 |
|
Scalable Wide-band Audio Codec based on Frequency Domain Linear Prediction, , , and , Idiap-RR-16-2007 |
|
Significance of Contextual Information in Phoneme Recognition, , , and , Idiap-RR-28-2007 |
|
Sparse Probabilistic Classifiers, and , Idiap-RR-19-2007 |
|
Stationary Features and Cat Detection, and , Idiap-RR-56-2007 |
|
Temporal Masking for Bit-rate Reduction in Audio Codec Based on Frequency Domain Linear Prediction, , , and , Idiap-RR-48-2007 |
|
The COLD Database, , , , and , Idiap-RR-49-2007 |
|
The use of brain-computer interfacing for ambient intelligence, , , , , and , Idiap-RR-61-2007 |
|
Theoretical Foundations for Large-Margin Kernel-Based Continuous Speech Recognition, , Idiap-RR-44-2007 |
|
To Err Is Human: Learning from Error Potentials in Brain-Computer Interfaces, , and , Idiap-RR-37-2007 |
|
Truncation Confusion Patterns in Onset Consonants, , Idiap-RR-05-2007 |
|
Unsupervised Learning for Information Distillation, , Idiap-RR-47-2007 |
|
Using Audio and Video Features to Classify the Most Dominant Person in a Group Meeting, , , , , , , , and , Idiap-RR-29-2007 |
|
Visual Focus of Attention Estimation from Head Pose Posterior Probability Distributions, and , Idiap-RR-75-2007 |
|
2006
2D Multi-Person Tracking: A Comparative Study in AMI Meetings, , , , , and , Idiap-RR-37-2006 |
|
A Bayesian Alternative to Gain Adaptation in Autoregressive Hidden Markov Models, and , Idiap-RR-55-2006 |
|
A Discriminative Approach for the Retrieval of Images from Text Queries, , and , Idiap-RR-15-2006 |
|
A Generalized Dynamic Composition Algorithm of Weighted Finite State Transducers for Large Vocabulary Speech Recognition, , and , Idiap-RR-62-2006 |
|
A Multitask Learning Approach to Document Representation using Unlabeled Data, and , Idiap-RR-44-2006 |
|
A Neural Network to Retrieve Images from Text Queries, and , Idiap-RR-33-2006 |
|
A Study on Visual Focus of Attention Recognition from Head Pose in a Meeting Room, and , Idiap-RR-10-2006 |
|
A supervised learning approach based on STDP and polychronization in spiking neuron networks, , and , Idiap-RR-54-2006 |
|
Active Shape Models Using Local Binary Patterns, and , Idiap-RR-07-2006 |
|
An Acoustic Model Based on Kullback-Leibler Divergence for Posterior Features, , and , Idiap-RR-60-2006 |
|
Analysis and Classification of EEG Signals using Probabilistic Models for Brain Computer Interfaces, , Idiap-RR-48-2006 |
[URL] |
Analyzing Group Interactions in Conversations: a Review, , Idiap-RR-63-2006 |
|
Assessing the effectiveness of slides as a mean to improve the automatic transcription of oral presentations, , and , Idiap-RR-56-2006 |
|
Audio Coding Based on Long Temporal Contexts, , , and , Idiap-RR-30-2006 |
|
Audio Coding Based on Long Temporal Segments: Experiments With Quantization of Excitation Signal, and , Idiap-RR-46-2006 |
|
Bi-Modal Face and Speech Authentication: a BioLogin Demonstration System, , , and , Idiap-RR-18-2006 |
|
Combination of Acoustic Classifiers based on Dempster-Shafer Theory of evidence, and , Idiap-RR-61-2006 |
|
Detecting Abandoned Luggage Items in a Public Space, , and , Idiap-RR-39-2006 |
|
Detecting Intentional Mental Transitions in an Asynchronous BCI, , , , and , Idiap-RR-43-2006 |
|
Detection and Application of Influence Rankings in Small Group Meetings, , , and , Idiap-RR-49-2006 |
|
Discriminant linear processing of time-frequency plane, and , Idiap-RR-20-2006 |
|
Discriminative Kernel-Based Phoneme Sequence Recognition, , , , and , Idiap-RR-14-2006 |
|
Discrmininant Models for Text-independent Speaker Verification, , Idiap-RR-70-2006 |
|
Estimating the Confidence Interval of Expected Performance Curve in Biometric Authentication Using Joint Bootstrap, and , Idiap-RR-25-2006 |
|
Exploring Contextual Information in a Layered Framework for Group Action Recognition, , and , Idiap-RR-41-2006 |
|
Face Authentication Using Adapted Local Binary Pattern Histograms, and , Idiap-RR-06-2006 |
|
Face Detection and Verification using Local Binary Patterns, , Idiap-RR-79-2006 |
|
Further Applications of Sector-Based Detection and Short-Term Clustering, , Idiap-RR-26-2006 |
|
Hand Posture Classification and Recognition using the Modified Census Transform, , and , Idiap-RR-02-2006 |
|
Identifying unexpected words using in-context and out-of-context phoneme posteriors, and , Idiap-RR-68-2006 |
|
Incremental Learning for Place Recognition in Dynamic Environments, , , and , Idiap-RR-52-2006 |
|
Indexation de Documents Manuscrits, , Idiap-RR-31-2006 |
|
Infinite Models for Speaker Clustering, , Idiap-RR-19-2006 |
|
Investigating Lexical Substitution Scoring for Subtitle Generation, , , , and , Idiap-RR-36-2006 |
|
Juicer: A Weighted Finite-State Transducer speech decoder, , , , , and , Idiap-RR-21-2006 |
|
Learning to Retrieve Images from Text Queries with a Discriminative Model, , and , Idiap-RR-32-2006 |
|
Machine Learning Approaches to Text Representation using Unlabeled Data, , Idiap-RR-76-2006 |
|
Master Thesis: Integration of the Harmonic plus Noise Model (HNM) into the Hidden Markov Model-Based Speech Synthesis System (HTS), , Idiap-RR-69-2006 |
|
Melanoma Recognition using Kernel Classifiers, , and , Idiap-RR-53-2006 |
|
Model Adaptation for Sentence Unit Segmentation from Speech, , Idiap-RR-64-2006 |
|
Multi-Person Tracking in Meetings: A Comparative Study, , , , , and , Idiap-RR-38-2006 |
|
Multi-stream Processing for Noise Robust Speech Recognition, , Idiap-RR-28-2006 |
|
Natural Scene Image Modeling using Color and Texture Visterms., and , Idiap-RR-17-2006 |
|
Nearly optimal exploration-exploitation decision thresholds, , Idiap-RR-12-2006 |
|
Observations on Multi-Band Asynchrony in Distant Speech Recordings, , Idiap-RR-74-2006 |
|
On the Recent Use of Local Binary Patterns for Face Authentication, , and , Idiap-RR-34-2006 |
|
Online Classifier Adaptation in Brain-Computer Interfaces, and , Idiap-RR-16-2006 |
|
Online statistical estimation for vehicle control, , Idiap-RR-13-2006 |
|
Posterior Based Keyword Spotting with A Priori Thresholds, , , and , Idiap-RR-67-2006 |
|
Probabilistic Graphical Models for Human Interaction Analysis, , Idiap-RR-78-2006 |
|
Recognizing People's Focus of Attention from Head Poses: a Study, and , Idiap-RR-42-2006 |
|
Revisiting Doddington's Zoo: A Systematic Method to Assess User-dependent Variabilities, , and , Idiap-RR-04-2006 |
|
Robust-to-Illumination Face Localisation using Active Shape Models and Local Binary Patterns, , and , Idiap-RR-47-2006 |
|
Role Recognition in Broadcast News Using Social Network Analysis and Duration Distribution Modeling, , Idiap-RR-35-2006 |
|
Semantic Segmentation of Radio Programs Using Social Network Analysis and Duration Distribution Modeling, , and , Idiap-RR-75-2006 |
|
Sociometry Based Multiparty Audio Recordings Summarization, , Idiap-RR-27-2006 |
|
Spatio-Temporal Analysis of Spontaneous Speech with Microphone Arrays, , Idiap-RR-77-2006 |
|
Speaker Localization for Microphone Array-Based ASR: The Effects of Accuracy on Overlapping Speech, and , Idiap-RR-29-2006 |
|
Speech Coding based on Spectral Dynamics, , , and , Idiap-RR-05-2006 |
|
Speech Enhancement and Recognition in Meetings with an Audio-Visual Sensor Array, , and , Idiap-RR-24-2006 |
|
Spiking Neuron Networks A survey, , Idiap-RR-11-2006 |
|
SVM-based Transfer of Visual Knowledge Across Robotic Platforms, , and , Idiap-RR-65-2006 |
|
Switching Linear Dynamical Systems for Noise Robust Speech Recognition, and , Idiap-RR-08-2006 |
|
The more you learn, the less you store: memory\--controlled incremental SVM, and , Idiap-RR-51-2006 |
|
The segmentation of multi-channel meeting recordings for automatic speech recognition, , and , Idiap-RR-22-2006 |
|
Towards using slide information to enhance speech transcription of meetings, , and , Idiap-RR-01-2006 |
|
Tracking Attention for Multiple People: Wandering Visual Focus of Attention Estimation, , , and , Idiap-RR-40-2006 |
|
Two-Handed Gestures for Human-Computer Interaction, , Idiap-RR-73-2006 |
|
Unified Inference for Variational Bayesian Linear Gaussian State-Space Models, and , Idiap-RR-50-2006 |
|
Unsupervised Spectral Subtraction for Noise-Robust ASR on Unknown Transmission Channels, , and , Idiap-RR-09-2006 |
|
Unsupervised Speech/Non-speech Detection for Automatic Speech Recognition in Meeting Rooms, , and , Idiap-RR-57-2006 |
|
Using Posterior-Based Features in Template Matching for Speech Recognition, , and , Idiap-RR-23-2006 |
|
Wide-Band Perceptual Audio Coding based on Frequency-Domain Linear Prediction, , and , Idiap-RR-58-2006 |
|
2005
A Discriminative Decoder for the Recognition of Phoneme Sequences, and , Idiap-RR-67-2005 |
|
A Frequency-Domain Silence Noise Model, , and , Idiap-RR-13-2005 |
|
A Generative Model for Music Transcription, , and , Idiap-RR-89-2005 |
|
A Graphical Model for Chord Progressions Embedded in a Psychoacoustic Space, , , and , Idiap-RR-33-2005 |
|
A Kernel Classifier for Distributions, and , Idiap-RR-32-2005 |
|
A Kernel Trick For Sequences Applied to Text-Independent Speaker Verification Systems, and , Idiap-RR-77-2005 |
|
A Meeting Browser Evaluation Test, , , and , Idiap-RR-02-2005 |
|
A Neural Network for Text Representation, and , Idiap-RR-12-2005 |
|
A Probabilistic Interpretation of SVMs with an Application to Unbalanced Classification, , and , Idiap-RR-26-2005 |
|
A Probabilistic Model for Chord Progressions, , and , Idiap-RR-57-2005 |
|
A Rao-Blackwellized Mixed State Particle Filter for Head Pose Tracking, and , Idiap-RR-35-2005 |
|
A Thousand Words in a Scene, , , and , Idiap-RR-40-2005 |
|
Application of Information Retrieval Technologies to Presentation Slides, and , Idiap-RR-36-2005 |
|
Audio-visual probabilistic tracking of multiple speakers in meetings, , , and , Idiap-RR-27-2005 |
|
Bayesian Factorial Linear Gaussian State-Space Models for Biosignal Decomposition, and , Idiap-RR-84-2005 |
|
Benchmarking Non-Parametric Statistical Tests, , and , Idiap-RR-38-2005 |
|
Can a Professional Imitator Fool a GMM-Based Speaker Verification System?, and , Idiap-RR-61-2005 |
|
Can Chimeric Persons Be Used in Multimodal Biometric Authentication Experiments?, and , Idiap-RR-20-2005 |
|
Chord Representations for Probabilistic Models, , and , Idiap-RR-58-2005 |
|
Compensating User-Specific Information with User-Independent Information in Biometric Authentication Tasks, and , Idiap-RR-44-2005 |
|
Constructing visual models with a latent space approach, , , and , Idiap-RR-14-2005 |
|
Construction and comparison of approximations for switching linear gaussian state space models, , Idiap-RR-71-2005 |
|
Construction and comparison of approximations for switching linear gaussian state space models, and , Idiap-RR-06-2005 |
|
Continuous Microphone Array Speech Recognition on Wall Street Journal Corpus, , and , Idiap-RR-47-2005 |
|
Cursive Character Challenge: a New Database for Machine Learning and Pattern Recognition, , and , Idiap-RR-79-2005 |
|
Developing and Enhancing Posterior Based Speech Recognition Systems, , , and , Idiap-RR-23-2005 |
|
EER of Fixed and Trainable Fusion Classifiers: A Theoretical Study with Application to Biometric Authentication Tasks, and , Idiap-RR-01-2005 |
|
Efficient Diffusion-based Illumination Normalization for Face Verification, , and , Idiap-RR-46-2005 |
|
Efficient Kalman Smoothing for Harmonic State-Space Models, , Idiap-RR-87-2005 |
|
Evaluation of Multiple Cues Head Pose Tracking Algorithm in Indoor Environments, and , Idiap-RR-05-2005 |
|
Extracting Information from Multimedia Meeting Collections, , and , Idiap-RR-50-2005 |
|
Face Authentication Based on Local Features and Generative Models, , Idiap-RR-85-2005 |
|
Finding groups of people in Google news, and , Idiap-RR-68-2005 |
|
Generative Temporal ICA for Classification in Asynchronous BCI Systems, and , Idiap-RR-08-2005 |
|
Gradient estimates of return, and , Idiap-RR-29-2005 |
|
Harmonic Plus Noise Model for Concatenative Speech Synthesis, , Idiap-RR-37-2005 |
|
Hierarchical approach for spotting keywords, , Idiap-RR-41-2005 |
|
Hierarchical Multi-Stream Posterior Based Speech Recognition System, , and , Idiap-RR-25-2005 |
|
Improving Continuous Speech Recognition System Performance with Grapheme Modelling, , , and , Idiap-RR-16-2005 |
|
Improving Speech Recognition Using a Data-Driven Approach, , and , Idiap-RR-66-2005 |
|
Inferring Document Similarity from Hyper-links, and , Idiap-RR-21-2005 |
|
Integrating co-occurrence and spatial contexts on patch-based scene segmentation, , , and , Idiap-RR-30-2005 |
|
Joint Speech and Speaker Recognition, , Idiap-RR-28-2005 |
|
Joint Training of Multi-Stream HMMs, , Idiap-RR-22-2005 |
|
Kernelized Infomax Clustering, and , Idiap-RR-73-2005 |
|
Learning influence among interacting Markov chains, , , and , Idiap-RR-48-2005 |
|
Local Binary Patterns as an Image Preprocessing for Face Authentication, , and , Idiap-RR-76-2005 |
|
Local Features and 1D-HMMs for Fast and Robust Face Authentication, , Idiap-RR-17-2005 |
|
Measuring the Performance of Face Localization Systems, , , and , Idiap-RR-53-2005 |
|
Modeling Interactions from Email Communication, , , and , Idiap-RR-51-2005 |
|
Modeling semantic aspects for cross-media image indexing, and , Idiap-RR-56-2005 |
|
Multi Channel Sequence Processing, and , Idiap-RR-04-2005 |
|
Multi-resolution RASTA filtering for TANDEM-based ASR, and , Idiap-RR-18-2005 |
|
Multi-stream ASR: Oracle Test and Embedded Training, , and , Idiap-RR-62-2005 |
|
Multimodal Integration for Meeting Group Action Segmentation and Recognition, , , , , and , Idiap-RR-31-2005 |
|
Multiview Face Detection, , and , Idiap-RR-49-2005 |
|
OCR Based Slide Retrieval, , and , Idiap-RR-11-2005 |
|
On Variable-Scale Piecewise Stationary Spectral Analysis of Speech Signals for ASR, , and , Idiap-RR-19-2005 |
|
On Variable-Scale Piecewise Stationary Spectral Analysis of Speech Signals for ASR, , and , Idiap-RR-09-2005 |
|
Optimal Spike-Timing Dependent Plasticity for Precise Action Potential Firing in Supervised Learing, , , and , Idiap-RR-88-2005 |
|
Performance Generalization in Biometric Authentication Using Joint User-Specific and Sample Bootstraps, , and , Idiap-RR-60-2005 |
|
Performance Generalization in Biometric Authentication Using Joint User-Specific and Sample Bootstraps, , and , Idiap-RR-60-2005 |
|
Person Authentication using Brainwaves (EEG) and Maximum A Posteriori Model Adaptation, and , Idiap-RR-81-2005 |
|
Probabilistic Tagging of Unstructured Genealogical Records, and , Idiap-RR-86-2005 |
|
Semi-supervised Meeting Event Recognition with Adapted HMMs, , and , Idiap-RR-15-2005 |
|
Sociometry Based Multiparty Audio Recordings Segmentation, , Idiap-RR-78-2005 |
|
Spectral Entropy Feature in Full-Combination Multi-stream for Robust ASR, and , Idiap-RR-10-2005 |
|
Spectral Entropy Feature in Multi-stream for Robust ASR, and , Idiap-RR-45-2005 |
|
Speech Acquisition in Meetings with an Audio-Visual Sensor Array, , , , and , Idiap-RR-03-2005 |
|
Sports Event Recognition using Layered HMMs, and , Idiap-RR-07-2005 |
|
Stable Directed Belief Propagation in Gaussian DAGs using the auxiliary variable trick, and , Idiap-RR-72-2005 |
|
Subjective Evaluation of Join Cost and Smoothing Methods for Unit Selection Speech Synthesis, and , Idiap-RR-34-2005 |
|
The ami meeting corpus: a pre-announcement, , , , , , , , , , , , , , , , and , Idiap-RR-82-2005 |
|
The Multi-Channel Wall Street Journal Audio Visual Corpus (MC-WSJ-AV): Specification and Initial Experiments, , , and , Idiap-RR-69-2005 |
|
The Role of Speech in Multimodal Human-Computer Interaction (Towards Reliable Rejection of Non-Keyword Input), , and , Idiap-RR-63-2005 |
|
Threshold Selection for Unsupervised Detection, with an Application to Microphone Arrays, , , and , Idiap-RR-52-2005 |
|
Towards ASR Based on Hierarchical Posterior-Based Keyword Recognition, and , Idiap-RR-64-2005 |
|
Towards Explaining the Success (Or Failure) of Fusion in Biometric Authentication, and , Idiap-RR-43-2005 |
|
Tracking the Multi Person Wandering Visual Focus of Attention, , , and , Idiap-RR-80-2005 |
|
Two-Handed Gesture Recognition, and , Idiap-RR-24-2005 |
|
Unsupervised Spectral Substraction for Noise-Robust ASR, , , and , Idiap-RR-42-2005 |
|
Using Auxiliary Sources of Knowledge for Automatic Speech Recognition, , Idiap-RR-90-2005 |
|
Using Chimeric Users to Construct Fusion Classifiers in Biometric Authentication Tasks: An Investigation, and , Idiap-RR-59-2005 |
|
Using Chimeric Users to Construct Fusion Classifiers in Biometric Authentication Tasks: An Investigation, and , Idiap-RR-59-2005 |
|
Using more informative posterior probabilities for speech recognition, , , and , Idiap-RR-91-2005 |
|
Using Pitch as Prior Knowledge in Template-Based Speech Recognition, , and , Idiap-RR-65-2005 |
|
Writer Identification for Smart Meeting Room Systems, , , , , and , Idiap-RR-70-2005 |
|
2004
A Meeting Browser Evaluation Test, , , and , Idiap-RR-53-2004 |
|
A New Speech Recognition Baseline System for Numbers 95 Version 1.3 Based on Torch, and , Idiap-RR-16-2004 |
|
A Novel Approach to Combining Client-Dependent and Confidence Information in Multimodal Biometric, and , Idiap-RR-68-2004 |
|
A Sector-Based Approach for Localization of Multiple Speakers with Microphone Arrays, and , Idiap-RR-15-2004 |
|
A Sector-Based, Frequency-Domain Approach to Detection and Localization of Multiple Speakers, and , Idiap-RR-54-2004 |
|
A Stable Switching Kalman Smoother, , Idiap-RR-89-2004 |
|
A Study of the Effects of Score Normalisation Prior to Fusion in Biometric Authentication Tasks, and , Idiap-RR-69-2004 |
|
A Unified Framework for Score Normalization Techniques Applied to Text Independent Speaker Verification, and , Idiap-RR-62-2004 |
|
An Auxiliary Variational Method, and , Idiap-RR-86-2004 |
|
An Investigation of F-ratio Client-Dependent Normalisation on Biometric Authentication Tasks, and , Idiap-RR-46-2004 |
|
Application of Information Retrieval Techniques to Single Writer Documents, , Idiap-RR-12-2004 |
|
Are two Classifiers performing equally? A treatment using Bayesian Hypothesis Testing, , Idiap-RR-57-2004 |
|
Assessing Scene Structuring in Consumer Videos, , , , and , Idiap-RR-11-2004 |
|
AV16.3: an Audio-Visual Corpus for Speaker Localization and Tracking, , and , Idiap-RR-28-2004 |
|
Boosting word error rates, and , Idiap-RR-49-2004 |
|
Browsing Recorded Meetings with Ferret, , and , Idiap-RR-32-2004 |
|
Database, Protocol and Tools for Evaluating Score-Level Fusion Algorithms in Biometric Authentication, and , Idiap-RR-44-2004 |
|
Detecting Group Interest-level in Meetings, , , and , Idiap-RR-51-2004 |
|
EEG Classification using Generative Independent Component Analysis, and , Idiap-RR-77-2004 |
|
Effect of Recognition Errors on Information Retrieval Performance, , Idiap-RR-08-2004 |
|
Effect of Recognition Errors on Text Clustering, and , Idiap-RR-82-2004 |
|
Effect of Segmentation Method on Video Retrieval Performance, and , Idiap-RR-83-2004 |
|
Entropy Based Combination of Tandem Representations for Noise Robust ASR, , , , and , Idiap-RR-19-2004 |
|
Estimates of Parameter Distributions for Optimal Action Selection, and , Idiap-RR-72-2004 |
|
Estimating the Quality of Face Localization for Face Verification, , , and , Idiap-RR-07-2004 |
|
Evidences of Equal Error Rate Reduction in Biometric Authentication Fusion, and , Idiap-RR-43-2004 |
|
Face Authentication using Client-specific Matching Pursuit, , , and , Idiap-RR-78-2004 |
|
HMM and IOHMM for the Recognition of Mono- and Bi-Manual 3D Hand Gestures, , and , Idiap-RR-39-2004 |
|
HMM/ANN Based Spectral Peak Location Estimation for Noise Robust Speech Recognition, , and , Idiap-RR-50-2004 |
|
How Do Correlation and Variance of Base-Experts Affect Fusion in Biometric Authentication Tasks?, and , Idiap-RR-18-2004 |
|
Improving Fusion with Margin-Derived Confidence In Biometric Authentication Tasks, and , Idiap-RR-63-2004 |
|
Improving Single Modal and Multimodal Biometric Authentication Using F-ratio Client-Dependent Normalisation, and , Idiap-RR-52-2004 |
|
Invariances in Kernel Methods: From Samples to Objects, and , Idiap-RR-56-2004 |
|
Large Scale Machine Learning, , Idiap-RR-42-2004 |
|
Links between Perceptrons, MLPs and SVMs, and , Idiap-RR-06-2004 |
|
LP-TRAP: Linear predictive temporal patterns, , and , Idiap-RR-59-2004 |
|
Making Retrieval Faster Through Document Clustering, and , Idiap-RR-02-2004 |
|
Modeling Individual and Group Actions in Meetings With Layered HMMs, , , , and , Idiap-RR-33-2004 |
|
Modeling Individual and Group Actions in Meetings: a Two-Layer HMM Framework, , , , and , Idiap-RR-09-2004 |
|
Modeling Scenes with Local Descriptors and Latent Aspects, , , , , and , Idiap-RR-79-2004 |
|
Modelling Auxiliary Features in Tandem Systems, , , and , Idiap-RR-21-2004 |
|
Motion likelihood and proposal modeling in Model-Based Stochastic Tracking, and , Idiap-RR-61-2004 |
|
Multi-resolution Spectral Entropy Based Feature for Robust ASR, , , and , Idiap-RR-37-2004 |
|
Multimodal Group Action Clustering in Meetings, , , , and , Idiap-RR-24-2004 |
|
Multimodal Multispeaker Probabilistic Tracking in Meetings, , , and , Idiap-RR-66-2004 |
|
New Nonsense Syllables Database -- Analyses and Preliminary ASR Experiments, , , and , Idiap-RR-29-2004 |
|
Noise-Robust Multi-Stream Fusion for Text-Independent Speaker Authentication, and , Idiap-RR-01-2004 |
|
Noisy Text Categorization, , Idiap-RR-03-2004 |
|
Noisy Text Clustering, and , Idiap-RR-31-2004 |
|
Nonlinear Feature Transformations for Noise Robust Speech Recognition, , Idiap-RR-70-2004 |
|
On Local Features for Face Verification, and , Idiap-RR-36-2004 |
|
On Performance / Robustness / Complexity Trade-Offs in Face Verification, , and , Idiap-RR-74-2004 |
|
On the Adequacy of Baseform Pronunciations and Pronunciation Variants, and , Idiap-RR-27-2004 |
|
On the Use of Information Retrieval Measures for Speech Recognition Evaluation, , , , , , and , Idiap-RR-73-2004 |
|
On the Use of Speech and Face Information for Identity Verification, and , Idiap-RR-10-2004 |
|
Order Matters: A Distributed Sampling Method for Multi-Object Tracking, , Idiap-RR-25-2004 |
|
Phase AutoCorrelation (PAC) Features for Noise Robust ASR, , , and , Idiap-RR-40-2004 |
Phoneme vs Grapheme Based Automatic Speech Recognition, , , and , Idiap-RR-48-2004 |
|
PLP$^2$: Autoregressive modeling of auditory-like 2-D spectro-temporal patterns, , and , Idiap-RR-60-2004 |
|
PLSA-based Image Auto-Annotation: Constraining the Latent Space, and , Idiap-RR-30-2004 |
|
Posteriori Probabilities and Likelihoods Combination for Speech and Speaker Recognition, and , Idiap-RR-23-2004 |
|
Pseudo-Syntactic Language Modeling for Disfluent Speech Recognition, , Idiap-RR-55-2004 |
|
Robust Audio Segmentation, , and , Idiap-RR-35-2004 |
|
Sector-Based Detection for Hands-Free Speech Enhancement in Cars, , and , Idiap-RR-67-2004 |
|
Semi-supervised Adapted HMMs for Unusual Event Detection, , and , Idiap-RR-80-2004 |
|
Sequence Classification with Input-Output Hidden Markov Models, and , Idiap-RR-13-2004 |
Short-Term Spatio-Temporal Clustering of Sporadic and Concurrent Events, , and , Idiap-RR-14-2004 |
|
{S}ignificance {T}ests for {\em Bizarre} {M}easures in 2-{C}lass {C}lassification {T}asks, , and , Idiap-RR-34-2004 |
|
Spectro-Temporal Activity Pattern (STAP) Features for Noise Robust ASR, , , and , Idiap-RR-20-2004 |
|
Statistical Transformation Techniques for Face Verification Using Faces Rotated in Depth, and , Idiap-RR-04-2004 |
|
Stochastic techniques in deriving perceptual knowledge, , Idiap-RR-84-2004 |
|
Subjective Evaluation of Join Cost Functions Used in Unit Selection Speech Synthesis, and , Idiap-RR-26-2004 |
|
The Auxiliary Variable Trick for deriving Kalman Smoothers, , Idiap-RR-87-2004 |
|
Theme Topic Mixture Model: A Graphical Model for Document Representation, and , Idiap-RR-05-2004 |
|
Towards Predicting Optimal Subsets of Base-Experts in Biometric Authentication Task, and , Idiap-RR-17-2004 |
|
Towards using hierarchical posteriors for flexible automatic speech recognition systems, , , , , and , Idiap-RR-58-2004 |
|
Tracking People in Meetings with Particles, , , , and , Idiap-RR-71-2004 |
|
User Authentication via Adapted Statistical Models of Face Images, , and , Idiap-RR-38-2004 |
|
User-Customized Password Speaker Verification Using Multiple Reference and Background Models, and , Idiap-RR-41-2004 |
|
Using RASTA in task independent TANDEM feature extraction, , and , Idiap-RR-22-2004 |
|
Variational Information Maximization for Population Coding, , Idiap-RR-85-2004 |
|
Variational Information Maximization in Gaussian Channels, and , Idiap-RR-88-2004 |
|
2003
A Color and Gradient Local Descriptor Fusion Scheme For Object Recognition, and , Idiap-RR-71-2003 |
|
A Localization/Verification Scheme for Finding Text in Images and Video Frames Based on Contrast Independent Features and Machine Learning Methods, and , Idiap-RR-42-2003 |
|
A Mixed-State I-Particle Filter for Multi-Camera Speaker Tracking, , , and , Idiap-RR-25-2003 |
|
A New Margin-Based Criterion for Efficient Gradient Descent, and , Idiap-RR-16-2003 |
|
A Probabilistic Framework for Joint Head Tracking and Pose Estimation, and , Idiap-RR-78-2003 |
|
A Robust Speaker Clustering Algorithm, and , Idiap-RR-38-2003 |
|
A Statistical Significance Test for Person Authentication, and , Idiap-RR-83-2003 |
|
A Symmetric Transformation for LDA-based Face Verification, , Idiap-RR-67-2003 |
|
Adapted Generative Models For Face Verification, , and , Idiap-RR-76-2003 |
|
Adaptive Importance Sampling to Accelerate Training of a Neural Probabilistic Language Model, and , Idiap-RR-35-2003 |
|
An Alternative To Silence Removal For Text-Independent Speaker Verification, and , Idiap-RR-51-2003 |
|
An Implicit Motion Likelihood for Tracking with Particle Filters, , and , Idiap-RR-15-2003 |
|
An Investigation of Spectral Subband Centroids for Speaker Authentication, , and , Idiap-RR-62-2003 |
|
An Online Audio Indexing System, , and , Idiap-RR-39-2003 |
|
Audio-Video Person Clustering in Video Databases, and , Idiap-RR-46-2003 |
|
Automatic Analysis of Multimodal Group Actions in Meetings, , , , , and , Idiap-RR-27-2003 |
|
Automatic Speech Recognition using Dynamic Bayesian Networks with the Energy as an Auxiliary Variable, and , Idiap-RR-18-2003 |
|
Boosting HMMs with an application to speech recognition, and , Idiap-RR-41-2003 |
|
Boosting Pixel-based Classifiers for Face Verification, and , Idiap-RR-65-2003 |
|
Client Dependent GMM-SVM Models for Speaker Verification, and , Idiap-RR-03-2003 |
|
Clustering And Segmenting Speakers And Their Locations In Meetings, , and , Idiap-RR-55-2003 |
|
Comparison and Combination of Features in a Hybrid HMM/MLP and a HMM/GMM Speech Recognition System, , , , and , Idiap-RR-48-2003 |
|
Comparison of MLP and GMM Classifiers for Face Verification on XM2VTS, , and , Idiap-RR-10-2003 |
|
Conditional Gaussian Mixtures, , Idiap-RR-11-2003 |
|
Confidence Measures in Multiple pronunciations Modeling For Speaker Verification, and , Idiap-RR-53-2003 |
|
EEG-based BCI Systems and IDIAP EEG Database, and , Idiap-RR-64-2003 |
|
Embedding Motion in Model-Based Stochastic Tracking, , and , Idiap-RR-72-2003 |
|
Evaluation of formant-like features for automatic speech recognition, , , , , and , Idiap-RR-08-2003 |
|
Face Processing & Frontal Face Verification, , Idiap-RR-20-2003 |
|
Face Verification using LDA and MLP on the BANCA database, , Idiap-RR-66-2003 |
|
Face Verification Using Synthesized Non-Frontal Models, and , Idiap-RR-60-2003 |
|
From Samples to Objects in Kernel Methods, and , Idiap-RR-29-2003 |
|
HMM and IOHMM Modeling of EEG Rhythms for Asynchronous BCI Systems, and , Idiap-RR-49-2003 |
|
HMM Mixtures (HMM2) for Robust Speech Recognition, , Idiap-RR-34-2003 |
|
Improving Face Verification using Symmetric Transformation, , Idiap-RR-68-2003 |
|
Joint Decoding for Phoneme-Grapheme Continuous Speech Recognition, , and , Idiap-RR-52-2003 |
|
Mel-Cepstrum Modulation Spectrum (MCMS) Features for Robust ASR, , , and , Idiap-RR-47-2003 |
|
Monte Carlo Video Text Segmentation, and , Idiap-RR-07-2003 |
|
Multi-Modal Audio-Visual Event Recognition for Football Analysis, , and , Idiap-RR-12-2003 |
|
Multimodal Authentication using Asynchronous HMMs, , Idiap-RR-02-2003 |
|
Noise Robust Discriminative Models, and , Idiap-RR-40-2003 |
|
Noisy Text Categorization, , Idiap-RR-61-2003 |
|
Non-Linear Variance Reduction Techniques in Biometric Authentication, and , Idiap-RR-26-2003 |
|
Nonlinear Analysis of Cognitive and Motor-related EEG Signals, and , Idiap-RR-14-2003 |
Nonlinear Spectral Transformations for Robust Speech Recognition, , and , Idiap-RR-36-2003 |
|
Object Localization in Metric Spaces for Video Linking, and , Idiap-RR-09-2003 |
|
Offline Cursive Handwriting: From Word To Text Recognition, , Idiap-RR-24-2003 |
|
Offline Recognition of Large Vocabulary Cursive Handwritten Text, , and , Idiap-RR-01-2003 |
|
Offline Recognition of Unconstrained Handwritten Texts Using HMMs and Statistical Language Models, , and , Idiap-RR-22-2003 |
|
On Automatic Annotation of Images with Latent Space Models, and , Idiap-RR-31-2003 |
|
On automatic annotation of meeting databases, , , , and , Idiap-RR-06-2003 |
|
On Factorizing Spectral Dynamics for Robust Speech Recognition, , , and , Idiap-RR-32-2003 |
|
On Multi-scale Fourier Transform Analysis of Speech Signals, and , Idiap-RR-33-2003 |
|
On Performance Evaluation of Face Detection and Localization Algorithms, , , and , Idiap-RR-80-2003 |
|
On the Combination of Speech and Speaker Recognition, and , Idiap-RR-19-2003 |
|
On the Need for On-Line Learning in Brain-Computer Interfaces, , Idiap-RR-30-2003 |
|
On Use of Task Independent Training Data in Tandem Feature Extraction, and , Idiap-RR-57-2003 |
|
Online Policy Adaptation for Ensemble Classifiers, and , Idiap-RR-69-2003 |
|
Online Policy Adaptation for Ensemble Classifiers, and , Idiap-RR-69-2003 |
|
Phase AutoCorrelation (PAC) features in Entropy based Multi-Stream for Robust Speech Recognition, , , and , Idiap-RR-54-2003 |
|
Phoneme-Grapheme Based Speech Recognition System, , , and , Idiap-RR-37-2003 |
|
Real-Time Face Detection Using Boosting Learning in Hierarchical Feature Spaces, , and , Idiap-RR-70-2003 |
|
Recognition of Isolated Complex Mono- and Bi-Manual 3D Hand Gestures, , and , Idiap-RR-63-2003 |
|
Reconnaissance de gestes 3D bi-manuels, , , and , Idiap-RR-79-2003 |
|
Robust Features for Frontal Face Authentication in Difficult Image Conditions, and , Idiap-RR-05-2003 |
|
Scalability Analysis of Audio-Visual Person Identity Verification, , , and , Idiap-RR-04-2003 |
|
Segmenting Multiple Concurrent Speakers Using Microphone Arrays, , and , Idiap-RR-21-2003 |
|
Show What You Know: Musings on the Reporting of Negative Results in Speech Recognition Research, and , Idiap-RR-81-2003 |
|
Some Emerging Concepts in Speech Recognition., and , Idiap-RR-82-2003 |
|
Spectral Entropy Based Feature for Robust ASR, , , and , Idiap-RR-56-2003 |
|
Speech & Face Based Biometric Authentication at IDIAP, , , , , , , and , Idiap-RR-13-2003 |
|
Speech Recognition with Auxiliary Information, , Idiap-RR-28-2003 |
|
Tangent Vector Kernels for Invariant Image Classification with SVMs, and , Idiap-RR-75-2003 |
|
Text detection and recognition in images and video sequences, , Idiap-RR-44-2003 |
|
Textual Data Representation, and , Idiap-RR-74-2003 |
|
The Expected Performance Curve, , and , Idiap-RR-85-2003 |
|
The Expected Performance Curve: a New Assessment Measure for Person Authentication, and , Idiap-RR-84-2003 |
|
Towards Computer Understanding of Human Interactions, , , and , Idiap-RR-45-2003 |
|
TRAP-TANDEM: Data-driven extraction of temporal features from speech, , Idiap-RR-50-2003 |
|
Using pitch frequency information in speech recognition, , and , Idiap-RR-23-2003 |
|
Variance Reduction Techniques in Biometric Authentication, and , Idiap-RR-17-2003 |
|
Vessel Segmentation and Branching Detection using an Adaptive Profile Kalman Filter in Retinal Blood Vessel Structure Analysis, and , Idiap-RR-58-2003 |
|
Video Text Segmentation Using Particle Filters, and , Idiap-RR-43-2003 |
|
Why Do Multi-Stream, Multi-Band and Multi-Modal Approaches Work on Biometric User Authentication Tasks?, and , Idiap-RR-59-2003 |
|
2002
A Multi-sample Multi-source Model for Biometric Authentication, , and , Idiap-RR-14-2002 |
|
A New Method of Contrast Normalization for Verification of Extracted Video Text Having Complex Backgrounds, and , Idiap-RR-16-2002 |
|
A State-of-the-art Neural Network for Robust Face Verification, , and , Idiap-RR-36-2002 |
|
An Asynchronous Hidden Markov Model for Audio-Visual Speech Recognition, , Idiap-RR-26-2002 |
|
Audio-Visual Speaker Tracking with Importance Particle Filters, , , , and , Idiap-RR-37-2002 |
|
Auxiliary Variables in Conditional Gaussian Mixtures for Automatic Speech Recognition, , and , Idiap-RR-25-2002 |
|
Bagging Using the VMSE Cost Function, , Idiap-RR-27-2002 |
|
Comparison of Support Vector Machine and Neural Network for Text Texture Verification, and , Idiap-RR-19-2002 |
|
Conditional Gaussian Mixture Models for Environmental Risk Mapping, , and , Idiap-RR-12-2002 |
|
Confusion matrix based posterior probabilities correction, and , Idiap-RR-53-2002 |
|
Dichotomy Between Clustering Performance and Minimum Distortion in Piecewise-Dependent-Data (PDD) Clustering, and , Idiap-RR-48-2002 |
|
Dynamic Bayesian Network Based Speech Recognition with Pitch and Energy as Auxiliary Variables, , , and , Idiap-RR-24-2002 |
|
Entropy-based Multi-stream Combination, , and , Idiap-RR-31-2002 |
|
Estimating the Intrinsic Dimension of Data with a Fractal-Based Method, and , Idiap-RR-02-2002 |
|
Estimation of Conditional Distributions using Gaussian Mixture Models, , and , Idiap-RR-03-2002 |
|
Evaluation of Formant-Like Features for ASR, , , , , and , Idiap-RR-04-2002 |
|
Evaluation Protocols and Comparative Results for the Triesch Hand Posture Database, , Idiap-RR-50-2002 |
|
Experimental Protocol on the BANCA Database, , , , , , , and , Idiap-RR-05-2002 |
|
Extended BIC Criterion for Model Selection, and , Idiap-RR-42-2002 |
|
Face Verification using MLP and SVM, and , Idiap-RR-21-2002 |
|
Finding Structure in Consumer Videos by Probabilistic Hierarchical Clustering, , and , Idiap-RR-22-2002 |
|
Gestures for Multi-Modal Interfaces: A Review, , Idiap-RR-34-2002 |
|
Head-Pose Invariant Facial Expression Recognition using Convolutional Neural Networks, , Idiap-RR-51-2002 |
|
Hybrid generative-discriminative models for speech and speaker recognition, and , Idiap-RR-06-2002 |
|
Hybrid HMM/ANN and GMM Combination for User-Customized Password Speaker Verification, and , Idiap-RR-45-2002 |
|
Improved Unknown-Multiple Speaker clustering using HMM, , and , Idiap-RR-23-2002 |
|
Improving Face Authetication Using Virtual Samples, , and , Idiap-RR-40-2002 |
|
Information Fusion and Person Verification Using Speech & Face Information, and , Idiap-RR-33-2002 |
|
Linking Objects in Videos by Importance Sampling, and , Idiap-RR-20-2002 |
|
Location Based Speaker Segmentation, and , Idiap-RR-43-2002 |
|
Low cost duration modelling for noise robust speech recognition, , and , Idiap-RR-08-2002 |
|
Microphone Array Speech Recognition : Experiments on Overlapping Speech in Meetings, and , Idiap-RR-41-2002 |
|
Modeling Human Interaction in Meetings, , , , , , , and , Idiap-RR-59-2002 |
|
Modelling auxiliary information (pitch frequency) in hybrid HMM/ANN based ASR systems, , and , Idiap-RR-62-2002 |
|
Mutliscale Facial Expression Recognition using Convolutional Neural Networks, , Idiap-RR-52-2002 |
|
Noise PDF transformation in secondary feature processing, , Idiap-RR-29-2002 |
|
On Spectral Methods and the Structuring of Home Videos, , and , Idiap-RR-55-2002 |
|
Online Policy Adaptation for Ensemble Algorithms, and , Idiap-RR-28-2002 |
|
Phase AutoCorrelation (PAC) derived Robust Speech Features, , and , Idiap-RR-38-2002 |
|
Probabilistic Home Video Structuring: Feature Selection and Performance Evaluation, , and , Idiap-RR-11-2002 |
|
Recent advances in the multi-stream HMM/ANN hybrid approach to noise robust ASR, and , Idiap-RR-57-2002 |
|
Robust Face Verification using Skin Color and Neural Networks, , Idiap-RR-49-2002 |
|
Robust Speaker Change Detection, , and , Idiap-RR-39-2002 |
|
Robust Speech Recognition with Small Microphone Arrays using the Missing Data Approach, , and , Idiap-RR-09-2002 |
|
Self-Organizing-Maps With BIC For Speaker Clustering, , Idiap-RR-60-2002 |
|
SOM-Based Clustering for On-Line Fraud Behavior Classification: a Case Study, and , Idiap-RR-30-2002 |
|
Speaker Normalization using HMM2, , and , Idiap-RR-15-2002 |
|
Speech recognition of spontaneous, noisy speech using auxiliary information in Bayesian networks, , and , Idiap-RR-44-2002 |
Speech recognition with auxiliary information, , and , Idiap-RR-58-2002 |
Text Detection and Recognition in Images and Videos, , and , Idiap-RR-61-2002 |
|
Text Segmentation and Recognition in Complex Background Based on Markov Random Field, , and , Idiap-RR-17-2002 |
|
The analysis of kernel ridge regression learning algorithm., , Idiap-RR-54-2002 |
|
The BANCA Database and Experimental Protocol for Speaker Verification, , , and , Idiap-RR-13-2002 |
|
Torch: a modular machine learning software library, , and , Idiap-RR-46-2002 |
|
Towards Robust and Adaptive Speech Recognition Models, , and , Idiap-RR-47-2002 |
|
Towards Robust and Adaptive Speech Recognition Models, , and , Idiap-RR-01-2002 |
|
Transforming the feature vectors to improve HMM based cursive word recognition systems, and , Idiap-RR-32-2002 |
|
Unknown-Multiple Speaker clustering using HMM, , , and , Idiap-RR-07-2002 |
|
User-Customized Password HMM Based Speaker Verification, and , Idiap-RR-35-2002 |
|
User-Customized Password Speaker Verification based on HMM/ANN and GMM Models, and , Idiap-RR-10-2002 |
|
Video Text Recognition Based on Markov Random Field and Grayscale Consistency Constraint, and , Idiap-RR-18-2002 |
|
What is Better: GMM of Two Gaussians or Two Clusters With One Gaussian?, , Idiap-RR-56-2002 |
|
2001
A Comparative Study of Adaptation Methods for Speaker Verification, and , Idiap-RR-34-2001 |
|
A Parallel Mixture of SVMs for Very Large Scale Problems, , and , Idiap-RR-12-2001 |
|
A Pragmatic View of the Application of HMM2 for ASR, , and , Idiap-RR-23-2001 |
|
Adaptive ML-Weighting in Multi-Band Recombination of Gaussian Mixture ASR, , and , Idiap-RR-05-2001 |
|
Analytic Assessment of Telephone Transmission Impact on ASR Performance Using a Simulation Model, and , Idiap-RR-17-2001 |
|
Artifacts of the colour coherence vector and an alternative similarity measure, and , Idiap-RR-02-2001 |
|
Combining Neural Gas and Learning Vector Quantization for Cursive Character Recognition, and , Idiap-RR-18-2001 |
|
Comparison of Client Model Adaptation Schemes, and , Idiap-RR-25-2001 |
|
Confidence Evaluation for Risk Prediction, , and , Idiap-RR-22-2001 |
|
Confidence Measures for Multimodal Identity Verification, , , and , Idiap-RR-38-2001 |
|
Data utility modelling for mismatch reduction, , Idiap-RR-30-2001 |
|
Detection of Narrative Structure for Annotation of News Broadcasts, , and , Idiap-RR-03-2001 |
|
EEG pattern recognition through multi-stream evidence combination, , and , Idiap-RR-31-2001 |
|
Error Correcting Posterior Combination for Robust Multi-Band Speech Recognition, and , Idiap-RR-10-2001 |
|
Evaluation of Biometric Technology on XM2VTS, , and , Idiap-RR-21-2001 |
|
Evaluation of SVM Binary Classification with Nonparametric Stochastic Simulations, , Idiap-RR-07-2001 |
|
Facial Expression Analysis using Shape and Motion Information Extracted by Convolutional Neural Networks, , Idiap-RR-49-2001 |
|
From missing data to maybe useful data: soft data modelling for noise robust ASR, , and , Idiap-RR-06-2001 |
|
Hidden Markov Models and other Finite State Automata for Sequence Processing, and , Idiap-RR-37-2001 |
|
IDIAP HMM/HMM2 System: Theoretical Basis and Software Specifications, , , and , Idiap-RR-27-2001 |
|
Improving Face Verification using Skin Color Information, and , Idiap-RR-44-2001 |
|
Increasing Speech Recognition Noise Robustness with HMM2, , and , Idiap-RR-36-2001 |
|
MAP Combination of Multi-Stream HMM or HMM/ANN Experts, , and , Idiap-RR-14-2001 |
|
Microphone Array Post-filter based on Noise Field Coherence, and , Idiap-RR-40-2001 |
|
Microphone Array Post-filter for Diffuse Noise Field, and , Idiap-RR-39-2001 |
|
Mixed Bayesian Networks with Auxiliary Variables for Automatic Speech Recognition, , and , Idiap-RR-45-2001 |
|
Modeling Auxiliary Information in Bayesian Network Based ASR, , and , Idiap-RR-11-2001 |
|
Neural Networks in Automatic Speech Recognition, , , and , Idiap-RR-09-2001 |
|
New Approaches Towards Robust and Adaptive Speech Recognition, , and , Idiap-RR-01-2001 |
|
Offline Cursive Word Recognition using Continuous Density Hidden Markov Models trained with PCA or ICA Features, and , Idiap-RR-46-2001 |
|
PhD Thesis: Speech Analysis with Production Constraints, , Idiap-RR-35-2001 |
|
Pronunciation models and their evaluation using confidence measures, and , Idiap-RR-29-2001 |
|
Robust Face Analysis using Convolutional Neural Networks, , Idiap-RR-48-2001 |
|
Robust HMM-Based Speech/Music Segmentation, , and , Idiap-RR-33-2001 |
|
Robust Speech Recognition and Feature Extraction Using HMM2, , , and , Idiap-RR-42-2001 |
|
Robust speech recognition based on multi-stream processing, , Idiap-RR-41-2001 |
|
Speaker Verification Based On User-Customized Password, , and , Idiap-RR-13-2001 |
|
Speech Recognition Using Advanced HMM2 Features, , and , Idiap-RR-24-2001 |
|
Speech/Music Discrimination using Entropy and Dynamism Features in a HMM Classification Framewor, , and , Idiap-RR-26-2001 |
|
Support Vector Machines for Classification and Mapping of Reservoir Data, , , , , and , Idiap-RR-04-2001 |
|
Text Enhancement with Asymmetric Filter for Video OCR, , and , Idiap-RR-19-2001 |
|
Text Identification in Complex Background using SVM, , and , Idiap-RR-20-2001 |
|
User Customized HMM/ANN Based Speaker Verification, and , Idiap-RR-32-2001 |
|
Using posterior probabilities for speech/music discrimination, , Idiap-RR-08-2001 |
|
Video OCR for Sport Video Annotation and Retrieval, and , Idiap-RR-28-2001 |
|
Writer adaptation techniques in HMM based Off-Line Cursive Script Recognition, and , Idiap-RR-15-2001 |
|
2000
A MAP approach, with synchronous decoding and unit-based normalization for text-dependent speaker verification, , and , Idiap-RR-48-2000 |
|
A neural network for classification with incomplete data, , Idiap-RR-23-2000 |
|
A new normalization technique for cursive handwritten words, and , Idiap-RR-32-2000 |
|
A Survey of Text Detection and Recognition in Images and Videos, and , Idiap-RR-38-2000 |
|
A survey on Off-Line Cursive Word Recognition, , Idiap-RR-43-2000 |
|
Adaptation robuste de modèles HMM pour la vérification du locuteur dépendante du texte, and , Idiap-RR-08-2000 |
|
Advanced Spatial Data Analysis and Modelling with Support Vector Machines, , , and , Idiap-RR-31-2000 |
|
An EM Algorithm for HMMs with Emission Distributions Represented by HMMs, , and , Idiap-RR-11-2000 |
|
An Introduction to Bayesian Network Theory and Usage, , Idiap-RR-03-2000 |
|
Approches génératives pour le traitement de séquences d'images: application à la reconnaissance dynamique des gestes de la main, , Idiap-RR-45-2000 |
|
ASYMMETRIC FILTER FOR TEXT RECOGNITION IN VIDEO, and , Idiap-RR-37-2000 |
|
Audio visual speech recognition, , , , , , , and , Idiap-RR-35-2000 |
|
Auto-Association by Multilayer Perceptrons and Singular Value Decomposition, , Idiap-RR-16-2000 |
|
Automatic Speech Recognition using Dynamic Bayesian Networks with both Acoustic and Articulatory Variables, , , and , Idiap-RR-19-2000 |
|
Automatic Speech Recognition using Pitch Information in Dynamic Bayesian Networks, , and , Idiap-RR-41-2000 |
|
Behavior of a Bayesian adaptation method for incremental enrollment in speaker verification, , , , , and , Idiap-RR-02-2000 |
|
Combining multiple tracking algorithms for improved general performance, , and , Idiap-RR-13-2000 |
|
Comparison of HMM experts with MLP experts in the Full Combination Multi-Band Approach to Robust ASR, and , Idiap-RR-21-2000 |
|
Cursive Character Recognition by Learning Vector Quantization, and , Idiap-RR-47-2000 |
|
Environmental Data Mapping with Support Vector Regression and Geostatistics, , and , Idiap-RR-10-2000 |
|
From Multi-Band Full Combination to Multi-Stream Full Combination Processing in Robust ASR, , and , Idiap-RR-20-2000 |
|
Handwritten Digits Recognition, , Idiap-RR-07-2000 |
|
HMM2- A Novel Approach to HMM Emission Probability Estimation, , and , Idiap-RR-30-2000 |
|
HMM2- Extraction of Formant Features and their Use for Robust ASR, , and , Idiap-RR-42-2000 |
|
Incorporating Domain Knowledge with Video and Voice Data Analysis in News Broadcasts, , and , Idiap-RR-14-2000 |
|
Indexing spoken audio by LSA and SOMs, , Idiap-RR-06-2000 |
|
Intrinsic dimension estimation of data: an approach based on Grassberger-Procaccia's algorithm, and , Idiap-RR-33-2000 |
|
Learning the Decision Function for Speaker Verification, and , Idiap-RR-40-2000 |
|
Local Machine Learning Models for Spatial Data Analysis, and , Idiap-RR-34-2000 |
|
Mixture Models for Unsupervised and Supervised Learning, , Idiap-RR-18-2000 |
|
Mixtures of latent variable models for density estimation and classification, , Idiap-RR-25-2000 |
|
Multiple Hypotheses Video OCR, and , Idiap-RR-28-2000 |
|
Multiple Timescale Feature Combination towards Robust Speech Recognition, , Idiap-RR-29-2000 |
|
On the Convergence of SVMTorch, an Algorithm for Large-Scale Regression Problems, and , Idiap-RR-24-2000 |
|
Recent Developments in Speaker Verification at IDIAP, and , Idiap-RR-26-2000 |
|
Robust multi-stream speech recognition based on the combined reliabilities of the speech signal and phonemes estimates, , Idiap-RR-36-2000 |
Spatial Data Mapping with Support Vector Regression, and , Idiap-RR-09-2000 |
|
Support Vector Machines for Large-Scale Regression Problems, and , Idiap-RR-17-2000 |
|
Taking on the Curse of Dimensionality in Joint Distributions Using Neural Networks, and , Idiap-RR-01-2000 |
|
Test of several external posterior weighting functions for multiband Full Combination ASR, and , Idiap-RR-27-2000 |
|
The use of Boolean concepts in general classification contexts, , Idiap-RR-46-2000 |
|
Thematic Indexing of Spoken Documents by Using Self-Organizing Maps, , Idiap-RR-05-2000 |
|
Using Multiple Time Scales in the Framework of Multi-Stream Speech Recognition, and , Idiap-RR-22-2000 |
|
Video Indexing and Similarity Retrieval by Largest Common Subgraph Detection using Decision Trees, , and , Idiap-RR-15-2000 |
|
Video sequence matching via decision tree path following, , and , Idiap-RR-12-2000 |
|
Weighting schemes for audio-visual fusion in speech recognition, , , , and , Idiap-RR-44-2000 |
|
1999
A comparison of noise reduction techniques for robust speech recognition, , Idiap-RR-10-1999 |
|
A comparison of two strategies for ASR in additive noise : Missing Data and Spectral Subtraction, and , Idiap-RR-17-1999 |
|
An Overview of the PICASSO Project Research Activities in Speaker Verification for Telephone Applications, , , , , , , , , , , and , Idiap-RR-24-1999 |
Automatic Facial Expression Analysis: A Survey, and , Idiap-RR-19-1999 |
|
CLIENT / WORLD MODEL SYNCHRONOUS ALIGNEMENT FOR SPEAKER VERIFICATION, , , and , Idiap-RR-23-1999 |
|
Combinatorial Approach for Data Binarization, and , Idiap-RR-08-1999 |
|
Combining Wavelet-domain Hidden Markov Trees with Hidden Markov Models, , and , Idiap-RR-14-1999 |
|
Data binarization by discriminant elimination, , and , Idiap-RR-04-1999 |
|
DynaBoost: Combining Boosted Hypotheses in a Dynamic Way, and , Idiap-RR-09-1999 |
|
Environmental spatial data classification with Support Vector Machines, , , and , Idiap-RR-07-1999 |
|
Fast latent semantic indexing of spoken documents by using self-organizing maps, , Idiap-RR-20-1999 |
|
Fusion of Face and Speech Data for Person Identity Verification, , and , Idiap-RR-03-1999 |
|
Indexing Audio Documents by using Latent Semantic Analysis and SOM, , Idiap-RR-13-1999 |
|
INtegrating SPEech acoustic and linguistic Constraints: Baseline System Development, , , and , Idiap-RR-21-1999 |
|
Iterative Posterior-Based Keyword Spotting Without Filler Models, and , Idiap-RR-16-1999 |
Iterative Posterior-Based Keyword Spotting Without Filler Models: Iterative Viterbi Decoding and One-Pass Approach, and , Idiap-RR-27-1999 |
Latent Semantic Indexing by Self-Organizing Map, and , Idiap-RR-12-1999 |
|
Multi-stream adaptive evidence combination for noise robust ASR, , , and , Idiap-RR-26-1999 |
|
Numerical Experiments with Support Vector Machines, and , Idiap-RR-15-1999 |
|
Off-Line Cursive Script Recognition Based on Continuous Density HMM, and , Idiap-RR-25-1999 |
|
Recognition of Asymmetric Facial Action Unit Activities and Intensities, and , Idiap-RR-22-1999 |
|
Segmentation of X-ray Image Sequences Showing the Vocal Tract, , Idiap-RR-01-1999 |
|
Segmentation of X-ray Image Sequences Showing the Vocal Tract (with tool documentation), , Idiap-RR-01-1999 |
|
Speaker verification experiments on the XM2VTS database, , Idiap-RR-02-1999 |
|
Synchronous Alignment, and , Idiap-RR-06-1999 |
|
Towards introducing long-term statistics in MUSE for robust speech recognition, and , Idiap-RR-18-1999 |
|
1998
Acoustico-articulatory inversion of unequal-length tube models through lattice inverse filtering, , Idiap-RR-16-1998 |
|
Audio-Visual Person Verification, , , , and , Idiap-RR-18-1998 |
|
Automatic Speech Recognition: an Auditory Perspective, , and , Idiap-RR-17-1998 |
Combined 5x2cv $F$-Test for Comparing Supervised Classification Learning Algorithms, , Idiap-RR-04-1998 |
|
Combining Linear Dichomotizers to Construct Nonlinear Polychotomizers, and , Idiap-RR-05-1998 |
|
Continuous Audio-Visual Speech Recognition, and , Idiap-RR-02-1998 |
|
Evaluating the Complexity of Databases for Person Identification and Verification, , and , Idiap-RR-10-1998 |
|
Illumination-robust Pattern Matching Using Distorted Color Histograms, and , Idiap-RR-09-1998 |
|
Improving Posterior Based Confidence Measures in Hybrid HMM/ANN Speech Recognition Systems, and , Idiap-RR-11-1998 |
|
Introduction à la reconnaissance de la parole et du locuteur, , Idiap-RR-13-1998 |
Localized mixtures of experts, , Idiap-RR-14-1998 |
Multi-Modal Data Fusion for Person Authentication using SVM, , Idiap-RR-07-1998 |
|
On the Complexity of Recognizing Regions Computable by Two-Layered Perceptrons, , Idiap-RR-03-1998 |
|
Optimal Parameterization of Point Distribution Models, and , Idiap-RR-01-1998 |
|
Speaker Verification: A Quick Overview, and , Idiap-RR-12-1998 |
|
Subband-Based Speech Recognition in Noisy Conditions: The Full Combination Approach, , and , Idiap-RR-15-1998 |
|
Support Vector Machine for Multiclass Classification, and , Idiap-RR-06-1998 |
|
1997
Acoustic-Labial Speaker Verification, , , and , Idiap-RR-13-1997 |
|
An Optical Thresholding Perceptron, , , , and , Idiap-RR-16-1997 |
|
Decision fusion in a multi-modal identity verification system using a multi-linear classifier, , and , Idiap-RR-06-1997 |
Discrete All-Positive Multilayer Perceptrons for Optical Implementation, , and , Idiap-RR-02-1997 |
|
Fast Object Detection using MLP and FFT, , Idiap-RR-11-1997 |
|
Handwritten Digit Recognition with Binary Optical Perceptron, , , and , Idiap-RR-15-1997 |
|
Improved Pairwise Coupling Classification With Correcting Classifiers, and , Idiap-RR-09-1997 |
|
Investigation of a possible process identity between DRM and Linear Filtering, , Idiap-RR-19-1997 |
|
Likelihood ratio adjustment for the compensation of model mismatch in speaker verification, and , Idiap-RR-05-1997 |
|
Mixtures of Experts Estimate A Posteriori Probabilities, , Idiap-RR-07-1997 |
|
Neural Network Adaptations to Hardware Implementations, and , Idiap-RR-17-1997 |
|
On the Complexity of Recognizing Iterated Differences of Polyhedra, , Idiap-RR-10-1997 |
|
Optimal Setting of Weights, Learning Rate, and Gain, and , Idiap-RR-04-1997 |
|
Pruning of Neural Networks, and , Idiap-RR-03-1997 |
|
Reconnaissance de caractères manuscrits à l'aide de réseaux neuromimétiques, , Idiap-RR-18-1997 |
|
Robust Speech Recognition based on Multi-Stream Features, , and , Idiap-RR-01-1997 |
|
Speechreading using Probabilistic Models, and , Idiap-RR-12-1997 |
|
Text dependent speaker verification using binary classifiers, , and , Idiap-RR-08-1997 |
|
Using the Multi-Stream Approach for Continuous Audio-Visual Speech Recognition, and , Idiap-RR-14-1997 |
|
1996
An Implementation of Logical Analysis of Data, , , , , and , Idiap-RR-05-1996 |
|
Combining methods to improve speaker verification decision, , , and , Idiap-RR-02-1996 |
|
Image Classification by Neural Networks for the Quality Control of Watches, , and , Idiap-RR-10-1996 |
|
Multi-Stream Speech Recognition, , and , Idiap-RR-07-1996 |
|
On the Complexity of the Class of Regions Computable by a Two-Layered Perceptron, , Idiap-RR-03-1996 |
|
On the Decomposition of Polychotomies into Dichotomies, and , Idiap-RR-08-1996 |
|
On Variations of the Convex Hull Operator, , Idiap-RR-06-1996 |
|
Secured vocal access to telephone servers, , , , and , Idiap-RR-04-1996 |
|
Speaker-Dependent Speech Recognition Based on Phone-Like Units Models --- Application to Voice Dialing, and , Idiap-RR-09-1996 |
|
Swiss French PolyPhone and PolyVar: telephone speech databases to model inter- and intra-speaker variability, , , , and , Idiap-RR-01-1996 |
|
1995
Apprentissage de prototypes de caractères à partir de l'image d'un texte manuscrit et avec l'aide d'un opérateur, , Idiap-RR-01-1995 |
|
Définition et évaluation d'un protocole de négociation dans un système multi-agents de reconnaissance de la parole, , Idiap-RR-02-1995 |
Experiments with robust similarity measures for OCR, , Idiap-RR-03-1995 |
Neural Networks with Adaptive Learning Rate and Momentum Terms, and , Idiap-RR-04-1995 |
|
1994
A System for the Off-Line Recognition of Handwritten Text, , Idiap-RR-02-1994 |
|
Adaptive Multilayer Optical Neural Network Design, and , Idiap-RR-04-1994 |
|
High Order and Multilayer Perceptron Initialization, and , Idiap-RR-07-1994 |
|
1993
An RBF Network that Learns Some Aspects of Perceptual Organization, , Idiap-RR-10-1993 |
|
Finding Lines under Bounded Error, , Idiap-RR-11-1993 |
|
Geometric Matching in Computer Vision--Algorithms and Open Problems, , Idiap-RR-07-1993 |
|
Higher-Order Statistics in Visual Object Recognition, , Idiap-RR-02-1993 |
|
Recognition of Handprinted Digits, , Idiap-RR-06-1993 |
|
The 3D Indexing Problem, , Idiap-RR-08-1993 |
|
Un interface d'indexation documentaire: I d'i, version 1.4, , Idiap-RR-01-1993 |
|
Un interface d'indexation documentaire: I d'i, version 2.0, , Idiap-RR-03-1993 |
|
Un interface de recherche documentaire: I de r, version 2.0, , Idiap-RR-04-1993 |
|
View-Based Recognition, , Idiap-RR-09-1993 |
|
1992
Neural Network Formalization, , Idiap-RR-01-1992 |
|
Un environnement d'analyse linguistique robuste: CPD, version 1.7, , Idiap-RR-03-1992 |
|
Une technique efficace de traitement en Prolog de la morphologie flexionnelle du français, , Idiap-RR-04-1992 |
|
Publications of type Idiap-Com
2024
Integrating large language models and ASR systems using confidence measures and prompting, , Idiap-Com-02-2024 |
|
On Learning to Classify Meerkat Calls, , Idiap-Com-01-2024 |
|
2023
A Bayesian approach to machine learning model comparison, , Idiap-Com-01-2023 |
|
Generalizable Automatic Classification of Sleep Stages, , Idiap-Com-02-2023 |
|
The Suisse Romande Local News Dataset, and , Idiap-Com-03-2023 |
|
2022
Claim-Dissector: An Interpretable Fact-Checking System with Joint Re-ranking and Veracity Prediction, , and , Idiap-Com-03-2022 |
[URL] |
Modeling and Optimal Control of the Open Torque-Controlled Quadruped Robot Solo-12, , Idiap-Com-02-2022 |
|
Planning and control of robot manipulation tasks, , Idiap-Com-01-2022 |
|
2021
Active tuberculosis detection from frontal chest X-ray images, , Idiap-Com-01-2021 |
[URL] |
2020
Automatic Speech Recognition Engines Adapted for Embedded Platforms, , Idiap-Com-01-2020 |
|
Deep Learning of Charisma, , Idiap-Com-03-2020 |
|
Face Recognition systems: performance evaluation and bias analysis, , Idiap-Com-04-2020 |
|
Machine Learning for Adverse Event Detection in Latent Tuberculosis Infection Treatment, , Idiap-Com-02-2020 |
|
2019
Adaptation of Multiple Sound Source Localization Neural Networks with Weak Supervision and Domain-Adversarial Training, , and , Idiap-Com-01-2019 |
|
2018
Local Affine Approximations for Improving Knowledge Transfer, and , Idiap-Com-01-2018 |
[URL] |
2016
Integration of Real-Time Speech Processing Technologies for Online Gaming, , and , Idiap-Com-01-2016 |
|
2015
HAVC-II - Idiap Private Cloud (Technical Inside-Out), , Idiap-Com-01-2015 |
|
2013
Notes on Probabilistic Linear Discriminant Analysis, and , Idiap-Com-03-2013 |
|
Who Wants To Be A Millionaire? (II), , and , Idiap-Com-02-2013 |
|
2012
Decision tree clustering for KL-HMM, and , Idiap-Com-01-2012 |
|
ICB 2013 - Competition on speaker recognition in mobile environment using the MOBIO database: The Evaluation Plan, , and , Idiap-Com-04-2012 |
|
Who Wants To Be A Millionaire?, , , and , Idiap-Com-03-2012 |
|
2011
Domain-specific language model adaptation: a case study, , and , Idiap-Com-01-2013 |
|
Face Detection using Ferns, and , Idiap-Com-01-2011 |
|
2010
Finding without searching, , Idiap-Com-01-2010 |
|
2009
MOBIO Database for the ICPR 2010 Face and Speech Competition, and , Idiap-Com-02-2009 |
|
Multimodal Data Flow Controller, , Idiap-Com-01-2009 |
|
2008
A Weighted Finite State Transducer tutorial, , Idiap-Com-03-2008 |
|
The Anterior Cingulate Cortex, , Idiap-Com-02-2008 |
|
2007
Correcting Confusion Matrices for Phone Recognizers, , Idiap-Com-03-2007 |
|
Feature Selection Methods on Distributed Linear Inverse Solutions for a Non-Invasive Brain-Machine Interface, , and , Idiap-Com-04-2007 |
|
Google Portrait, , and , Idiap-Com-07-2007 |
|
Speech Recognition based on Template Matching and Phone Posterior Probabilities, , and , Idiap-Com-02-2007 |
|
2006
Activity Report 2005, , Idiap-Com-01-2006 |
|
Annotation of face detection: description of XML format and files, , , and , Idiap-Com-06-2006 |
|
Managing IDIAP Inventory (Computers, Components, Software and Licences), and , Idiap-Com-04-2006 |
|
ORGIDIAP : le couteau suisse pour la gestion d'une entreprise, and , Idiap-Com-05-2006 |
|
The Juicer LVCSR Decoder - User Manual for Juicer version 0.5.0, , Idiap-Com-03-2006 |
|
2005
A Video Database for Head Pose Tracking Evaluation, and , Idiap-Com-04-2005 |
|
Activity Report 2004, , Idiap-Com-01-2005 |
|
From Meeting Recordings to Web Distribution: Description of the Process, and , Idiap-Com-05-2005 |
|
Lighting Normalization Algorithms for Face Verification, , and , Idiap-Com-03-2005 |
|
2004
A video package for Torch, and , Idiap-Com-02-2004 |
|
Activity Report 2003, , Idiap-Com-01-2004 |
|
The IDIAP Multimedia File Server, and , Idiap-Com-05-2004 |
|
Une application de reconnaissance du locuteur : \\ le User-Customized Password Speaker Verification, , Idiap-Com-04-2004 |
|
2003
A Hierarchical Keyframe User Interface for Browsing Video over the Internet, , , and , Idiap-Com-02-2003 |
|
Activity Report 2002, , Idiap-Com-01-2003 |
|
Enhanced Performance of Multimodal Biometric Systems by Confidence Estimation, , Idiap-Com-05-2003 |
|
HMM inference towards flexible speech recognition, , Idiap-Com-03-2003 |
IDIAP Demonstration Management, and , Idiap-Com-06-2003 |
|
In Search of a Good BET, and , Idiap-Com-11-2003 |
|
Information Retrieval on Noisy Text, , and , Idiap-Com-08-2003 |
|
Internship Report : Summer 2003, , Idiap-Com-09-2003 |
|
Meeting Data Collection Specifications, , and , Idiap-Com-10-2003 |
|
Multimodal Identity Verification at IDIAP, , Idiap-Com-04-2003 |
|
Small Microphone Array: Algorithms and Hardware, and , Idiap-Com-07-2003 |
|
2002
Activity Report 2001, , Idiap-Com-01-2002 |
|
Algorithms for Video Structuring, , and , Idiap-Com-05-2002 |
|
An information theoretic measure of sequence recognition performance, , Idiap-Com-03-2002 |
|
Handwriting Recognition Demo, , and , Idiap-Com-02-2002 |
|
Speech Processing & Text-Independent Automatic Person Verification, , Idiap-Com-08-2002 |
|
The IDIAP Smart Meeting Room, , Idiap-Com-07-2002 |
|
The MNIST Database of Handwritten upper-case letters, and , Idiap-Com-04-2002 |
|
The VidTIMIT Database, , Idiap-Com-06-2002 |
|
TODE: A Decoder for Continuous Speech Recognition, , Idiap-Com-09-2002 |
|
2001
Activity Report 2000, , Idiap-Com-01-2001 |
|
Developement d'un systeme de demande interactif via le telephone (INFOVOX), , Idiap-Com-08-2001 |
|
Development of a DTW based Speech Recognition System over the telephone line, , and , Idiap-Com-05-2001 |
|
EPFL lab session 1/2: Introduction to Gaussian statistics and pattern recognition, , Idiap-Com-06-2001 |
|
EPFL lab session 2/2: Introduction to Hidden Markov Models, , Idiap-Com-07-2001 |
|
Rebuilding Speech Recognition on Windows, , Idiap-Com-09-2001 |
|
Speech Recognition Engine for Interactive Voice Response application on Windows, , Idiap-Com-10-2001 |
|
2000
Activity Report 1999, , Idiap-Com-01-2000 |
|
Language modeling based on neural clustering of words, , Idiap-Com-02-2000 |
|
Personal Voice Dialing over PC, and , Idiap-Com-05-2000 |
|
Support Vector Machines, Théorie et Application, , Idiap-Com-03-2000 |
|
Various adaptive weighting schemes for large vocabulary robust audio-visual ASR, with particular reference to the cocktail party effect, , Idiap-Com-04-2000 |
1999
Latent variable decomposition for posteriors or likelihood based subband ASR, , Idiap-Com-04-1999 |
|
1998
Baseline System for Hybrid Speech Recognition on French (Experiments on BREF), , Idiap-Com-07-1998 |
|
Evaluation Protocol for the extended M2VTS Database (XM2VTSDB), and , Idiap-Com-05-1998 |
|
Fast Multi-Scale Face Detection, , Idiap-Com-04-1998 |
|
1997
1997 NIST Evaluation: Text independent speaker detection (verification), and , Idiap-Com-03-1997 |
|
Activity Report 1996, , , , and , Idiap-Com-01-1997 |
|
Quantization and Pruning of Multilayer Perceptrons: Towards Compact Neural Networks, and , Idiap-Com-02-1997 |
|
Réalisation d'un Majordome vocal, , Idiap-Com-04-1997 |
|
Some Methods for Training Mixtures of Experts, , Idiap-Com-05-1997 |
|
Speaker Verification by Pairwise Coupling, , Idiap-Com-07-1997 |
SWISSCOM ``AVIS'' PROJECT (No. 392) Advanced Vocal Interfaces Services, , and , Idiap-Com-06-1997 |
|
1996
Annulation d'écho sur une ligne téléphonique, , , and , Idiap-Com-06-1996 |
|
Datapump Full-Duplex, , , and , Idiap-Com-02-1996 |
|
Présentation du Modèle DRM, , Idiap-Com-03-1996 |
|
Towards a Multi-agents Approach for Understanding Speech, and , Idiap-Com-05-1996 |
|
VoicePhone: An Interactive Vocal Server for Telephone Numbers, , Idiap-Com-04-1996 |
|
Proceedings on Privacy Enhancing Technologies
Identifying Privacy Personas, and , in: Proceedings on Privacy Enhancing Technologies, 2025 |
|
ACM Digital Government: Research and Practice
Generative AI Literacy: Twelve Defining Competencies, , and , in: ACM Digital Government: Research and Practice, 2024 |
[DOI] [URL] |
Acoustics
Training a Filter-Based Model of the Cochlea in the Context of Pre-Trained Acoustic Models, and , in: Acoustics, 6:470 - 488, 2024 |
[DOI] |
Advances in Neural Information Processing Systems (NeurIPS)
CulturePark: Boosting Cross-cultural Understanding in Large Language Models, , , , , and , in: Advances in Neural Information Processing Systems (NeurIPS), 2024 |
arXiv
SWEET - An Open Source Modular Platform for Contactless Hand Vascular Biometric Experiments, , , , and , in: arXiv, 2024 |
[DOI] [URL] |
Frontiers in Neuroscience
Exploring neural oscillations during speech perception via surrogate gradient spiking neural networks, and , in: Frontiers in Neuroscience, 18(1449181), 2024 |
[DOI] |
IEEE Access
Group Membership Verification via Nonlinear Sparsifying Transform Learning, , , , , , and , in: IEEE Access, 12:86739-86751, 2024 |
[DOI] [URL] |
IEEE Robotics and Automation Letters
Logic Learning from Demonstrations for Multi-step Manipulation Tasks in Dynamic Environments, , , and , in: IEEE Robotics and Automation Letters, 2024 |
|
IEEE Robotics and Automation Letters (RA-L)
A Minimum-Jerk Approach to Handle Singularities in Virtual Fixtures, , and , in: IEEE Robotics and Automation Letters (RA-L), 9(11):10256-10263, 2024 |
|
A Probabilistic Approach to Multi-Modal Adaptive Virtual Fixtures, , , , , , and , in: IEEE Robotics and Automation Letters (RA-L), 2024 |
|
Online Learning of Continuous Signed Distance Fields Using Piecewise Polynomials, , and , in: IEEE Robotics and Automation Letters (RA-L), 9(6):6020-6026, 2024 |
[DOI] [URL] |
IEEE Robotics and Automation Magazine
gafro: Geometric Algebra for Robotics, , and , in: IEEE Robotics and Automation Magazine, 2024 |
|
IEEE Transactions on Biometrics, Behavior, and Identity Science
EdgeFace : Efficient Face Recognition Model for Edge Devices, , , , and , in: IEEE Transactions on Biometrics, Behavior, and Identity Science, 2024 |
|
From Modalities to Styles: Rethinking the Domain Gap in Heterogeneous Face Recognition, and , in: IEEE Transactions on Biometrics, Behavior, and Identity Science, 2024 |
|
IEEE Transactions on Circuits and Systems for Video Technology
Mirror-based Full-View Finger Vein Authentication with Illumination Adaptation, , , , and , in: IEEE Transactions on Circuits and Systems for Video Technology, 2024 |
[DOI] |
IEEE Transactions on Robotics (T-RO)
An Optimal Control Formulation of Tool Affordance Applied to Impact Tasks, , , and , in: IEEE Transactions on Robotics (T-RO), 2024 |
|
Online Multi-Contact Receding Horizon Planning via Value Function Approximation, , , , , , , , , , and , in: IEEE Transactions on Robotics (T-RO), 2024 |
|
IEEE/ACM Transactions on Audio, Speech, and Language Processing
Bayesian Parameter-Efficient Fine-Tuning for Overcoming Catastrophic Forgetting, and , in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 |
[DOI] |
International Journal of Robotics Research
Tensor Train for Global Optimization Problems in Robotics, , , and , in: International Journal of Robotics Research, 43(6):811-839, 2024 |
[DOI] |
Journal of Biomedical Informatics, Elsevier
PRIMIS: Privacy-Preserving Medical Image Sharing via Deep Sparsifying Transform Learning with Obfuscation, , , , , , , and , in: Journal of Biomedical Informatics, Elsevier, 150, 2024 |
[DOI] [URL] |
Journal of Sustainable Real Estate
Missed Opportunities in Building Energy Performance Assessment, , , and , in: Journal of Sustainable Real Estate, 16(1), 2024 |
[DOI] |
Journal of Urban Management
Why daylight should be a priority for urban planning, , , , , , , , , , , , , and , in: Journal of Urban Management, 2024 |
[DOI] [URL] |
Journal of Wrist Surgery
Developing 3D-Printed Wrist Splints for Distal Radius and Scaphoid Fractures, , , , , , and , in: Journal of Wrist Surgery, 2024 |
[DOI] [URL] |
Microvascular Research
Absolute retinal blood flow in healthy eyes and in eyes with retinal vein occlusion, , , , , , , and , in: Microvascular Research, 152, 2024 |
[DOI] |
PACM on Interactive, Mobile, Wearable, and Ubiquitous Technologies (IMWUT)
M3BAT: Unsupervised Domain Adaptation for Multimodal Mobile Sensing with Multi-Branch Adversarial Training, , and , in: PACM on Interactive, Mobile, Wearable, and Ubiquitous Technologies (IMWUT), 8(2):46, 2024 |
[DOI] |
Pattern Recognition
Test-time adaptation for 6D pose tracking, , and , in: Pattern Recognition, 152, 2024 |
[DOI] [URL] |
Transactions on Machine Learning Research (TMLR)
Candidate Set Re-ranking for Composed Image Retrieval with Dual Multi-modal Encoder, , , and , in: Transactions on Machine Learning Research (TMLR), 2024 |
[URL] |
ACM Digital Government: Research and Practice
Urban Crowdsourcing Platforms across the World: A Systematic Review, and , in: ACM Digital Government: Research and Practice, 2023 |
[DOI] [URL] |
ACM Journal on Computing and Sustainable Societies
Characterizing Swiss Alpine Lakes: from Wikipedia to Citizen Science, and , in: ACM Journal on Computing and Sustainable Societies, 2023 |
|
Aerospace
A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers, , , , and , in: Aerospace, 10(5), 2023 |
[DOI] [URL] |
An Automatic Speaker Clustering Pipeline for the Air Traffic Communication Domain, , , , , , and , in: Aerospace, 10(10):876, 2023 |
[DOI] [URL] |
Lessons Learned in Transcribing 5000 h of Air Traffic Control Communications for Robust Automatic Speech Understanding, , , , , , , , , , and , in: Aerospace, 10(10):898, 2023 |
[DOI] [URL] |
Validating Automatic Speech Recognition and Understanding for Pre-Filling Radar Labels-Increasing Safety While Reducing Air Traffic Controllers' Workload, , , , , , , and , in: Aerospace, 10(6):538, 2023 |
[DOI] |
ArXiv
Nonparametric Variational Regularisation of Pretrained Transformers, and , in: ArXiv, 2023 |
[DOI] [URL] |
Association for Computational Linguistics
Claim-Dissector: An Interpretable Fact-Checking System with Joint Re-ranking and Veracity Prediction, , and , in: Association for Computational Linguistics, Findings of the Association for Computational Linguistics: ACL 2023:10184–10205, 2023 |
[URL] |
Big Data & Society
Diversity and neocolonialism in Big Data research: Avoiding extractivism while struggling with paternalism, , , , , , and , in: Big Data & Society, 2023 |
[DOI] |
Biological Imaging
PAAQ: Paired Alternating AcQuisitions for Virtual High Frame Rate Multichannel Cardiac Fluorescence Microscopy, , , and , in: Biological Imaging, 3:e20, 2023 |
[DOI] |
BMC Bioinformatics
A systematic review of biologically-informed deep learning models for cancer: fundamental trends for encoding and interpreting oncology data, , , , and , in: BMC Bioinformatics, 24(198), 2023 |
[DOI] |
Clinical and Experimental Optometry
What do individuals with visual impairment need and want from a dialogue-based digital assistant?, , , , and , in: Clinical and Experimental Optometry, 2023 |
Cognitive Systems Research
A lexical-availability-based framework from short communications for automatic personality identification, , , , and , in: Cognitive Systems Research, 79:126-137, 2023 |
[DOI] [URL] |
Computers and Electronics in Agriculture
Towards Smart Pruning: ViNet, a Deep-Learning Approach for Grapevine Structure Estimation, , , and , in: Computers and Electronics in Agriculture, 207:107736, 2023 |
[DOI] [URL] |
Energies
Assessment of Subsidization Strategies for Multi-Objective Optimization of Energy Efficiency Measures for Building Renovation at District Scale, , , , and , in: Energies, 16(15), 2023 |
[DOI] |
Energy
Verification of an open-source Python library for the simulation of district heating networks with complex topologies, and , in: Energy, 2023 |
[DOI] [URL] |
ESMO Open
Learning Lessons from the COVID-19 pandemic for Real World Evidence research in Oncology–shared perspectives from an international consortia, , and , in: ESMO Open, 2023 |
European Journal of Cancer
Defining the role of real-world data in cancer clinical research: the position of the European Organisation for Research and Treatment of Cancer, , and , in: European Journal of Cancer, 2023 |
Frontiers in Molecular Neuroscience
RNA at a breaking point? Cytoplasmic cleavage and other post-transcriptional RNA processing in neurodevelopment and disease, , and , in: Frontiers in Molecular Neuroscience, 2023 |
Genome Research
The predicted RNA-binding protein regulome of axonal mRNAs, , , and , in: Genome Research, 2023 |
GigaScience
Suggesting disease associations for overlooked metabolites using literature from metabolic neighbors, , , , , , , , and , in: GigaScience, 12:13, 2023 |
[DOI] |
IEEE Access
Attacking Face Recognition with T-shirts: Database, Vulnerability Assessment and Detection, and , in: IEEE Access, 2023 |
|
IEEE Robotics and Automation Letters
Coordinated Multi-Robot Shared Autonomy Based on Scheduling and Demonstrations, , , , , , , and , in: IEEE Robotics and Automation Letters, 8(12):8335 - 8342, 2023 |
[DOI] [URL] |
Whole-Body Ergodic Exploration with a Manipulator Using Diffusion, , and , in: IEEE Robotics and Automation Letters, 8(12):8581-8587, 2023 |
[DOI] [URL] |
IEEE Signal Processing Magazine
From Nano to Macro: An overview of the IEEE Bio Image and Signal Processing Technical Committee, , , , , , , , and , in: IEEE Signal Processing Magazine, 40(4):61-71, 2023 |
[DOI] [URL] |
IEEE Transactions on Robotics
Geometric Algebra for Optimal Control with Applications in Manipulation Tasks, and , in: IEEE Transactions on Robotics, 2023 |
|
International Journal of Selection and Assessment
Automatic identification of storytelling responses to past-behavior interview questions via machine learning, , , , , and , in: International Journal of Selection and Assessment, 2023 |
|
Journal of Biomedical Informatics
Meta-analysis informed machine learning: Supporting cytokine storm detection during CAR-T cell Therapy, , , , , , , , , and , in: Journal of Biomedical Informatics, 142, 2023 |
[DOI] |
Journal of wrist Surgery
Development of 3D-printed Patient-Specific Anatomical Braces (PSAB) for Distal Radius and Scaphoid Fractures, , , , , , and , in: Journal of wrist Surgery, 2023 |
Knowledge-based Systems
A Canonical Context-preserving Representation for Open IE: Extracting Semantically Typed Relational Tuples from Complex Sentences, , , and , in: Knowledge-based Systems, 2023 |
Nature Communications
Integrated transcriptome landscape of ALS identifies genome instability linked to TDP-43 pathology, , , , , , , , , , , , , and , in: Nature Communications, 2023 |
Optics Express
Efficient compressed sensing reconstruction for 3D fluorescence microscopy using OptoMechanical Modulation Tomography (OMMT) with a 1+2D regularization, and , in: Optics Express, 31(20):31718-31733, 2023 |
[DOI] |
Proceedings of the ACM on Human Computer Interaction
Periscope: A Robotic Camera System to Support Remote Physical Collaboration, , , , and , in: Proceedings of the ACM on Human Computer Interaction, 2023 |
|
Robotics and Autonomous Systems
A Geometric Optimal Control Approach for Imitation and Generalization of Manipulation Skills, , , , and , in: Robotics and Autonomous Systems, 2023 |
Sustainable Cities and Society
From Zero Energy to Zero Power Buildings: a new paradigm for a sustainable transition of the building stock, , and , in: Sustainable Cities and Society, 2023 |
[DOI] [URL] |
The International Journal of Tuberculosis and Lung Disease
The rise of artificial intelligence reading of chest X-rays for enhanced TB diagnosis and elimination, , , , , , , , and , in: The International Journal of Tuberculosis and Lung Disease, 27(5):367--372, 2023 |
[DOI] [URL] |
The Journal of Creative Behavior
Loose and Tight: Creative Formation but Rigid Use of Nominal Compounds in Conspiracist Texts, , and , in: The Journal of Creative Behavior, 2023 |
The Leadership Quarterly
Combating COVID-19 with charisma: Evidence on governor speeches in the United States, , , , , , and , in: The Leadership Quarterly, 2023 |
[DOI] [URL] |
Total Environment Research Themes
Development and comparison of adaptive data-driven models for thermal comfort assessment and control, , , , , and , in: Total Environment Research Themes, 8, 2023 |
[DOI] [URL] |
Transactions of the ACL
Introduction to Mathematical Language Processing: Informal Proofs, Word Problems, and Supporting Tasks, and , in: Transactions of the ACL, 2023 |
Transactions on Machine Learning Research
Benefits of Max Pooling in Neural Networks: Theoretical and Experimental Evidence, , and , in: Transactions on Machine Learning Research, 2023 |
Inducing Meaningful Units from Character Sequences with Dynamic Capacity Slot Attention, and , in: Transactions on Machine Learning Research, 2023 |
[URL] |
Travel Medicine and Infectious Disease
Epidemiological and clinical analysis of polish short-term and long-term travelers returning from tropical countries, , and , in: Travel Medicine and Infectious Disease, 55, 2023 |
[DOI] |
ACM Transactions on Human-Robot Interaction
Social Robot Co-Design Canvases: A Participatory Design Framework, , , and , in: ACM Transactions on Human-Robot Interaction, 11(1), 2022 |
[DOI] [URL] |
ACM Transactions on Multimedia Computing, Communications, and Applications
Robust Unsupervised Gaze Calibration using Conversation and Manipulation Attention Priors, and , in: ACM Transactions on Multimedia Computing, Communications, and Applications, 18(1):26, 2022 |
[DOI] [URL] |
Artificial Intelligence
Assessing the communication gap between AI models and healthcare professionals: explainability, utility and trust in AI-driven clinical decision-making, , , , , , and , in: Artificial Intelligence, 2022 |
arxiv
A Variational AutoEncoder for Transformers with Nonparametric Variational Information Bottleneck, and , in: arxiv, 2022 |
[DOI] [URL] |
HyperMixer: An MLP-based Green AI Alternative to Transformers, , , , , , and , in: arxiv, 2022 |
BioRxiv
Meta-analysis of the amyotrophic lateral sclerosis spectrum uncovers genome instability, , , , , , , , , , , and , in: BioRxiv, 2022 |
The RNA Binding proteome of axonal mRNAs in sympathetic neurons, , and , in: BioRxiv, 2022 |
BMJ Open
Biomarker identification using dynamic time warping analysis: a longitudinal cohort study of COVID-19 patients in a UK tertiary hospital, , , , , and , in: BMJ Open, 2022 |
Brain Sciences
Differentiation of motor speech disorders through the seven deviance scores from MonPaGe-2.0.s, , and , in: Brain Sciences, 12(11):1471-1487, 2022 |
British Journal of Cancer
Patient Attrition in Molecular Tumour Boards: A Systematic Review, , , , , and , in: British Journal of Cancer, 2022 |
Building Simulation
Ranking parameters in urban energy models for various building forms and climates using sensitivity analysis, , , and , in: Building Simulation, 2022 |
[DOI] |
Clinical Cancer Informatics
Establishment of CORONET, COVID-19 Risk in Oncology Evaluation Tool, to Identify Cancer Patients at Low Versus High Risk of Severe Complications of COVID-19 Infection Upon Presentation to Hospital, , , and , in: Clinical Cancer Informatics, 2022 |
Computational Linguistics
Transformers and the representation of biomedical background knowledge, , , , , and , in: Computational Linguistics, 2022 |
Computer Speech & Language
Towards Lifelong Human Assisted Speaker Diarization, , , , , , , , , , , , and , in: Computer Speech & Language, 2022 |
[DOI] [URL] |
Computer, Speech & Language
Adjustable Deterministic Pseudonymization of Speech, , and , in: Computer, Speech & Language, 72, 2022 |
[DOI] |
Energy and Buildings
Saving energy by maximising daylight and minimising the impact on occupants: an automatic lighting system approach, , , , , , , and , in: Energy and Buildings, 2022 |
[DOI] |
Frontiers in Neuroscience
A surrogate gradient spiking baseline for speech command recognition, and , in: Frontiers in Neuroscience, 2022 |
[DOI] [URL] |
Genome Research
Physiological intron retaining transcripts in the cytoplasm abound during human motor neurogenesis, , , , , , , , and , in: Genome Research, 2022 |
IEEE Access
Sensing Eating Events in Context: A Smartphone-Only Approach, , , , , and , in: IEEE Access, 10, 2022 |
[DOI] [URL] |
IEEE Robotics and Automation Letters
drozBot: Using Ergodic Control to Draw Portraits, , and , in: IEEE Robotics and Automation Letters:7, 2022 |
[DOI] [URL] |
From Key Positions to Optimal Basis Functions for Probabilistic Adaptive Control, , and , in: IEEE Robotics and Automation Letters, 2022 |
|
IEEE Robotics and Automation Letters (RA-L)
Passive Bimanual Skills Learning from Demonstration with Motion Graph Attention Networks, , , , and , in: IEEE Robotics and Automation Letters (RA-L), 7(2):4917-4923, 2022 |
Robot Cooking with Stir-fry: Bimanual Non-prehensile Manipulation of Semi-fluid Objects, , , , , , and , in: IEEE Robotics and Automation Letters (RA-L), 7(2):5159-5166, 2022 |
|
IEEE Transactions on Biometrics, Behavior, and Identity Science
Domain-Specific Adaptation of CNN for Detecting Face Presentation Attacks in NIR, , , , , , , and , in: IEEE Transactions on Biometrics, Behavior, and Identity Science, 2022 |
|
IEEE Transactions on Information Forensics and Security
Prepended Domain Transformer: Heterogeneous Face Recognition without Bells and Whistles, , and , in: IEEE Transactions on Information Forensics and Security, 2022 |
|
IEEE Transactions on Robotics
Ergodic Exploration using Tensor Train: Applications in Insertion Tasks, , and , in: IEEE Transactions on Robotics, 38(2):906--921, 2022 |
[DOI] [URL] |
Int. Conf. on Mobile and Ubiquitous Multimedia
Health Talk: Understanding Practices of Popular Professional YouTubers, , , , and , in: Int. Conf. on Mobile and Ubiquitous Multimedia, 2022 |
|
International Journal of Robotics Research
Learning from Demonstration using Products of Experts: Applications to Manipulation and Task Prioritization, , and , in: International Journal of Robotics Research, 41(2):163-188, 2022 |
|
JCO Clinical Cancer Informatics
digital ECMT cancer trial matching tool, an open source research application to support oncologists in the identification of precision medicine clinical trials,, , and , in: JCO Clinical Cancer Informatics, 2022 |
Journal of Clinical Virology
Wave comparisons of clinical characteristics and outcomes of COVID-19 admissions - Exploring the impact of treatment and strain dynamics, , , , , , and , in: Journal of Clinical Virology, 2022 |
Journal of Official Statistics
Response Burden and Dropout in a Probability-Based Online Panel Study – A Comparison between an App and Browser-Based Design, , , , and , in: Journal of Official Statistics, 2022 |
[DOI] [URL] |
Journal of Speech, Language, and Hearing Research
Perceptual classification of motor speech disorders: the role of severity, speech task, and listener's expertise, , , and , in: Journal of Speech, Language, and Hearing Research, 2022 |
Journal of Survey Statistics and Methodology
Data Privacy Concerns as a Source of Resistance to Complete Mobile Data Collection Tasks via a Smartphone App, , , , and , in: Journal of Survey Statistics and Methodology, 2022 |
|
Nature Scientific Reports
State-of-the-art retinal vessel segmentation with minimalistic models, , , , , and , in: Nature Scientific Reports, 12(6174), 2022 |
[DOI] |
npj Digital Medicine
A Systems Approach Towards Remote Health-Monitoring in Older Adults: Introducing a Zero-Interaction Digital Exhaust, , , , , , , , , , , , and , in: npj Digital Medicine, 5(Article 116), 2022 |
|
Opt. Continuum
Mechanical Artifacts in Optical Projection Tomography: Classification and Automatic Calibration, , , , and , in: Opt. Continuum, 1(12):2577--2589, 2022 |
[DOI] |
Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT)
Generalization and Personalization of Mobile Sensing-Based Mood Inference Models: An Analysis of College Students in Eight Countries, , , , , , , , , , , , , , , , , , , , , , , , , , and , in: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT), 6(4), 2022 |
[DOI] |
Solar Energy Advances
Integrating daylight with general and task lighting: A longitudinal in-the-wild study in individual and open space working areas, , , , , , and , in: Solar Energy Advances, 2, 2022 |
[DOI] [URL] |
Springer Biological Cybernetics
Autoencoders Reloaded, and , in: Springer Biological Cybernetics, 2022 |
[DOI] [URL] |
Transactions of the Association for Computational Linguistics
Diff-Explainer: Differentiable Convex Optimization for Explainable Multi-Hop Inference, , , , and , in: Transactions of the Association for Computational Linguistics, 2022 |
[DOI] |
Brain
Aberrant cytoplasmic intron retention is a blueprint for RNA binding protein mislocalization in VCP-related amyotrophic lateral sclerosis, , , , , , , , and , in: Brain, 2021 |
Brain Pathology
Automated and unbiased discrimination of ALS from control tissue at single cell resolution, , , , , , , , and , in: Brain Pathology, 2021 |
Cell Reports
Cytoplasmic cleavage of IMPA1 3' UTR is necessary for maintaining axon integrity, , , , , , , , , , and , in: Cell Reports, 2021 |
Cognitive Computation
Applying Attention-Based Models for Detecting Cognitive Processes and Mental Health Conditions, , , and , in: Cognitive Computation:18, 2021 |
[DOI] [URL] |
Complex & Intelligent Systems
Bilateral Teleoperation with Object-Adaptive Mapping, , , , and , in: Complex & Intelligent Systems, 2021 |
|
Computacion y Sistemas (CyS)
Classifier Implementation for Spontaneous EEG Activity during Schizophrenic Psychosis, , , , and , in: Computacion y Sistemas (CyS), 25(3), 2021 |
[URL] |
Computer Speech and Language
Multilingual and Unsupervised Subword Modeling for Zero-Resource Languages, , and , in: Computer Speech and Language, 65, 2021 |
[DOI] [URL] |
Diagrams
Number and quality of diagrams in scholarly publications is associated with number of citations, , and , in: Diagrams, 2021 |
Electronics
Domain-Adversarial Based Model with Phonological Knowledge for Cross-Lingual Speech Recognition, , , , , and , in: Electronics, 10(24):1-15, 2021 |
[DOI] [URL] |
ESMO Open
Longitudinal characterisation of haematological and biochemical parameters in cancer patients prior to and during COVID-19 reveals features associated with outcome, , , and , in: ESMO Open, 2021 |
Frontiers in Robotics and AI
Editorial: Artificial Intelligence and Human Movement in Industries and Creation, , , , and , in: Frontiers in Robotics and AI, 8:712521, 2021 |
|
Towards an Engagement-Aware Attentive Artificial Listener for Multi-Party Interactions, , , , , and , in: Frontiers in Robotics and AI, 8:189, 2021 |
[DOI] [URL] |
IEEE Access
Smartphone Sensing for the Well-being of Young Adults: A Review, and , in: IEEE Access, 2021 |
[DOI] [URL] |
IEEE Journal of Biomedical And Health Informatics
A Sensor-Driven Visit Detection System in Older Adults’ Homes: Towards Digital Late-Life Depression Marker Extraction, , , , , , , , , , and , in: IEEE Journal of Biomedical And Health Informatics, 26(4):1560-1569, 2021 |
[DOI] [URL] |
IEEE Robotics and Automation Letters
Learning Constrained Distributions of Robot Configurations with Generative Adversarial Network, , , and , in: IEEE Robotics and Automation Letters, 2021 |
|
Motion Mappings for Continuous Bilateral Teleoperation, , , , , and , in: IEEE Robotics and Automation Letters, 6(3):5048-5055, 2021 |
|
Probabilistic Adaptive Control for Robust Behavior Imitation, , and , in: IEEE Robotics and Automation Letters, 2021 |
|
IEEE Robotics and Automation Letters (RA-L)
Learning Optimal Impedance Control During Complex 3D Arm Movements, , , , and , in: IEEE Robotics and Automation Letters (RA-L), 6(2):1248-1255, 2021 |
[DOI] [URL] |
IEEE Transaction on Pattern Analysis and Machine Intelligence
A Differential Approach for Gaze Estimation, , and , in: IEEE Transaction on Pattern Analysis and Machine Intelligence, 43(3):1092--1098, 2021 |
[DOI] [URL] |
IEEE Transactions on Biometrics, Behavior, and Identity Science
Fairness in Biometrics: a figure of merit to assess biometric verification systems, and , in: IEEE Transactions on Biometrics, Behavior, and Identity Science, 2021 |
[DOI] |
Improving Generalization of Deepfake Detection with Data Farming and Few-Shot Learning, and , in: IEEE Transactions on Biometrics, Behavior, and Identity Science, 2021 |
|
IEEE Transactions on Information Forensics and Security
On Joint Optimization of Automatic Speaker Verification and Anti-spoofing in the Embedding Space, , , , and , in: IEEE Transactions on Information Forensics and Security, 16:1579--1593, 2021 |
[DOI] |
IEEE Transactions on Pattern Analysis and Machine Intelligence
A Bayesian Approach to Recurrence in Neural Networks, and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 43(8):2527--2537, 2021 |
[DOI] |
IEEE/ACM Transactions on Audio, Speech, and Language Processing
Neural Network Adaptation and Data Augmentation for Multi-Speaker Direction-of-Arrival Estimation, , and , in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 29:1303-1317, 2021 |
[DOI] [URL] |
Informatica
Extreme Learning Machines with feature selection using GA for effective prediction of fetal heart disease: A Novel Approach, , , and , in: Informatica, 45(3), 2021 |
[DOI] [URL] |
International Journal of Robotics Research (IJRR)
Sequential Robot Imitation Learning from Observations, , , , and , in: International Journal of Robotics Research (IJRR), 2021 |
JMIR Mhealth Uhealth
Contactless Sleep Monitoring for Early Detection of Health Deteriorations in Community-Dwelling Older Adults: Exploratory Study, , , , , , , , , and , in: JMIR Mhealth Uhealth, 9(6), 2021 |
|
Journal of Research in Computing Science
Analysis of Vector Representations in Maintenance Logs in the Industry: Towards an Information Retrieval System, , , and , in: Journal of Research in Computing Science, 2021 |
Topic analysis and tracking from Mexico's President daily press briefing, , and , in: Journal of Research in Computing Science, 2021 |
|
Methods in Psychology, Special Issue on Innovations in Qualitative Research
Professional YouTubers’ health videos as research material: Formulating a multi-method design in health psychology, , , , and , in: Methods in Psychology, Special Issue on Innovations in Qualitative Research, 5, 2021 |
|
Neural Networks
Deep learning architectures for estimating breathing signal and respiratory parameters from speech recordings, , , , and , in: Neural Networks, 141:211--224, 2021 |
[DOI] |
Neuropathology and Applied Neurobiology
Image-based deep learning reveals the responses of human motor neurons to stress and VCP-related ALS, , , and , in: Neuropathology and Applied Neurobiology, 2021 |
Physics Review Research
Similarity-Based Equational Inference in Physics, and , in: Physics Review Research, 2021 |
PLoS Computational Biology
Signal-to-signal neural networks for improved spike estimation from calcium imaging data, , , and , in: PLoS Computational Biology, 17(3):1--19, 2021 |
[DOI] |
PLOS ONE
Ten seconds of my nights: exploring methods to measure brightness, loudness and attendance and their associations with alcohol use from video clips, , , , , and , in: PLOS ONE, 2021 |
[DOI] |
Proceedings of the ACM on Human-Computer Interaction
Declarative Variables in Online Dating: A Mixed-Method Analysis of a Mimetic-Distinctive Mechanism, , and , in: Proceedings of the ACM on Human-Computer Interaction, 5(CSCW1), 2021 |
|
Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT)
Examining the Social Context of Alcohol Drinking in Young Adults with Smartphone Sensing, , , and , in: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT), 5(3):26, 2021 |
[DOI] |
One More Bite? Inferring Food Consumption Level of College Students Using Smartphone Sensing and Self-Reports, , , , , , and , in: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT), 5(1), 2021 |
|
Robotics and Autonomous Systems
Tensor-variate mixture of experts for proportional myographic control of a robotic hand, , and , in: Robotics and Autonomous Systems, 142:103812, 2021 |
|
Sustainability
Application of Urban Scale Energy Modelling and Multi-Objective Optimization Techniques for Building Energy Renovation at District Scale, , , and , in: Sustainability, 13(20), 2021 |
[DOI] [URL] |
Evaluation of Urban Scale Building Energy-Use Models and Tools – Application for the City of Fribourg, Switzerland, , , and , in: Sustainability, 13(7), 2021 |
[DOI] [URL] |
TACL
ParsiNLU: A Suite of Language Understanding Challenges for Persian, , , , , , , , , , , , , , , , , , , , , and , in: TACL, 2021 |
|
Transactions of the Association for Computational Linguistics (2021)
Recursive Non-Autoregressive Graph-to-Graph Transformer for Dependency Parsing with Iterative Refinement, and , in: Transactions of the Association for Computational Linguistics (2021), 9:18, 2021 |
[DOI] [URL] |
Addiction Research and Theory
Capturing drinking and nightlife behaviours and their social and physical context with a smartphone application - investigation of users' experience and reactivity, , , , , , , and , in: Addiction Research and Theory, 28(1):62-75, 2020 |
[DOI] [URL] |
Addictive Behaviors
Do different drinks make you feel different emotions? Examination of young adolescents' beverage-specific alcohol expectancies using the Alcohol Expectancy Task, , , and , in: Addictive Behaviors, 2020 |
[DOI] [URL] |
Fun/intoxication pre-drinking motives lead indirectly to more alcohol-related consequences via increased alcohol consumption on a given night, , , and , in: Addictive Behaviors, 2020 |
[DOI] [URL] |
Advanced Robotics
Assisted teleoperation in changing environments with a mixture of virtual guides, , and , in: Advanced Robotics, 34(18):1157-1170, 2020 |
[DOI] [URL] |
Area
The emotional entanglements of smartphones in the field: On emotional discomfort, power relations, and research ethics, , , , , and , in: Area, 52(1), 2020 |
[DOI] [URL] |
arXiv
Fairness in Biometrics: a figure of merit to assess biometric verification systems, and , in: arXiv, 2020 |
|
Brain
Paraspeckle components NONO and PSPC1 are not mislocalized from motor neuron nuclei in sporadic ALS, , , , , and , in: Brain, 2020 |
[URL] |
Children's Geographies
Youth nightlife at home: towards a feminist conceptualisation of home, , , , , , and , in: Children's Geographies, 2020 |
[DOI] [URL] |
Computers in Biology and Medicine
Competitive Neural Layer-based Method to Identify People with High Risk for Diabetic Foot, , , , , and , in: Computers in Biology and Medicine, 120, 2020 |
[DOI] [URL] |
Cornell University Pre-print Server
The Little W-Net That Could: State-of-the-Art Retinal Vessel Segmentation with Minimalistic Models, , , , , and , in: Cornell University Pre-print Server, 2020 |
[URL] |
Drug and Alcohol Review
Shooting shots: Estimating alcoholic drink sizes in real life using event-level reports and annotations of close-up pictures, , , and , in: Drug and Alcohol Review, 2020 |
[DOI] [URL] |
Frontiers in Public Health
Evaluation of 1-Year in-Home Monitoring Technology by Home-Dwelling Older Adults, Family Caregivers, and Nurses, , , , , , , and , in: Frontiers in Public Health, 8:9, 2020 |
[DOI] [URL] |
IEEE Robotics and Automation Letters
Memory of Motion for Warm-starting Trajectory Optimization, , , and , in: IEEE Robotics and Automation Letters, 5(2):2594-2601, 2020 |
[DOI] |
IEEE Robotics and Automation Magazine (RAM)
Gaussians on Riemannian Manifolds for Robot Learning and Adaptive Control, , in: IEEE Robotics and Automation Magazine (RAM), 2020 |
|
IEEE Signal Processing Letters
A t-distribution based operator for enhancing out of distribution robustness of neural network classifiers, and , in: IEEE Signal Processing Letters, 27:1070-1074, 2020 |
[DOI] |
Subspace-based Learning for Automatic Dysarthric Speech Detection, , and , in: IEEE Signal Processing Letters, 2020 |
IEEE Trans. on Robotics
A Survey on Policy Search Algorithms for Learning Robot Controllers in a Handful of Trials, , , , and , in: IEEE Trans. on Robotics, 32(2):328-347, 2020 |
[DOI] [URL] |
IEEE Transactions on Biometrics, Behavior, and Identity Science
Deep Models and Shortwave Infrared Information to Detect Face Presentation Attacks, , , , and , in: IEEE Transactions on Biometrics, Behavior, and Identity Science, 2020 |
|
IEEE Transactions on Image Processing
Spatially-Variant CNN-Based Point Spread Function Estimation for Blind Deconvolution and Depth Estimation in Optical Microscopy, and , in: IEEE Transactions on Image Processing, 29:5848 - 5861, 2020 |
[DOI] |
IEEE Transactions on Information Forensics and Security
Learning One Class Representations for Face Presentation Attack Detection using Multi-channel Convolutional Neural Networks, and , in: IEEE Transactions on Information Forensics and Security, 2020 |
|
IEEE/ACM Transactions on Audio Speech and Language Processing
Novel Architectures for Unsupervised Information Bottleneck based Speaker Diarization of Meetings, , , and , in: IEEE/ACM Transactions on Audio Speech and Language Processing, 2020 |
IEEE/ACM Transactions on Audio, Speech, and Language Processing
Automatic pathological speech intelligibility assessment exploiting subspace-based analyses, , and , in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 28:1717 - 1728, 2020 |
[DOI] |
IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING
Neural Network based End-to-End Query by Example Spoken Term Detection, , and , in: IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 2020 |
|
IEEE/ACM Transactions on Audio, Speech, and Language Processing
Spectro-temporal sparsity characterization for dysarthric speech detection, and , in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 28:1210-1222, 2020 |
|
In Journal of Computación y Sistemas (CyS)
Author Profiling in Social Media with Multimodal Information., , , and , in: In Journal of Computacion y Sistemas (CyS), 24(3), 2020 |
[URL] |
Informatica
Predicting the Causal Effect Relationship Between COPD and Cardio Vascular Diseases, , , and , in: Informatica, 44(4), 2020 |
[DOI] [URL] |
Journal of Biomedical Optics
Aliasing mitigation in optical microscopy of dynamic biological samples by use of temporally modulated color illumination and a standard RGB camera, and , in: Journal of Biomedical Optics, 25(10):106505, 2020 |
[DOI] [URL] |
Journal of Integrative Neuroscience
Epileptic seizure detection: a comparative study between deep and traditional machine learning techniques, , , , and , in: Journal of Integrative Neuroscience, 19(1):1-9, 2020 |
[URL] |
Machine Vision and Applications
WatchNet++: Efficient and accurate depth-based network for detecting people attacks and intrusion, , , and , in: Machine Vision and Applications, 2020 |
|
Mathematical Geosciences
Adaptive Ensemble-based Optimisation for Petrophysical Inversion, and , in: Mathematical Geosciences, 2020 |
[DOI] [URL] |
OSA Continuum
Temporal resolution doubling in fluorescence light-sheet microscopy via a hue-encoded shutter and regularization, , , and , in: OSA Continuum, 3(8), 2020 |
|
Pattern Recognition Letters
Multi-scale sequential network for semantic text segmentation and localization, , and , in: Pattern Recognition Letters, 129:63-69, 2020 |
[DOI] [URL] |
Photoniques
Free annotated data for deep learning in microscopy? A hitchhiker's guide, and , in: Photoniques(104):30-33, 2020 |
[DOI] [URL] |
Scientific Reports
Mammary epithelial morphogenesis in 3D combinatorial microenvironments, , , and , in: Scientific Reports, 10(1), 2020 |
[URL] |
Speech Communication
On quantifying the quality of acoustic models in hybrid DNN-HMM ASR, , and , in: Speech Communication, 119:24-35, 2020 |
[DOI] |
The Prague Bulletin of Mathematical Linguistics
Inferring Highly-dense Representations for Clustering Broadcast Media Content, , , and , in: The Prague Bulletin of Mathematical Linguistics, 2020 |
[URL] |
Transactions of the Association for Computational Linguistics
Recursive Non-Autoregressive Graph-to-Graph Transformer for Dependency Parsing with Iterative Refinement, and , in: Transactions of the Association for Computational Linguistics, 2020 |
[URL] |
Transactions of the Association for Computational Linguistics(under submission)
Recursive Non-Autoregressive Graph-to-Graph Transformer for Dependency Parsing with Iterative Refinement, and , in: Transactions of the Association for Computational Linguistics(under submission), 2020 |
Urban Climate
Parametric study of URBAN morphology on building solar energy potential in Singapore context, , , , and , in: Urban Climate, 33(100624), 2020 |
[DOI] [URL] |
Word Structure
Compound or phrase or in between? Testing Linguistic Criteria for Compoundhood in English, and , in: Word Structure, 13(2):250-281, 2020 |
ACM Transactions on Multimedia Computing, Communications, and Applications
Modeling Dyadic and Group Impressions with Inter-Modal and Inter-Person Features, , , and , in: ACM Transactions on Multimedia Computing, Communications, and Applications, 15(1), 2019 |
|
ACM Transactions on Social Computing
Mi Casa es su Casa? Examining Airbnb Hospitality Exchange Practices in a Developing Economy, , , , , , and , in: ACM Transactions on Social Computing, 2(1), 2019 |
|
Alcohol and Alcoholism
The Role of Sex and Age on Pre-drinking: An Exploratory International Comparison of 27 Countries, , , , and , in: Alcohol and Alcoholism, 54(4):378–385, 2019 |
[DOI] |
Applied Energy
A solar-based sustainable urban design: The effects of city-scale street-canyon geometry on solar access in Geneva, Switzerland, , , , , and , in: Applied Energy, 240:173-190, 2019 |
[DOI] |
Automated Eye-sight Venetian blinds based on an embedded photometric device with real-time daylighting computing, , and , in: Applied Energy, 252, 2019 |
[DOI] [URL] |
Autonomous Robots
Learning from demonstration for semi-autonomous teleoperation, and , in: Autonomous Robots, 43(3):713-726, 2019 |
[DOI] [URL] |
Bernoulli
A supermartingale approach to Gaussian process based sequential design of experiments, , and , in: Bernoulli, 25(4A):2883-2919, 2019 |
Building and Environment
Split-pane electrochromic window control based on an embedded photometric device with real-time daylighting computing, , , , and , in: Building and Environment, 2019 |
[DOI] |
Electronic Communications in Probability
Conditions for the finiteness of the moments of the volume of level sets, , , and , in: Electronic Communications in Probability, 24(17), 2019 |
[DOI] [URL] |
Energy Procedia
Daylighting simulation for external Venetian blinds based on HDR sky luminance monitoring with matrix algebraic approach, , and , in: Energy Procedia, 158:2677-2682, 2019 |
[DOI] |
Frontiers in Robotics and AI
Learning Trajectory Distributions for Assisted Teleoperation and Path Planning, , , , , , and , in: Frontiers in Robotics and AI, 6:89, 2019 |
[DOI] [URL] |
Hydrology and Earth System Sciences
Contaminant source localization via Bayesian global optimization, , , and , in: Hydrology and Earth System Sciences, 23:351-369, 2019 |
[DOI] [URL] |
IEEE Robotics and Automation Letters
Bayesian Gaussian mixture model for robotic policy imitation, and , in: IEEE Robotics and Automation Letters, 4(4):4452 - 4458, 2019 |
[DOI] [URL] |
IEEE Transactions on Automation Science and Engineering
SCALAR: Simultaneous Calibration of 2-D Laser and Robot Kinematic Parameters Using Planarity and Distance Constraints, , and , in: IEEE Transactions on Automation Science and Engineering, 16(4):1971-1979, 2019 |
[DOI] |
IEEE Transactions on Biometrics, Behavior and Identity Science
A Comprehensive Experimental and Reproducible Study on Selfie Biometrics in Multistream and Heterogeneous Settings, , and , in: IEEE Transactions on Biometrics, Behavior and Identity Science, 2019 |
[DOI] [URL] |
IEEE Transactions on Biometrics, Behavior, and Identity Science
Multispectral Deep Embeddings As a Countermeasure To Custom Silicone Mask Presentation Attacks, , and , in: IEEE Transactions on Biometrics, Behavior, and Identity Science, 2019 |
|
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY
Biometric Face Presentation Attack Detection with Multi-Channel Convolutional Neural Network, , , , , and , in: IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2019 |
|
IEEE Transactions on Information Forensics and Security
Heterogeneous Face Recognition Using Domain Specific Units, , and , in: IEEE Transactions on Information Forensics and Security:13, 2019 |
[DOI] |
IEEE Transactions on Robotics
Learning Task Priorities from Demonstrations, , , and , in: IEEE Transactions on Robotics, 35(1):78-94, 2019 |
[DOI] [URL] |
IEEE/ACM Transactions on Audio, Speech, and Language Processing
Joint acoustic localization and dereverberation through plane wave decomposition and sparse regularization, , , , and , in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 27(12):1893-1905, 2019 |
[DOI] [URL] |
Information
Subunits Inference and Lexicon Development Based on Pairwise Comparison of Utterances and Signs, and , in: Information, 10:298, 2019 |
[DOI] [URL] |
International Journal of Robotics Research (IJRR)
Small Variance Asymptotics for Non-Parametric Online Robot Learning, and , in: International Journal of Robotics Research (IJRR), 38(1):3-22, 2019 |
|
Journal of Global Optimization
On the choice of the low-dimensional domain for global optimization via random embeddings, , and , in: Journal of Global Optimization, 2019 |
[DOI] [URL] |
Multimedia Tools and Applications
Improving speech embedding using crossmodal transfer learning with audio-visual data, and , in: Multimedia Tools and Applications, 78(11):15681-15704, 2019 |
[DOI] |
Optical Society of America Biomedical Optics Express
Temporal Super-Resolution Microscopy Using a Hue-Encoded Shutter, , , and , in: Optical Society of America Biomedical Optics Express, 10(09):4727-4741, 2019 |
[DOI] [URL] |
PLOS ONE
The contexts of heavy drinking: A systematic review of the combinations of context-related factors associated with heavy drinking occasions, , , , and , in: PLOS ONE, 14(7):29, 2019 |
[DOI] [URL] |
Scientific Reports
Validity of pervasive computing based continuous physical activity assessment in community-dwelling old and oldest-old, , , , , , , , , , , and , in: Scientific Reports, 9(9662), 2019 |
|
Solar Energy
Performance assessment of the BTDF data compression based on wavelet transforms in daylighting simulation, , and , in: Solar Energy, 2019 |
[DOI] |
Speech Communication
End-to-End Acoustic Modeling using Convolutional Neural Networks for HMM-based Automatic Speech Recognition, , and , in: Speech Communication, 108:15--32, 2019 |
[DOI] |
Technometrics
Adaptive Design of Experiments for Conservative Estimation of Excursion Sets, , , , and , in: Technometrics, 2019 |
[DOI] [URL] |
Profile extrema for visualizing and quantifying uncertainties on excursion regions. Application to coastal flooding, , , and , in: Technometrics, 61(4):474-493, 2019 |
[DOI] [URL] |
Transactions of the Association for Computational Linguistics (TACL)
GILE: A Generalized Input-Label Embedding for Text Classification, and , in: Transactions of the Association for Computational Linguistics (TACL), 2019 |
|
ACM Journal on Computing and Cultural Heritage (JOCCH)
How to Tell Ancient Signs Apart? Recognizing and Visualizing Maya Glyphs with CNNs, , and , in: ACM Journal on Computing and Cultural Heritage (JOCCH), 11(4):20, 2018 |
[DOI] |
ACM Transactions on Social Computing
Looking South: Learning Urban Perception in Developing Cities, , and , in: ACM Transactions on Social Computing, 2018 |
|
Advances in Space Research
Geometric calibration of Colour and Stereo Surface Imaging System of ESA's Trace Gas Orbiter, , , , , , , and , in: Advances in Space Research, 2018 |
Atmospheric Research
A Poisson regression approach to model monthly hail occurrence in Northern Switzerland using large-scale environmental variables, , and , in: Atmospheric Research, 203:261-274, 2018 |
[DOI] |
Autonomous Robots
Special issue on robot learning for human-robot collaboration, , , , and , in: Autonomous Robots, 42(5):953-956, 2018 |
[DOI] [URL] |
EURASIP Journal on Advances in Signal Processing
Improving the conditioning of the optimization criterion in acoustic multi-channel equalization using shorter reshaping filters, and , in: EURASIP Journal on Advances in Signal Processing(11), 2018 |
|
IEEE Robotics and Automation Letters (RA-L)
A Brief Survey on the Role of Dimensionality Reduction in Manipulation Learning and Control, , and , in: IEEE Robotics and Automation Letters (RA-L), 3(3):2608-2615, 2018 |
[DOI] [URL] |
Programming by Demonstration for Shared Control with an Application in Teleoperation, , and , in: IEEE Robotics and Automation Letters (RA-L), 3(3):1848-1855, 2018 |
[DOI] [URL] |
IEEE Robotics and Automation Magazine
Dexterous Underwater Manipulation from Distant Onshore Locations, , , , , , , , , , , , , , , , , , , , , , , , and , in: IEEE Robotics and Automation Magazine, 2018 |
|
IEEE Robotics and Automation Magazine (RAM)
Flexible Automation Driven by Demonstration: Leveraging Strategies that Simplify Robotics, , , , , , and , in: IEEE Robotics and Automation Magazine (RAM), 25(2):18-27, 2018 |
[DOI] [URL] |
IEEE Signal Processing Magazine
Cognitive Speech Coding: Examining the Impact of Cognitive Speech Processing on Speech Compression, , and , in: IEEE Signal Processing Magazine, 35(3):97-109, 2018 |
[DOI] |
IEEE Transaction on Acoustics, Speech and Language Processing
Analysis of eigenvalue decomposition-based late reverberation power spectral density estimation, and , in: IEEE Transaction on Acoustics, Speech and Language Processing, 26(6):1106-1118, 2018 |
|
IEEE Transactions on Medical Imaging
Learning-Based Compressive MRI, , , , , , and , in: IEEE Transactions on Medical Imaging, 2018 |
|
IEEE Transactions on Mobile Computing
DrinkSense: Characterizing Youth Drinking Behavior using Smartphones, , , , , and , in: IEEE Transactions on Mobile Computing, 2018 |
|
IEEE transactions on Multimedia
Check Out This Place: Inferring Ambiance from Airbnb Photos, , , and , in: IEEE transactions on Multimedia, 20(6):1499-1511, 2018 |
[DOI] [URL] |
IEEE Transactions on Multimedia
Maya Codical Glyph Segmentation: A Crowdsourcing Approach, , and , in: IEEE Transactions on Multimedia, 20(3):711-725, 2018 |
[DOI] [URL] |
IEEE Transactions on Signal Processing
A Non-Euclidean Gradient Descent Framework for Non-Convex Matrix Factorization, , , , , and , in: IEEE Transactions on Signal Processing, 2018 |
|
Journal of Computational and Graphical Statistics
Estimating orthant probabilities of high dimensional Gaussian vectors with an application to set estimation, and , in: Journal of Computational and Graphical Statistics, 27(2):255-267, 2018 |
[DOI] [URL] |
Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies
A Tale of Two Interactions: Inferring Performance in Hospitality Encounters from Cross-Situation Social Sensing, , and , in: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 2(129), 2018 |
|
SIAM/ASA Journal on Uncertainty Quantification
Warped Gaussian processes and derivative-based sequential design for functions with heterogeneous variations, , , and , in: SIAM/ASA Journal on Uncertainty Quantification, 6(3):991-1018, 2018 |
Speech Communication
Cross-lingual Adaptation of a CTC-based multilingual Acoustic Model, , and , in: Speech Communication, 104:39-46, 2018 |
[DOI] |
Phonetic Subspace Features for Improved Query by Example Spoken Term Detection, , and , in: Speech Communication, 103:27-36, 2018 |
[DOI] |
Towards Weakly Supervised Acoustic Subword Unit Discovery and Lexicon Development Using Hidden Markov Models, , and , in: Speech Communication, 96:168-183, 2018 |
[DOI] |
Sustainable Cities and Society
Fusing TensorFlow with building energy simulation for intelligent energy management in smart cities, , , and , in: Sustainable Cities and Society, 2018 |
[DOI] |
Transactions of the Association for Computational Linguistics (TACL)
Integrating Weakly Supervised Word Sense Disambiguation into Neural Machine Translation, , , and , in: Transactions of the Association for Computational Linguistics (TACL), 2018 |
|
Advances in Water Resources
On uncertainty quantification in hydrogeology and hydrogeophysics, , , , and , in: Advances in Water Resources, 110:166–181, 2017 |
[DOI] [URL] |
arXiv
A reproducible study on remote heart rate measurement, , and , in: arXiv, 2017 |
[URL] |
Autonomous Robots
Learning Autonomous Behaviours for the Body of a Flexible Surgical Robot, , and , in: Autonomous Robots, 41(2):333-347, 2017 |
[DOI] [URL] |
Biomedical Optics Express
Direct inversion algorithm for focal plane scanning optical projection tomography, and , in: Biomedical Optics Express, 2017 |
|
Computer Speech and Language
Characterisation of voice quality of Parkinson's disease using differential phonological posterior features, , , , , and , in: Computer Speech and Language, 2017 |
|
Digital Signal Processing
NeuroSpeech: An open-source software for Parkinson's speech analysis, , , , , , , , , , , , , , and , in: Digital Signal Processing, 2017 |
[DOI] |
IEEE Journal of Selected Topics in Signal Processing
Impact of score fusion on voice biometrics and presentation attack detection in cross-database evaluations, and , in: IEEE Journal of Selected Topics in Signal Processing, 11(4):695 - 705, 2017 |
[DOI] |
IEEE MultiMedia
Biometrics: In Search of Identity and Security (Q & A), , , , , and , in: IEEE MultiMedia, PP, 2017 |
[DOI] |
IEEE Pervasive Computingg, Special Issue on Smart Cities
SenseCityVity: Mobile Crowdsourcing, Urban Awareness, and Collective Action in Mexico, , , , , , , , , and , in: IEEE Pervasive Computingg, Special Issue on Smart Cities, 16(2):44-53, 2017 |
|
IEEE Robotics and Automation Letters (RA-L)
An Approach for Imitation Learning on Riemannian Manifolds, , , , and , in: IEEE Robotics and Automation Letters (RA-L), 2(3):1240-1247, 2017 |
[DOI] [URL] |
IEEE Signal Processing Letters
A Posterior-Based Multi-Stream Formulation for G2P Conversion, and , in: IEEE Signal Processing Letters, 2017 |
|
IEEE Transactions on Affective Computing
Rapport with Virtual Agents: What do Human Social Cues and Personality Explain?, , and , in: IEEE Transactions on Affective Computing, 8(3):382-395, 2017 |
[DOI] |
IEEE/ACM Transactions on Audio, Speech and Language Processing
Long-Term Spectral Statistics for Voice Presentation Attack Detection, , , and , in: IEEE/ACM Transactions on Audio, Speech and Language Processing, 25(11):2098-2111, 2017 |
|
IEEE/ACM Transactions on Audio, Speech, and Language Processing
Perceptual Information Loss due to Impaired Speech Production, , and , in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2017 |
|
IET (The Institution of Engineering and Technology) -- Biometrics
Deeply Vulnerable -- a study of the robustness of face recognition to presentation attacks, , and , in: IET (The Institution of Engineering and Technology) -- Biometrics:1--13, 2017 |
[DOI] |
International Journal of Multimedia Information Retrieval
Multilingual Visual Sentiment Concept Clustering and Analysis, , , , , , and , in: International Journal of Multimedia Information Retrieval, 2017 |
|
International Journal of Social Research Methodology
Development of the Geographical Proportional-to-size Street-Intercept Sampling (GPSIS) method for recruiting urban nightlife-goers in an entire city, , , , , , , and , in: International Journal of Social Research Methodology, 20(6):721-736, 2017 |
[DOI] |
Journal of Artificial Intelligence Research (JAIR)
Explicit Document Modeling through Weighted Multiple-Instance Learning, and , in: Journal of Artificial Intelligence Research (JAIR), 58:591--626, 2017 |
|
Pattern Analysis and Applications
Machine learning-based tools to model and to remove the off-target effect, , , and , in: Pattern Analysis and Applications, 20(1):87-100, 2017 |
[DOI] |
Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (PACM IMWUT)
Bites'n'Bits: Inferring Eating Behavior from Contextual Mobile Data, , , and , in: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (PACM IMWUT), 1(4):125-157, 2017 |
|
Robotics and Autonomous Systems
Learning adaptive dressing assistance from human demonstration, and , in: Robotics and Autonomous Systems, 93:61-75, 2017 |
[DOI] [URL] |
Small Group Research
Theories and Models of Teams and Group, , , , and , in: Small Group Research, 48(5):544--567, 2017 |
[DOI] |
Speech Communication
Template-matching for Text-dependent Speaker Verification, , , and , in: Speech Communication, 2017 |
|
Technologies
Combining Electromyography and Tactile Myography to Improve Hand and Wrist Activity Detection in Prostheses, , , and , in: Technologies, 5(4), 2017 |
|
ACM Journal on Computing and Cultural Heritage (JOCCH)
Evaluating Shape Representations for Maya Glyph Classification, , and , in: ACM Journal on Computing and Cultural Heritage (JOCCH), 9(3), 2016 |
Autonomous Robots
High-slope terrain locomotion for torque-controlled quadruped robots, , , , , and , in: Autonomous Robots, 2016 |
[DOI] [URL] |
Computer Speech and Language
Articulatory feature based continuous speech recognition using probabilistic lexical modeling, and , in: Computer Speech and Language, 36:233-259, 2016 |
[DOI] |
Speech vocoding for laboratory phonology, , and , in: Computer Speech and Language, 2016 |
|
Computer Vision and Image Understanding
Scalable greedy algorithms for transfer learning, , and , in: Computer Vision and Image Understanding, 2016 |
Data & Knowledge Engineering Journal
Question Answering in Conversations: Query Refinement Using Contextual and Semantic Information, , and , in: Data & Knowledge Engineering Journal, 2016 |
Expert Systems with Applications
Adaptive Sentiment-Aware One-Class Collaborative Filtering, and , in: Expert Systems with Applications, 43:23-41, 2016 |
[DOI] [URL] |
Frontiers in ICT: Computer Image Analysis
CRF-Based Context Modeling for Person Identification in Broadcast Videos, , , and , in: Frontiers in ICT: Computer Image Analysis, 3, 2016 |
|
Frontiers in Robotics and AI
Learning Controllers for Reactive and Proactive Behaviors in Human-Robot Collaboration, , , and , in: Frontiers in Robotics and AI, 3(30):1-11, 2016 |
[DOI] |
IEEE Robotics and Automation Letters
Learning Robot Manipulation Tasks with Task-Parameterized Semi-Tied Hidden Semi-Markov Model, and , in: IEEE Robotics and Automation Letters, 1(1):235-242, 2016 |
[DOI] [URL] |
IEEE Signal Processing Letters
A Large-Scale Open-Source Acoustic Simulator for Speaker Recognition, , , , and , in: IEEE Signal Processing Letters, 23(4):527 - 531, 2016 |
|
IEEE Trans. on Robotics
Learning Physical Collaborative Robot Behaviors from Human Demonstrations, , , , and , in: IEEE Trans. on Robotics, 32(3):513-527, 2016 |
[DOI] [URL] |
IEEE Transactions on Computational Imaging
Simultaneous temporal superresolution and denoising for cardiac fluorescence microscopy, , , and , in: IEEE Transactions on Computational Imaging, 2016 |
[DOI] [URL] |
IEEE Transactions on Multimedia
Predicting the Performance in Decision-Making Tasks: From Individual Cues to Group Interaction, and , in: IEEE Transactions on Multimedia, 18(4):643--658, 2016 |
[DOI] [URL] |
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE
Deep Dynamic Neural Networks for Multimodal Gesture Segmentation and Recognition, , , , , , and , in: IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016 |
|
IEEE Transactions on Pattern Analysis and Machine Intelligence
Tracking Interacting Objects Using Intertwined Flows, , , and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016 |
IEEE Transactions on Signal Processing
Binary Sparse Coding of Convolutive Mixtures for Sound Localization and Separation via Spatialization, , , , , and , in: IEEE Transactions on Signal Processing, 64(3):567-579, 2016 |
[DOI] |
TDOA Matrices: Algebraic Properties and their Application to Robust Denoising with Missing Data, , , and , in: IEEE Transactions on Signal Processing, 64(20):5242-5254, 2016 |
[DOI] [URL] |
IEEE/ACM Trans. on Audio, Speech and Language Processing
Composition of Deep and Spiking Neural Networks for Very Low Bit Rate Speech Coding, , , and , in: IEEE/ACM Trans. on Audio, Speech and Language Processing, 2016 |
|
Intelligent Service Robotics
A Tutorial on Task-Parameterized Movement Learning and Retrieval, , in: Intelligent Service Robotics, 9(1):1-29, 2016 |
[DOI] [URL] |
International Journal of Computer Vision
Gaze Estimation in the 3D Space Using RGB-D sensors. Towards Head-Pose And User Invariance., and , in: International Journal of Computer Vision, 118(2):194-216, 2016 |
[DOI] [URL] |
Journal of Cell Biology
Computer vision profiling of neurite outgrowth dynamics reveals spatio-temporal modularity of Rho GTPase signaling, , , , , , , , , and , in: Journal of Cell Biology, 212(1):91-111, 2016 |
[DOI] |
Journal of Machine Learning Research
Jointly Informative Feature Selection, and , in: Journal of Machine Learning Research, 2016 |
Journal of Statistical Planning and Inference
On degeneracy and invariances of random fields paths with applications in Gaussian process modelling, , and , in: Journal of Statistical Planning and Inference, 170:117-128, 2016 |
[DOI] |
Machine Learning
Fast Rates by Transferring from Auxiliary Hypotheses, and , in: Machine Learning, 2016 |
Multimedia Tools and Applications
Adaptive relevance feedback for large-scale image retrieval, and , in: Multimedia Tools and Applications, 75(12):6777-6807, 2016 |
[DOI] |
SIAM/ASA J. Uncertainty Quantification
Quantifying uncertainties on excursion sets under a Gaussian random field prior, , , and , in: SIAM/ASA J. Uncertainty Quantification, 4(1):850-874, 2016 |
[DOI] [URL] |
Speech Communication
Acoustic data-driven grapheme-to-phoneme conversion in the probabilistic lexical modeling framework, , and , in: Speech Communication, 80, 2016 |
[DOI] |
Computational Methods for Underdetermined Convolutive Speech Localization and Separation via Model-based Sparse Component Analysis, , , and , in: Speech Communication, 76:201-217, 2016 |
|
Feature mapping using far-field microphones for distant speech recognition, , , and , in: Speech Communication, 83:1-9, 2016 |
[DOI] [URL] |
On Structured Sparsity of Phonological Posteriors for Linguistic Parsing, , and , in: Speech Communication, 84:36-45, 2016 |
[DOI] [URL] |
Predicting the intrusiveness of noise through sparse coding with auditory kernels, and , in: Speech Communication, 76:186-200, 2016 |
[DOI] [URL] |
Speech Communication: Special Issue on Advances in Sparse Modeling and Low-rank Modeling for Speech Processing
Sparse Modeling of Neural Network Posterior Probabilities for Exemplar-based Speech Recognition, , and , in: Speech Communication: Special Issue on Advances in Sparse Modeling and Low-rank Modeling for Speech Processing, 76:230–244, 2016 |
[DOI] |
ACM Transactions on Interactive Intelligent Systems
Brief Introduction to the Special Issue on Behavior Understanding for Arts and Entertainment, , , , and , in: ACM Transactions on Interactive Intelligent Systems, 5(2):6, 2015 |
[DOI] |
In the Mood for Vlog: Multimodal Inference in Conversational Social Video, , , and , in: ACM Transactions on Interactive Intelligent Systems, 5(2), 2015 |
[DOI] |
Biomedical Optics Express
Dynamic structure and protein expression of the live embryonic heart captured by 2-photon light sheet microscopy and retrospective registration, , , , , and , in: Biomedical Optics Express, 6(6):2056-2066, 2015 |
[DOI] [URL] |
Computer Speech and Language
A Survey on Perceived Speaker Traits: Personality, Likability, Pathology and the First Challenge, , , , , , , , , , and , in: Computer Speech and Language, 19(1):100-131, 2015 |
[DOI] |
EURASIP Journal on Audio, Speech, and Music Processing
Exploiting foreign resources for DNN-based ASR, , , , and , in: EURASIP Journal on Audio, Speech, and Music Processing(2015:17), 2015 |
[DOI] |
IEEE Journal of Selected Topics in Signal Processing
Spatial Sound Localization via Multipath Euclidean Distance Matrix Recovery, , , , and , in: IEEE Journal of Selected Topics in Signal Processing, 9(5):802-814, 2015 |
|
IEEE Multimedia
Klewel Webcast: from Research to Growing Company, , , and , in: IEEE Multimedia, 22(4):94-99, 2015 |
|
IEEE Signal Processing Magazine
Signal Processing in the Workplace, , in: IEEE Signal Processing Magazine, 32(1):121-125, 2015 |
|
IEEE Transactions on Information Forensics and Security
Joint Speaker Verification and Anti-Spoofing in the i-Vector Space, , , , and , in: IEEE Transactions on Information Forensics and Security, 10(4):821-832, 2015 |
[DOI] |
IEEE Transactions on Information Forensics and Security, Special Issue on Biometric Anti-spoofing
On the use of client identity information for face anti-spoofing, and , in: IEEE Transactions on Information Forensics and Security, Special Issue on Biometric Anti-spoofing, 10(4):787-796, 2015 |
|
IEEE/ACM Transactions on Audio Speech and Language Processing
Keyword Extraction and Clustering for Document Recommendation in Conversations, and , in: IEEE/ACM Transactions on Audio Speech and Language Processing, 23(4):746 - 759, 2015 |
[DOI] |
IEEE/ACM Transactions on Audio, Speech and Language Processing
Disambiguating Discourse Connectives for Statistical Machine Translation, , and , in: IEEE/ACM Transactions on Audio, Speech and Language Processing, 23(7):1184-1197, 2015 |
[DOI] |
IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING
Incremental Syllable-Context Phonetic Vocoding, , , , and , in: IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 23(6), 2015 |
[URL] |
IET Biometrics
Impact of Eye Detection Error on Face Recognition Performance, , , , , and , in: IET Biometrics, 2015 |
[URL] |
International Journal of Psychology
Emergent Power Hierarchies and Group Performance, , , and , in: International Journal of Psychology, 50(5):392–396, 2015 |
[DOI] [URL] |
Journal of Wavelets, Multiresolution and Information Processing
Reconstruction of Images from Gabor Graphs with Applications in Facial Image Processing, , , and , in: Journal of Wavelets, Multiresolution and Information Processing, 13(4):25, 2015 |
[DOI] |
Multimedia Tools and Applications, Special Issue on Content Based Multimedia Indexing
Combining Content with User Preferences for Non-Fiction Multimedia Recommendation: A Study on TED Lectures, and , in: Multimedia Tools and Applications, Special Issue on Content Based Multimedia Indexing, 74(4):1175-1197, 2015 |
[DOI] |
Multimedia, IEEE Transactions
Automatic Recognition of Emergent Social Roles in Small Group Interactions, and , in: Multimedia, IEEE Transactions, 17(5):746 - 760, 2015 |
[DOI] |
Neurocomputing
Modeling Annotator Behaviors for Crowd Labeling, , , and , in: Neurocomputing, 160:141–156, 2015 |
[DOI] |
Pattern Recognition Letters
Combining dynamic head pose-gaze mapping with the robot conversational state for attention recognition in human-robot interactions, and , in: Pattern Recognition Letters, 66:81-90, 2015 |
|
Signal Processing
Ad Hoc Microphone Array Calibration: Euclidean Distance Matrix Completion Algorithm and Theoretical Guarantees, , , , and , in: Signal Processing, 107:123–140, 2015 |
[DOI] |
Speech Communication
Acoustic and Lexical Resource Constrained ASR using Language-Independent Acoustic Model and Language-Dependent Probabilistic Lexical Model, and , in: Speech Communication, 68:23–40, 2015 |
[DOI] [URL] |
Audio, Speech and Language processing, IEEE/ACM Transaction on
Overlapping speech detection using long-term conversational features for speaker diarization in meeting room conversations., and , in: Audio, Speech and Language processing, IEEE/ACM Transaction on, 22(12):1688-1700, 2014 |
|
Frontiers in Neurorobotics
Stable Myoelectric Control of a Hand Prosthesis using Non-Linear Incremental Learning, , , , , , , and , in: Frontiers in Neurorobotics, 8, 2014 |
[DOI] |
IEEE Journal of Selected Topics in Signal Processing - Special Issue on Statistical Parametric Speech Synthesis
Combining Vocal Tract Length Normalization with Hierarchical Linear Transformations, , , and , in: IEEE Journal of Selected Topics in Signal Processing - Special Issue on Statistical Parametric Speech Synthesis, 8(2):262 - 272, 2014 |
[DOI] |
IEEE Pervasive Computing
ISWC 2013--Wearables are Here to Stay, , , and , in: IEEE Pervasive Computing, 13(1):14-18, 2014 |
[DOI] |
IEEE Trans. Pattern Analysis and Machine Intelligence (PAMI)
Temporal Analysis of Motif Mixtures using Dirichlet Processes, , and , in: IEEE Trans. Pattern Analysis and Machine Intelligence (PAMI), 36(1), 2014 |
|
IEEE Transaction on Affective Computing
A Survey of Personality Computing, and , in: IEEE Transaction on Affective Computing, 5(3):273-291, 2014 |
|
IEEE Transactions Affective Computing
What Your Face Vlogs About: Expressions of Emotion and Big-Five Traits Impressions in YouTube, , , and , in: IEEE Transactions Affective Computing, 2014 |
|
IEEE Transactions on Information Forensics and Security
Biometrics Evaluation Under Spoofing Attacks, , and , in: IEEE Transactions on Information Forensics and Security, 9(12):2264-2276, 2014 |
[DOI] |
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY
Spoofing Face Recognition with 3D Masks, and , in: IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY:1084-1097, 2014 |
[DOI] |
IEEE Transactions on Multimedia
Broadcasting oneself: Visual Discovery of Vlogging Styles, , and , in: IEEE Transactions on Multimedia, 16(1):201-215, 2014 |
[DOI] |
Mining Crowdsourced First Impressions in Online Social Video, and , in: IEEE Transactions on Multimedia, 16(7), 2014 |
|
Image and Vision Computing
Bi-Modal Biometric Authentication on Mobile Phones in Challenging Conditions, , , , and , in: Image and Vision Computing:1147-1160, 2014 |
[DOI] [URL] |
International Journal of Speech Techonology
MODIFIED GROUP DELAY FEATURE BASED TOTAL VARIABILITY SPACE MODELLING FOR SPEAKER RECOGNITION, , and , in: International Journal of Speech Techonology, 18(1):17-23, 2014 |
[DOI] |
Journal of Machine Learning Research
Adaptive Sampling for Large Scale Boosting, and , in: Journal of Machine Learning Research, 15:1431-1453, 2014 |
|
Pattern Recognition
Leveraging Colour Segmentation for Upper-Body Detection, and , in: Pattern Recognition, 47(6):2222-2230, 2014 |
|
Pervasive and Mobile Computing
A Probabilistic Kernel Method for Human Mobility Prediction with Smartphones, , , and , in: Pervasive and Mobile Computing, 2014 |
|
Robotics and Biomimetics
Learning by Imitation with the STIFF-FLOP Surgical Robot: A Biomimetic Approach Inspired by Octopus Movements, , , and , in: Robotics and Biomimetics, 1(13):1-15, 2014 |
[URL] |
Signal Processing
Enhanced Diffuse Field Model for Ad Hoc Microphone Array Calibration, , and , in: Signal Processing, 101:242-255, 2014 |
|
The Phonetician
On Learning Grapheme-to-Phoneme Relationships through the Acoustic Speech Signal, and , in: The Phonetician, 109–110:6-23, 2014 |
|
Transactions on Image Processing
Exploiting Long-Term Connectivity and Visual Motion in CRF-based Multi-Person Tracking, , and , in: Transactions on Image Processing, 2014 |
|
Transactions on Neural Systems and Rehabilitation Engineering
Characterization of a Benchmark Database for Myoelectric Movement Classification, , , , , , , , and , in: Transactions on Neural Systems and Rehabilitation Engineering, 23:73-83, 2014 |
[DOI] |
The Movement Error Rate for Evaluation of Machine Learning Methods for sEMG-based Hand Movement Classification, , , , and , in: Transactions on Neural Systems and Rehabilitation Engineering:735 - 744, 2014 |
[DOI] |
15th ACM International Conference on Multimodal Interaction, Sydney, Australia, ACM, 2013
Hi YouTube! Personality Impressions and Verbal Content in Social Video, , , and , in: 15th ACM International Conference on Multimodal Interaction, Sydney, Australia, ACM, 2013, 2013 |
|
Advances in Multimedia
Real-Time Audio-Visual Analysis for Multiperson Videoconferencing, , , , , , , , and , in: Advances in Multimedia, 2013:21, 2013 |
[DOI] [URL] |
Artificial Intelligence Journal
Computing Text Semantic Relatedness using the Contents and Links of a Hypertext Encyclopedia, and , in: Artificial Intelligence Journal, 194:176–202, 2013 |
[DOI] |
Belgian Journal of Linguistics
Using the Europarl corpus for cross-linguistic research, , and , in: Belgian Journal of Linguistics(27):23 – 42, 2013 |
[URL] |
Dialogue & Discourse
Annotating the meaning of discourse connectives by looking at their translation: The translation-spotting technique, , and , in: Dialogue & Discourse, 4(2):65-86, 2013 |
[DOI] |
IEEE Signal Processing Letters
A Savitzky-Golay Filtering Perspective of Dynamic Feature Computation, , and , in: IEEE Signal Processing Letters, 20(3):281 -- 284, 2013 |
[DOI] |
A Simple Continuous Pitch Estimation Algorithm, , and , in: IEEE Signal Processing Letters, 20(1):102--105, 2013 |
[URL] |
IEEE Signal Processing Magazine, Special Issue on Source Separation and Applications
Convexity in source separation: Models, geometry, and algorithms, , , , and , in: IEEE Signal Processing Magazine, Special Issue on Source Separation and Applications, 2013 |
|
IEEE Trans. on Intelligent Transportation Systems
Observation of Vehicle Axles Through Pass-by Noise: A Strategy of Microphone Array Design, , , , and , in: IEEE Trans. on Intelligent Transportation Systems, 2013 |
|
IEEE Transactions on Audio, Speech, and Language Processing
Applying multi- and cross-lingual stochastic phone space transformations to non-native speech recognition, , , , and , in: IEEE Transactions on Audio, Speech, and Language Processing, 2013 |
[DOI] |
IEEE Transactions on Image Processing
A Track Creation and Deletion Framework for Long-Term Online Multi-Face Tracking, and , in: IEEE Transactions on Image Processing, 2013 |
|
IEEE Transactions on Mobile Computing
The Places of Our Lives: Visiting Patterns and Automatic Labeling from Longitudinal Smartphone Data, and , in: IEEE Transactions on Mobile Computing, 2013 |
|
IEEE Transactions on Pattern Analysis and Machine Intelligence
Multi-Commodity Network Flow for Tracking Multiple People, , , and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013 |
International Journal of Computer Vision
A Sequential Topic Model for Mining Recurrent Activities from Long Term Video Logs, , and , in: International Journal of Computer Vision, 103(1):100-126, 2013 |
|
Machine Learning
Introduction to the Special Issue on Learning Semantics, , , , , and , in: Machine Learning, 2013 |
[DOI] |
Multimedia Tools and Applications
Gesture control interface for immersive panoramic displays, , , , , , and , in: Multimedia Tools and Applications, 1380-7501:1-27, 2013 |
[DOI] |
Neural Networks
Autonomous reinforcement learning with experience replay, and , in: Neural Networks, 41:156 - 167, 2013 |
[DOI] [URL] |
Pattern Recognition Letters
treeKL: A distance between high dimension empirical distributions, and , in: Pattern Recognition Letters, 34(2):140-145, 2013 |
|
Pervasive and Mobile Computing
From Big Smartphone Data to Worldwide Research: The Mobile Data Challenge, , , , , , , and , in: Pervasive and Mobile Computing, 9(6):752–771, 2013 |
|
Where and What: Using Smartphones to Predict Next Locations and Applications in Daily Life, and , in: Pervasive and Mobile Computing, 2013 |
|
Speech Communication
Using out-of-language data to improve an under-resourced speech recognizer, , , and , in: Speech Communication, 2013 |
[DOI] [URL] |
Water Resources Research
Clustering flood events from water quality time-series using Latent Dirichlet Allocation model, , , , , , , , , and , in: Water Resources Research, 2013 |
[DOI] |
2012US-13/654055
A Method, Apparatus and Computer Program for Determining the Location of a Plurality of Speech Source, , and , in: 2012US-13/654055, 2012 |
[URL] |
Cognitive Processing
Conversation Analysis at Work: Detection of Conflict in Competitive Discussions through Automatic Turn-Organization Analysis, , , and , in: Cognitive Processing, 2012 |
IEEE Multimedia
Finding Information in Multimedia Records of Meetings, , and , in: IEEE Multimedia, 19(2):48-57, 2012 |
[DOI] [URL] |
IEEE Transactions on Affective Computing
Automatic Attribution of Personality Traits Based on Prosodic Features, and , in: IEEE Transactions on Affective Computing, 2012 |
|
Bridging the Gap Between Social Animal and Unsocial Machine: A Survey of Social Signal Processing, , , , , , and , in: IEEE Transactions on Affective Computing, 2012 |
IEEE Transactions on Audio, Speech and Language Processing
Vocal Tract Length Normalization for Statistical Parametric Speech Synthesis, , and , in: IEEE Transactions on Audio, Speech and Language Processing, 2012 |
|
IEEE Transactions on Audio, Speech, and Language Processing
The ICSI RT-09 Speaker Diarization System, , , , , , , and , in: IEEE Transactions on Audio, Speech, and Language Processing, 20(2):371--381, 2012 |
[DOI] |
Wordless Sounds: Robust Speaker Diarization using Privacy-Preserving Audio Representations, , and , in: IEEE Transactions on Audio, Speech, and Language Processing, 2012 |
|
IEEE Transactions on Information Forensics and Security
A Fast Parts-based Approach to Speaker Verification using Boosted Slice Classifiers, , and , in: IEEE Transactions on Information Forensics and Security, 7(1):241-254, 2012 |
|
Cross-pollination of normalisation techniques from speaker to face authentication using Gaussian mixture models, , , and , in: IEEE Transactions on Information Forensics and Security, 7(2):553 -- 562, 2012 |
|
IEEE Transactions on Multimedia
A Nonverbal Behavior Approach to Identify Emergent Leaders in Small Groups, , , and , in: IEEE Transactions on Multimedia, 14(3-2):816-832, 2012 |
[DOI] |
Automatic Role Recognition in Multiparty Conversations: an Approach Based on Turn Organization, Prosody and Conditional Random Fields, and , in: IEEE Transactions on Multimedia, 2012 |
IEEE Transactions on Pattern Analysis and Machine Intelligence
A real-time deformable detector., , , and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2012 |
|
IEEE TRANSACTIONS ON ROBOTICS
Improving Control of Dexterous Hand Prostheses Using Adaptive Learning, , , and , in: IEEE TRANSACTIONS ON ROBOTICS, 2012 |
[DOI] |
International Journal of Computer and Electrical Engineering
The TA2 Database – A Multi-Modal Database From Home Entertainment, , and , in: International Journal of Computer and Electrical Engineering, 4(5):670-673, 2012 |
[URL] |
Journal of Machine Learning Research
Regularized Bundle Methods for Convex and Non-Convex Risks, and , in: Journal of Machine Learning Research, 13:3539-3583, 2012 |
|
Journal of Multimedia
Assessing Sparse Coding Methods for Contextual Shape Indexing of Maya Hieroglyphs, , and , in: Journal of Multimedia, 7(2):179--192, 2012 |
|
Journal on Multimodal User Interfaces
Emergent leaders through looking and speaking: from audio-visual data to multimodal recognition, , , , and , in: Journal on Multimodal User Interfaces, 2012 |
|
Multimedia Tools and Applications
Audiovisual Diarization Of People In Video Content, , and , in: Multimedia Tools and Applications, 2012 |
|
Neural Networks
Real-time model learning using Incremental Sparse Spectrum Gaussian Process Regression, and , in: Neural Networks, 2012 |
Personal and Ubiquitous Computing
Human Interaction Discovery in Smartphone Proximity Networks, and , in: Personal and Ubiquitous Computing, 2012 |
|
Mining Large-Scale Smartphone Data for Personality Studies, , and , in: Personal and Ubiquitous Computing, 2012 |
|
Speech Communication
Multistream speaker diarization of meetings recordings beyond MFCC and TDOA features, , and , in: Speech Communication, 54(1), 2012 |
[DOI] |
Phase AutoCorrelation (PAC) features for noise robust speech recognition, , , and , in: Speech Communication, 54(7):867–880, 2012 |
[DOI] |
ACM Transactions on Intelligent Systems and Technology
Discovering Routines from Large-Scale Human Locations using Probabilistic Topic Models, and , in: ACM Transactions on Intelligent Systems and Technology, 2(1), 2011 |
|
Computer Speech and Language
Automatic Identification of Discourse Markers in Multiparty Dialogues: An In-Depth Study of Like and Well, and , in: Computer Speech and Language, 25(3):499-518, 2011 |
[DOI] |
Personalising speech-to-speech translation: Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis, , , , , , , , , , , , and , in: Computer Speech and Language, 2011 |
[DOI] [URL] |
Computer Vision and Image Understanding
3D human pose recovery from image by efficient visual feature selection, , , and , in: Computer Vision and Image Understanding, 115(3), 2011 |
|
Fast Human Detection from Joint Appearance and Foreground Feature Subset Covariances, and , in: Computer Vision and Image Understanding, 115(10):1414-1426, 2011 |
|
EURASIP Journal on Advances in Signal Processing
Performance Improvement of TDOA-Based Speaker Localization in Joint Noisy and Reverberant Conditions, and , in: EURASIP Journal on Advances in Signal Processing, 2011 |
[DOI] |
IEEE Multimedia
Using Modality Replacement to Facilitate Communication between Visually and Hearing-Impaired People, , , , and , in: IEEE Multimedia, 18(2):26-37, 2011 |
[DOI] |
IEEE Pervasive Computing, Special Issue on Large-Scale Opportunistic Sensing
Sensing the `Health State` of our Society, , , , and , in: IEEE Pervasive Computing, Special Issue on Large-Scale Opportunistic Sensing, 2011 |
|
IEEE Trans. on Pattern Analysis and Machine Intelligence
Multi-Person Visual Focus of Attention from Head Pose and Meeting Contextual Cues, and , in: IEEE Trans. on Pattern Analysis and Machine Intelligence, 33(1):101-116, 2011 |
|
IEEE Transaction on Autonomous Mental Development
Using object affordances to improve object recognition, , , , and , in: IEEE Transaction on Autonomous Mental Development, 2011 |
|
IEEE Transactions on Audio Speech and Language Processing
An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization, , and , in: IEEE Transactions on Audio Speech and Language Processing, 19(2), 2011 |
[DOI] |
IEEE Transactions on Audio, Speech, and Language Processing
Estimating Dominance in Multi-Party Meetings Using Speaker Diarization, , , and , in: IEEE Transactions on Audio, Speech, and Language Processing, 19(4):847-860, 2011 |
|
Privacy-Sensitive Audio Features for Speech/Nonspeech Detection, , , and , in: IEEE Transactions on Audio, Speech, and Language Processing, 19(8), 2011 |
|
Transcribing Mandarin Broadcast Speech Using Multi-Layer Perceptron Acoustic Features, , , , and , in: IEEE Transactions on Audio, Speech, and Language Processing, 19(8), 2011 |
[DOI] |
IEEE Transactions on Pattern Analysis and Machine Intelligence
Multiple Object Tracking using K-Shortest Paths Optimization, , , and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2011 |
IEEE Transactions on Visualization and Computer Graphics
Learning a 3D Human Pose Distance Metric from Geometric Pose Descriptor, , in: IEEE Transactions on Visualization and Computer Graphics, 17(11):1676-1689, 2011 |
|
IEEE Transcations on Audio, Speech, and Language Processing
Analysis of MLP Based Hierarchical Phoneme Posterior Probability Estimator, , , , and , in: IEEE Transcations on Audio, Speech, and Language Processing, 19(2):225-241, 2011 |
|
IJST (Springer)
Robustness of Group Delay Representations for Noisy Speech Signals, , and , in: IJST (Springer), 14(4), 2011 |
|
Journal of Machine Learning Research
Natural Language Processing (Almost) from Scratch, , , , , and , in: Journal of Machine Learning Research, 12:2493-2537, 2011 |
|
Journal of Physical Agents
Towards semi-supervised learning of semantic spatial concepts for mobile robots, and , in: Journal of Physical Agents, 2011 |
|
Proceedings of the National Academy of Sciences
Comparing machines and humans on a visual categorization test, , , , , and , in: Proceedings of the National Academy of Sciences, 2011 |
Speech Communication
A novel framework for noise robust ASR using cochlear implant-like spectrally reduced speech, , and , in: Speech Communication, 2011 |
[DOI] |
Springer Multimedia Systems Journal
Privacy-sensitive recognition of group conversational context with sociometers, , , and , in: Springer Multimedia Systems Journal, 2011 |
|
Transactions on Multimedia Computing, Communications and Applications
VlogSense: Conversational Behavior and Social Attention in YouTube, and , in: Transactions on Multimedia Computing, Communications and Applications, 2011 |
|
EURASIP Journal on Audio Speech and Music Processing
Wide-Band Audio Coding based on Frequency Domain Linear Prediction, , , and , in: EURASIP Journal on Audio Speech and Music Processing, 2010(856280), 2010 |
[DOI] [URL] |
IEEE Journal of Selected Topics in Signal Processing
Measuring the gap between HMM-based ASR and TTS, , and , in: IEEE Journal of Selected Topics in Signal Processing, in print, 2010 |
|
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING
Probabilistic Mining of Socio-Geographic Routines from Mobile Phone Data, and , in: IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 4(4), 2010 |
|
IEEE Trans. on Multimedia, Special Issue on Multimodal Affective Interaction
Estimating Cohesion in Small Groups Using Audio-Visual Nonverbal Behavior, and , in: IEEE Trans. on Multimedia, Special Issue on Multimodal Affective Interaction, 12(6):563 - 575, 2010 |
|
IEEE Transactions on Audio, Speech, and Language Processing
Tuning-Robust Initialization Methods for Speaker Diarization, and , in: IEEE Transactions on Audio, Speech, and Language Processing, 18(8):2028-2037, 2010 |
[DOI] |
IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING
Autoregressive Models of Amplitude Modulations in Audio Compression, , and , in: IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 2010 |
[URL] |
IEEE Transactions on Multimedia
Mining group nonverbal conversational patterns using probabilistic topic models, and , in: IEEE Transactions on Multimedia, 2010 |
|
Modeling and Understanding Flickr Communities through Topic-based Analysis, and , in: IEEE Transactions on Multimedia, 12(5), 2010 |
[DOI] |
Image and Vision Computing
The More you Learn, the Less you Store: Memory-controlled Incremental SVM for Visual Place Recognition, , and , in: Image and Vision Computing, 2010 |
[DOI] |
Multimedia Tools and Applications, Special issue on Social Media
Inferring competitive role patterns in reality TV show through nonverbal analysis, and , in: Multimedia Tools and Applications, Special issue on Social Media, 2010 |
|
Pattern Recognition
A Multi-class Classification Strategy for Fisher Scores: Application to Signer Independent Sign Language Recognition, and , in: Pattern Recognition, 43(5), 2010 |
[DOI] |
Pattern Recognition Letters
Feature distribution modelling techniques for 3D face recognition, , and , in: Pattern Recognition Letters, 31:1324-1330, 2010 |
|
Speech Communication
Hierarchical and Parallel Processing of Auditory and Modulation Frequencies for Automatic Speech Recognition, , in: Speech Communication, 52(10):790-800, 2010 |
[DOI] |
Multi-Stream Speech Recognition based on Dempster-Shafer Combination Rule, , in: Speech Communication, 52(3):213-222, 2010 |
[DOI] |
EURASIP Journal on Image and Video Processing, Special Issue on Patches in Vision
Contextual classification of image patches with latent aspect models, , , and , in: EURASIP Journal on Image and Video Processing, Special Issue on Patches in Vision, 2009 |
|
IEEE Signal Processing Letters
A Novel Criterion for Classifiers Combination in Multistream Speech Recognition, , in: IEEE Signal Processing Letters, 16(7), 2009 |
[DOI] |
IEEE Transactions on Audio Speech and Language Processing
An Information Theoretic Approach to Speaker Diarization of Meeting Data, , and , in: IEEE Transactions on Audio Speech and Language Processing, 17(7), 2009 |
[DOI] |
IEEE Transactions on Multimedia
Automatic Role Recognition in Multiparty Recordings: Using Social Affiliation Networks for Feature Extraction, , and , in: IEEE Transactions on Multimedia, 11(7), 2009 |
|
IEEE Transactions on Systems, Man, Cybernetics, Part-B
Recognizing Human Visual Focus of Attention from Head Pose in Meetings, and , in: IEEE Transactions on Systems, Man, Cybernetics, Part-B, Vol. 39(No. 1), 2009 |
|
Image & Vision Computing
A novel statistical generative model dedicated to face recognition, and , in: Image & Vision Computing, 2009 |
|
Image and vision Computing
Classifying Material in the Real World, , , and , in: Image and vision Computing, accepted for pub, 2009 |
Image and Vision Computing, Special Issue on Human Behavior
Automatic nonverbal analysis of social interaction in small groups: A review, , in: Image and Vision Computing, Special Issue on Human Behavior, 27(12), 2009 |
|
International Journal of Robotics Research
COLD: The COsy Localization Database, and , in: International Journal of Robotics Research, 28(5), 2009 |
|
Journal of Machine Learning Research
Bounded kernel-based perceptrons, , and , in: Journal of Machine Learning Research, Accepted for pub, 2009 |
Linguistica Antverpiensia New Series
The FEMTI guidelines for contextual MT evaluation: principles and tools, , and , in: Linguistica Antverpiensia New Series, 8, 2009 |
Pattern Recognition
On the vulnerability of face verification systems to hill-climbing attacks, , , , and , in: Pattern Recognition, 2009 |
Towards Life-long Learning for Cognitive Systems: Online Independent Support Vector Machine, , , , and , in: Pattern Recognition, Accepted for Pub, 2009 |
Pattern Recognition Letter
Multi-layer Boosting for Pattern Recognition, , in: Pattern Recognition Letter, 30, 2009 |
Signal Processing
Verified Speaker Localization Utilizing Voicing Level in Split-bands, , , and , in: Signal Processing, 89(6):1038-1049, 2009 |
|
Speech Communication
Discriminative Keyword Spotting, , and , in: Speech Communication, 51(4), 2009 |
|
Clinical Neurophysiology
A Brain-Actuated Wheelchair: Asynchronous and Non-Invasive Brain-Computer Interfaces for Continuous Control of Robots, , , , , , and , in: Clinical Neurophysiology, 2008 |
|
ELectronic Letters on Computer vision and Image Analysis
Class specific object recognition using kernel Gibbs distributions, , in: ELectronic Letters on Computer vision and Image Analysis, 7(2), 2008 |
|
IEEE Intelligent Systems
Brain-Controlled Robots, , in: IEEE Intelligent Systems, 2008 |
|
{IEEE} Signal Processing Letters
Recognition Of Reverberant Speech Using Frequency Domain Linear Prediction, , and , in: IEEE Signal Processing Letters, 2008 |
|
IEEE Trans. on Biomedical Engineering
Error-Related EEG Potentials Generated during Simulated Brain-Computer Interaction, and , in: IEEE Trans. on Biomedical Engineering, 55(3), 2008 |
|
IEEE Trans. on Pattern Analysis and Machine Intelligence
Tracking the visual focus of attention for a varying number of wandering people, , , and , in: IEEE Trans. on Pattern Analysis and Machine Intelligence, 30(7), 2008 |
|
IEEE Transactions on Audio, Speech and Language Processing
Modeling Dominance in Group Conversations using NonVerbal Activity Cues, , , and , in: IEEE Transactions on Audio, Speech and Language Processing, 2008 |
|
IEEE Transactions on Biomedical Engineering
Fast Recognition of Anticipation Related Potentials, , and , in: IEEE Transactions on Biomedical Engineering, 2008 |
|
{IEEE} Transactions on Neural Systems {&} Rehabilitation Engineering
Characterizing the EEG Correlates of Exploratory Behavior, , , and , in: IEEE Transactions on Neural Systems & Rehabilitation Engineering, 2008 |
|
IEEE Transactions on Pattern Analysis and Machine Intelligence
Classification-based Probabilistic Modeling of Texture Transition for Fast Line Search Tracking and Delineation, , , and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008 |
Multi-Camera People Tracking with a Probabilistic Occupancy Map, , , and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(2), 2008 |
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
A Discriminative Kernel-based Model to Rank Images from Text Queries, and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), X, 2008 |
|
International Journal of Pattern Recognition and Artificial Intelligence
Non-Invasive Brain-Machine Interaction, , , , and , in: International Journal of Pattern Recognition and Artificial Intelligence, 2008 |
|
Journal of Acoustical Society of America - Express Letters
Modulation Frequency Features For Phoneme Recognition In Noisy Speech, , and , in: Journal of Acoustical Society of America - Express Letters, 2008 |
|
Journal of Machine Learning Research
SimpleMKL, , , and , in: Journal of Machine Learning Research, 9, 2008 |
|
Stationary Features and Cat Detection, and , in: Journal of Machine Learning Research, 9, 2008 |
Language Resources and Evaluation
Dimensionality of Dialogue Act Tagsets: An Empirical Analysis of Large Corpora, , in: Language Resources and Evaluation, 42(1), 2008 |
[DOI] |
Pattern Recognition Letters
Discriminative cue integration for medical image annotation, , and , in: Pattern Recognition Letters, 2008 |
|
Computational Intelligence and Neuroscience
Context-based Filtering for Assisted Brain-Actuated Wheelchair Driving, , , , , , , and , in: Computational Intelligence and Neuroscience, 2007, 2007 |
|
Vibrotactile Feedback for Brain-Computer Interface Operation, , , , , , , , , , , and , in: Computational Intelligence and Neuroscience, 2007, 2007 |
|
Computer Vision and Image Undertanding
Local velocity-adapted motion events for spatio-temporal recognition, , and , in: Computer Vision and Image Undertanding, 108(3), 2007 |
|
IEEE Computer
Human-centered Computing: Toward a Human Revolution, , , and , in: IEEE Computer, 40(5), 2007 |
|
{IEEE} Pattern Analysis and Machine intelligence
Performance Generalization in Biometric Authentication Using Joint User-Specific and Sample Bootstraps, , and , in: IEEE Pattern Analysis and Machine intelligence, 2007 |
|
{IEEE} Signal Processing Letters
Bayesian Factorial Linear Gaussian State-Space Models for Biosignal Decomposition, and , in: IEEE Signal Processing Letters, 2007 |
|
IEEE Transactions on Audio, Speech and Language Processing
Short-Term Spatio-Temporal Clustering Applied to Multiple Moving Speakers, and , in: IEEE Transactions on Audio, Speech and Language Processing, 15(5):15, 2007 |
|
{IEEE} Transactions on Multimedia
Role Recognition in Broadcast News Using Social Network Analysis and Duration Distribution Modeling, , in: IEEE Transactions on Multimedia, 2007 |
|
{IEEE} Transactions on Pattern Analysis and Machine Intelligence
A Thousand Words in a Scene, , , and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2007 |
|
Modeling semantic aspects for cross-media image indexing, and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2007 |
|
{IEEE} TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE Special Issue on Biometrics
Person Authentication using Brainwaves (EEG) and Maximum A Posteriori Model Adaptation, and , in: IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE Special Issue on Biometrics, 2007 |
|
International Journal on Image and Video Processing Special Issue on Facial Image Processing
On the Recent Use of Local Binary Patterns for Face Authentication, , and , in: International Journal on Image and Video Processing Special Issue on Facial Image Processing, 2007 |
|
Journal of Neuroscience Methods
High-Resolution EEG Techniques for Brain-Computer Interface Applications, , , , , , , , , , , and , in: Journal of Neuroscience Methods, 2007 |
|
Pattern Recognition
A Kernel Trick For Sequences Applied to Text-Independent Speaker Verification Systems, and , in: Pattern Recognition, 2007 |
|
EURASIP Journal on Applied Signal Processing, Special Issue on Advances in Multimicrophone Speech Processing
Sector-Based Detection for Hands-Free Speech Enhancement in Cars, , and , in: EURASIP Journal on Applied Signal Processing, Special Issue on Advances in Multimicrophone Speech Processing, 2006 |
|
IEEE Trans. on Audio, Speech and Language Processing
Subjective Evaluation of Join Cost and Smoothing Methods for Unit Selection Speech Synthesis, and , in: IEEE Trans. on Audio, Speech and Language Processing, 14(5), 2006 |
|
IEEE Trans. on Audio, Speech, and Language Processing, accepted for publication.
Audio-visual probabilistic tracking of multiple speakers in meetings, , , and , in: IEEE Trans. on Audio, Speech, and Language Processing, accepted for publication., 2006 |
|
IEEE Trans. on Neural Systems and Rehabilitation Engineering
The BCI Competition III: Validating Alternative Approaches to Actual BCI Problems, , , , , , , , , and , in: IEEE Trans. on Neural Systems and Rehabilitation Engineering, 14(2), 2006 |
Towards a Robust BCI: Error Potentials and Online Learning, , and , in: IEEE Trans. on Neural Systems and Rehabilitation Engineering, 14(2), 2006 |
|
IEEE Transaction on Image Processing
Embedding Motion in Model-Based Stochastic Tracking, , and , in: IEEE Transaction on Image Processing, 15(11), 2006 |
{IEEE} Transactions on Multimedia
Application of Information Retrieval Technologies to Presentation Slides, and , in: IEEE Transactions on Multimedia, 8(5), 2006 |
|
Image and Vision Computing Journal
Measuring the Performance of Face Localization Systems, , , and , in: Image and Vision Computing Journal, 24(8), 2006 |
|
International Journal on Image, Systems and Technology
Spin Glass Models of Markov Random Fields, , in: International Journal on Image, Systems and Technology, 16(5), 2006 |
|
Neurocomputing
EEG Classification using Generative Independent Component Analysis, and , in: Neurocomputing, 2006 |
|
NeuroImage
Very High Frequency Oscillations (VHFO) as a Predictor of Movement Intentions, , , , , and , in: NeuroImage, 32(1), 2006 |
|
Speech Communication
On Variable-scale Piecewise Stationary Spectral Analysis of Speech Signals for Asr, , and , in: Speech Communication, 48(9), 2006 |
|
User-Customized Password Speaker Verification Using Multiple Reference and Background Models, and , in: Speech Communication, 8, 2006 |
|
Optimal Spike-Timing Dependent Plasticity for Precise Action Potential Firing in Supervised Learing, , , and , 2005 |
|
Cognitive Processing, Special Issue on Motor Planning in Humans and Neuroprosthesis Control
Non-Invasive Estimation of Local Field Potentials for Neuroprosthesis Control, , , , and , in: Cognitive Processing, Special Issue on Motor Planning in Humans and Neuroprosthesis Control, 6(1), 2005 |
|
{IEEE} Signal Processing Letters, Volume 12
A Unified Framework for Score Normalization Techniques Applied to Text Independent Speaker Verification, and , in: IEEE Signal Processing Letters, Volume 12, 12(7), 2005 |
|
IEEE Trans. on Signal Processing
How Do Correlation and Variance of Base-Experts Affect Fusion in Biometric Authentication Tasks?, and , in: IEEE Trans. on Signal Processing, 2005 |
|
{IEEE} {T}ransaction on {S}ignal {P}rocessing
User Authentication via Adapted Statistical Models of Face Images, , and , in: IEEE Transaction on Signal Processing, 2005 |
|
IEEE Transactions on Pattern Analysis and Machine Intelligence
Noisy Text Categorization, , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(12), 2005 |
|
International Journal of Pattern Recognition and Artificial Intelligence (IJPRAI)
Monte Carlo Video Text Segmentation, , and , in: International Journal of Pattern Recognition and Artificial Intelligence (IJPRAI), 19(5), 2005 |
|
Mente y Cerebro
Interfaces Cerebrales, , in: Mente y Cerebro, 13(July), 2005 |
|
Neurocomputing
Online Policy Adaptation for Ensemble Classifiers, and , in: Neurocomputing, 2005 |
|
Pattern Recognition (in press)
On transforming statistical models for non-frontal face verification, , and , in: Pattern Recognition (in press), 2005 |
[DOI] |
Pattern Recognition Journal
Database, Protocol and Tools for Evaluating Score-Level Fusion Algorithms in Biometric Authentication, and , in: Pattern Recognition Journal, 2005 |
|
Pattern Recognition Letters
Application of Information Retrieval Techniques to Single Writer Documents, , in: Pattern Recognition Letters, 26(14-15), 2005 |
|
Video Text Recognition using Sequential Monte Carlo and Error Voting Methods, and , in: Pattern Recognition Letters, 26(9), 2005 |
|
Artificial Intelligence
Brain-Actuated Interaction, , , and , in: Artificial Intelligence, 159(1-2), 2004 |
|
Digital Signal Processing
Identity verification using speech and face information, and , in: Digital Signal Processing, 14(5), 2004 |
[DOI] |
IEEE Trans. on Biomedical Engineering, Special Issue on Brain-Machine Interfaces
Non-Invasive Brain-Actuated Control of a Mobile Robot by Human EEG, , , and , in: IEEE Trans. on Biomedical Engineering, Special Issue on Brain-Machine Interfaces, 51(6), 2004 |
|
IEEE Trans. on Speech and Audio Processing
Speech recognition with auxiliary information, , and , in: IEEE Trans. on Speech and Audio Processing, 4, 2004 |
IEEE Transactions on Pattern Analysis and Machine Intelligence
Offline Recognition of Unconstrained Handwritten Texts Using HMMs and Statistical Language Models, , and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 26(6), 2004 |
|
IEEE Transactions on Pattern Analysis and Machine Intelligence (to appear)
Automatic Analysis of Multimodal Group Actions in Meetings, , , , , and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence (to appear), 2004 |
|
{IEEE Transactions on Speech and Audio Processing}
A Generative Model for Music Transcription, , and , in: IEEE Transactions on Speech and Audio Processing, 2004 |
|
Information Fusion
Multimodal Speech Processing Using Asynchronous Hidden Markov Models, , in: Information Fusion, 5(2), 2004 |
|
Journal of the Acoustical Society of America (JASA)
Evaluation of Formant-Like Features for Automatic Speech Recognition, , , , , and , in: Journal of the Acoustical Society of America (JASA), 116(3), 2004 |
|
Pattern Recognition
Text Detection and Recognition in Images and Videos, , and , in: Pattern Recognition, 37(3), 2004 |
Signal Processing: Image Communication
A Localization/Verification Scheme for Finding Text in Images and Video Frames Based on Contrast Independent Features and Machine Learning Methods, , and , in: Signal Processing: Image Communication, 19(3), 2004 |
|
Communications of the ACM
Adaptive Brain Interfaces, , in: Communications of the ACM, 46(3), 2003 |
Computer Speech & Language
Robust Speech Recognition and Feature Extraction Using HMM2, , , and , in: Computer Speech & Language, 17(2-3), 2003 |
IEEE Signal Processing Letters (to appear)
Robust Speaker Change Detection, , and , in: IEEE Signal Processing Letters (to appear), 2003 |
|
IEEE Trans. on Neural Systems and Rehabilitation Engineering, Special Issue on Brain-Computer Interface Technology
Asynchronous BCI and Local Neural Classifiers: An Overview of the Adaptive Brain Interface Project, and , in: IEEE Trans. on Neural Systems and Rehabilitation Engineering, Special Issue on Brain-Computer Interface Technology, 11(2), 2003 |
|
IEEE Transactions on Circuits and Systems for Video Technology
Finding Structure in Home Videos by Probabilistic Hierarchical Clustering, , and , in: IEEE Transactions on Circuits and Systems for Video Technology, 13(6), 2003 |
|
IEEE Transactions on Speech and Audio Processing
Microphone Array Post-filter based on Noise Field Coherence, and , in: IEEE Transactions on Speech and Audio Processing, 11(6), 2003 |
|
International Journal on Pattern Recognition and Artificial Intelligence ({IJPRAI})
Scaling Large Learning Problems with Hard Parallel Mixtures, , and , in: International Journal on Pattern Recognition and Artificial Intelligence (IJPRAI), 17(3), 2003 |
|
Neurocomputing
Combining Neural Gas and Learning Vector Quantization for Cursive Character Recognition, and , in: Neurocomputing, 51, 2003 |
|
Pattern Recognition
Automatic Facial Expression Analysis: A Survey, and , in: Pattern Recognition, 36(1), 2003 |
|
Pattern Recognition Letters
Fast features for face authentication under illumination direction changes, and , in: Pattern Recognition Letters, 24(14), 2003 |
[DOI] |
Structurally noise resistant classifier for multi-modal person verification, and , in: Pattern Recognition Letters, 24(16), 2003 |
Speech Communication
Speech/Music Discrimination using Entropy and Dynamism Features in a HMM Classification Framework, , and , in: Speech Communication, 40, 2003 |
|
to be published in IEEE Signal Processing Letters
Dichotomy Between Clustering Performance and Minimum Distortion in Piecewise-Dependent-Data (PDD) Clustering, and , in: to be published in IEEE Signal Processing Letters, 2003 |
|
to be published in IEEE Transactions on Speech and Audio Processing
Comparison and Combination of Features in a Hybrid HMM/MLP and a HMM/GMM Speech Recognition System, , , , and , in: to be published in IEEE Transactions on Speech and Audio Processing(48), 2003 |
|
IEEE Transactions on Pattern Analysis and Machine Intelligence
Estimating the Intrinsic Dimension of Data with a Fractal-Based Method, and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(10), 2002 |
|
Information Fusion
Confidence Measures for Multimodal Identity Verification, , , and , in: Information Fusion, 3(04), 2002 |
|
Neural Computation
A Parallel Mixture of SVMs for Very Large Scale Problems, , and , in: Neural Computation, 14(05), 2002 |
|
Pattern Recognition
A survey on Off-Line Cursive Word Recognition, , in: Pattern Recognition, 35(07), 2002 |
|
Pattern Recognition Letters
Writer adaptation techniques in HMM based Off-Line Cursive Script Recognition, and , in: Pattern Recognition Letters, 23(8), 2002 |
|
Speech Communication
Analytic Assessment of Telephone Transmission Impact on ASR Performance Using a Simulation Model, and , in: Speech Communication, 2002 |
|
Journal of Machine Learning Research
SVMTorch: Support Vector Machines for Large-Scale Regression Problems, and , in: Journal of Machine Learning Research, 1, 2001 |
|
Neural Processing Letters
Intrinsic dimension estimation of data: an approach based on Grassberger-Procaccia's algorithm, and , in: Neural Processing Letters, 14(01), 2001 |
|
Pattern Recognition Letters
A new normalization technique for cursive handwritten words, and , in: Pattern Recognition Letters, 22(09), 2001 |
|
Cursive Character Recognition by Learning Vector Quantization, and , in: Pattern Recognition Letters, 22(6), 2001 |
|
Speech Communication
Multi-stream adaptive evidence combination for noise robust ASR, , , and , in: Speech Communication, 2001 |
{IEEE} Transaction on Neural Networks special issue on data mining and knowledge discovery
Taking on the Curse of Dimensionality in Joint Distributions Using Neural Networks, and , in: IEEE Transaction on Neural Networks special issue on data mining and knowledge discovery, 2000 |
|
IEEE Transactions on Multimedia
Audio-Visual Speech Modelling for Continuous Speech Recognition, and , in: IEEE Transactions on Multimedia, 2000 |
Journal of Geographic Information and Decision Analysis
Local Machine Learning Models for Spatial Data Analysis, and , in: Journal of Geographic Information and Decision Analysis, 4(01), 2000 |
|
Pattern Recognition
Combining multiple tracking algorithms for improved general performance, , and , in: Pattern Recognition, 34(06), 2000 |
|
Video Indexing and Similarity Retrieval by Largest Common Subgraph Detection using Decision Trees, , and , in: Pattern Recognition, 34(05), 2000 |
|
Annals Mathematics and Artificial Intelligence
On the Complexity of Recognizing Regions Computable by Two-Layered Perceptrons, , in: Annals Mathematics and Artificial Intelligence, 1999 |
|
DSP Journal (Special Issue on the Nist Speaker Recognition Workshop)
The ELISA Systems for the NIST'99 Evaluation in Speaker Detection and Tracking, , , , , , , , , , , , , , , , , , , , and , in: DSP Journal (Special Issue on the Nist Speaker Recognition Workshop), 1999 |
IEEE Transactions on Neural Networks
Fusion of Face and Speech Data for Person Identity Verification, , and , in: IEEE Transactions on Neural Networks, 10(05), 1999 |
|
Optical Engineering
Discrete All-Positive Multilayer Perceptrons for Optical Implementation, , and , in: Optical Engineering, 37(4), 1998 |
|
Computer Vision and Image Understanding
Speechreading using Probabilistic Models, and , in: Computer Vision and Image Understanding, 65(02), 1997 |
IEEE Transactions on Neural Networks
High Order and Multilayer Perceptron Initialization, and , in: IEEE Transactions on Neural Networks, 8(02), 1997 |
Neural Processing Letters
Two neural network construction methods, and , in: Neural Processing Letters, 6(01), 1997 |
Neurocomputing
Calendar of meetings (several issues), , in: Neurocomputing, 1997 |
Pattern Recognition Letters
Acoustic-Labial Speaker Verification, , , and , in: Pattern Recognition Letters, 18(09), 1997 |
|
Fusion of audio and video information for multi modal person authentication, , , , and , in: Pattern Recognition Letters, 18(9), 1997 |
Zeitschrift für Kristallographie
The 3-regular nets with 4 and 6 vertices per unit cell, , and , in: Zeitschrift fur Kristallographie, 212, 1997 |
Zeolites
Zeolite cycle sequences, and , in: Zeolites, 19, 1997 |
Applied Optics
Incorporation of Liquid-Crystal Light Valve Non-Linearities in Optical Multilayer Neural Networks, , and , in: Applied Optics, 35(26), 1996 |
|
International Journal of Neural Systems
Constructive Training Methods for Feedforward Neural Networks with Binary Weights, and , in: International Journal of Neural Systems, 7(2), 1996 |
|
Journal of the European Optical Society
Time Resolved Polarimetry on an Optical Fiber Ammeter, and , in: Journal of the European Optical Society, 5, 1996 |
Neural Computation
The Interchangeability of Learning Rate and Gain in Backpropagation Neural Networks, , and , in: Neural Computation, 8(02), 1996 |
|
Neurocomputing
A Review of MicroNeuro'96, February 12-14, 1996, Lausanne, Switzerland, , in: Neurocomputing, 12(04), 1996 |
Generalized Cauchy Machines, and , in: Neurocomputing, 1996 |
Pattern Recognition
Finding Lines Under Bounded Error, , in: Pattern Recognition, 29(01), 1996 |
SIAM Journal of Discr. Math
On the Power of Democratic Networks, , in: SIAM Journal of Discr. Math, 9(02), 1996 |
|
IEEE Speech and Audio Processing
Automatic Word Recognition in Cars, and , in: IEEE Speech and Audio Processing, 1995 |
Optical Engineering
Adaptive Multilayer Optical Neural Network with Optical Thresholding, and , in: Optical Engineering, 34(08), 1995 |
Computer Standards {&} Interfaces
Neural Network Classification and Formalization, , in: Computer Standards & Interfaces, 16(03), 1994 |
Publications of type Book
2023
Intelligent Technologies: Concepts, Applications, and Future Directions, Volume 2, and , Springer, volume 1098, 2023 |
[DOI] |
2022
Natural Language Processing in Healthcare, , , , and , Taylor & Francis Groups, 2022 |
[DOI] [URL] |
2020
OdiEnCorp 2.0: Odia-English Parallel Corpus for Machine Translation, , , , , and , European Language Resources Association (ELRA), 2020 |
[URL] |
2015
Proceedings of the Second Workshop on Discourse in Machine Translation (DiscoMT 2015), , , and , Association for Computational Linguistics, 2015 |
[URL] |
2014
Proceedings of the 16th International Conference on Multimodal Interaction, ICMI 2014, Istanbul, Turkey, November 12-16, 2014., , , , , and , ACM, 2014 |
2013
Interactive Multimodal Information Management, and , EPFL Press, 2013 |
Proceedings of the ACL Workshop on Discourse in Machine Translation (DiscoMT 2013), , , and , Association for Computational Linguistics, 2013 |
[URL] |
2012
Multimodal Signal Processing: Human Interactions in Meetings, , , and , Cambridge University Press, 2012 |
[URL] |
Together Anywhere, Together Anytime, Technologies for Intimate Interactions, , , and , Centrum Wiskunde & Informatica, 2012 |
2011
Analysis of Verbal and Nonverbal Communication and Enactment: The Processing Issues, , , , and , Springer Verlag, 2011 |
2010
Human Behavior Understanding, , Springer Verlag, 2010 |
2009
Multimodal Signal Processing: Methods and Techniques to Build Multimodal Interactive Systems, , and , Academic Press, 2009 |
2008
Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, and , John Wiley & Sons, 2008 |
Machine Learning for Audio, Image and Video Analysis, and , Springer Verlag, 2008 |
Machine Learning for Multimodal Interaction IV, , and , Springer-Verlag, LNCS, volume 4892, 2008 |
[DOI] |
Machine Learning for Multimodal Interaction V, and , Springer-Verlag, LNCS, volume 5237, 2008 |
[DOI] |
2007
Towards Brain-Computer Interfacing, , , and , The MIT Press, 2007 |
2002
Proceedings of the Twelfth IEEE Workshop on Neural Networks for Signal Processing (NNSP), IEEE Press, 2002 |
2000
Traitement de la Parole, , , , and , Presses Polytechniques Universitaires Romandes, 2000 |
1997
CRC Comprehensive Dictionary of Electrical Engineering, , CRC Press, 1997 |
1996
Handbook of Neural Computation, Institute of Physics and Oxford University Press, The Computational Intelligence Library, 1996 |
1994
CONNECTIONIST SPEECH RECOGNITION - A Hybrid Approach, and , KLUWER ACADEMIC PUBLISHERS, 1994 |
|
Ergonomics in Robotics: Advances and Innovations (2025)
Intuitive Robot Programming, , , , and , in: Ergonomics in Robotics: Advances and Innovations, Springer, 2025 |
Human-Robot Collaboration: Unlocking the potential for industrial applications (2023)
Programming industrial robots from few demonstrations., , in: Human-Robot Collaboration: Unlocking the potential for industrial applications, pages 9-37, Institution of Engineering and Technology (IET), 2023 |
Robotic Research (2023)
Reactive Anticipatory Robot Skills with Memory, , and , in: Robotic Research, pages 436-451, Springer, 2023 |
|
Handbook of Biometric Anti-Spoofing (2023)
Robust Face Presentation Attack Detection with Multi-channel Neural Networks, and , in: Handbook of Biometric Anti-Spoofing, Springer, 2023 |
|
Intelligent Technologies: Concepts, Applications, and Future Directions. Studies in Computational Intelligence (2022)
Classifying the Social Media Author Profile Through a Multimodal Representation, , , and , in: Intelligent Technologies: Concepts, Applications, and Future Directions. Studies in Computational Intelligence, Springer, 2022 |
[DOI] [URL] |
SHC Task 63: Solar Neighborhood Planning, Subtask C: Solar Planning Tools (2022)
Identification of existing tools and workflows for solar neighborhood planning, , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: SHC Task 63: Solar Neighborhood Planning, Subtask C: Solar Planning Tools, IEA, 2022 |
[DOI] |
Early Detection of Mental Health Disorders by Social Media Monitoring: The First Five Years of the eRisk Project (2022)
Two Simple and Domain-independent Approaches for Early Detection of Anorexia, , , and , in: Early Detection of Mental Health Disorders by Social Media Monitoring: The First Five Years of the eRisk Project, pages 159-182, Springer International Publishing, 2022 |
[DOI] [URL] |
Deep Learning-Based Face Analytics (2021)
Multi-channel Face Presentation Attack Detection Using Deep Learning, and , in: Deep Learning-Based Face Analytics, Springer International Publishing, 2021 |
|
Machine Learning for Healthcare Applications (2021)
Semantic Behavior Analysis of COVID-19 Patients: A Collaborative Framework, , , and , in: Machine Learning for Healthcare Applications, John Wiley & Sons, Inc. USA and Scrivener Publishing LLC, USA, 2021 |
[URL] |
The role of constituents in multiword expressions. Phraseology and Multiword Expressions (2020)
Compositionality in English deverbal compounds:The role of the head, , and , in: The role of constituents in multiword expressions. Phraseology and Multiword Expressions, Language Science Press, Berlin, 2020 |
Handbook of Biometric Anti-Spoofing (2019)
An Introduction to Vein Presentation Attacks and Detection, , and , in: Handbook of Biometric Anti-Spoofing, Springer International Publishing, 2019 |
[DOI] [URL] |
Mixture Models and Applications (2019)
Interactive Generation of Calligraphic Trajectories from Gaussian Mixtures, , and , in: Mixture Models and Applications, pages 23-38, Springer, 2019 |
[DOI] |
Humanoid Robotics: a Reference (2019)
Learning Control, and , in: Humanoid Robotics: a Reference, pages 1261-1312, Springer, 2019 |
[DOI] [URL] |
Encyclopedia of Robotics (2019)
Learning from Demonstration (Programming by Demonstration), , in: Encyclopedia of Robotics, Springer, 2019 |
[DOI] [URL] |
Mixture Models and Applications (2019)
Mixture Models for the Analysis, Edition, and Synthesis of Continuous Time Series, , in: Mixture Models and Applications, pages 39-57, Springer, 2019 |
[DOI] |
Handbook of Biometric Anti-Spoofing (2019)
Recent Advances in Face Presentation Attack Detection, , , and , in: Handbook of Biometric Anti-Spoofing, Springer, 2019 |
[URL] |
Voice Presentation Attack Detection Using Convolutional Neural Networks, , , , , and , in: Handbook of Biometric Anti-Spoofing, pages 391--415, Springer, 2019 |
[URL] |
Handbook of Biometric Anti-Spoofing: Presentation Attack Detection, 2nd Edition (2018)
A Cross-database Study of Voice Presentation Attack Detection, and , in: Handbook of Biometric Anti-Spoofing: Presentation Attack Detection, 2nd Edition, Springer, 2018 |
Robotics Research (2018)
Robot Learning with Task-Parameterized Generative Models, , in: Robotics Research, pages 111-126, Springer, 2018 |
[DOI] [URL] |
Wiley StatsRef: Statistics Reference Online (2018)
Sequential Design of Computer Experiments, , in: Wiley StatsRef: Statistics Reference Online, Wiley, 2018 |
Digital Polis (2018)
What TripAdvisor Can't Tell: Crowdsourcing Urban Impressions for Whole Cities, , and , in: Digital Polis, L'Oeil d'Or (translated to French.), 2018 |
|
Social Signal Processing (2017)
Analysis of Small Groups, , and , in: Social Signal Processing, pages 349-367, Cambridge University Press. Editors J. Burgoon, N. Magnenat-Thalmann, M. Pantic, and A. Vinciarelli, 2017 |
[DOI] |
Arqueologia computacional: Nuevos enfoques para el analisis y la difusion del patrimonio cultural (2017)
MAAYA: Multimedia Methods to Support Maya Epigraphic Analysis, , , , , , and , in: Arqueologia computacional: Nuevos enfoques para el analisis y la difusion del patrimonio cultural, INAH-RedTDPC, 2017 |
|
User-Centric Privacy and Security in Biometrics (2017)
Presentation attack detection in voice biometrics, and , in: User-Centric Privacy and Security in Biometrics, The Institution of Engineering and Technology, 2017 |
|
mODa 11 - Advances in Model-Oriented Design and Analysis (2016)
Design of Computer Experiments Using Competing Distances Between Set-Valued Inputs, , , and , in: mODa 11 - Advances in Model-Oriented Design and Analysis, pages 123-131, Springer International Publishing, 2016 |
[DOI] |
Handbook of Robotics (2016)
Learning From Humans, , and , in: Handbook of Robotics, pages 1995-2014, Springer, 2016 |
[DOI] [URL] |
Monte Carlo and Quasi-Monte Carlo Methods (2016)
On ANOVA Decompositions of Kernels and Gaussian Random Field Paths, , , , and , in: Monte Carlo and Quasi-Monte Carlo Methods, pages 315-330, Springer International Publishing, 2016 |
[DOI] |
Ellerle Konusmak: Türk İşaret Dili Araştırmaları / Research on Turkish Sign Language (2016)
Otomatik İşaret Dili Tanıma ve Türk İşaret Dili için Bilgisayar Uygulamaları, , , , and , in: Ellerle Konusmak: Turk Isaret Dili Arastirmalari / Research on Turkish Sign Language, pages 471-498, Koc University Press, 2016 |
Machine Learning, Optimization, and Big Data (2015)
Differentiating the Multipoint Expected Improvement for Optimal Batch Design, , and , in: Machine Learning, Optimization, and Big Data, pages 37-48, Springer International Publishing, 2015 |
[DOI] |
International Conference on Speech and Computer , SPECOM (2015)
DNN-based Speech Synthesis: Importance of input features and training data, , and , in: International Conference on Speech and Computer , SPECOM, pages 193-200, Springer Berlin Heidelberg, 2015 |
[DOI] |
Machine Learning, Optimization, and Big Data (2015)
Global Optimization with Sparse and Local Gaussian Process Models, and , in: Machine Learning, Optimization, and Big Data, pages 185-196, Springer International Publishing, 2015 |
[DOI] |
Intelligent Robotics and Applications (2015)
Learning the Stiffness of a Continuous Soft Manipulator from Multiple Demonstrations, , , and , in: Intelligent Robotics and Applications, pages 185-195, Springer, 2015 |
[DOI] [URL] |
Computational Linguistics and Intelligent Text Processing (2015)
Rehabilitation of Count-based Models for Word Vector Representations, and , in: Computational Linguistics and Intelligent Text Processing, pages 417-429, Springer International Publishing, 2015 |
Papers dedicated to Jacques Moeschler (2014)
Discourse connectives: theoretical models and empirical validations in humans and computers, and , in: Papers dedicated to Jacques Moeschler, University of Geneva, 2014 |
[URL] |
Handbook of Biometric Anti-Spoofing (2014)
Evaluation Databases, , , and , in: Handbook of Biometric Anti-Spoofing, pages 247-278, Springer-Verlag, 2014 |
[DOI] |
Handbook of Biometric Antispoofing (2014)
Evaluation Methodologies, , and , in: Handbook of Biometric Antispoofing, Springer, 2014 |
Person Re-Identification (2014)
Re-Identification for Improved People Tracking, , and , in: Person Re-Identification, pages 311-336, Springer, 2014 |
Interactive Multimodal Information Management (2013)
Interactive Multimodal Information Management: Shaping the Vision, and , in: Interactive Multimodal Information Management, pages 1-17, EPFL Press, 2013 |
|
Learning to learn new models of human activities in indoor settings1, , , and , in: Interactive Multimodal Information Management, EPFL Press, 2013 |
Learning to learn new models of human activities in indoor settings1, , , and , in: Interactive Multimodal Information Management, EPFL Press, 2013 |
|
Medical image annotation, , in: Interactive Multimodal Information Management, EPFL Press, 2013 |
|
Speech Processing, , in: Interactive Multimodal Information Management, pages 221--245, EPFL Press, 2013 |
Intelligent Video Surveillance Systems (ISTE) (2013)
Unsupervised methods for activity analysis and detection of abnormal events, and , in: Intelligent Video Surveillance Systems (ISTE), Wiley, 2013 |
[DOI] |
In Neural Networks: Tricks of the Trade (2012)
Deep Learning via Semi-Supervised Embedding, , , and , in: In Neural Networks: Tricks of the Trade, Springer, 2012 |
|
Multimodal Signal Processing: Human Interactions in Meetings (2012)
Evaluation of Meeting Support Technology, and , in: Multimodal Signal Processing: Human Interactions in Meetings, pages 237-252, Cambridge University Press, 2012 |
LNCS Proceedings on COGNITIVE BEHAVIOURAL SYSTEMS (2012)
From Nonverbal Cues to Perception: Personality and Social Attractiveness, , , , and , in: LNCS Proceedings on COGNITIVE BEHAVIOURAL SYSTEMS, Springer, 2012 |
Neural Networks: Tricks of the Trade (2012)
Implementing Neural Networks Efficiently, , and , in: Neural Networks: Tricks of the Trade, Springer, 2012 |
|
Multimodal Signal Processing: Human Interactions in Meetings (2012)
Multimodal Signal Processing for Meetings: an Introduction, and , in: Multimodal Signal Processing: Human Interactions in Meetings, pages 1-11, Cambridge University Press, 2012 |
|
Sampling techniques for audio-visual tracking and head pose estimation, and , in: Multimodal Signal Processing: Human Interactions in Meetings, pages 84-102, Cambridge University Press, 2012 |
|
Practical Applications of Sparse Modeling: Biology, Signal Processing and Beyond (2012)
Sparsity in Topic Models, , and , in: Practical Applications of Sparse Modeling: Biology, Signal Processing and Beyond, MIT Press, 2012 |
|
Multimodal Signal Processing: Human Interactions in Meetings (2012)
Speaker Diarization, and , in: Multimodal Signal Processing: Human Interactions in Meetings, Cambridge University Press, 2012 |
[URL] |
User Requirements for Meeting Support Technology, and , in: Multimodal Signal Processing: Human Interactions in Meetings, pages 210-221, Cambridge University Press, 2012 |
Computer Analysis of Human Behavior (2011)
Analysis of Group Conversations: Modeling Social Verticality, and , in: Computer Analysis of Human Behavior, pages 293-322, Springer London, 2011 |
Social Media Computing (2011)
Call me Guru: user categories and large-scale behavior in YouTube, and , in: Social Media Computing, Springer, 2011 |
|
Handbook of Natural Language Processing and Machine Translation Handbook of Natural Language Processing and Machine Translation (2011)
Data-driven extraction of spectral-dynamics based posteriors, , in: Handbook of Natural Language Processing and Machine Translation Handbook of Natural Language Processing and Machine Translation, Springer, 2011 |
[URL] |
"Computer Analysis of Human Behavior" by A.Salah and T.Gevers (eds.) (2011)
Introduction to Sequence Analysis for Human Behavior Understanding, and , in: "Computer Analysis of Human Behavior" by A.Salah and T.Gevers (eds.), pages 21-40, Springer Verlag, 2011 |
"Visual Analysis of Humans" by T.B.Moeslund, A.Hilton, V.Krueger and L.Sigal (eds.) (2011)
Social Signal Processing: The Research Agenda, , , , , , , , and , in: "Visual Analysis of Humans" by T.B.Moeslund, A.Hilton, V.Krueger and L.Sigal (eds.), pages 511-538, Springer Verlag, 2011 |
Toward Autonomous, Adaptive, and Context-Aware Multimodal Interfaces: Theoretical and Practical Issues, A.Esposito (ed.) (2010)
More than Words: Inference of Socially Relevant Information from Nonverbal Vocal Cues in Speech, , , and , in: Toward Autonomous, Adaptive, and Context-Aware Multimodal Interfaces: Theoretical and Practical Issues, A.Esposito (ed.), LNCS,Springer, 2010 |
|
"Affective Computing and Interaction: Psychological, Cognitive and Neuroscientific Perspectives" by D. Gokcay & G. Yildirim (eds.) (2010)
Towards a Technology of Nonverbal Communication: Vocal Behavior in Social and Affective Phenomena, and , in: "Affective Computing and Interaction: Psychological, Cognitive and Neuroscientific Perspectives" by D. Gokcay & G. Yildirim (eds.), igi-global, 2010 |
|
In H. Nakashima, J. Augusto, H. Aghajan (Eds.,',','),
Handbook of Ambient Intelligence and Smart Environments (2010)
Visual Attention, Speaking Activity, and Group Conversational Analysis in Multi-Sensor Environments, and , in: In H. Nakashima, J. Augusto, H. Aghajan (Eds.,',','), Handbook of Ambient Intelligence and Smart Environments, Springer, 2010 |
Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods (2009)
A Kernel Wrapper for Phoneme Sequence Recognition, and , in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009 |
A Large Margin Algorithm for Forced Alignment, , , and , in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009 |
A Proposal for a Kernel-based Algorithm for Large Vocabulary Continuous Speech Recognition, , in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009 |
Multimodal Corpora: From Models of Natural Interaction to Systems and Applications (2009)
Accessing a Large Multimodal Corpus using an Automatic Content Linking Device, , , and , in: Multimodal Corpora: From Models of Natural Interaction to Systems and Applications, Springer-Verlag, 2009 |
[DOI] |
Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods (2009)
Discriminative Keyword Spotting, , and , in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009 |
Multimodal Signal Processing for Human-Computer Interaction (2009)
Managing Multimodal Data, Metadata and Annotations: Challenges and Solutions, , in: Multimodal Signal Processing for Human-Computer Interaction, Elsevier / Academic Press, 2009 |
In J.-P. Thiran, H. Bourlard, and F. Marques, (Eds.,',','),
Multimodal Signal Processing, Academic Press (2009)
Modeling interest in face-to-face conversations from multimodal nonverbal behavior, , in: In J.-P. Thiran, H. Bourlard, and F. Marques, (Eds.,',','), Multimodal Signal Processing, Academic Press, Academic Press, 2009 |
|
Multi-camera networks: principles and applications (2009)
Multi-Person Bayesian Tracking with Multiple Cameras, and , in: Multi-camera networks: principles and applications, pages 363-388, Academic Press, 2009 |
|
Applied Signal Processing--A MATLAB approach (2008)
How does a dictation machine recognize speech ?, , and , in: Applied Signal Processing--A MATLAB approach, Springer MA, 2008 |
|
Machine Learning for Multimodal Interaction IV (2008)
Towards an Objective Test for Meeting Browsers: the BET4TQB Pilot Experiment, , , and , in: Machine Learning for Multimodal Interaction IV, Springer-Verlag, 2008 |
[DOI] |
Towards Brain-Computer Interfacing (2007)
Adaptation in Brain-Computer Interfaces, , , , , , , , , , and , in: Towards Brain-Computer Interfacing, The MIT Press, 2007 |
Error-Related EEG Potentials in Brain-Computer Interfaces, and , in: Towards Brain-Computer Interfacing, The MIT Press, 2007 |
Non-Invasive Estimates of Local Field Potentials for Brain-Computer Interfaces, , , and , in: Towards Brain-Computer Interfacing, The MIT Press, 2007 |
European Visions for the Knowledge Age (2007)
Tapping the Mind or Resonating Minds?, , in: European Visions for the Knowledge Age, Cheshire Henbury, 2007 |
Towards Brain-Computer Interfacing (2007)
The IDIAP Brain-Computer Interface: An Asynchronous Multi-Class Approach, , and , in: Towards Brain-Computer Interfacing, The MIT Press, 2007 |
2006 IMIA Yearbook of Medical Informatics (2006)
Non-Invasive Brain-Actuated Control of a Mobile Robot by Human EEG, , , and , in: 2006 IMIA Yearbook of Medical Informatics, Schattauer Verlag, 2006 |
The Handbook of Brain Theory and Neural Networks: The Second Edition (2002)
Brain-Computer Interfaces, , in: The Handbook of Brain Theory and Neural Networks: The Second Edition, The MIT Press, 2002 |
|
Hidden Markov Models and other Finite State Automata for Sequence Processing, and , in: The Handbook of Brain Theory and Neural Networks: The Second Edition, The MIT Press, 2002 |
|
Robot Navigation, , in: The Handbook of Brain Theory and Neural Networks: The Second Edition, The MIT Press, 2002 |
|
Mathematical Foundations of Speech Processing and Recognition (2002)
Towards Robust and Adaptive Speech Recognition Models, , and , in: Mathematical Foundations of Speech Processing and Recognition, Springer-Verlag, 2002 |
|
Speech Processing in the Auditory System (2000)
Automatic Speech Recognition: an Auditory Perspective, , and , in: Speech Processing in the Auditory System, Springer Verlag, New York, 2000 |
to be published in The Handbook of Brain Theory and Neural Networks (2000)
Neural Networks in Automatic Speech Recognition, , , and , in: to be published in The Handbook of Brain Theory and Neural Networks, Bradford Books, The MIT Press, 2000 |
Kohonen Maps (1999)
Indexing Audio Documents by using Latent Semantic Analysis and SOM, , in: Kohonen Maps, Elsevier, 1999 |
|
Modern Interface Technology: The Leading Edge (1999)
Speech Reading, , in: Modern Interface Technology: The Leading Edge, Research Studies Press Ltd., 1999 |
Survey of the State of the Art in Human Language Technology (1998)
Connectionist Techniques, and , in: Survey of the State of the Art in Human Language Technology, Cambridge University Press, 1998 |
Adaptive Processing of Sequences and Data Structures (1998)
Hybrid HMM/ANN Systems for Speech Recognition: Overview and New Research Directions, and , in: Adaptive Processing of Sequences and Data Structures, Springer Verlag, 1998 |
Optical Metrology (1997)
Ellipsometry, , in: Optical Metrology, Artech House, 1997 |
Handbook of Neural Computation (1997)
Neural Network Adaptations to Hardware Implementations, and , in: Handbook of Neural Computation, Institute of Physics Publishing and Oxford University Publishing, 1997 |
|
Speechreading by Humans and Machines (1996)
Active Shape Models for Visual Speech Feature Extraction, , and , in: Speechreading by Humans and Machines, Springer Verlag, 1996 |
|
Machine Recognition and Applications, , and , in: Speechreading by Humans and Machines, Springer Verlag, 1996 |
Handbook of Neural Computation (1996)
Neural Network Topologies, , in: Handbook of Neural Computation, 1996 |
Fondements et perspectives en traitement automatique de la parole (1996)
Reconnaissance et compréhension de la parole: évaluation et applications, , , , and , in: Fondements et perspectives en traitement automatique de la parole, AUPELF -- UREF, 1996 |
Handbook of Neural Computation (1996)
Supervised Ontogenic Networks, and , in: Handbook of Neural Computation, 1996 |
The handbook of brain theory and neural networks (1995)
A Hybrid Approach to Continuous Speech Recognition, and , in: The handbook of brain theory and neural networks, The MIT Press, 1995 |
From Natural to Artificial Neural Computation (1995)
An All-Optical Forward Propagation Multilayer Neural Network, and , in: From Natural to Artificial Neural Computation, Springer Verlag, 1995 |
Recent Developments in Computer Vision (1995)
Applying Handwriting Recognition to US Census Forms, , in: Recent Developments in Computer Vision, Springer, 1995 |
|
Spoken Language Ressources and Assessment (1995)
Assessment of speaker verification systems, and , in: Spoken Language Ressources and Assessment, EAGLES Handbook, 1995 |
Recent Developments in Computer Vision (1995)
Handwriting Recognition, , in: Recent Developments in Computer Vision, Springer, 1995 |
Fondements et perspectives en traitement automatique de la parole (1995)
Les domaines d'application des technologies vocales, , in: Fondements et perspectives en traitement automatique de la parole, GDR-PRC Communication Homme-Machine, 1995 |
From Natural to Artificial Neural Computation (1995)
Neural Network Initialization, and , in: From Natural to Artificial Neural Computation, Springer Verlag, 1995 |
The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (2024)
A Differentiable Integer Linear Programming Solver for Explanation-Based Natural Language Inference, , and , in: The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, 2024 |
The IEEE International Joint Conference on Biometrics (2024)
A Novel and Responsible Dataset for Face Presentation Attack Detection on Mobile Devices, , , , , and , in: The IEEE International Joint Conference on Biometrics, Buffalo, New York, pages 8, 2024 |
|
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks (NeurIPS Datasets and Benchmarks 2024) (2024)
A Retrospective on the Robot Air Hockey Challenge: Benchmarking Robust, Reliable, and Safe Learning Techniques for Real-world Robotics, , , , , , , , , , , , , , , , , , , , and , in: Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks (NeurIPS Datasets and Benchmarks 2024), 2024 |
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL) (2024)
A Symbolic Framework for Systematic Evaluation of Mathematical Reasoning with Transformers, , , and , in: Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2024 |
[URL] |
Proceedings of the 2024 ACM/IEEE International Conference on Human-Robot Interaction (2024)
A System for Human-Robot Teaming through End-User Programming and Shared Autonomy, , , , , and , in: Proceedings of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, pages 231-239, 2024 |
[DOI] [URL] |
The 18th IEEE International Conference on Automatic Face and Gesture Recognition (2024)
A Unified Model for Gaze Following and Social Gaze Prediction, , , and , in: The 18th IEEE International Conference on Automatic Face and Gesture Recognition, 2024 |
|
Interspeech (2024)
Adversarial Robustness Analysis in Automatic Pathological Speech Detection Approaches, and , in: Interspeech, 2024 |
|
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (2024)
Annotator-centric Active Learning for Subjective NLP Tasks, , , and , in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, 2024 |
Proceedings of the 7th BlackboxNLP Workshop: Analyzing and Interpreting Neural Networks for NLP (2024)
Are there identifiable structural parts in the sentence embedding whole?, and , in: Proceedings of the 7th BlackboxNLP Workshop: Analyzing and Interpreting Neural Networks for NLP, 2024 |
Proceedings of IEEE International Joint Conference on Biometrics (2024)
Assessing the Reliability of Biometric Authentication on Virtual Reality Devices, , and , in: Proceedings of IEEE International Joint Conference on Biometrics, 2024 |
|
IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (2024)
Bi-directional Training for Composed Image Retrieval via Text Prompt Learning, , , , and , in: IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2024 |
[URL] |
Proceedings of the 10th Italian Conference on Computational Linguistics (2024)
BLM-It - Blackbird Language Matrices for Italian: A CALAMITA Challenge, , , and , in: Proceedings of the 10th Italian Conference on Computational Linguistics, 2024 |
Out Of Distribution Generalization in Computer Vision, Workshop at ECCV (2024)
Can We Learn to Select the Right Algorithm for OOD Generalization?, and , in: Out Of Distribution Generalization in Computer Vision, Workshop at ECCV, 2024 |
18th IEEE Int. Conference on Automatic Face and Gesture Recognition (FG), Istanbul, (2024)
CCDb-HG: Novel Annotations and Gaze-Aware Representations for Head Gesture Recognition, , , and , in: 18th IEEE Int. Conference on Automatic Face and Gesture Recognition (FG), Istanbul,, 2024 |
|
Proceedings of the European Conference on Computer Vision (ECCV) Workshops (2024)
ChildPlay-Hand: A Dataset of Hand Manipulations in the Wild, , and , in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024 |
|
Proceedings of the 12th European Workshop on Visual Information Processing (2024)
Comparing Stability and Discriminatory Power of Hand-crafted Versus Deep Radiomics: A 3D-Printed Anthropomorphic Phantom Study, , , , , , , , , , and , in: Proceedings of the 12th European Workshop on Visual Information Processing, 2024 |
|
Robotics: Science and Systems (RSS), 2024 (2024)
Configuration Space Distance Fields for Manipulation Planning, , , and , in: Robotics: Science and Systems (RSS), 2024, 2024 |
|
ICASSP (2024)
CONTENT-BASED OBJECTIVE EVALUATION OF ARTIFICIALLY GENERATED SIGN LANGUAGE VIDEOS, , , , , and , in: ICASSP, 2024 |
|
Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024 (2024)
CONTEXTUAL BIASING METHODS FOR IMPROVING RARE WORD DETECTION IN AUTOMATIC SPEECH RECOGNITION, , , , , , , and , in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, Seoul, Korea, 2024 |
|
IEEE International Conference on Robotics and Automation (2024)
D-LGP: Dynamic Logic-Geometric Program for Reactive Task and Motion Planning, , and , in: IEEE International Conference on Robotics and Automation, 2024 |
|
Proceedings of the 6th Clinical Natural Language Processing Workshop (2024)
DAIC-WOZ: On the Validity of Using the Therapist's prompts in Automatic Depression Detection from Clinical Interviews, , , , , and , in: Proceedings of the 6th Clinical Natural Language Processing Workshop, Association for Computational Linguistics, 2024 |
|
49th IEEE International Conference on Acoustics, Speech and Signal Processing (2024)
Deep Variational Privacy Funnel: General Modeling with Applications in Face Recognition, , and , in: 49th IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE, 2024 |
[DOI] [URL] |
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (2024)
DiffuCOMET: Contextual Commonsense Knowledge Diffusion, , , , , and , in: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Bangkok, Thailand, pages 4809–4831, Association for Computational Linguistics, 2024 |
[DOI] [URL] |
The 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024) (2024)
Enhancing Ethical Explanations of Large Language Models through Iterative Symbolic Refinement, , , and , in: The 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024), 2024 |
The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (2024)
Estimating the Causal Effects of Natural Logic Features in Transformer-Based NLI Models, , and , in: The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, 2024 |
International Joint Conference on Biometrics (2024)
Evaluating the Effectiveness of Attack-Agnostic Features for Morphing Attack Detection, and , in: International Joint Conference on Biometrics, 2024 |
|
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (2024)
Explaining models relating objects and privacy, , , , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024 |
[URL] |
Tenth Italian Conference on Computational Linguistics (2024)
Exploring Italian sentence embeddings properties through multi-tasking, , , and , in: Tenth Italian Conference on Computational Linguistics, 2024 |
Int. Conf. Computer Vision and Pattern Recognition (CVPR), Workshop on Gaze Estimation and Prediction in the Wild (2024)
Exploring the Zero-Shot Capabilities of Vision-Language Models for Improving Gaze Following, , , and , in: Int. Conf. Computer Vision and Pattern Recognition (CVPR), Workshop on Gaze Estimation and Prediction in the Wild, 2024 |
|
Proc. IEEE Intl Conf. on Robotics and Automation (2024)
Extending the Cooperative Dual-Task Space in Conformal Geometric Algebra, and , in: Proc. IEEE Intl Conf. on Robotics and Automation, 2024 |
|
IEEE International Joint Conference on Biometrics (2024)
Face Liveness Detection Competition (LivDet-Face) - 2024, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: IEEE International Joint Conference on Biometrics, 2024 |
|
4th International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots (2024)
Feature Representations for Automatic Meerkat Vocalization Classification, , , and , in: 4th International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots, 2024 |
|
Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) (2024)
Fine-tuning Self-Supervised Models For Language Identification Using Orthonormal Constraint, , , , , , and , in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP), 2024 |
|
Proceedings of the 3rd Workshop on Knowledge Augmented Methods for NLP (2024)
GADePo: Graph-Assisted Declarative Pooling Transformers for Document-Level Relation Extraction, , , and , in: Proceedings of the 3rd Workshop on Knowledge Augmented Methods for NLP, Association for Computational Linguistics, 2024 |
[DOI] [URL] |
International Conference on Learning Representations (ICLR) (2024)
Generalized Policy Iteration using Tensor Approximation for Hybrid Control, , and , in: International Conference on Learning Representations (ICLR), 2024 |
|
Proceedings of the European Conference on Computer Vision (ECCV) Workshops (2024)
GLoFool: global enhancements and local perturbations to craft adversarial images, and , in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024 |
|
IEEE International Conference on Acoustics, Speech and Signal Processing (2024)
Heterogeneous Face Recognition Using Domain Invariant Units, and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2024 |
|
Proceedings of the European Conference on Computer Vision (ECCV) Workshops (2024)
Image-guided topic modeling for interpretable privacy classification, and , in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024 |
|
EUSIPCO (2024)
Impact of Speech Mode in Automatic Pathological Speech Detection, and , in: EUSIPCO, IEEE, 2024 |
[URL] |
Findings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024) (2024)
Improving Semantic Control in Discrete Latent Spaces with Transformer Quantized Variational Autoencoders, , , , and , in: Findings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024), 2024 |
In Proc. IEEE Intl Conf. on Robotics and Biomimetics (ROBIO) (2024)
Learning Goal-oriented Bimanual Dough Rolling Using Dynamic Heterogeneous Graph Based on Human Demonstration, , , , , , , and , in: In Proc. IEEE Intl Conf. on Robotics and Biomimetics (ROBIO), 2024 |
Proc. Robotics: Science and Systems (RSS) (2024)
Logic-Skill Programming: An Optimization-based Approach to Sequential Skill Planning, , , and , in: Proc. Robotics: Science and Systems (RSS), 2024 |
|
IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (2024)
Mitigating Demographic Bias in Face Recognition via Regularized Score Calibration, and , in: IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, IEEE/CVF, 2024 |
|
IEEE International Joint Conference on Biometrics (2024)
Modality Agnostic Heterogeneous Face Recognition with Switch Style Modulators, and , in: IEEE International Joint Conference on Biometrics, 2024 |
|
The 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024) (2024)
Multi-Relational Hyperbolic Word Embeddings from Natural Language Definitions, , and , in: The 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024), 2024 |
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024)
Neural Redshift: Random Networks are not Random Functions, , , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024 |
First conference on Language Modelling (2024)
Nonparametric Variational Regularisation of Pretrained Transformers, and , in: First conference on Language Modelling, 2024 |
[URL] |
Odyssey 2024: The Speaker and Language Recognition Workshop (2024)
Normalizing Flows for Speaker and Language Recognition Backend, , , , and , in: Odyssey 2024: The Speaker and Language Recognition Workshop, 2024 |
|
4th International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots (2024)
On the Utility of Speech and Audio Foundation Models for Marmoset Call Analysis, and , in: 4th International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots, 2024 |
|
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024)
Open-Vocabulary Object 6D Pose Estimation, , , , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024 |
[URL] |
International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2024)
Parkinson's Disease Detection through Formant and F0 Analysis at Syllable Level, and , in: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024 |
|
The 17th ACM International Conference on Web Search and Data Mining (2024)
ProGAP: Progressive Graph Neural Networks with Differential Privacy Guarantees, and , in: The 17th ACM International Conference on Web Search and Data Mining, 2024 |
|
In Proc. Intl Workshop on the Algorithmic Foundations of Robotics (WAFR) (2024)
Recursive Forward Dynamics for Serial Kinematic Chains using Conformal Geometric Algebra, and , in: In Proc. Intl Workshop on the Algorithmic Foundations of Robotics (WAFR), 2024 |
Proceedings of the 12th European Workshop on Visual Information Processing (2024)
Refining Tuberculosis Detection in CXR Imaging: Addressing Bias in Deep Neural Networks via Interpretability, , and , in: Proceedings of the 12th European Workshop on Visual Information Processing, 2024 |
|
IEEE International Conference on Robotics and Automation (2024)
Representing Robot Geometry as Distance Fields: Applications to Whole-body Manipulation, , , and , in: IEEE International Conference on Robotics and Automation, 2024 |
|
Proceedings of Conference on Robot Learning (2024)
Robust Manipulation Primitive Learning via Domain Contraction, , , and , in: Proceedings of Conference on Robot Learning, 2024 |
|
Proceedings of the European Conference on Computer Vision (ECCV) Workshops (2024)
Segmenting Object Affordances: Reproducibility and Sensitivity to Scale, , , and , in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024 |
|
International Conference on Machine Learning (ICML) (2024)
Selective Mixup Helps with Distribution Shifts, But Not (Only) because of Mixup, , and , in: International Conference on Machine Learning (ICML), 2024 |
[URL] |
Int. Conference Computer Vision and Pattern Recognition (CVPR), Seatle (2024)
Sharingan: A Transformer Architecture for Multi-Person Gaze Following, , and , in: Int. Conference Computer Vision and Pattern Recognition (CVPR), Seatle, 2024 |
|
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops (2024)
Sparse multi-view hand-object reconstruction for unseen environments, , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2024 |
[URL] |
Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction (2024)
Synergizing Natural Language Towards Enhanced Shared Autonomy, , and , in: Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, 2024 |
[URL] |
Proceedings of Interspeech (2024)
Towards interfacing large language models with ASR systems using confidence measures and prompting, , , , and , in: Proceedings of Interspeech, pages 2980-2984, 2024 |
[DOI] |
Proc. IEEE Intl Conf. on Robotics and Automation (ICRA) (2024)
Towards Robo-Coach: Robot Interactive Stiffness/Position Adaptation for Human Strength and Conditioning Training, , , , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), 2024 |
Proc. ACM Workshop on Exploring Innovative Technology for Commensality and Human-Food Interaction (2024)
Towards Wine Tasting Activity Recognition for a Digital Sommelier, , , and , in: Proc. ACM Workshop on Exploring Innovative Technology for Commensality and Human-Food Interaction, 2024 |
Proceedings of the 9th Workshop on Representation Learning for NLP (2024)
Tracking linguistic information in transformer-based sentence embeddings through targeted sparsification, and , in: Proceedings of the 9th Workshop on Representation Learning for NLP, 2024 |
[URL] |
Findings of the European chapter of Association for Computational Linguistics, 2024 (2024)
Understanding the effects of language-specific class imbalance in multilingual fine-tuning, and , in: Findings of the European chapter of Association for Computational Linguistics, 2024, 2024 |
|
Proceedings of the International Conference on Pattern Recognition (ICPR) (2024)
Vascular Biometrics Experiments on Candy -- A New Contactless Finger-Vein Dataset, , , , and , in: Proceedings of the International Conference on Pattern Recognition (ICPR), Calcutta (India), 2024 |
|
IEEE International Conference on Acoustics, Speech, and Signal Processing (2024)
Vulnerability of Face Age Verification to Replay Attacks, , , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2024 |
|
NeurIPS 2024 Workshop on Federated Learning (2024)
ZooPFL: Exploring Black-box Foundation Models for Personalized Federated Learning, , , , , , , , and , in: NeurIPS 2024 Workshop on Federated Learning, 2024 |
[URL] |
The 4th Workshop on Recommender Systems for Human Resources, in conjunction with the 18th ACM Conference on Recommender Systems (2024)
Hardware-effective Approaches for Skill Extraction in Job Offers and Resumes, , , , , , and , in: The 4th Workshop on Recommender Systems for Human Resources, in conjunction with the 18th ACM Conference on Recommender Systems, 2024 |
[URL] |
International Conference on Acoustics, Speech and Signal Processing (2024)
COMPARING DATA-DRIVEN AND HANDCRAFTED FEATURES FOR DIMENSIONAL EMOTION RECOGNITION, , and , in: International Conference on Acoustics, Speech and Signal Processing, 2024 |
|
Proceedings of Building Simulation 2023: 18th Conference of IBPSA (2023)
A Machine Learning Model for the Prediction of Building Hourly Heating Demand from CityGML Files: Training Workflow and Deployment as an API, , and , in: Proceedings of Building Simulation 2023: 18th Conference of IBPSA, pages 2932 - 2939, 2023 |
[DOI] [URL] |
Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS) (2023)
A Multitask and Kernel approach for Learning to Push Objects with a Task-Parameterized Deep Q-Network, , , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2023 |
|
The Eleventh International Conference on Learning Representations (2023)
A VAE for Transformers with Nonparametric Variational Information Bottleneck, and , in: The Eleventh International Conference on Learning Representations, 2023 |
[URL] |
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops (ICCVW) (2023)
Affordance segmentation of hand-occluded containers from exocentric images, , , , , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), 2023 |
[DOI] [URL] |
IEEE International Joint Conference on Biometric (2023)
Approximating Optimal Morphing Attacks using Template Inversion, , and , in: IEEE International Joint Conference on Biometric, 2023 |
[DOI] |
13th SESAR Innovation Days (2023)
Automatic Speech Analysis Framework for ATC Communication in HAAWAII, , , , , and , in: 13th SESAR Innovation Days, 2023 |
|
Fifteenth USA/Europe Air Traffic Management Research and Development Seminar (ATM2023) (2023)
Automatic Speech Recognition and Understanding for Radar Label Maintenance Support Increases Safety and Reduces Air Traffic Controllers’ Workload, , , , , , , , , , , and , in: Fifteenth USA/Europe Air Traffic Management Research and Development Seminar (ATM2023), Eurocontrol (Europe), FAA (U.S.), Savannah, Georgia, USA, 2023 |
[URL] |
2023 IEEE Spoken Language Technology Workshop (SLT) (2023)
BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications, , , , , , and , in: 2023 IEEE Spoken Language Technology Workshop (SLT), IEEE, 2023 |
[URL] |
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops (ICCVW) (2023)
Black-box Attacks on Image Activity Prediction and its Natural Language Explanations, , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), 2023 |
[DOI] [URL] |
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (2023)
BLESS: Benchmarking Large Language Models on Sentence Simplification, , , , , , and , in: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Singapore, 2023 |
|
IJCB (2023)
Bridging the Gap: Heterogeneous Face Recognition with Conditional Adaptive Instance Modulation, and , in: IJCB, 2023 |
|
Proc. INTERSPEECH 2023 (2023)
Can ChatGPT Detect Intent? Evaluating Large Language Models for Spoken Language Understanding, and , in: Proc. INTERSPEECH 2023, pages 1109-1113, 2023 |
[DOI] |
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (2023)
Can Language Models Learn Analogical Reasoning? Investigating Training Objectives and Comparisons to Human Performance, and , in: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Singapore, Association for Computational Linguistics, 2023 |
|
Proceedings of Interspeech (2023)
Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers?, and , in: Proceedings of Interspeech, 2023 |
|
Proc. Interspeech 2023 (2023)
CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice, , , and , in: Proc. Interspeech 2023, 2023 |
[URL] |
Proceedings of the 2023 ACM CHI Conference on Human Factors in Computing Systems (CHI) (2023)
Complex Daily Activities, Country-Level Diversity, and Smartphone Sensing: A Study in Denmark, Italy, Mongolia, Paraguay, and UK, , , , , , , , , , , , , , , , , , , , , and , in: Proceedings of the 2023 ACM CHI Conference on Human Factors in Computing Systems (CHI), Association for Computing Machinery, 2023 |
[DOI] |
Proc. 13th SESAR Innovation Days (2023)
Customization of Automatic Speech Recognition Engines for Rare Word Detection Without Costly Model Re-Training, , , , , , and , in: Proc. 13th SESAR Innovation Days, Seville, Spain, 2023 |
[DOI] [URL] |
Proc. IEEE Intl Conf. on Robotics and Automation (ICRA) (2023)
Demonstration-guided Optimal Control for Long-term Non-prehensile Planar Manipulation, , , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), pages 4999-5005, 2023 |
[DOI] |
Proc. 12th ISCA Speech Synthesis Workshop (SSW 12) (2023)
Diffusion Transformer for Adaptive Text-to-Speech, and , in: Proc. 12th ISCA Speech Synthesis Workshop (SSW 12), 2023 |
[DOI] |
IEEE International Joint Conference on Biometrics (IJCB 2023) (2023)
EFaR 2023: Efficient Face Recognition Competition, , , , and , in: IEEE International Joint Conference on Biometrics (IJCB 2023), 2023 |
|
Proceedings of the 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (2023)
Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks, , , , , , , , and , in: Proceedings of the 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, 2023 |
|
Proceedings of the IEEE/CVF International Conference on Computer Vision (2023)
Efficient Grapevine Structure Estimation in Vineyards Conditions, , , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision, Paris, pages 712--720, 2023 |
[URL] |
The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP) (2023)
End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation, , , , , , and , in: The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), Singapore, 2023 |
[URL] |
Journal of Physics: Conference Series (2023)
Energy assessment of a district by integrating solar thermal in district heating network: a dynamic analysis approach, , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2023 |
[DOI] [URL] |
Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2023), CEUR Workshop Proceedings (2023)
Enhancing Multi-modal Classification of Violent Events using Image Captioning, , and , in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2023), CEUR Workshop Proceedings, Jaén, Spain, 2023 |
[URL] |
Journal of Physics: Conference Series (2023)
Enhancing user acceptance in automated systems with human-centric lighting: the role of visual comfort, personality, and preference, , , , , , , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2023 |
[DOI] [URL] |
Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (CVPR) (2023)
ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (CVPR), pages 17408-17419, 2023 |
[DOI] |
2nd ACM International Workshop on Multimedia AI against Disinformation (MAD '23), June 12, 2023, Thessaloniki, Greece (2023)
Examining European Press Coverage of the Covid-19 No-Vax Movement: An NLP Framework, and , in: 2nd ACM International Workshop on Multimedia AI against Disinformation (MAD '23), June 12, 2023, Thessaloniki, Greece, 2023 |
|
CONCATENATE Workshop at HRI 2023 in Stockholm, Sweden (2023)
Factors that Affect Personalization of Robots for Older Adults, , and , in: CONCATENATE Workshop at HRI 2023 in Stockholm, Sweden, 2023 |
[URL] |
Proceedings of Interspeech (2023)
Few-shot Dysarthric Speech Recognition with Text-to-Speech Data Augmentation, and , in: Proceedings of Interspeech, pages 156-160, 2023 |
[DOI] [URL] |
Proceedings of the IWSLT conference (2023)
Findings of the IWSLT 2023 evaluation campaign, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: Proceedings of the IWSLT conference, 2023 |
International Conference on Multimedia Retrieval (ICMR '23), June 12--15, 2023, Thessaloniki, Greece (2023)
Framing the News: From Human Perception to Large Language Model Inferences, and , in: International Conference on Multimedia Retrieval (ICMR '23), June 12--15, 2023, Thessaloniki, Greece, 2023 |
|
32nd USENIX Security Symposium (USENIX Security 23) (2023)
GAP: Differentially Private Graph Neural Networks with Aggregation Perturbation, , , and , in: 32nd USENIX Security Symposium (USENIX Security 23), 2023 |
|
2023 IEEE Spoken Language Technology Workshop (SLT) (2023)
How Does Pre-trained Wav2Vec 2.0 Perform on Domain-Shifted ASR? An Extensive Benchmark on Air Traffic Control Communications, , , , , , , , and , in: 2023 IEEE Spoken Language Technology Workshop (SLT), IEEE, 2023 |
[URL] |
Proceedings of the Human Factors and Ergonomics Society Annual Meeting (2023)
Human-Robot Collaboration in a Sanding Task, , , , , , , , and , in: Proceedings of the Human Factors and Ergonomics Society Annual Meeting, 2023 |
|
Proc. Interspeech 2023 (2023)
HyperConformer: Multi-head HyperMixer for Efficient Speech Recognition, , , and , in: Proc. Interspeech 2023, Ireland, 2023 |
|
Proc. of the 61st Annual Meeting of the Association for Computational Linguistics (2023)
HyperMixer: An MLP-based Low Cost Alternative to Transformers, , , , , , and , in: Proc. of the 61st Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Toronto, Canada, pages 15632-15654, 2023 |
[DOI] |
Advances in Neural Information Processing Systems (NeurIPS) (2023)
ID and OOD performance are sometimes inversely correlated on real-world datasets, , , and , in: Advances in Neural Information Processing Systems (NeurIPS), 2023 |
Proc. Interspeech 2023 (2023)
Implementing contextual biasing in GPU decoder for online ASR, , , , , , and , in: Proc. Interspeech 2023, 2023 |
|
Proceedings of the International Conference on Historical Cryptology (2023)
International Conference on the Voynich Manuscript 2022, , , , , , and , in: Proceedings of the International Conference on Historical Cryptology, 2023 |
Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society (2023)
Keep Sensors in Check: Disentangling Country-Level Generalization Issues in Mobile Sensor-Based Models with Diversity Scores, , , and , in: Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, 2023 |
|
In Findings of the European chapter of Association for Computational Linguistics (2023)
Learning Disentangled Representations for Natural Language Definitions, , , and , in: In Findings of the European chapter of Association for Computational Linguistics, 2023 |
|
ICML 2023: The Second Workshop on Spurious Correlations, Invariance and Stability (2023)
Learning diverse features in vision transformers for improved generalization, , , and , in: ICML 2023: The Second Workshop on Spurious Correlations, Invariance and Stability, 2023 |
[URL] |
Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS) (2023)
Learning Joint Space Reference Manifold for Reliable Physical Assistance, , , , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), pages 10412-10417, 2023 |
[DOI] |
The 2023 Conference on Empirical Methods in Natural Language Processing (2023)
Learning to Abstract with Nonparametric Variational Information Bottleneck, , and , in: The 2023 Conference on Empirical Methods in Natural Language Processing, 2023 |
[URL] |
NeurIPS Workshop on Diffusion Models (2023)
Leveraging Diffusion Disentangled Representations to Mitigate Shortcuts in Underspecified Visual Tasks, , , , and , in: NeurIPS Workshop on Diffusion Models, 2023 |
[URL] |
Target and Background Signatures IX (2023)
Multi-image deconvolution of thermal images with a boundary condition weighting scheme, , , , and , in: Target and Background Signatures IX, International Society for Optics and Photonics, Amsterdam, pages 149-158, SPIE, 2023 |
[DOI] [URL] |
Proceedings of The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023) (2023)
NLI4CT: Multi-Evidence Natural Language Inference for Clinical Trial Reports, , , , , and , in: Proceedings of The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), 2023 |
Proceedings of Interspeech (2023)
Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviews, , , and , in: Proceedings of Interspeech, 2023 |
|
Findings of the 17th European Chapter of the Association for Computational Linguistics (2023)
On Interventional Probing in High Dimensions: An NLI Case Study, , , and , in: Findings of the 17th European Chapter of the Association for Computational Linguistics, 2023 |
Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU'23 (2023)
Parameter-Efficient Tuning With Adaptive Bottlenecks For Automatic Speech Recognition, , , , , and , in: Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU'23, 2023 |
|
Journal of Physics: Conference Series (2023)
Potential for district heating networks from waste heat: an assessment tool and its application to sewage treatment plants in the Canton of Zurich, and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2023 |
[DOI] [URL] |
Proceedings of the 2023 ACM CHI Conference on Human Factors in Computing Systems (CHI) 2023 (2023)
Quantified Canine: Inferring Dog Personality From Wearables, , , , , and , in: Proceedings of the 2023 ACM CHI Conference on Human Factors in Computing Systems (CHI) 2023, Association for Computing Machinery, 2023 |
[DOI] |
ACM International Conference on Interactive Media Experiences (IMX '23), June 2023, Nantes, France (2023)
Referencing in YouTube Knowledge Communication Videos, and , in: ACM International Conference on Interactive Media Experiences (IMX '23), June 2023, Nantes, France, 2023 |
|
20th International Conference on Ubiquitous Robots (UR) (2023)
Robust Execution of Assembly Policies Using a Pose Invariant Task Representation, , , , , and , in: 20th International Conference on Ubiquitous Robots (UR), Honolulu, HI, USA, IEEE, 2023 |
|
Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023) (2023)
SemEval-2023 Task 7: Multi-Evidence Natural Language Inference for Clinical Trial Data, , , , , and , in: Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023), Association for Computational Linguistics, 2023 |
[DOI] [URL] |
CHI '23: Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (2023)
Situated Participatory Design: A Method for In Situ Design of Robotic Interaction with Older Adults, , and , in: CHI '23: Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023 |
[DOI] [URL] |
Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS) (2023)
SoftGPT: Learn Goal-oriented Soft Object Manipulation Skills by Generative Pre-trained Heterogeneous Graph Transformer, , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2023 |
|
Findings of EMNLP (2023)
Strong and Efficient Baselines for Open Domain Conversational Question Answering, , and , in: Findings of EMNLP, Association for Computational Linguistics, 2023 |
[DOI] [URL] |
IEEE International Joint Conference on Biometrics (IJCB 2023) (2023)
SynthDistill: Face Recognition with Knowledge Distillation from Synthetic Data, , and , in: IEEE International Joint Conference on Biometrics (IJCB 2023), 2023 |
[DOI] |
Proc. 18th Blizzard Challenge Workshop (2023)
The Idiap Speech Synthesis System for the Blizzard Challenge 2023, , , and , in: Proc. 18th Blizzard Challenge Workshop, 2023 |
[DOI] |
Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023 (2023)
The Interpreter Understands Your Meaning: End-to-end Spoken Language Understanding Aided by Speech Translation, and , in: Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023, pages 4408-4423, Association for Computational Linguistics, 2023 |
[DOI] |
IEEE International Joint Conference on Biometrics (IJCB 2023) (2023)
The Unconstrained Ear Recognition Challenge 2023: Maximizing Performance and Minimizing Bias, and , in: IEEE International Joint Conference on Biometrics (IJCB 2023), 2023 |
|
IEEE International Joint Conference on Biometrics (2023)
Toward responsible face datasets: modeling the distribution of a disentangled latent space for sampling face images from demographic groups, , and , in: IEEE International Joint Conference on Biometrics, 2023 |
|
Companion of the 2023 ACM/IEEE International Conference on Human-Robot Interaction (2023)
Towards Improved Replicability of Human Studies in Human-Robot Interaction: Recommendations for Formalized Reporting, , , , , , , , , , and , in: Companion of the 2023 ACM/IEEE International Conference on Human-Robot Interaction, pages 629-633, 2023 |
|
Big Picture Workshop at EMNLP 2023 (2023)
Transformers as Graph-to-Graph Models, , , and , in: Big Picture Workshop at EMNLP 2023, 2023 |
International Conference on Semantic Computing (2023)
Transformers, Tables and Frame Semantics, , , and , in: International Conference on Semantic Computing, 2023 |
25th ACM International Conference on Multimodal Interaction (2023)
Understanding the Social Context of Eating with Multimodal Smartphone Sensing: The Role of Country Diversity, , and , in: 25th ACM International Conference on Multimodal Interaction, 2023 |
[DOI] [URL] |
Proceedings of Interspeech (2023)
Using Commercial ASR Solutions to Assess Reading Skills in Children: A Case Report, , , , , and , in: Proceedings of Interspeech, pages 4573-4577, 2023 |
[DOI] [URL] |
Proceedings of Building Simulation 2023: 18th Conference of IBPSA (2023)
Verification of PyDHN - a Python library for the thermo-hydraulic simulation of district heating networks - through the DESTEST, , and , in: Proceedings of Building Simulation 2023: 18th Conference of IBPSA, IBPSA, IBPSA, 2023 |
[DOI] [URL] |
Proc. IEEE International Conference on Robotics and Automation (ICRA) (2023)
VP-STO: Via-point-based Stochastic Trajectory Optimization for Reactive Robot Behavior, , , and , in: Proc. IEEE International Conference on Robotics and Automation (ICRA), 2023 |
|
IEEE International Joint Conference on Biometrics (2023)
Vulnerability of Automatic Identity Recognition to Audio-Visual Deepfakes, , , and , in: IEEE International Joint Conference on Biometrics, 2023 |
|
Under review (2023)
Zero-shot Retrieval: Augmenting Pre-trained Models with Search Engines, , , , , , and , in: Under review, 2023 |
[URL] |
Shortcut Bias Mitigation via Ensemble Diversity Using Diffusion Probabilistic Models, , , , , and , in: Under review, 2023 |
[URL] |
Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) (2023)
ChildPlay: A New Benchmark for Understanding Children's Gaze Behaviour, , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023 |
|
Proceedings of the Second Workshop on Text Simplification, Accessibility and Readability (2023)
Document-level Text Simplification with Coherence Evaluation, , , and , in: Proceedings of the Second Workshop on Text Simplification, Accessibility and Readability, 2023 |
|
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (2022)
A Modular Multimodal Architecture for Gaze Target Prediction: Application to Privacy-Sensitive Settings, , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022 |
|
Proc. of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2022)
A two-step approach to leverage contextual data: speech recognition in air-traffic communications, , , , and , in: Proc. of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022 |
|
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2022)
Active Learning by Feature Mixing, , , , , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022 |
Annual Conference of the International Speech Communication Association (2022)
Adversarial-free speaker identity-invariant representation learning for automatic dysarthric speech classification, and , in: Annual Conference of the International Speech Communication Association, 2022 |
|
21st International Conference of the Biometrics Special Interest Group (BIOSIG 2022) (2022)
An anomaly detection approach for backdoored neural networks: face recognition as a case study, and , in: 21st International Conference of the Biometrics Special Interest Group (BIOSIG 2022), Darmstadt, Germany, 2022 |
|
Pacific Asia Conference on Language, Information and Computation (PACLIC 36), In ACL Anthology Proceedings (2022)
An Empirical Comparison of Semantic Similarity Methods for Analyzing down-streaming Automatic Minuting task, , , , and , in: Pacific Asia Conference on Language, Information and Computation (PACLIC 36), In ACL Anthology Proceedings, 2022 |
|
Pacific Asia Conference on Language, Information and Computation (PACLIC 36) , In proceedings of ACL Anthology (2022)
An End-to-End Multilingual System for Automatic Minuting of Multi-Party Dialogues, , , and , in: Pacific Asia Conference on Language, Information and Computation (PACLIC 36) , In proceedings of ACL Anthology, 2022 |
|
Proceedings of ISES and IEA SHC International Conference on Solar Energy for Buildings and Industry (2022)
An exploratory interplay between daylight, general and task lighting for visual comfort and electricity savings in a personal office space, , , , , , and , in: Proceedings of ISES and IEA SHC International Conference on Solar Energy for Buildings and Industry, Kassel, Germany, 2022 |
International Conference on Acoustics, Speech and Signal Processing (2022)
Are GAN-based Morphs Threatening Face Recognition?, , , and , in: International Conference on Acoustics, Speech and Signal Processing, 2022 |
|
Pacific Asia Conference on Language, Information and Computation (PACLIC 36), In proceedings of ACL Anthology (2022)
Automatic Minuting: A Pipeline Method for Generating Minutes, , , and , in: Pacific Asia Conference on Language, Information and Computation (PACLIC 36), In proceedings of ACL Anthology, 2022 |
|
Automatic Summarization for Creative Writing, International Conference on Computational Linguistics (COLING 2022) (2022)
Automatic Summarization for Creative Writing: Denoising Auto-Encoder based Pipeline Method for Generating Summary of Movie Scripts, , , , and , in: Automatic Summarization for Creative Writing, International Conference on Computational Linguistics (COLING 2022), 2022 |
|
Proc. Interspeech 2022 (2022)
Bayesian Recurrent Units and the Forward Backward Algorithm, and , in: Proc. Interspeech 2022, pages 4137-4141, 2022 |
[DOI] |
PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION (PACLIC 2022), In Proceedings of ACL Anthology (2022)
Bio-Medical Multi-label Scientific Literature Classification using LWAN and Dual-attention module, , , , and , in: PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION (PACLIC 2022), In Proceedings of ACL Anthology, 2022 |
|
International Conference on Pattern Recognition (2022)
Borrowing from yourself: Faster future video segmentation with partial channel update, and , in: International Conference on Pattern Recognition, 2022 |
|
Proceedings of the 29th International Conference on Computational Linguistics (2022)
Case-Based Abductive Natural Language Inference, , and , in: Proceedings of the 29th International Conference on Computational Linguistics, 2022 |
[URL] |
Annual Conference of the International Speech Communication Association (2022)
Comparison of 5 methods for the evaluation of intelligibility in mild to moderate French dysarthric speech, , , , , , , , , and , in: Annual Conference of the International Speech Communication Association, pages 2188-2192, 2022 |
[DOI] |
Proceedings of the 13th Language Resources and Evaluation Conference (2022)
Conversational Speech Recognition Needs Data? Experiments with Austrian German, , , and , in: Proceedings of the 13th Language Resources and Evaluation Conference, European Language Resources Association, pages 4684--4691, 2022 |
[URL] |
International Conference on Acoustics, Speech, and Signal Processing (2022)
Custom attribution loss for improving generalization and interpretability of deepfake detection, , and , in: International Conference on Acoustics, Speech, and Signal Processing, 2022 |
|
BlackBoxNLP: Workshop on analyzing and interpreting neural networks for NLP (2022)
Decomposing Natural Logic Inferences for Neural NLI, , , , and , in: BlackBoxNLP: Workshop on analyzing and interpreting neural networks for NLP, 2022 |
Proceedings of the 1st International Conference on the Voynich Manuscript (2022)
Demystifying the Scribes behind the Voynich Manuscript using Computational Linguistic Techniques, , and , in: Proceedings of the 1st International Conference on the Voynich Manuscript, 2022 |
- (2022)
DHgeN: Automated Generation of District Heating Network Layouts for Feasibility Studies, and , in: -, 2022 |
|
arXiv (2022)
EBMs vs. CL: Exploring Self-Supervised Visual Pretraining for Visual Question Answering, , , , and , in: arXiv, 2022 |
NeurIPS 2022 (2022)
Efficient Training of Low-Curvature Neural Networks, , , and , in: NeurIPS 2022, 2022 |
[URL] |
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2022)
Evading the Simplicity Bias: Training a Diverse Set of Models Discovers Solutions with Superior OOD Generalization, , , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022 |
|
Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (2022)
Expanded Lattice Embeddings for Spoken Document Retrieval on Informal Meetings, , , , and , in: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, 2022 |
[DOI] |
International Conference on Acoustics, Speech, and Signal Processing (2022)
Experimental investigation on STFT phase Representations for deep learning-based dysarthric speech detection, and , in: International Conference on Acoustics, Speech, and Signal Processing, 2022 |
|
ACM Multimedia (2022)
Face Anthropometry Aware Audio-visual Age Verification, and , in: ACM Multimedia, 2022 |
|
International Conference on Pattern Recognition Workshops (2022)
Fairness Index Measures to Evaluate Bias in Biometric Recognition, and , in: International Conference on Pattern Recognition Workshops, 2022 |
|
Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (CVPR) (2022)
GeoNeRF: Generalizing NeRF with Geometry Priors, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (CVPR), 2022 |
[URL] |
12th SESAR Innovation Days (2022)
Grammar Based Speaker Role Identification for Air Traffic Control Speech Recognition, , , , , , and , in: 12th SESAR Innovation Days, 2022 |
|
Findings of Association for >Computational Linguistics: ACL 2022 (2022)
Graph Refinement for Coreference Resolution, and , in: Findings of Association for >Computational Linguistics: ACL 2022, 2022 |
Proceedings of the 21st International Conference on Mobile and Ubiquitous Multimedia (2022)
Health Talk: Understanding Practices of Popular Professional YouTubers, , , , and , in: Proceedings of the 21st International Conference on Mobile and Ubiquitous Multimedia, 2022 |
ACL (2022)
Hierarchical Multi-task learning framework for Isometric-Speech Language Translation, , , and , in: ACL, 2022 |
|
PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION (PACLIC 2022), In Proceedings of ACL Anthology (2022)
HMIST: Hierarchical Multilingual Isometric Speech Translation using Multi-Task Learning Framework for Automatic Dubbing, , , and , in: PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION (PACLIC 2022), In Proceedings of ACL Anthology, 2022 |
|
MAD '22: Proceedings of the 1st International Workshop on Multimedia AI against Disinformation (2022)
How Did Europe’s Press Cover Covid-19 Vaccination News? A Five-Country Analysis, and , in: MAD '22: Proceedings of the 1st International Workshop on Multimedia AI against Disinformation, 2022 |
[DOI] [URL] |
Thirty-Sixth AAAI Conference on Artificial Intelligence (2022)
Hybrid Autoregressive Inference for Scalable Multi-hop Explanation Regeneration, , , and , in: Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022 |
ACL (2022)
IDIAP Submission@LT-EDI-ACL2022 : Hope Speech Detection for Equality, Diversity and Inclusion, and , in: ACL, 2022 |
|
IDIAP Submission@LT-EDI-ACL2022: Detecting Signs of Depression from Social Media Text, and , in: ACL, 2022 |
|
ACL Proceedings (2022)
IDIAP Submission@LT-EDI-ACL2022: Homophobia/Transphobia Detection in social media comments, and , in: ACL Proceedings, 2022 |
|
The 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE @ EMNLP 2022) (2022)
IDIAPers @ Causal News Corpus 2022: Efficient Causal Relation Identification Through a Prompt-based Few-shot Approach, , , , , , and , in: The 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE @ EMNLP 2022), 2022 |
[URL] |
IDIAPers @ Causal News Corpus 2022: Extracting Cause-Effect-Signal Triplets via Pre-trained Autoregressive Language Model, , , , , , and , in: The 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE @ EMNLP 2022), 2022 |
[URL] |
ACL (2022)
IDIAP_TIET@LT-EDI-ACL2022 : Hope Speech Detection in Social Media using Contextualized BERT with Attention Mechanism, , and , in: ACL, 2022 |
|
Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS) (2022)
Imitation of Manipulation Skills Using Multiple Geometries, , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2022 |
|
International Conference on Computational Linguistics (COLING 2022) (2022)
Innovators@SMM4H'22: An Ensembles Approach for self-reporting of COVID-19 Vaccination Status Tweets, , , , and , in: International Conference on Computational Linguistics (COLING 2022), 2022 |
|
Innovators@SMM4H'22: An Ensembles Approach for Stance and Premise Classification of COVID-19 Health Mandates Tweets, , , , and , in: International Conference on Computational Linguistics (COLING 2022), 2022 |
Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS) (2022)
Learning to Guide Online Multi-Contact Receding Horizon Planning, , , , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2022 |
Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022) (2022)
Leveraging Events Sub-Categories for Violent-Events Detection in Social Media, , and , in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022), 2022 |
[URL] |
SPIE sensors + imaging, Target and Background Signatures VIII (2022)
Local estimation of parametric point spread functions in thermal images via convolutional neural networks, , , and , in: SPIE sensors + imaging, Target and Background Signatures VIII, Berlin, Germany, pages 1227009 1--8, SPIE, 2022 |
[DOI] [URL] |
Proceedings of the 11th Joint Conference on Lexical and Computational Semantics (2022)
Multilingual Extraction and Categorization of Lexical Collocations with Graph-aware Transformers, , , , and , in: Proceedings of the 11th Joint Conference on Lexical and Computational Semantics, Seattle, USA, pages 89–100, 2022 |
|
International Conference on Language Resources and Evaluation (LREC 2022) (2022)
Multimodality for NLP-Centered Applications: Resources, Advances and Frontiers, , , and , in: International Conference on Language Resources and Evaluation (LREC 2022), 2022 |
|
21st International Conference of the Biometrics Special Interest Group (BIOSIG 2022) (2022)
On the detection of morphing attacks generated by GANs, and , in: 21st International Conference of the Biometrics Special Interest Group (BIOSIG 2022), 2022 |
|
ACL (2022)
PERFECT: Prompt-free and Efficient Few-shot Learning with Language Models, , , , , , and , in: ACL, 2022 |
|
Proceedings of the workshop on Deep Learning for Low-Resource NLP (2022)
Pre-training Data Quality and Quantity for a Low-Resource Language: New Corpus and BERT Models for Maltese, , , , and , in: Proceedings of the workshop on Deep Learning for Low-Resource NLP, 2022 |
[URL] |
European Conference on Computer Vision (2022)
Predicting is not understanding: Recognizing and addressing underspecification in machine learning, , and , in: European Conference on Computer Vision, pages 458-476, Springer, 2022 |
Union World Conference on Lung Health (2022)
Pulmonary Tuberculosis Screening from Radiological Signs on Chest X-Ray Images Using Deep Models, , and , in: Union World Conference on Lung Health, The Union, 2022 |
The International Symposium on Robotics Research (2022)
Reactive Anticipatory Robot Skills with Memory, , and , in: The International Symposium on Robotics Research, 2022 |
|
11th SESAR Innovation Days (2022)
Readback Error Detection by Automatic Speech Recognition and Understanding -- Results of HAAWAII Project for Isavia’s Enroute Airspace, , , , , , , , , and , in: 11th SESAR Innovation Days, SESAR, pages 9, 2022 |
|
arXiv (2022)
Reasoning over vision and language: Exploring the benefits of supplemental knowledge, , , and , in: arXiv, 2022 |
Advances in Neural Information Processing Systems (2022)
SelecMix: Debiased Learning by Contradicting-pair Sampling, , , , , , and , in: Advances in Neural Information Processing Systems, 2022 |
ICML Workshop on Spurious Correlations, Invariance and Stability (2022)
SelecMix: Debiased Learning by Mixing up Contradicting Pairs, , , , , , and , in: ICML Workshop on Spurious Correlations, Invariance and Stability, 2022 |
3rd Workshop on Computational Approaches to Discourse (CODI) @ COLING (2022)
Shallow Discourse Parsing for Open Information Extraction and Text Simplification, , and , in: 3rd Workshop on Computational Approaches to Discourse (CODI) @ COLING, 2022 |
The 2022 Conference on Empirical Methods in Natural Language Processing (2022)
SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages, , , , , and , in: The 2022 Conference on Empirical Methods in Natural Language Processing, 2022 |
|
12th SESAR Innovation Days (2022)
Speech and Natural Language Processing Technologies for Pseudo-Pilot Simulator, , , , , and , in: 12th SESAR Innovation Days, 2022 |
|
Advances in Neural Information Processing Systems 35 (2022)
Symmetry-induced Disentanglement on Graphs, , and , in: Advances in Neural Information Processing Systems 35, 2022 |
Findings of the ACL (2022)
Systematicity, Compositionality and Transitivity of Deep NLP Models: a Metamorphic Testing Perspective, , , , and , in: Findings of the ACL, 2022 |
Proceedings of TextGraphs-16: Graph-based Methods for Natural Language Processing (2022)
TextGraphs 2022 Shared Task on Natural Language Premise Selection, , , , and , in: Proceedings of TextGraphs-16: Graph-based Methods for Natural Language Processing, 2022 |
[URL] |
Findings of the ACL (2022)
To be or not to be an Integer? Encoding Variables for Mathematical Text, , , , and , in: Findings of the ACL, 2022 |
ACM International Conference on Multimodal Interaction (2022)
Towards Accessible Sign Language Learning and Assessment, , , and , in: ACM International Conference on Multimodal Interaction, Bangalore, INDIA, pages 626-631, 2022 |
[DOI] |
ACM International Conference on Multimodal Interaction (ICMI Companion) (2022)
Towards Automatic Prediction of Non-Expert Perceived Speech Fluency Ratings, , , and , in: ACM International Conference on Multimodal Interaction (ICMI Companion), 2022 |
[DOI] |
ICREC 2022 Conference Proceedings (2022)
Towards energy hubs: an innovative Geographic Information System based approach for cluster definition, , , and , in: ICREC 2022 Conference Proceedings, 2022 |
Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023) (2022)
UM-DFKI Maltese Speech Translation, , , , , , , , , and , in: Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023), 2022 |
CEUR Workshop Proceedings (2022)
UNSL at eRisk 2022: Decision policies with history for early classification, , , and , in: CEUR Workshop Proceedings, 2022 |
[URL] |
Proceedings of Interspeech (2022)
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering, , and , in: Proceedings of Interspeech, 2022 |
|
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts (2022)
Vision-Language Pretraining: Current Trends and the Future, , and , in: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts, 2022 |
[URL] |
Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics (2022)
Visually Grounded Interpretation of Noun-Noun Compounds in English, , , and , in: Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics, Association for Computational Linguistics, 2022 |
37th IEEE International Conference on Data Engineering (ICDE) (2022)
Voyager: Data Discovery for Onboarding in Data Science, , , and , in: 37th IEEE International Conference on Data Engineering (ICDE), 2022 |
The 2022 Conference on Empirical Methods in Natural Language Processing (2022)
What Do Compressed Multilingual Machine Translation Models Forget?, , , , , and , in: The 2022 Conference on Empirical Methods in Natural Language Processing, 2022 |
|
13th International Conference on the Theory and Application of Diagrams (2022)
Why Scholars Are Diagramming Neural Network Models, , and , in: 13th International Conference on the Theory and Application of Diagrams, 2022 |
EAI Pervasive Health (2022)
Your Day in Your Pocket: Complex Activity Recognition from Smartphone Accelerometers, , and , in: EAI Pervasive Health, 2022 |
|
Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2021)
A Bayesian Interpretation of the Light Gated Recurrent Unit, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2021 |
[DOI] |
Proceedings of SWC 2021: ISES Solar World Congress (2021)
A Comparative Study Of Simulation Tools To Model The Solar Irradiation On Building Façades, , , , , , , , , , , , , , , and , in: Proceedings of SWC 2021: ISES Solar World Congress, ISES, 2021 |
[DOI] [URL] |
2021 IEEE International Conference on Acoustics, Speech and Signal Processing (2021)
A COMPARISON OF METHODS FOR OOV-WORD RECOGNITION ON A NEW PUBLIC DATASET, , and , in: 2021 IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE Signal Processing Society, Toronto, Ontario, Canada, 2021 |
|
IEEE International Conference on Robotics and Automation (2021)
A Laser-based Dual-arm System for Precise Control of Collaborative Robots, , and , in: IEEE International Conference on Robotics and Automation, 2021 |
|
Proc. of Workshop on Emerging paradigms for robotic manipulation: from the lab to the productive world, ICRA (2021)
An Attention Mechanism for Deep Q-Networks with Applications in Robotic Pushing, , and , in: Proc. of Workshop on Emerging paradigms for robotic manipulation: from the lab to the productive world, ICRA, 2021 |
IEEE/RSJ International Conference on Intelligent Robots and Systems (2021)
An Efficient Image-to-Image Translation HourGlass-based Architecture for Object Pushing Policy Learning, , and , in: IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021 |
|
IEEE Automatic Speech Recognition and Understanding Workshop (2021)
An Evaluation Benchmark for Automatic Speech Recognition of German-English Code-Switching, , and , in: IEEE Automatic Speech Recognition and Understanding Workshop, 2021 |
|
Proceedings of ITG Conference on Speech Communication (2021)
An Objective Evaluation Framework for Pathological Speech Synthesis, , , , , and , in: Proceedings of ITG Conference on Speech Communication, 2021 |
|
Proceedings of the 2021 International Conference on Multimodal Interaction (2021)
Approximating the Mental Lexicon from Clinical Interviews as a Support Tool for Depression Detection, , , , and , in: Proceedings of the 2021 International Conference on Multimodal Interaction, ACM, 2021 |
[DOI] |
2021 IEEE/AIAA 40th Digital Avionics Systems Conference (DASC) (2021)
Automated Interpretation of Air Traffic Control Communication: The Journey from Spoken Words to a Deeper Understanding of the Meaning, , , , , , , and , in: 2021 IEEE/AIAA 40th Digital Avionics Systems Conference (DASC), San Antonio, TX, USA, pages 1-9, IEEE, 2021 |
[DOI] |
IEEE International Conference on Acoustics, Speech and Signal Processing (2021)
Automatic and perceptual discrimination between dysarthria, apraxia of speech, and neurotypical speech, , , and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2021 |
|
Proceeding of International Conference on Information Technology (OCIT) (2021)
Automatic Dialect Detection for Low Resource Santali Language, , , , , and , in: Proceeding of International Conference on Information Technology (OCIT), 2021 |
|
45th International Conference on Acoustics, Speech, and Signal Processing (2021)
AUTOMATIC DYSARTHRIC SPEECH DETECTION EXPLOITING PAIRWISE DISTANCE-BASED CONVOLUTIONAL NEURAL NETWORKS, , and , in: 45th International Conference on Acoustics, Speech, and Signal Processing, Toronto, Canada, pages 7328–7332, 2021 |
|
Proceedings of 9th OpenSky Symposium 2020 (2021)
Automatic processing pipeline for collecting and annotating air-traffic voice communication data, , , , , , , , , and , in: Proceedings of 9th OpenSky Symposium 2020, OpenSky Network, Brussels, Belgium, pages 1-9, MDPI, 2021 |
|
Proceedings of the IEEE/CVF International Conference on Computer Vision (2021)
Beyond question-based biases: Assessing multimodal shortcut learning in visual question answering, , , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021 |
|
Interspeech 2021 (2021)
Boosting of contextual information in ASR for air-traffic call-sign recognition, , , , , , , and , in: Interspeech 2021, 2021 |
|
SafeAI 2021 - AAAI's Workshop on Artificial Intelligence Safety (2021)
Challenges for Using Impact Regularizers to Avoid Negative Side Effects, , and , in: SafeAI 2021 - AAAI's Workshop on Artificial Intelligence Safety, 2021 |
|
NeurIPS (2021)
Compacter: Efficient Low-Rank Hypercomplex Adapter Layers, , and , in: NeurIPS, 2021 |
|
Proceedings of Interspeech (2021)
Comparing CTC and LFMMI for out-of-domain adaptation of wav2vec 2.0 acoustic model, , and , in: Proceedings of Interspeech, 2021 |
[URL] |
Interspeech 2021 (2021)
Contextual Semi-Supervised Learning: An Approach To Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems, , , , , , and , in: Interspeech 2021, 2021 |
[URL] |
37th IEEE International Conference on Data Engineering (ICDE) (2021)
Cost–effective Variational Active Entity Resolution, , , , and , in: 37th IEEE International Conference on Data Engineering (ICDE), 2021 |
[URL] |
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2021)
Cross Modal Focal Loss for RGBD Face Anti-Spoofing, and , in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021 |
|
Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) (2021)
DepthInSpace: Exploitation and Fusion of Multiple Video Frames for Structured-Light Depth Estimation, , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 6039-6048, 2021 |
[URL] |
XV Brazilian Congress on Computational Intelligence (2021)
Development of a lung segmentation algorithm for analog imaged chest X-Ray: preliminary results, , , , and , in: XV Brazilian Congress on Computational Intelligence, Joinville, Brazil, 2021 |
[URL] |
The 2021 Conference on Empirical Methods in Natural Language Processing (2021)
Disentangling Generative Factors in Natural Language with Discrete Variational Autoencoders, and , in: The 2021 Conference on Empirical Methods in Natural Language Processing, 2021 |
Journal of Physics: Conference Series (2021)
District heating network modelling for future integration of solar thermal energy, , , and , in: Journal of Physics: Conference Series, pages 012089, IOP Publishing, 2021 |
[DOI] |
14th International Conference on Computational Semantics (2021)
Do Natural Language Explanations Represent Valid Logical Arguments? Verifying Entailment in Explainable NLI Gold Standards, , and , in: 14th International Conference on Computational Semantics, 2021 |
[URL] |
59th Annual Meeting of the Association for Computational Linguistics (Demonstration track) (2021)
Does My Representation Capture X? Probe-Ably, , , , and , in: 59th Annual Meeting of the Association for Computational Linguistics (Demonstration track), 2021 |
[URL] |
14th International Conference on Computational Semantics (2021)
Encoding Explanatory Knowledge for Zero-shot Science Question Answering, , , and , in: 14th International Conference on Computational Semantics, 2021 |
[URL] |
International Symposium on Biomedical Imaging, 2021 (2021)
Estimating Nonplanar Flow from 2D Motion-blurred Widefield Microscopy Images via Deep Learning, and , in: International Symposium on Biomedical Imaging, 2021, 2021 |
|
59th Annual Meeting of the Association for Computational Linguistics (ACL Findings) (2021)
Explainable Inference Over Grounding-Abstract Chains for Science Questions, , and , in: 59th Annual Meeting of the Association for Computational Linguistics (ACL Findings), 2021 |
|
16th conference of the European Chapter of the Association for Computational Linguistics (EACL) (2021)
Explainable Natural Language Reasoning via Conceptual Unification, , and , in: 16th conference of the European Chapter of the Association for Computational Linguistics (EACL), 2021 |
[URL] |
International Joint Conference on Biometrics (2021)
Face Liveness Detection Competition (LivDet-Face) - 2021, , , , , , , , , and , in: International Joint Conference on Biometrics, 2021 |
2nd Multimodal Sentiment Analysis Challenge (MuSe '21), October 24, 2021, Virtual Event, China (2021)
Fusion of Acoustic and Linguistic Information Using Supervised Autoencoder for Improved Emotion Recognition, , and , in: 2nd Multimodal Sentiment Analysis Challenge (MuSe '21), October 24, 2021, Virtual Event, China, 2021 |
[DOI] |
1st ISCA Symposium on Security and Privacy in Speech Communication (2021)
Graph2Speak: Improving Speaker Identification using Network Knowledge in Criminal Conversational Data, , , and , in: 1st ISCA Symposium on Security and Privacy in Speech Communication, pages 10--13, 2021 |
[DOI] |
Proceedings of Interspeech (2021)
Handling acoustic variation in dysarthric speech recognition systems through model combination, and , in: Proceedings of Interspeech, 2021 |
|
Identification of F1 and F2 in speech using modified zero frequency filtering, and , in: Proceedings of Interspeech, 2021 |
|
Proceedings of the IEEE/CVF International Conference on Computer Vision (2021)
Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models, , , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021 |
|
Journal of Physics: Conference Series (2021)
Implementation of machine learning techniques for the quasi real-time blind and electric lighting optimization in a controlled experimental facility, , , , , and , in: Journal of Physics: Conference Series, IOP Publishing, 2021 |
[DOI] [URL] |
Proceedings of the 25th Conference on Computational Natural Language Learning (2021)
Imposing Relation Structure in Language-Model Embeddings Using Contrastive Learning, , , and , in: Proceedings of the 25th Conference on Computational Natural Language Learning, Online, pages 337-348, Association for Computational Linguistics, 2021 |
11th ISCA Speech Synthesis Workshop (2021)
Improving Emotional TTS with an Emotion Intensity Input from Unsupervised Extraction, and , in: 11th ISCA Speech Synthesis Workshop, 2021 |
[URL] |
International Workshop on Multimedia Signal Processing (2021)
Improving Generalization of Deepfake Detection by Training for Attribution, , and , in: International Workshop on Multimedia Signal Processing, 2021 |
|
Proceedings of Interspeech 2021 (2021)
Late Fusion of the Available Lexicon and Raw Waveform-based Acoustic Modeling for Depression and Dementia Recognition, , , , , and , in: Proceedings of Interspeech 2021, ISCA-International Speech Communication Association 2021, 2021 |
|
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (2021)
LATTICE-FREE MMI ADAPTATION OF SELF-SUPERVISED PRETRAINED ACOUSTIC MODELS, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, 2021 |
[URL] |
IEEE Automatic Speech Recognition and Understanding Workshop (2021)
Learning to Translate Low-Resourced Swiss German Dialectal Speech into Standard German Text, , and , in: IEEE Automatic Speech Recognition and Understanding Workshop, Colombia, Cartagena, IEEE, 2021 |
|
Proceedings of Building Simulation 2021 (2021)
Machine learning techniques for the daylight and electric lighting performance predictions, , and , in: Proceedings of Building Simulation 2021, 2021 |
Proceedings of Interspeech (2021)
Modeling Dialectal Variation for Swiss German Automatic Speech Recognition, , and , in: Proceedings of Interspeech, 2021 |
[DOI] |
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (2021)
Multi-Adversarial Learning for Cross-Lingual Word Embeddings, , and , in: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Online, pages 463-472, 2021 |
Proceedings of Interspeech 2021 (2021)
Multi-task Neural Network for Robust Multiple Speaker Embedding Extraction, , and , in: Proceedings of Interspeech 2021, 2021 |
European Signal Processing Conference, EUSIPCO 2021 (2021)
Multi-task Single Channel Speech Enhancement Using Speech Presence Probability As A Secondary Task Training Target, , and , in: European Signal Processing Conference, EUSIPCO 2021, 2021 |
|
Proceedings of the First Workshop on Multimodal Machine Translation for Low Resource Languages (MMTLRL 2021) (2021)
Multimodal Neural Machine Translation System for English to Bengali, , , , , , and , in: Proceedings of the First Workshop on Multimodal Machine Translation for Low Resource Languages (MMTLRL 2021), Online (Virtual Mode), pages 31--39, INCOMA Ltd., 2021 |
[URL] |
18th Extended Semantic Web Conference (ESWC) (2021)
Natural Language Inference over Tables: Enabling Explainable Data Exploration on Data Lakes, , , and , in: 18th Extended Semantic Web Conference (ESWC), 2021 |
[URL] |
Proceedings of the 8th Workshop on Asian Translation (WAT2021) (2021)
NLPHut's Participation at WAT2021, , , , , , , and , in: Proceedings of the 8th Workshop on Asian Translation (WAT2021), pages 146--154, Association for Computational Linguistics, 2021 |
[URL] |
Proceedings of Interspeech (2021)
On Modeling Glottal Source Information for Phonation Assessment in Parkinson’s Disease, , , , and , in: Proceedings of Interspeech, 2021 |
|
International Joint Conference on Biometrics (IJCB 2021) (2021)
On the Effectiveness of Vision Transformers for Zero-shot Face Anti-Spoofing, and , in: International Joint Conference on Biometrics (IJCB 2021), 2021 |
|
Proceedings of the Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (2021)
On the Language-specificity of Multilingual BERT and the Impact of Fine-tuning, , , and , in: Proceedings of the Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2021 |
International Joint Conference on Biometrics (IJCB 2021) (2021)
On the use of automatically generated synthetic image datasets for benchmarking face recognition, , and , in: International Joint Conference on Biometrics (IJCB 2021), 2021 |
|
Proceedings of the First Workshop on Natural Language Processing for Indigenous Languages of the Americas (2021)
Open Machine Translation for Low Resource South American Languages (AmericasNLP 2021 Shared Task Contribution), , , , , , , , and , in: Proceedings of the First Workshop on Natural Language Processing for Indigenous Languages of the Americas, pages 218–223, Association for Computational Linguistics, 2021 |
[DOI] [URL] |
1st ISCA Symposium on Security and Privacy in Speech Communication (2021)
Open-Set Speaker Identification pipeline in live criminal investigations, and , in: 1st ISCA Symposium on Security and Privacy in Speech Communication, 2021 |
|
2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI) (2021)
Optics Versus Computation: Influence of Illumination and Reconstruction Model Accuracy in Focal-Plane-Scanning Optical Projection Tomography, and , in: 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI), Nice, France, pages 567-570, IEEE, 2021 |
[DOI] |
Proc. IEEE Intl Conf. on Advanced Robotics (ICAR) (2021)
Optimal Control Combining Emulation and Imitation to Acquire Physical Assistance Skills, , and , in: Proc. IEEE Intl Conf. on Advanced Robotics (ICAR), 2021 |
|
Optimization of robot configurations for motion planning in industrial riveting, , , and , in: Proc. IEEE Intl Conf. on Advanced Robotics (ICAR), 2021 |
|
Proceedings of the 8th Workshop on Asian Translation (WAT2021) (2021)
Overview of the 8th Workshop on Asian Translation, , , , , , , , , , , , , , , and , in: Proceedings of the 8th Workshop on Asian Translation (WAT2021), pages 1--45, Association for Computational Linguistics, 2021 |
[URL] |
Security + Defence, Target and Background Signatures VII, Proc. of SPIE (2021)
Perspectives and limitations of visible-thermal image pair synthesis via generative adversarial networks, , , , and , in: Security + Defence, Target and Background Signatures VII, Proc. of SPIE, online only, pages 1186509-1--1186509-8, SPIE, 2021 |
[DOI] [URL] |
Proceedings of European Signal Processing Conference (EUSIPCO) (2021)
Phoneme based Respiratory Analysis of Read Speech, , , and , in: Proceedings of European Signal Processing Conference (EUSIPCO), 2021 |
|
International Conference in Computer Vision - Workshops (2021)
Pose Transformers (POTR): Human Motion Prediction with Non-Autoregressive Transformers, , and , in: International Conference in Computer Vision - Workshops, 2021 |
|
International Conference on Intelligent Robots and Systems (2021)
Probabilistic Iterative LQR for Short Time Horizon MPC, and , in: International Conference on Intelligent Robots and Systems, pages 579-585, 2021 |
[DOI] |
Proceedings of Robotics: Science and Systems (2021)
PROMPT: Probabilistic Motion Primitives based Trajectory Planning, , , and , in: Proceedings of Robotics: Science and Systems, 2021 |
[DOI] [URL] |
Fourteenth USA/Europe Air Traffic Management Research and Development Seminar (ATM2021) (2021)
Readback Error Detection by Automatic Speech Recognition to Increase ATM Safety, , , , , , , , , , , , , and , in: Fourteenth USA/Europe Air Traffic Management Research and Development Seminar (ATM2021), The United States Federal Aviation Administration (FAA), EUROCONTROL, pages 10, 2021 |
[URL] |
International Conference on Learning Representations (2021)
Rethinking the Role of Gradient-based Attribution Methods for Model Interpretability, and , in: International Conference on Learning Representations, 2021 |
|
Interspeech (2021)
Robust Command Recognition for Lithuanian Air Traffic Control Tower Utterances, , , , , , , and , in: Interspeech, 2021 |
|
Interspeech Show and Tell 2021 (2021)
ROXANNE Research Platform: Automate criminal investigations, , , , , and , in: Interspeech Show and Tell 2021, 2021 |
|
1st ISCA Symposium on Security and Privacy in Speech Communication (2021)
ROXSD: a Simulated Dataset of Communication in Organized Crime, , , , and , in: 1st ISCA Symposium on Security and Privacy in Speech Communication, 2021 |
|
Diagrams (2021)
Scholarly AI system diagrams as an access point to mental models, , and , in: Diagrams, 2021 |
Interspeech (2021)
Speech Activity Detection Based on Multilingual Speech Recognition System, , and , in: Interspeech, 2021 |
|
16th conference of the European Chapter of the Association for Computational Linguistics (EACL) (2021)
STAR: Cross-modal Statement Representation for Selecting Relevant Mathematical Premises, and , in: 16th conference of the European Chapter of the Association for Computational Linguistics (EACL), 2021 |
Diagrams (2021)
Structuralist analysis for neural network system diagrams, , and , in: Diagrams, 2021 |
The international Conference on Acoustics, Speech, and Signal Processing (2021)
Subjective and objective evaluation of deepfake videos, and , in: The international Conference on Acoustics, Speech, and Signal Processing, 2021 |
|
ITG Conference on Speech Communication (2021)
Supervised Speech Representation Learning for Parkinson's Disease Classification, and , in: ITG Conference on Speech Communication, 2021 |
|
Natural Logic Meets Machine Learning Workshop (2021)
Supporting Context Monotonicity Abstractions in Neural NLI Models, , , , and , in: Natural Logic Meets Machine Learning Workshop, 2021 |
[URL] |
14th International Conference on Computational Semantics (2021)
Switching Contexts: Transportability Measures for NLP, , , and , in: 14th International Conference on Computational Semantics, 2021 |
[URL] |
Arxiv (2021)
Syntax-Aware Graph-to-Graph Transformer for Semantic Role Labelling, and , in: Arxiv, 2021 |
|
Proceedings of the 17th International Conference on Parsing Technologies and the IWPT 2021 Shared Task on Parsing into Enhanced Universal Dependencies (2021)
The DCU-EPFL Enhanced Dependency Parser at the IWPT 2021 Shared Task, , , , and , in: Proceedings of the 17th International Conference on Parsing Technologies and the IWPT 2021 Shared Task on Parsing into Enhanced Universal Dependencies, Online, pages 204-212, Association for Computational Linguistics, 2021 |
|
Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society (2021)
The Theory, Practice, and Ethical Challenges of Designing a Diversity-Aware Platform for Social Relations, , , , , , , and , in: Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society, pages 11, ACM, 2021 |
[DOI] |
International Conference on Advanced Robotics (2021)
Trajectory Prediction with Compressed 3D Environment Representation using Tensor Train Decomposition, , , and , in: International Conference on Advanced Robotics, 2021 |
|
Proc. Int. Conf. on Human-Computer Interaction (2021)
Trust indicators and explainable AI: A study on user perceptions, , , , , , and , in: Proc. Int. Conf. on Human-Computer Interaction, Bari, Italy, 2021 |
|
2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI) (2021)
Unequivocal cardiac phase sorting from alternating ramp- and pulse-illuminated microscopy image sequences, , , , and , in: 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI), pages 868-872, 2021 |
[DOI] [URL] |
16th conference of the European Chapter of the Association for Computational Linguistics (2021)
Unification-based Reconstruction of Multi-hop Explanations for Science Questions, , and , in: 16th conference of the European Chapter of the Association for Computational Linguistics, 2021 |
[URL] |
Proceedings of the IEEE/CVF International Conference on Computer Vision (2021)
Unshuffling data for improved generalization in visual question answering, , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021 |
|
ICLR (2021)
Variational Information Bottleneck for Effective Low-Resource Fine-Tuning, , and , in: ICLR, 2021 |
|
Biometrics Special Interest Group (BIOSIG 2021) (2021)
Vein Enhancement with Deep Auto-Encoders to improve Finger Vein Recognition, , and , in: Biometrics Special Interest Group (BIOSIG 2021), 2021 |
|
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (2021)
Visual Focus of Attention Estimation in 3D Scene with an Arbitrary Number of Targets, and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pages 9, IEEE, 2021 |
|
EMNLP, 2nd Workshop on Evaluation and Comparison of NLP Systems (2021)
What is SemEval evaluating? A Systematic Analysis of Evaluation Campaigns in NLP, , , and , in: EMNLP, 2nd Workshop on Evaluation and Comparison of NLP Systems, 2021 |
IEEE International Conference on Robotics and Automation (2021)
Whole Body Model Predictive Control with a Memory of Motion:Experiments on a Torque-Controlled Talos, , , , , , , , , , , and , in: IEEE International Conference on Robotics and Automation, 2021 |
|
Proceedings of the 1st International Workshop on Multimedia Computing for Urban Data (2021)
Zurich Like New: Analyzing Open Urban Multimodal Data, , and , in: Proceedings of the 1st International Workshop on Multimedia Computing for Urban Data, 2021 |
|
ACL (2021)
Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks, , , and , in: ACL, 2021 |
|
Proceedings of Interspeech (2020)
A Comparison of Acoustic and Linguistics Methodologies for Alzheimer's Dementia Recognition, , , , , , , , , , and , in: Proceedings of Interspeech, pages 2182-2186, 2020 |
|
International Conference on Robotics and Automation (2020)
A memory of motion for visual predictive control tasks, , and , in: International Conference on Robotics and Automation, 2020 |
|
Companion Publication of the 2020 International Conference on Multimodal Interaction (ICMI '20 Companion) (2020)
A Phonology-based Approach for Isolated Sign Production Assessment in Sign Language, , , and , in: Companion Publication of the 2020 International Conference on Multimodal Interaction (ICMI '20 Companion), 2020 |
|
Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (2020)
Active Improvement of Control Policies with Bayesian Gaussian Mixture Model, , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, 2020 |
19th International Conference on Mobile and Ubiquitous Multimedia (2020)
Alone or With Others? Understanding Eating Episodes of College Students with Mobile Sensing, , and , in: 19th International Conference on Mobile and Ubiquitous Multimedia, ACM, Essen, Germany, pages 162–166, Association for Computing Machinery, 2020 |
[DOI] [URL] |
Proceedings of the International Conference on Language Resources and Evaluation LREC 2020 (2020)
An HMM Approach with Inherent Model Selection for Sign Language and Gesture Recognition, , and , in: Proceedings of the International Conference on Language Resources and Evaluation LREC 2020, 2020 |
|
Proceedings of the 5th IBPSA-England Conference on Building Simulation and Optimization (Virtual) (2020)
An Integrated and strategic evaluation of automatic blind controls to achieve energy and occupant's comfort objectives, and , in: Proceedings of the 5th IBPSA-England Conference on Building Simulation and Optimization (Virtual), Loughborough, UK, 2020 |
[URL] |
Proceedings of 8th OpenSky Symposium 2020 (2020)
Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications, , , , , , , , , , , , , , , and , in: Proceedings of 8th OpenSky Symposium 2020, OpenSky Network, pages 1-10, MDPI, 2020 |
[DOI] [URL] |
Interspeech (2020)
Automatic Discrimination of Apraxia of Speech and Dysarthria using a Minimalistic Set of Handcrafted Features, , , and , in: Interspeech, 2020 |
|
Proc. Interspeech 2020 (2020)
Automatic Speech Recognition Benchmark for Air-Traffic Communications, , , , and , in: Proc. Interspeech 2020, pages 2297-2301, 2020 |
[DOI] |
Proceedings of the 17th International Conference on Natural Language Processing (2020)
BertAA: BERT fine-tuning for Authorship Attribution, , , and , in: Proceedings of the 17th International Conference on Natural Language Processing, 2020 |
|
IEEE International Conference on Image Processing (2020)
CNN Patch Pooling for Detecting 3D Mask Presentation Attacks in NIR, and , in: IEEE International Conference on Image Processing, 2020 |
|
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2020)
Detection of S1 and S2 locations in phonocardiogram signals using zero frequency filter, , , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020 |
|
Proceedings of the 17th International Conference on Natural Language Processing (2020)
Detection of Similar Languages and Dialects Using Deep Supervised Autoencoders, , , , and , in: Proceedings of the 17th International Conference on Natural Language Processing, 2020 |
|
International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2020)
Dysarthric Speech Recognition with Lattice-Free MMI, and , in: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 6109-6113, 2020 |
[DOI] [URL] |
ACL (2020)
End-to-End Bias Mitigation by Modelling Biases in Corpora, , and , in: ACL, 2020 |
|
Machine Learning for Engineering Modeling, Simulation, and Design Workshop at Neural Information Processing Systems 2020 (2020)
Exact Preimages of Neural Network Aircraft Collision Avoidance Systems, and , in: Machine Learning for Engineering Modeling, Simulation, and Design Workshop at Neural Information Processing Systems 2020, 2020 |
|
Proceedings of the International Conference on Neural Information Processing Systems (2020)
Fast Transformers with Clustered Attention, , and , in: Proceedings of the International Conference on Neural Information Processing Systems, 2020 |
International Join Conference on Biometrics (2020)
Generating Master Faces for Use in PerformingWolf Attacks on Face Recognition Systems, , , and , in: International Join Conference on Biometrics, 2020 |
|
In Proc. Conference on Robot Learning (CoRL) (2020)
Generative adversarial training of product of policies for robust and adaptive movement primitives, , and , in: In Proc. Conference on Robot Learning (CoRL), 2020 |
|
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (2020)
Graph-to-Graph Transformer for Transition-based Dependency Parsing, and , in: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, 2020 |
[URL] |
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Findings (2020)
Graph-to-Graph Transformer for Transition-based Dependency Parsing, and , in: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Findings, ACL, Online, pages 3278–3289, Association for Computational Linguistics, 2020 |
[URL] |
Proceedings of the GermEval 2020 Shared Task on the Classification and Regression of Cognitive and Motivational style from Text (2020)
Idiap & UAM participation at GermEval 2020: Classification and Regression of Cognitive and Motivational Style from Text, , , , and , in: Proceedings of the GermEval 2020 Shared Task on the Classification and Regression of Cognitive and Motivational style from Text, 2020 |
[URL] |
Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2020) co-located with 36th Conference of the Spanish Society for Natural Language Processing (SEPLN 2020) (2020)
Idiap and UAM Participation at MEX-A3T Evaluation Campaign, , , , and , in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2020) co-located with 36th Conference of the Spanish Society for Natural Language Processing (SEPLN 2020), pages 6, CEUR Workshop Proceedings, 2020 |
[URL] |
Proceedings of the 5th Swiss Text Analytics Conference (SwissText) & 16th Conference on Natural Language Processing (KONVENS) (2020)
Idiap Submission to Swiss-German Language Detection Shared Task, , , , and , in: Proceedings of the 5th Swiss Text Analytics Conference (SwissText) & 16th Conference on Natural Language Processing (KONVENS), CEUR Workshop Proceedings, 2020 |
[URL] |
INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB 2020) (2020)
Iris Liveness Detection Competition (LivDet-Iris) – The 2020 Edition, , , , , , , , , , , , , and , in: INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB 2020), 2020 |
[URL] |
International Conference on Robotics and Automation (2020)
Learning How to Walk: Warm-starting Optimal Control Solver with Memory of Motion, , , , and , in: International Conference on Robotics and Automation, 2020 |
|
Proc. ACM/IEEE Intl Conf. on Human-Robot Interaction (HRI) (2020)
Learning, Generating and Adapting Wave Gestures for Expressive Human-Robot Interaction, , and , in: Proc. ACM/IEEE Intl Conf. on Human-Robot Interaction (HRI), pages 386-388, 2020 |
[DOI] [URL] |
Symposium on Eye Tracking Research and Applications (2020)
ManiGaze: a Dataset for Evaluating Remote Gaze Estimator in Object Manipulation Situations, , and , in: Symposium on Eye Tracking Research and Applications, Stuttgart, Germany, pages 5, ACM, 2020 |
[DOI] |
Proceedings of the 7th Workshop on Asian Translation (2020)
ODIANLP's Participation in WAT2020, , , , , , , , and , in: Proceedings of the 7th Workshop on Asian Translation, ACL Anthology, 2020 |
|
Proceedings of the 37th International Conference on Machine Learning (2020)
Optimizer Benchmarking Needs to Account for Hyperparameter Tuning, , , , and , in: Proceedings of the 37th International Conference on Machine Learning, Vienna, Austria, 2020 |
[URL] |
Proceedings of the 7th Workshop on Asian Translation (2020)
Overview of the 7th Workshop on Asian Translation, , , , , , , , , , , and , in: Proceedings of the 7th Workshop on Asian Translation, Association for Computational Linguistics, 2020 |
[URL] |
Proceedings of the Third Workshop on Computational Models of Reference, Anaphora and Coreference (2020)
Partially-supervised Mention Detection, and , in: Proceedings of the Third Workshop on Computational Models of Reference, Anaphora and Coreference, 2020 |
|
IEEE International Conference on Robotics and Automation (2020)
Plucking Motions for Tea Harvesting Robots Using Probabilistic Movement Primitives, , , and , in: IEEE International Conference on Robotics and Automation, 2020 |
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (2020)
Plug and Play Autoencoders for Conditional Text Generation, , , , and , in: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, Online, 2020 |
|
19th International Conference on Mobile and Ubiquitous Multimedia (2020)
Protecting Mobile Food Diaries from Getting too Personal, , and , in: 19th International Conference on Mobile and Ubiquitous Multimedia, Essen, Germany, pages 212–222, Association for Computing Machinery, 2020 |
[DOI] [URL] |
IEEE International Conference on Acoustics, Speech, and Signal Processing (2020)
pyannote.audio: neural building blocks for speaker diarization, , , , , , , , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2020 |
[URL] |
Asian Conference on Computer Vision (2020)
Real-Time Segmentation Networks should be Latency Aware, and , in: Asian Conference on Computer Vision, 2020 |
|
2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2020)
Spatial Attention for Far-Field Speech Recognition with Deep Beamforming Neural Networks, , , , , and , in: 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, pages 7499-7503, 2020 |
[DOI] |
International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2020)
SYNTHETIC SPEECH REFERENCES FOR AUTOMATIC PATHOLOGICAL SPEECH INTELLIGIBILITY ASSESSMENT, , and , in: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Barcelona, Spain, 2020 |
|
Proceedings of the 29th IEEE International Conference on Robot & Human Interactive Communication (2020)
The MuMMER data set for Robot Perception in multi-party HRI Scenarios, , , and , in: Proceedings of the 29th IEEE International Conference on Robot & Human Interactive Communication, 2020 |
|
Proceedings of the International Conference on Computational Creativity (2020)
The societal and ethical relevance of computational Creativity, , and , in: Proceedings of the International Conference on Computational Creativity, 2020 |
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (2020)
The Unstoppable Rise of Computational Linguistics in Deep Learning, , in: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Online, pages 6294-6306, Association for Computational Linguistics, 2020 |
[DOI] [URL] |
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2020)
Towards Multilingual Sign Language Recognition, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020 |
|
Proceedings of International Conference on Machine Learning (2020)
Transformers are RNNs: Fast Autoregressive Transformers with Linear Attention, , , and , in: Proceedings of International Conference on Machine Learning, 2020 |
Proc. ACM Int. Conf. on Multimodal Interaction (ICMI) (2020)
Understanding Applicants' Reactions to Asynchronous Video Interviews through Self-Reports and Nonverbal Cues, , , , , and , in: Proc. ACM Int. Conf. on Multimodal Interaction (ICMI), Utrecht, 2020 |
|
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2020)
Unsupervised Representation Learning for Gaze Estimation, and , in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020 |
|
International Conference on Robotics and Automation (2020)
Variational Inference with Mixture Model Approximation for Applications in Robotics, , and , in: International Conference on Robotics and Automation, 2020 |
|
2019 ACM Symposium on Eye Tracking Research and Applications (2019)
A Deep Learning Approach for Robust Head Pose Independent Eye Movements Recognition from Videos, , and , in: 2019 ACM Symposium on Eye Tracking Research and Applications, pages 5, ACM, 2019 |
[DOI] |
A Learning-Based Framework for Quantized Compressed Sensing (2019)
A Learning-Based Framework for Quantized Compressed Sensing, , and , in: A Learning-Based Framework for Quantized Compressed Sensing, 2019 |
|
Journal of Physics: Conference Series (2019)
A morphological based PV generation and energy consumption predictive model for Singapore neighbourhood, and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
A smart luminaire in an office environment: impact on light distribution, user interactions and comfort, , , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2019) (2019)
Abstract Text Summarization: A Low Resource Challenge, and , in: In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2019), HongKong, China, pages 5, Association for Computational Linguistics (ACL), 2019 |
|
Proceedings of the 2nd International Conference on Intelligent Human Systems Integration (IHSI 2019): Integrating People and Intelligent Systems (2019)
Adaptation of Assistant Based Speech Recognition to New Domains and Its Acceptance by Air Traffic Controllers, , , , , , , , , and , in: Proceedings of the 2nd International Conference on Intelligent Human Systems Integration (IHSI 2019): Integrating People and Intelligent Systems, San Diego, California, USA, pages 820 - 826, 2019 |
[DOI] |
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2019)
Adaptation of Multiple Sound Source Localization Neural Networks with Weak Supervision and Domain-Adversarial Training, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brighton, United Kingdom, pages 770-774, 2019 |
[DOI] |
Proceedings of the Second Workshop on Fact Extraction and VERification (FEVER) (2019)
Aligning Multilingual Word Embeddings for Cross-Modal Retrieval Task, , and , in: Proceedings of the Second Workshop on Fact Extraction and VERification (FEVER), Hong Kong, pages 27-33, Association for Computational Linguistics, 2019 |
[DOI] [URL] |
11th International workshop on Models and Analysis of Vocal Emissions for Biomedical Applications (2019)
AM-FM DECOMPOSITION OF SPEECH SIGNAL: APPLICATIONS FOR SPEECH PRIVACY AND DIAGNOSIS, , , , and , in: 11th International workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, Universita Degli Studi Firenze, Firenze, Italy, 2019 |
[URL] |
ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2019)
An End-to-end Network to Synthesize Intonation Using a Generalized Command Response Model, , , , and , in: ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, United Kingdom, pages 7040-7044, IEEE, 2019 |
[DOI] [URL] |
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2019)
Automatic Diagnosis of Alzheimer's Disease Using Neural Network Language Models, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
|
Proc. ACM Int. Conf. on Interactive Experiences for Television and Online Video (TVX) (2019)
BookTubing Across Regions: Examining Differences based on Nonverbal and Verbal Cues, , and , in: Proc. ACM Int. Conf. on Interactive Experiences for Television and Online Video (TVX), Salford, ENGLAND, 2019 |
[DOI] |
Proceedings of 4th Building Simulation Applications Conference - BSA 2019 (2019)
Building energy models with Morphological urban-scale parameters: a case study in Turin, , , , , and , in: Proceedings of 4th Building Simulation Applications Conference - BSA 2019, 2019 |
[URL] |
International Conference on Learning Representations (2019)
CBOW Is Not All You Need: Combining CBOW with the Compositional Matrix Space Model, , and , in: International Conference on Learning Representations, New Orleans, Louisiana, USA, 2019 |
[URL] |
Proceedings of the 6th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation (2019)
CityLearn v1.0: An OpenAI Gym Environment for Demand Response with Deep Reinforcement Learning, , , and , in: Proceedings of the 6th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation, New-York, USA, pages 356-357, ACM, 2019 |
[DOI] |
Journal of Physics: Conference Series (2019)
CO2 experimental measurements towards the development of a predictive framework using user actions in smart buildings, , , , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
Proceedings of APSIPA ASC 2019 (2019)
Cross-lingual Automatic Speech Recognition Exploiting Articulatory Features, , , , , and , in: Proceedings of APSIPA ASC 2019, 2019 |
Proceedings of 7th IAPR/IEEE International Workshop on Biometrics and Forensics (2019)
Custom Silicone Face Masks - Vulnerability of Commercial Face Recognition Systems & Presentation Attack Detection, , , , , , and , in: Proceedings of 7th IAPR/IEEE International Workshop on Biometrics and Forensics, 2019 |
|
Journal of Physics: Conference Series (2019)
Daylight regulated by automated external Venetian blinds based on HDR sky luminance mapping in winter, , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
International Conference on Biometrics (2019)
Deep Pixel-wise Binary Supervision for Face Presentation Attack Detection, and , in: International Conference on Biometrics, 2019 |
|
Proceedings of the 36th International Conference on Machine Learning (ICML) (2019)
Deep Residual Output Layers for Neural Language Generation, and , in: Proceedings of the 36th International Conference on Machine Learning (ICML), 2019 |
|
Ubicomp/Iswc'19 Adjunct: Proceedings Of The 2019 Acm International Joint Conference On Pervasive And Ubiquitous Computing And Proceedings Of The 2019 Acm International Symposium On Wearable Computers (2019)
Discovering Eating Routines in Context with a Smartphone App, , , and , in: Ubicomp/Iswc'19 Adjunct: Proceedings Of The 2019 Acm International Joint Conference On Pervasive And Ubiquitous Computing And Proceedings Of The 2019 Acm International Symposium On Wearable Computers, London, pages 422-429, 2019 |
[DOI] |
International Conference on Biometrics 2019, IEEE (2019)
Domain Adaptation in Multi-Channel Autoencoder based Features for Robust Face Anti-Spoofing, , and , in: International Conference on Biometrics 2019, IEEE, 2019 |
|
Proceedings of 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing (2019)
EMPIRICAL EVALUATION AND COMBINATION OF PUNCTUATION PREDICTION MODELS APPLIED TO BROADCAST NEWS, and , in: Proceedings of 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2019 |
|
International Conference on Speech and Language Processing, Interspeech (2019)
End-to-End Accented Speech Recognition, , and , in: International Conference on Speech and Language Processing, Interspeech, ISCA, Graz, Austria, pages 2140-2144, 2019 |
[DOI] |
Advances in Neural Information Processing Systems (2019)
Full-Gradient Representation for Neural Network Visualization, and , in: Advances in Neural Information Processing Systems, 2019 |
[URL] |
Proceeding of the SPIE Conference Optics and Photonics, Wavelets and Sparsity XVIII (2019)
Generalized temporal sampling with active illumination in optical microscopy, and , in: Proceeding of the SPIE Conference Optics and Photonics, Wavelets and Sparsity XVIII, SPIE, San Diego, California, United States, SPIE, 2019 |
|
Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2019)
HMM-based Approaches to Model Multichannel Information in Sign Language inspired from Articulatory Features-based Speech Processing, , , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
|
Proceedings of the 4th edition of the Swiss Text Analytics Conference (2019)
Idiap Abstract Text Summarization System for German Text Summarization Task, and , in: Proceedings of the 4th edition of the Swiss Text Analytics Conference, 2019 |
[URL] |
Proceedings of the 6th Workshop on Asian Translation (2019)
Idiap NMT System for WAT 2019 Multimodal Translation Task, and , in: Proceedings of the 6th Workshop on Asian Translation, Hong Kong, China, pages 175–180, Association for Computational Linguistics, 2019 |
[DOI] [URL] |
Proceedings of the 32nd International Florida Artificial Intelligence Research Society Conference (2019)
Implicit discourse relation classification with syntax-aware contextualized word representations, , , and , in: Proceedings of the 32nd International Florida Artificial Intelligence Research Society Conference, 2019 |
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2019)
Improving Children Speech Recognition through Feature Learning from Raw Speech Signal, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
|
Proc. IEEE Intl Conf. on Robotics and Automation (2019)
Improving dual-arm assembly by master-slave compliance, , , and , in: Proc. IEEE Intl Conf. on Robotics and Automation, pages 8676-8682, 2019 |
|
Proceedings of ICASSP 2019 (2019)
INCREMENTAL TRANSFER LEARNING IN TWO-PASS INFORMATION BOTTLENECK BASED SPEAKER DIARIZATION SYSTEM FOR MEETINGS, , , and , in: Proceedings of ICASSP 2019, pages 6291-6295, 2019 |
IEEE International Conference on Acoustics, Speech and Signal Processing (2019)
Joint estimation of RETF vector and power spectral densities for speech enhancement based on alternating least squares, , and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, pages 795--799, 2019 |
Proceedings of the IEEE International Conference on Computer Vision (2019)
Learning an event sequence embedding for event-based deep stereo, , , , and , in: Proceedings of the IEEE International Conference on Computer Vision, 2019 |
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2019)
Learning voice source related information for depression detection, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
|
Journal of Physics: Conference Series (2019)
Multi-agent reinforcement learning for adaptive demand response in smart cities, , , , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019) (2019)
Multi-Spectral Widefield Microscopy of the Beating Heart through Post-Acquisition Synchronization and Unmixing, , , and , in: 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019), Venice, Italy, pages 1382-1385, 2019 |
[DOI] |
IEEE Automatic Speech Recognition and Understanding Workshop (2019)
Multilingual Bottleneck Features for Query by Example Spoken Term Detection, , and , in: IEEE Automatic Speech Recognition and Understanding Workshop, 2019 |
|
Proceedings of the 6th Workshop on Asian Translation (2019)
Overview of the 6th Workshop on Asian Translation, , in: Proceedings of the 6th Workshop on Asian Translation, Hong Kong, China, pages 1–35, Association for Computational Linguistics, 2019 |
[DOI] [URL] |
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2019)
PATHOLOGICAL SPEECH INTELLIGIBILITY ASSESSMENT BASED ON THE SHORT-TIME OBJECTIVE INTELLIGIBILITY MEASURE, , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brighton, UK, pages 6405--6409, 2019 |
|
Proceedings of International Conference on Machine Learning (2019)
Processing Megapixel Images with Deep Attention-Sampling Models, and , in: Proceedings of International Conference on Machine Learning, 2019 |
[URL] |
Proceedings of the international conference on Neural Information Processing Systems (2019)
Reducing Noise in GAN Training with Variance Reduced Extragradient, , , and , in: Proceedings of the international conference on Neural Information Processing Systems, 2019 |
IEEE International Conference on Intelligent Robots and Systems (2019)
Reinforcement learning of trajectory distributions: Applications in assisted teleoperation and motion planning, , , , , and , in: IEEE International Conference on Intelligent Robots and Systems, 2019 |
Journal of Physics: Conference Series (2019)
Retrofitting, district heating and energy storage: neighborhood energy planning, , , , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
Ptroc. 2019 Conférence sur l'Apprentissage automatique (2019)
SATokE: How can Syntax-Aware Contextualized Word Representations Benefit Implicit Discourse Relation Classification?, , , and , in: Ptroc. 2019 Conference sur l'Apprentissage automatique, 2019 |
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2019)
Segment-level training of ANNs based on acoustic confidence measures for hybrid HMM/ANN Speech Recognition, and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
|
WNGT EMNLP (2019)
Selecting, Planning, and Rewriting: A Modular Approach for Data-to-Document Generation and Translation, , and , in: WNGT EMNLP, 2019 |
|
Proc. Interspeech 2019 (2019)
Self-attention for Speech Emotion Recognition, , and , in: Proc. Interspeech 2019, 2019 |
[DOI] |
Proceedings of Interspeech (2019)
Spectral Subspace Analysis for Automatic Assessment of Pathological Speech Intelligibility, , and , in: Proceedings of Interspeech, Graz, Austria, pages 3038--3042, 2019 |
|
Proceedings of TSD (2019)
Spoken language identification using language bottleneck features, , , , , and , in: Proceedings of TSD, 2019 |
|
IEEE International Conference on Acoustics, Speech and Signal Processing (2019)
Super-gaussianity of Speech Spectral Coefficients as a Potential Biomarker for Dysarthric Speech Detection, and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2019 |
International Conference on Machine Learning (2019)
Tampered Speaker Inconsistency Detection with Phonetically Aware Audio-visual Features, , , , , , , and , in: International Conference on Machine Learning, 2019 |
|
Proceedings of Building Simulation 2019: 16th Conference of IBPSA (2019)
The Simulation of Mean Radiant Temperature in Outdoor Conditions: A Review of Architectural Tools Calculation Assumptions, , , and , in: Proceedings of Building Simulation 2019: 16th Conference of IBPSA, 2019 |
IEEE/RSJ International Conference on Intelligent Robots and Systems (2019)
Uncertainty-aware imitation learning using kernelized movement primitives, , , , and , in: IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019 |
|
Proceedings of Interspeech (2019)
Understanding and Visualizing Raw Waveform-based CNNs, , , and , in: Proceedings of Interspeech, 2019 |
|
Journal of Physics: Conference Series (2019)
Understanding the performance gap: a machine learning approach on residential buildings in Turin, Italy, , , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2019 |
[DOI] |
Proceedings of Interspeech (2019)
Using Speech Production Knowledge for Raw Waveform Modelling based Styrian Dialect Identification, and , in: Proceedings of Interspeech, 2019 |
|
IAPR International Conference on Biometrics (2019)
Vulnerability assessment and detection of Deepfake videos, and , in: IAPR International Conference on Biometrics, 2019 |
|
International Conference on Biometrics for Borders (2019)
Vulnerability of Face Recognition to Deep Morphing, and , in: International Conference on Biometrics for Borders, 2019 |
|
Proc. 2019 Conference on Empirical Methods in Natural Language Processing (2019)
Weakly-Supervised Concept-based Adversarial Learning for Cross-lingual Word Embeddings, , and , in: Proc. 2019 Conference on Empirical Methods in Natural Language Processing, 2019 |
29TH BRITISH MACHINE VISION CONFERENCE (2018)
A Differential Approach for Gaze Estimation with Calibration, , , and , in: 29TH BRITISH MACHINE VISION CONFERENCE, 2018 |
|
Proc. Interspeech 2018 (2018)
A Neural Model to Predict Parameters for a Generalized Command Response Model of Intonation, and , in: Proc. Interspeech 2018, pages 3147-3151, 2018 |
[DOI] |
MLSLP-18 Proceedings (2018)
A Neural Model to Predict Parameters for a Generalized Command Response Model of Intonation, and , in: MLSLP-18 Proceedings, Hyderabad, 2018 |
[URL] |
Proceedings of Interspeech 2018 (2018)
Analysis of Language Dependent Front-End for Speaker Recognition, , and , in: Proceedings of Interspeech 2018, Hyderabad, INDIA, pages 1101-1105, 2018 |
[DOI] |
Proceedings of the Third Conference on Machine Translation (WMT) (2018)
Beyond Weight Tying: Learning Joint Input-Output Embeddings for Neural Machine Translation, , and , in: Proceedings of the Third Conference on Machine Translation (WMT), 2018 |
|
Proc. of the IEEE-RAS Intl Conf. on Humanoid Robots (Humanoids) (2018)
Bimanual Skill Learning with Pose and Joint Space Constraints, , , and , in: Proc. of the IEEE-RAS Intl Conf. on Humanoid Robots (Humanoids), 2018 |
|
Conference: SESAR Innovation Days 2018 (2018)
Building Blocks of Assistant Based Speech Recognition for Air Traffic Management Applications, , , , , , , and , in: Conference: SESAR Innovation Days 2018, European Union, Eurocontrol, Salzburg, Austria, SESARJU, 2018 |
[URL] |
Proceedings of Interspeech (2018)
CNN based Query by Example Spoken Term Detection, , and , in: Proceedings of Interspeech, 2018 |
|
Proc. International Conference on Acoustics, Speech, and Signal Processing (2018)
Complexity reduction of eigenvalue decomposition-based diffuse power spectral density estimators using the power method, , and , in: Proc. International Conference on Acoustics, Speech, and Signal Processing, Calgary, Canada, pages 451-455, 2018 |
|
2018 IEEE International Conference on Robotics and Automation (ICRA) (2018)
Deep Neural Networks for Multiple Speaker Detection and Localization, , and , in: 2018 IEEE International Conference on Robotics and Automation (ICRA), Brisbane, AUSTRALIA, pages 74-79, 2018 |
[DOI] |
Proceedings of Interspeech 2018 (2018)
Denoising and Raw-waveform Networks for Weakly-Supervised Gender Identification on Noisy Speech, , , , , and , in: Proceedings of Interspeech 2018, Hyderabad, INDIA, pages 292-296, 2018 |
[DOI] |
Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP) (2018)
Document-Level Neural Machine Translation with Hierarchical Attention Networks, , , and , in: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2018 |
|
Technology Enhanced Assessment Conference. (2018)
Enhancing Trust in eAssessment - the TeSLA System Solution, , , , and , in: Technology Enhanced Assessment Conference., 2018 |
|
Proc. EuroNoise 2018 (2018)
Experimental evaluation of speech enhancement methods in remote microphone systems for hearing aids, , , , and , in: Proc. EuroNoise 2018, Crete, Greece, pages 351-358, 2018 |
|
Proceedings of the 17th International Conference on Mobile and Ubiquitous Multimedia (2018)
Facing Employers and Customers: What Do Gaze and Expressions Tell About Soft Skills?, , , and , in: Proceedings of the 17th International Conference on Mobile and Ubiquitous Multimedia, Cairo, Egypt, pages 121-126, ASSOC COMPUTING MACHINERY, 2018 |
[DOI] |
13th Intl Workshop on the Algorithmic Foundations of Robotics (WAFR) (2018)
Generalizing Robot Imitation Learning with Invariant Hidden Semi-Markov Models, , , , , , , and , in: 13th Intl Workshop on the Algorithmic Foundations of Robotics (WAFR), 2018 |
|
Proceedings of the International Conference on Machine Learning (2018)
Geodesic Convolutional Shape Optimization, , , and , in: Proceedings of the International Conference on Machine Learning, 2018 |
Proceedings of Interspeech 2018 (2018)
Implementing Fusion Techniques for the Classification of Paralinguistic Information, , , and , in: Proceedings of Interspeech 2018, pages 526-530, 2018 |
|
European Conference on Computer Vision - Workshops (2018)
Investigating Depth Domain Adaptation for Efficient Human Pose Estimation, , , and , in: European Conference on Computer Vision - Workshops, 2018 |
|
Proc. ITG conference on Speech Communication (2018)
Iterative alternating least-aquares approach to jointly estimate the RETFs and the diffuse PSD, , and , in: Proc. ITG conference on Speech Communication, Oldenburg, Germany, 2018 |
|
Proceedings of Interspeech 2018 (2018)
Iterative Learning of Speech Recognition Models for Air Traffic Control, , , , , , and , in: Proceedings of Interspeech 2018, ISCA, Hyderabad, India, pages 3519-3523, 2018 |
[DOI] |
Proc. International Conference on Acoustics, Speech, and Signal Processing (2018)
Joint late reverberation and noise power spectral density estimation in a spatially homogeneous noise field, and , in: Proc. International Conference on Acoustics, Speech, and Signal Processing, Calgary, Canada, pages 441-445, 2018 |
|
Proceedings of the International Conference on Machine Learning (2018)
Knowledge Transfer with Jacobian Matching, and , in: Proceedings of the International Conference on Machine Learning, 2018 |
[URL] |
Kronecker Recurrent Units, , and , in: Proceedings of the International Conference on Machine Learning, 2018 |
2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (2018)
Leveraging Convolutional Pose Machines for Fast and Accurate Head Pose Estimation, , and , in: 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Madrid, SPAIN, pages 1089-1094, IEEE, 2018 |
|
The Speaker and Language Recognition Workshop (Odyssey) (2018)
Low-latency speaker spotting with online diarization and detection, , , , , , , , and , in: The Speaker and Language Recognition Workshop (Odyssey), 2018 |
|
Proc. Interspeech (2018)
Multilingual bottleneck features for subword modeling in zero-resource languages, and , in: Proc. Interspeech, pages 2668-2672, 2018 |
[DOI] |
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (2018)
NASAL SPEECH SOUNDS DETECTION USING CONNECTIONIST TEMPORAL, and , in: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 2018 |
|
Proceedings of International Conference on Machine Learning (2018)
Not All Samples Are Created Equal: Deep Learning with Importance Sampling, and , in: Proceedings of International Conference on Machine Learning, 2018 |
|
Proceedings of Interspeech (2018)
On Learning to Identify Genders from Raw Speech Signal Using CNNs, , and , in: Proceedings of Interspeech, Hyderabad, INDIA, pages 287-291, 2018 |
[DOI] |
On Learning Vocal Tract System Related Speaker Discriminative Information from Raw Signal Using CNNs, , and , in: Proceedings of Interspeech, Hyderabad, INDIA, pages 1116-1120, 2018 |
|
International Conference on Identity, Security and Behavior Analysis (2018)
On the Use of Convolutional Neural Networks for Speech Presentation Attack Detection, , , , and , in: International Conference on Identity, Security and Behavior Analysis, 2018 |
|
Proceedings of Interspeech (2018)
Phonological Posterior Hashing for Query by Example Spoken Term Detection, , and , in: Proceedings of Interspeech, 2018 |
|
Proceedings of the international conference on Neural Information Processing Systems (2018)
Practical Deep Stereo (PDS): Toward applications-friendly deep stereo matching, , and , in: Proceedings of the international conference on Neural Information Processing Systems, 2018 |
Proc. of the IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS) (2018)
Probabilistic Learning of Torque Controllers from Kinematic and Force Constraints, , , , and , in: Proc. of the IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), pages 6552-6559, 2018 |
|
EUSIPCO (2018)
Real-Time DCT Learning-based Reconstruction of Neural Signals, , and , in: EUSIPCO, 2018 |
|
Proceedings of Interspeech (2018)
Robust and Discriminative Speaker Embedding via Intra-Class Distance Variance Regularization, and , in: Proceedings of Interspeech, Hyderabad, INDIA, pages 2257-2261, 2018 |
[DOI] |
International Conference on Intelligent Robots (2018)
SCALAR - Simultaneous Calibration of 2D Laser And Robot's Kinematic Parameters Using Three Planar Constraints, , and , in: International Conference on Intelligent Robots, 2018 |
|
Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL) (2018)
Self-Attentive Residual Decoder for Neural Machine Translation, , , and , in: Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2018 |
|
2018 25th IEEE International Conference on Image Processing (ICIP) (2018)
Semi-Blind Spatially-Variant Deconvolution in Optical Microscopy with Local Point Spread Function Estimation by Use of Convolutional Neural Networks, and , in: 2018 25th IEEE International Conference on Image Processing (ICIP), pages 3818-3822, IEEE, 2018 |
[DOI] |
37th AIAA/IEEE Digital Avionics Systems Conference (2018)
Semi-supervised Adaptation of Assistant Based Speech Recognition Models for different Approach Areas, , , , , , , , , , and , in: 37th AIAA/IEEE Digital Avionics Systems Conference, AIAA/IEEE, London, 2018 |
[URL] |
Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (2018)
SGAN: An Alternative Training of Generative Adversarial Networks, and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, pages 9407-9415, IEEE, 2018 |
[DOI] |
Big Data and Artificial Intelligence for Military Decision Making (2018)
SIIP: An Innovative Speaker Identification Approach for Law Enforcement Agencies, , , , , , , , , and , in: Big Data and Artificial Intelligence for Military Decision Making, http://www.sto.nato.int/, pages PT-1 - 1: PT-1 - 14, STO, 2018 |
[DOI] [URL] |
Proc. Annual Conference of the International Speech Communication Association (2018)
Single-channel late reverberation power spectral density estimation using denoising autoencoders, and , in: Proc. Annual Conference of the International Speech Communication Association, Hyderabad, India, 2018 |
|
Language Resources and Evaluation Conference (2018)
SMILE Swiss German Sign Language Dataset, , , , , , , , , , , and , in: Language Resources and Evaluation Conference, 2018 |
European Signal Processing Conference (2018)
Speaker Inconsistency Detection in Tampered Video, and , in: European Signal Processing Conference, 2018 |
|
Proceedings of BTAS2018 (2018)
Spoofing Deep Face Recognition With Custom Silicone Masks, , and , in: Proceedings of BTAS2018, 2018 |
|
Proc. ITG conference on Speech Communication (2018)
Statistical modeling of speech spectral coefficients in patients with Parkinson's disease, and , in: Proc. ITG conference on Speech Communication, Oldenburg, Germany, 2018 |
|
International Conference on Machine Learning (ICML) workshop on Theoretical Foundations and Applications of Deep Generative Models (2018)
Stochastic Variance Reduced Gradient Optimization of Generative Adversarial Networks, , , and , in: International Conference on Machine Learning (ICML) workshop on Theoretical Foundations and Applications of Deep Generative Models, 2018 |
IEEE International Conference on Acoustics, Speech and Signal Processing (2018)
Towards directly modeling raw speech signal for speaker verification using CNNs, , and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, Calgary, CANADA, pages 4884-4888, 2018 |
|
2018 IEEE 4th International Conference on Identity, Security, and Behavior Analysis (ISBA) (2018)
Towards Quantifying the Entropy of Fingervein Patterns across Different Feature Extractors, and , in: 2018 IEEE 4th International Conference on Identity, Security, and Behavior Analysis (ISBA), 2018 |
|
IEEE International Conference on Advanced Video and Signal-based Surveillance Workshop (2018)
UNICITY: A depth maps database for people detection in security airlocks, , , , , , , and , in: IEEE International Conference on Advanced Video and Signal-based Surveillance Workshop, 2018 |
|
Proceedings of the 17th International Conference on Mobile and Ubiquitous Multimedia (2018)
Vlogging Over Time: Longitudinal Impressions and Behavior in YouTube, , , , and , in: Proceedings of the 17th International Conference on Mobile and Ubiquitous Multimedia, Cairo, EGYPT, pages 37-46, 2018 |
[DOI] |
IEEE International Conference on Advanced Video and Signal-based Surveillance (2018)
WatchNet: Efficient and Depth-based Network for People Detection in Video Surveillance Systems, , , and , in: IEEE International Conference on Advanced Video and Signal-based Surveillance, Auckland, NEW ZEALAND, pages 109-114, 2018 |
|
Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (2018)
WILDTRACK: A Multi-camera HD Dataset for Dense Unscripted Pedestrian Detection, , , , , , , , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, pages 5030-5039, 2018 |
[DOI] |
Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (2018)
Words Worth: Verbal Content and Hirability Impressions in YouTube Video Resumes, , and , in: Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, 2018 |
|
ACM International Conference on Multimodal Interaction (2017)
A Domain Adaptation Approach to Improve Speaker Turn Embedding Using Face Representation, and , in: ACM International Conference on Multimodal Interaction, Glasgow, Scotland, ACM, 2017 |
|
IEEE/RSJ International Conference on Intelligent Robots and Systems (2017)
A Generative Model for Intention Recognition and Manipulation Assistance in Teleoperation, and , in: IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017 |
|
Proceedings of the 20th International Conference on Artificial Intelligence and Statistics (2017)
A Sub-Quadratic Exact Medoid Algorithm, and , in: Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017 |
IEEE Winter Conference on Applications of Computer Vision (WACV) (2017)
Active Online Anomaly Detection using Dirichlet Process Mixture Model and Gaussian Process Classification, , , , and , in: IEEE Winter Conference on Applications of Computer Vision (WACV), Washington, 2017 |
|
Proc. of Interspeech (2017)
An Investigation of Deep Neural Networks for Multilingual Speech Recognition Training and Adaptation, , and , in: Proc. of Interspeech, 2017 |
|
Bob Speaks Kaldi, , , , and , in: Proc. of Interspeech, 2017 |
|
Proceedings of the MICCAI Workshop on Ophthalmic Medical Image Analysis (2017)
Boosted Exudate Segmentation in Retinal Images using Residual Nets, , , and , in: Proceedings of the MICCAI Workshop on Ophthalmic Medical Image Analysis, 2017 |
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL) (2017)
Consistent Translation of Repeated Nouns using Syntactic and Semantic Cues, , and , in: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL), Valencia, Spain, Association for Computational Linguistics, 2017 |
|
IEEE/IAPR International Joint Conference on Biometrics (2017)
Cross-Eyed 2017: Cross-Spectral Iris/Periocular Recognition Competition., , , , , , , , , , , , , and , in: IEEE/IAPR International Joint Conference on Biometrics, Denver, Colorado, USA, IEEE, 2017 |
Proceedings of the IEEE International Conference on Machine Learning and Applications (2017)
Deep Multi-Camera People Detection, and , in: Proceedings of the IEEE International Conference on Machine Learning and Applications, 2017 |
Proceedings of the IEEE International Conference on Computer Vision (2017)
Deep Occlusion Reasoning for Multi-Camera Multi-Target Detection, , and , in: Proceedings of the IEEE International Conference on Computer Vision, 2017 |
Intl Workshop on movement and computing (MOCO) (2017)
Dynamic Graffiti Stylisation with Stochastic Optimal Control, , and , in: Intl Workshop on movement and computing (MOCO), London, UK, pages 1-8, ACM, 2017 |
[DOI] [URL] |
In Proceedings of MMHealth (2017)
Elderly People Living Alone: Detecting Home Visits with Ambient and Wearable Sensing, , , and , in: In Proceedings of MMHealth, 2017 |
|
International Joint Conference on Biometrics (2017)
End-to-End Convolutional Neural Network-based Voice Presentation Attack Detection, , and , in: International Joint Conference on Biometrics, Denver, Colorado, USA, 2017 |
|
Proceedings of the 16th International Conference on Mobile and Ubiquitous Multimedia (2017)
Examining Linguistic Content and Skill Impression Structure for Job Interview Analytics in Hospitality, and , in: Proceedings of the 16th International Conference on Mobile and Ubiquitous Multimedia, 2017 |
|
Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing (2017)
EXPLOITING SEQUENCE INFORMATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION, , , and , in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, New Orleans, pages 5370-5374, 2017 |
|
Proceedings of the thematic conference on computational vision and medical image processing (2017)
Exploratory Study on Direct Prediction of Diabetes using Deep Residual Networks, , , and , in: Proceedings of the thematic conference on computational vision and medical image processing, 2017 |
Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (2017)
Gaussian Mixture Regression on Symmetric Positive Definite Matrices Manifolds: Application to Wrist Motion Estimation with sEMG, and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, pages 59-64, 2017 |
[URL] |
Proc. 43rd Conf. on Graphics Interface (2017)
Generating Calligraphic Trajectories with Model Predictive Control, , and , in: Proc. 43rd Conf. on Graphics Interface, Edmonton, AL, Canada, pages 132-139, 2017 |
[DOI] |
Proceddings of 19th ACM International Conference on Multimodal Interaction (2017)
How May I Help You? Behavior and Impressions in Hospitality Service Encounters, , and , in: Proceddings of 19th ACM International Conference on Multimodal Interaction, 2017 |
|
Proceedings of Interspeech 2017 (2017)
Implementing gender-dependent vowel-level analysis for boosting speech-based depression recognition, , , and , in: Proceedings of Interspeech 2017, 2017 |
|
Proc. of the Myoelectric Control Symposium (2017)
Improving hand and wrist activity detection using tactile sensors and tensor regression methods on Riemannian manifolds, , and , in: Proc. of the Myoelectric Control Symposium, 2017 |
[URL] |
ICCV Workshop on Computer Vision for Audio-Visual Media (2017)
Improving speaker turn embedding by crossmodal transfer learning from face embedding, and , in: ICCV Workshop on Computer Vision for Audio-Visual Media, 2017 |
|
International Conference on Multimedia Retrieval (2017)
Insiders and Outsiders: Comparing Urban Impressions between Population Groups, , and , in: International Conference on Multimedia Retrieval, ACM, 2017 |
[DOI] |
Proceedings of the international conference on Neural Information Processing Systems (2017)
K-Medoids For K-Means Seeding, and , in: Proceedings of the international conference on Neural Information Processing Systems, 2017 |
Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (2017)
Learning Manipulability Ellipsoids for Task Compatibility in Robot Manipulation, , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, pages 3183-3189, 2017 |
[URL] |
Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS) (2017)
Learning Task-Space Synergies using Riemannian Geometry, , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), Vancouver, Canada, pages 73-78, IEEE, 2017 |
[URL] |
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL) (2017)
Machine translation of Spanish personal and possessive pronouns using anaphora probabilities, and , in: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL), Valencia, Spain, Association for Computational Linguistics, 2017 |
|
Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (2017)
Multi-Modal Mean-Fields via Cardinality-Based Clamping, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2017 |
Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017) (2017)
Multi-view Representation Learning Via GCCA for Multimodal Analysis of Parkinson's Disease, , , , , , , , , , , , , , and , in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), 2017 |
|
Proceedings of the 8th International Joint Conference on Natural Language Processing (IJCNLP) (2017)
Multilingual Hierarchical Attention Networks for Document Classification, and , in: Proceedings of the 8th International Joint Conference on Natural Language Processing (IJCNLP), pages 1015-1025, 2017 |
|
Proceedings of the IEEE International Conference on Computer Vision (2017)
Non-Markovian Globally Consistent Multi-Object Tracking, , , and , in: Proceedings of the IEEE International Conference on Computer Vision, 2017 |
Wavelets and Sparsity XVII (2017)
Non-parametric warping via local scale estimation for non-stationary Gaussian process modelling, , , and , in: Wavelets and Sparsity XVII, pages 1039421, International Society for Optics and Photonics, 2017 |
[DOI] [URL] |
Proceedings of the 25th ACM International Conference on Multimedia, 2017 (2017)
On Job Training: Automated Interpersonal Behavior Assessment & Real-Time Feedback, , in: Proceedings of the 25th ACM International Conference on Multimedia, 2017, 2017 |
|
16th International Conference of the Biometrics Special Interest Group (2017)
On the Generalization of Fused Systems in Voice Presentation Attack Detection, , , , and , in: 16th International Conference of the Biometrics Special Interest Group, 2017 |
|
Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017) (2017)
On the Impact of Non-modal Phonation On Phonological Features, , , , , , , , , , , , , and , in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), 2017 |
|
23ème Congrès Français de Mécanique, 28 août - 1er septembre 2017, Lille, France (FR), AFM (2017)
Planification adaptative d'expériences numériques par paquets en contexte non stationnaire pour une étude de fissuration mécanique, , , , and , in: 23eme Congres Francais de Mecanique, 28 aout - 1er septembre 2017, Lille, France (FR), AFM, 2017 |
[URL] |
Proceedings of Second Conference on Machine Translation (WMT17) (2017)
Sense-Aware Statistical Machine Translation using Adaptive Context-Dependent Clustering, , and , in: Proceedings of Second Conference on Machine Translation (WMT17), 2017 |
|
15th International Workshop on Content-Based Multimedia Indexing (2017)
Shape Representations for Maya Codical Glyphs: Knowledge-driven or Deep?, , and , in: 15th International Workshop on Content-Based Multimedia Indexing, 2017 |
|
Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (2017)
Social Scene Understanding: End-to-End Multi-Person Action Localization and Collective Activity Recognition, , , , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2017 |
Workshop on Signal Processing with Adaptive Sparse Structured Representations (SPARS) (2017)
Sparse Pronunciation Codes for Perceptual Phonetic Information Assessment, , , and , in: Workshop on Signal Processing with Adaptive Sparse Structured Representations (SPARS), 2017 |
|
ECEM COGAIN Symposium (2017)
Supervised Gaze Bias Correction for Gaze Coding in Interactions, and , in: ECEM COGAIN Symposium, pages 3, 2017 |
|
Proc. IEEE Intl Conf. on Robotics and Automation (ICRA) (2017)
Supervisory teleoperation with online learning and optimal control, and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Singapore, pages 1534-1540, IEEE, 2017 |
[URL] |
European Intelligence and Security Informatics Conference (EISIC) 2017 (2017)
Towards a breakthrough Speaker Identification approach for Law Enforcement Agencies: SIIP, , , , , , , , , and , in: European Intelligence and Security Informatics Conference (EISIC) 2017, Athenes, Greece, pages 32-39, IEEE Computer Society, 2017 |
[DOI] [URL] |
15th International Workshop on Content-Based Multimedia Indexing (2017)
Towards large scale multimedia indexing: A case study on person discovery in broadcast news, , and , in: 15th International Workshop on Content-Based Multimedia Indexing, 2017 |
|
Proc. IEEE Intl Conf. on Robotics and Automation (ICRA) (2017)
Trajectory and Foothold Optimization using Low-Dimensional Models for Rough Terrain Locomotion, , , , , , , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Singapore, pages 1096-1103, IEEE, 2017 |
[URL] |
Proceedings of the 2nd Workshop on Coreference Resolution Beyond OntoNotes (CORBON 2017) (2017)
Using Coreference Links to Improve Spanish-to-English Machine Translation, and , in: Proceedings of the 2nd Workshop on Coreference Resolution Beyond OntoNotes (CORBON 2017), Valencia, Spain, pages 30-40, Association for Computational Linguistics (ACL), 2017 |
|
Proceedings of the Third Workshop on Discourse in Machine Translation (DiscoMT) (2017)
Validation of an Automatic Metric for the Accuracy of Pronoun Translation (APT), and , in: Proceedings of the Third Workshop on Discourse in Machine Translation (DiscoMT), Denmark, Copenhagen, Association for Computational Linguistics (ACL), 2017 |
|
Proceedings of the 25th ACM International Conference on Multimedia, ACM, 2017 (2017)
Venues in Social Media: Examining Ambiance Perception Through Scene Semantics, , and , in: Proceedings of the 25th ACM International Conference on Multimedia, ACM, 2017, 2017 |
|
Proceedings of the IEEE International Conference on Computer Vision (2017)
Weakly Supervised Learning of Deep Metrics for Stereo Reconstruction, , and , in: Proceedings of the IEEE International Conference on Computer Vision, 2017 |
Proceedings of the 16th International Conference on Biometrics Special Interest Group. (2017)
What you can't see can help you -- extended-range imaging for 3D-mask presentation attack detection, and , in: Proceedings of the 16th International Conference on Biometrics Special Interest Group., Darmstadt (Germany), Gesellschaft fuer Informatik e.V. (GI), 2017 |
|
European Association for Machine Translation (2016)
A Contextual Language Model to Improve Machine Translation of Pronouns by Re-ranking Translation Hypotheses, and , in: European Association for Machine Translation, 2016 |
Proceedings of the British Machine Vision Conference (2016)
A MultiPath Network for Object Detection, , , , , , and , in: Proceedings of the British Machine Vision Conference, BMVA Press, 2016 |
[URL] |
2016 IEEE International Symposium on Biomedical Imaging (2016)
A Point-Spread-Function-Aware Filtered Backprojection Algorithm for Focal-Plane-Scanning Optical Projection Tomography, and , in: 2016 IEEE International Symposium on Biomedical Imaging, 2016 |
Digital Humanities (DH) (2016)
Ancient Maya Writings as High-Dimensional Data: a Visualization Approach, , , and , in: Digital Humanities (DH), Krakow, 2016 |
|
Proceedings of the 7th Int. Workshop on Human Behavior Understanding, ACM Multimedia, 2016 (2016)
Anomaly detection in elderly daily behavior in ambient sensing environments, , , and , in: Proceedings of the 7th Int. Workshop on Human Behavior Understanding, ACM Multimedia, 2016, Amsterdam, Netherlands, 2016 |
|
Proceedings of the ACM International Conference on Multimedia Retrieval (ICMR) (2016)
Audiovisual Summarization of Lectures and Meetings Using a Segment Similarity Graph, , and , in: Proceedings of the ACM International Conference on Multimedia Retrieval (ICMR), ACM, New York, NY, ACM Press, 2016 |
Natural Language Processing and Information Systems: 21st International Conference on Applications of Natural Language to Information Systems, NLDB 2016 (2016)
Comparing Two Strategies for Query Expansion in a News Monitoring System, and , in: Natural Language Processing and Information Systems: 21st International Conference on Applications of Natural Language to Information Systems, NLDB 2016, pages 267-275, Springer-Verlag, 2016 |
[DOI] |
Interspeech (2016)
Cross-database evaluation of audio-based spoofing detection systems, and , in: Interspeech, San Francisco, USA, 2016 |
[URL] |
Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016) (2016)
DEEP NEURAL NETWORK BASED POSTERIORS FOR TEXT-DEPENDENT SPEAKER VERIFICATION, , , and , in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5050-5054, IEEE, 2016 |
|
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (2016)
Deep Neural Networks for Syntactic Parsing of Morphologically Rich Languages, and , in: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016 |
|
IFAC Conference on Control Applications in Marine Systems (CAMS) (2016)
Dexterous Undersea Interventions with Far Distance Onshore Supervision: the DexROV Project, , , , , , , , , , , , , , , , , , , , , , and , in: IFAC Conference on Control Applications in Marine Systems (CAMS), Trondheim, Norway, pages 414-419, 2016 |
[DOI] [URL] |
Interspeech (2016)
Efficient Posterior Exemplar Search Space Hashing Exploiting Class-Specific Sparsity Structures, , , and , in: Interspeech, San Francisco, CA, 2016 |
|
9th ISCA Speech Synthesis Workshop (2016)
Emphasis Recreation for TTS using Intonation Atoms, and , in: 9th ISCA Speech Synthesis Workshop, pages 14--20, 2016 |
[DOI] |
Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016) (2016)
Exploiting Low-dimensional Structures to Enhance DNN based Acoustic Modeling in Speech Recognition, , , and , in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5690-5694, IEEE, 2016 |
|
Proceedings of the International Conference on Machine Learning (ICML) (2016)
Fast K-Means with Accurate Bounds, and , in: Proceedings of the International Conference on Machine Learning (ICML), New York, 2016 |
Proceedings of WMT 2016 (First Conference on Machine Translation) (2016)
Findings of the 2016 WMT Shared Task on Cross-lingual Pronoun Prediction, , , , , , , , and , in: Proceedings of WMT 2016 (First Conference on Machine Translation), Association for Computational Linguistics, Berlin, Germany, pages 525–542, 2016 |
[URL] |
IEEE Computer Society Workshop on Biometrics (2016)
Heterogeneous Face Recognition using Inter-Session Variability Modelling, and , in: IEEE Computer Society Workshop on Biometrics, Las Vegas - USA, IEEE, 2016 |
|
Proceedings of the IEEE International Conference of Robotics and Automation (2016)
Hierarchical Planning of Dynamic Movements without Scheduled Contact Sequences, , , , and , in: Proceedings of the IEEE International Conference of Robotics and Automation, 2016 |
Proceedings of Interspeech (2016)
HMM-based Non-native Accent Assessment using Posterior Features, , and , in: Proceedings of Interspeech, San Francisco, USA, 2016 |
|
Proceedings of the EMNLP 2016 Workshop on Natural Language Processing for Social Media (2016)
Human versus Machine Attention in Document Classification: A Dataset with Crowdsourced Annotations, and , in: Proceedings of the EMNLP 2016 Workshop on Natural Language Processing for Social Media, Austin, USA, 2016 |
|
Proceedings of the International Conference on Machine Learning (ICML) (2016)
Importance Sampling Tree for Large-scale Empirical Expectation, , and , in: Proceedings of the International Conference on Machine Learning (ICML), New-York, 2016 |
Proceedings of the First Conference on Machine Translation (WMT16) (2016)
Improving Pronoun Translation by Modeling Coreference Uncertainty, and , in: Proceedings of the First Conference on Machine Translation (WMT16), Berlin, Germany, 2016 |
|
Proceedings of Interspeech (2016)
Improving Under-Resourced Language ASR Through Latent Subword Unit Space Discovery, and , in: Proceedings of Interspeech, 2016 |
|
Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016) (2016)
INFORMATION THEORETIC CLUSTERING FOR UNSUPERVISED DOMAIN-ADAPTATION, , and , in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5580-5584, IEEE, 2016 |
|
Proceedings of the 24th ACM International Conference on Multimedia (2016)
InnerView: Learning Place Ambiance from Social Media Images, , and , in: Proceedings of the 24th ACM International Conference on Multimedia, ACM, 2016 |
[DOI] |
Proceeedings of the INTERSPEECH (2016)
Inter-task System Fusion for Speaker Recognition, , , , and , in: Proceeedings of the INTERSPEECH, 2016 |
|
9th ISCA Speech Synthesis Workshop (2016)
Investigating Spectral Amplitude Modulation Phase Hierarchy Features in Speech Synthesis, , , and , in: 9th ISCA Speech Synthesis Workshop, 2016 |
|
IEEE International Conference on Biometrics: Theory, Applications and Systems (2016)
Joint Operation of Voice Biometrics and Presentation Attack Detection, and , in: IEEE International Conference on Biometrics: Theory, Applications and Systems, 2016 |
[URL] |
Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (2016)
Large Scale Hard Sample Mining with Monte Carlo Tree Search, and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2016 |
|
Proc. IEEE International Symposium on Safety, Security and Rescue Robotics (2016)
Learning assistive teleoperation behaviors from demonstration, and , in: Proc. IEEE International Symposium on Safety, Security and Rescue Robotics, pages 258-263, 2016 |
|
Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (2016)
Learning dynamic graffiti strokes with a compliant robot, , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, pages 3981-3986, 2016 |
[URL] |
TextLink: Structuring Discourse in Multilingual Europe (Handbook of the Second Action Conference) (2016)
Manual and automatic labeling of discourse connectives for machine translation (Keynote paper), , in: TextLink: Structuring Discourse in Multilingual Europe (Handbook of the Second Action Conference), Budapest, Hungary, pages 16-20, 2016 |
[URL] |
Proc. of EUSIPCO (2016)
Modeling Unvoiced Sounds In Statistical Parametric Speech Synthesis with a Continuous Vocoder, , , and , in: Proc. of EUSIPCO, Budapest, Hungary, 2016 |
|
Proceedings of the International Conference on Multimedia Retrieval (ICMR) (2016)
Multilingual Visual Sentiment Concept Matching, , , , , , and , in: Proceedings of the International Conference on Multimedia Retrieval (ICMR), 2016 |
|
Proceedings of NIPS (2016)
Nested Mini-Batch K-Means, and , in: Proceedings of NIPS, 2016 |
Proceedings of the ACL 1st Conference on Machine Translation (2016)
Neural Network-based Word Alignment through Score Aggregation, , and , in: Proceedings of the ACL 1st Conference on Machine Translation, 2016 |
|
NIPS workshop on Advances in Approximate Bayesian Inference (2016)
Online Inference in Bayesian Non-Parametric Mixture Models under Small Variance Asymptotics, and , in: NIPS workshop on Advances in Approximate Bayesian Inference, Barcelona, Spain, pages 1-5, 2016 |
[URL] |
Proc. IEEE Intl Conf. on Systems, Man, and Cybernetics (2016)
Online motion synthesis with minimal intervention control and formal safety guarantees, , , and , in: Proc. IEEE Intl Conf. on Systems, Man, and Cybernetics, Budapest, Hungary, 2016 |
|
IEEE International Conference on Biometrics: Theory, Applications and Systems (2016)
Overview of BTAS 2016 Speaker Anti-spoofing Competition, , , , , , , , , , , , , , , and , in: IEEE International Conference on Biometrics: Theory, Applications and Systems, 2016 |
[URL] |
Proceeding on the 7th Workshop on Speech and Language Processing for Assistive Technologies (SLPAT) (2016)
PAoS Markers: Trajectory Analysis of Selective Phonological Posteriors for Assessment of Progressive Apraxia of Speech, , and , in: Proceeding on the 7th Workshop on Speech and Language Processing for Assistive Technologies (SLPAT), 2016 |
|
Interspeech (2016)
PhonVoc: A Phonetic and Phonological Vocoding Toolkit, and , in: Interspeech, San Francisco, USA, 2016 |
|
Proceedings of the 12th Workshop on Multiword Expressions (2016)
Phrase Representations for Multiword Expressions, and , in: Proceedings of the 12th Workshop on Multiword Expressions, 2016 |
|
International Conference of the Biometrics Special Interest Group (BIOSIG) (2016)
Presentation Attack Detection Using Long-Term Spectral Statistics for Trustworthy Speaker Verification, , and , in: International Conference of the Biometrics Special Interest Group (BIOSIG), 2016 |
|
Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (2016)
Principled Parallel Mean-Field Inference for Discrete Random Fields, , , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2016 |
Proceedings of Interspeech (2016)
Probabilistic Amplitude Demodulation Features in Speech Synthesis for Improving Prosody, , and , in: Proceedings of Interspeech, San Francisco, USA, 2016 |
|
Proceedings of the First Conference on Machine Translation (WMT16) (2016)
Pronoun Language Model and Grammatical Heuristics for Aiding Pronoun Prediction, and , in: Proceedings of the First Conference on Machine Translation (WMT16), Berlin, Germany, ACL, 2016 |
|
ECCV 2016 (2016)
Scalable Metric Learning via Weighted Approximate Rank Component Analysis, and , in: ECCV 2016, 2016 |
|
Interspeech (2016)
Sound Pattern Matching for Automatic Prosodic Event Detection, , , , and , in: Interspeech, San Francisco, USA, 2016 |
|
Intl Workshop on Human-Friendly Robotics (2016)
Stochastic learning and control in multiple coordinate systems, , in: Intl Workshop on Human-Friendly Robotics, Genoa, Italy, pages 1-5, 2016 |
|
Interspeech (2016)
Subspace Detection of DNN Posterior Probabilities via Sparse Representation for Query by Example Spoken Term Detection, , and , in: Interspeech, 2016 |
|
Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016) (2016)
SYSTEM FUSION AND SPEAKER LINKING FOR LONGITUDINAL DIARIZATION OF TV SHOWS, , , and , in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5495-5499, IEEE, 2016 |
|
Proceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing (2016)
The Night is Young: Urban Crowdsourcing of Nightlife Patterns, , , , , , and , in: Proceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing, ACM, 2016 |
[DOI] |
Proceedings of the International Conference on Biometrics Special Interests Group (2016)
The REPLAY-MOBILE Face Presentation-Attack Database, , , and , in: Proceedings of the International Conference on Biometrics Special Interests Group, 2016 |
|
Proceedings of Interspeech (2016)
The SIWIS database: a multilingual speech database with acted emphasis, , , , , , , , , , , and , in: Proceedings of Interspeech, San Francisco, USA, pages 1532--1535, 2016 |
[DOI] |
Proceedings of the 18th ACM International Conference on Multimodal Interaction (2016)
Towards building an attentive artificial listener: on the perception of attentiveness in audio-visual feedback tokens, , , , , , and , in: Proceedings of the 18th ACM International Conference on Multimodal Interaction, Tokyo, Japan, pages 21-28, ACM, 2016 |
[DOI] |
Proc. ECCV Workshop on Computer Vision for Art Analysis (2016)
Transferring Neural Representations for Low-dimensional Indexing of Maya Hieroglyphic Art, , , , , , , , and , in: Proc. ECCV Workshop on Computer Vision for Art Analysis, Amsterdam, pages 842-855, Springer, 2016 |
[DOI] [URL] |
Proceedings of Interspeech 2016 (2016)
Two-Pass IB based Speaker Diarization System using Meeting-Specific ANN based Features, , , and , in: Proceedings of Interspeech 2016, pages 2199-2203, 2016 |
IAPR Int. Workshops on Structural and Syntactic Pattern Recognition (SSPR) (2016)
Unsupervised Interpretable Pattern Discovery in Time Series Using Autoencoders, , , and , in: IAPR Int. Workshops on Structural and Syntactic Pattern Recognition (SSPR), 2016 |
|
Proceedings of the 10th Language Resources and Evaluation Conference (LREC) (2016)
Using the TED Talks to Evaluate Spoken Post-editing of Machine Translation, and , in: Proceedings of the 10th Language Resources and Evaluation Conference (LREC), Portoroz, Slovenia, 2016 |
|
Proc. of the IEEE Intl Conf. on Robotics and Automation (ICRA) (2016)
Variable Duration Movement Encoding with Minimal Intervention Control, , and , in: Proc. of the IEEE Intl Conf. on Robotics and Automation (ICRA), pages 497-503, 2016 |
|
Computer Vision and Pattern Recognition (CVPR), IEEE Conference on (2016)
When Naïve Bayes Nearest Neighbors Meet Convolutional Neural Networks, , and , in: Computer Vision and Pattern Recognition (CVPR), IEEE Conference on, 2016 |
|
Proceedings of CSEDU 2016 (2016)
Wiki-LDA: A Mixed-Method Approach for Effective Interest Mining on Twitter Data, , , and , in: Proceedings of CSEDU 2016, 2016 |
|
Computer Vision - ECCV 2016 (2016)
Learning to Refine Object Segments, , , and , in: Computer Vision - ECCV 2016, Amsterdam, pages 75-91, Springer, 2016 |
[DOI] [URL] |
German Conference on Pattern Recognition (2015)
A Deeper Look at Dataset Bias, , , and , in: German Conference on Pattern Recognition, Aachen, Germany, pages 504–516, Springer International Publishing, 2015 |
[DOI] |
Proc. of Interspeech (2015)
An Empirical Model of Emphatic Word Detection, and , in: Proc. of Interspeech, Dresden, Germany, pages 573-577, ISCA, 2015 |
|
International Conference on Acoustics, Speech and Signal Processing (2015)
An HMM-Based Formalism for Automatic Subword Unit Derivation and Pronunciation Generation, and , in: International Conference on Acoustics, Speech and Signal Processing, pages 4639-4643, IEEE, 2015 |
[DOI] |
Proceedings of Interspeech (2015)
Analysis of CNN-based Speech Recognition System using Raw Speech as Input, , and , in: Proceedings of Interspeech, ISCA, Dresden, pages 11-15, ISCA, 2015 |
|
Annotators' agreement and spontaneous emotion classification performance, and , in: Proceedings of Interspeech, pages 1546-1550, 2015 |
|
IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2015)
Atom Decomposition-based Intonation Modelling, , and , in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, Australia, pages 4744--4748, IEEE, 2015 |
[DOI] |
Proceedings of Interspeech (2015)
Automatic Accentedness Evaluation of Non-Native Speech Using Phonetic and Sub-Phonetic Posterior Probabilities, , , and , in: Proceedings of Interspeech, 2015 |
|
Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2015)
Building context-dependent DNN acoustic models using Kullback-Leibler divergence-based state tying, , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2015 |
|
Proceedings of Interspeech (2015)
Channel Selection in the Short-time Modulation Domain for Distant Speech Recognition, , , , and , in: Proceedings of Interspeech, pages 741-745, 2015 |
|
Proceedings of ICASSP 2015 (2015)
COMBINING SGMM SPEAKER VECTORS AND KL-HMM APPROACH FOR SPEAKER DIARIZATION, , and , in: Proceedings of ICASSP 2015, pages 4834-4837, 2015 |
|
Proceedings of the 17th International Conference on Human-Computer Interaction with Mobile Devices and Services (2015)
CommuniSense: Crowdsourcing Road Hazards in Nairobi, , , , , , and , in: Proceedings of the 17th International Conference on Human-Computer Interaction with Mobile Devices and Services, Copenhagen, Denmark, pages 445-456, ACM, 2015 |
[DOI] [URL] |
International Conference on Acoustics, Speech and Signal Procecssing (2015)
Convolutional Neural Networks-based Continuous Speech Recognition using Raw Speech Signal, , and , in: International Conference on Acoustics, Speech and Signal Procecssing, IEEE, South Brisbane, QLD, pages 4295 - 4299, IEEE, 2015 |
|
Proceedings of ACII (2015)
Cross-Corpus Acoustic Emotion Recognition: Variances and Strategies (Extended Abstract), , , , , , and , in: Proceedings of ACII, Xi'an, pages 470-476, IEEE, 2015 |
[DOI] |
Proceedings of the 2015 ACM on International Conference on Multimodal Interaction (2015)
Deciphering the Silent Participant. On the Use of Audio-Visual Cues for the Classification of Listener Categories in Group Discussions, , , and , in: Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, ACM, Seattle, Washington, USA, pages 107--114, ACM, 2015 |
[DOI] |
IFAC Workshop on Navigation, Guidance and Control of Underwater Vehicles (NGCUV) (2015)
DexROV: Dexterous Undersea Inspection and Maintenance in Presence of Communication Latencies, , , , , , , , , , , , , , , and , in: IFAC Workshop on Navigation, Guidance and Control of Underwater Vehicles (NGCUV), pages 218-223, 2015 |
Proceedings of SPARS 2015: Workshop on Signal Processing with Adaptive Sparse Structured Representations, 2015 (2015)
Dictionary Learning for Sparse Representation of Neural Network Exemplars in Speech Recognition, , and , in: Proceedings of SPARS 2015: Workshop on Signal Processing with Adaptive Sparse Structured Representations, 2015, pages 1093, 2015 |
|
2015 IEEE International Conference on Acoustics, Speech, and Signal Processing (2015)
EMPLOYMENT OF SUBSPACE GAUSSIAN MIXTURE MODELS IN SPEAKER RECOGNITION, , , and , in: 2015 IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE, Brisbane, Australia, pages 4445-4449, 2015 |
[URL] |
Biomedical Imaging (ISBI), 2015 IEEE 12th International Symposium on (2015)
Estimation of Divergence-Free 3D Cardiac Blood Flow in a Zebrafish Larva Using Multi-View Microscopy, and , in: Biomedical Imaging (ISBI), 2015 IEEE 12th International Symposium on, IEEE, Brooklyn, NY, USA, pages 385-388, 2015 |
[DOI] |
Proceedings of ACII (2015)
Exploring Dataset Similarities using PCA-based Feature Selection, , , and , in: Proceedings of ACII, Xi'an, pages 387-393, IEEE, 2015 |
[DOI] |
IEEE International Conference on Biometrics: Theory, Applications and Systems (2015)
Finger vein Liveness Detection Using Motion Magnification, , , and , in: IEEE International Conference on Biometrics: Theory, Applications and Systems, pages 1-7, 2015 |
[DOI] |
Computer Vision and Patter Recognition (CVPR) (2015)
From Image-level to Pixel-level Labeling with Convolutional Networks, and , in: Computer Vision and Patter Recognition (CVPR), Boston, MA, pages 1713-1721, IEEE, 2015 |
[DOI] [URL] |
Scandinavian Conference on Image Analysis (2015)
Gender Classification by LUT based boosting of Overlapping Block Patterns, , and , in: Scandinavian Conference on Image Analysis, pages 530-542, Springer International Publishing, 2015 |
[DOI] [URL] |
Proceedings of the ICCV 2015 (2015)
Head Nod Detection from a Full 3D Model, , and , in: Proceedings of the ICCV 2015, pages 528-536, 2015 |
|
International Conference on Acoustics, Speech and Signal Processing (2015)
Integrated Pronunciation Learning for Automatic Speech Recognition Using Probabilistic Lexical Modeling, , and , in: International Conference on Acoustics, Speech and Signal Processing, South Brisbane, QLD, pages 5176-5180, 2015 |
[DOI] |
Biomedical Imaging (ISBI), 2015 IEEE 12th International Symposium on (2015)
Intensity-Based Point-Spread-Function-Aware Registration for Multi-View Applications in Optical Microscopy, , and , in: Biomedical Imaging (ISBI), 2015 IEEE 12th International Symposium on, pages 306-309, IEEE, 2015 |
[DOI] |
Happy and Agreeable? Multi-Label Classification of Impressions in Social Video (2015)
International Conference on Mobile and Ubiquitous Multimedia, , and , in: Happy and Agreeable? Multi-Label Classification of Impressions in Social Video, Linz, Austria, pages 109-120, ACM, 2015 |
[DOI] [URL] |
Proceedings of ICLR 2015 (2015)
Joint RNN-Based Greedy Parsing and Word Composition, and , in: Proceedings of ICLR 2015, 2015 |
|
Proceedings of ICASSP 2015 (2015)
KL-HMM BASED SPEAKER DIARIZATION SYSTEM FOR MEETINGS, and , in: Proceedings of ICASSP 2015, pages 4435-4439, 2015 |
|
Proceedings of the international conference on Neural Information Processing Systems (2015)
Kullback-Leibler Proximal Variational Inference, , , and , in: Proceedings of the international conference on Neural Information Processing Systems, pages 3402-3410, 2015 |
|
Proc. of the 8th Intl Workshop on Human-Friendly Robotics (2015)
Learned Minimal Intervention Control Synthesis based on Hidden Semi-Markov Models, , and , in: Proc. of the 8th Intl Workshop on Human-Friendly Robotics, pages 17, 2015 |
|
Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS) (2015)
Learning bimanual end-effector poses from demonstrations using task-parameterized dynamical systems, , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), Hamburg, Germany, pages 464-470, 2015 |
|
IEEE International Conference on Acoustics, Speech, and Signal Processing (2015)
Learning Feature Mapping using Deep Neural Network Bottleneck Features for Distant Large Vocabulary Speech Recognition, , , , , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing, pages 4540-4544, 2015 |
[DOI] |
Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS) (2015)
Learning Optimal Controllers in Human-robot Cooperative Transportation Tasks with Position and Force Constraints, , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), Hamburg, Germany, pages 1024-1030, 2015 |
|
Advances in Neural Information Processing Systems (2015)
Learning to Segments Objects Candidates, , and , in: Advances in Neural Information Processing Systems, Montreal, Canada, pages 1990-1998, Curran Associates, Inc., 2015 |
[URL] |
Proceedings of the ACL-IJCNLP 2015 Student Research Workshop (2015)
Leveraging Compounds to Improve Noun Phrase Translation from Chinese and German, , , , , and , in: Proceedings of the ACL-IJCNLP 2015 Student Research Workshop, Beijing, China, pages 8-15, 2015 |
|
Proceedings of the 2015 Annual Symposium on Computing for Development (2015)
Looking at Cities in Mexico with Crowds, , and , in: Proceedings of the 2015 Annual Symposium on Computing for Development, London, United Kingdom, pages 127-135, ACM, 2015 |
[DOI] [URL] |
Proceedings of the 23rd ACM International Conference on Multimedia (2015)
Loud and Trendy: Crowdsourcing Impressions of Social Ambiance in Popular Indoor Urban Places, and , in: Proceedings of the 23rd ACM International Conference on Multimedia, Brisbane, Australia, pages 211-220, ACM, 2015 |
[DOI] [URL] |
International Conference on Learning Representations (2015)
N-gram-Based Low-Dimensional Representation for Document Classification, and , in: International Conference on Learning Representations, 2015 |
[URL] |
Proc. of Interspeech (2015)
Neuromorphic Based Oscillatory Device for Incremental Syllable Boundary Detection, and , in: Proc. of Interspeech, Dresden, Germany, pages 1191-1195, ISCA, 2015 |
|
IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2015)
Novel GCC-PHAT Model in Diffuse Sound Field for Microphone Array Pairwise Distance Based Calibration, , , , , , and , in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 2669-2673, 2015 |
|
On Application Of Non-Negative Matrix Factorization for Ad Hoc Microphone Array Calibration from Incomplete Noisy Distances, , , , and , in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 2694-2698, IEEE, 2015 |
[DOI] |
Proceeding of Interspeech (2015)
On Compressibility of Neural Network phonological Features for Low Bit Rate Speech Coding, , and , in: Proceeding of Interspeech, pages 418-422, ISCA, 2015 |
|
The 8th IAPR International Conference on Biometrics (ICB) (2015)
On the Vulnerability of Palm Vein Recognition to Spoofing Attacks, and , in: The 8th IAPR International Conference on Biometrics (ICB), pages 319 - 325, 2015 |
[DOI] [URL] |
IEEE International Conference on Biometrics: Theory, Applications and Systems (2015)
On the Vulnerability of Speaker Verification to Realistic Voice Spoofing, , , and , in: IEEE International Conference on Biometrics: Theory, Applications and Systems, pages 1-8, IEEE, 2015 |
[DOI] [URL] |
6th IEEE Conference on Cognitive Infocommunications (2015)
Overlapping Speech, Utterance Duration and Affective Content in HHI and HCI - an Comparison, , , , and , in: 6th IEEE Conference on Cognitive Infocommunications, Gyor, pages 83-88, 2015 |
[DOI] |
IEEE International Conference of the Biometrics Special Interest Group (2015)
Palm Vein Database and Experimental Framework for Reproducible Research, and , in: IEEE International Conference of the Biometrics Special Interest Group, pages 1-7, 2015 |
[DOI] [URL] |
Proceedings of the ACM International Conference on Multimodal Interaction (2015)
Personality Trait Classification via Co-Occurrent Multiparty Multimodal Event Discovery, , and , in: Proceedings of the ACM International Conference on Multimodal Interaction, Seattle, Washington, USA, pages 15-22, ACM, 2015 |
[DOI] |
IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2015)
Phonological Vocoding Using Artificial Neural Networks, , and , in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, Australia, pages 4844-4848, IEEE, 2015 |
[DOI] |
International Conference on Machine Learning (ICML) (2015)
Phrase-based Image Captioning, , and , in: International Conference on Machine Learning (ICML), Lille, France, pages 2085–2094, JMLR, 2015 |
[URL] |
Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (2015)
Probability Occupancy Maps for Occluded Depth Images, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, pages 2829-2837, 2015 |
Proceedings of the Second Workshop on Discourse in Machine Translation (DiscoMT) (2015)
Pronoun Translation and Prediction with or without Coreference Links, , and , in: Proceedings of the Second Workshop on Discourse in Machine Translation (DiscoMT), Lisbon, Portugal, pages 94–100, 2015 |
|
4th Biennial Workshop on Less-Resourced Languages (2015)
Pronunciation Lexicon Development for Under-Resourced Languages Using Automatically Derived Subword Units: A Case Study on Scottish Gaelic, , and , in: 4th Biennial Workshop on Less-Resourced Languages, 2015 |
|
Proceedings of NLDB 2015 (20th International Conference on Applications of Natural Language to Information Systems) (2015)
Query Refinement Using Conversational Context: a Method and an Evaluation Resource, and , in: Proceedings of NLDB 2015 (20th International Conference on Applications of Natural Language to Information Systems), Passau, Germany, pages 89-102, Springer-Verlag Berlin, 2015 |
[DOI] |
Proc. Intl Symp. on Robotics Research (2015)
Robot Learning with Task-Parameterized Generative Models, , in: Proc. Intl Symp. on Robotics Research, 2015 |
|
IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2015)
Robust Microphone Placement for Source Localization from Noisy Distance Measurements, , , , and , in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 2579-2583, IEEE, 2015 |
[DOI] |
Proceedings of SPARS 2015: Workshop on Signal Processing with Adaptive Sparse Structured Representations (2015)
Sparse Modeling of Neural Network Posterior Probabilities for Exemplar-Based Speech Recognition, , and , in: Proceedings of SPARS 2015: Workshop on Signal Processing with Adaptive Sparse Structured Representations, 2015 |
|
Proceedings of Interspeech (2015)
Sparse Modeling of Posterior Exemplars for Keyword Detection, , , and , in: Proceedings of Interspeech, pages 3690-3694, 2015 |
|
The 8th IAPR International Conference on Biometrics (ICB) (2015)
The 1st Competition on Counter Measures to Finger Vein Spoofing Attacks, , , , , , , , , and , in: The 8th IAPR International Conference on Biometrics (ICB), pages 513 - 518, 2015 |
[DOI] [URL] |
IEEE Automatic Speech Recognition and Understanding Workshop (2015)
Towards utterance-based neural network adaptation in acoustic modeling, , , and , in: IEEE Automatic Speech Recognition and Understanding Workshop, pages 289-295, 2015 |
|
Image Analysis and Processing - ICIAP 2015 (2015)
Transfer Learning through Greedy Subset Selection, , and , in: Image Analysis and Processing - ICIAP 2015, Genoa, Italy, pages 3-14, Springer International Publishing, 2015 |
[DOI] |
Proceedings of the ACM International Conference on Multimedia (2015)
Visual Affect Around the World: A Large-scale Multilingual Visual Sentiment Ontology, , , , , and , in: Proceedings of the ACM International Conference on Multimedia, pages 159--168, 2015 |
|
Proceedings of Interspeech (2015)
Weighted Correlation based Atom Decomposition Intonation Modelling, , , and , in: Proceedings of Interspeech, Dresden, Germany, pages 1601--1605, 2015 |
|
Proceedings of the 39th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2014)
A Conditional Random field approach for audio-visual people diarization, , , , and , in: Proceedings of the 39th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 116 - 120, IEEE, 2014 |
[DOI] |
In Proc. of the AAAI Symp. on Knowledge, Skill, and Behavior Transfer in Autonomous Robots (2014)
A Skill Transfer Approach for Continuum Robots - Imitation of Octopus Reaching Motion with the STIFF-FLOP Robot, , , and , in: In Proc. of the AAAI Symp. on Knowledge, Skill, and Behavior Transfer in Autonomous Robots, Arlington, VA, USA, pages 49-52, 2014 |
[URL] |
Proc. IEEE Intl Conf. on Robotics and Automation (ICRA) (2014)
A task-parameterized probabilistic model with minimal intervention control, , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Hong Kong, pages 3339 - 3344, IEEE, 2014 |
[DOI] |
Proceeding of 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays (2014)
Ad-Hoc Microphone Array Calibration from Partial Distance Measurements, , , and , in: Proceeding of 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014 |
Proceedings of the 4th Joint Workshop on Hands-free speech communication and Microphone Arrays (2014)
Ad-Hoc Microphone Array Calibration from Partial Distance Measurements, , , and , in: Proceedings of the 4th Joint Workshop on Hands-free speech communication and Microphone Arrays, Villers-les-Nancy, pages 1 - 5, IEEE, 2014 |
[DOI] |
IEEE Spoken Language Technology workshop (2014)
Artificial neural network features for speaker diarization, , and , in: IEEE Spoken Language Technology workshop, South Lake Tahoe, USA, 2014 |
|
International Joint Conference on Biometrics (2014)
Audio-Visual Gender Recognition in Uncontrolled Environment Using Variability Modeling Techniques, , and , in: International Joint Conference on Biometrics, Clearwater, Florida, USA, pages 1 - 8, IEEE, 2014 |
[DOI] [URL] |
Proc. ACM Int. Conf. on Multimodal Interaction (2014)
Automatic Blinking Detection towards Stress Discovery, , , and , in: Proc. ACM Int. Conf. on Multimodal Interaction, Istanbul, pages 307-310, ACM New York, 2014 |
[DOI] |
ACM MM (2014)
Automatic Maya Hieroglyph Retrieval Using Shape and Context Information, , , , and , in: ACM MM, pages 4, 2014 |
[URL] |
Proceedings of Interspeech (2014)
Automatic Speech Recognition and Translation of a Swiss German Dialect: Walliserdeutsch, , and , in: Proceedings of Interspeech, 2014 |
|
Proc. ACM Int. Conf. on Multimodal Interaction (2014)
Capturing Upper Body Motion in Conversation: an Appearance Quasi-Invariant Approach, , , and , in: Proc. ACM Int. Conf. on Multimodal Interaction, Istanbul, pages 327-334, ACM New York, 2014 |
[DOI] |
12th International Workshop on Content-Based Multimedia Indexing (2014)
Comparison of Two Methods for Unsupervised Person Identification in TV Shows, , , , and , in: 12th International Workshop on Content-Based Multimedia Indexing, 2014 |
|
IEEE Workshop on Biometric Measurements and Systems for Security and Medical Applications (BioMS) (2014)
Cross-Database Evaluation With an Open Finger Vein Sensor, , , and , in: IEEE Workshop on Biometric Measurements and Systems for Security and Medical Applications (BioMS), Rome, Italy, pages 30-35, IEEE, 2014 |
[DOI] |
9th Edition of the Language Resources and Evaluation Conference (2014)
Cross-linguistic annotation of narrativity for English/French verb tense disambiguation, and , in: 9th Edition of the Language Resources and Evaluation Conference, Reykjavik, Iceland, 2014 |
|
INTERSPEECH (2014)
Detecting and Labeling Speakers on Overlapping Speech using Vector Taylor Series, , and , in: INTERSPEECH, 2014 |
|
Proceedings of Interspeech (2014)
Detecting speaker roles and topic changes in multiparty conversations using latent topic models, and , in: Proceedings of Interspeech, 2014 |
|
Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech 2014) (2014)
Development of Bilingual ASR System for MediaParl Corpus, , , and , in: Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech 2014), Singapore, ISCA, 2014 |
|
The 15th Annual Conference of the International Speech Communication Association (2014)
Dialect Levelling in Finnish: A Universal Speech Attribute Approach, , , , , , and , in: The 15th Annual Conference of the International Speech Communication Association, 2014 |
INTERSPEECH 2014 (2014)
Diarizing Large Corpora using Multi-modal Speaker Linking, , , and , in: INTERSPEECH 2014, 2014 |
|
International Conference on Machine Learning (2014)
Dynamic Programming Boosting for Discriminative Macro-Action Discovery, and , in: International Conference on Machine Learning, 2014 |
|
ICMI Workshop on Understanding and Modeling Multiparty Multimodal Interactions (2014)
Effect of nonverbal behavioral patterns on the performance of small groups, and , in: ICMI Workshop on Understanding and Modeling Multiparty Multimodal Interactions, Istanbul, Turkey, 2014 |
|
Proceedings of the 6th Asian Conference on Machine Learning (ACML) (2014)
Efficient Sample Mining for Object Detection, and , in: Proceedings of the 6th Asian Conference on Machine Learning (ACML), Nha Trang, Vietnam, 2014 |
|
Proceedings of the Coling 2014 (25th International Conference on Computational Linguistics) (2014)
Enforcing Topic Diversity in a Document Recommender for Conversations, and , in: Proceedings of the Coling 2014 (25th International Conference on Computational Linguistics), Dublin, Ireland, pages 746-759, IEEE, 2014 |
|
The Ninth Language Resources and Evaluation Conference (2014)
English-French Verb Phrase Alignment in Europarl for Tense Translation Modeling, , and , in: The Ninth Language Resources and Evaluation Conference, Reykjavik, Iceland, 2014 |
|
Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP) (2014)
Explaining the Stars: Weighted Multiple-Instance Learning for Aspect-Based Sentiment Analysis, and , in: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar, 2014 |
|
9th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications. (2014)
Exploiting Scene Cues for Dropped Object Detection, , and , in: 9th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications., 2014 |
|
Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2014)
Exploiting un-transcribed foreign data for speech recognition in well-resourced languages, , , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Florence, IT, pages 2322 - 2326, IEEE, 2014 |
[DOI] |
Proceedings of the ACM Symposium on Eye Tracking Research and Applications (2014)
EYEDIAP: A Database for the Development and Evaluation of Gaze Estimation Algorithms from RGB and RGB-D Cameras, , and , in: Proceedings of the ACM Symposium on Eye Tracking Research and Applications, Safety Harbor, Florida, United States of America, ACM, 2014 |
[DOI] |
IEEE International Conference on Image Processing 2014 (2014)
Face identification from overlaid texts using Local Face Recurrent Patterns and CRF models, , , , and , in: IEEE International Conference on Image Processing 2014, Paris, IEEE, 2014 |
|
Proc. of Interspeech 2014 (2014)
Feature Switching in the i-vector Framework for Speaker Verification, , , , and , in: Proc. of Interspeech 2014, pages 5, 2014 |
IEEE Computer Vision and Pattern Recognition Conference (2014)
Geometric Generative Gaze Estimation (G3E) for Remote RGB-D Cameras, and , in: IEEE Computer Vision and Pattern Recognition Conference, Columbus, Ohio,USA, pages 1773-1780, IEEE, 2014 |
[DOI] |
Odyssey: The Speaker and Language Recognition Workshop (2014)
Hierarchical speaker clustering methods for the NIST i-vector Challenge, , , and , in: Odyssey: The Speaker and Language Recognition Workshop, 2014 |
|
Human Behavior Understanding (2014)
How Do You Like Your Virtual Agent?: Human-Agent Interaction Experience through Nonverbal Features and Personality Traits, , and , in: Human Behavior Understanding, pages 1-15, Springer, 2014 |
|
International Conference on Image Processing (2014)
Improving Head and Body Pose Estimation through Semi-supervised Manifold Alignment, , , , and , in: International Conference on Image Processing, 2014 |
|
Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2014)
Improving Speaker Diarization using social role information, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2014 |
|
ICASSP (2014)
Inferring social relationships in a phone call from a single party's speech, , and , in: ICASSP, Florence, IT, pages 4843 - 4847, IEEE, 2014 |
[DOI] |
Information Bottleneck based Speaker Diarization of Meetings using Non-speech as Side Information, and , in: ICASSP, Florence, IT, pages 96 - 100, IEEE, 2014 |
[DOI] |
The 15th Annual Conference of the International Speech Communication Association (2014)
Introducing I-Vectors for Joint Anti-spoofing and Speaker Verification, , , , and , in: The 15th Annual Conference of the International Speech Communication Association, 2014 |
|
Proc. ACM Int. Workshop on Crowdsourcing for Multimedia (2014)
Is That a Jaguar? Segmenting Ancient Maya Glyphs via Crowdsourcing, , and , in: Proc. ACM Int. Workshop on Crowdsourcing for Multimedia, Orlando, pages 37-40, ACM New York, 2014 |
[DOI] |
Global Conference on Signal and Information Processing (2014)
Joint Phoneme Segmentation Inference and Classification using CRFs, , and , in: Global Conference on Signal and Information Processing, Atlanta, GA, pages 587 - 591, IEEE, 2014 |
[DOI] |
International Conference on Artificial Intelligence and Statistics (2014)
Jointly Informative Feature Selection, and , in: International Conference on Artificial Intelligence and Statistics, pages 567–575, 2014 |
|
Proc. IEEE Intl Conf. on Development and Learning and on Epigenetic Robotics (ICDL-EpiRob) (2014)
Learning adaptive movements from demonstration and self-guided exploration, , and , in: Proc. IEEE Intl Conf. on Development and Learning and on Epigenetic Robotics (ICDL-EpiRob), Genoa, Italy, pages 160-165, 2014 |
|
Proc. IEEE Intl Symposium on Robot and Human Interactive Communication (Ro-Man) (2014)
Learning Force and Position Constraints in Human-robot Cooperative Transportation, , and , in: Proc. IEEE Intl Symposium on Robot and Human Interactive Communication (Ro-Man), Edinburgh, Scotland, UK, pages 619-624, 2014 |
|
Proc. IEEE Intl Conf. on Robotics and Automation (ICRA) (2014)
Learning from demonstrations with partially observable task parameters, , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Hong Kong, pages 3309 - 3314, IEEE, 2014 |
[DOI] |
Proceedings of the Computer Vision and Pattern Recognition (2014)
Learning to Learn, from Transfer Learning to Domain Adaptation: A Unifying Perspective, and , in: Proceedings of the Computer Vision and Pattern Recognition, Columbus, OH, pages 1442-1449, IEEE, 2014 |
[DOI] |
Proceedings of the International Conference on 3D vision (2014)
LETHA: Learning from High Quality Inputs for 3D Pose Estimation in Low Quality Images, , , and , in: Proceedings of the International Conference on 3D vision, pages 517–524, 2014 |
International Workshop on Content-Based Multimedia Indexing (2014)
Mode of Teaching Based Segmentation and Annotation of Video Lectures, , and , in: International Workshop on Content-Based Multimedia Indexing, 2014 |
|
2014 IEEE International Conference on Acoustics, Speech and Signal Processing (2014)
Model-based Sparse Component Analysis for Reverberant Speech Localization, , , and , in: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing, pages 1439 - 1443, IEEE, 2014 |
[DOI] |
Odyssey: The Speaker and Language Recognition Workshop (2014)
Modeling Overlapping Speech using Vector Taylor Series, , and , in: Odyssey: The Speaker and Language Recognition Workshop, Joensuu, Finland, 2014 |
|
Proceedings of the International Conference on Pattern Recognition (2014)
Multi-Source Adaptive Learning for Fast Control of Prosthetics Hand, , and , in: Proceedings of the International Conference on Pattern Recognition, Stockholm, pages 2769 - 2774, 2014 |
[DOI] |
INTERSPEECH (2014)
Multi-source Posteriors for Speech Activity Detection on Public Talks, and , in: INTERSPEECH, 2014 |
|
Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2014)
Multilingual Deep Neural Network based Acoustic Modeling For Rapid Language Adaptation, , , , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Florence, pages 7639-7643, IEEE, 2014 |
[DOI] |
ACM International Conference on Multimedia Retrieval (2014)
Multimodal Reranking of Content-based Recommendations for Hyperlinking Video Snippets, , , and , in: ACM International Conference on Multimedia Retrieval, 2014 |
|
Proc. IEEE Intl Conf. on Robotics and Automation (ICRA) (2014)
Null space redundancy learning for a flexible surgical robot, , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Hong Kong, pages 2443 - 2448, IEEE, 2014 |
[DOI] |
International Conference on Acoustics, Speech, and Signal Processing (2014)
On Modeling Context-Dependent Clustered States: Comparing HMM/GMM, Hybrid HMM/ANN and KL-HMM Approaches, , and , in: International Conference on Acoustics, Speech, and Signal Processing, Florence, IT, pages 7659 - 7663, IEEE, 2014 |
[DOI] |
Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech 2014) (2014)
On Recognition of Non-Native Speech Using Probabilistic Lexical Model, and , in: Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech 2014), 2014 |
|
IEEE International Conference of the Biometrics Special Interest Group (BIOSIG) (2014)
On the Vulnerability of Finger Vein Recognition to Spoofing, , and , in: IEEE International Conference of the Biometrics Special Interest Group (BIOSIG), Darmstadt, Germay, pages 1 - 10, IEEE, 2014 |
|
ImageCLEF 2014: Overview and analysis of the results (2014)
Overview of the ImageCLEF 2014 Domain Adaptation Task, and , in: ImageCLEF 2014: Overview and analysis of the results, 2014 |
|
Interspeech 2014 (2014)
Phoneme Background Model for Information Bottleneck based Speaker Diarization, , and , in: Interspeech 2014, 2014 |
Interspeech (2014)
Phoneme Background Model for Information Bottleneck based Speaker Diarization, , and , in: Interspeech, Singapore, 2014 |
|
Proceeding of Interspeech (2014)
Posterior-based Sparse Representation for Automatic Speech Recognition, , , and , in: Proceeding of Interspeech, 2014 |
|
Speech Prosody (2014)
Prosody in Swiss French Accents: Investigation using Analysis by Synthesis, , , and , in: Speech Prosody, 2014 |
|
31st International Conference on Machine Learning (ICML) (2014)
Recurrent Convolutional Neural Networks for Scene Labeling, and , in: 31st International Conference on Machine Learning (ICML), Beijing, China, pages 82-90, JMLR, 2014 |
[URL] |
Proceedings of ECML 2014 (2014)
Recurrent Greedy Parsing with Neural Networks, and , in: Proceedings of ECML 2014, pages 130-144, Springer Berlin Heidelberg, 2014 |
[DOI] |
Engineering in Medicine and Biology Society (EMBC), 2014 36th Annual International Conference of the IEEE (2014)
Rewards-driven control of robot arm by decoding EEG signals, , and , in: Engineering in Medicine and Biology Society (EMBC), 2014 36th Annual International Conference of the IEEE, pages 1658-1661, IEEE, 2014 |
[DOI] [URL] |
Proceedings of the 2014 Workshop on Roadmapping the Future of Multimodal Interaction Research including Business Opportunities and Challenges (RFMIR '14) (2014)
ROCKIT: Roadmap for Conversational Interaction Technologies, , , , , , , , , , , , , , and , in: Proceedings of the 2014 Workshop on Roadmapping the Future of Multimodal Interaction Research including Business Opportunities and Challenges (RFMIR '14), pages 39-42, ACM, 2014 |
[DOI] |
Proceedings of the 6th Asian Conference on Machine Learning (ACML) (2014)
Sample Distillation for Object Detection and Image Classification, , and , in: Proceedings of the 6th Asian Conference on Machine Learning (ACML), Nha Trang, Vietnam, 2014 |
|
Proceedings of the 22nd International Conference on Pattern Recognition (2014)
Scene Recognition with Naive Bayes Non-linear Learning, and , in: Proceedings of the 22nd International Conference on Pattern Recognition, Stockholm, pages 3404 - 3409, IEEE, 2014 |
[DOI] |
In Proc. of the Intl Conf. on Ubiquitous Robots and Ambient Intelligence (URAI) (2014)
Skills Learning in Robots by Interaction with Users and Environment, , in: In Proc. of the Intl Conf. on Ubiquitous Robots and Ambient Intelligence (URAI), Kuala Lumpur, Malaysia, pages 161-162, 2014 |
[URL] |
Interspeech (2014)
Stress and Accent Transmission In HMM-Based Syllable-Context Very Low Bit Rate Speech Coding, , , and , in: Interspeech, 2014 |
|
Speech Prosody (2014)
SVR vs MLP for Phone Duration Modelling in HMM-based Speech Synthesis, , and , in: Speech Prosody, 2014 |
|
Odyssey: The Speaker and Language Recognition Workshop (2014)
SWISS FRENCH REGIONAL ACCENT IDENTIFICATION, , , , , and , in: Odyssey: The Speaker and Language Recognition Workshop, 2014 |
|
Nouveaux cahiers de linguistique francaise (2014)
Syllable-based Regional Swiss French Accent Identification using Prosodic Features, , , and , in: Nouveaux cahiers de linguistique francaise, 2014 |
|
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14) (2014)
The DBOX Corpus Collection of Spoken Human-Human and Human-Machine Dialogues, , , , , , , , , , , , , and , in: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), Reykjavik, Iceland, European Language Resources Association (ELRA), 2014 |
[URL] |
Proc. Digital Humanities Conference (2014)
The MAAYA Project: Multimedia Analysis and Access for Documentation and Decipherment of Maya Epigraphy, , , , , , and , in: Proc. Digital Humanities Conference, Lausanne, 2014 |
|
DOGS2014 - Digital speech and image processing (2014)
The SP2 SCOPES Project on Speech Prosody, , , , , , , , and , in: DOGS2014 - Digital speech and image processing, 2014 |
|
Proceedings of the ACM International Conference on Multimedia (2014)
The Workshop on Computational Personality Recognition 2014, , , , , and , in: Proceedings of the ACM International Conference on Multimedia, 2014 |
|
Proceedings of the First International Conference on IoT in Urban Space (2014)
The Young and the City: Crowdsourcing Urban Awareness in a Developing Country, , and , in: Proceedings of the First International Conference on IoT in Urban Space, pages 74-79, 2014 |
[DOI] [URL] |
Proceedings of the European Conference on Computer Vision (2014)
Tracking Interacting Objects Optimally Using Integer Programming, , , and , in: Proceedings of the European Conference on Computer Vision, pages 17-32, 2014 |
|
International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (2014)
What to Show? Automatic Stream Selection Among Multiple Sensors, , and , in: International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, 2014 |
|
International Conference on Multimodal Interaction, Understanding and Modeling Multiparty, Multimodal Interactions Workshop (2014)
Who Will Get the Grant ? A Multimodal Corpus for the Analysis of Conversational Behaviours in Group Interviews, , , , and , in: International Conference on Multimodal Interaction, Understanding and Modeling Multiparty, Multimodal Interactions Workshop, Istanbul, Turkey, ACM, 2014 |
[DOI] |
International Workshop on Multimedia Signal Processing (2014)
Within- and Cross- Database Evaluations for Gender Classification via BeFIT Protocols, , and , in: International Workshop on Multimedia Signal Processing, pages 1-6, 2014 |
[DOI] [URL] |
14th Conference of the European Chapter of the Association for Computational Linguistics (2014)
Word Embeddings through Hellinger PCA, and , in: 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014 |
|
Doctoral consortium of the 15th ACM International Conference on Multimodal Interaction (2013)
3D Head Pose and Gaze Tracking and Their Application to Diverse Multimodal Tasks, , in: Doctoral consortium of the 15th ACM International Conference on Multimodal Interaction, Sydney, Australia, 2013 |
[DOI] |
Signal Processing with Adaptive Sparse Structured Representations SPARS (2013)
A Multipath Sparse Beamfroming Method, , , and , in: Signal Processing with Adaptive Sparse Structured Representations SPARS, 2013 |
|
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2013)
A Probabilistic Framework for Multiple Speaker Localization, , , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2013 |
|
15th ACM International Conference on Multimodal Interaction (2013)
A Semi-Automated System for Accurate Gaze Coding in Natural Dyadic Interactions, , , and , in: 15th ACM International Conference on Multimodal Interaction, Sydney, Australia, ACM, 2013 |
[DOI] |
CVPR 2013 Workshop on Structured Prediction (2013)
Accelerated Training of Linear Object Detectors, and , in: CVPR 2013 Workshop on Structured Prediction, 2013 |
[URL] |
The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2013)
ACCENT ADAPTATION USING SUBSPACE GAUSSIAN MIXTURE MODELS, , , and , in: The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Vancouver, BC, Canada, pages 7170-7174, 2013 |
[DOI] |
Proc. of IEEE Int. Workshop on Multimedia Signal Processing (MMSP) (2013)
Alignment of Binocular-Binaural Data Using a Moving Audio-Visual Target, , and , in: Proc. of IEEE Int. Workshop on Multimedia Signal Processing (MMSP), 2013 |
|
INTERSPEECH (2013)
An Open-source State-of-the-art Toolbox for Broadcast News Diarization, , , , , and , in: INTERSPEECH, 2013 |
|
Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Workshop on Biometrics (2013)
Anti-spoofing in action: joint operation with a verification system, , and , in: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Workshop on Biometrics, Portland, Oregon, 2013 |
|
Are ACT's scores increasing with better translation quality? (2013)
Are ACT's scores increasing with better translation quality?, , in: Are ACT's scores increasing with better translation quality?, pages 6, 2013 |
|
14th International Conference on Intelligent Text Processing and Computational Linguistics (2013)
Assessing the Accuracy of Discourse Connective Translations: Validation of an Automatic Metric, and , in: 14th International Conference on Intelligent Text Processing and Computational Linguistics, University of the Aegean, Samos, Greece, pages 236-247, Springer, 2013 |
[DOI] |
Proceedings of Interspeech (2013)
Automatic Social Role Recognition In Professional Meetings Using Conditional Random Fields, and , in: Proceedings of Interspeech, 2013 |
|
International Conference on Affective Computing and Intelligent Interaction (2013)
Automatic Staging of Audio with Emotions, and , in: International Conference on Affective Computing and Intelligent Interaction, 2013 |
Proceedings of IEEE International Conference on Automatic Face and Gesture Recognition (2013)
Body communicative cue extraction for conversational analysis, , , , and , in: Proceedings of IEEE International Conference on Automatic Face and Gesture Recognition, 2013 |
|
Proceedings of the 11th International Workshop on Content Based Multimedia Indexing (2013)
Combining Content with User Preferences for TED Lecture Recommendation, and , in: Proceedings of the 11th International Workshop on Content Based Multimedia Indexing, Veszprém, Hungary, IEEE, 2013 |
|
International Joint Conference on artificial intelligence (2013)
Computing Text Semantic Relatedness using the Contents and Links of a Hypertext Encyclopedia, and , in: International Joint Conference on artificial intelligence, 2013 |
|
Proceedings of the 6th Workshop on Eye Gaze in Intelligent Human Machine Interaction: Gaze in Multimodal Interaction (2013)
Context Aware Addressee Estimation for Human Robot Interaction, , , and , in: Proceedings of the 6th Workshop on Eye Gaze in Intelligent Human Machine Interaction: Gaze in Multimodal Interaction, 2013 |
Human Behavior Understanding (2013)
Creative Applications of Human Behavior Understanding. HBU 2013: 1-14, , , and , in: Human Behavior Understanding, pages 1-14, 2013 |
15th ACM International Conference on Multimodal Interaction (2013)
Cross-Domain Personality Prediction: From Video Blogs to Small Group Meetings, and , in: 15th ACM International Conference on Multimodal Interaction, 2013 |
|
Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013) (2013)
Crosslingual Tandem-SGMM: Exploiting Out-Of-Language Data for Acoustic Model and Feature Level Adaptation, , and , in: Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013), ISCA - International Speech Communication Association, Lyon, France, pages 510-514, ISCA, 2013 |
|
British Machine Vision Conference (2013)
Deformable Part Models with Individual Part Scaling, and , in: British Machine Vision Conference, 2013 |
|
Proceedings of the 1st DiscoMT Workshop at ACL 2013 (51st Annual Meeting of the Association for Computational Linguistics) (2013)
Detecting Narrativity to Improve English to French Translation of Simple Past Verbs, , and , in: Proceedings of the 1st DiscoMT Workshop at ACL 2013 (51st Annual Meeting of the Association for Computational Linguistics), Sofia, Bulgaria, pages 33-42, 2013 |
|
European Geosciences Union (2013)
Discovering Temporal Patterns in Water Quality Time Series, Focusing on Floods with the LDA method, , , , , , , and , in: European Geosciences Union, 2013 |
|
Proceedings of the 14th International Conference on Intelligent Text Processing and Computational Linguistics (2013)
Distinguishing the Popularity Between Topics: A System for Up-to-date Opinion Retrieval and Mining in the Web, , and , in: Proceedings of the 14th International Conference on Intelligent Text Processing and Computational Linguistics, LNCS, Samos, Greece, ACM, 2013 |
[URL] |
Proceedings of the ACL 2013 (51th Annual Meeting of the Association for Computational Linguistics ), Short Papers (2013)
Diverse Keyword Extraction from Conversations, and , in: Proceedings of the ACL 2013 (51th Annual Meeting of the Association for Computational Linguistics ), Short Papers, Sofia, Bulgaria, pages 651-657, ACL, 2013 |
|
Proceedings of Interspeech (2013)
Estimating Phoneme Class Conditional Probabilities from Raw Speech Signal using Convolutional Neural Networks, , and , in: Proceedings of Interspeech, 2013 |
|
Proceedings IEEE International Conference On Digital Signal Processing (2013)
Euclidean Distance Matrix Completion for Ad-hoc Microphone Array Calibration, , , and , in: Proceedings IEEE International Conference On Digital Signal Processing, 2013 |
|
Proceedings of the 4th IEEE International Conference on Cognitive Infocommunications (2013)
Evaluating Intra- and Crosslingual Adaptation for Non-native Speech Recognition in a Bilingual Environment, and , in: Proceedings of the 4th IEEE International Conference on Cognitive Infocommunications, IEEE, Budapest, Hungary, pages 357-361, 2013 |
|
in Proc. Mexican Conf. on Pattern Recognition (2013)
Evaluating Shape Descriptors for Detection of Maya Hieroglyphs, , and , in: in Proc. Mexican Conf. on Pattern Recognition, Queretaro, 2013 |
|
International Conference on Rehabilitation Robotics (2013)
Exploiting Accelerometers to Improve Movement Classification for Prosthetics, and , in: International Conference on Rehabilitation Robotics, 2013 |
|
Proceedings of the Conference on Computer Vision and Pattern Recognition (2013)
Fast Object Detection with Entropy-Driven Evaluation, , , and , in: Proceedings of the Conference on Computer Vision and Pattern Recognition, 2013 |
The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2013)
FEATURE AND SCORE LEVEL COMBINATION OF SUBSPACE GAUSSIANS IN LVCSR TASK, , and , in: The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Vancouver, BC, Canada, pages 7604-7608, 2013 |
[DOI] |
The 7th International AAAI Conference on Weblogs and Social Media (2013)
From Foursquare to my Square: Learning Check-in Behavior from Multiple Sources, , and , in: The 7th International AAAI Conference on Weblogs and Social Media, 2013 |
|
Proceedings of the Conference on Computer Vision and Pattern Recognition (2013)
From N to N+1: Multiclass Transfer Incremental Learning, , and , in: Proceedings of the Conference on Computer Vision and Pattern Recognition, 2013 |
|
Proceedings of the 3rd ACM conference on International conference on multimedia retrieval (2013)
Fusing Matching and Biometric Similarity Measures for Face Diarization in Video, , and , in: Proceedings of the 3rd ACM conference on International conference on multimedia retrieval, Dallas, Texas, USA, pages 97-104, ACM, 2013 |
|
Proceedings of IEEE TENCON (2013)
Gammatone Wavelet Cepstral Coefficients for Robust Speech Recognition, , and , in: Proceedings of IEEE TENCON, 2013 |
|
Proceedings of Human Robot Interaction (HRI) Conference (2013)
Given that, Should I Respond? Contextual Addressee Estimation in Multi-Party Human-Robot Interactions, and , in: Proceedings of Human Robot Interaction (HRI) Conference, 2013 |
|
IEEE International Conference on Acoustics, Speech and Signal Processing (2013)
Grapheme and Multilingual Posterior Features for Under-Resourced Speech Recognition: A Study on Scottish Gaelic, , and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2013 |
|
INTERSPEECH (2013)
I4U Submission to NIST SRE 2012: a large-scale collaborative effort for noise-robust speaker verification, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: INTERSPEECH, Lyon, France, 2013 |
|
MediaEval 2013 Workshop (2013)
Idiap at MediaEval 2013: Search and Hyperlinking Task, , , and , in: MediaEval 2013 Workshop, Barcelona, Spain, CEUR-WS.org, 2013 |
|
Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding (2013)
Impact of deep MLP architecture on different acoustic modeling techniques for under-resourced speech recognition, , , and , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, 2013 |
|
Proceedings of the 1st DiscoMT Workshop at ACL 2013 (51st Annual Meeting of the Association for Computational Linguistics) (2013)
Implicitation of Discourse Connectives in (Machine) Translation, and , in: Proceedings of the 1st DiscoMT Workshop at ACL 2013 (51st Annual Meeting of the Association for Computational Linguistics), Sofia, Bulgaria, pages 19-26, 2013 |
|
ICASSP (2013)
Improved Overlap Speech Diarization of Meeting Recordings using Long-term Conversational Features, and , in: ICASSP, 2013 |
Improved Overlap Speech Diarization of Meeting Recordings using Long-term Conversational Features, and , in: ICASSP, 2013 |
|
Proceedings of Interspeech (2013)
Improving Grapheme-based ASR by Probabilistic Lexical Modeling Approach, and , in: Proceedings of Interspeech, 2013 |
|
12th International Conference on Mobile and Ubiquitous Multimedia (2013)
Inferring Mood in Ubiquitous Conversational Video, , , , , and , in: 12th International Conference on Mobile and Ubiquitous Multimedia, Luleå, Sweden, ACM Press, 2013 |
|
15th ACM International Conference on Multimodal Interaction (2013)
Inferring social activities with mobile sensor networks, , , , and , in: 15th ACM International Conference on Multimodal Interaction, 2013 |
|
Affective Computing and Intelligent Interaction (2013)
Investigating the Impact of Language Style and Vocal Expression on Social Roles of Participants in Professional Meetings, and , in: Affective Computing and Intelligent Interaction, Geneva, pages 324-329, IEEE, 2013 |
[DOI] |
Mining and Learning with Graphs (2013)
Learning to Rank on Network Data, , and , in: Mining and Learning with Graphs, 2013 |
|
Proceedings of the 15th ACM on International Conference on Multimodal Interaction (2013)
Leveraging the robot dialog state for visual focus of attention recognition, , , , and , in: Proceedings of the 15th ACM on International Conference on Multimodal Interaction, 2013 |
Proceedings of the 1st DiscoMT Workshop at ACL 2013 (51st Annual Meeting of the Association for Computational Linguistics) (2013)
Machine Translation with Many Manually Labeled Discourse Connectives, and , in: Proceedings of the 1st DiscoMT Workshop at ACL 2013 (51st Annual Meeting of the Association for Computational Linguistics), Sofia, Bulgaria, pages 43-50, 2013 |
|
IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (2013)
Manifold Sparse Beamforming, , and , in: IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing, Saint Martin, France, pages 113-116, IEEE, 2013 |
[DOI] |
Proceedings of the International Conference on Acoustics, Speech and Signal Processing (2013)
MLP-based Factor Analysis for Tandem Speech Recognition, and , in: Proceedings of the International Conference on Acoustics, Speech and Signal Processing, 2013 |
|
Proceedings of the 21st ACM International Conference on Multimedia (2013)
Multi-factor Segmentation for Topic Visualization and Recommendation: the MUST-VIS System, , , , , , , and , in: Proceedings of the 21st ACM International Conference on Multimedia, Barcelona, Spain, pages 365-368, ACM, 2013 |
[DOI] [URL] |
JMLR W&CP, Volume 29: Asian Conference on Machine Learning (2013)
Multiclass Latent Locally Linear Support Vector Machines, , and , in: JMLR W&CP, Volume 29: Asian Conference on Machine Learning, Canberra, Australia, pages 229-244, 2013 |
[URL] |
15th ACM International Conference on Multimodal Interaction Proceedings (2013)
Multimodal Analysis of Body Communication Cues in Employment Interviews, , , and , in: 15th ACM International Conference on Multimodal Interaction Proceedings, 2013 |
|
Proceedings of the AIA-DAGA 2013 International Conference on Acoustics (2013)
Noise Intrusiveness Factors in Speech Telecommunications, , , and , in: Proceedings of the AIA-DAGA 2013 International Conference on Acoustics, Merano, Italy, pages 436-439, 2013 |
|
Proceedings of the IEEE Intl. Conference on Acoustics, Speech and Signal Processing (ICASSP) (2013)
On the (Un)importance of the Contextual Factors In HMM-Based Speech Synthesis, , and , in: Proceedings of the IEEE Intl. Conference on Acoustics, Speech and Signal Processing (ICASSP), Vancouver, Canada, pages 8140 - 8143, 2013 |
|
Biometric Technologies in Forensic Science (2013)
On the Improvements of Uni-modal and Bi-modal Fusions of Speaker and Face Recognition for Mobile Biometrics, , , and , in: Biometric Technologies in Forensic Science, Nijmegen, The Netherlands, 2013 |
|
15th ACM International Conference on Multimodal Interaction (2013)
One of a Kind: Inferring Personality Impressions in Meetings, and , in: 15th ACM International Conference on Multimodal Interaction, 2013 |
|
Working Notes, CLEF 2013 (2013)
Overview of the ImageCLEF 2013 Robot Vision Task, , , and , in: Working Notes, CLEF 2013, 2013 |
|
IEEE Workshop on Performance Evaluation of Tracking and Surveillance (2013)
Parameter Estimation and Contextual Adaptation for a Multi-Object Tracking CRF Model, and , in: IEEE Workshop on Performance Evaluation of Tracking and Surveillance, 2013 |
|
International Conference on Image Processing (2013)
Person Independent 3D Gaze Estimation From Remote RGB-D Cameras, and , in: International Conference on Image Processing, Melbourne, Australia, IEEE, 2013 |
[DOI] |
Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding (2013)
Probabilistic Lexical Modeling and Unsupervised Training for Zero-Resourced ASR, , and , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, 2013 |
|
Workshop on Speech, Language and Audio in Multimedia (2013)
Processing and Linking Audio Events in Large Multimedia Archives: The EU inEvent Project, , , , , , , , and , in: Workshop on Speech, Language and Audio in Multimedia, 2013 |
|
Proceedings of the international conference on Neural Information Processing Systems (2013)
Reservoir Boosting : Between Online and Offline Ensemble Learning, and , in: Proceedings of the international conference on Neural Information Processing Systems, 2013 |
|
Proceedings of the 2013 ACM Conference on Pervasive and Ubiquitous Computing Adjunct Publication (2013)
Revisiting the Generality of the Rank-based Human Mobility Model, and , in: Proceedings of the 2013 ACM Conference on Pervasive and Ubiquitous Computing Adjunct Publication, Zurich, Switzerland, pages 1209-1218, ACM, 2013 |
[DOI] [URL] |
36th ACM SIGIR Conference on Research and Development in Information Retrieval (2013)
Sentiment Analysis of User Comments for One-Class Collaborative Filtering over TED Talks, and , in: 36th ACM SIGIR Conference on Research and Development in Information Retrieval, Dublin, Ireland, ACM, 2013 |
|
Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2013)
Speaker adaptive Kullback-Leibler divergence based hidden Markov models, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2013 |
|
Proceedings of the 21st ACM International Conference on Multimedia (2013)
Speaking Swiss: Languages and Venues in Foursquare, and , in: Proceedings of the 21st ACM International Conference on Multimedia, Barcelona, Spain, pages 501-504, ACM, 2013 |
[DOI] [URL] |
International Conference of the Biometrics Special Interes Group (2013)
Spoofing Attacks To 2D Face Recognition Systems With 3D Masks, and , in: International Conference of the Biometrics Special Interes Group, Darmstadt, Germany, 2013 |
|
Biometrics: Theory, Applications and Systems (2013)
Spoofing in 2D Face Recognition with 3D Masks and Anti-spoofing with Kinect, and , in: Biometrics: Theory, Applications and Systems, Washington DC, USA, 2013 |
|
International Conference on Machine Learning (2013)
Stability and Hypothesis Transfer Learning, and , in: International Conference on Machine Learning, 2013 |
|
Signal Processing with Adaptive Sparse Structured Representations SPARS (2013)
Structured Sparse Acoustic Modeling for Speech Separation, , , and , in: Signal Processing with Adaptive Sparse Structured Representations SPARS, SPARS, 2013 |
|
Proc. of Interspeech 2013 (2013)
Syllable-based Pitch Encoding for Low Bit Rate Speech Coding with Recognition/Synthesis Architecture, , and , in: Proc. of Interspeech 2013, Lyon, France, 2013 |
|
The 6th IAPR International Conference on Biometrics (2013)
The 2013 Face Recognition Evaluation in Mobile Environment, , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: The 6th IAPR International Conference on Biometrics, 2013 |
|
The 2013 Speaker Recognition Evaluation in Mobile Environment, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: The 6th IAPR International Conference on Biometrics, 2013 |
|
International Conference of Biometrics 2013 (2013)
The 2nd competition on counter measures to 2D face spoofing attacks, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: International Conference of Biometrics 2013, Madrid, Spain, 2013 |
|
Proceedings of the 8th ACM/IEEE international conference on Human-robot interaction (2013)
The vernissage corpus: a conversational human-robot-interaction dataset, , , , , , , , , and , in: Proceedings of the 8th ACM/IEEE international conference on Human-robot interaction, 2013 |
|
IEEE International Conference on Image Processing (2013)
Time-Sensitive Topic Models for Action Recognition in Videos, , and , in: IEEE International Conference on Image Processing, 2013 |
|
IEEE/RSJ International Conference on Intelligent Robots and Systems (2013)
Transfer in Inverse Reinforcement Learning for Multiple Strategies, and , in: IEEE/RSJ International Conference on Intelligent Robots and Systems, Tokyo, Japan, pages 3244-3250, IEEE, 2013 |
[DOI] [URL] |
ISCA Speech Synthesis Workshop (2013)
Understanding Factors in Emotion Perception, and , in: ISCA Speech Synthesis Workshop, 2013 |
|
Proc. of Interspeech 2013 (2013)
Using Phonological Phrase Segmentation to Improve Automatic Keyword Spotting for the Highly Agglutinating Hungarian Language, and , in: Proc. of Interspeech 2013, 2013 |
International Conference on Multimodal Interaction (2013)
Who is Persuasive? The Role of Perceived Personality and Communication Modality in Social Multimedia, , , , and , in: International Conference on Multimodal Interaction, 2013 |
ICASSP 2012 : IEEE International Conference on Acoustics, Speech and Signal Processing (2012)
A tree-based distance between distributions: application to classification of neurons, and , in: ICASSP 2012 : IEEE International Conference on Acoustics, Speech and Signal Processing, 2012 |
Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2012)
Acoustic Data-driven Grapheme-to-Phoneme Conversion using KL-HMM, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2012 |
|
24th IEEE International Conference on Tools with Artificial Intelligence (2012)
An Agent-Based Focused Crawling Framework for Topic- and Genre-Related Web Document Discovery, , and , in: 24th IEEE International Conference on Tools with Artificial Intelligence, Athens, Greece, IEEE, 2012 |
[URL] |
Computer Vision - ECCV 2012. Workshops and Demonstrations (2012)
An Open Source Framework for Standardized Comparisons of Face Recognition Algorithms, , and , in: Computer Vision - ECCV 2012. Workshops and Demonstrations, Idiap Research Institute, Heidelberg, pages 547-556, Springer Berlin, 2012 |
[DOI] [URL] |
Proceedings of Interspeech 2012 (2012)
Annotation and Recognition of Personality Traits in Spoken Conversations from the AMI Meetings Corpus, , and , in: Proceedings of Interspeech 2012, 2012 |
|
INTERSPEECH (2012)
Automatic detection of conflict escalation in spoken conversations, , and , in: INTERSPEECH, ISCA, Portland, Oregon, USA., 2012 |
|
Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2012)
Automatic detection of conflicts in spoken conversations: ratings and analysis of broadcast political debates, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, Japan, 2012 |
|
Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, Japan, 2012 (2012)
Automatic Speaker Role Labeling in AMI Meetings: Recognition of Formal and Social Roles, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, Japan, 2012, 2012 |
|
Working Notes of the ImageCLEF 2012 Laboratory (2012)
Baseline Multimodal Place Classifier for the 2012 Robot Vision Task, , and , in: Working Notes of the ImageCLEF 2012 Laboratory, 2012 |
|
Asian Conference on Computer Vision (2012)
Beyond Dataset Bias: Multi-task Unaligned Shared Knowledge Transfer, , , and , in: Asian Conference on Computer Vision, 2012 |
|
Proceedings of the 21st International Conference on Pattern Recognition (2012)
Bi-Modal Authentication in Mobile Environments Using Session Variability Modelling, , , , and , in: Proceedings of the 21st International Conference on Pattern Recognition, 2012 |
|
IEEE ICME Workshop on Hot Topics in Mobile Multimedia (2012)
Bi-Modal Person Recognition on a Mobile Phone: using mobile phone data, , , , , , , , , , , , , and , in: IEEE ICME Workshop on Hot Topics in Mobile Multimedia, 2012 |
|
Proceedings of the 3rd International Workshop on Spoken Languages Technologies for Under-resourced Languages (2012)
Boosting under-resourced speech recognizers by exploiting out of language data - Case study on Afrikaans, , and , in: Proceedings of the 3rd International Workshop on Spoken Languages Technologies for Under-resourced Languages, Cape Town, pages 60--67, 2012 |
|
IEEE Conference on Computer Vision and Pattern Recognition, 2012, Providence, Rhode Island, USA (2012)
Bridging the Past, Present and Future: Modeling Scene Activities From Event Relationships and Global Rules, , and , in: IEEE Conference on Computer Vision and Pattern Recognition, 2012, Providence, Rhode Island, USA, 2012 |
Proceedings of the Fourth IEEE RAS/EMBS International Conference on Biomedical Robotics and Biomechatronics (2012)
Building the NinaPro Database: a Resource for the Biorobotics Community, , , , , , , , and , in: Proceedings of the Fourth IEEE RAS/EMBS International Conference on Biomedical Robotics and Biomechatronics, 2012 |
|
Proceedings of the 11th International Conference on Mobile and Ubiquitous Multimedia (2012)
Checking In or Checked In: Comparing Large-Scale Manual and Automatic Location Disclosure Patterns, , and , in: Proceedings of the 11th International Conference on Mobile and Ubiquitous Multimedia, Ulm, Germany, 2012 |
International Symposium on Communications, Control, and Signal Processing (2012)
Collecting data for socially intelligent surveillance and monitoring approaches: the case of conflict in competitive conversations, , , and , in: International Symposium on Communications, Control, and Signal Processing, 2012 |
|
Proceedings of Interspeech (2012)
Combination of Sparse Classification and Multilayer Perceptron for Noise Robust ASR, , , , , and , in: Proceedings of Interspeech, 2012 |
|
Combining Acoustic Data Driven G2P and Letter-to-Sound Rules for Under Resource Lexicon Generation, and , in: Proceedings of Interspeech, Portland, Oregon, 2012 |
|
IEEE International Conference on Acoustics, Speech, and Signal Processing (2012)
Combining transcription-based and acoustic-based speaker identifications for broadcast news, , , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2012 |
|
Proceedings in International conference on Speech and Signal processing (2012)
COMBINING VOCAL TRACT LENGTH NORMALIZATION WITH HIERARCHIAL LINEAR TRANSFORMATIONS, , , and , in: Proceedings in International conference on Speech and Signal processing, Kyoto, Japan, pages 4493-4496, IEEE SPS (ICASSP), 2012 |
|
Proceedings of Interspeech (2012)
Comparing different acoustic modeling techniques for multilingual boosting, , , , and , in: Proceedings of Interspeech, Portland, Oregon, 2012 |
|
Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2012)
Computational Methods For Structured Sparse Component Analysis of Convolutive Speech Mixtures, , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2012 |
|
Proceedings of the 14th ACM International Conference on Ubiquitous Computing (2012)
Contextual Conditional Models for Smartphone-based Human Mobility Prediction, and , in: Proceedings of the 14th ACM International Conference on Ubiquitous Computing, 2012 |
|
Proceedings of Interspeech (2012)
Conversion of Recurrent Neural Network Language Models to Weighted Finite State Transducers for Automatic Speech Recognition, and , in: Proceedings of Interspeech, Portland, Oregon, USA, pages to appear, 2012 |
|
Proceedings of International ACM Workshop on Crowdsourcing for Multimedia (2012)
Crowdsourcing Micro-Level Multimedia Annotations: The Challenges of Evaluation and Interface, , , and , in: Proceedings of International ACM Workshop on Crowdsourcing for Multimedia, 2012 |
IEEE Content Based Multimedia Indexing (2012)
Detecting and Labeling Folk Literature in Spoken Cultural Heritage Archives using Structural and Prosodic Features, and , in: IEEE Content Based Multimedia Indexing, 2012 |
|
Proceedings of Interspeech (2012)
DiarTk : An Open Source Toolkit for Research in Multistream Speaker Diarization and its Application to Meetings Recordings, and , in: Proceedings of Interspeech, 2012 |
|
Proceedings of the eighth international conference on Language Resources and Evaluation (LREC) (2012)
Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns, , , , and , in: Proceedings of the eighth international conference on Language Resources and Evaluation (LREC), pages 5, 2012 |
|
8th Joint ACL-ISO Workshop on Interoperable Semantic Annotation (2012)
Empirical validations of multilingual annotation schemes for discourse relations, , , and , in: 8th Joint ACL-ISO Workshop on Interoperable Semantic Annotation, 2012 |
|
Proceedings of the European Conference on Computer Vision (2012)
Exact Acceleration of Linear Object Detectors, and , in: Proceedings of the European Conference on Computer Vision, 2012 |
|
Proceedings of the eighth international conference on Language Resources and Evaluation (LREC) (2012)
Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies, and , in: Proceedings of the eighth international conference on Language Resources and Evaluation (LREC), Istanbul, TR, pages 6, 2012 |
|
12th International Conference on Knowledge Management and Knowledge Technologies (2012)
Extracting Informative Textual Parts from Web Pages Containing User-Generated Content, , and , in: 12th International Conference on Knowledge Management and Knowledge Technologies, ACM ICPS, Graz, Austria, pages 4:1--4:8, ACM, 2012 |
[URL] |
Artificial Neural Networks and Machine Learning (2012)
Face Recognition with Disparity Corrected Gabor Phase Differences, , and , in: Artificial Neural Networks and Machine Learning, Heidelberg, pages 411-418, Springer Berlin, 2012 |
[DOI] |
Proceedings of the 11th International Conference of the Biometrics Special Interest Group (2012)
Face Verification using Gabor Filtering and Adapted Gaussian Mixture Models, , and , in: Proceedings of the 11th International Conference of the Biometrics Special Interest Group, Darmstadt, Germany, pages 397-408, GI-Edition, 2012 |
|
Proceedings International Conference on Multimodal Interfaces (ICMI-MLMI) (2012)
FaceTube: predicting personality from facial expressions of emotion in online conversational video, , and , in: Proceedings International Conference on Multimodal Interfaces (ICMI-MLMI), 2012 |
|
in Proceedings of ACM Multimedia 2012 (2012)
From Speech to Personality: Mapping Voice Quality and Intonation into Personality Differences, , , and , in: in Proceedings of ACM Multimedia 2012, 2012 |
|
IEEE Conference in Computer Vision and Pattern Recognition, Workshop on Gesture Recognition (2012)
Gaze Estimation From Multimodal Kinect Data, and , in: IEEE Conference in Computer Vision and Pattern Recognition, Workshop on Gesture Recognition, Providence, RI, USA, 2012 |
[DOI] |
Proceedings of 2012 IEEE International Conference on Acoustics, Speech and Signal Processing. (2012)
Generating Exact Lattices in The WFST Framework, , , , , , , , , , , , and , in: Proceedings of 2012 IEEE International Conference on Acoustics, Speech and Signal Processing., The 37th International Conference on Acoustics, Speech, and Signal Processing, Kyoto, JP, Kyoto, Japan, pages 4213-4216, IEEE Signal Processing Societ, 2012 |
[DOI] |
Actes de la conférence conjointe JEP-TALN-RECITAL 2012 (2012)
Impact du degré de supervision sur l'adaptation à un domaine d'un modèle de langage à partir du Web, , , and , in: Actes de la conference conjointe JEP-TALN-RECITAL 2012, Grenoble, France, pages 193-200, ATALA/AFCP, 2012 |
|
Proceedings on IEEE International Conference on Acoustics, Speech and Signal Processing (2012)
IMPROVING ACOUSTIC BASED KEYWORD SPOTTING USING LVCSR LATTICES, , and , in: Proceedings on IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE, Japan, pages 4413-4416, 2012 |
Proceedings of the British Machine Vision Conference (2012)
Indoor Scene Recognition using Task and Saliency-driven Feature Pooling, and , in: Proceedings of the British Machine Vision Conference, Guildford, UK, 2012 |
|
Int Conf. on Multimodal Interaction (ICMI), Santa Monica (2012)
Investigating the Midline Effect for Visual Focus of Attention Recognition, and , in: Int Conf. on Multimodal Interaction (ICMI), Santa Monica, 2012 |
|
Proceedings of the 21st ACM Conference on Information and Knowledge Management (2012)
Iterative Relevance Feedback with Adaptive Exploration/Exploitation Trade-off, and , in: Proceedings of the 21st ACM Conference on Information and Knowledge Management, pages 1323-1331, 2012 |
|
Proceedings of the British Machine Vision Conference (2012)
Leveraging over prior knowledge for online learning of visual categories, , , and , in: Proceedings of the British Machine Vision Conference, 2012 |
|
Proceedings of the International Conference on Multimodal Interaction (ICMI), Santa Monica, USA (2012)
Linking Speaking and Looking Behavior Patterns with Group Composition, Perception, and Performance, , , , and , in: Proceedings of the International Conference on Multimodal Interaction (ICMI), Santa Monica, USA, 2012 |
|
Proceedings of the Tenth Biennial Conference of the Association for Machine Translation in the Americas (AMTA) (2012)
Machine Translation of Labeled Discourse Connectives, , , and , in: Proceedings of the Tenth Biennial Conference of the Association for Machine Translation in the Americas (AMTA), pages 10, 2012 |
|
International Conference on Machine Learning and Applications (2012)
Macro-Action Discovery Based on Change Point Detection and Boosting, and , in: International Conference on Machine Learning and Applications, 2012 |
|
Proceedings of the 2012 IEEE Workshop on Spoken Language Technology (2012)
MediaParl: Bilingual mixed language accented speech database, , , , , and , in: Proceedings of the 2012 IEEE Workshop on Spoken Language Technology, pages 263--268, 2012 |
|
IEEE 7th Sensor Array and Multichannel Signal Processing Workshop(SAM) (2012)
Microphone Array Beampattern Characterization for Hands-free Speech Applications, , and , in: IEEE 7th Sensor Array and Multichannel Signal Processing Workshop(SAM), Hoboken, NJ, USA, pages 473-476, 2012 |
|
Proceedings of International Conference on Multimodal Interaction, ICMI 2012, Santa Monica, CA (2012)
Modeling dominance effects on nonverbal behaviors using granger causality, , , , , and , in: Proceedings of International Conference on Multimodal Interaction, ICMI 2012, Santa Monica, CA, 2012 |
|
Proceedings International Conference on MultiMedia Modeling (2012)
Multimodal Cue Detection Engine for Orchestrated Entertainment, , , and , in: Proceedings International Conference on MultiMedia Modeling, Klagenfurt, Austria, 2012 |
|
in Proceedings of INTERSPEECH 2012 (2012)
On Speaker-Independent Personality Perception and Prediction from Speech, , , , , and , in: in Proceedings of INTERSPEECH 2012, 2012 |
|
34th Annual Conference of the IEEE Engineering in Medicine & Biology Society (2012)
On the Challenge of Classifying 52 Hand Movements from Surface Electromyography, , and , in: 34th Annual Conference of the IEEE Engineering in Medicine & Biology Society, 2012 |
|
Proceedings of the 11th International Conference of the Biometrics Special Interes Group (2012)
On the Effectiveness of Local Binary Patterns in Face Anti-spoofing, , and , in: Proceedings of the 11th International Conference of the Biometrics Special Interes Group, 2012 |
|
Working Notes of the ImageCLEF 2012 Laboratory (2012)
Overview of the ImageCLEF 2012 Robot Vision Task, , and , in: Working Notes of the ImageCLEF 2012 Laboratory, 2012 |
|
ACM Multimedia (2012)
Predicting the Conflict Level in Television Political Debates: an Approach Based on Crowdsourcing, Nonverbal Communication and Gaussian Processes, , , and , in: ACM Multimedia, 2012 |
Workshop on Child, Computer and Interaction (2012)
Reading Companion: The Technical and Social Design of an Automated Reading Tutor, , , , , and , in: Workshop on Child, Computer and Interaction, Portland, Oregon, U.S.A., 2012 |
|
IEEE International Conference on Intelligent Robots and Systems (IROS) - Human Behavior Understanding Workshop(IROS-HBU) (2012)
Recognizing the Visual Focus of Attention for Human Robot Interaction, , and , in: IEEE International Conference on Intelligent Robots and Systems (IROS) - Human Behavior Understanding Workshop(IROS-HBU), 2012 |
|
Proceedings of 5th International Conference on Cognitive Systems (2012)
Robot-to-group Interaction in a Vernissage: Architecture & Dataset for Multi-party Dialog, , , , , , , and , in: Proceedings of 5th International Conference on Cognitive Systems, 2012 |
|
Proceedings of Interspeech (2012)
Robust triphone mapping for acoustic modeling, , and , in: Proceedings of Interspeech, Portland, Oregon, 2012 |
|
International Symposium on Wearable Computers (2012)
Socio-Technical Network Analysis from Wearable Interactions, , and , in: International Symposium on Wearable Computers, 2012 |
|
Proceedings of the IEEE Workshop on Spoken Language Technology (2012)
Speaker Diarization and Linking of Large Corpora, and , in: Proceedings of the IEEE Workshop on Spoken Language Technology, 2012 |
|
Proceedings of International Conference on Acoustic, Speech and Signal Processing (2012)
Speaker Diarization of Meetings based on large TDOA feature vectors, and , in: Proceedings of International Conference on Acoustic, Speech and Signal Processing, 2012 |
|
INTERSPEECH (2012)
Speaker diarization of overlapping speech based on silence distribution in meeting recordings, and , in: INTERSPEECH, Portland, Oregon, USA, 2012 |
|
SAPA-SCALE Conference, The 5th ISCA workshop on statistical and perceptual audition (2012)
Structured Sparse Coding for Microphone Array Location Calibration, , , and , in: SAPA-SCALE Conference, The 5th ISCA workshop on statistical and perceptual audition, 2012 |
|
Proceedings of the 13th Annual Conference of the International Speech Communication Association (InterSpeech) (2012)
Sub-Band Based Log-Energy and its Dynamic Range Stretching for Robust In-Car Speech Recognition, and , in: Proceedings of the 13th Annual Conference of the International Speech Communication Association (InterSpeech), Portland, Oregon, 2012 |
|
Proceedings of Interspeech (2012)
Supervised and unsupervised Web-based language model domain adaptation, , , and , in: Proceedings of Interspeech, Portland, Oregon, USA, pages to appear, 2012 |
|
Synthetic References for Template-based ASR using Posterior Features, , and , in: Proceedings of Interspeech, Portland, Oregon, USA, 2012 |
|
SAPA-SCALE Conference, International Speech Communication Association (2012)
Template-based ASR using Posterior features and synthetic references: comparing different TTS systems, , and , in: SAPA-SCALE Conference, International Speech Communication Association, 2012 |
|
Proceedings of AAAI International Conference on Weblogs and Social Media (2012)
The Good, the Bad, and the Angry: Analyzing Crowdsourced Impressions of Vloggers, and , in: Proceedings of AAAI International Conference on Weblogs and Social Media, 2012 |
|
NIST Speaker Recognition Conference (2012)
The I4U Submission to the 2012 NIST Speaker Recognition Evaluation, , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: NIST Speaker Recognition Conference, 2012 |
The Idiap Speaker Recognition Evaluation System at NIST SRE 2012, , and , in: NIST Speaker Recognition Conference, NIST, Orlando, USA, 2012 |
|
in Proceedings of INTERSPEECH (2012)
The INTERSPEECH 2012 Speaker Trait Challenge, , , , , , , , , , , and , in: in Proceedings of INTERSPEECH, 2012 |
Fourth Workshop on Computational Approaches to Arabic Script-based Languages at Proceedings of the Tenth Biennial Conference of the Association for Machine Translation in the Americas (AMTA) (2012)
Translating English Discourse Connectives into Arabic: a Corpus-based Analysis and an Evaluation Metric, and , in: Fourth Workshop on Computational Approaches to Arabic Script-based Languages at Proceedings of the Tenth Biennial Conference of the Association for Machine Translation in the Americas (AMTA), 2012 |
|
European Conference on Computer Vision (2012)
Unsupervised Activity Analysis and Monitoring algorithms for Effective Surveillance Systems, , , , , , , and , in: European Conference on Computer Vision, 2012 |
|
RecSys, Recommendation Utility Evaluation (RUE 2012) (2012)
Using Crowdsourcing to Compare Document Recommendation Strategies for Conversations, and , in: RecSys, Recommendation Utility Evaluation (RUE 2012), Dublin, Ireland, pages 15-20, 2012 |
|
Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2012)
Using KL-divergence and multilingual information to improve ASR for under-resourced languages, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, pages 4869--4872, 2012 |
|
Proceedings of the EACL2012 Workshop on Hybrid Approaches to Machine Translation (HyTra) (2012)
Using Sense-labeled Discourse Connectives for Statistical Machine Translation, and , in: Proceedings of the EACL2012 Workshop on Hybrid Approaches to Machine Translation (HyTra), Avignon, FR, pages 129--138, 2012 |
|
Proceedings of Interspeech (2012)
Using Sparse Classification Outputs as Feature Observations for Noise Robust ASR, , , , , and , in: Proceedings of Interspeech, 2012 |
|
European Signal Processing Conference (2011)
A Bimodal Sound Source Model for Vehicle Tracking in Traffic Monitoring, , , and , in: European Signal Processing Conference, 2011 |
|
Proceedings of the 19th European Signal Processing Conference (EUSIPCO) (2011)
A BSS-based Approach for Localization of Simultaneous Speakers in Reverberant Conditions, , , and , in: Proceedings of the 19th European Signal Processing Conference (EUSIPCO), 2011 |
|
Proceedings of International Symposium on Artificial Intelligence and Signal Processing (2011)
A Compressive Sensing Based Compressed Neural Network for Sound Source Localization, , and , in: Proceedings of International Symposium on Artificial Intelligence and Signal Processing, 2011 |
|
Proceedings of the GSCL2011 Workshop on "Contrastive Analysis - Translation Studies - Machine Translation: What can we learn from each other?" (2011)
A Corpus-based Contrastive Analysis for Defining Minimal Semantics of Inter-sentential Dependencies for Machine Translation, , , and , in: Proceedings of the GSCL2011 Workshop on "Contrastive Analysis - Translation Studies - Machine Translation: What can we learn from each other?", Hamburg, Germany, pages 5, 2011 |
|
IEEE International Workshop on Socially Intelligent Surveillance and Monitoring (2011)
A Joint Estimation of Head and Body Orientation Cues in Surveillance Video, , and , in: IEEE International Workshop on Socially Intelligent Surveillance and Monitoring, 2011 |
|
SIGDIAL 2011 (12th annual SIGDIAL Meeting on Discourse and Dialogue), Demonstration Session (2011)
A Just-in-Time Document Retrieval System for Dialogues or Monologues, , , and , in: SIGDIAL 2011 (12th annual SIGDIAL Meeting on Discourse and Dialogue), Demonstration Session, Portland, OR, pages 350-352, 2011 |
|
Proceedings of the 22nd British Machine Vision Conference (2011)
A Large-Scale Database of Images and Captions for Automatic Face Naming, , , and , in: Proceedings of the 22nd British Machine Vision Conference, 2011 |
|
Proceedings of the ACL-HLT 2011 System Demonstrations (49th Annual Meeting of the Association for Computational Linguistics) (2011)
A Speech-based Just-in-Time Retrieval System using Semantic Search, , , and , in: Proceedings of the ACL-HLT 2011 System Demonstrations (49th Annual Meeting of the Association for Computational Linguistics), Portland, OR, pages 80-86, 2011 |
[URL] |
Multimodal Corpora for Machine Learning: Taking Stock and Road mapping the Future (2011)
An Audio Visual Corpus for Emergent Leader Analysis, , and , in: Multimodal Corpora for Machine Learning: Taking Stock and Road mapping the Future, 2011 |
The Third Joint Workshop on Hands-free Speech Communication and Microphone Arrays (2011)
An Integrated Framework for Multi-Channel Multi-Source Localization and Voice Activity Detection, , , , and , in: The Third Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2011 |
|
Proceedings of Interspeech 2011 (2011)
Analysis and Comparison of Recent MLP Features for LVCSR Systems, , and , in: Proceedings of Interspeech 2011, 2011 |
|
Proceedings of the 3rd Joint Workshop on Hands-Free Speech Communication and Microphone Arrays (2011)
Audio Spatio-Temporal Fingerprints for Cloudless Real-Time Hands-Free Diarization on Mobile Devices, , in: Proceedings of the 3rd Joint Workshop on Hands-Free Speech Communication and Microphone Arrays, Edinburgh, UK, 2011 |
|
1st International SystemsX.ch Conference on Systems Biology (2011)
Automated Quantification of Morphodynamics for High-Throughput Live Cell Imaging Datasets, , , , , and , in: 1st International SystemsX.ch Conference on Systems Biology, 2011 |
Proceedings International Conference on Signal Acquisition and Processing (2011)
Automatic Time Skew Detection and Correction, , in: Proceedings International Conference on Signal Acquisition and Processing, Singapore, 2011 |
|
Proceedings of the Neural Information Processing Systems Conference (2011)
Boosting with Maximum Adaptive Sampling, and , in: Proceedings of the Neural Information Processing Systems Conference, 2011 |
Proceedings of Corpus Linguistics Conference (2011)
Building 'directional corpora' for unbiased contrastive analysis, and , in: Proceedings of Corpus Linguistics Conference, Birmingham, UK, pages 29-30, 2011 |
|
Proceedings of IAPR IEEE International Joint Conference on Biometrics (IJCB), Washington DC, USA (2011)
Competition on Counter Measures to 2-D Facial Spoofing Attacks, , , , , , , , , , , , , , , , , , , , , , , and , in: Proceedings of IAPR IEEE International Joint Conference on Biometrics (IJCB), Washington DC, USA, 2011 |
|
International Conference on Artificial Intelligence and Statistics (2011)
Deep Learning for Efficient Discriminative Parsing, , in: International Conference on Artificial Intelligence and Statistics, 2011 |
|
The Eleventh IEEE International Workshop on Visual Surveillance (2011)
Detection-Based Multi-Human Tracking Using a CRF Model, , and , in: The Eleventh IEEE International Workshop on Visual Surveillance, 2011 |
|
Proceedings of Corpus Linguistics Conference (2011)
Disambiguating discourse connectives using parallel corpora: senses vs. translations, , , , , and , in: Proceedings of Corpus Linguistics Conference, Birmingham, UK, pages 104-105, 2011 |
|
Proceedings of ACL-HLT 2011 Student Session (2011)
Disambiguating Temporal-Contrastive Discourse Connectives for Machine Translation, , in: Proceedings of ACL-HLT 2011 Student Session, Association for Computational Linguistics, Portland, OR, pages 46--51, 2011 |
|
Proceedings of the SIGDIAL 2011: the 12th Annual Meeting of the Special Interest Group on Discourse and Dialogue (2011)
Engagement-based Multi-party Dialog with a Humanoid Robot, , , , , , and , in: Proceedings of the SIGDIAL 2011: the 12th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pages 341-343, 2011 |
|
International Conference on Ambient Computing, Applications, Services and Technologies (2011)
Environment - Application - Adaptation: a Community Architecture for Ambient Intelligence, , in: International Conference on Ambient Computing, Applications, Services and Technologies, 2011 |
IEEE Conference on Automatic Face and Gesture Recognition (2011)
Exploiting Long-Term Observations for Track Creation and Deletion in Online Multi-Face Tracking, and , in: IEEE Conference on Automatic Face and Gesture Recognition, pages 525-530, IEEE, 2011 |
|
Exploiting observers' judgements for nonverbal group interaction analysis, , and , in: IEEE Conference on Automatic Face and Gesture Recognition, pages 6, IEEE, 2011 |
|
IEEE Conference on Computer Vision and Pattern Recognition (2011)
Extracting and Locating Temporal Motifs in Video Scenes Using a Hierarchical Non Parametric Bayesian Model, , and , in: IEEE Conference on Computer Vision and Pattern Recognition, 2011 |
|
Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding (2011)
Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition, , and , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Hawaii, USA, pages 348-353, 2011 |
|
IAPR IEEE International Joint Conference on Biometrics (2011)
Fast Speaker Verification on Mobile Phone data using Boosted Slice Classifiers, , and , in: IAPR IEEE International Joint Conference on Biometrics, Washington DC, 2011 |
|
IEEE/ACM 13th International Conference on Multimodal Interaction (2011)
Finding Audio-Visual Events in Informal Social Gatherings, , , and , in: IEEE/ACM 13th International Conference on Multimodal Interaction, 2011 |
|
Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (2011)
FlowBoost - Appearance Learning from Sparsely Annotated Video, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2011 |
Proceedings of Interspeech (2011)
Grapheme-based Automatic Speech Recognition using KL-HMM, , , and , in: Proceedings of Interspeech, 2011 |
|
15th annual International Symposium on Wearable Computers (2011)
GroupUs: Smartphone Proximity Data and Human Interaction Type Mining, and , in: 15th annual International Symposium on Wearable Computers, San Francisco, USA, 2011 |
|
Proceedings of the IEEE International Conference on Computer Vision (2011)
HEAT: Iterative Relevance Feedback with One Million Images, and , in: Proceedings of the IEEE International Conference on Computer Vision, pages 2118-2125, 2011 |
Proceedings of Interspeech (2011)
Hierarchical Tandem Features for ASR in Mandarin, , and , in: Proceedings of Interspeech, 2011 |
Proceedings of 4th Workshop on Building and Using Comparable Corpora (2011)
How Comparable are Parallel Corpora? Measuring the Distribution of General Vocabulary and Connectives, , , and , in: Proceedings of 4th Workshop on Building and Using Comparable Corpora, ACL, Portland, OR, pages 78--86, 2011 |
|
Proceeding of IEEE Int Conference on Systems, Man, and Cybernetics - Special Sessions (2011)
Humans as Feature Extractors: Combining Prosody and Personality Perception for Better Speaking Style Recognition, and , in: Proceeding of IEEE Int Conference on Systems, Man, and Cybernetics - Special Sessions, 2011 |
|
Proceedings European Signal Processing Conference (2011)
Impact of Excitation Frequency on Short-Term Recording Synchronisation and Confidence Estimation, , in: Proceedings European Signal Processing Conference, Barcelona, Spain, 2011 |
|
Artificial Neural Networks and Machine Learning - ICANN 2011 (2011)
Improving Articulatory Feature and Phoneme Recognition using Multitask Learning, and , in: Artificial Neural Networks and Machine Learning - ICANN 2011, pages 299-306, Springer Berlin / Heidelberg, 2011 |
[DOI] [URL] |
Proceedings of Interspeech (2011)
Improving non-native ASR through stochastic multilingual phoneme space transformations, , , , and , in: Proceedings of Interspeech, Florence, Italy, pages 537-540, 2011 |
|
Neural Information Processing Systems (NIPS) Workshop on Modeling Human Communication Dynamics (HCD) (2011)
Inferring truth from multiple annotators for social interaction analysis, , and , in: Neural Information Processing Systems (NIPS) Workshop on Modeling Human Communication Dynamics (HCD), pages 4, 2011 |
|
Interspeech (2011)
Information Bottleneck Features for HMM/GMM Speaker Diarization of Meetings Recordings, and , in: Interspeech, Florence, Italy, pages 953-956, 2011 |
|
Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP (2011)
Integrating articulatory features using Kullback-Leibler divergence based acoustic model for phoneme recognition, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, pages 5192 - 5195, 2011 |
[DOI] |
International Joint Conference on Biometrics (2011)
Inter-session Variability Modelling and Joint Factor Analysis for Face Authentication, , , and , in: International Joint Conference on Biometrics, 2011 |
British Machine Vision Conference (2011)
Joint Adaptive Colour Modelling and Skin, Hair and Clothing Segmentation Using Coherent Probabilistic Index Maps, and , in: British Machine Vision Conference, British Machine Vision Association, Dundee, UK, 2011 |
|
Proceedings of International Conference on Document Analysis and Recognition (2011)
Joint Optimization of Hidden Conditional Random Fields and Non Linear Feature Extraction, , and , in: Proceedings of International Conference on Document Analysis and Recognition, 2011 |
Proceedings IEEE International Conference on Multimedia & Expo (2011)
Just-in-Time Multimodal Association and Fusion from Home Entertainment, , , and , in: Proceedings IEEE International Conference on Multimedia & Expo, Barcelona, Spain, 2011 |
|
Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2011)
Language dependent universal phoneme posterior estimation for mixed language speech recognition, , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Prag, CZ, pages 5012-5015, 2011 |
|
Proceedings of Interspeech (2011)
Language-Independent Socio-Emotional Role Recognition in the AMI Meetings Corpus, and , in: Proceedings of Interspeech, 2011 |
|
Conference on Artificial Intelligence (2011)
Learning Structured Embeddings of Knowledge Bases, , , and , in: Conference on Artificial Intelligence, 2011 |
|
Proceedings of International Conference on Ambient Intelligence (2011)
Look at who's talking, , , , and , in: Proceedings of International Conference on Ambient Intelligence, pages 68-76, 2011 |
Interspeech (2011)
LP Residual Features for Robust, Privacy-Sensitive Speaker Diarization, , and , in: Interspeech, 2011 |
|
1st International SystemsX.ch Conference on Systems Biology (2011)
Machine learning techniques to analyse complex, computer vision-extracted, dynamic cellular phenotypes, , , , , and , in: 1st International SystemsX.ch Conference on Systems Biology, 2011 |
2011 IEEE International Conference on Acoustics, Speech and Signal Processing (2011)
Model-based Compressive Sensing for Multi-party Distant Speech Recognition, , and , in: 2011 IEEE International Conference on Acoustics, Speech and Signal Processing, 2011 |
|
1st International SystemsX.ch Conference on Systems Biology (2011)
Morphodynamic profiling to explore spatio-temporal signaling networks regulating neurite outgrowth, , , , , , and , in: 1st International SystemsX.ch Conference on Systems Biology, 2011 |
8th IEEE International Conference on Advanced Video and Signal-Based Surveillance (2011)
Multi-camera Open Space Human Activity Discovery for Anomaly Detection, , and , in: 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011 |
|
Proceedings of Interspeech (2011)
Multi-party Speech Recovery Exploiting Structured Sparsity Models, , , and , in: Proceedings of Interspeech, 2011 |
|
Proceedings of the 13th International Conference on Computer Vision (2011)
Multiclass Transfer Learning from Unconstrained Priors, , and , in: Proceedings of the 13th International Conference on Computer Vision, 2011 |
|
Proceedings of 12th SIGdial Meeting on Discourse and Dialogue (2011)
Multilingual Annotation and Disambiguation of Discourse Connectives for Machine Translation, , , and , in: Proceedings of 12th SIGdial Meeting on Discourse and Dialogue, Association for Computational Linguistics, Portland, OR, pages 194--203, 2011 |
|
Proceedings of International Conference on Acoustics, Speech and Signal Processing (2011)
MULTISTREAM SPEAKER DIARIZATION THROUGH INFORMATION BOTTLENECK SYSTEM OUTPUTS COMBINATION, , and , in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2011 |
|
Pervasive (2011)
Pervasive Sensing to Model Political Opinions in Face-to-Face Networks, , , and , in: Pervasive, San Francisco, 2011 |
|
IEEE Intl. Conference on Acoustics, Speech and Signal Processing 2011 (2011)
Phoneme Recognition using Boosted Binary Features, , and , in: IEEE Intl. Conference on Acoustics, Speech and Signal Processing 2011, 2011 |
|
Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (2011)
Posterior Features for Template-based ASR, , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Prague, Czech Republic, 2011 |
|
Proceedings of the IEEE International Conference on Systems, Man and Cybernetics (2011)
Recent Developments in Social Signal Processing, , and , in: Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, pages 380-385, 2011 |
13th International Conference on Multimodal Interaction (2011)
Smartphone usage in the wild: a large-scale analysis of applications and context, , and , in: 13th International Conference on Multimodal Interaction, 2011 |
|
Proceedings IEEE International Conference on Multimedia & Expo (2011)
Social Focus of Attention as a Time Function Derived from Multimodal Signals, and , in: Proceedings IEEE International Conference on Multimedia & Expo, Barcelona, Spain, 2011 |
|
Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2011)
Speaker Diarization of Meetings based on Speaker Role N-gram Models, , and , in: Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011 |
|
IEEE 2011 Workshop on Automatic Speech Recognition and Understanding (2011)
The Kaldi Speech Recognition Toolkit, , , , , , , , , , , , and , in: IEEE 2011 Workshop on Automatic Speech Recognition and Understanding, Hilton Waikoloa Village, Big Island, Hawaii, US, IEEE Signal Processing Society, 2011 |
|
European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (2011)
The MASH Project, , , and , in: European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2011 |
|
International Conference on Signal Acquisition and Processing (2011)
The TA2 Database - A Multi-Modal Database from Home Entertainment, , and , in: International Conference on Signal Acquisition and Processing, Singapore, 2011 |
|
BigLearn, NIPS Workshop (2011)
Torch7: A Matlab-like Environment for Machine Learning, , and , in: BigLearn, NIPS Workshop, 2011 |
|
Proceedings of the IEEE International Conference on Social Computing (2011)
Towards Computational Proxemics: Inferring Social Relations from Interpersonal Distances, , , , , and , in: Proceedings of the IEEE International Conference on Social Computing, pages 290-297, 2011 |
IEEE International Conference on Robotics and Automation (2011)
Towards semi-supervised learning of semantic spatial concepts, and , in: IEEE International Conference on Robotics and Automation, 2011 |
|
Proceedings of the IEEE International Conference on Computer Vision (2011)
Tracking Multiple Objects under Global Appearance Constraints, , , and , in: Proceedings of the IEEE International Conference on Computer Vision, 2011 |
Visual Surveillance Workshop at ICCV (2011)
Transferring Activities: Updating Human Behavior Analysis, , , , and , in: Visual Surveillance Workshop at ICCV, 2011 |
|
Proceedings of the 28th International Conference on Machine Learning (2011)
Ultra-Fast Optimization Algorithm for Sparse Multi Kernel Learning, and , in: Proceedings of the 28th International Conference on Machine Learning, 2011 |
|
Proceedings of the IEEE International Conference on Systems, Man and Cybernetics (2011)
Understanding Social Signals in Multi-party Conversations: Automatic Recognition of Socio-Emotional Roles in the AMI Meeting Corpus, , , and , in: Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, pages 374-379, 2011 |
Graph-based Methods for Natural Language Processing (2011)
Using a Wikipedia-based Semantic Relatedness Measure for Document Clustering., and , in: Graph-based Methods for Natural Language Processing, 2011 |
|
International Symposium on Wearable Computing (2011)
Who's Who with Big-Five: Analyzing and Classifying Personality Traits with Smartphones, , and , in: International Symposium on Wearable Computing, pages 8, 2011 |
|
Proceedings of AAAI International Conference on Weblogs and Social Media (2011)
You Are Known by How You Vlog: Personality Impressions and Nonverbal Behavior in YouTube, , and , in: Proceedings of AAAI International Conference on Weblogs and Social Media, Barcelona, 2011 |
|
Proceedings of Interspeech, Japan (2010)
A Comparative Study of MLP Front-ends for Mandarin ASR, , , , and , in: Proceedings of Interspeech, Japan, 2010 |
|
Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (2010)
A Comparison of Supervised and Unsupervised Cross-Lingual Speaker Adaptation Approaches for HMM-Based Speech Synthesis, , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Dallas, U.S.A., 2010 |
|
CLEF 2010 Notebook Papers/LABs/Workshops (2010)
A Multi Cue Discriminative Approach to Semantic Place Classification, , and , in: CLEF 2010 Notebook Papers/LABs/Workshops, 2010 |
|
LREC workshop on Multimodal Corpora: Advances in Capturing, Coding and Analyzing Multimodality, Malta, May 2010 (2010)
A Multimodal Corpus for Studying Dominance in Small Group Conversations, , and , in: LREC workshop on Multimodal Corpora: Advances in Capturing, Coding and Analyzing Multimodality, Malta, May 2010, 2010 |
|
Proceedings of the 4th IEEE International Conference on Semantic Computing (ICSC 2010 ), Carnegie Mellon University, Pittsburgh, PA, USA (2010)
A Random Walk Framework to Compute Textual Semantic Similarity: a Unified Model for Three Benchmark Tasks, and , in: Proceedings of the 4th IEEE International Conference on Semantic Computing (ICSC 2010 ), Carnegie Mellon University, Pittsburgh, PA, USA, 2010 |
|
NIPS-2010 Workshop on Practical Applications of Sparse Modeling: Open Issues and New Directions (2010)
A Sparsity Constraint for Topic Models - Application to Temporal Activity Mining, , and , in: NIPS-2010 Workshop on Practical Applications of Sparse Modeling: Open Issues and New Directions, 2010 |
|
Proceedings of Interspeech (2010)
Advances in Fast Multistream Diarization based on the Information Bottleneck Framework, , and , in: Proceedings of Interspeech, 2010 |
Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2010)
An Adaptive Initialization Method for Speaker Diarization based on Prosodic Features, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, pages 4946-4949, 2010 |
|
An Alternative Scanning Strategy to Detect Faces, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010 |
|
Proceedings of Interspeech (2010)
An Analysis of Language Mismatch in HMM State Mapping-Based Cross-Lingual Speaker Adaptation, and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
|
2010 IEEE International Conference on Acoustics, Speech and Signal Processing (2010)
Analysis of Phone Posterior Feature Space Exploiting Class Specific Sparsity and MLP-based Similarity Measure, , and , in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, 2010 |
|
Application of Out-Of-Language Detection To Spoken-Term Detection, and , in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010 |
|
IEEE International Conference on Acoustics, Speech and Signal Processing (2010)
Are you a Werewolf? Detecting deceptive roles and outcomes in a conversational role-playing game, and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2010 |
|
Proceedings of the 33rd Annual ACM SIGIR Conference (2010)
Automatic Content Linking: Speech-based Just-in-time Retrieval for Multimedia Archives, , , , , and , in: Proceedings of the 33rd Annual ACM SIGIR Conference, Geneva, Switzerland, pages 703, 2010 |
[DOI] |
Proceedings of the ACM International Conference on Multimedia (2010)
Automatic Role Recognition Based on Conversational and Prosodic Behaviour, , , and , in: Proceedings of the ACM International Conference on Multimedia, 2010 |
|
Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2010)
Automatic Temporal Alignment of AV Data with Confidence Estimation, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010 |
|
2010 IEEE International Conference on Acoustics, Speech and Signal Processing (2010)
BOOSTED BINARY FEATURES FOR NOISE-ROBUST SPEAKER VERIFICATION, , and , in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, Texas, 2010 |
|
20th International Conference on Pattern Recognition, Istanbul, Turkey (2010)
Crossmodal Matching of Speakers using Lip and Voice Features in Temporally Non-overlapping Audio and Video Streams, and , in: 20th International Conference on Pattern Recognition, Istanbul, Turkey, International Association for Pattern Recognition (IAPR), Istanbul, Turkey, 2010 |
|
Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (2010)
Delineating Trees in Noisy 2D Images and 3D Image Stacks, , , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, pages 2799–2806, 2010 |
Proceedings of 5th International Symposium on Telecommunications (2010)
Determination of Pitch Range Based on Onset and Offset Analysis in Modulation Frequency Domain, , , and , in: Proceedings of 5th International Symposium on Telecommunications, 2010 |
|
Proceedings of 9th International Conference on on Mobile and Ubiquitous Multimedia (MUM,',','),
Limassol, Cyprus (2010)
Discovering Human Places of Interest from Multimodal Mobile Phone Data, and , in: Proceedings of 9th International Conference on on Mobile and Ubiquitous Multimedia (MUM,',','), Limassol, Cyprus, 2010 |
|
Proceedings of Interspeech, Makuhari, Japan, 2010 (2010)
English Spoken Term Detection in Multilingual Recordings, , and , in: Proceedings of Interspeech, Makuhari, Japan, 2010, ISCA, Makuhari, Japan, 2010 |
|
ICASSP 2010 (2010)
Evaluating the Robustness of Privacy-Sensitive Audio Features for Speech Detection in Personal Audio Log Scenarios, , , and , in: ICASSP 2010, 2010 |
|
NIPS workshop on Learning and Planning from Batch Time Series Data (2010)
Extracting Motifs from Time Series Generated by Concurrent Activities., , and , in: NIPS workshop on Learning and Planning from Batch Time Series Data, 2010 |
|
ECCV, Workshop on Face Detection: Where we are, and what next? (2010)
Fast Bounding Box Estimation based Face Detection, and , in: ECCV, Workshop on Face Detection: Where we are, and what next?, 2010 |
[URL] |
International Conference on Speech and Language Processing, Interspeech (2010)
Floor Holder Detection and End of Speaker Turn Prediction in Meetings, , and , in: International Conference on Speech and Language Processing, Interspeech, Makuhari, Japan, ISCA, 2010 |
|
20th International Conference on Pattern Recognition, Istanbul, Turkey, 2010 (2010)
Fusing Audio-Visual Nonverbal Cues to Detect Dominant People in Conversations, and , in: 20th International Conference on Pattern Recognition, Istanbul, Turkey, 2010, Istanbul, Turkey, 2010 |
|
Proceedings of Interspeech (2010)
Hands Free Audio Analysis from Home Entertainment, , and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
|
Hierarchical Multilayer Perceptron based Language Identification, , and , in: Proceedings of Interspeech, Makuhari, Japan, pages 2722-2725, 2010 |
|
Proc. ICMI-MLMI '10 International Conference on Multimodal Interfaces and the Workshop on Machine Learning for Multimodal Interaction (2010)
Identifying Emergent Leadership in Small Groups using Nonverbal Communicative Cues, , , and , in: Proc. ICMI-MLMI '10 International Conference on Multimodal Interfaces and the Workshop on Machine Learning for Multimodal Interaction, Beijing, ACM New York, NY, USA ©2010, 2010 |
[DOI] |
Proceedings of ISCA Speech Synthesis Workshop (2010)
Implementation of VTLN for Statistical Speech Synthesis, , , and , in: Proceedings of ISCA Speech Synthesis Workshop, Kyoto, Japan, 2010 |
|
Proceedings of ACM Multimedia Workshop on Social Signal Processing (2010)
Improving Speech Processing trough Social Signals: Automatic Speaker Segmentation of Political Debates using Role based Turn-Taking Patterns., and , in: Proceedings of ACM Multimedia Workshop on Social Signal Processing, 2010 |
|
IEEE Fourth International Conference on Biometrics: Theory, Applications and Systems (2010)
Introducing Crossmodal Biometrics:Person Identification from Distinct Audio & Visual Streams, and , in: IEEE Fourth International Conference on Biometrics: Theory, Applications and Systems, 2010 |
|
Proceedings of the Neural Information Processing Systems Conference (2010)
Joint Cascade Optimization Using a Product Of Boosted Classifiers, and , in: Proceedings of the Neural Information Processing Systems Conference, pages 1315–1323, 2010 |
Proc. of the 18th Intl. Conf. on Multimedia (2010)
Kodak Moments and Flickr Diamonds: How Users Shape Large-scale Media, , and , in: Proc. of the 18th Intl. Conf. on Multimedia, Firenze, Italy, 2010 |
Advances in Neural Information Processing Systems 23 (NIPS10) (2010)
Learning from Candidate Labeling Sets, and , in: Advances in Neural Information Processing Systems 23 (NIPS10), NIPS Foundation, Vancouver, B.C., Canada, MIT Press, 2010 |
|
Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2010)
Leveraging speaker diarization for meeting recognition from distant microphones, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, pages 4390--4393, 2010 |
2010 IEEE Second International Conference on Social Computing, SIN Symposium (2010)
Mining Human Location-Routines Using a Multi-Level Approach to Topic Modeling, and , in: 2010 IEEE Second International Conference on Social Computing, SIN Symposium, Minneapolis, Minnesota, USA, 2010 |
|
Proceedings of the International Workshop on Mobile HCI (2010)
Mobile Social Signal Processing: vision and research issues, , and , in: Proceedings of the International Workshop on Mobile HCI, Lisbon, pages 513-516, 2010 |
|
International Conference on Acoustics, Speech, and Signal Processing (2010)
Multistream Speaker Diarization beyond Two Acoustic Feature Streams, , and , in: International Conference on Acoustics, Speech, and Signal Processing, 2010 |
|
Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics (2010)
Neural conditional random fields, and , in: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, Chia Laguna, Sardinia, Italy, JMLR: W&CP, 2010 |
|
International Conference on Intelligent Robots and Systems (2010)
Object Recognition using Visuo-Affordance Maps, , , and , in: International Conference on Intelligent Robots and Systems, Taipei, pages 1572-1578, IEEE, 2010 |
[DOI] |
In Proceeding of CVPR 2010, Online Learning for Computer Vision Workshop (2010)
OM-2: An Online Multi-class Multi-kernel Learning Algorithm, , , , and , in: In Proceeding of CVPR 2010, Online Learning for Computer Vision Workshop, 2010 |
|
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2010)
Online-Batch Strongly Convex Multi Kernel Learning, , and , in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2010 |
|
Proceedings of International Conference on Mobile and Ubiquitous Multimedia, Limassol, Cyprus (2010)
Recognizing conversational context in group interaction using privacy-sensitive mobile sensors, , , and , in: Proceedings of International Conference on Mobile and Ubiquitous Multimedia, Limassol, Cyprus, 2010 |
|
Proceedings of IEEE Computer Vision and Pattern Recognition Conference (2010)
Safety in Numbers: Learning Categories from Few Examples with Multi Model Knowledge Transfer, , and , in: Proceedings of IEEE Computer Vision and Pattern Recognition Conference, San Francisco, CA, pages 3081-3088, IEEE, 2010 |
[DOI] |
Proceedings of 5th International Symposium on Telecommunications (2010)
Single Channel Speech Separation with a Frame-based Pitch Range Estimation Method in Modulation Frequency, , , and , in: Proceedings of 5th International Symposium on Telecommunications, 2010 |
|
Proceedings of Measuring Behavior 2010, Eindhoven (The Netherlands) (2010)
Social Signal Processing: Understanding Nonverbal Communication in Social Interactions, and , in: Proceedings of Measuring Behavior 2010, Eindhoven (The Netherlands), 2010 |
|
Proceedings of International Workshop on Socially Intelligent Surveillance and Monitoring (2010)
Socially Intelligent Surveillance and Monitoring: Analysing Social Dimensions of Physical Space, , and , in: Proceedings of International Workshop on Socially Intelligent Surveillance and Monitoring, San Francisco, pages 51-58, 2010 |
|
Proceedings of Interspeech (2010)
Sparse Component Analysis for Speech Recognition in Multi-Speaker Environment, , and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
|
Proceedings of 5th International Symposium on Telecommunications (2010)
Speech Enhancement using an Improved MMSE Estimator with Laplacian Prior, , and , in: Proceedings of 5th International Symposium on Telecommunications, 2010 |
|
ACM Multimedia Workshop on Searching Spontaneous Conversational Speech (2010)
The ACLD: Speech-based Just-in-Time Retrieval of Meeting Transcripts, Documents and Websites, , , and , in: ACM Multimedia Workshop on Searching Spontaneous Conversational Speech, Florence, Italy, 2010 |
|
CLEF 2010 Notebook Papers/LABs/Workshops (2010)
The Robot Vision Track at ImageCLEF 2010, , , and , in: CLEF 2010 Notebook Papers/LABs/Workshops, 2010 |
[URL] |
Proceedings of ACM Multimedia Workshop on Social Signal Processing (2010)
The Voice of Personality: Mapping Nonverbal Vocal Behavior into Trait Attributions, , and , in: Proceedings of ACM Multimedia Workshop on Social Signal Processing, 2010 |
|
ACM Multimedia (2010)
The Wolf Corpus: Exploring group behaviour in a competitive role-playing game, and , in: ACM Multimedia, 2010 |
|
DIRAC Workshop at the European Conference on Machine Learning (2010)
Towards a quantitative measure of rareness, and , in: DIRAC Workshop at the European Conference on Machine Learning, pages 129-136, Springer Berlin Heidelberg, 2010 |
[DOI] |
7th International Conference on Language Resources and Evaluation (2010)
Towards a standard for dialogue act annotation, , , , , , , , , , , and , in: 7th International Conference on Language Resources and Evaluation, Malta, 2010 |
[URL] |
Proceedings of Interspeech (2010)
Towards mixed language speech recognition systems, , and , in: Proceedings of Interspeech, Makuhari, Japan, pages 278-281, 2010 |
|
Proc. ACM Int. Conf. on Pervasive Services (ICPS,',','),
Berlin. (2010)
Towards rich mobile phone datasets: Lausanne data collection campaign, , , , and , in: Proc. ACM Int. Conf. on Pervasive Services (ICPS,',','), Berlin., 2010 |
|
International Conference on Acoustics, Speech and Signal Processing (2010)
Using Audio and Visual Cues for Speaker Diarisation Initialisation, and , in: International Conference on Acoustics, Speech and Signal Processing, 2010 |
|
Proceedings of ICASSP (2010)
VARIATIONAL BAYESIAN SPEAKER DIARIZATION OF MEETING RECORDINGS, , and , in: Proceedings of ICASSP, 2010 |
|
Proc. Int. Conf. on Computer Vision Theory and Applications (2010)
View-Based Appearance Model Online Learning for 3D Deformable Face Tracking, and , in: Proc. Int. Conf. on Computer Vision Theory and Applications, Angers, 2010 |
|
ACM 25th Symposium on Applied Computing, 2010, Sierre, Switzerland (2010)
Visual processing-inspired Fern-Audio features for Noise-Robust Speaker Verification, and , in: ACM 25th Symposium on Applied Computing, 2010, Sierre, Switzerland, Association for Computing Machinery, 2010 |
|
Proceedings International Conference on Multimodal Interfaces (ICMI-MLMI) (2010)
Vlogcast Yourself: Nonverbal Behavior and Attention in Social Media, and , in: Proceedings International Conference on Multimodal Interfaces (ICMI-MLMI), 2010 |
|
Proceedings of AAAI International Conference on Weblogs and Social Media, Washington DC (2010)
Voices of Vlogging, and , in: Proceedings of AAAI International Conference on Weblogs and Social Media, Washington DC, 2010 |
|
Proceedings of ICASSP (2010)
VTLN Adaptation for Statistical Speech Synthesis, , , and , in: Proceedings of ICASSP, Dallas, Texas, 2010 |
|
International Conference on Speech and Language Processing, Interspeech (2010)
Audio–Visual Synchronisation for Speaker Diarisation, , and , in: International Conference on Speech and Language Processing, Interspeech, Makuhari, Japan, 2010 |
|
Proceedings of ICMI-MLMI 2009 (11th International Conference on Multimodal Interfaces and 6th Workshop on Machine Learning for Multimodal Interaction) (2009)
A Multimedia Retrieval System Using Speech Input, , , , , , , , , , and , in: Proceedings of ICMI-MLMI 2009 (11th International Conference on Multimodal Interfaces and 6th Workshop on Machine Learning for Multimodal Interaction), Cambridge, MA, 2009 |
|
International Conference on Developmental Learning (2009)
A theoretical framework for transfer of knowledge across modalities in artificial and cognitive systems, , , and , in: International Conference on Developmental Learning, 2009 |
|
Proceeding of The 9th Asian Conference on Computer Vision (2009)
An online framework for learning novel concepts over multiple cues, , and , in: Proceeding of The 9th Asian Conference on Computer Vision, Xi'an, China, 2009 |
|
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009, WASPAA '09. (2009)
APPLICATIONS OF SIGNAL ANALYSIS USING AUTOREGRESSIVE MODELS FOR AMPLITUDE MODULATION, , , and , in: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009, WASPAA '09., IEEE, Mohonk Mountain House, New Paltz, New York, USA, 2009 |
[URL] |
10th Annual Conference of the International Speech Communication Association (2009)
Arithmetic Coding of Sub-Band Residuals in FDLP Speech/Audio Codec, , and , in: 10th Annual Conference of the International Speech Communication Association, ISCA, Brighton, England, ISCA 2009, 2009 |
|
ACM International Conference on Multimedia (2009)
Automatic Role Recognition in Multiparty Recordings Using Social Networks and Probabilistic Sequential Models, , and , in: ACM International Conference on Multimedia, 2009 |
|
10th Annual Conference of the International Speech Communication Association (2009)
Automatic vs. human question answering over multimedia meeting recordings, and , in: 10th Annual Conference of the International Speech Communication Association, 2009 |
|
International Conference on Biometrics (2009)
Bayesian Networks to Combine Intensity and Color Information in Face Recognition, and , in: International Conference on Biometrics, Springer, 2009 |
|
Proceedings of the International Conference on Affective Computing and Intelligent Interaction (IEEE International Workshop on Social Signal Processing) (2009)
Canal9: A database of political debates for analysis of social interactions, , , and , in: Proceedings of the International Conference on Affective Computing and Intelligent Interaction (IEEE International Workshop on Social Signal Processing), Amsterdam, Netherlands, 2009 |
[DOI] |
Proceedings ICME 2009 (2009)
Characterising Conversationsal Group Dynamics Using Nonverbal Behaviour, , and , in: Proceedings ICME 2009, 2009 |
|
Proceedings ICMI-MLMI (2009)
Discovering Group Nonverbal Conversational Patterns with Topics, and , in: Proceedings ICMI-MLMI, 2009 |
|
Proceedings of the British Maschine Vision Conference (2009)
Dynamic Partitioned Sampling For Tracking With Discriminative Features, , and , in: Proceedings of the British Maschine Vision Conference, London, 2009 |
|
12th International Conference on Text, Speech and Dialogue, TSD 2009 (2009)
Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, , and , in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, Pilsen, Czech Republic, Springer - Verlag, Berlin Heidelberg 2009, 2009 |
|
Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, , and , in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, Pilsen, Czech Republic, Springer - Verlag, Berlin Heidelberg 2009, 2009 |
|
Proceedings of the IEEE International Workshop on Performance Evaluation of Tracking and Surveillance (2009)
Evaluation of Probabilistic Occupancy Map People Detection for Surveillance Systems, , , , and , in: Proceedings of the IEEE International Workshop on Performance Evaluation of Tracking and Surveillance, 2009 |
Proceedings of the 17th ACM International Conference on Multimedia (2009)
Flickr Hypergroups, , , , and , in: Proceedings of the 17th ACM International Conference on Multimedia, 2009 |
|
British Machine Vision Conference 2009 (2009)
Haar Local Binary Pattern Feature for Fast Illumination Invariant Face Detection, and , in: British Machine Vision Conference 2009, 2009 |
|
Proceedings of the 10thAnnual Conference of the International Speech Communication Association (Interspeech) (2009)
Hierarchical Processing of the Modulation Spectrum for GALE Mandarin LVCSR system, , , and , in: Proceedings of the 10thAnnual Conference of the International Speech Communication Association (Interspeech), Brighton, 2009 |
|
Proceedings of the First IEEE International Conference on Biometrics, Identity and Security (BIdS) (2009)
Hill-Climbing Attack to an Eigenface-Based Face Verification System, , , , and , in: Proceedings of the First IEEE International Conference on Biometrics, Identity and Security (BIdS), 2009 |
|
Proceedings of Interspeech 2009 (2009)
Investigating Privacy-Sensitive Features for Speech Detection in Multiparty Conversations, , , and , in: Proceedings of Interspeech 2009, 2009 |
|
Proceedings of the ACM International Conference on Multimedia (2009)
Investigating the use of Visual Focus of Attention for Audio-Visual Speaker Diarisation, , , and , in: Proceedings of the ACM International Conference on Multimedia, Beijing, China, 2009 |
|
Proceedings of the IEEE International Conference on Computer Vision (2009)
Joint Pose Estimator and Feature Learning for Object Detection, , , and , in: Proceedings of the IEEE International Conference on Computer Vision, 2009 |
10th Annual Conference of the International Speech Communication Association (2009)
KL Realignment for Speaker Diarization with Multiple Feature Streams, , and , in: 10th Annual Conference of the International Speech Communication Association, 2009 |
ICMI-MLMI (2009)
Learning and Predicting Multimodal Daily Life Patterns from Cell Phones, and , in: ICMI-MLMI, 2009 |
|
IEEE Int. Conference on Image Processing, Cairo, Egypt (2009)
Learning Large Margin Likelihood for Realtime Head Pose Tracking, and , in: IEEE Int. Conference on Image Processing, Cairo, Egypt, IEEE, 2009 |
|
Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (2009)
Learning Rotational Features for Filament Detection, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2009 |
Audio Engineering Society (AES,',','),
127th Convention (2009)
MDCT for Encoding Residual Signals in Frequency Domain Linear Prediction, , and , in: Audio Engineering Society (AES,',','), 127th Convention, Audio Engineering Society (AES), Audio Engineering Society, 60 East 42nd Street, New York, New York 10165-2520, USA;, 2009 |
[URL] |
Proceedings International ICST Conference on User Centric Media (2009)
Memoirs of Togetherness from Audio Logs, , in: Proceedings International ICST Conference on User Centric Media, Venice, Italy, 2009 |
|
Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding (2009)
MLP Based Hierarchical System for Task Adaptation in ASR, , and , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, 2009 |
|
IEEE International conference on Robotics and Automation (2009)
Model adaptation with least-square SVM for adaptive hand prosthetics, , , , and , in: IEEE International conference on Robotics and Automation, 2009 |
|
International Conference on Audio, Speech and Signal Processing (2009)
MULTI-MODAL SPEAKER DIARIZATION OF REAL-WORLD MEETINGS USING COMPRESSED-DOMAIN VIDEO FEATURES, , and , in: International Conference on Audio, Speech and Signal Processing, 2009 |
|
Proceedings of International Conference on Acoustics, Speech and Signal Processing (2009)
MUTUAL INFORMATION BASED CHANNEL SELECTION FOR SPEAKER DIARIZATION OF MEETINGS DATA, , and , in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2009 |
|
Proceedings of International conference on acoustics speech and signal processing (2009)
Mutual Information based Channel Selection for Speaker Diarization of Meetings Data, , and , in: Proceedings of International conference on acoustics speech and signal processing, 2009 |
Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2009)
Non-linear mapping for multi-channel speech separation and robust overlapping speech recognition, , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009 |
|
Proceedings IADIS International Conference Applied Computing (2009)
Out-of-Scene AV Data Detection, , in: Proceedings IADIS International Conference Applied Computing, Rome, Italy, 2009 |
|
Workshop of the Cross-Language Evaluation Forum (2009)
Overview of the CLEF 2009 medical image annotation track, , , , and , in: Workshop of the Cross-Language Evaluation Forum, Corfu, Greece, pages 85-93, Springer Berlin Heidelberg, 2009 |
[DOI] |
in Proceedings of IEEE/IAPR International Conference on Biometrics (2009)
Parts-Based Face Verification using Local Frequency Bands, and , in: in Proceedings of IEEE/IAPR International Conference on Biometrics, 2009 |
|
Proc. Int. Conf. on Multimodal Interfaces, Workshop on Multimodal Sensor-Based Systems and Mobile Phones for Social Computing, (2009)
Predicting Remote Versus Collocated Group Interactions using Nonverbal Cues, , and , in: Proc. Int. Conf. on Multimodal Interfaces, Workshop on Multimodal Sensor-Based Systems and Mobile Phones for Social Computing,, Cambridge, 2009 |
[DOI] |
IEEE International Conference on Acoustic, Speech, and Signal Processing (2009)
Robust Discriminative Keyword Spotting for Emotionally Colored Spontaneous Speech using Bidirectional LSTM Networks, , , , , and , in: IEEE International Conference on Acoustic, Speech, and Signal Processing, 2009 |
|
Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding (2009)
Robust Speaker Diarization for Short Speech Recordings, and , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, pages 432-437, 2009 |
|
Proceedings of Interspeech (2009)
Robustness of Phase based Features for Speaker Recognition, , and , in: Proceedings of Interspeech, 2009 |
|
Proceedings of the Doctoral Consortium of the International Conference on Affective Computing & Intelligent Interaction (ACII) (2009)
Social Network Analysis in Multimedia Indexing: Making Sense of People in Multiparty Recordings, , in: Proceedings of the Doctoral Consortium of the International Conference on Affective Computing & Intelligent Interaction (ACII), 2009 |
|
Proceedings of ICMI-MLMI 2009 (2009)
Speaker Change Detection with Privacy-Preserving Audio Cues, , , and , in: Proceedings of ICMI-MLMI 2009, 2009 |
|
Proceedings of the International Conference on Medical Image Computing and Computer Assisted Intervention (2009)
Steerable Features for Statistical 3D Dendrite Detection, , , , and , in: Proceedings of the International Conference on Medical Image Computing and Computer Assisted Intervention, 2009 |
IEEE Proc. Int. Conf. on Multimedia and Expo (2009)
Structure and appearance features for robust 3D facial actions tracking, and , in: IEEE Proc. Int. Conf. on Multimedia and Expo, IEEE, 2009 |
|
British Machine Vision Conference (2009)
The more you know, the less you learn: from knowledge transfer to one-shot learning of object categories, and , in: British Machine Vision Conference, 2009 |
|
9th International Workshop in Visual Surveillance (2009)
Topic Models for Scene Analysis and Abnormality Detection, and , in: 9th International Workshop in Visual Surveillance, IEEE, Kyoto, Japan, IEEE, 2009 |
|
International Conference on Image Analysis and Processing (2009)
Towards a theoretical framework for learning multi-modal patterns for embodied agents, , , , , , , and , in: International Conference on Image Analysis and Processing, 2009 |
|
International Conference on Multimedia & Expo (2009)
Visual Activity Context For Focus of Attention Estimation in Dynamic Meetings, , and , in: International Conference on Multimedia & Expo, 2009 |
|
ACM Multimedia (2009)
Visual Speaker Localization Aided by Acoustic Models, , and , in: ACM Multimedia, 2009 |
Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2009)
Volterra Series for Analyzing MLP based Phoneme Posterior Probability Estimator, , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2009 |
|
Proceedings of the 17th ACM International Conference on Multimedia (2009)
Wearing a YouTube hat: directors, comedians, gurus, and user aggregated behavior, and , in: Proceedings of the 17th ACM International Conference on Multimedia, ACM, 2009 |
|
Advances in Neural Information Processing Systems 22 (NIPS09) (2009)
Who's Doing What: Joint Modeling of Names and Verbs for Simultaneous Face and Pose Annotation, , and , in: Advances in Neural Information Processing Systems 22 (NIPS09), NIPS Foundation, Vancouver, B.C., Canada, MIT Press, 2009 |
|
Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP,',','),
Taiwan. (2009)
YOU ARE FIRED! NONVERBAL ROLE ANALYSIS IN COMPETITIVE MEETINGS, , and , in: Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP,',','), Taiwan., 2009 |
|
International Conference on Robotic and Systems (2009)
You live, you learn, you forget: continuous learning of visual places with a forgetting mechanism, , and , in: International Conference on Robotic and Systems, 2009 |
10thAnnual Conference of the International Speech Communication Association (2009)
Automatic Out-of-Language Detection Based on Confidence Measures Derived fromLVCSR Word and Phone Lattices, , in: 10thAnnual Conference of the International Speech Communication Association, ISCA, Brighton, England, 2009 |
|
3rd {ACM}/{IEEE} Conf on Human-Robot Interaction ({HRI08}) (2008)
A Comparative Psychophysical and EEG Study of Different Feedback Modalities for HRI, , , , and , in: 3rd ACM/IEEE Conf on Human-Robot Interaction (HRI08), 2008 |
|
25th International Conference on Machine Learning (ICML) (2008)
A Distance Model for Rhythms, , , and , in: 25th International Conference on Machine Learning (ICML), 2008 |
|
Workshop of the Cross-Language Evaluation Forum (2008)
An SVM Confidence-Based Approach to Medical Image Annotation, , and , in: Workshop of the Cross-Language Evaluation Forum, 2008 |
|
Proc. of the Intl. Conf. on Image and Video Retrieval (2008)
Analyzing Flickr Groups, and , in: Proc. of the Intl. Conf. on Image and Video Retrieval, ACM, 2008 |
Int Conf Spatial Cognition 2008 (2008)
Analyzing Interactions Between Navigation Strategies Using a Computational Model of Action Selection, , , , and , in: Int Conf Spatial Cognition 2008, 2008 |
|
First IEEE Workshop on CVPR for Human Communicative Behavior Analysis (2008)
Associating Audio-Visual Activity Cues in a Dominance Estimation Framework, , , and , in: First IEEE Workshop on CVPR for Human Communicative Behavior Analysis, 2008 |
|
16 European Signal Processing Conference (2008)
Asynchronous detection and classification of oscillatory brain activity, , and , in: 16 European Signal Processing Conference, 2008 |
|
proceedings of the European Conference on Computer Vision (2008)
Automated Delineation of Dendritic Networks in Noisy Image Stacks, , and , in: proceedings of the European Conference on Computer Vision, 2008 |
{AES} 124th Convention, Audio Engineering Society (2008)
Autoregressive Modelling of Hilbert Envelopes for Wide-band Audio Coding, , , and , in: AES 124th Convention, Audio Engineering Society, 2008 |
|
Proceedings of the first Internatinal Conference on Cognitive Systems (2008)
Biologically Motivated Audio-Visual Cue Integration for Object, , , , , , , , and , in: Proceedings of the first Internatinal Conference on Cognitive Systems, 2008 |
|
Proceedings of the 26th Annual CHI Conference on Human Factors in Computing Systems, Extended Abstracts (2008)
Brain-Computer Interfaces for HCI and Games, , , , , and , in: Proceedings of the 26th Annual CHI Conference on Human Factors in Computing Systems, Extended Abstracts, 2008 |
|
European Conf. on Computer Vision (2008)
Calibration from statistical properties of the visual world, , and , in: European Conf. on Computer Vision, 2008 |
|
{IEEE} Int. Conf. on Acoustics, Speech, and Signal Processing ({ICASSP}) (2008)
COMBINATION OF AGGLOMERATIVE AND SEQUENTIAL CLUSTERING FOR SPEAKER DIARIZATION, , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008 |
|
{P}roceedings of Interspeech (2008)
Combining Evidence from a Generative and a Discriminative Model in Phoneme Recognition, and , in: Proceedings of Interspeech, 2008 |
|
Proceedings of the 25th Annual International {C}onference on {M}achine {L}earning ({ICML} 2008) (2008)
Composite Kernel Learning, , and , in: Proceedings of the 25th Annual International Conference on Machine Learning (ICML 2008), Omnipress, 2008 |
|
In proceedings, 4th Intl. Brain-Computer Interface Workshop and Training Course (2008)
Continuous Brain-Actuated Control of an Intelligent Wheelchair by Human EEG, , , , , , and , in: In proceedings, 4th Intl. Brain-Computer Interface Workshop and Training Course, 2008 |
|
Advances in Multilingual and Multimodal Information Retrieval: 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Budapest, Hungary, September 19-21, 2007, Revised Selected Papers (2008)
Cue Integration for Medical Image Annotation, , and , in: Advances in Multilingual and Multimodal Information Retrieval: 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Budapest, Hungary, September 19-21, 2007, Revised Selected Papers, Springer-Verlag, 2008 |
|
Workshop on Machine Learning and Multimodal Interaction (MLMI08) (2008)
Daily Routine Classification from Mobile Phone Data, and , in: Workshop on Machine Learning and Multimodal Interaction (MLMI08), 2008 |
|
Proc. Int. Conf. on Pattern Recognition (ICPR) (2008)
Detecting queues at vending machines: a statistical layered approach, and , in: Proc. Int. Conf. on Pattern Recognition (ICPR), Tampa, 2008 |
|
{IEEE} International Symposium on Wearable Computers ({ISWC}) (2008)
Discovering Human Routines from Cell Phone Data with Topic Models, and , in: IEEE International Symposium on Wearable Computers (ISWC), 2008 |
|
Proc. 16th European Signal Processing Conference (EUSIPCO) (2008)
Emulating Temporal Receptive Fields of Auditory Mid-Brain Neurons for Automatic Speech Recognition, and , in: Proc. 16th European Signal Processing Conference (EUSIPCO), 2008 |
|
"{IEEE} Int. Conf. on Acoustics, Speech, and Signal Processing ({ICASSP})" (2008)
Exploiting Contextual Information for Improved Phoneme Recognition, , , and , in: "IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)", 2008 |
|
Text, Speech and Dialogue (2008)
Exploiting Contextual Information for Speech/Non-Speech Detection, , and , in: Text, Speech and Dialogue, Brno, Czech Republic, Springer-Verlag Berlin, Heidelberg, 2008 |
|
{Workshop on Searching Spontaneous Conversational Speech at SIGIR} (2008)
Fast Approximate Spoken Term Detection from Sequence of Phonemes, , , and , in: Workshop on Searching Spontaneous Conversational Speech at SIGIR, 2008 |
|
European Conference on Computer Vision, workshop on Visual
Surveillance (ECCV-VS) (2008)
Fast human detection from videos using covariance features, and , in: European Conference on Computer Vision, workshop on Visual Surveillance (ECCV-VS), Marseille, 2008 |
|
Interspeech 2008 (2008)
Front-end for Far-field Speech Recognition based on Frequency Domain Linear Prediction, , and , in: Interspeech 2008, 2008 |
|
MobileHCI 2008 (10th International Conference on Human-Computer Interaction with Mobile Devices and Services, Demonstrations Session) (2008)
Graphical representation of meetings on mobile devices, , and , in: MobileHCI 2008 (10th International Conference on Human-Computer Interaction with Mobile Devices and Services, Demonstrations Session), Amsterdam, 2008 |
|
{IEEE} Int. Conf. on Acoustics, Speech, and Signal Processing ({ICASSP}) (2008)
Hierarchical and Parallel Processing of Modulation Spectrum for ASR applications, and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008 |
|
MLMI 2008 (2008)
Hilbert Envelope Based Features for Far-Field Speech Recognition, , and , in: MLMI 2008, 2008 |
|
Interspeech 2008 (2008)
Hilbert Envelope Based Spectro-Temporal Features for Phoneme Recognition in Telephone Speech, , and , in: Interspeech 2008, 2008 |
|
International Conference on Automatic Face and Gesture Recognition (2008)
Identifying Dominant People in Meetings from Audio-Visual Sensors, and , in: International Conference on Automatic Face and Gesture Recognition, Amsterdam, The Netherlands, 2008 |
|
6th International Conference on Language Resources and Evaluation (2008)
Improving Contextual Quality Models for MT Evaluation Based on Evaluators' Feedback, , and , in: 6th International Conference on Language Resources and Evaluation, Marrakech, Morocco, 2008 |
Interspeech 2008 (2008)
Integration of TDOA Features in Information Bottleneck Framework for Fast Speaker Diarization, , and , in: Interspeech 2008, 2008 |
|
Introducing Temporal Asymmetries in Feature Extraction for Automatic Speech Recognition, and , in: Interspeech 2008, 2008 |
|
International Conference on Multi-modal Interfaces (2008)
Investigating Automatic Dominance Estimation in Groups From Visual Attention and Speaking Activity, , , , and , in: International Conference on Multi-modal Interfaces, 2008 |
|
16th European Signal processing Conference (EUSIPCO) (2008)
Multi-camera 3d person tracking with particle filter in a surveillance environment, and , in: 16th European Signal processing Conference (EUSIPCO), 2008 |
|
European Conference on Computer Vision, workshop on Multi
Camera and Multi-modal Sensor Fusion Algorithms and Applications
(ECCV-M2SFA2) (2008)
Multi-camera multi-person 3d space tracking with mcmc in surveillance scenarios, and , in: European Conference on Computer Vision, workshop on Multi Camera and Multi-modal Sensor Fusion Algorithms and Applications (ECCV-M2SFA2), Marseille, 2008 |
|
proceedings of the European Conference on Computer Vision (2008)
Multi-Camera Tracking and Atypical Motion Detection with Behavioral Maps, , and , in: proceedings of the European Conference on Computer Vision, 2008 |
{IEEE} Int. Conf. on Acoustics, Speech, and Signal Processing ({ICASSP}) (2008)
Multi-party Focus of Attention Recognition in Meetings from Head Pose and Multimodal Contextual Cues, and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008 |
|
International Conference on Computer Vision Systems (ICVS08) (2008)
Object Category Detection using Audio-visual Cues, , , , and , in: International Conference on Computer Vision Systems (ICVS08), 2008 |
|
Interspeech 2008 (2008)
On the Combination of Auditory and Modulation Frequency Channels for ASR applications, and , in: Interspeech 2008, 2008 |
|
Text, Speech and Dialogue (2008)
Perceptually motivated Sub-band Decomposition for FDLP Audio Coding, , , , and , in: Text, Speech and Dialogue, Brno, Czech Republic, Springer-Verlag Berlin, Heidelberg, 2008 |
|
ACM MM 2008 (2008)
Predicting the Dominant Clique in Meetings through Fusion of Nonverbal Cues, , , and , in: ACM MM 2008, 2008 |
|
Proceedings - ICMI 2008 (2008)
Predicting Two Facets of Social Verticality in Meetings from Five-Minute Time Slices and Nonverbal Cues, , , and , in: Proceedings - ICMI 2008, 2008 |
|
proceedings of the International Conference on Computer Vision Theory and Applications (2008)
Principled Detection-by-classification from Multiple Views, , and , in: proceedings of the International Conference on Computer Vision Theory and Applications, 2008 |
LangTech 2008 (2008)
Recognition and Understanding of Meetings Overview of the European AMI and AMIDA Projects, and , in: LangTech 2008, 2008 |
|
In proceedings, 4th Intl. Brain-Computer Interface Workshop and Training Course (2008)
Recognition of Anticipatory Behavior from Human EEG, , and , in: In proceedings, 4th Intl. Brain-Computer Interface Workshop and Training Course, 2008 |
|
LREC 2008 ELRA Workshop on Evaluation (2008)
Reference-based vs. task-based evaluation of human language technology, , in: LREC 2008 ELRA Workshop on Evaluation, ELRA, Marrakech, Morocco, 2008 |
|
11th International {C}onference on {T}ext, {S}peech, and {D}ialogue (2008)
Reverse Correlation for analyzing MLP Posterior Features in ASR, , and , in: 11th International Conference on Text, Speech, and Dialogue, 2008 |
|
ACM International Conference on Multimedia (2008)
Role Recognition for Meeting Participants: an Approach Based on Lexical Information and Social Network Analysis, , , , and , in: ACM International Conference on Multimedia, Vancouver, Canada, 2008 |
|
International Conference on Multimodal Interfaces (2008)
Role Recognition in Multiparty Recordings using Social Affiliation Networks and Discrete Distributions, , , and , in: International Conference on Multimodal Interfaces, Chania, Greece, 2008 |
|
Interspeech (2008)
Silence Models in Weighted Finite-State Transducers, , in: Interspeech, 2008 |
|
Proceedings of the 4th International Brain-Computer Interface Workshop and Training Course (2008)
Simultaneous Real-Time Detection of Motor Imagery and Error-Related Potentials for Improved BCI Accuracy, and , in: Proceedings of the 4th International Brain-Computer Interface Workshop and Training Course, 2008 |
|
Proceedings of the ACM International Conference on Multimedia (2008)
Social Signal Processing: State-of-the-Art and Future Perspectives of an Emerging Domain, , , and , in: Proceedings of the ACM International Conference on Multimedia, 2008 |
|
Proceedings of International Conference on Multimodal Interfaces (to appear) (2008)
Social Signals, their Function, and Automatic Analysis: A Survey, , , and , in: Proceedings of International Conference on Multimodal Interfaces (to appear), 2008 |
|
INTERSPEECH 2008 (2008)
Spectral Noise Shaping: Improvements in Speech/Audio Codec Based on Linear Prediction in Spectral Domain, , , and , in: INTERSPEECH 2008, 2008 |
|
EUSIPCO 2008 (2008)
Spectro-Temporal Features for Automatic Speech Recognition using Linear Prediction in Spectral Domain, , and , in: EUSIPCO 2008, 2008 |
|
Proceedings of the 22nd Annual Conference on Neural Information Processing Systems (2008)
Support Vector Machines with a Reject Option, , , and , in: Proceedings of the 22nd Annual Conference on Neural Information Processing Systems, 2008 |
|
Proceedings of the IEEE International Conference on Robotics and Automation (ICRA08) (2008)
SVM-based Discriminative Accumulation Scheme for Place Recognition, , and , in: Proceedings of the IEEE International Conference on Robotics and Automation (ICRA08), 2008 |
|
6th International Conference on Language Resources and Evaluation (2008)
Task-based evaluation of meeting browsers: from BET task elicitation to user behavior analysis, , , and , in: 6th International Conference on Language Resources and Evaluation, Marrakech, Morocco, 2008 |
|
{IEEE} Int. Conf. on Acoustics, Speech, and Signal Processing ({ICASSP}) (2008)
Temporal Masking for Bit-rate Reduction in Audio Codec Based on Frequency Domain Linear Prediction, , , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008 |
|
Machine Learning for Multimodal Interaction V (2008)
The AMIDA Automatic Content Linking Device: Just-in-Time Document Retrieval in Meetings, , , , , , , and , in: Machine Learning for Multimodal Interaction V, Utrecht, Springer-Verlag, 2008 |
[DOI] |
Proceedings of the International Conference on Multimodal Interfaces (2008)
The DIRAC AWEAR Audio-Visual Platform for Detection of Unexpected and Incongruent Events, , , , , , , , , , , , , and , in: Proceedings of the International Conference on Multimodal Interfaces, 2008 |
|
Int. Conf. on Machine Learning (2008)
The Projectron: a Bounded Kernel-Based Perceptron, , and , in: Int. Conf. on Machine Learning, 2008 |
|
"Int. Conf. on Music Information Retrieval ({ISMIR})" (2008)
Timbre and Rhythmic TRAP-TANDEM features for music information retrieval, , in: "Int. Conf. on Music Information Retrieval (ISMIR)", 2008 |
|
MM '08: Proc. of the 16th ACM Intl. Conf. on Multimedia (2008)
Topickr: Flickr Groups and Users Reloaded, and , in: MM '08: Proc. of the 16th ACM Intl. Conf. on Multimedia, ACM, 2008 |
European Conference on Computer Vision Workshop on Multi-camera and Multi-modal Sensor Fusion (2008)
Towards Audio-Visual On-line Diarization Of Participants In Group Meetings, and , in: European Conference on Computer Vision Workshop on Multi-camera and Multi-modal Sensor Fusion, 2008 |
|
IEEE International Conference on Robotics ad Automation (2008)
Towards Robust Place Recognition for Robot Localization, , , , , and , in: IEEE International Conference on Robotics ad Automation, 2008 |
|
11th International IEEE Conference on Intelligent Transportation Systems (ITSC) (2008)
Understanding Metro Station Usage using Closed Circuit Television Cameras Analysis, , , , , , and , in: 11th International IEEE Conference on Intelligent Transportation Systems (ITSC), Bejing, 2008 |
|
International Conference on Multi-media {&} Expo (2008)
Visual Focus of Attention Estimation from Head Pose Posterior Probability Distributions, and , in: International Conference on Multi-media & Expo, 2008 |
|
ACM International Conference on Multimedia ({ACMMM}) (2008)
What Did You Do Today? Discovering Daily Routines from Large-Scale Mobile Data, and , in: ACM International Conference on Multimedia (ACMMM), 2008 |
|
ICASSP'08 (2008)
Hierarchical Integration of Phonetic and Lexical Knowledge in Phone Posterior Estimation, and , in: ICASSP'08, 2008 |
|
ICSLP'08 (2008)
In-Context Phone Posteriors as Complementary Features for Tandem ASR, and , in: ICSLP'08, 2008 |
|
International Conference on Multi-Media {&} Expo ({ICME07}) (2007)
A Cognitive and Unsupervised MAP Adaptation Approach to the Recognition of the Focus of Attention from Head Pose, and , in: International Conference on Multi-Media & Expo (ICME07), 2007 |
|
{NIPS} Workshop on Brain, Music and Cognition (2007)
A Generative Model for Rhythms, , , and , in: NIPS Workshop on Brain, Music and Cognition, 2007 |
|
European Symposium on Artificial Neural Networks, {ESANN} (2007)
A supervised learning approach based on STDP and polychronization in spiking neuron networks, , and , in: European Symposium on Artificial Neural Networks, ESANN, 2007 |
|
Proceedings of the 10th IEEE International Conference on Rehabilitation Robotics (2007)
Adaptive Shared Control of a Brain-Actuated Simulated Wheelchair, , , , , , , and , in: Proceedings of the 10th IEEE International Conference on Rehabilitation Robotics, 2007 |
|
{IEEE} Automatic Speech Recognition and Understanding Workshop (2007)
AGGLOMERATIVE INFORMATION BOTTLENECK FOR SPEAKER DIARIZATION OF MEETINGS DATA, , and , in: IEEE Automatic Speech Recognition and Understanding Workshop, 2007 |
|
{IEEE} Int. Conf. on Acoustics, Speech, and Signal Processing ({ICASSP}) (2007)
An Acoustic Model Based on Kullback-Leibler Divergence for Posterior Features, , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007 |
|
Proceedings of the 13th International Symposium on Robotics Research (2007)
An Asynchronous and Non-Invasive Brain-Actuated Wheelchair, , , , , , , and , in: Proceedings of the 13th International Symposium on Robotics Research, 2007 |
|
Proceedings of the 20th International Joint Conference on Artificial Intelligence, Workshop on Artificial Intelligence for Space Applications (2007)
Augmenting Astronaut's Capabilities through Brain-Machine Interfaces, , , and , in: Proceedings of the 20th International Joint Conference on Artificial Intelligence, Workshop on Artificial Intelligence for Space Applications, 2007 |
|
3rd European Conference on Mobile Robots ({ECMR 2007}) (2007)
Bayesian Controller for a Novel Semi-Autonomous Navigation Concept, , , and , in: 3rd European Conference on Mobile Robots (ECMR 2007), 2007 |
|
7th International Workshop on Multiple Classifier Systems, {MCS} (2007)
Biometric Person Authentication IS A Multiple Classifier Problem, and , in: 7th International Workshop on Multiple Classifier Systems, MCS, 2007 |
|
Proceedings of the 12th International Conference on Human-Computer Interaction (2007)
Brain-Machine Interfaces through Control of Electroencephalographic Signals and Vibrotactile Feedback, , , , , , , , and , in: Proceedings of the 12th International Conference on Human-Computer Interaction, 2007 |
|
{ACM} International Conference on Multimedia (2007)
Broadcast News Story Segmentation Using Social Network Analysis and Hidden Markov Models, and , in: ACM International Conference on Multimedia, 2007 |
|
Proceedings of ImageCLEF 2007 -LNCS (2007)
CLEF2007 Image Annotation Task: an SVM-based Cue Integration Approach, , and , in: Proceedings of ImageCLEF 2007 -LNCS, 2007 |
|
{IEEE} Int. Conf. on Acoustics, Speech, and Signal Processing ({ICASSP}) (2007)
Combination of Acoustic Classifiers based on Dempster-Shafer Theory of evidence, and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007 |
|
IEEE International Conference on Intelligent RObot Systems (IROS) (2007)
Confidence-based Cue Integration for Visual Place Recognition, and , in: IEEE International Conference on Intelligent RObot Systems (IROS), 2007 |
|
2nd Workshop on Speech in Mobile and Pervasive Environments (SiMPE) (2007)
Detection and Recognition of Number Sequences in Spoken Utterances, and , in: 2nd Workshop on Speech in Mobile and Pervasive Environments (SiMPE), 2007 |
|
Workshop on Non-Linear Speech Processing (2007)
Discriminative Keyword Spotting, , and , in: Workshop on Non-Linear Speech Processing, Paris, France, 2007 |
|
Advances in Neural Information {P}rocessing {S}ystems 21 (2007)
EEG-Based Brain-Computer Interaction: Improved Accuracy by Automatic Single-Trial Error Detection, and , in: Advances in Neural Information Processing Systems 21, 2007 |
|
IEEE International Conference on Acoustics, Speech, and Signal Processing (2007)
ESTIMATING THE DOMINANT PERSON IN MULTI-PARTY CONVERSATIONS USING SPEAKER DIARIZATION STRATEGIES, , , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2007 |
2007
I{EEE} / {IAPR} Intl. {C}onf. On {B}iometrics ({ICB}) (2007)
Face Authentication with Salient Local Features and Static Bayesian Network, and , in: IEEE / IAPR Intl. Conf. On Biometrics (ICB), 2007 |
|
Proceedings of the IEEE International Symposium on Intelligent Signal Processing (2007)
Feature Extraction for Multi-Class BCI using Canonical Variates Analysis, , , , and , in: Proceedings of the IEEE International Symposium on Intelligent Signal Processing, 2007 |
|
4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms ({MLMI}) (2007)
Frequency Domain Linear Prediction for QMF Sub-bands and Applications to Audio Coding, , , and , in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), 2007 |
|
Interspeech 2007 (2007)
Hierarchical Neural Networks Feature Extraction for LVCSR system, , , , , and , in: Interspeech 2007, 2007 |
|
Advances in Neural Information {P}rocessing {S}ystems 21 (2007)
Hierarchical Penalization, , and , in: Advances in Neural Information Processing Systems 21, 2007 |
|
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS07) (2007)
Incremental Learning for Place Recognition in Dynamic Environments, , , and , in: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS07), 2007 |
|
18th British Machine Vision Conference (BMVC07) (2007)
Indoor Place Recognition using Online Independent Support Vector Machines, , , , and , in: 18th British Machine Vision Conference (BMVC07), 2007 |
|
International Conference on Speech Communication and Technology {(INTERSPEECH)} (2007)
Learning the Inter-frame Distance for Discriminative Template-based Keyword Detection, and , in: International Conference on Speech Communication and Technology (INTERSPEECH), 2007 |
|
International Conference on Machine Learning ({ICML}) (2007)
More Efficiency in Multiple Kernel Learning, , , and , in: International Conference on Machine Learning (ICML), 2007 |
|
CVPR 2007 Workshop on Visual
Surveillance (VS2007) (2007)
Multi-Layer Background Subtraction Based on Color and Texture, and , in: CVPR 2007 Workshop on Visual Surveillance (VS2007), 2007 |
|
Interspeech 2007 (2007)
Multi-stream Features Combination based on Dempster-Shafer Rule for LVCSR System, , and , in: Interspeech 2007, 2007 |
|
Proceedings of the 2nd International Symposium on Brain, Vision and Artificial Intelligence (2007)
Non-Invasive Brain-Actuated Interaction, , , , and , in: Proceedings of the 2nd International Symposium on Brain, Vision and Artificial Intelligence, 2007 |
|
Interspeech-Eurospeech # to appear in html (2007)
Non-linear Spectral Contrast Stretching for In-car Speech Recognition, and , in: Interspeech-Eurospeech # to appear in html, 2007 |
|
Tenth International Conference on TEXT, SPEECH and DIALOGUE ({TSD}) (2007)
Non-Uniform Speech/Audio Coding Exploiting Predictability of Temporal Evolution of Spectral Envelopes, , , and , in: Tenth International Conference on TEXT, SPEECH and DIALOGUE (TSD), 2007 |
|
2007
On Confusions in a Phoneme Recognizer, , and , 2007 |
|
4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms ({MLMI}) (2007)
Posterior-Based Features and Distances in Template Matching for Speech Recognition, and , in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), 2007 |
|
Classification of Events, Activities and Relationship Evaluation and Workshop (2007)
Probabilistic Head Pose Tracking Evaluation in Single and Multiple Camera Setups, and , in: Classification of Events, Activities and Relationship Evaluation and Workshop, 2007 |
|
Proc. of the {IEEE} Workshop on Automatic Speech Recognition and Understanding, {ASRU}'07 (2007)
Recognition and Understanding of Meetings The AMI and AMIDA Projects, , and , in: Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU'07, 2007 |
|
{IEEE} International Conference on Multimedia and Expo ({ICME}) (2007)
Semantic Segmentation of Radio Programs Using Social Network Analysis and Duration Distribution Modeling, , and , in: IEEE International Conference on Multimedia and Expo (ICME), 2007 |
|
International Conference on Machine Learning ({ICML}) (2007)
Sparse Probabilistic Classifiers, and , in: International Conference on Machine Learning (ICML), 2007 |
|
International Conference on Computer Vision Systems (ICVS07) (2007)
SVM-based Transfer of Visual Knowledge Across Robotic Platforms, , and , in: International Conference on Computer Vision Systems (ICVS07), 2007 |
|
In the book, Constructing Ambient Intelligence: AmI-07 Workshops Proceedings, Max M\:uhlh\:auser, Alois Ferscha, and Erwin Aitenbichler (Eds.,',','),
LNCS, Springer Verlag, 2008. (2007)
The use of brain-computer interfacing for ambient intelligence, , , , , and , in: In the book, Constructing Ambient Intelligence: AmI-07 Workshops Proceedings, Max M\:uhlh\:auser, Alois Ferscha, and Erwin Aitenbichler (Eds.,',','), LNCS, Springer Verlag, 2008., 2007 |
|
1st International Conference on Cognitive Neurodynamics ({ICCN 2007}) (2007)
To Err Is Human: Learning from Error Potentials in Brain-Computer Interfaces, , and , in: 1st International Conference on Cognitive Neurodynamics (ICCN 2007), 2007 |
|
{IEEE} Int. Conf. on Acoustics, Speech, and Signal Processing ({ICASSP}) (2007)
Unsupervised Speech/Non-speech Detection for Automatic Speech Recognition in Meeting Rooms, , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007 |
|
"" (2007)
Using Audio and Video Features to Classify the Most Dominant Person in a Group Meeting, , , , , , , , and , in: "", 2007 |
|
Proceedings of the 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (2007)
Vibrotactile Feedback in the Context of Mu-Rhythm based BCI, , , , , , , , , , , and , in: Proceedings of the 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2007 |
|
Proceedings of the 1st International Conference on Cognitive Neurodynamics (2007)
Visuo-Spatial Attention Frame Recognition for Brain-Computer Interfaces, , , , , , and , in: Proceedings of the 1st International Conference on Cognitive Neurodynamics, 2007 |
|
{IEEE} Int. Conf. on Acoustics, Speech, and Signal Processing ({ICASSP}) (2007)
Wide-Band Perceptual Audio Coding based on Frequency-Domain Linear Prediction, , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007 |
|
Classification of Events, Activities, and Relationships (CLEAR) 2006 (2006)
2D Multi-Person Tracking: A Comparative Study in AMI Meetings, , , , , and , in: Classification of Events, Activities, and Relationships (CLEAR) 2006, 2006 |
|
European Conference on Machine Learning {(ECML)} (2006)
A Discriminative Approach for the Retrieval of Images from Text Queries, , and , in: European Conference on Machine Learning (ECML), 2006 |
|
IEEE International Conference on Intelligent RObot Systems (IROS) (2006)
A Discriminative Approach to Robust Visual Place Recognition, , , and , in: IEEE International Conference on Intelligent RObot Systems (IROS), 2006 |
|
Second Workshop on Multimodal User Authentication, {MMUA} (2006)
A Max Kernel For Text-Independent Speaker Verification Systems, and , in: Second Workshop on Multimodal User Authentication, MMUA, 2006 |
|
International Conference on Artificial Neural Networks {(ICANN)} (2006)
A Neural Network to Retrieve Images from Text Queries, and , in: International Conference on Artificial Neural Networks (ICANN), 2006 |
|
3rd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms ({MLMI06}) (2006)
A Study on Visual Focus of Attention Recognition from Head Pose in a Meeting Room, and , in: 3rd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI06), 2006 |
|
{IEEE} Int. Conf. on Multisensor Fusion and Integration for Intelligent Systems ({MFI}) (2006)
Analyzing Group Interactions in Conversations: a Review, , in: IEEE Int. Conf. on Multisensor Fusion and Integration for Intelligent Systems (MFI), 2006 |
|
Workshop on Multimodal User Authentication ({MMUA}) (2006)
Bi-Modal Face and Speech Authentication: a BioLogin Demonstration System, , , and , in: Workshop on Multimodal User Authentication (MMUA), 2006 |
|
{t}he {S}pringer series of {L}ecture Notes in {C}omputer {S}cience (2006)
Constructing visual models with a latent space approach, , , and , in: the Springer series of Lecture Notes in Computer Science, 2006 |
|
Proceedings of International {C}onference on {P}attern {R}ecognition ({ICPR}) (2006)
Cursive Character Challenge: a New Database for Machine Learning and Pattern Recognition, , and , in: Proceedings of International Conference on Pattern Recognition (ICPR), 2006 |
|
{IEEE} Performance Evaluation of Tracking and Surveillance Workshop (PETS) (2006)
Detecting Abandoned Luggage Items in a Public Space, , and , in: IEEE Performance Evaluation of Tracking and Surveillance Workshop (PETS), 2006 |
|
In the Eighth International Conference on Multimodal Interfaces (ICMI'06) (2006)
Detection and Application of Influence Rankings in Small Group Meetings, , , and , in: In the Eighth International Conference on Multimodal Interfaces (ICMI'06), 2006 |
|
International Conference on Spoken Language Processing (2006)
Discriminant linear processing of time-frequency plane, and , in: International Conference on Spoken Language Processing, 2006 |
|
The 9th International Conference on Spoken Language Processing (INTERSPEECH) (2006)
Discriminative Kernel-Based Phoneme Sequence Recognition, , , , and , in: The 9th International Conference on Spoken Language Processing (INTERSPEECH), Pittsburgh, PA, 2006 |
|
In the Eighth International Conference on Multimodal Interfaces (ICMI'06) (2006)
Exploring Contextual Information in a Layered Framework for Group Action Recognition, , and , in: In the Eighth International Conference on Multimodal Interfaces (ICMI'06), 2006 |
|
9th European Conference on Computer Vision ({ECCV}) (2006)
Face Authentication Using Adapted Local Binary Pattern Histograms, and , in: 9th European Conference on Computer Vision (ECCV), 2006 |
|
{ACM} Int. Conf. on Human-Centered Multimedia ({HCM}) (2006)
Finding groups of people in Google news, and , in: ACM Int. Conf. on Human-Centered Multimedia (HCM), 2006 |
|
{IEEE} Int. Conf. on Automatic Face and Gesture Recognition ({AFGR}) (2006)
Hand Posture Classification and Recognition using the Modified Census Transform, , and , in: IEEE Int. Conf. on Automatic Face and Gesture Recognition (AFGR), 2006 |
|
Proceedings of the 3rd International Brain-Computer Interface Workshop {&} Training Course 2006 (2006)
Haptic Feedback Compared with Visual Feedback for BCI, , , , , and , in: Proceedings of the 3rd International Brain-Computer Interface Workshop & Training Course 2006, 2006 |
|
High Frequency Bands and Estimated Local Field Potentials to Improve Single-Trial Classification of Electroencephalographic Signals, , , , , and , in: Proceedings of the 3rd International Brain-Computer Interface Workshop & Training Course 2006, 2006 |
|
Proceedings du {C}olloque International {F}rancophone sur l'Ecrit et le {D}ocument ({CIFED}06) (2006)
Indexation de Documents Manuscrits, , in: Proceedings du Colloque International Francophone sur l'Ecrit et le Document (CIFED06), 2006 |
|
International Conference on Spoken Language Processing (2006)
Infinite Models for Speaker Clustering, , in: International Conference on Spoken Language Processing, 2006 |
|
Beyond Patches Workshop, in conjunction with {CVPR} (2006)
Integrating co-occurrence and spatial contexts on patch-based scene segmentation, , , and , in: Beyond Patches Workshop, in conjunction with CVPR, 2006 |
|
Proceedings of the 10th Conference on Computational Natural Language Learning ({CoNLL}). (2006)
Investigating Lexical Substitution Scoring for Subtitle Generation, , , , and , in: Proceedings of the 10th Conference on Computational Natural Language Learning (CoNLL)., 2006 |
|
3rd Joint Workshop on Multimodal Interaction and Related Machine LEarning Algorithms {MLMI'06} (2006)
Juicer: A Weighted Finite-State Transducer speech decoder, , , , , and , in: 3rd Joint Workshop on Multimodal Interaction and Related Machine LEarning Algorithms MLMI'06, 2006 |
|
Medical Informatics in Europe (MIE) (2006)
Kernel Methods for Melanoma Recognition, , and , in: Medical Informatics in Europe (MIE), 2006 |
|
Proceedings of Workshop on {C}omputer {V}ision {A}pproaches to {M}edical Image Analysis (CVAMIA) 2006) (2006)
Kernel Methods for Melanoma Recognition, , and , in: Proceedings of Workshop on Computer Vision Approaches to Medical Image Analysis (CVAMIA) 2006), 2006 |
|
International Workshop on Adaptive Multimedia Retrieval ({AMR}) (2006)
Learning to Retrieve Images from Text Queries with a Discriminative Model, , and , in: International Workshop on Adaptive Multimedia Retrieval (AMR), 2006 |
|
{IEEE} Int. Conf. on Automatic Face and Gesture Recognition ({AFGR}) (2006)
Local Binary Patterns as an Image Preprocessing for Face Authentication, , and , in: IEEE Int. Conf. on Automatic Face and Gesture Recognition (AFGR), 2006 |
|
International Workshop on Computer Vision Applications for Medical Image Analysis (2006)
Melanoma Recognition Using Representative and Discriminative Kernel Classifiers, , and , in: International Workshop on Computer Vision Applications for Medical Image Analysis, 2006 |
|
Proc. IEEE International Conference on Multimedia & Expo (ICME,',','),
2006 (2006)
Modeling Interactions from Email Communication, , and , in: Proc. IEEE International Conference on Multimedia & Expo (ICME,',','), 2006, 2006 |
|
Multimodal Interaction and Related Machine Learning Algorithms (MLMI) (2006)
Multi-Person Tracking in Meetings: A Comparative Study, , , , , and , in: Multimodal Interaction and Related Machine Learning Algorithms (MLMI), 2006 |
|
Proceedings of ISCA International Conference on Spoken Language Processing (ICSLP) (2006)
Multi-stream ASR: An Oracle Perspective, , and , in: Proceedings of ISCA International Conference on Spoken Language Processing (ICSLP), 2006 |
|
Conference on Image and Video Retrieval {CIVR} (2006)
Natural Scene Image Modeling using Color and Texture Visterms., and , in: Conference on Image and Video Retrieval CIVR, 2006 |
|
Int. Conf. on Artificial Neural Networks ({ICANN}) (2006)
Nearly optimal exploration-exploitation decision thresholds, , in: Int. Conf. on Artificial Neural Networks (ICANN), 2006 |
|
Proceedings of the 3rd International Brain-Computer Interface Workshop {&} Training Course 2006 (2006)
Non-Invasive Brain Computer Interface for Mental Control of a Simulated Wheelchair, , , , , , and , in: Proceedings of the 3rd International Brain-Computer Interface Workshop & Training Course 2006, 2006 |
|
Online Classifier Adaptation in High Frequency EEG, , and , in: Proceedings of the 3rd International Brain-Computer Interface Workshop & Training Course 2006, 2006 |
|
International Conference on Spoken Language Processing (ICSLP) (2006)
Posterior Based Keyword Spotting with A Priori Thresholds, , , and , in: International Conference on Spoken Language Processing (ICSLP), 2006 |
|
Proceedings of the 57th International Astronautical Conference (2006)
Prospects on Brain-Machine Interfaces for Space System Control, , , , , , , , , , , , , , , , , and , in: Proceedings of the 57th International Astronautical Conference, 2006 |
|
Multimodal User Authentication ({MMUA}) (2006)
Revisiting Doddington's Zoo: A Systematic Method to Assess User-dependent Variabilities, , and , in: Multimodal User Authentication (MMUA), 2006 |
|
Proceedings of the {IEEE} {C}onference on {M}ultimedia and Expo ({ICME} 2006) (2006)
Sociometry Based Multiparty Audio Recordings Segmentation, , in: Proceedings of the IEEE Conference on Multimedia and Expo (ICME 2006), 2006 |
|
Proceedings of International {C}onference on {P}attern {R}ecognition ({ICPR} 2006) (2006)
Sociometry Based Multiparty Audio Recordings Summarization, , in: Proceedings of International Conference on Pattern Recognition (ICPR 2006), 2006 |
|
Proc. Int. Conf. on Multimodal Interfaces (ICMI) (2006)
Speaker Localization for Microphone Array-Based ASR: The Effects of Accuracy on Overlapping Speech, and , in: Proc. Int. Conf. on Multimodal Interfaces (ICMI), 2006 |
|
Ninth International Conference on TEXT, SPEECH and DIALOGUE ({TSD}) (2006)
Speech Coding based on Spectral Dynamics, , , and , in: Ninth International Conference on TEXT, SPEECH and DIALOGUE (TSD), 2006 |
|
Proceedings of International {C}ognitive {V}ision {W}orkshop (ICVW) 2006) (2006)
The More you Learn, the Less you Store: Memory-Controlled Incremental SVM, and , in: Proceedings of International Cognitive Vision Workshop (ICVW) 2006), 2006 |
|
Int. Conf. on Spoken Language Processing ({Interspeech ICSLP}) (2006)
The segmentation of multi-channel meeting recordings for automatic speech recognition, , and , in: Int. Conf. on Spoken Language Processing (Interspeech ICSLP), 2006 |
|
Proceedings of {ICASSP} 2006 (2006)
Threshold Selection for Unsupervised Detection, with an Application to Microphone Arrays, , and , in: Proceedings of ICASSP 2006, 2006 |
|
International Conference on Multimodal Interfaces ({ICMI06}) (2006)
Tracking the Multi Person Wandering Visual Focus of Attention, , , and , in: International Conference on Multimodal Interfaces (ICMI06), 2006 |
|
NIPS (2006)
Unified Inference for Variational Bayesian Linear Gaussian State-Space Models, and , in: NIPS, 2006 |
|
{IEEE} Int. Conf. on Acoustics, Speech, and Signal Processing ({ICASSP}) (2006)
Using Chimeric Users to Construct Fusion Classifiers in Biometric Authentication Tasks: An Investigation, and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2006 |
|
Using more informative posterior probabilities for speech recognition, , , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2006 |
|
Proceedings of ICASSP, 2006 (2006)
Using Pitch as Prior Knowledge in Template-Based Speech Recognition, , and , in: Proceedings of ICASSP, 2006, 2006 |
|
International Conference on Spoken Language Processing (2006)
Using Posterior-Based Features in Template Matching for Speech Recognition, , and , in: International Conference on Spoken Language Processing, 2006 |
|
Seventh {IAPR} Workshop on Document Analysis Systems, {DAS} (2006)
Writer Identification for Smart Meeting Room Systems, , , , , and , in: Seventh IAPR Workshop on Document Analysis Systems, DAS, 2006 |
|
Proceedings of the 22nd International Conference on Machine Learning (2005)
A Graphical Model for Chord Progressions Embedded in a Psychoacoustic Space, , , and , in: Proceedings of the 22nd International Conference on Machine Learning, 2005 |
|
CHI '92: Proceedings of the SIGCHI conference on Human factors in computing systems (2005)
A Meeting Browser Evaluation Test, , , and , in: CHI '92: Proceedings of the SIGCHI conference on Human factors in computing systems, Portland, OR, USA, ACM Press, 2005 |
|
International {C}onference on {A}rtificial Neural Networks, {ICANN} (2005)
A Neural Network for Text Representation, and , in: International Conference on Artificial Neural Networks, ICANN, 2005 |
|
Fifth Int'l. Conf. Audio- and Video-Based Biometric Person Authentication {AVBPA} (2005)
A Novel Approach to Combining Client-Dependent and Confidence Information in Multimodal Biometric, and , in: Fifth Int'l. Conf. Audio- and Video-Based Biometric Person Authentication AVBPA, 2005 |
|
Advances in Neural Information Processing Systems, {NIPS} 15 (2005)
A Probabilistic Interpretation of SVMs with an Application to Unbalanced Classification, , and , in: Advances in Neural Information Processing Systems, NIPS 15, 2005 |
|
Proceedings of the Sixth International Conference on Music Information Retrieval (ISMIR) (2005)
A Probabilistic Model for Chord Progressions, , and , in: Proceedings of the Sixth International Conference on Music Information Retrieval (ISMIR), 2005 |
|
{ACM ICMI} Workshop on Multimodal Multiparty Meeting Processing ({MMMP}) (2005)
A Rao-Blackwellized Mixed State Particle Filter for Head Pose Tracking, and , in: ACM ICMI Workshop on Multimodal Multiparty Meeting Processing (MMMP), 2005 |
|
Proceedings of {ICASSP} 2005 (2005)
A Sector-Based, Frequency-Domain Approach to Detection and Localization of Multiple Speakers, and , in: Proceedings of ICASSP 2005, 2005 |
|
Proceedings of {INTERSPEECH} 2005 (2005)
A Spectrogram Model for Enhanced Source Localization and Noise-Robust ASR, , and , in: Proceedings of INTERSPEECH 2005, 2005 |
|
Proceedings of the 2004 MLMI Workshop, S. Bengio and H. Bourlard Eds, Springer Verlag (2005)
AV16.3: an Audio-Visual Corpus for Speaker Localization and Tracking, , and , in: Proceedings of the 2004 MLMI Workshop, S. Bengio and H. Bourlard Eds, Springer Verlag, 2005 |
|
Advances in Neural Information Processing Systems, NIPS 18. MIT Press (2005)
Benchmarking Non-Parametric Statistical Tests, , and , in: Advances in Neural Information Processing Systems, NIPS 18. MIT Press, 2005 |
|
{IEEE} International Conference on Acoustic, Speech, and Signal Processing, {ICASSP} (2005)
Boosting word error rates, and , in: IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP, 2005 |
Fifth Int'l. Conf. Audio- and Video-Based Biometric Person Authentication {AVBPA} (2005)
Database, Protocol and Tools for Evaluating Score-Level Fusion Algorithms in Biometric Authentication, and , in: Fifth Int'l. Conf. Audio- and Video-Based Biometric Person Authentication AVBPA, 2005 |
|
IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP) (2005)
Detecting Group Interest-level in Meetings, , , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2005 |
|
Proceedings of Interspeech (2005)
Developing and Enhancing Posterior Based Speech Recognition Systems, , , and , in: Proceedings of Interspeech, 2005 |
|
Sixth International Workshop on Multiple Classifier System (MCS2005) (2005)
EER of Fixed and Trainable Fusion Classifiers: A Theoretical Study with Application to Biometric Authentication Tasks, and , in: Sixth International Workshop on Multiple Classifier System (MCS2005), 2005 |
|
{P}roceedings of the 2005 {IEEE} International {C}onference on {M}ultimedia and Expo ({ICME}-05) (2005)
Effect of Segmentation Method on Video Retrieval Performance, and , in: Proceedings of the 2005 IEEE International Conference on Multimedia and Expo (ICME-05), 2005 |
|
International Conference on Multimedia & Expo ICME 2005 (2005)
Evaluation of Multiple Cues Head Pose Tracking Algorithm in Natural Environments, and , in: International Conference on Multimedia & Expo ICME 2005, 2005 |
|
NIPS Workshop on Learning to Rank (2005)
Exploiting Hyperlinks to Learn a Retrieval Model, and , in: NIPS Workshop on Learning to Rank, 2005 |
|
7th ACM SIGMM International Workshop on Multimedia Information Retrieval (2005)
Extracting Information from Multimedia Meeting Collections, , and , in: 7th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2005 |
|
Proceedings of the 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-05) (2005)
F-ratio Client-Dependent Normalisation on Biometric Authentication Tasks, and , in: Proceedings of the 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-05), 2005 |
|
European Symposium on Artificial Neural Networks {ESANN} (2005)
Generative Independent Component Analysis for EEG Classification, and , in: European Symposium on Artificial Neural Networks ESANN, 2005 |
|
The 2nd International IEEE EMBS Conference On Neural Engineering (2005)
Generative Temporal ICA for Classification in Asynchronous BCI Systems, and , in: The 2nd International IEEE EMBS Conference On Neural Engineering, 2005 |
|
{PASCAL} Workshop on Principled Methods of Trading Exploration and Exploitation (2005)
Gradient estimates of return distributions, and , in: PASCAL Workshop on Principled Methods of Trading Exploration and Exploitation, 2005 |
|
Proceedings MLMI workshop (2005)
Hierarchical Multi-Stream Posterior Based Speech Recognition System, , and , in: Proceedings MLMI workshop, 2005 |
|
Proceedings of {INTERSPEECH} 2005 (2005)
Implicit Control of Noise Canceller for Speech Enhancement, , and , in: Proceedings of INTERSPEECH 2005, 2005 |
|
Fifth Int'l. Conf. Audio- and Video-Based Biometric Person Authentication {AVBPA} (2005)
Improving Fusion with Margin-Derived Confidence In Biometric Authentication Tasks, and , in: Fifth Int'l. Conf. Audio- and Video-Based Biometric Person Authentication AVBPA, 2005 |
|
Proceedings of Interspeech, 2005 (2005)
Improving Speech Recognition Using a Data-Driven Approach, , and , in: Proceedings of Interspeech, 2005, 2005 |
|
ACM Conference on Information and Knowledge Management (2005)
Inferring Document Similarity from Hyperlinks, and , in: ACM Conference on Information and Knowledge Management, 2005 |
|
NIPS (2005)
Learning influence among interacting Markov chains, , , and , in: NIPS, 2005 |
|
2005
Machine Learning for Multimodal Interaction: First International Workshop, MLMI'2004, Springer-Verlag Heidelberg, 2005 |
{IEEE} Int. Conf. on Computer Vision (2005)
Modeling Scenes with Local Descriptors and Latent Aspects, , , , , and , in: IEEE Int. Conf. on Computer Vision, 2005 |
|
Proceedings of Interspeech 2005 (2005)
Multi-resolution RASTA filtering for TANDEM-based ASR, and , in: Proceedings of Interspeech 2005, 2005 |
|
Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2005)
Multi-resolution Spectral Entropy Based Feature for Robust ASR, , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2005 |
|
Proceedings of {HSCMA} 2005 (2005)
Multichannel Speech Enhancement in Cars: Explicit vs. Implicit Adaptation Control, , and , in: Proceedings of HSCMA 2005, 2005 |
|
MLMI (2005)
Multimodal Integration for Meeting Group Action Segmentation and Recognition, , , , , and , in: MLMI, 2005 |
|
Proc. Int. Conf. on Multimodal Interfaces (ICMI) (2005)
Multimodal Multispeaker Probabilistic Tracking in Meetings, , , and , in: Proc. Int. Conf. on Multimodal Interfaces (ICMI), 2005 |
|
{IEEE} International Conference on Information Technology and Applications, {ICITA} (2005)
On Accuracy/Robustness/Complexity Trade-Offs in Face Verification, , and , in: IEEE International Conference on Information Technology and Applications, ICITA, 2005 |
|
Pro. IEEE CVPR (2005)
Semi-supervised Adapted HMMs for Unusual Event Detection, , , and , in: Pro. IEEE CVPR, 2005 |
|
Pro. IEEE ICME (2005)
Semi-supervised Meeting Event Recognition with Adapted HMMs, , and , in: Pro. IEEE ICME, 2005 |
|
Proceedings of ISCA European Conference on Speech Communication and Technology (Eurospeech) (2005)
Spectral Entropy Feature in Full-Combination Multi-Stream for Robust ASR, and , in: Proceedings of ISCA European Conference on Speech Communication and Technology (Eurospeech), 2005 |
|
Pro. IEEE ICME (2005)
Speech Acquisition in Meetings with an Audio-Visual Sensor Array, , , , and , in: Pro. IEEE ICME, 2005 |
|
Machine Learning for Multimodal Interaction: Second International Workshop, {MLMI'2005} (2005)
The AMI Meeting Corpus: a Pre-Announcement, , , , , , , , , , , , , , , , and , in: Machine Learning for Multimodal Interaction: Second International Workshop, MLMI'2005, 2005 |
|
International Conference on Machine Learning, {ICML}, Workshop on {ROC} Analysis in Machine Learning (2005)
The Expected Performance Curve, , and , in: International Conference on Machine Learning, ICML, Workshop on ROC Analysis in Machine Learning, 2005 |
|
Proceedings of 8th International Conference on Text, Speech and Dialogue - TSD 2005 (2005)
The Role of Speech in Multimodal Human-Computer Interaction (Towards Reliable Rejection of Non-Keyword Input), , and , in: Proceedings of 8th International Conference on Text, Speech and Dialogue - TSD 2005, 2005 |
|
Proc. Int. Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS,',','),
invited paper (2005)
Tracking People in Meetings with Particles, , , , and , in: Proc. Int. Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS,',','), invited paper, 2005 |
|
Proceedings of the 2005 {IEEE} {ASRU} {W}orkshop (2005)
Unsupervised Spectral Subtraction for Noise-Robust ASR, , , and , in: Proceedings of the 2005 IEEE ASRU Workshop, 2005 |
|
Proceedings of the 19th International Joint Conference on Artificial Intelligence (2005)
You Are Wrong!---Automatic Detection of Interaction Errors from Brain Waves, and , in: Proceedings of the 19th International Joint Conference on Artificial Intelligence, 2005 |
|
{IEEE} International Conference on Acoustic, Speech, and Signal Processing, {ICASSP} (2004)
A Gentle Hessian for Efficient Gradient Descent, and , in: IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP, 2004 |
|
17th Int. Conf. Pattern Recognition (ICPR) (2004)
A probabilistic framework for joint head tracking and pose estimation, and , in: 17th Int. Conf. Pattern Recognition (ICPR), 2004 |
|
{P}roceedings of the 2004 {SAPA} {W}orkshop (2004)
A Sector-Based Approach for Localization of Multiple Speakers with Microphone Arrays, and , in: Proceedings of the 2004 SAPA Workshop, 2004 |
|
Proceedings of Odyssey 2004: The Speaker and Language Recognition Workshop (2004)
A Statistical Significance Test for Person Authentication, and , in: Proceedings of Odyssey 2004: The Speaker and Language Recognition Workshop, 2004 |
|
Proceedings of the 6th International Conference on Automatic Face and Gesture Recognition (2004)
A Symmetric Transformation for LDA-based Face Verification, , in: Proceedings of the 6th International Conference on Automatic Face and Gesture Recognition, IEEE Computer Society Press, 2004 |
|
Int'l Conf. on Biometric Authentication (2004)
An Investigation of Spectral Subband Centroids for Speaker Authentication, , and , in: Int'l Conf. on Biometric Authentication, 2004 |
|
Int. Conf. on Image and Video Retrieval (CIVR) (2004)
Assessing Scene Structuring in Consumer Videos, , , , and , in: Int. Conf. on Image and Video Retrieval (CIVR), 2004 |
|
{IEEE} International Conference on Acoustic, Speech, and Signal Processing, {ICASSP} (2004)
Boosting HMMs with an application to speech recognition, and , in: IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP, 2004 |
|
Biometric Authentication Workshop of the 8th European Conference on Computer Vision, BIOAW2004 (2004)
Boosting Pixel-based Classifiers for Face Verification, and , in: Biometric Authentication Workshop of the 8th European Conference on Computer Vision, BIOAW2004, Springer-Verlag, 2004 |
|
ICASSP (2004)
Clustering And Segmenting Speakers And Their Locations In Meetings, , and , in: ICASSP, 2004 |
|
Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04) (2004)
Confidence Measures in Multiple pronunciations Modeling For Speaker Verification, and , in: Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04), 2004 |
|
International Conference on Computer Vision and Pattern Recognition (2004)
Cue integration through discriminative accumulation, and , in: International Conference on Computer Vision and Pattern Recognition, 2004 |
|
Proceedings of International Workshop on Frontiers in Handwriting Recognition (2004)
Effect of Recognition Errors on Information Retrieval Performance, , in: Proceedings of International Workshop on Frontiers in Handwriting Recognition, 2004 |
|
17th Int. Conf. Pattern Recognition (ICPR) (2004)
Embedding motion in model-based stochastic tracking, and , in: 17th Int. Conf. Pattern Recognition (ICPR), 2004 |
|
Proceedings of the INTERSPEECH-ICSLP-04 (2004)
Entropy Based Combination of Tandem Representations for Noise Robust ASR, , , , and , in: Proceedings of the INTERSPEECH-ICSLP-04, 2004 |
|
{IEEE} International Conference on Image Processing, {ICIP} (2004)
Estimating the Quality of Face Localization for Face Verification, , , and , in: IEEE International Conference on Image Processing, ICIP, 2004 |
|
The 6th International Conference on Automatic Face and Gesture Recognition, FG2004 (2004)
Face Verification Using Adapted Generative Models, , and , in: The 6th International Conference on Automatic Face and Gesture Recognition, FG2004, IEEE, 2004 |
|
Proceedings IEEE WIAMIS 2004(5th International Workshop on Image Analysis for Multimedia Interactive Services,',','),
21-23 April, 2004, Lisboa, Portugal (2004)
Fusion of Structural and Color Local Descriptors for Enhanced Object Recognition, and , in: Proceedings IEEE WIAMIS 2004(5th International Workshop on Image Analysis for Multimedia Interactive Services,',','), 21-23 April, 2004, Lisboa, Portugal, 2004 |
|
European Symposium on Artificial Neural Networks {ESANN} (2004)
HMM and IOHMM Modeling of EEG Rhythms for Asynchronous BCI Systems, and , in: European Symposium on Artificial Neural Networks ESANN, 2004 |
|
Proceedings of ICASSP (2004)
Joint Decoding for Phoneme-Grapheme Continuous Speech Recognition, , and , in: Proceedings of ICASSP, 2004 |
|
International Conference on Machine Learning, {ICML} (2004)
Links Between Perceptrons, MLPs and SVMs, and , in: International Conference on Machine Learning, ICML, 2004 |
|
IEEE Transaction on Multimedia, June, 2006 (2004)
Modeling Individual and Group Actions in Meetings With Layered HMMs, , , , and , in: IEEE Transaction on Multimedia, June, 2006, 2004 |
|
the Second IEEE Workshop on Event Mining: Detection and Recognition of Events in Video, In Association with CVPR (2004)
Modeling Individual and Group Actions in Meetings: a Two-Layer HMM Framework, , , , and , in: the Second IEEE Workshop on Event Mining: Detection and Recognition of Events in Video, In Association with CVPR, 2004 |
|
Proceedings of ICSLP (2004)
Modelling Auxiliary Features in Tandem Systems, , , and , in: Proceedings of ICSLP, 2004 |
|
ACM 2nd International Workshop on Video Surveillance & Sensor Networks in conjunction with 12th ACM International Conference on Multimedia (2004)
Multimodal Group Action Clustering in Meetings, , , , and , in: ACM 2nd International Workshop on Video Surveillance & Sensor Networks in conjunction with 12th ACM International Conference on Multimedia, 2004 |
|
Proceedings of International Conference on Spoken Language Processing (ICSLP) (2004)
New Nonsense Syllables Database -- Analyses and Preliminary ASR Experiments, , , and , in: Proceedings of International Conference on Spoken Language Processing (ICSLP), 2004 |
|
The Speaker and Recognition Workshop (2004)
Noise-Robust Multi-Stream Fusion for Text-Independent Speaker Authentication, and , in: The Speaker and Recognition Workshop, 2004 |
|
Proceedings of International Conference on Pattern Recognition (ICPR) (2004)
Noisy Text Categorization, , in: Proceedings of International Conference on Pattern Recognition (ICPR), 2004 |
|
17th International Conference on Pattern Recognition ({ICPR}) (2004)
On Performance Evaluation of Face Detection and Localization Algorithms, , , and , in: 17th International Conference on Pattern Recognition (ICPR), 2004 |
|
Proceedings of the International Joint Conference on Neural Networks (2004)
On the Need for On-Line Learning in Brain-Computer Interfaces, , in: Proceedings of the International Joint Conference on Neural Networks, 2004 |
|
Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04) (2004)
On Use of Task Independent Training Data in Tandem Feature Extraction, and , in: Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04), 2004 |
|
12th European Symposium on Artificial Neural Networks, {ESANN} 04 (2004)
Online Policy Adaptation for Ensemble Classifiers, and , in: 12th European Symposium on Artificial Neural Networks, ESANN 04, 2004 |
British Machine Vision Conference (BMVC) (2004)
Order Matters: A Distributed Sampling Method for Multi-Object Tracking, , in: British Machine Vision Conference (BMVC), 2004 |
|
Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04) (2004)
Phase AutoCorrelation (PAC) features in Entropy based Multi-Stream for Robust Speech Recognition, , , and , in: Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04), 2004 |
|
Proc. ACM Int. Conf. on Multimedia (ACM MM) (2004)
PLSA-based Image Auto-Annotation: Constraining the Latent Space, and , in: Proc. ACM Int. Conf. on Multimedia (ACM MM), 2004 |
|
International Conference on Spoken Language Processing (ICSLP~2004) (2004)
Posteriori Probabilities and Likelihoods Combination for Speech and Speaker Recognition, and , in: International Conference on Spoken Language Processing (ICSLP~2004), 2004 |
|
Proceedings of SST 2004 (10th Australian International Conference on Speech Science & Technology,',','),
Sydney, Australia, 2004 (2004)
Pseudo-Syntactic Language Modeling for Disfluent Speech Recognition, , in: Proceedings of SST 2004 (10th Australian International Conference on Speech Science & Technology,',','), Sydney, Australia, 2004, 2004 |
|
the International Conference on Pattern Recognition (ICPR) (2004)
Real-Time Face Detection Using Boosting Learning in Hierarchical Feature Spaces, , and , in: the International Conference on Pattern Recognition (ICPR), 2004 |
|
Proc. of the sixth International Conference on Automatic Face and Gesture Recognition (2004)
Recognition of Isolated Complex Mono- and Bi-Manual 3D Hand Gestures, , and , in: Proc. of the sixth International Conference on Automatic Face and Gesture Recognition, 2004 |
|
Proceedings of the 4th Forum of European Neuroscience (2004)
Restoring Locomotion with a Thought Controlled Mobile Robot, , in: Proceedings of the 4th Forum of European Neuroscience, 2004 |
Proc. 17th International Conference on Pattern Recognition (ICPR 2004) (2004)
Robust Playfield Segmentation using MAP Adaptation, and , in: Proc. 17th International Conference on Pattern Recognition (ICPR 2004), 2004 |
|
Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2004)
Spectral Entropy Based Feature for Robust ASR, , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2004 |
|
Proceedings of the INTERSPEECH-ICSLP-04 (2004)
Spectro-Temporal Activity Pattern (STAP) Features for Noise Robust ASR, , , and , in: Proceedings of the INTERSPEECH-ICSLP-04, 2004 |
|
{P}roceedings of the {IEEE} International {C}onference on Image {P}rocessing ({ICIP}) (2004)
Statistical Transformations of Frontal Models for Non-Frontal Face Verification, and , in: Proceedings of the IEEE International Conference on Image Processing (ICIP), 2004 |
|
Proceedings of ICSLP (2004)
Subjective Evaluation of Join Cost Functions Used in Unit Selection Speech Synthesis, and , in: Proceedings of ICSLP, 2004 |
|
17th Int. Conf. Pattern Recognition (ICPR) (2004)
Tangent Vector Kernels for Invariant Image Classification with SVMs, and , in: 17th Int. Conf. Pattern Recognition (ICPR), 2004 |
|
Proceedings of Odyssey 2004: The Speaker and Language Recognition Workshop (2004)
The Expected Performance Curve: a New Assessment Measure for Person Authentication, and , in: Proceedings of Odyssey 2004: The Speaker and Language Recognition Workshop, 2004 |
|
Pascal Workshop on Text Mining and Understanding (2004)
Theme Topic Mixture Model: A Graphical Model for Document Representation, and , in: Pascal Workshop on Text Mining and Understanding, 2004 |
|
2004
{P}roceedings of the 2004 {ICASSP-NIST} {M}eeting {R}ecognition {W}orkshop (2004)
Unsupervised Location-Based Segmentation of Multi-Party Speech, , and , in: Proceedings of the 2004 ICASSP-NIST Meeting Recognition Workshop, 2004 |
|
Proceedings of ICSLP, 2004 (2004)
Using RASTA in task independent TANDEM feature extraction, , and , in: Proceedings of ICSLP, 2004, 2004 |
|
Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04) (2004)
Why Do Multi-Stream, Multi-Band and Multi-Modal Approaches Work on Biometric User Authentication Tasks?, and , in: Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04), 2004 |
|
2004
An Online Audio Indexing System, , and , 2004 |
|
LP-TRAP: Linear predictive temporal patterns, , and , 2004 |
|
Proceedings of the 9th International Conference on Human-Computer Interaction (INTERACT-2003) (2003)
A Hierarchical Keyframe User Interface for Browsing Video over the Internet, , , and , in: Proceedings of the 9th International Conference on Human-Computer Interaction (INTERACT-2003), IOS Press, 2003 |
|
IEEE Int. Conf. on Computer Vision Workshop on Multimedia Technologies for E-Learning and Collaboration (ICCV-WOMTEC) (2003)
A Mixed-State I-Particle Filter for Multi-Camera Speaker Tracking, , , and , in: IEEE Int. Conf. on Computer Vision Workshop on Multimedia Technologies for E-Learning and Collaboration (ICCV-WOMTEC), 2003 |
|
IEEE Automatic Speech Recognition Understanding Workshop (2003)
A Robust Speaker Clustering Algorithm, and , in: IEEE Automatic Speech Recognition Understanding Workshop, 2003 |
|
Proceedings of the 10th International Conference on Human-Computer Interaction (2003)
Adaptive Brain Interfaces for Communication and Control, , in: Proceedings of the 10th International Conference on Human-Computer Interaction, 2003 |
|
Advances in Neural Information Processing Systems, {NIPS} 15 (2003)
An Asynchronous Hidden Markov Model for Audio-Visual Speech Recognition, , in: Advances in Neural Information Processing Systems, NIPS 15, MIT Press, 2003 |
|
{B}ritish {M}achine {V}ision {C}onference (BMVC) (2003)
An Implicit Motion Likelihood for Tracking with Particle Filters, , and , in: British Machine Vision Conference (BMVC), Springer Verlag, 2003 |
|
IEEE International Conference on Image Processing (ICIP) (2003)
Audio-Visual Speaker Tracking with Importance Particle Filters, , , , and , in: IEEE International Conference on Image Processing (ICIP), 2003 |
|
{P}roceedings of the 2003 {W}orkshop on {M}ultimodal User Authentication ({MMUA}'03) (2003)
Augmenting Frontal Face Models for Non-Frontal Verification, and , in: Proceedings of the 2003 Workshop on Multimodal User Authentication (MMUA'03), 2003 |
International Conference on Artificial Neural Networks, {ICANN/ICONIP} 2003 (2003)
Client Dependent GMM-SVM Models for Speaker Verification, and , in: International Conference on Artificial Neural Networks, ICANN/ICONIP 2003, Springer Verlag, 2003 |
|
Proceedings of the 1st International IEEE EMBS Conference on Neural Engineering (2003)
Comparison of different feature classifiers for brain computer interfaces, , , , , , , , and , in: Proceedings of the 1st International IEEE EMBS Conference on Neural Engineering, 2003 |
4th International Conference on AUDIO- and VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION (2003)
Comparison of MLP and GMM Classifiers for Face Verification on XM2VTS, , and , in: 4th International Conference on AUDIO- and VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, 2003 |
|
Proceedings of ISCA European Conference on Speech Communication and Technology (Eurospeech) (2003)
Confusion Matrix Based Entropy Correction in Multi-stream Combination, and , in: Proceedings of ISCA European Conference on Speech Communication and Technology (Eurospeech), 2003 |
|
Proceedings of the 9th International Conference on Functional Mapping of the Human Brain (2003)
Direct Non-Invasive Brain Computer Interfaces, , , , and , in: Proceedings of the 9th International Conference on Functional Mapping of the Human Brain, 2003 |
Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03) (2003)
Hybrid HMM/ANN and GMM Combination for User-Customized Password Speaker Verification, and , in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003 |
|
{IEEE} International Conference on Acoustics, Speech, and Signal Processing (2003)
Improving Face Authetication Using Virtual Samples, , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003 |
|
{P}roceedings of the 2003 {IEEE} International {C}onference on Acoustics, {S}peech, and {S}ignal {P}rocessing ({ICASSP}-03) (2003)
Location Based Speaker Segmentation, and , in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003 |
|
IEEE ASRU (2003)
Mel-Cepstrum Modulation Spectrum (MCMS) Features for Robust ASR, , , and , in: IEEE ASRU, 2003 |
|
Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03) (2003)
Microphone Array Speech Recognition : Experiments on Overlapping Speech in Meetings, and , in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003 |
|
Proceedings of International Conference on Acoustics, Speech and Signal Processing (2003)
Modeling Human Interaction in Meetings, , , , , , , and , in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2003 |
|
GRETSI conference, Signal and Image Processing, (2003)
Modélisation implicite du mouvement en suivi par filtrage de Monte Carlo séquentiel, and , in: GRETSI conference, Signal and Image Processing,, 2003 |
|
Proc. IEEE Workshop on Neural Networks for Signal Processing (NNSP) (2003)
Multi-Modal Audio-Visual Event Recognition for Football Analysis, , and , in: Proc. IEEE Workshop on Neural Networks for Signal Processing (NNSP), 2003 |
|
4th International Conference on Audio- and Video-Based Biometric Person Authentication, {AVBPA} (2003)
Multimodal Authentication using Asynchronous HMMs, , in: 4th International Conference on Audio- and Video-Based Biometric Person Authentication, AVBPA, Springer-Verlag, 2003 |
|
Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2003)
New Entropy Based Combination Rules in HMM/ANN Multi-stream ASR, , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2003 |
|
{P}roceedings of the 2003 {IEEE} International {C}onference on Acoustics, {S}peech, and {S}ignal {P}rocessing ({ICASSP}-03) (2003)
Noise Resistant Audio-Visual Verification via Structural Constraints, and , in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003 |
Proceedings of the 18th International Joint Conference on Artificial Intelligence (2003)
Non-Invasive Brain-Actuated Control of a Mobile Robot, , , and , in: Proceedings of the 18th International Joint Conference on Artificial Intelligence, 2003 |
|
Workshop on Multimodal User Authentication (2003)
Non-Linear Variance Reduction Techniques in Biometric Authentication, and , in: Workshop on Multimodal User Authentication, 2003 |
|
Proceedings of the IEEE Automatic Speech Recognition and Understanding (ASRU) Workshop 2003 (2003)
Nonlinear Spectral Transformations for Robust Speech Recognition, , and , in: Proceedings of the IEEE Automatic Speech Recognition and Understanding (ASRU) Workshop 2003, 2003 |
|
Proceedings of International Conference on Document Analysis and Recognition (ICDAR) (2003)
Offline Recognition of Large Vocabulary Cursive Handwritten Text, , and , in: Proceedings of International Conference on Document Analysis and Recognition (ICDAR), 2003 |
|
IEEE International Conference on Image Processing (ICIP) (2003)
On automatic annotation of meeting databases, , , , and , in: IEEE International Conference on Image Processing (ICIP), 2003 |
|
Eurospeech (2003)
On Factorizing Spectral Dynamics for Robust Speech Recognition, , , and , in: Eurospeech, 2003 |
|
Proc. ACM Int. Conf. on Multimedia (ACM MM) (2003)
On Image Auto-Annotation with Latent Space Models, and , in: Proc. ACM Int. Conf. on Multimedia (ACM MM), 2003 |
|
European Conference On Speech, Communication and Technology (EUROSPEECH'03) (2003)
On the Combination of Speech and Speaker Recognition, and , in: European Conference On Speech, Communication and Technology (EUROSPEECH'03), 2003 |
|
Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03) (2003)
Phase AutoCorrelation (PAC) derived Robust Speech Features, , and , in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003 |
|
Proceedings of IEEE ASRU (2003)
Phoneme-Grapheme Based Speech Recognition System, , , and , in: Proceedings of IEEE ASRU, 2003 |
|
{P}roceedings of 4th International {C}onference on Audio- and {V}ideo-based {B}iometric {P}erson Authentication ({AVBPA}-03) (2003)
Robust Features for Frontal Face Authentication in Difficult Image Conditions, and , in: Proceedings of 4th International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA-03), 2003 |
4th International Conference on Audio- and Video-Based Biometric Person Authentication, {AVBPA} (2003)
Scalability Analysis of Audio-Visual Person Identity Verification, , , and , in: 4th International Conference on Audio- and Video-Based Biometric Person Authentication, AVBPA, Springer-Verlag, 2003 |
|
{P}roceedings of Eurospeech 2003 (2003)
Segmenting Multiple Concurrent Speakers Using Microphone Arrays, , and , in: Proceedings of Eurospeech 2003, 2003 |
|
ICIP (2003)
Sequential Monte Carlo Video Text Segmentation, and , in: ICIP, 2003 |
|
International Conference on Image and Video Retrieval (CIVR'03) (2003)
Spectral Structuring of Home Videos, , and , in: International Conference on Image and Video Retrieval (CIVR'03), Springer Verlag, 2003 |
|
{P}roceedings of the 2003 {IEEE} International {C}onference on {M}ultimedia & Expo ({ICME}-03) (2003)
Speech & Face Based Biometric Authentication at IDIAP, , , , , , , and , in: Proceedings of the 2003 IEEE International Conference on Multimedia & Expo (ICME-03), 2003 |
Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03) (2003)
Speech recognition of spontaneous, noisy speech using auxiliary information in Bayesian networks, , and , in: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03), 2003 |
Proceedings of the Conference of the International Society for Brain Electromagnetic Topography (2003)
Studying Phase Synchrony for Classification of Mental Tasks in Brain Machine Interfaces, , , and , in: Proceedings of the Conference of the International Society for Brain Electromagnetic Topography, 2003 |
4th International Conference on Audio- and Video-Based Biometric Person Authentication, {AVBPA} (2003)
The BANCA Database and Evaluation Protocol, , , , , , , , , , , and , in: 4th International Conference on Audio- and Video-Based Biometric Person Authentication, AVBPA, Springer-Verlag, 2003 |
|
large part published in Proceedings of ASRU-2003 (2003)
TRAP-TANDEM: Data-driven extraction of temporal features from speech, , in: large part published in Proceedings of ASRU-2003, 2003 |
|
Proceedings of Eurospeech (2003)
Using pitch frequency information in speech recognition, , and , in: Proceedings of Eurospeech, 2003 |
|
Pattern Recognition and Image Analysis: First Iberian Conference, IbPRIA 2003, Springer-Verlag LNCS (2003)
Vessel Segmentation and Branching Detection using an Adaptive Profile Kalman Filter in Retinal Blood Vessel Structure Analysis, and , in: Pattern Recognition and Image Analysis: First Iberian Conference, IbPRIA 2003, Springer-Verlag LNCS, 2003 |
|
3rd Workshop on Content-Based Multimedia Indexing (CBMI) (2003)
Video Shot Clustering using Spectral Methods, , and , in: 3rd Workshop on Content-Based Multimedia Indexing (CBMI), 2003 |
|
International Conference on Spoken Language Processing {ICSLP} (2002)
A Comparative Study of Adaptation Methods for Speaker Verification, and , in: International Conference on Spoken Language Processing ICSLP, 2002 |
|
{IEEE} International Workshop on Neural Networks for Signal Processing (NNSP) (2002)
A Multi-sample Multi-source Model for Biometric Authentication, , and , in: IEEE International Workshop on Neural Networks for Signal Processing (NNSP), 2002 |
|
Advances in Neural Information Processing Systems, {NIPS} 14 (2002)
A Parallel Mixture of SVMs for Very Large Scale Problems, , and , in: Advances in Neural Information Processing Systems, NIPS 14, MIT Press, 2002 |
|
Proceedings of the COST275 Workshop on The Advent of Biometrics on the Internet (2002)
A State-of-the-art Neural Network for Robust Face Verification, , and , in: Proceedings of the COST275 Workshop on The Advent of Biometrics on the Internet, 2002 |
|
Seventh International Conference on Spoken Language Processing (ICSLP~2002) (2002)
Auxiliary Variables in Conditional Gaussian Mixtures for Automatic Speech Recognition, , and , in: Seventh International Conference on Spoken Language Processing (ICSLP~2002), 2002 |
|
{IEEE} International Workshop on Neural Networks for Signal Processing (NNSP) (2002)
Conditional Gaussian Mixture Models for Environmental Risk Mapping, , and , in: IEEE International Workshop on Neural Networks for Signal Processing (NNSP), 2002 |
|
2002 IEEE International Workshop on Neural Networks for for Signal Processing (NNSP~2002) (2002)
Dynamic Bayesian Network Based Speech Recognition with Pitch and Energy as Auxiliary Variables, , , and , in: 2002 IEEE International Workshop on Neural Networks for for Signal Processing (NNSP~2002), 2002 |
|
International Conference on Spoken Language Processing (ICSLP 2002) (2002)
Evaluation of Formant-Like Features for ASR, , , , , and , in: International Conference on Spoken Language Processing (ICSLP 2002), 2002 |
|
Proceedings of the International Federation for Medical and Biological Engineering (2002)
Evolution of the Mental States Operating a Brain-Computer Interface, , , and , in: Proceedings of the International Federation for Medical and Biological Engineering, 2002 |
|
XI Journees NeuroSciences et Sciences pour l'Ingenieur ({NSI} 2002) (2002)
Face Verification using MLP and SVM, and , in: XI Journees NeuroSciences et Sciences pour l'Ingenieur (NSI 2002), 2002 |
|
International {IEEE} {W}orkshop on Neural Networks for {S}ignal {P}rocessing ({NNSP} 02) (2002)
Facial Expression Analysis using Shape and Motion Information Extracted by Convolutional Neural Networks, , in: International IEEE Workshop on Neural Networks for Signal Processing (NNSP 02), 2002 |
|
International {IEEE} {C}onference on {M}ultimodal Interfaces ({ICMI} 02) (2002)
Head-Pose Invariant Facial Expression Recognition using Convolutional Neural Networks, , in: International IEEE Conference on Multimodal Interfaces (ICMI 02), 2002 |
|
Proceedings of the 16th International Conference on Pattern Recognition (2002)
Improving Face Verification using Skin Color Information, and , in: Proceedings of the 16th International Conference on Pattern Recognition, IEEE Computer Society Press, 2002 |
|
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 02) (2002)
Increasing Speech Recognition Noise Robustness with HMM2, , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 02), 2002 |
|
IEEE International Conference on Multimedia and Expo (2002)
Linking Objects in Videos by Importance Sampling, and , in: IEEE International Conference on Multimedia and Expo, 2002 |
|
Proc. ICSLP (2002)
Low cost duration modelling for noise robust speech recognition, , and , in: Proc. ICSLP, 2002 |
|
Proceedings of International Conference on Acoustics, Speech and Signal Processing (2002)
Microphone Array Post-filter for Diffuse Noise Field, and , in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2002 |
|
International Conference on Pattern Recognition (ICPR~2002) (2002)
Mixed Bayesian Networks with Auxiliary Variables for Automatic Speech Recognition, , and , in: International Conference on Pattern Recognition (ICPR~2002), 2002 |
|
Indian {C}onference on {C}omputer {V}ision, {G}raphics and Image {P}rocessing ({ICVGIP} 02) (2002)
Mutliscale Facial Expression Recognition using Convolutional Neural Networks, , in: Indian Conference on Computer Vision, Graphics and Image Processing (ICVGIP 02), 2002 |
|
IEEE Workshop on Motion and Video Computing (2002)
Object Localization in Metric Spaces for Video Linking, and , in: IEEE Workshop on Motion and Video Computing, 2002 |
|
Proceedings of International Conference on Pattern Recognition (2002)
Offline Cursive Word Recognition using Continuous Density Hidden Markov Models trained with PCA or ICA Features, and , in: Proceedings of International Conference on Pattern Recognition, 2002 |
|
IEEE International Conference on Image Processing (2002)
Probabilistic Home Video Structuring: Feature Selection and Performance Evaluation, , and , in: IEEE International Conference on Image Processing, 2002 |
|
Proceedings of the International {C}onference on {P}attern {R}ecognition ({ICPR} 02) (2002)
Robust Face Analysis using Convolutional Neural Networks, , in: Proceedings of the International Conference on Pattern Recognition (ICPR 02), 2002 |
|
ICASSP (2002)
Robust HMM-Based Speech/Music Segmentation, , and , in: ICASSP, 2002 |
|
Proceedings of International Conference on Speech and Language Processing (ICSLP) (2002)
Robust Speech Recognition with Small Microphone Arrays using the Missing Data Approach, , and , in: Proceedings of International Conference on Speech and Language Processing (ICSLP), 2002 |
|
International Workshop on Pattern Recognition with Support Vector Machines, {SVM}'2002 (2002)
Scaling Large Learning Problems with Hard Parallel Mixtures, , and , in: International Workshop on Pattern Recognition with Support Vector Machines, SVM'2002, 2002 |
|
Proceedings of the 2002 IEEE International Workshop on Neural Networks for Signal Processing (NNSP-02) (2002)
Speaker Normalization using HMM2, , and , in: Proceedings of the 2002 IEEE International Workshop on Neural Networks for Signal Processing (NNSP-02), 2002 |
|
Int. Conf. Pattern Recognition 2002 (2002)
Text Segmentation and Recognition in Complex Background Based on Markov Random Field, , and , in: Int. Conf. Pattern Recognition 2002, 2002 |
|
ICSLP (2002)
Unknown-Multiple Speaker clustering using HMM, , , and , in: ICSLP, 2002 |
|
Proceedings of the COST275 Workshop on the Advent of Biometrics on the Internet (2002)
User-Customized Password HMM Based Speaker Verification, and , in: Proceedings of the COST275 Workshop on the Advent of Biometrics on the Internet, 2002 |
|
International Conference on Spoken Language Processing (ICSLP~2002) (2002)
User-Customized Password Speaker Verification based on HMM/ANN and GMM Models, and , in: International Conference on Spoken Language Processing (ICSLP~2002), 2002 |
|
Int. Conf. Image Processing 2002 (2002)
Video Text Recognition Based on Markov Random Field and Grayscale Consistency Constraint, and , in: Int. Conf. Image Processing 2002, 2002 |
|
Proceedings of 8$^{th}$ International Conference on Frontiers on Handwriting Recognition (2002)
Writer adaptation techniques in HMM based Off-Line Cursive Script Recognition, and , in: Proceedings of 8$^{th}$ International Conference on Frontiers on Handwriting Recognition, 2002 |
|
ICASSP (2001)
Adaptive ML-Weighting in Multi-Band Recombination of Gaussian Mixture ASR, , and , in: ICASSP, 2001 |
|
2001 Annual Conference of the IAMG (2001)
Confidence Evaluation for Risk Prediction, , and , in: 2001 Annual Conference of the IAMG, 2001 |
|
Proc. CRAC (workshop on Consistent & Reliable Acoustic Cues for sound analysis) (2001)
Data utility modelling for mismatch reduction, , in: Proc. CRAC (workshop on Consistent & Reliable Acoustic Cues for sound analysis), 2001 |
|
Proc. World Congress on Neuroinformatics (2001)
EEG pattern recognition through multi-stream evidence combination, , and , in: Proc. World Congress on Neuroinformatics, 2001 |
|
EUROSPEECH (2001)
Error Correcting Posterior Combination for Robust Multi-Band Speech Recognition, and , in: EUROSPEECH, 2001 |
|
Proc. WISP (2001)
From missing data to maybe useful data: soft data modelling for noise robust ASR, , and , in: Proc. WISP, 2001 |
|
European Conference on Speech Communication and Technology (Eurospeech 2001) (2001)
HMM2- Extraction of Formant Features and their Use for Robust ASR, , and , in: European Conference on Speech Communication and Technology (Eurospeech 2001), 2001 |
|
{IEEE} International Conference on Acoustic, Speech, and Signal Processing, {ICASSP} (2001)
Learning the Decision Function for Speaker Verification, and , in: IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP, 2001 |
|
Proc. Eurospeech (2001)
MAP Combination of Multi-Stream HMM or HMM/ANN Experts, , and , in: Proc. Eurospeech, 2001 |
|
7th European Conference on Speech Communication and Technology (Eurospeech~2001) (2001)
Modeling Auxiliary Information in Bayesian Network Based ASR, , and , in: 7th European Conference on Speech Communication and Technology (Eurospeech~2001), 2001 |
|
Advances in Neural Information Processing Systems 13 (2001)
New Approaches Towards Robust and Adaptive Speech Recognition, , and , in: Advances in Neural Information Processing Systems 13, MIT Press, 2001 |
|
Proc. ICASSP 2001 (2001)
Signal modeling with Non Uniform Topology lattice filters, and , in: Proc. ICASSP 2001, 2001 |
|
Automatic Speech Recognition and Understanding Workshop (2001)
Speech Recognition Using Advanced HMM2 Features, , and , in: Automatic Speech Recognition and Understanding Workshop, 2001 |
|
Proceedings of the 11th International Conference on Image Analysis and Processing (2001)
Text Enhancement with Asymmetric Filter for Video OCR, , and , in: Proceedings of the 11th International Conference on Image Analysis and Processing, 2001 |
|
Proceedings of the Int. Conf. on computer vision and pattern recognition (2001)
Text Identification in Complex Background using SVM, , and , in: Proceedings of the Int. Conf. on computer vision and pattern recognition, 2001 |
Proceedings of the 8th {IEEE} International Conference on Mechatronics and Machine Vision in Practice (2001)
Video OCR for Sport Video Annotation and Retrieval, , and , in: Proceedings of the 8th IEEE International Conference on Mechatronics and Machine Vision in Practice, 2001 |
|
Int. Conf. on Spoken Language Processing (ICSLP) (2000)
A front-end using the harmonicity cue for speech enhancement in loud noise, , and , in: Int. Conf. on Spoken Language Processing (ICSLP), 2000 |
ICSLP (2000)
A MAP approach, with synchronous decoding and unit-based normalization for text-dependent speaker verification, , and , in: ICSLP, 2000 |
|
Proc. ICSLP (2000)
A neural network for classification with incomplete data: application to robust ASR, , , , and , in: Proc. ICSLP, 2000 |
|
Journee d'Etudes sur la Parole, Aussois (2000)
Adaptation robuste de modèles HMM pour la vérification du locuteur dépendante du texte, and , in: Journee d'Etudes sur la Parole, Aussois, 2000 |
|
2000
Audio visual speech recognition, , , , , , , and , Johns Hopkins University-CLSP, 2000 |
6th International Conference on Spoken Language Processing: ICSLP~2000 (Interspeech~2000) (2000)
Automatic Speech Recognition using Dynamic Bayesian Networks with both Acoustic and Articulatory Variables, , , and , in: 6th International Conference on Spoken Language Processing: ICSLP~2000 (Interspeech~2000), 2000 |
|
ICASSP2000 - IEEE International Conference on Acoustics, Speech, and Signal Processing (2000)
Behavior of a Bayesian adaptation method for incremental enrollment in speaker verification, , , , , and , in: ICASSP2000 - IEEE International Conference on Acoustics, Speech, and Signal Processing, 2000 |
|
ICONIP, 7th IEEE Int. Conf. on Neural Information Processing (2000)
Blind acoustic source separation for cocktail party speech recognition, , , and , in: ICONIP, 7th IEEE Int. Conf. on Neural Information Processing, 2000 |
ICSLP (2000)
Comparison of HMM experts with MLP experts in the Full Combination Multi-Band Approach to Robust ASR, and , in: ICSLP, 2000 |
|
Neural Computation 2000 (2000)
Comparison of Unsupervised and Supervised Training of RBF Neural Networks. Case Study: Mapping of Contamination Data, and , in: Neural Computation 2000, 2000 |
Geostatistical congress 2000 (2000)
Environmental and Pollution Spatial Data Classification with Support Vector Machines and Geostatistics, , , and , in: Geostatistical congress 2000, 2000 |
Journee d'Etudes sur la Parole, Aussois (2000)
Etudes comparatives des robustesses au bruit de l'approche 'Full Combination' et de son approximation, and , in: Journee d'Etudes sur la Parole, Aussois, 2000 |
|
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP'2000 (2000)
Fast latent semantic indexing of spoken documents by using self-organizing maps, , in: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP'2000, 2000 |
|
ISCA ITRW ASR2000 (2000)
From Multi-Band Full Combination to Multi-Stream Full Combination Processing in Robust ASR, , and , in: ISCA ITRW ASR2000, 2000 |
|
International Conference on Spoken Langugae Processing (ICSLP 2000) (2000)
HMM2- A Novel Approach to HMM Emission Probability Estimation, , and , in: International Conference on Spoken Langugae Processing (ICSLP 2000), 2000 |
|
Proceedings of the Sixth ACM International Conference on Knowledge Discovery and Data Mining (2000)
Incorporating Domain Knowledge with Video and Voice Data Analysis in News Broadcasts, , and , in: Proceedings of the Sixth ACM International Conference on Knowledge Discovery and Data Mining, ACM, Boston, MA, USA, 2000 |
|
Proceedings of the European Signal Processing Conference EUSIPCO'2000 (2000)
Indexing spoken audio by LSA and SOMs, , in: Proceedings of the European Signal Processing Conference EUSIPCO'2000, 2000 |
Geostatistical congress 2000 (2000)
Indoor Radon Risk Assessment with Geostatistics and Artificial Neural Networks, , , , , , and , in: Geostatistical congress 2000, 2000 |
Proc. ICSLP 2000 (2000)
Inverse lattice filtering of speech with adapted non-uniform delays, and , in: Proc. ICSLP 2000, 2000 |
|
Proceedings of the IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing (2000)
Iterative Posterior-Based Keyword Spotting Without Filler Models, and , in: Proceedings of the IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, 2000 |
Proc. 5th Speech Production Seminar (2000)
LPC modeling with speech production constraints, , in: Proc. 5th Speech Production Seminar, 2000 |
|
Int. Conf. on Spoken Language Processing (ICSLP) (2000)
Multichannel signal separation for cocktail party speech recognition: a dynamic recurrent network, , , and , in: Int. Conf. on Spoken Language Processing (ICSLP), no IDIAP RR, see RESPITE www, 2000 |
Proceedings of the 4th International Workshop on Document Analysis System (2000)
Multiple Hypotheses Video OCR, and , in: Proceedings of the 4th International Workshop on Document Analysis System, 2000 |
|
KONVENS 2000 / Sprachkommunikation (2000)
Multiple Timescale Feature Combination towards Robust Speech Recognition, , in: KONVENS 2000 / Sprachkommunikation, 2000 |
|
Neural Computation 2000 (2000)
Neural Network Residual Stochastic Co-simulation for Environmental Data Analysis, , , , and , in: Neural Computation 2000, 2000 |
Proceedings of 7th International Workshop on Frontiers in Handwriting Recognition (2000)
Off-Line Cursive Script Recognition Based on Continuous Density HMM, and , in: Proceedings of 7th International Workshop on Frontiers in Handwriting Recognition, 2000 |
Proceedings of the International {C}onference on {P}attern {R}ecognition ({ICPR} 2000) (2000)
Recognition of Asymmetric Facial Action Unit Activities and Intensities, and , in: Proceedings of the International Conference on Pattern Recognition (ICPR 2000), 2000 |
|
Proceedings of JEP'2000 (2000)
Reconnaissance de la parole dans le bruit après renforcement fondé sur l'harmonicité, and , in: Proceedings of JEP'2000, no IDIAP RR, see RESPITE www, 2000 |
Proc. ICSLP 2000 (2000)
Relating LPC modeling to a factor-based articulatory model, , in: Proc. ICSLP 2000, 2000 |
|
Phonus No.5,Dec.2000, ISSN 0949-1791, Proc. Workshop on Phonetics and Phonology in ASR (2000)
Some applications of a priori knowledge in multi-stream HMM and HMM/ANN based ASR, , in: Phonus No.5,Dec.2000, ISSN 0949-1791, Proc. Workshop on Phonetics and Phonology in ASR, 2000 |
|
Int. Conf. on Spoken Language Processing (ICSLP) (2000)
Test of several external posterior weighting functions for multiband Full Combination ASR, and , in: Int. Conf. on Spoken Language Processing (ICSLP), 2000 |
ICSLP (2000)
Using Multiple Time Scales in the Framework of Multi-Stream Speech Recognition, and , in: ICSLP, 2000 |
|
Proc. IEEE Int. Conference on Speech Processing (ICSP) (1999)
A CASA front-end using the localisation cue for segregation and then cocktail-party speech recognition, , , and , in: Proc. IEEE Int. Conference on Speech Processing (ICSP), IEEE, 1999 |
Proc.\ European Conf.\ on Speech Communication and Technology (EUROSPEECH) (1999)
A CASA-labelling model using the localisation cue for robust cocktail-party speech recognition, , and , in: Proc.\ European Conf.\ on Speech Communication and Technology (EUROSPEECH), 1999 |
Proceedings of the International Conference on Artificial Neural Networks (ICANN'99) (1999)
A comparison of mixture models for density estimation, , in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'99), London: IEE, 1999 |
|
6th European Conference on Speech Communication and Technology --- Eurospeech'99 (1999)
A comparison of two strategies for ASR in additive noise : Missing Data and Spectral Subtraction, and , in: 6th European Conference on Speech Communication and Technology --- Eurospeech'99, 1999 |
|
Proc. Int. Joint Conf. on Artificial Intelligence (IJCAI) (1999)
A measure of speech and pitch reliability from voicing, and , in: Proc. Int. Joint Conf. on Artificial Intelligence (IJCAI), Scandinavian AI Society, 1999 |
Proc. Int. Congress on Phonetic Sciences (ICPhS) (1999)
A new SNR-feature mapping for robust multistream speech recognition, and , in: Proc. Int. Congress on Phonetic Sciences (ICPhS), 1999 |
6th european conference on speech communication and technology --- eurospeech'99 (1999)
An Overview of the PICASSO Project Research Activities in Speaker Verification for Telephone Applications, , , , , , , , , , , and , in: 6th european conference on speech communication and technology --- eurospeech'99, 1999 |
Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, June 1999, Fort Collins, USA (1999)
Audio-Visual Person Verification, , , , and , in: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, June 1999, Fort Collins, USA, 1999 |
|
Proc. IEEE Int. Conference on Speech Processing (ICSP) (1999)
Blind separation of delayed and superimposed acoustic sources : learning algorithms an experimental study, , , , and , in: Proc. IEEE Int. Conference on Speech Processing (ICSP), IEEE, 1999 |
Proceedings of the International Conference on Artificial Neural Networks (ICANN'99) (1999)
Classification using localized mixtures of experts, , in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'99), London: IEE, 1999 |
|
6th European Conference on Speech Communication and Technology --- Eurospeech'99 (1999)
CLIENT / WORLD MODEL SYNCHRONOUS ALIGNEMENT FOR SPEAKER VERIFICATION, , , and , in: 6th European Conference on Speech Communication and Technology --- Eurospeech'99, 1999 |
|
Principles of Data Mining and Knowledge Discovery: third european conference; proceedings / PKDD'99 (1999)
Combinatorial Approach for Data Binarization, and , in: Principles of Data Mining and Knowledge Discovery: third european conference; proceedings / PKDD'99, Springer, 1999 |
|
Proceedings of the ICML-99 Workshop: From Machine Learning to Knowledge Discovery in Databases (1999)
Data binarization by discriminant elimination, , and , in: Proceedings of the ICML-99 Workshop: From Machine Learning to Knowledge Discovery in Databases, 1999 |
|
Intelligent techniques for Spatio-Temporal Data Analysis in Environmental Applications. Workshop W07 (1999)
Decision-Oriented Environmental Mapping with Radial Basis Function Neural Networks, , , , and , in: Intelligent techniques for Spatio-Temporal Data Analysis in Environmental Applications. Workshop W07, 1999 |
Proceedings of the European Conference on Speech Communication and Technology (1999)
Deliberate Imposture: a challenge for automatic speaker verification systems, and , in: Proceedings of the European Conference on Speech Communication and Technology, 1999 |
Robust Methods for Speech Recognition in Adverse Conditions (1999)
Different Weighting Schemes in the Full Combination Subbands Approach for Noise Robust ASR, , and , in: Robust Methods for Speech Recognition in Adverse Conditions, 1999 |
|
Intelligent techniques for Spatio-Temporal Data Analysis in Environmental Applications. Workshop W07 (1999)
Environmental and Pollution Spatial Data Classification with Support Vector Machines and Geostatistics, , , and , in: Intelligent techniques for Spatio-Temporal Data Analysis in Environmental Applications. Workshop W07, 1999 |
8th Int. Conf. Computer Analysis of Images and Patterns (1999)
Evaluating the Complexity of Databases for Person Identification and Verification, , and , in: 8th Int. Conf. Computer Analysis of Images and Patterns, Springer Verlag, 1999 |
|
Proceedings of the European Conference on Speech Communication and Technology (1999)
Experimental evaluation of text-dependent speaker verification on laboratory and field test databases in the M2VTS project, , , and , in: Proceedings of the European Conference on Speech Communication and Technology, 1999 |
|
Extraction of Articulators in X-Ray Image Sequences, and , in: Proceedings of the European Conference on Speech Communication and Technology, 1999 |
|
Proc. Second International Conference on Audio and Video-based Biometric Person Authentication (AVBPA'99) (1999)
Fast Face Detection using MLP and FFT, , and , in: Proc. Second International Conference on Audio and Video-based Biometric Person Authentication (AVBPA'99), 1999 |
|
Pattern Recognition and Image Understanding (1999)
Illumination-robust Pattern Matching Using Distorted Color Histograms, and , in: Pattern Recognition and Image Understanding, Infix, 1999 |
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'99,',','),
Phoenix, Arizona, USA (1999)
Incremental Enrollment of Speech Recognizers, and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'99,',','), Phoenix, Arizona, USA, 1999 |
Proceedings of the IEEE Automatic Speech Recognition and Understanding (ASRU'99) Workshop (1999)
Iterative Posterior-Based Keyword Spotting Without Filler Models, and , in: Proceedings of the IEEE Automatic Speech Recognition and Understanding (ASRU'99) Workshop, 1999 |
ESCA ETRW workshop on Accessing Information in Spoken Audio (1999)
Latent Semantic Indexing by Self-Organizing Map, and , in: ESCA ETRW workshop on Accessing Information in Spoken Audio, 1999 |
|
Proc. Eurospeech'99 (1999)
LPC-based inversion of the DRM articulatory model, , in: Proc. Eurospeech'99, 1999 |
|
IEEE International Conference on Multimedia Computing and Systems (1999)
Multi Modal Verification for Teleservices and Security Applications, , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: IEEE International Conference on Multimedia Computing and Systems, 1999 |
Proc. Second International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'99) (1999)
Multi-Modal Data Fusion for Person Authentication using SVM, , in: Proc. Second International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'99), 1999 |
|
Proc. of the ESCA Workshop on Robust Methods for Speech Recognition in Adverse Conditions (1999)
Non-Stationary Multi-Channel (Multi-Stream) Processing Towards Robust and Adaptive ASR, , in: Proc. of the ESCA Workshop on Robust Methods for Speech Recognition in Adverse Conditions, 1999 |
Proceedings of the European Conference on Speech Communication and Technology (1999)
Robust Person Verification based on Speech and Facial Images, and , in: Proceedings of the European Conference on Speech Communication and Technology, 1999 |
|
IEEE Workshop on Automatic Advanced Technologies (1999)
The Elisa'99 Speaker Recognition and Tracking Systems, , , , , , , , , , , , , , , and , in: IEEE Workshop on Automatic Advanced Technologies, 1999 |
6th European Conference on Speech Communication and Technology --- Eurospeech'99 (1999)
The full combination sub-bands approach to noise robust HMM/ANN based ASR, , and , in: 6th European Conference on Speech Communication and Technology --- Eurospeech'99, 1999 |
|
Automatic Speech Recognition and Understanding (ASRU) workshop (1999)
Towards introducing long-term statistics in MUSE for robust speech recognition, and , in: Automatic Speech Recognition and Understanding (ASRU) workshop, 1999 |
|
8th Int. Conf. Computer Analysis of Images and Patterns (1999)
Tracking Articulators in X-ray Movies of the Vocal Tract, , in: 8th Int. Conf. Computer Analysis of Images and Patterns, Springer Verlag, 1999 |
|
Proc. Second International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'99) (1999)
XM2VTSDB: The Extended M2VTS Database, , , , and , in: Proc. Second International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'99), 1999 |
ICASSP 98 (1998)
A comparison of a priori threshold setting procedures for speaker verification in the CAVE project, , , , , , and , in: ICASSP 98, 1998 |
Reconnaissance du locuteur et ses applications commerciales et criminalistiques (1998)
An overview of the cave project research activities in speaker verification, , , , , and , in: Reconnaissance du locuteur et ses applications commerciales et criminalistiques, 1998 |
Proceedings of Workshop on Text, Speech and Dialog (TSD'98) Brno, Czech Republic (1998)
Confidence Measures in Hybrid HMM/ANN Speech Recognition, and , in: Proceedings of Workshop on Text, Speech and Dialog (TSD'98) Brno, Czech Republic, 1998 |
Proceedings of IK'98, Interdisziplinäres Kolleg, Spring Scholl, Günne am Möhnessee, Germany, March 7--14 (1998)
Connectionist speech recognition, , in: Proceedings of IK'98, Interdisziplinares Kolleg, Spring Scholl, Gunne am Mohnessee, Germany, March 7--14, 1998 |
Proc. 5th European Conference on Computer Vision (1998)
Continuous Audio-Visual Speech Recognition, and , in: Proc. 5th European Conference on Computer Vision, Springer Verlag, 1998 |
|
1st International Conference on Multisource-Multisensor Data Fusion (1998)
Decision fusion using a multi-linear classifier, , and , in: 1st International Conference on Multisource-Multisensor Data Fusion, 1998 |
Machine Learning: ECML-98 (1998)
Improved Pairwise Coupling Classification With Correcting Classifiers, and , in: Machine Learning: ECML-98, Springer, 1998 |
|
Proceedings of International Conference on Spoken Language Processing (ICSLP'98) Sydney, Australia (1998)
Improving Posterior Based Confidence Measures in Hybrid HMM/ANN Speech Recognition Systems, and , in: Proceedings of International Conference on Spoken Language Processing (ICSLP'98) Sydney, Australia, 1998 |
|
TSD'98-Text, Speech and Dialog International Workshop (1998)
Interfacing of CASA and Multistream recognition, , , and , in: TSD'98-Text, Speech and Dialog International Workshop, BRNO-Czech Republic, 1998 |
|
ICSLP'98 (1998)
Interfacing of CASA and partial recognition based on a multistream technique, , , and , in: ICSLP'98, Sidney, 1998 |
|
Reconnaissance du locuteur et ses applications commerciales et criminalistiques (1998)
POLYCOST: a telephone-speech database for speaker recognition, , , and , in: Reconnaissance du locuteur et ses applications commerciales et criminalistiques, 1998 |
Journées Etude Parole - Martigny (1998)
Reconnaissance multi-bandes de la parole bruitée par couplage entre les niveaux primitifs et d'identification, , , and , in: Journees Etude Parole - Martigny, 1998 |
|
Neurosciences et Sciences de l'Ingénieur'98 - Munster, CNRS (1998)
Reconnaissance robuste de la parole par segmentation signal/bruit en sous-bandes, , , and , in: Neurosciences et Sciences de l'Ingenieur'98 - Munster, CNRS, 1998 |
|
Proceedings of ICSLP, Sidney (1998)
Speech pre-processing against intentional imposture in speaker recognition, and , in: Proceedings of ICSLP, Sidney, 1998 |
Proceedings of the 1998 IEEE International Conference on Acoustics, Speech, and Signal Processing --- ICASSP'98 (1998)
Text dependent speaker verification using binary classifiers, , and , in: Proceedings of the 1998 IEEE International Conference on Acoustics, Speech, and Signal Processing --- ICASSP'98, IEEE, IEEE, 1998 |
|
Proc. 5th Int. Conf. on Spoken Language Processing (1998)
Using the Multi-Stream Approach for Continuous Audio-Visual Speech Recognition: Experiments on the M2VTS Database, and , in: Proc. 5th Int. Conf. on Spoken Language Processing, 1998 |
|
Proceedings of International Phonetic Science conference IPS98, Washington (1998)
Voice transformation, a tool for imposture of speaker verification, and , in: Proceedings of International Phonetic Science conference IPS98, Washington, 1998 |
{IEEE} 4th Workshop on Intercative Voice Technology for Telecommunications Applications (IVTTA'98) September 29--30, Torino, Italy (1998)
Voice-B System, , , , and , in: IEEE 4th Workshop on Intercative Voice Technology for Telecommunications Applications (IVTTA'98) September 29--30, Torino, Italy, 1998 |
Proceedings of the Fifth International Workshop on Artificial Intelligence for High Energy Physics (1997)
A Connectionist System for Two-Dimensional Representation of Multivariate Location Data, and , in: Proceedings of the Fifth International Workshop on Artificial Intelligence for High Energy Physics, AIHENP, Lausanne, Switzerland, Elsevier Science, 1997 |
Proceedings of the First International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'97) (1997)
Acoustic-Labial Speaker Verification, , , and , in: Proceedings of the First International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'97), Springer Verlag, 1997 |
Proceedings of the International Conference on Neural Networks (1997)
Adapting the 2-Class Recursive Deterministic Perceptron Neural Network to m Classes, , , and , in: Proceedings of the International Conference on Neural Networks, IEEE, IEEE, 1997 |
Proceedings of the Workshop on Optics and Computer Science (1997)
An Optical Thresholding Perceptron, , , , and , in: Proceedings of the Workshop on Optics and Computer Science, Geneva, Switzerland, 1997 |
|
EUROSPEECH'97 (1997)
Estimation of Global Posteriors and Forward-Backward Training of Hybrid HMM/ANN Systems, , , and , in: EUROSPEECH'97, 1997 |
|
Proceedings of the International Conference on Artificial Neural Networks (ICANN'97) (1997)
Handwritten Digit Recognition with Binary Optical Perceptron, , , and , in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'97), Springer-Verlag, 1997 |
|
International School on Neural Nets: Adaptive Processing of Temporal Information (1997)
Hybrid HMM/ANN Systems for Speech Recognition: Overview and New Research Directions, and , in: International School on Neural Nets: Adaptive Processing of Temporal Information, Springer Verlag, 1997 |
|
IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing (1997)
Hybrid HMM/ANN Systems for Training Independent Tasks: Experiments on 'Phonebook' and Related Improvements, , , , and , in: IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, 1997 |
|
Proceedings of the European Conference on Speech Communication and Technology (1997)
Integrating Acoustic and Labial Information for Speaker Identification and Verification, , , and , in: Proceedings of the European Conference on Speech Communication and Technology, 1997 |
Eurospeech 97 (1997)
Likelihood ratio adjustment for the compensation of model mismatch in speaker verification, and , in: Eurospeech 97, 1997 |
|
Proceedings of the International Conference on Artificial Neural Networks (ICANN'97) (1997)
Mixtures of Experts Estimate A Posteriori Probabilities, , in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'97), Springer-Verlag, 1997 |
|
On the Complexity of Recognizing Iterated Differences of Polyhedra, , in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'97), Springer-Verlag, 1997 |
|
Proceedings of The Fourteenth International Conference on Machine Learning (1997)
On the Decomposition of Polychotomies into Dichotomies, and , in: Proceedings of The Fourteenth International Conference on Machine Learning, Morgan Kaufmann, 1997 |
|
Proceedings of the First International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'97) (1997)
Person Authentication by Fusing Face and Speech Information, , , and , in: Proceedings of the First International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'97), Springer Verlag, 1997 |
Proc. of the ESCA-NATO Workshop on Robust Speech Recognition for Unknown Communication Channels (1997)
Robust Speech Recognition based on Multi-Stream Features, , and , in: Proc. of the ESCA-NATO Workshop on Robust Speech Recognition for Unknown Communication Channels, 1997 |
|
Proceedings of the European Conference on Speech Communication and Technology ({EUROSPEECH'97}) (1997)
Speaker Verification in the Telephone Network : Research Activities in the CAVE Project, , , , , and , in: Proceedings of the European Conference on Speech Communication and Technology (EUROSPEECH'97), 1997 |
|
IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing (1997)
Speaker-Dependent Speech Recognition Based on Phone-Like Unit Model -- Application to Voice Dialing, and , in: IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, 1997 |
|
Proceedings of the International Conference on Artificial Neural Networks (ICANN'97) (1997)
State-of-the-Art and Recent Progress in Hybrid HMM/ANN Speech Recognition, , in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'97), Springer-Verlag, 1997 |
IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing (1997)
Subband-Based Speech Recognition, and , in: IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, 1997 |
|
Proceedings of the European Conference on Speech Communication and Technology (1997)
Towards Speaker Independent Continuous Speechreading, , in: Proceedings of the European Conference on Speech Communication and Technology, 1997 |
|
EUROSPEECH'97 (1997)
Using Multiple Time Scales in a Multi-Stream Speech Recognition System, and , in: EUROSPEECH'97, 1997 |
|
Proceedings of the 8th IEEE International Conference on Tools with Artificial Intelligence (1996)
A Boolean Approach to Construct Neural Networks for Non-Boolean Problems, and , in: Proceedings of the 8th IEEE International Conference on Tools with Artificial Intelligence, IEEE, 1996 |
Proceedings of the Third {IEEE} International Conference on Electronics, Circuits, and Systems (1996)
A Method for All-Positive Optical Multilayer Perceptrons, , and , in: Proceedings of the Third IEEE International Conference on Electronics, Circuits, and Systems, University of Patras, Rhodos, Greece, IEEE, 1996 |
|
Journees d'etudes sur la parole (1996)
Amelioration des performances de verification du locuteur par combinaison de methodes, , , and , in: Journees d'etudes sur la parole, JEP, 1996 |
Proceedings of ESANN'96 (1996)
Bounds on the Degree of High Order Binary Perceptrons, , in: Proceedings of ESANN'96, D facto, 1996 |
|
Proceedings of The Fourth International Conference on Spoken Language Processing (1996)
Combining methods to improve speaker verification decision, , , and , in: Proceedings of The Fourth International Conference on Spoken Language Processing, ICSLP, ICSLP, 1996 |
|
Proceedings of the '96 {SIPAR}-Workshop on Parallel and Distributed Computing (1996)
Connectionist Quantization Functions, , and , in: Proceedings of the '96 SIPAR-Workshop on Parallel and Distributed Computing, Scientific and Parallel Computing Group, University of Geneva, Geneva, Switzerland, 1996 |
|
Proceedings of JEP'96: XXIèmes Journées d'étude sur la Parole (1996)
ETC\_vérif : un environnement multi-agents de reconnaissance automatique de la parole en continu, and , in: Proceedings of JEP'96: XXIemes Journees d'etude sur la Parole, 1996 |
|
Proceedings of the International Conference on Neural Information Processing (1996)
Extended Cauchy Machines, and , in: Proceedings of the International Conference on Neural Information Processing, 1996 |
Proceedings of the Fifth International Conference on Microelectronics for Neural Networks and Fuzzy Systems: MicroNeuro'96 (1996)
Hardware-Friendly Learning Algorithms for Neural Networks: An Overview, and , in: Proceedings of the Fifth International Conference on Microelectronics for Neural Networks and Fuzzy Systems: MicroNeuro'96, EPFL and CSEM, Lausanne, Switzerland, IEEE Computer Society Press, 1996 |
|
Proceedings ISAI /IFIS 1996 (1996)
Image Classification by Neural Networks for the Quality Control of Watches, , and , in: Proceedings ISAI /IFIS 1996, ITESM, Cancun, Mexico, ITESM, 1996 |
Proceedings of the International Conference on Pattern Recognition (ICPR'96) (1996)
Learning to recognise talking faces, , and , in: Proceedings of the International Conference on Pattern Recognition (ICPR'96), IAPR, 1996 |
|
Locating and tracking facial speech features, , and , in: Proceedings of the International Conference on Pattern Recognition (ICPR'96), IAPR, 1996 |
|
European Conference on Multimedia Applications, Services and Techniques (1996)
Multi-modal person verification tools using speech and images, , in: European Conference on Multimedia Applications, Services and Techniques, 1996 |
The 1st Workshop on Soft Computing (1996)
Neural Network Pruning and Pruning Parameters, and , in: The 1st Workshop on Soft Computing, Dept. of Information Electronics Nagoya University, 1996 |
|
Proceedings of the 8th European Signal Processing Conference (Eusipco'96) (1996)
New time-frequency derived cepstral coefficients for automatic speech recognition, and , in: Proceedings of the 8th European Signal Processing Conference (Eusipco'96), 1996 |
|
Proceedings of the First International Symposium on Neuro-Fuzzy Systems (AT'96) (1996)
Overcoming Inaccuracies in Optical Multilayer Perceptrons, , and , in: Proceedings of the First International Symposium on Neuro-Fuzzy Systems (AT'96), Lausanne, Switzerland, AATI, 1996 |
1996
Polycost Database, , and , 1996 |
Proceedings of IVTTA 1996 IEEE Third Workshop Interactive Voice Technology for Telecommunications Applications (1996)
Secured vocal access to telephone servers, , , , and , in: Proceedings of IVTTA 1996 IEEE Third Workshop Interactive Voice Technology for Telecommunications Applications, 1996 |
Application of speaker recognition techniques in telephony (1996)
Semi-automatic HMM-based annotation of the PolyCOST Database, , , and , in: Application of speaker recognition techniques in telephony, COST250, 1996 |
Proceedings of the International Conference on Neural Networks (1996)
Sparse Initial Topologies for High Order Perceptrons, , and , in: Proceedings of the International Conference on Neural Networks, IEEE, 1996 |
Proceedings of the 4th International Conference on Spoken Language Processing (ICSLP'96) (1996)
Speachreading using shape and intensity information, , and , in: Proceedings of the 4th International Conference on Spoken Language Processing (ICSLP'96), 1996 |
|
Speaker identification by lipreading, , and , in: Proceedings of the 4th International Conference on Spoken Language Processing (ICSLP'96), 1996 |
|
Proceedings of the 8th European Signal Processing Conference (Eusipco'96) (1996)
Statistical lip modelling for visual speech recognition, , and , in: Proceedings of the 8th European Signal Processing Conference (Eusipco'96), 1996 |
|
Proceedings of Workstations und ihre Anwendungen, SIWORK'96 (1996)
Sun Workstation and SwissNet Platform for Speech Recognition and Speaker Verification over the Telephone, , , , and , in: Proceedings of Workstations und ihre Anwendungen, SIWORK'96, 1996 |
|
Proceedings of the '96 {SIPAR}-Workshop on Parallel and Distributed Computing (1996)
Superceptron Construction, , and , in: Proceedings of the '96 SIPAR-Workshop on Parallel and Distributed Computing, Scientific and Parallel Computing Group, University of Geneva, Geneva, Switzerland, 1996 |
Proceedings of The 3rd Slovenian-German and 2nd SDRV Workshop, Speech and Image Understanding (1996)
Swiss PolyPhone and PolyVar: Building Databases for Speech Recognition and Speaker Verification, and , in: Proceedings of The 3rd Slovenian-German and 2nd SDRV Workshop, Speech and Image Understanding, 1996 |
4ème Colloque National sur l'Écrit et le Document ({CNED'96}) (1996)
Traitement préliminaire de l'image d'un texte manuscrit en vue de sa reconnaissance: une méthode de sur-segmentation, , and , in: 4eme Colloque National sur l'A?crit et le Document (CNED'96), 1996 |
Proceedings of JEP'96: XXIèmes Journées d'étude sur la Parole (1996)
Un système prédictif de la structuration syntaxico-rythmique d'un énoncé à l'aide d'informations prosodiques, , and , in: Proceedings of JEP'96: XXIemes Journees d'etude sur la Parole, 1996 |
|
Proceedings of ICSLP 96 (1996)
Validating Different Flexible Vocabulary Approaches on the Swiss French PolyPhone and PolyVar databases, , , and , in: Proceedings of ICSLP 96, 1996 |
IEEE International Conference on Acoustics, Speech, and Signal Processing ({ICASSP'96}) (1996)
Visual Speech Recognition using Active Shape Models and Hidden Markov Models, , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'96), 1996 |
|
Proc. of WOz'95: International Workshop on Oz Programming (1995)
A graphical tool for monitoring Oz objects activity, and , in: Proc. of WOz'95: International Workshop on Oz Programming, IDIAP, Uni. Fribourg, 1995 |
International Congress of Phonetic Sciences (1995)
A study of Intra- and Inter-Speaker Variability in the Voices of Twins for Speaker Verification, and , in: International Congress of Phonetic Sciences, 1995 |
SIPAR Workshop'95 Parallel and Distributed Systems (1995)
Boolean Logic Inspired High Order Perceptron Construction, , and , in: SIPAR Workshop'95 Parallel and Distributed Systems, SIPAR SI Group for Parallel Systems, Biel School of Engineering, Computer Science Department, 1995 |
|
4th European Conference on Speech Communication and Technology (1995)
Discrimination of the voices of twins and siblings for speaker verification, and , in: 4th European Conference on Speech Communication and Technology, 1995 |
Actes des 3èmes Journées Francophones sur l'Intelligence Artificielle Distribuée et les Systèmes Multi-agents (1995)
Environnement multi-agents de reconnaissance automatique de la parole en continu, and , in: Actes des 3emes Journees Francophones sur l'Intelligence Artificielle Distribuee et les Systemes Multi-agents, 1995 |
Proc. of WOz'95: International Workshop on Oz Programming (1995)
ETC\_vérif, a Prototype of a Cooperative Automatic Speech Recognition System, and , in: Proc. of WOz'95: International Workshop on Oz Programming, IDIAP, Uni. Fribourg, 1995 |
1995 International Symposium on Artificial Neural Networks ({ISANN'95}) (1995)
Evaluating pruning methods, and , in: 1995 International Symposium on Artificial Neural Networks (ISANN'95), 1995 |
|
Proceedings of the International Conference on Neural Networks (1995)
Gain Elimination form Backpropagation Neural Networks, , and , in: Proceedings of the International Conference on Neural Networks, IEEE, Perth, IEEE, 1995 |
Second Asian Conference on Computer Vision (ACCV'95,',','),
Singapore (1995)
Handwriting Recognition, , in: Second Asian Conference on Computer Vision (ACCV'95,',','), Singapore, 1995 |
International Congress of Phonetic Sciences (1995)
Lexical filtrering by means of prosodic information, , and , in: International Congress of Phonetic Sciences, 1995 |
4th European Conference on Speech Communication and Technology (1995)
Microprosodic study of isolated French word corpora, , in: 4th European Conference on Speech Communication and Technology, 1995 |
ICASSP (1995)
Neural nets approaches to Speaker Verification: comparison with Second Order Statistical Measure, and , in: ICASSP, 1995 |
Proceedings of the International Conference on Neural Networks (1995)
Non-Ontogenic Sparse Neural Networks, , and , in: Proceedings of the International Conference on Neural Networks, IEEE, IEEE, 1995 |
Proceedings of the {SIPAR} Workshop '95: Parallel and Distributed Systems (1995)
Ontogenic High Order Cauchy Machines, and , in: Proceedings of the SIPAR Workshop '95: Parallel and Distributed Systems, Biel School of Engineering, 1995 |
Optics and Information (1995)
Optical Multilayer Perceptrons based on Liquid Crystal Devices, , , and , in: Optics and Information, Cercle SFO/SEE d'Opto-informatique, Mulhouse, France, European Optical Society (EOS), 1995 |
4th European Conference on Speech Communication and Technology (1995)
Reliability in a Multi-agent Spoken Language Recognition System, and , in: 4th European Conference on Speech Communication and Technology, 1995 |
Linguistic Databases (1995)
Swiss-French Polyphone: a Telephone Speech Database to develop Interactive Voice Servers, , , and , in: Linguistic Databases, 1995 |
Proceedings of the International Conference on Artificial Neural Networks (ICANN'95 and NeuroNîmes'95) (1995)
The Effects of Optical Thresholding in Backpropagation Neural Networks, , and , in: Proceedings of the International Conference on Artificial Neural Networks (ICANN'95 and NeuroNimes'95), ENNS, Paris, France, EC2 & Cie, 1995 |
International Congress of Phonetic Sciences (1995)
The use of prosodic agents in a cooperative automatic speech recognition system, and , in: International Congress of Phonetic Sciences, 1995 |
International Conference on Pattern Recognition (ICPR,',','),
Jerusalem (1994)
A system for the off-line recognition of handwritten text, , in: International Conference on Pattern Recognition (ICPR,',','), Jerusalem, 1994 |
IAPR Workshop on Document Analysis Systems (1994)
Design and Implementation of a System for the Recognition of Handwritten Responses on US Census Forms, , in: IAPR Workshop on Document Analysis Systems, 1994 |
Proceedings of the International Conference on Artificial Neural Networks ({ICANN 94}) (1994)
Modular Object-Oriented Neural Network Simulators and Topology Generalizations, , and , in: Proceedings of the International Conference on Artificial Neural Networks (ICANN 94), Sorrento, Italy, Springer-Verlag, 1994 |
|
Proceedings of the '94 SIPAR-Workshop on Parallel and Distributed Computing (1994)
Results on the Steepness in Backpropagation Neural Networks, , and , in: Proceedings of the '94 SIPAR-Workshop on Parallel and Distributed Computing, SI Group for Parallel Systems, 1994 |
Proceedings of the '94 SIPAR--Workshop on Parallel and Distributed Computing (1994)
Weight Initialization for High Order and Multilayer Perceptrons, and , in: Proceedings of the '94 SIPAR--Workshop on Parallel and Distributed Computing, SI Group for Parallel Systems, 1994 |
International Conference on Artificial neural Networks (1993)
Do Backpropagation trained neural networks have normal weight distributions?, and , in: International Conference on Artificial neural Networks, 1993 |
|
Proc. IEEE Conf. on Computer Vision and Pattern Recognition (1993)
Higher-Order Statistics in Visual Object Recognition, , in: Proc. IEEE Conf. on Computer Vision and Pattern Recognition, 1993 |
|
International Conference on Document Analysis and Retrieval (ICDAR,',','),
Tsukuba Science City, Japan (1993)
Recognition of Handprinted Digits using Optimal Bounded Error Matching, , in: International Conference on Document Analysis and Retrieval (ICDAR,',','), Tsukuba Science City, Japan, 1993 |
Publications of type Mastersthesis
1995
Optimisation de réseaux de neurones, , {EPFL}, Lausanne, Switzerland, 1995 |
Publications of type Phdthesis
2024
A Stochastic Approach to Contact-rich Manipulation, , Ecole Polytechnique Fédérale de Lausanne, 2024 |
|
Biologically Inspired Spiking Neural Networks for Speech Recognition, , EPFL/EDEE, 2024 |
[DOI] |
Performing And Detecting Backdoor Attacks on Face Recognition Algorithms, , Ecole Polytechnique Fédérale de Lausanne, 2024 |
|
Robot Learning using Tensor Networks, , Ecole Polytechnique Fédérale de Lausanne, 2024 |
[DOI] |
Safe Deep Neural Networks, , EPFL, 2024 |
|
2023
Data-driven urban building energy modeling in Satom (CH): The energy savings potential and use of available renewable energy sources., , and , Politecnico di Torino, 2023 |
[URL] |
Generalization and Personalization of Machine Learning for Multimodal Mobile Sensing in Everyday Life, , EPFL, 2023 |
|
Learning and Optimization of Anticipatory Feedback Controllers for Robot Manipulation, , École Polytechnique Fédérale de Lausanne, 2023 |
[DOI] |
Modeling Structured Data in Attention-based Models, , EPFL, 2023 |
[URL] |
Novel Methods For Detection And Analysis Of Atypical Aspects In Speech, , École Polytechnique Fédérale de Lausanne, 2023 |
[DOI] |
On matching data and model in LF-MMI-based dysarthric speech recognition, , École polytechnique fédérale de Lausanne, 2023 |
[DOI] [URL] |
Practical computational imaging by use of spatiotemporal light modulation: from simulations to applications in biological microscopy, , EPFL, 2023 |
[DOI] |
Privacy-Preserving Machine Learning on Graphs, , EPFL, 2023 |
[DOI] |
Sparse Autoencoders for Speech Modeling and Recognition, , École polytechnique fédérale de Lausanne, 2023 |
[DOI] |
Text Representation Learning for Low Cost Natural Language Understanding, , École polytechnique fédérale de Lausanne, 2023 |
[DOI] [URL] |
2022
Automatic pathological speech assessment, , École polytechnique fédérale de Lausanne, 2022 |
[DOI] |
Memory of Motion for Initializing Optimization in Robotics, , École Polytechnique Fédérale de Lausanne, 2022 |
|
Stop Wasting my FLOPS: Improving the Efficiency of Deep Learning Models, , École Polytechnique Fédérale de Lausanne, 2022 |
[DOI] |
Using synthetic fingerprint images to test the performance of an AFIS system, , Université de Lausanne, 2022 |
|
2021
Deep Learning Approaches for Auditory Perception in Robotics, , École polytechnique fédérale de Lausanne, 2021 |
|
Efficient Depth-based Deep Learning Methods for Multi-Party Pose Estimation, , École polytechnique fédérale de Lausanne, 2021 |
[DOI] |
Explainable Phonology-based Approach for Sign Language Recognition and Assessment, , Ecole Polytechnique Fédérale de Lausanne, 2021 |
|
Gradient-based Methods for Deep Model Interpretability, , École polytechnique fédérale de Lausanne, 2021 |
[DOI] |
Learning strategies and representations for intuitive robot learning from demonstration, , EPFL, 2021 |
|
Modeling and Inferring Attention between Humans or for Human-Robot Interactions, , Ecole Polytechnique Federale de Lausanne, 2021 |
[DOI] [URL] |
Novel Methods for Incorporating Prior Knowledge for Automatic Speech Assessment, , École polytechnique fédérale de Lausanne (EPFL), 2021 |
|
2020
Accurate Nod and 3D Gaze Estimation for Social Interaction Analysis, , EDEE, EPFL, 2020 |
|
Active Illumination and Computational Methods for Temporal and Spectral Super-Resolution Microscopy, , EPFL, 2020 |
[DOI] |
Context is Everything: Using a Smartphone App to Capture Young People's Drinking Behaviours, Cognitions, Environments, and Consequences, , La Trobe University, Melbourne, Australia, 2020 |
[DOI] |
Deep Generative Models and Applications, and , EPFL, 2020 |
[DOI] [URL] |
Detection of disguised speech in forensic science by humans and automatic systems, , Université de Lausanne Ecole des Sciences Criminelles, 2020 |
|
Discourse Phenomena in Machine Translation, , École polytechnique fédérale de Lausanne, 2020 |
|
Product of experts for robot learning from demonstration, , EPFL, 2020 |
Robot skills learning with Riemannian manifolds : Leveraging geometry-awareness in robot learning, optimization and control, , Ecole Polytechnique Fédérale de Lausanne, 2020 |
|
2019
Automated Daylighting Control System based on Sky Luminance Monitoring and Lighting Computing, , and , EPFL, 2019 |
[DOI] |
Language Independent Query by Example Spoken Term Detection, , École Polytechnique Fédérale de Lausanne, 2019 |
|
Multimodal Person Recognition in Audio-Visual Streams, , EPFL, 2019 |
[DOI] |
Sparse and Low-rank Modeling for Automatic Speech Recognition, , EPFL, 2019 |
[DOI] |
Trustworthy speaker recognition with minimal prior knowledge using neural networks, , Ecole polytechnique fédérale de Lausanne (EPFL), 2019 |
[DOI] [URL] |
2018
Generative Models for Learning Robot Manipulation Skills from Humans, , Ecole Polytechnique Federale de Lausanne, 2018 |
[DOI] |
Learning embeddings: efficient algorithms and applications, , École Polytechnique Fédérale de Lausanne, 2018 |
[DOI] |
Novel Algorithms for Clustering, , École polytechnique fédérale de Lausanne, 2018 |
[DOI] |
Theory and Algorithms for Hypothesis Transfer Learning, , EPFL, 2018 |
[DOI] |
Word Sense Consistency in Statistical and Neural Machine Translation, , École Polytechnique Fédérale de Lausanne, 2018 |
|
2017
Intonation Modelling for Speech Synthesis and Emphasis Preservation, , École Polytechnique Fédérale de Lausanne, 2017 |
[DOI] |
Large-Scale Image Segmentation with Convolutional Networks, , Sciences et Techniques de l’Ingénieur (STI), 2017 |
|
Object Detection with Active Sample Harvesting, , École Polytechnique Fédérale de Lausanne, 2017 |
|
On Modeling the Synergy Between Acoustic and Lexical Information for Pronunciation Lexicon Development, , École polytechnique fédérale de Lausanne (EPFL), 2017 |
[DOI] |
Visual Analysis of Maya Glyphs via Crowdsourcing and Deep Learning, , École Polytechnique Fédérale de Lausanne, 2017 |
[DOI] |
2016
"Can you hear me now?" --- Automatic assessment of background noise intrusiveness and speech intelligibility in telecommunications, , Sciences et Techniques de l’Ingénieur (STI), 2016 |
[DOI] |
Building Word Embeddings for Solving Natural Language Processing, , École Polytechnique Fédérale de Lausanne, 2016 |
[DOI] |
Computational Analysis of Urban Places Using Mobile Crowdsensing, , Ecole Polytechnique Federale de Lausanne, 2016 |
[DOI] |
Learning Explainable User Sentiment and Preferences for Information Filtering, , École Polytechnique Fédérale de Lausanne, 2016 |
[DOI] |
Towards End-to-End Speech Recognition, , Ecole polytechnique Fédérale de Lausanne, 2016 |
[DOI] |
Word Sequence Modeling using Deep Learning: and End-to-end Approach and its Applications, , EPFL, 2016 |
[DOI] |
2015
3D Gaze Estimation from Remote RGB-D Sensors, , École Polytechnique Fédérale de Lausanne, 2015 |
[DOI] |
Automatic social role recognition and its application in structuring multiparty interactions, , EPFL, 2015 |
|
Computational Analysis Of Behavior In Employment Interviews And Video Resumes, , École Polytechnique Fédérale de Lausanne, 2015 |
|
Enabling speech applications using Ad-Hoc Microphone Arrays, , École Polytechnique Fédérale de Lausanne, 2015 |
|
Statistical Models in Automatic Speech Recognition, , University of Fribourg, Department of Mathematics, 2015 |
|
Trustworthy Biometric Verification under Spoofing Attacks: Application to the Face Mode, , École Polytechnique Fédérale de Lausanne, 2015 |
[URL] |
2014
Grapheme-based Automatic Speech Recognition using Probabilistic Lexical Modeling, , École polytechnique fédérale de Lausanne, 2014 |
[DOI] |
Human Tracking and Pose Estimation in Open Spaces, , École Polytechnique Fédérale de Lausanne (EPFL), 2014 |
|
Inferring Visual Attention and Addressee in Human Robot Interaction, , École Polytechnique Fédérale de Lausanne (EPFL), 2014 |
|
Saliency-based Representations and Multi-component Classifiers for Visual Scene Recognition, , École Polytechnique Fédérale de Lausanne (EPFL), 2014 |
|
Scalable Probabilistic Models for Face and Speaker Recognition, , École Polytechnique Fédérale de Lausanne (EPFL), 2014 |
[URL] |
Tractable Approaches to Learning and Planning in High Dimensions, , EPFL, 2014 |
[DOI] |
2013
Automatic Personality Perception: Inferring Personality Traits from Nonverbal Vocal Behavior, , Electrical Engineering Department, EPFL, 2013 |
|
Learning to Learn by Exploiting Prior Knowledge, , EDIC, 2013 |
|
Mining Conversational Social Video, , EPFL, 2013 |
|
Model-based Sparse Component Analysis for Multiparty Distant Speech Recognition, , École Polytechnique Fédérale de Lausanne, 2013 |
|
Multilingual speech recognition A posterior based approach, , École Polytechnique Fédérale de Lausanne (EPFL), 2013 |
|
Object Classification and Detection in High Dimensional Feature Space, , Programme doctoral en Informatique, Communications et Information, 2013 |
|
Similarity Learning Over Large Collaborative Networks, , and , EPFL, 2013 |
|
2012
Alternative search techniques for face detection using location estimation and binary features, , ECOLE POLYTECHNIQUE FEDERALE DE LAUSANNE, 2012 |
|
Data-Driven Enhancement of State Mapping-Based Cross-Lingual Speaker Adaptation, , École Polytechnique Fédérale de Lausanne, 2012 |
|
Sequential Topic Models for Mining Recurrent Activities and their Relationships : Application to long term video recordings, , École Polytechnique Fédérale de Lausanne, 2012 |
|
Statistical Shape Descriptors for Ancient Maya Hieroglyphs Analysis, , École Polytechnique Fédérale de Lausanne, 2012 |
|
Unified Framework Of Feature Based Adaptation For Statistical Speech Synthesis And Recognition, , Ecole Polytechnique Federale de Lausanne (EPFL), 2012 |
|
2011
A Probabilistic Approach to Socio-Geographic Reality Mining, , Ecole Polytechnique Fédérale de Lausanne, 2011 |
|
Bayesian Approaches to Uncertainty in Speech Processing, , School of Computing Sciences, University of East Anglia, 2011 |
|
Boosting Localized Features for Speaker and Speech Recognition, , Ecole Polytechnique Federale de Lausanne (EPFL), 2011 |
|
Computational modeling of face-to-face social interaction using nonverbal behavioral cues, , Ecole Polytechnique Fédérale de Lausanne, 2011 |
|
Modeling and understanding communities in online social media using probabilistic methods, , Ecole polytechnique fédérale de Lausanne, 2011 |
[DOI] [URL] |
Open-ended Learning of Visual and Multi-modal Patterns, , Ecole polytechnique fédérale de Lausanne, 2011 |
|
Privacy-Sensitive Audio Features for Conversational Speech Processing, , Ecole Polytechnique Fédérale de Lausanne, 2011 |
|
2010
An Information Theoretic Approach to Speaker Diarization of Meeting Recordings, , Ecole polytechnique fédérale de Lausanne, 2010 |
|
Multilayer Perceptron Based Hierarchical Acoustic Modeling for Automatic Speech Recognition, , Ecole polytechnique fédérale de Lausanne, 2010 |
|
Social Network Analysis for Automatic Role Recognition, , Ecole Polytechnique Fédérale de Lausanne, 2010 |
|
2009
On the design of audio features robust to the album-effect for music information retrieval., , Ecole Polytechnique Fédérale de Lausanne, 2009 |
2008
Acoustic Models for Posterior Features in Speech Recognition, , Ecole Polytechnique Fédérale de Lausanne, 2008 |
|
Enhancing posterior based speech recognition systems, , Ecole Polytechnique Fédérale de Lausanne, 2008 |
|
Inference in switching linear dynamical systems applied to noise robust speech recognition of isolated digits, , Ecole Polytechnique Fédérale de Lausanne, 2008 |
|
Machine Learning for Information Retrieval, , Ecole Polytechnique Fédérale de Lausanne, 2008 |
|
Methods for Asynchronous and Non-Invasive EEG-Based Brain-Computer Interfaces. Towards Intelligent Brain-Actuated Wheelchairs, , University of Barcelona, 2008 |
|
Probabilistic models for music, , Ecole Polytechnique Fédérale de Lausanne, 2008 |
[URL] |
2007
Bayesian methods for visual multi-object tracking with applications to human activity recognition, , École Polytechnique Fédérale de Lausanne, 2007 |
|
Error-related EEG potentials in brain-computer interfaces, , École Polytechnique Fédérale de Lausanne, 2007 |
|
Joint Head Tracking and Pose Estimation for Visual Focus of Attention Recognition, , École Polytechnique Fédérale de Lausanne, 2007 |
|
Learning the structure of image collections with latent aspect models, , École Polytechnique Fédérale de Lausanne, 2007 |
|
Scene image classification and segmentation with quantized local descriptors and latent aspect modeling, , École Polytechnique Fédérale de Lausanne, 2007 |
|
2006
Analysis and Classification of EEG Signals using Probabilistic Models for Brain Computer Interfaces, , École Polytechnique Fédérale de Lausanne, 2006 |
[DOI] [URL] |
Ensembles for Sequence Learning, , École Polytechnique Fédérale de Lausanne, 2006 |
|
Face Detection and Verification using Local Binary Patterns, , École Polytechnique Fédérale de Lausanne, 2006 |
|
Machine Learning Approaches to Text Representation using Unlabeled Data, , Ecole Polytechnique Fédérale de Lausanne, 2006 |
|
Multi-stream Processing for Noise Robust Speech Recognition, , École Polytechnique Fédérale de Lausanne, 2006 |
|
Multi-system Biometric Authentication: Optimal Fusion and User-Specific Information, , École Polytechnique Fédérale de Lausanne, 2006 |
|
Prior Knowledge in Kernel Methods, , École Polytechnique Fédérale de Lausanne, 2006 |
|
Probabilistic Graphical Models for Human Interaction Analysis, , École Polytechnique Fédérale de Lausanne, 2006 |
|
Spatio-Temporal Analysis of Spontaneous Speech with Microphone Arrays, , Ecole Polytechnique Fédérale de Lausanne, 2006 |
|
Two-Handed Gestures for Human-Computer Interaction, , École Polytechnique Fédérale de Lausanne, 2006 |
|
2005
Face Authentication Based on Local Features and Generative Models, , École Polytechnique Fédérale de Lausanne, 2005 |
|
Joint Speech and Speaker Recognition, , École Polytechnique Fédérale de Lausanne, Computer Science Department, 2005 |
|
Multimedia event modelling and recognition, , École Polytechnique Fédérale de Lausanne, 2005 |
Using Auxiliary Sources of Knowledge for Automatic Speech Recognition, , École Polytechnique Fédérale de Lausanne, Computer Science Department, 2005 |
|
2004
Large Scale Machine Learning, , Université de Paris VI, 2004 |
|
Nonlinear Feature Transformations for Noise Robust Speech Recognition, , Ecole Polytechnique Fédérale de Lausanne, 2004 |
|
Robust Audio Segmentation, , and , École Polytechnique Fédérale de Lausanne, 2004 |
|
2003
HMM Mixtures (HMM2) for Robust Speech Recognition, , Ecole Polytechnique Federale de Lausanne, 2003 |
|
Speech Recognition with Auxiliary Information, , École Polytechnique Fédérale de Lausanne, Computer Science Department, 2003 |
|
Text detection and recognition in images and video sequences, , École Polytechnique Fédérale de Lausanne, 2003 |
|
2001
PhD Thesis: Speech Analysis with Production Constraints, , École Polytechnique Fédérale de Lausanne, 2001 |
|
Robust speech recognition based on multi-stream processing, , École Polytechnique Fédérale de Lausanne, 2001 |
|
2000
Mixture Models for Unsupervised and Supervised Learning, , École Polytechnique Fédérale de Lausanne, Computer Science Department, 2000 |
|
The use of Boolean concepts in general classification contexts, , Ecole Polytechnique Federale de Lausanne, 2000 |
|
1999
Reconnaissance et Transformation de Locuteurs, , École Polytechnique Fédérale de Lausanne, 1999 |
|
1997
Optimization of high order perceptrons, , École Polytechnique Fédérale de Lausanne, 1997 |
|
Visual Speech and Speaker Recognition, , University of Sheffield, 1997 |
|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 |