Keywords:
- 3D face model
- abnormality detection
- acoustic generators
- Acoustic signal processing
- activity
- appearance based methods
- Appearance based model
- appearance model
- Archaeology
- Artificial Neural Networks
- attention
- audio-visual speaker recognition
- Autism
- backchannels
- Bayesian modeling
- behavior analysis.
- bias correction
- blink
- bobbing estimation
- camera network
- Children
- clustering
- cognition
- computer vision
- Content-based multimedia indexing
- conversation
- convolutional network
- Convolutional neural network.
- Convolutional Neural Networks
- corpus
- covariance matrices
- Crowdsourcing
- Cultural heritage
- dataset
- deep learning
- Deep Metric Learning.
- deep neural networks
- Delays
- diarization
- Dimensionality reduction
- direction-of-arrival estimation
- DOA estimation
- domain adaptation
- embedding
- embedding learning
- Encoding
- entrainment to music
- Epigraphy
- Estimation
- eye movements
- eye tracking
- eye-gaze
- Face
- face clustering
- Face dirarization
- Face Recognition
- Face tracking
- Facial animations
- Feature extraction
- Feature-based tracking
- first impressions
- focus of attention
- gait
- Gaze
- Gaze Coding
- gaze detection
- Gaze estimation
- generative models
- geometric method
- grapevine pruning
- group dynamics
- HCI
- head nods
- Head pose
- Head pose tracking
- head-pose invariance
- HHI
- hieroglyph
- Histogram of orientation
- HOOSC
- HRI
- human activity recognition
- human behaviour analysis
- human detection
- Human pose estimation
- human-robot interaction
- image rectification
- image retrieval
- image segmentation
- indexing
- information fusion
- information visualization
- internet of things
- involvement
- keyframe extraction
- language
- learning
- likelihood-based encoding
- listener categories
- machine learning
- manipulation
- Maya civilization
- Maya culture
- Maya glyph
- maya glyphs
- Metric learning
- microphone arrays
- Microphones
- mixed activity
- Monte Carlo methods
- motif mining
- multi-camera
- multi-object tracking
- Multimodal
- multimodal identification
- Multimodal interaction
- Multimodal person diarization
- multiple face tracking
- multiple sound sources
- multiple speaker detection
- multivariate time series
- network output
- neural nets
- neural network-based sound source localization methods
- neural networks
- non parametric models
- non-verbal cues
- Nonverbal behavior
- OCR
- online calibration.
- particle filter
- person diarization
- person discovery
- person identification
- person invariance
- Person Tracking
- plant skeleton
- pLSA
- Position measurement
- precision viticulture
- real-time
- remote
- remote recording
- remote sensing
- remote sensor
- representation learning
- RGB-D
- RGB-D camera
- RGB-D cameras
- road vehicles
- Robots
- saccade
- Sampling
- scene analysis
- segmentation
- shape classification
- Shape descriptor
- shape recognition
- Shape retrieval
- shot boundary detection
- simultaneous detection
- single sound source
- sketch
- skin colour
- social computing
- sound mixtures
- sound source localization
- sparse autoencoder
- sparse coding
- spatial spectrum-based approaches
- speaker
- Speaker Diarization
- Speaker identification
- speaker recognition
- speaker verification
- spectral shot clustering
- Speech
- surveillance
- topic models
- tracking
- training
- transfer learning
- triplet loss
- unsupervised
- Unsupervised · Latent sequential patterns · Topic models · PLSA · Video surveillance · Activity analysis
- unsupervised activity analysis
- unsupervised calibration
- unsupervised learning
- usability
- user study
- variational inference
- ve- hicle detection.
- VFOA
- video
- video processing
- video structuring
- vineyard
- virtual agents
- visual focus of attention
- Visual similarity
- weakly-supervised learning.
Publications of Jean-Marc Odobez
2014
A Conditional Random field approach for audio-visual people diarization, , , , and , in: Proceedings of the 39th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 116 - 120, IEEE, 2014 |
[DOI] |
Automated Bobbing and Phase Analysis to Measure Walking Entrainment, , , , , , and , in: IEEE International Conference on Image Processing (ICIP), Paris, 2014 |
|
Automatic Maya Hieroglyph Retrieval Using Shape and Context Information, , , , and , in: ACM MM, pages 4, 2014 |
[URL] |
Comparison of Two Methods for Unsupervised Person Identification in TV Shows, , , , and , in: 12th International Workshop on Content-Based Multimedia Indexing, 2014 |
|
Exploiting Long-Term Connectivity and Visual Motion in CRF-based Multi-Person Tracking, , and , Idiap-RR-06-2014 |
|
Exploiting Long-Term Connectivity and Visual Motion in CRF-based Multi-Person Tracking, , and , Idiap-RR-05-2014 |
|
Exploiting Long-Term Connectivity and Visual Motion in CRF-based Multi-Person Tracking, , and , in: Transactions on Image Processing, 2014 |
|
Exploiting Scene Cues for Dropped Object Detection, , and , in: 9th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications., 2014 |
|
EYEDIAP Database: Data Description and Gaze Tracking Evaluation Benchmarks, , and , Idiap-RR-08-2014 |
|
EYEDIAP: A Database for the Development and Evaluation of Gaze Estimation Algorithms from RGB and RGB-D Cameras, , and , in: Proceedings of the ACM Symposium on Eye Tracking Research and Applications, Safety Harbor, Florida, United States of America, ACM, 2014 |
[DOI] |
Face identification from overlaid texts using Local Face Recurrent Patterns and CRF models, , , , and , in: IEEE International Conference on Image Processing 2014, Paris, IEEE, 2014 |
|
Geometric Generative Gaze Estimation (G3E) for Remote RGB-D Cameras, and , in: IEEE Computer Vision and Pattern Recognition Conference, Columbus, Ohio,USA, pages 1773-1780, IEEE, 2014 |
[DOI] |
Improving Head and Body Pose Estimation through Semi-supervised Manifold Alignment, , , , and , in: International Conference on Image Processing, 2014 |
|
Is That a Jaguar? Segmenting Ancient Maya Glyphs via Crowdsourcing, , and , in: Proc. ACM Int. Workshop on Crowdsourcing for Multimedia, Orlando, pages 37-40, ACM New York, 2014 |
[DOI] |
Leveraging Colour Segmentation for Upper-Body Detection, and , in: Pattern Recognition, 47(6):2222-2230, 2014 |
|
Temporal Analysis of Motif Mixtures using Dirichlet Processes, , and , in: IEEE Trans. Pattern Analysis and Machine Intelligence (PAMI), 36(1), 2014 |
|
The MAAYA Project: Multimedia Analysis and Access for Documentation and Decipherment of Maya Epigraphy, , , , , , and , in: Proc. Digital Humanities Conference, Lausanne, 2014 |
|
What to Show? Automatic Stream Selection Among Multiple Sensors, , and , in: International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, 2014 |
|
Who Will Get the Grant ? A Multimodal Corpus for the Analysis of Conversational Behaviours in Group Interviews, , , , and , in: International Conference on Multimodal Interaction, Understanding and Modeling Multiparty, Multimodal Interactions Workshop, Istanbul, Turkey, ACM, 2014 |
[DOI] |
2013
A Semi-Automated System for Accurate Gaze Coding in Natural Dyadic Interactions, , , and , in: 15th ACM International Conference on Multimodal Interaction, Sydney, Australia, ACM, 2013 |
[DOI] |
A Sequential Topic Model for Mining Recurrent Activities from Long Term Video Logs, , and , in: International Journal of Computer Vision, 103(1):100-126, 2013 |
|
A Track Creation and Deletion Framework for Long-Term Online Multi-Face Tracking, and , in: IEEE Transactions on Image Processing, 2013 |
|
Analyse non supervisée d'activités en vidéo surveillance pour l'analyse de scène et la détection d'événements anormaux, and , Idiap-RR-20-2013 |
[URL] |
Clustering flood events from water quality time-series using Latent Dirichlet Allocation model, , , , , , , , , and , in: Water Resources Research, 2013 |
[DOI] |
Context Aware Addressee Estimation for Human Robot Interaction, , , and , in: Proceedings of the 6th Workshop on Eye Gaze in Intelligent Human Machine Interaction: Gaze in Multimodal Interaction, 2013 |
Discovering Temporal Patterns in Water Quality Time Series, Focusing on Floods with the LDA method, , , , , , , and , in: European Geosciences Union, 2013 |
|
Evaluating Shape Descriptors for Detection of Maya Hieroglyphs, , and , in: in Proc. Mexican Conf. on Pattern Recognition, Queretaro, 2013 |
|
Fusing Matching and Biometric Similarity Measures for Face Diarization in Video, , and , in: Proceedings of the 3rd ACM conference on International conference on multimedia retrieval, Dallas, Texas, USA, pages 97-104, ACM, 2013 |
|
Fusing Matching and Biometric Similarity Measures for Face Diarization in Video, , and , Idiap-RR-31-2013 |
|
Given that, Should I Respond? Contextual Addressee Estimation in Multi-Party Human-Robot Interactions, and , in: Proceedings of Human Robot Interaction (HRI) Conference, 2013 |
|
Investigating time-sensitive topic model approaches for action recognition, , and , Idiap-RR-26-2013 |
|
Leveraging the robot dialog state for visual focus of attention recognition, , , , and , in: Proceedings of the 15th ACM on International Conference on Multimodal Interaction, 2013 |
Observation of Vehicle Axles Through Pass-by Noise: A Strategy of Microphone Array Design, , , , and , in: IEEE Trans. on Intelligent Transportation Systems, 2013 |
|
Parameter Estimation and Contextual Adaptation for a Multi-Object Tracking CRF Model, and , in: IEEE Workshop on Performance Evaluation of Tracking and Surveillance, 2013 |
|