Keywords:
- 3D face model
- abnormality detection
- acoustic generators
- Acoustic signal processing
- activity
- Animal behavior recognition
- appearance based methods
- Appearance based model
- appearance model
- Archaeology
- Artificial Neural Networks
- attention
- audio-visual speaker recognition
- Autism
- backchannels
- Bayesian modeling
- behavior analysis.
- bias correction
- blink
- bobbing estimation
- camera network
- Children
- Chimpanzees
- clustering
- cognition
- computer vision
- Content-based multimedia indexing
- conversation
- convolutional network
- Convolutional neural network.
- Convolutional Neural Networks
- corpus
- covariance matrices
- Crowdsourcing
- Cultural heritage
- dataset
- deep learning
- Deep Metric Learning.
- deep neural networks
- Delays
- diarization
- Dimensionality reduction
- direction-of-arrival estimation
- DOA estimation
- domain adaptation
- embedding
- embedding learning
- Encoding
- entrainment to music
- Epigraphy
- Estimation
- eye movements
- eye tracking
- eye-gaze
- Face
- face clustering
- Face dirarization
- Face Recognition
- Face tracking
- Facial animations
- Feature extraction
- Feature-based tracking
- first impressions
- focus of attention
- gait
- Gaze
- Gaze Coding
- gaze detection
- Gaze estimation
- generative models
- geometric method
- grapevine pruning
- Great apes
- group dynamics
- HCI
- head nods
- Head pose
- Head pose tracking
- head-pose invariance
- HHI
- hieroglyph
- Histogram of orientation
- HOOSC
- HRI
- human activity recognition
- human behaviour analysis
- human detection
- Human pose estimation
- human-robot interaction
- image rectification
- image retrieval
- image segmentation
- indexing
- information fusion
- information visualization
- internet of things
- involvement
- keyframe extraction
- language
- learning
- likelihood-based encoding
- listener categories
- machine learning
- manipulation
- Maya civilization
- Maya culture
- Maya glyph
- maya glyphs
- Metric learning
- microphone arrays
- Microphones
- mixed activity
- Monte Carlo methods
- motif mining
- multi-camera
- multi-object tracking
- Multimodal
- multimodal identification
- Multimodal interaction
- Multimodal person diarization
- multiple face tracking
- multiple sound sources
- multiple speaker detection
- multivariate time series
- network output
- neural nets
- neural network-based sound source localization methods
- neural networks
- non parametric models
- Non-human primates
- non-verbal cues
- Nonverbal behavior
- OCR
- online calibration.
- particle filter
- person diarization
- person discovery
- person identification
- person invariance
- Person Tracking
- plant skeleton
- pLSA
- pose estimation
- Position measurement
- precision viticulture
- real-time
- remote
- remote recording
- remote sensing
- remote sensor
- representation learning
- RGB-D
- RGB-D camera
- RGB-D cameras
- road vehicles
- Robots
- saccade
- Sampling
- scene analysis
- segmentation
- shape classification
- Shape descriptor
- shape recognition
- Shape retrieval
- shot boundary detection
- simultaneous detection
- single sound source
- Skeleton-based action recognition
- sketch
- skin colour
- social computing
- sound mixtures
- sound source localization
- sparse autoencoder
- sparse coding
- spatial spectrum-based approaches
- speaker
- Speaker Diarization
- Speaker identification
- speaker recognition
- speaker verification
- spectral shot clustering
- Speech
- surveillance
- topic models
- tracking
- training
- transfer learning
- triplet loss
- unsupervised
- Unsupervised · Latent sequential patterns · Topic models · PLSA · Video surveillance · Activity analysis
- unsupervised activity analysis
- unsupervised calibration
- unsupervised learning
- usability
- user study
- variational inference
- ve- hicle detection.
- VFOA
- video
- Video foundation models
- video processing
- video structuring
- vineyard
- virtual agents
- visual focus of attention
- Visual similarity
- weakly-supervised learning.
Publications of Jean-Marc Odobez sorted by title
S
| Shape Representations for Maya Codical Glyphs: Knowledge-driven or Deep?, , and , in: 15th International Workshop on Content-Based Multimedia Indexing, 2017 |
|
| Sharingan: A Transformer Architecture for Multi-Person Gaze Following, , and , in: Int. Conference Computer Vision and Pattern Recognition (CVPR), Seatle, 2024 |
|
| Short-Term Spatio-Temporal Clustering Applied to Multiple Moving Speakers, and , in: IEEE Transactions on Audio, Speech and Language Processing, 15(5):15, 2007 |
|
| Short-Term Spatio-Temporal Clustering of Sporadic and Concurrent Events, , and , Idiap-RR-14-2004 |
|
| Sparsity in Topic Models, , and , in: Practical Applications of Sparse Modeling: Biology, Signal Processing and Beyond, MIT Press, 2012 |
|
| Spectral Structuring of Home Videos, , and , in: International Conference on Image and Video Retrieval (CIVR'03), Springer Verlag, 2003 |
|
| Sports Event Recognition using Layered HMMs, and , Idiap-RR-07-2005 |
|
| Statistical Shape Descriptors for Ancient Maya Hieroglyphs Analysis, , École Polytechnique Fédérale de Lausanne, 2012 |
|
| Structure and appearance features for robust 3D facial actions tracking, and , in: IEEE Proc. Int. Conf. on Multimedia and Expo, IEEE, 2009 |
|
| Supervised Gaze Bias Correction for Gaze Coding in Interactions, and , Idiap-RR-23-2017 |
|
| Supervised Gaze Bias Correction for Gaze Coding in Interactions, and , in: ECEM COGAIN Symposium, pages 3, 2017 |
|
T
| Temporal Analysis of Motif Mixtures using Dirichlet Processes, , and , in: IEEE Trans. Pattern Analysis and Machine Intelligence (PAMI), 36(1), 2014 |
|
| Temporally Subsampled Detection for Accurate and Efficient Face Tracking and Diarization, , , and , in: International Conference on Pattern Recognition, Cancun, Mexico, IEEE, 2016 |
|
| Text Detection and Recognition in Images and Videos, , and , in: Pattern Recognition, 37(3), 2004 |
| Text Detection and Recognition in Images and Videos, , and , Idiap-RR-61-2002 |
|
| Text Segmentation and Recognition in Complex Background Based on Markov Random Field, , and , in: Int. Conf. Pattern Recognition 2002, 2002 |
|
| Text Segmentation and Recognition in Complex Background Based on Markov Random Field, , and , Idiap-RR-17-2002 |
|
| The AI4Autism Project: A Multimodal and Interdisciplinary Approach to Autism Diagnosis and Stratification, , , , , , , and , in: Companion Publication of the 25th International Conference on Multimodal Interaction, Paris, France, pages 414–425, Association for Computing Machinery, 2023 |
[DOI] [URL] |
| The MAAYA Project: Multimedia Analysis and Access for Documentation and Decipherment of Maya Epigraphy, , , , , , and , in: Proc. Digital Humanities Conference, Lausanne, 2014 |
|
| The MuMMER data set for Robot Perception in multi-party HRI Scenarios, , , and , in: Proceedings of the 29th IEEE International Conference on Robot & Human Interactive Communication, 2020 |
|
| The vernissage corpus: a conversational human-robot-interaction dataset, , , , , , , , , and , in: Proceedings of the 8th ACM/IEEE international conference on Human-robot interaction, 2013 |
|
| The Vernissage Corpus: A Multimodal Human-Robot-Interaction Dataset, , , , , , , , , and , Idiap-RR-33-2012 |
|
| Theories and Models of Teams and Group, , , , and , in: Small Group Research, 48(5):544--567, 2017 |
[DOI] |
| Threshold Selection for Unsupervised Detection, with an Application to Microphone Arrays, , , and , Idiap-RR-52-2005 |
|
| Time-Sensitive Topic Models for Action Recognition in Videos, , and , in: IEEE International Conference on Image Processing, 2013 |
|
| Topic Models for Scene Analysis and Abnormality Detection, and , in: 9th International Workshop in Visual Surveillance, IEEE, Kyoto, Japan, IEEE, 2009 |
|
| Toward Semantic Gaze Target Detection, , , and , in: 38th Conf. on Neural Information Processing System, 2024 |
|
| Towards an Engagement-Aware Attentive Artificial Listener for Multi-Party Interactions, , , , , and , in: Frontiers in Robotics and AI, 8:189, 2021 |
[DOI] [URL] |
| Towards building an attentive artificial listener: on the perception of attentiveness in audio-visual feedback tokens, , , , , , and , in: Proceedings of the 18th ACM International Conference on Multimodal Interaction, Tokyo, Japan, pages 21-28, ACM, 2016 |
[DOI] |
| Towards large scale multimedia indexing: A case study on person discovery in broadcast news, , and , in: 15th International Workshop on Content-Based Multimedia Indexing, 2017 |
|
| Towards Smart Pruning: ViNet, a Deep-Learning Approach for Grapevine Structure Estimation, , , and , in: Computers and Electronics in Agriculture, 207:107736, 2023 |
[DOI] [URL] |
| Towards the Use of Social Interaction Conventions as Prior for Gaze Model Adaptation, , and , in: Proceedings of 19th ACM International Conference on Multimodal Interaction, pages 9, ACM, 2017 |
[DOI] |
| Tracking Attention for Multiple People: Wandering Visual Focus of Attention Estimation, , , and , Idiap-RR-40-2006 |
|
| Tracking People in Meetings with Particles, , , , and , in: Proc. Int. Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS,',','), invited paper, 2005 |
|
| Tracking People in Meetings with Particles, , , , and , Idiap-RR-71-2004 |
|
| Tracking the Multi Person Wandering Visual Focus of Attention, , , and , in: International Conference on Multimodal Interfaces (ICMI06), 2006 |
|
| Tracking the Multi Person Wandering Visual Focus of Attention, , , and , Idiap-RR-80-2005 |
|
| Tracking the visual focus of attention for a varying number of wandering people, , , and , in: IEEE Trans. on Pattern Analysis and Machine Intelligence, 30(7), 2008 |
|
| Training on the Job: Behavioral Analysis of Job Interviews in Hospitality, , , , , and , in: Proceedings of the 18th ACM International Conference on Multimodal Interaction, pages 84-91, 2016 |
|
| Transferring Neural Representations for Low-dimensional Indexing of Maya Hieroglyphic Art, , , , , , , , and , in: Proc. ECCV Workshop on Computer Vision for Art Analysis, Amsterdam, pages 842-855, Springer, 2016 |
[DOI] [URL] |