Keywords:
- 3D face model
- abnormality detection
- acoustic generators
- Acoustic signal processing
- activity
- appearance based methods
- Appearance based model
- appearance model
- Archaeology
- Artificial Neural Networks
- attention
- audio-visual speaker recognition
- Autism
- backchannels
- Bayesian modeling
- behavior analysis.
- bias correction
- blink
- bobbing estimation
- camera network
- Children
- clustering
- cognition
- computer vision
- Content-based multimedia indexing
- conversation
- convolutional network
- Convolutional neural network.
- Convolutional Neural Networks
- corpus
- covariance matrices
- Crowdsourcing
- Cultural heritage
- dataset
- deep learning
- Deep Metric Learning.
- deep neural networks
- Delays
- diarization
- Dimensionality reduction
- direction-of-arrival estimation
- DOA estimation
- domain adaptation
- embedding
- embedding learning
- Encoding
- entrainment to music
- Epigraphy
- Estimation
- eye movements
- eye tracking
- eye-gaze
- Face
- face clustering
- Face dirarization
- Face Recognition
- Face tracking
- Facial animations
- Feature extraction
- Feature-based tracking
- first impressions
- focus of attention
- gait
- Gaze
- Gaze Coding
- gaze detection
- Gaze estimation
- generative models
- geometric method
- grapevine pruning
- group dynamics
- HCI
- head nods
- Head pose
- Head pose tracking
- head-pose invariance
- HHI
- hieroglyph
- Histogram of orientation
- HOOSC
- HRI
- human activity recognition
- human behaviour analysis
- human detection
- Human pose estimation
- human-robot interaction
- image rectification
- image retrieval
- image segmentation
- indexing
- information fusion
- information visualization
- internet of things
- involvement
- keyframe extraction
- language
- learning
- likelihood-based encoding
- listener categories
- machine learning
- manipulation
- Maya civilization
- Maya culture
- Maya glyph
- maya glyphs
- Metric learning
- microphone arrays
- Microphones
- mixed activity
- Monte Carlo methods
- motif mining
- multi-camera
- multi-object tracking
- Multimodal
- multimodal identification
- Multimodal interaction
- Multimodal person diarization
- multiple face tracking
- multiple sound sources
- multiple speaker detection
- multivariate time series
- network output
- neural nets
- neural network-based sound source localization methods
- neural networks
- non parametric models
- non-verbal cues
- Nonverbal behavior
- OCR
- online calibration.
- particle filter
- person diarization
- person discovery
- person identification
- person invariance
- Person Tracking
- plant skeleton
- pLSA
- Position measurement
- precision viticulture
- real-time
- remote
- remote recording
- remote sensing
- remote sensor
- representation learning
- RGB-D
- RGB-D camera
- RGB-D cameras
- road vehicles
- Robots
- saccade
- Sampling
- scene analysis
- segmentation
- shape classification
- Shape descriptor
- shape recognition
- Shape retrieval
- shot boundary detection
- simultaneous detection
- single sound source
- sketch
- skin colour
- social computing
- sound mixtures
- sound source localization
- sparse autoencoder
- sparse coding
- spatial spectrum-based approaches
- speaker
- Speaker Diarization
- Speaker identification
- speaker recognition
- speaker verification
- spectral shot clustering
- Speech
- surveillance
- topic models
- tracking
- training
- transfer learning
- triplet loss
- unsupervised
- Unsupervised · Latent sequential patterns · Topic models · PLSA · Video surveillance · Activity analysis
- unsupervised activity analysis
- unsupervised calibration
- unsupervised learning
- usability
- user study
- variational inference
- ve- hicle detection.
- VFOA
- video
- video processing
- video structuring
- vineyard
- virtual agents
- visual focus of attention
- Visual similarity
- weakly-supervised learning.
Publications of Jean-Marc Odobez
2010
View-Based Appearance Model Online Learning for 3D Deformable Face Tracking, and , in: Proc. Int. Conf. on Computer Vision Theory and Applications, Angers, 2010 |
|
Visual Attention, Speaking Activity, and Group Conversational Analysis in Multi-Sensor Environments, and , in: In H. Nakashima, J. Augusto, H. Aghajan (Eds.,',','), Handbook of Ambient Intelligence and Smart Environments, Springer, 2010 |
2009
Contextual classification of image patches with latent aspect models, , , and , in: EURASIP Journal on Image and Video Processing, Special Issue on Patches in Vision, 2009 |
|
Dynamic Partitioned Sampling For Tracking With Discriminative Features, , and , in: Proceedings of the British Maschine Vision Conference, London, 2009 |
|
Fast Human Detection in Videos using Joint Appearance and Foreground Learning from Covariances of Image Feature Subsets, and , Idiap-RR-19-2009 |
|
Investigating the use of Visual Focus of Attention for Audio-Visual Speaker Diarisation, , , and , in: Proceedings of the ACM International Conference on Multimedia, Beijing, China, 2009 |
|
Learning Large Margin Likelihood for Realtime Head Pose Tracking, and , in: IEEE Int. Conference on Image Processing, Cairo, Egypt, IEEE, 2009 |
|
Multi-Person Bayesian Tracking with Multiple Cameras, and , in: Multi-camera networks: principles and applications, pages 363-388, Academic Press, 2009 |
|
Recognizing Human Visual Focus of Attention from Head Pose in Meetings, and , in: IEEE Transactions on Systems, Man, Cybernetics, Part-B, Vol. 39(No. 1), 2009 |
|
Retrieving Ancient Maya Glyphs with Shape Context, , , and , in: 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops, Kyoto, Japan, IEEE, 2009 |
|
Structure and appearance features for robust 3D facial actions tracking, and , in: IEEE Proc. Int. Conf. on Multimedia and Expo, IEEE, 2009 |
|
Topic Models for Scene Analysis and Abnormality Detection, and , in: 9th International Workshop in Visual Surveillance, IEEE, Kyoto, Japan, IEEE, 2009 |
|
Visual activity context for focus of attention estimation in dynamic meetings, , and , Idiap-RR-02-2009 |
|
Visual Activity Context For Focus of Attention Estimation in Dynamic Meetings, , and , in: International Conference on Multimedia & Expo, 2009 |
|
2008
Detecting queues at vending machines: a statistical layered approach, and , Idiap-RR-04-2008 |
|
Detecting queues at vending machines: a statistical layered approach, and , in: Proc. Int. Conf. on Pattern Recognition (ICPR), Tampa, 2008 |
|
Fast human detection from videos using covariance features, and , in: European Conference on Computer Vision, workshop on Visual Surveillance (ECCV-VS), Marseille, 2008 |
|
Investigating Automatic Dominance Estimation in Groups From Visual Attention and Speaking Activity, , , , and , in: International Conference on Multi-modal Interfaces, 2008 |
|
Multi-camera 3d person tracking with particle filter in a surveillance environment, and , in: 16th European Signal processing Conference (EUSIPCO), 2008 |
|
Multi-camera multi-person 3d space tracking with mcmc in surveillance scenarios, and , in: European Conference on Computer Vision, workshop on Multi Camera and Multi-modal Sensor Fusion Algorithms and Applications (ECCV-M2SFA2), Marseille, 2008 |
|
Multi-party Focus of Attention Recognition in Meetings from Head Pose and Multimodal Contextual Cues, and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008 |
|
Multi-Person Visual Focus of Attention from Head Pose and Meeting Contextual Cues, and , Idiap-RR-47-2008 |
|
Predicting Two Facets of Social Verticality in Meetings from Five-Minute Time Slices and Nonverbal Cues, , , and , in: Proceedings - ICMI 2008, 2008 |
|
Tracking the visual focus of attention for a varying number of wandering people, , , and , in: IEEE Trans. on Pattern Analysis and Machine Intelligence, 30(7), 2008 |
|
understanding metro station usage using closed circuit television cameras analysis, , , , , , and , Idiap-RR-38-2008 |
|
Understanding Metro Station Usage using Closed Circuit Television Cameras Analysis, , , , , , and , in: 11th International IEEE Conference on Intelligent Transportation Systems (ITSC), Bejing, 2008 |
|
Visual Focus of Attention Estimation from Head Pose Posterior Probability Distributions, and , in: International Conference on Multi-media & Expo, 2008 |
|
2007
A Cognitive and Unsupervised MAP Adaptation Approach to the Recognition of the Focus of Attention from Head Pose, and , in: International Conference on Multi-Media & Expo (ICME07), 2007 |
|
A Cognitive and Unsupervised MAP Adaptation Approach to the Recognition of the Focus of Attention from Head Pose, and , Idiap-RR-20-2007 |
|
A Thousand Words in a Scene, , , and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2007 |
|
Dynamical Dirichlet Mixture Model, , and , Idiap-RR-02-2007 |
|
Fast Human Detection from Videos Using Covariance Features, and , Idiap-RR-68-2007 |
|
Multi-Layer Background Subtraction Based on Color and Texture, and , in: CVPR 2007 Workshop on Visual Surveillance (VS2007), 2007 |
|
Multi-Layer Background Subtraction Based on Color and Texture, and , Idiap-RR-67-2007 |
|
Multi-party Focus of Attention Recognition in Meetings from Head Pose and Multimodal Contextual Cues, and , Idiap-RR-50-2007 |
|
Probabilistic Head Pose Tracking Evaluation in Single and Multiple Camera Setups, and , in: Classification of Events, Activities and Relationship Evaluation and Workshop, 2007 |
|
Probabilistic Head Pose Tracking Evaluation in Single and Multiple Camera Setups, and , Idiap-RR-21-2007 |
|
Short-Term Spatio-Temporal Clustering Applied to Multiple Moving Speakers, and , in: IEEE Transactions on Audio, Speech and Language Processing, 15(5):15, 2007 |
|
Using Audio and Video Features to Classify the Most Dominant Person in a Group Meeting, , , , , , , , and , in: "", 2007 |
|
Using Audio and Video Features to Classify the Most Dominant Person in a Group Meeting, , , , , , , , and , Idiap-RR-29-2007 |
|
Visual Focus of Attention Estimation from Head Pose Posterior Probability Distributions, and , Idiap-RR-75-2007 |
|
2006
A Study on Visual Focus of Attention Recognition from Head Pose in a Meeting Room, and , in: 3rd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI06), 2006 |
|
A Study on Visual Focus of Attention Recognition from Head Pose in a Meeting Room, and , Idiap-RR-10-2006 |
|