Keywords:
- AI competencies
- AI Literacy
- AI skills
- Alcohol
- Alcohol Consumption
- alcohol use
- Ambiance
- App-based survey
- Archaeology
- Assessment reactivity
- attractiveness
- attrition
- audio-visual feature extraction
- behavior
- BERT
- Big-Five
- booktube
- Casual Drinking
- Characterizing small groups
- Citizen Sourcing
- classifiers
- clustering
- cohesion
- collective classification
- Computational linguistics
- computer algorithm
- computer vision
- Convolutional Neural Networks
- corpus
- Crowdsourcing
- Cultural heritage
- data collection
- Data Literacy
- deep learning
- depression detection
- developing country
- Differential Privacy
- digital biomarker
- Dimensionality reduction
- Distribution Shift
- diversity
- domain adaptation
- Drink size
- Drinking behaviours
- drinking context
- Eating Behavior
- Ecological momentary assessment
- Ecological momentary assessment (EMA)
- Emergent leadership
- Emerging power hierarchies
- Energy Expenditure Estimation
- Epigraphy
- explainability
- Facial & Body landmarks
- first impressions
- Flickr groups LDA
- Flickr PLSA topic-model communities
- Food Consumption
- food diaries
- foursquare
- generalization
- Generative Ai
- Generative AI Literacy
- Gestures
- Graph Convolutional Networks
- Graph Neural Networks
- Graph Representation Learning
- group interaction
- Group performance.
- health
- Heavy Drinking
- hieroglyph
- hirability
- Histogram of orientation
- Home Spaces
- HOOSC
- human mobility
- image retrieval
- impressions
- indexing
- information visualization
- Inter-pretable Models
- interaction
- Interview
- job interviews
- keyframe extraction
- language
- Language Production
- language style
- late-life depression
- Lausanne Data Collection Campaign
- LIWC
- location-based social networks
- loneliness
- Long-tail data distribution
- machine learning
- Maya civilization
- Maya culture
- Maya glyph
- maya glyphs
- meal
- meetings
- Mental Lexicon
- mHealth
- Mobile And Wearable Sensing
- Mobile Crowdsensing
- mobile crowdsourcing
- mobile mining
- mobile phone data
- mobile sensing
- mobile survey
- mobility models
- model adaptation
- mood
- motion capture
- multimodal cues
- Multimodal interaction
- Multimodal Sensing
- narcissism
- Natural conversation
- natural language porcessing
- neural networks
- Nightlife
- nonverbal be- havior
- Nonverbal behavior
- online video
- passive sensing
- perceived dominance
- performance
- Personality
- personality impressions
- Personality traits inference
- personalized service
- pervasive computing
- Physical Activity
- Picture annotation
- place extraction
- place labeling
- place visit
- prediction
- Prompt engineering
- psychological correlates
- Recruitment method
- reliability analysis
- Remote meetings
- Response burden
- review
- routines
- self-training
- sensing
- Sentiment Analysis
- shape classification
- Shape descriptor
- shape recognition
- Shape retrieval
- shot boundary detection
- sketch
- small group interactions
- Smartphone application
- smartphone data
- smartphone sensing
- Smartphones
- Snack and Meal
- social computing
- social context
- social media
- social networks
- social psychology
- Social video
- sparse autoencoder
- sparse coding
- spatial data
- spectral shot clustering
- Speech activity
- Standard drink
- stress
- support
- survey
- Task-competence
- telemonitoring
- Text classification
- thin slices
- transfer learning
- Ubiquitous Computing
- unsupervised methods
- urban awareness
- Urban Computing
- Urban Crowdsourcing
- usability
- user aggregated behavior
- user behavior
- user modeling
- user satisfaction
- user study
- verbal analysis
- Verbal content
- video
- Video annotation
- video resumes
- video structuring
- video-sharing
- Visual similarity
- vlog
- vlogging
- vlogs
- wearable sensing
- well being
- well-being
- young adults
- Youth
- youtube
- YouTubeYouTube
Publications of Daniel Gatica-Perez
2006
Modeling Interactions from Email Communication, , and , in: Proc. IEEE International Conference on Multimedia & Expo (ICME,',','), 2006, 2006 |
|
Multi-Person Tracking in Meetings: A Comparative Study, , , , , and , in: Multimodal Interaction and Related Machine Learning Algorithms (MLMI), 2006 |
|
Multi-Person Tracking in Meetings: A Comparative Study, , , , , and , Idiap-RR-38-2006 |
|
Speaker Localization for Microphone Array-Based ASR: The Effects of Accuracy on Overlapping Speech, and , in: Proc. Int. Conf. on Multimodal Interfaces (ICMI), 2006 |
|
Speaker Localization for Microphone Array-Based ASR: The Effects of Accuracy on Overlapping Speech, and , Idiap-RR-29-2006 |
|
Speech Enhancement and Recognition in Meetings with an Audio-Visual Sensor Array, , and , Idiap-RR-24-2006 |
|
Tracking Attention for Multiple People: Wandering Visual Focus of Attention Estimation, , , and , Idiap-RR-40-2006 |
|
Tracking the Multi Person Wandering Visual Focus of Attention, , , and , in: International Conference on Multimodal Interfaces (ICMI06), 2006 |
|
Unsupervised Speech/Non-speech Detection for Automatic Speech Recognition in Meeting Rooms, , and , Idiap-RR-57-2006 |
|
2005
A Thousand Words in a Scene, , , and , Idiap-RR-40-2005 |
|
Audio-visual probabilistic tracking of multiple speakers in meetings, , , and , Idiap-RR-27-2005 |
|
AV16.3: an Audio-Visual Corpus for Speaker Localization and Tracking, , and , in: Proceedings of the 2004 MLMI Workshop, S. Bengio and H. Bourlard Eds, Springer Verlag, 2005 |
|
Constructing visual models with a latent space approach, , , and , Idiap-RR-14-2005 |
|
Detecting Group Interest-level in Meetings, , , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2005 |
|
Extracting Information from Multimedia Meeting Collections, , and , in: 7th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2005 |
|
Extracting Information from Multimedia Meeting Collections, , and , Idiap-RR-50-2005 |
|
Finding groups of people in Google news, and , Idiap-RR-68-2005 |
|
Integrating co-occurrence and spatial contexts on patch-based scene segmentation, , , and , Idiap-RR-30-2005 |
|
Learning influence among interacting Markov chains, , , and , in: NIPS, 2005 |
|
Learning influence among interacting Markov chains, , , and , Idiap-RR-48-2005 |
|
Modeling Interactions from Email Communication, , , and , Idiap-RR-51-2005 |
|
Modeling Scenes with Local Descriptors and Latent Aspects, , , , , and , in: IEEE Int. Conf. on Computer Vision, 2005 |
|
Modeling semantic aspects for cross-media image indexing, and , Idiap-RR-56-2005 |
|
Multimodal Integration for Meeting Group Action Segmentation and Recognition, , , , , and , in: MLMI, 2005 |
|
Multimodal Integration for Meeting Group Action Segmentation and Recognition, , , , , and , Idiap-RR-31-2005 |
|
Multimodal Multispeaker Probabilistic Tracking in Meetings, , , and , in: Proc. Int. Conf. on Multimodal Interfaces (ICMI), 2005 |
|
Semi-supervised Adapted HMMs for Unusual Event Detection, , , and , in: Pro. IEEE CVPR, 2005 |
|
Semi-supervised Meeting Event Recognition with Adapted HMMs, , and , in: Pro. IEEE ICME, 2005 |
|
Semi-supervised Meeting Event Recognition with Adapted HMMs, , and , Idiap-RR-15-2005 |
|
Speech Acquisition in Meetings with an Audio-Visual Sensor Array, , , , and , in: Pro. IEEE ICME, 2005 |
|
Speech Acquisition in Meetings with an Audio-Visual Sensor Array, , , , and , Idiap-RR-03-2005 |
|
Tracking People in Meetings with Particles, , , , and , in: Proc. Int. Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS,',','), invited paper, 2005 |
|
Tracking the Multi Person Wandering Visual Focus of Attention, , , and , Idiap-RR-80-2005 |
|
2004
Assessing Scene Structuring in Consumer Videos, , , , and , in: Int. Conf. on Image and Video Retrieval (CIVR), 2004 |
|
Assessing Scene Structuring in Consumer Videos, , , , and , Idiap-RR-11-2004 |
|
Automatic Analysis of Multimodal Group Actions in Meetings, , , , , and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence (to appear), 2004 |
|
AV16.3: an Audio-Visual Corpus for Speaker Localization and Tracking, , and , Idiap-RR-28-2004 |
|
Detecting Group Interest-level in Meetings, , , and , Idiap-RR-51-2004 |
|
Embedding motion in model-based stochastic tracking, and , in: 17th Int. Conf. Pattern Recognition (ICPR), 2004 |
|
Modeling Individual and Group Actions in Meetings With Layered HMMs, , , , and , in: IEEE Transaction on Multimedia, June, 2006, 2004 |
|
Modeling Individual and Group Actions in Meetings With Layered HMMs, , , , and , Idiap-RR-33-2004 |
|
Modeling Individual and Group Actions in Meetings: a Two-Layer HMM Framework, , , , and , in: the Second IEEE Workshop on Event Mining: Detection and Recognition of Events in Video, In Association with CVPR, 2004 |
|
Modeling Individual and Group Actions in Meetings: a Two-Layer HMM Framework, , , , and , Idiap-RR-09-2004 |
|
Modeling Scenes with Local Descriptors and Latent Aspects, , , , , and , Idiap-RR-79-2004 |
|
Motion likelihood and proposal modeling in Model-Based Stochastic Tracking, and , Idiap-RR-61-2004 |
|
Multimodal Group Action Clustering in Meetings, , , , and , in: ACM 2nd International Workshop on Video Surveillance & Sensor Networks in conjunction with 12th ACM International Conference on Multimedia, 2004 |
|
Multimodal Group Action Clustering in Meetings, , , , and , Idiap-RR-24-2004 |
|
Multimodal Multispeaker Probabilistic Tracking in Meetings, , , and , Idiap-RR-66-2004 |
|
On the Use of Information Retrieval Measures for Speech Recognition Evaluation, , , , , , and , Idiap-RR-73-2004 |
|
PLSA-based Image Auto-Annotation: Constraining the Latent Space, and , in: Proc. ACM Int. Conf. on Multimedia (ACM MM), 2004 |
|