Keywords:
- AI competencies
- AI Literacy
- AI skills
- Alcohol
- Alcohol Consumption
- alcohol use
- Ambiance
- App-based survey
- Archaeology
- Assessment reactivity
- attractiveness
- attrition
- audio-visual feature extraction
- behavior
- BERT
- Big-Five
- booktube
- Casual Drinking
- Characterizing small groups
- Citizen Sourcing
- classifiers
- clustering
- cohesion
- collective classification
- Computational linguistics
- computer algorithm
- computer vision
- Convolutional Neural Networks
- corpus
- Crowdsourcing
- Cultural heritage
- data collection
- Data Literacy
- deep learning
- depression detection
- developing country
- Differential Privacy
- digital biomarker
- Dimensionality reduction
- Distribution Shift
- diversity
- domain adaptation
- Drink size
- Drinking behaviours
- drinking context
- Eating Behavior
- Ecological momentary assessment
- Ecological momentary assessment (EMA)
- Emergent leadership
- Emerging power hierarchies
- Energy Expenditure Estimation
- Epigraphy
- explainability
- Facial & Body landmarks
- first impressions
- Flickr groups LDA
- Flickr PLSA topic-model communities
- Food Consumption
- food diaries
- foursquare
- generalization
- Generative Ai
- Generative AI Literacy
- Gestures
- Graph Convolutional Networks
- Graph Neural Networks
- Graph Representation Learning
- group interaction
- Group performance.
- health
- Heavy Drinking
- hieroglyph
- hirability
- Histogram of orientation
- Home Spaces
- HOOSC
- human mobility
- image retrieval
- impressions
- indexing
- information visualization
- Inter-pretable Models
- interaction
- Interview
- job interviews
- keyframe extraction
- language
- Language Production
- language style
- late-life depression
- Lausanne Data Collection Campaign
- LIWC
- location-based social networks
- loneliness
- Long-tail data distribution
- machine learning
- Maya civilization
- Maya culture
- Maya glyph
- maya glyphs
- meal
- meetings
- Mental Lexicon
- mHealth
- Mobile And Wearable Sensing
- Mobile Crowdsensing
- mobile crowdsourcing
- mobile mining
- mobile phone data
- mobile sensing
- mobile survey
- mobility models
- model adaptation
- mood
- motion capture
- multimodal cues
- Multimodal interaction
- Multimodal Sensing
- narcissism
- Natural conversation
- natural language porcessing
- neural networks
- Nightlife
- nonverbal be- havior
- Nonverbal behavior
- online video
- passive sensing
- perceived dominance
- performance
- Personality
- personality impressions
- Personality traits inference
- personalized service
- pervasive computing
- Physical Activity
- Picture annotation
- place extraction
- place labeling
- place visit
- prediction
- Prompt engineering
- psychological correlates
- Recruitment method
- reliability analysis
- Remote meetings
- Response burden
- review
- routines
- self-training
- sensing
- Sentiment Analysis
- shape classification
- Shape descriptor
- shape recognition
- Shape retrieval
- shot boundary detection
- sketch
- small group interactions
- Smartphone application
- smartphone data
- smartphone sensing
- Smartphones
- Snack and Meal
- social computing
- social context
- social media
- social networks
- social psychology
- Social video
- sparse autoencoder
- sparse coding
- spatial data
- spectral shot clustering
- Speech activity
- Standard drink
- stress
- support
- survey
- Task-competence
- telemonitoring
- Text classification
- thin slices
- transfer learning
- Ubiquitous Computing
- unsupervised methods
- urban awareness
- Urban Computing
- Urban Crowdsourcing
- usability
- user aggregated behavior
- user behavior
- user modeling
- user satisfaction
- user study
- verbal analysis
- Verbal content
- video
- Video annotation
- video resumes
- video structuring
- video-sharing
- Visual similarity
- vlog
- vlogging
- vlogs
- wearable sensing
- well being
- well-being
- young adults
- Youth
- youtube
- YouTubeYouTube
Publications of Daniel Gatica-Perez sorted by recency
Semi-supervised Adapted HMMs for Unusual Event Detection, , , and , in: Pro. IEEE CVPR, 2005 |
|
Multimodal Multispeaker Probabilistic Tracking in Meetings, , , and , in: Proc. Int. Conf. on Multimodal Interfaces (ICMI), 2005 |
|
Multimodal Integration for Meeting Group Action Segmentation and Recognition, , , , , and , in: MLMI, 2005 |
|
Modeling Scenes with Local Descriptors and Latent Aspects, , , , , and , in: IEEE Int. Conf. on Computer Vision, 2005 |
|
Modeling Interactions from Email Communication, , and , in: Proc. IEEE International Conference on Multimedia & Expo (ICME,',','), 2006, 2006 |
|
Learning influence among interacting Markov chains, , , and , in: NIPS, 2005 |
|
Finding groups of people in Google news, and , in: ACM Int. Conf. on Human-Centered Multimedia (HCM), 2006 |
|
Extracting Information from Multimedia Meeting Collections, , and , in: 7th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2005 |
|
Constructing visual models with a latent space approach, , , and , in: the Springer series of Lecture Notes in Computer Science, 2006 |
|
AV16.3: an Audio-Visual Corpus for Speaker Localization and Tracking, , and , in: Proceedings of the 2004 MLMI Workshop, S. Bengio and H. Bourlard Eds, Springer Verlag, 2005 |
|
Audio-visual probabilistic tracking of multiple speakers in meetings, , , and , in: IEEE Trans. on Audio, Speech, and Language Processing, accepted for publication., 2006 |
|
Unsupervised Speech/Non-speech Detection for Automatic Speech Recognition in Meeting Rooms, , and , Idiap-RR-57-2006 |
|
Tracking the Multi Person Wandering Visual Focus of Attention, , , and , in: International Conference on Multimodal Interfaces (ICMI06), 2006 |
|
Tracking Attention for Multiple People: Wandering Visual Focus of Attention Estimation, , , and , Idiap-RR-40-2006 |
|
Speech Enhancement and Recognition in Meetings with an Audio-Visual Sensor Array, , and , Idiap-RR-24-2006 |
|
Speaker Localization for Microphone Array-Based ASR: The Effects of Accuracy on Overlapping Speech, and , Idiap-RR-29-2006 |
|
Speaker Localization for Microphone Array-Based ASR: The Effects of Accuracy on Overlapping Speech, and , in: Proc. Int. Conf. on Multimodal Interfaces (ICMI), 2006 |
|
Multi-Person Tracking in Meetings: A Comparative Study, , , , , and , in: Multimodal Interaction and Related Machine Learning Algorithms (MLMI), 2006 |
|
Integrating co-occurrence and spatial contexts on patch-based scene segmentation, , , and , in: Beyond Patches Workshop, in conjunction with CVPR, 2006 |
|
Exploring Contextual Information in a Layered Framework for Group Action Recognition, , and , in: In the Eighth International Conference on Multimodal Interfaces (ICMI'06), 2006 |
|
Detection and Application of Influence Rankings in Small Group Meetings, , , and , in: In the Eighth International Conference on Multimodal Interfaces (ICMI'06), 2006 |
|
Detecting Abandoned Luggage Items in a Public Space, , and , in: IEEE Performance Evaluation of Tracking and Surveillance Workshop (PETS), 2006 |
|
Analyzing Group Interactions in Conversations: a Review, , in: IEEE Int. Conf. on Multisensor Fusion and Integration for Intelligent Systems (MFI), 2006 |
|
2D Multi-Person Tracking: A Comparative Study in AMI Meetings, , , , , and , in: Classification of Events, Activities, and Relationships (CLEAR) 2006, 2006 |
|
A Thousand Words in a Scene, , , and , Idiap-RR-40-2005 |
|
Modeling semantic aspects for cross-media image indexing, and , Idiap-RR-56-2005 |
|
Using Audio and Video Features to Classify the Most Dominant Person in a Group Meeting, , , , , , , , and , Idiap-RR-29-2007 |
|
A Thousand Words in a Scene, , , and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2007 |
|
Analyzing Flickr Groups, and , Idiap-RR-03-2008 |
|
Unsupervised Speech/Non-speech Detection for Automatic Speech Recognition in Meeting Rooms, , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007 |
|
Modeling semantic aspects for cross-media image indexing, and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2007 |
|
Human-Centered Computing: Toward a Human Revolution, , , and , Idiap-RR-57-2007 |
|
Human-centered Computing: Toward a Human Revolution, , , and , in: IEEE Computer, 40(5), 2007 |
|
ESTIMATING THE DOMINANT PERSON IN MULTI-PARTY CONVERSATIONS USING SPEAKER DIARIZATION STRATEGIES, , , and , Idiap-RR-60-2007 |
|
Using Audio and Video Features to Classify the Most Dominant Person in a Group Meeting, , , , , , , , and , in: "", 2007 |
|