Keywords:
- Age prediction
- Alzheimer's disease
- Author profiling
- automatic speech recognition (ASR)
- Bag OF Words
- BERT
- bias
- bias aware
- Computational linguistics
- Contextual Adaptation
- Contextual Entrepreneurship
- Cross-modal Alignment
- Cross-modal Attentio
- Cross-modal Attention
- Data Selection
- deep learning
- deep learning models
- depression detection
- Domain Classification
- Early Risk Detection
- Etrepreneurial Challenges
- explainability
- F1 score
- fact checking
- factual reporting
- finite-state transducers
- Gender prediction
- GPU decoding
- Graph Convolutional Networks
- Graph Neural Networks
- health care
- Human-Computer Interaction
- Industry Context
- information verification
- Intelligent Systems
- Intent Classification
- Inter-pretable Models
- Interpretability
- Interpretable Models
- knowledge distillation
- Language Production
- Large Language Models
- limited training data
- Location prediction
- low-resource domains
- machine learning
- media bias
- Medical Sector
- Mental Health
- Mental Lexicon
- Mexican Tourist Text
- Multi-modal Approach
- multimodal analysis
- Multimodal classification
- multitask learning
- multitask training
- Nahuatl and Spanish utterances
- named entity recognition
- natural language porcessing
- Natural language processing
- Natural Language Understanding
- news media
- node weighted graphs
- Occupation prediction
- online speech recognition
- Operant Motive Test
- pseudo-labelling
- Psycholinguistics
- Raw Speech
- real-time speech recognition
- Recommendation System
- reinforcement learning
- reliability estimation
- resources and evaluation
- Sentiment Analysis
- Service Robots
- Sexual predators identification
- shallow fusion
- slot filling
- social media analysis
- Speaker change detection
- speaker turn detection
- speech recognition
- Spoken Language Understanding
- streaming transducer
- Supervised Autoencoders
- Text classification
- Text Information Organization Schemes
- Text Mining
- Text Representation
- topic modeling
- transformers
- whisper
- Word Consensus Networks
- Word-Confusion-Networks
- XLSR-Transducer
- Zipformer
Publications of Esaú Villatoro-Tello sorted by first author
| 1 | 2 |
T
Temporal fine-tuning for early risk detection, , , and , in: Memorias De Las JAIIO, Argentina, pages 137-149, 2024 |
[URL] |
Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper, , , , , , , , and , in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, ACL, 2024 |
|
V
Enhancing Multi-modal Classification of Violent Events using Image Captioning, , and , in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2023), CEUR Workshop Proceedings, Jaén, Spain, 2023 |
[URL] |
Leveraging Events Sub-Categories for Violent-Events Detection in Social Media, , and , in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022), 2022 |
[URL] |
Late Fusion of the Available Lexicon and Raw Waveform-based Acoustic Modeling for Depression and Dementia Recognition, , , , , and , Idiap-RR-09-2021 |
|
Late Fusion of the Available Lexicon and Raw Waveform-based Acoustic Modeling for Depression and Dementia Recognition, , , , , and , in: Proceedings of Interspeech 2021, ISCA-International Speech Communication Association 2021, 2021 |
|
Expanded Lattice Embeddings for Spoken Document Retrieval on Informal Meetings, , , , and , Idiap-RR-06-2022 |
|
Expanded Lattice Embeddings for Spoken Document Retrieval on Informal Meetings, , , , and , in: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, 2022 |
[DOI] |
Probability-Aware Word-Confusion-Network-to-Text Alignment Approach for Intent Classification, , , , , , , and , in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, Seoul, Republic of Korea, pages 12617-12621, IEEE, 2024 |
[DOI] [URL] |
Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks, , , , , , , , and , in: Proceedings of the 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, 2023 |
|
Applying Attention Based Models for Detecting Cognitive Processes and Mental Health Conditions, , , and , Idiap-RR-01-2022 |
|
Applying Attention-Based Models for Detecting Cognitive Processes and Mental Health Conditions, , , and , in: Cognitive Computation:18, 2021 |
[DOI] [URL] |
Idiap & UAM participation at GermEval 2020: Classification and Regression of Cognitive and Motivational Style from Text, , , , and , in: Proceedings of the GermEval 2020 Shared Task on the Classification and Regression of Cognitive and Motivational style from Text, 2020 |
[URL] |
Inferring Highly-dense Representations for Clustering Broadcast Media Content, , , and , in: The Prague Bulletin of Mathematical Linguistics, 2020 |
[URL] |
Broadcast Media Content Categorization Using Low-Resolution Concepts, , , , and , Idiap-RR-06-2021 |
|
Approximating the Mental Lexicon from Clinical Interviews as a Support Tool for Depression Detection, , , , and , in: Proceedings of the 2021 International Conference on Multimodal Interaction, ACM, 2021 |
[DOI] |
Approximating the Mental Lexicon from Clinical Interviews as a Support Tool for Depression Detection, , , , and , Idiap-RR-19-2021 |
Idiap and UAM Participation at MEX-A3T Evaluation Campaign, , , , and , in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2020) co-located with 36th Conference of the Spanish Society for Natural Language Processing (SEPLN 2020), pages 6, CEUR Workshop Proceedings, 2020 |
[URL] |
| 1 | 2 |