Keywords:
- Age prediction
- Alzheimer's disease
- Author profiling
- Bag OF Words
- BERT
- bias aware
- Computational linguistics
- Contextual Adaptation
- Contextual Entrepreneurship
- Cross-modal Alignment
- Cross-modal Attentio
- Cross-modal Attention
- deep learning
- depression detection
- Etrepreneurial Challenges
- F1 score
- finite-state transducers
- Gender prediction
- GPU decoding
- Graph Convolutional Networks
- Graph Neural Networks
- health care
- Human-Computer Interaction
- Industry Context
- Intent Classification
- Inter-pretable Models
- Interpretability
- Interpretable Models
- knowledge distillation
- Language Production
- Large Language Models
- limited training data
- Location prediction
- machine learning
- Medical Sector
- Mental Lexicon
- Mexican Tourist Text
- Multi-modal Approach
- multimodal analysis
- Multimodal classification
- multitask learning
- multitask training
- named entity recognition
- natural language porcessing
- Natural language processing
- node weighted graphs
- Occupation prediction
- online speech recognition
- Operant Motive Test
- pseudo-labelling
- Psycholinguistics
- Raw Speech
- real-time speech recognition
- Recommendation System
- reinforcement learning
- reliability estimation
- resources and evaluation
- Sentiment Analysis
- Sexual predators identification
- shallow fusion
- social media analysis
- Speaker change detection
- speaker turn detection
- speech recognition
- Spoken Language Understanding
- streaming transducer
- Supervised Autoencoders
- Text classification
- Text Information Organization Schemes
- Text Mining
- Text Representation
- topic modeling
- Word Consensus Networks
- Word-Confusion-Networks
- XLSR-Transducer
Publications of Esaú Villatoro-Tello
| 1 | 2 |
2024
DAIC-WOZ: On the Validity of Using the Therapist's prompts in Automatic Depression Detection from Clinical Interviews, , , , , and , in: Proceedings of the 6th Clinical Natural Language Processing Workshop, Association for Computational Linguistics, 2024 |
|
Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper, , , , , , , , and , Idiap-RR-10-2024 |
|
Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper, , , , , , , , and , in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, ACL, 2024 |
|
Mapping the Media Landscape: Predicting Factual Reporting and Political Bias Through Web Interactions, , , and , in: Experimental IR Meets Multilinguality, Multimodality, and Interaction - 14th International Conference of the CLEF Association, CLEF, 2024, Grenoble, France, September 9-12, 2024, Proceedings, 2024 |
|
Multitask Speech Recognition and Speaker Change Detection for Unknown Number of Speakers, , , , , , , and , in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, 2024 |
Probability-Aware Word-Confusion-Network-to-Text Alignment Approach for Intent Classification, , , , , , , and , in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, 2024 |
|
Reliability Estimation of News Media Sources: Birds of a Feather Flock Together, , , and , in: Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics, 2024 |
|
Sentiment Analysis using pretrained LLMs, , and , Idiap-RR-05-2024 |
|
TokenVerse: Towards Unifying Speech and NLP Tasks via Transducer-based ASR, , , , , , , , and , in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, ACL, 2024 |
[URL] |
TokenVerse: Unifying Speech and NLP Tasks via Transducer-based ASR, , , , , , , , and , Idiap-RR-07-2024 |
[URL] |
XLSR-Transducer: Streaming ASR for Self-Supervised Pretrained Models, , , , , , , and , Idiap-RR-08-2024 |
[URL] |
2023
A lexical-availability-based framework from short communications for automatic personality identification, , , , and , in: Cognitive Systems Research, 79:126-137, 2023 |
[DOI] [URL] |
Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks, , , , , , , , and , in: Proceedings of the 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, 2023 |
|
Enhancing Multi-modal Classification of Violent Events using Image Captioning, , and , in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2023), CEUR Workshop Proceedings, Jaén, Spain, 2023 |
[URL] |
Implementing contextual biasing in GPU decoder for online ASR, , , , , , and , Idiap-RR-02-2023 |
|
Implementing contextual biasing in GPU decoder for online ASR, , , , , , and , in: Proc. Interspeech 2023, 2023 |
|
Intelligent Technologies: Concepts, Applications, and Future Directions, Volume 2, and , Springer, volume 1098, 2023 |
[DOI] |
Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviews, , , and , Idiap-RR-03-2023 |
|
Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviews, , , and , in: Proceedings of Interspeech, 2023 |
|
2022
Applying Attention Based Models for Detecting Cognitive Processes and Mental Health Conditions, , , and , Idiap-RR-01-2022 |
|
Classifying the Social Media Author Profile Through a Multimodal Representation, , , and , in: Intelligent Technologies: Concepts, Applications, and Future Directions. Studies in Computational Intelligence, Springer, 2022 |
[DOI] [URL] |
Expanded Lattice Embeddings for Spoken Document Retrieval on Informal Meetings, , , , and , Idiap-RR-06-2022 |
|
Expanded Lattice Embeddings for Spoken Document Retrieval on Informal Meetings, , , , and , in: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, 2022 |
[DOI] |
IDIAPers @ Causal News Corpus 2022: Efficient Causal Relation Identification Through a Prompt-based Few-shot Approach, , , , , , and , Idiap-RR-13-2022 |
|
IDIAPers @ Causal News Corpus 2022: Efficient Causal Relation Identification Through a Prompt-based Few-shot Approach, , , , , , and , in: The 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE @ EMNLP 2022), 2022 |
[URL] |
IDIAPers @ Causal News Corpus 2022: Extracting Cause-Effect-Signal Triplets via Pre-trained Autoregressive Language Model, , , , , , and , Idiap-RR-12-2022 |
|
IDIAPers @ Causal News Corpus 2022: Extracting Cause-Effect-Signal Triplets via Pre-trained Autoregressive Language Model, , , , , , and , in: The 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE @ EMNLP 2022), 2022 |
[URL] |
Leveraging Events Sub-Categories for Violent-Events Detection in Social Media, , and , in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022), 2022 |
[URL] |
Natural Language Processing in Healthcare, , , , and , Taylor & Francis Groups, 2022 |
[DOI] [URL] |
The Winning Approach for the Recommendation Systems Shared Task @REST_MEX 2022, , , , and , in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022), 2022 |
[URL] |
2021
Analysis of Vector Representations in Maintenance Logs in the Industry: Towards an Information Retrieval System, , , and , in: Journal of Research in Computing Science, 2021 |
Applying Attention-Based Models for Detecting Cognitive Processes and Mental Health Conditions, , , and , in: Cognitive Computation:18, 2021 |
[DOI] [URL] |
Approximating the Mental Lexicon from Clinical Interviews as a Support Tool for Depression Detection, , , , and , Idiap-RR-19-2021 |
Approximating the Mental Lexicon from Clinical Interviews as a Support Tool for Depression Detection, , , , and , in: Proceedings of the 2021 International Conference on Multimodal Interaction, ACM, 2021 |
[DOI] |
Automatic Dialect Detection for Low Resource Santali Language, , , , , and , in: Proceeding of International Conference on Information Technology (OCIT), 2021 |
|
| 1 | 2 |