logo Idiap Research Institute        
 [BibTeX] [Marc21]
Late Fusion of the Available Lexicon and Raw Waveform-based Acoustic Modeling for Depression and Dementia Recognition
Type of publication: Idiap-RR
Citation: VILLATORO-TELLO_Idiap-RR-09-2021
Number: Idiap-RR-09-2021
Year: 2021
Month: 7
Institution: Idiap
Note: Paper accepted for Publication in Interspeech 2021
Abstract: Mental disorders, e.g. depression and dementia, are categorized as priority conditions according to the World Health Organization (WHO). When diagnosing, psychologists employ structured questionnaires/interviews, and different cognitive tests. Although accurate, there is an increasing necessity of developing digital mental health support technologies to alleviate the burden faced by professionals. In this paper, we propose a multi-modal approach for modeling the communication process employed by patients being part of a clinical interview or a cognitive test. The language-based modality, inspired by the Lexical Availability (LA) theory from psycho-linguistics, identifies the most accessible vocabulary of the interviewed subject and use it as features in a classification process. The acoustic-based modality is processed by a Convolutional Neural Network (CNN) trained on signals of speech that predominantly contained voice source characteristics. At the end, a late fusion technique, based on majority voting, assigns the final classification. Results show the complementarity of both modalities, reaching an overall Macro-F1 of 84% and 90% for Depression and Alzheimer's dementia respectively.
Keywords: Alzheimer's disease, depression detection, Mental Lexicon, Multi-modal Approach, Raw Speech
Projects Idiap
TAPAS
Authors Villatoro-Tello, Esaú
Dubagunta, S. Pavankumar
Fritsch, Julian
Ramírez-de-la-Rosa, Gabriela
Motlicek, Petr
Magimai.-Doss, Mathew
Crossref by VILLATORO-TELLO_INTERSPEECH2021_2021
Added by: [ADM]
Total mark: 0
Attachments
  • VILLATORO-TELLO_Idiap-RR-09-2021.pdf (MD5: 79823b2447d95b7446b85656e6c78374)
Notes