logo Idiap Research Institute        
 [BibTeX] [Marc21]
Late Fusion of the Available Lexicon and Raw Waveform-based Acoustic Modeling for Depression and Dementia Recognition
Type of publication: Conference paper
Citation: VILLATORO-TELLO_INTERSPEECH2021_2021
Publication status: Accepted
Booktitle: Proceedings of Interspeech 2021
Year: 2021
Month: August
Publisher: ISCA-International Speech Communication Association 2021
Crossref: VILLATORO-TELLO_Idiap-RR-09-2021:
Abstract: Mental disorders, e.g. depression and dementia, are categorized as priority conditions according to the World Health Organization (WHO). When diagnosing, psychologists employ structured questionnaires/interviews, and different cognitive tests. Although accurate, there is an increasing necessity of developing digital mental health support technologies to alleviate the burden faced by professionals. In this paper, we propose a multi-modal approach for modeling the communication process employed by patients being part of a clinical interview or a cognitive test. The language-based modality, inspired by the Lexical Availability (LA) theory from psycho-linguistics, identifies the most accessible vocabulary of the interviewed subject and use it as features in a classification process. The acoustic-based modality is processed by a Convolutional Neural Network (CNN) trained on signals of speech that predominantly contained voice source characteristics. In the end, a late fusion technique, based on majority voting, assigns the final classification. Results show the complementarity of both modalities, reaching an overall Macro-F1 of 84% and 90% for Depression and Alzheimer's dementia respectively.
Keywords:
Projects Idiap
TAPAS
Authors Villatoro-Tello, Esaú
Dubagunta, S. Pavankumar
Fritsch, Julian
Ramírez-de-la-Rosa, Gabriela
Motlicek, Petr
Magimai.-Doss, Mathew
Added by: [UNK]
Total mark: 0
Attachments
  • VILLATORO-TELLO_INTERSPEECH2021_2021.pdf
Notes