On Detection of Depression in Parkinson's Disease Patients' Speech: Handcrafted Features vs. Speech Foundation Models
| Type of publication: | Book chapter |
| Citation: | Purohit_SPRINGERNATURE_2025 |
| Booktitle: | Automatic Assessment of Parkinsonian Speech |
| Edition: | 1 |
| Series: | Communications in Computer and Information Science (CCIS) |
| Volume: | 2646 |
| Year: | 2025 |
| Publisher: | Springer Nature Switzerland AG |
| Address: | Cambridge, MA, USA |
| URL: | https://link.springer.com/book... |
| Abstract: | This study investigates speech-based depression detection in individuals with Parkinson’s disease (PD), comparing two feature representation approaches: interpretable handcrafted acoustic features and non-interpretable representations derived from Speech Foundation Models (SFMs). The study utilizes the DAIC-WOZ corpus for typical depression (non-PD speech) and the PD-Depression corpus for depression in atypical speech (PD speech). We first examine the viability of handcrafted features and then analyse how acoustic descriptors differ across these conditions. We then evaluate SFM-based representations on the PD-Depression dataset using a cross-validation-based layer selection methodology. Results suggest that handcrafted features yield better classification performance for depression in PD speech compared to SFM embeddings. Analysis shows that while typical depression is marked by pitch instability and reduced vocal quality, speech from PD patients with depression exhibits broader spectral variability, likely due to motor impairments associated with hypokinetic dysarthria. |
| Main Research Program: | AI for Life |
| Additional Research Programs: |
AI for Everyone |
| Keywords: | depression detection, Interpretable features, Parkinson’s disease, Speech Foundation Models |
| Projects: |
EMIL |
| Authors: | |
| Editors | |
| Added by: | [UNK] |
| Total mark: | 0 |
|
Attachments
|
|
|
Notes
|
|
|
|
|