LP Residual Features for Robust, Privacy-Sensitive Speaker Diarization
Type of publication: | Idiap-RR |
Citation: | Parthasarathi_Idiap-RR-14-2011 |
Number: | Idiap-RR-14-2011 |
Year: | 2011 |
Month: | 5 |
Institution: | Idiap |
Abstract: | We present a comprehensive study of linear prediction residual for speaker diarization on single and multiple distant microphone conditions in privacy-sensitive settings, a requirement to analyze a wide range of spontaneous conversations. Two representations of the residual are compared, namely real-cepstrum and MFCC, with the latter performing better. Experiments on RT06eval show that residual with subband information from 2.5 kHz to 3.5 kHz and spectral slope yields a performance close to traditional MFCC features. As a way to objectively evaluate privacy in terms of linguistic information, we perform phoneme recognition. Residual features yield low phoneme accuracies compared to traditional MFCC features. |
Keywords: | |
Projects |
Idiap SNSF-MULTI |
Authors | |
Added by: | [ADM] |
Total mark: | 0 |
Attachments
|
|
Notes
|
|
|