Wordless Sounds: Robust Speaker Diarization using Privacy-Preserving Audio Representations

We use cookies

This website uses cookies and other tracking technologies to improve your browsing experience for the following purposes: to enable basic functionality of the website, to provide a better experience on the website, to measure your interest in our products and services and to personalize marketing interactions, to deliver ads that are more relevant to you.

[BibTeX] [Marc21]

Type of publication:	Idiap-RR
Citation:	Parthasarathi_Idiap-RR-28-2012
Number:	Idiap-RR-28-2012
Year:	2012
Month:	9
Institution:	Idiap
Abstract:	This paper investigates robust privacy-sensitive audio features for speaker diarization in multiparty conversations: ie., a set of audio features having low linguistic information for speaker diarization in a single and multiple distant microphone scenarios. We systematically investigate Linear Prediction (LP) residual. Issues such as prediction order and choice of representation of LP residual are studied. Additionally, we explore the combination of LP residual with subband information from 2.5 kHz to 3.5 kHz and spectral slope. Next, we propose a supervised framework using deep neural architecture for deriving privacy-sensitive audio features. We benchmark these approaches against the traditional Mel Frequency Cepstral Coefficients (MFCC) features for speaker diarization in both the microphone scenarios. Experiments on the RT07 evaluation dataset show that the proposed approaches yield diarization performance close to the MFCC features on the single distant microphone dataset. To objectively evaluate the notion of privacy in terms of linguistic information, we perform human and automatic speech recognition tests, showing that the proposed approaches to privacy-sensitive audio features yield much lower recognition accuracies compared to MFCC features.
Keywords:
Projects	Idiap FP 7 SNSF-MULTI
Authors	Parthasarathi, Sree Hari Krishnan Bourlard, Hervé Gatica-Perez, Daniel
Crossref by	Parthasarathi_TASLP_2012
Added by:	[ADM]
Total mark:	0
Attachments
Parthasarathi_Idiap-RR-28-2012.pdf (MD5: 4852204bf6218e130533ebae4a3f8e6a)
Notes

processing time: 0.0003 seconds.