logo Idiap Research Institute        
 [BibTeX] [Marc21]
Evaluating the Robustness of Privacy-Sensitive Audio Features for Speech Detection in Personal Audio Log Scenarios
Type of publication: Idiap-RR
Citation: Parthasarathi_Idiap-RR-01-2010
Number: Idiap-RR-01-2010
Year: 2010
Month: 1
Institution: Idiap
Abstract: Personal audio logs are often recorded in multiple environments. This poses challenges for robust front-end processing, including speech/nonspeech detection (SND). Motivated by this, we investigate the robustness of four different privacy-sensitive features for SND, namely energy, zero crossing rate, spectral flatness, and kurtosis. We study early and late fusion of these features in conjunction with modeling temporal context. These combinations are evaluated in mismatched conditions on a dataset of nearly 450 hours. While both combinations yielded improvements over individual features, generally feature combinations performed better. Comparisons with a state-of-the-art spectral based and a privacy-sensitive feature set are also provided.
Keywords:
Projects SNSF-MULTI
IM2
Authors Parthasarathi, Sree Hari Krishnan
Magimai.-Doss, Mathew
Bourlard, Hervé
Gatica-Perez, Daniel
Added by: [ADM]
Total mark: 0
Attachments
  • Parthasarathi_Idiap-RR-01-2010.pdf (MD5: 87d0f108f2b53410a294c0d1ab94aca2)
Notes