logo Idiap Research Institute        
 [BibTeX] [Marc21]
Implementing Fusion Techniques for the Classification of Paralinguistic Information
Type of publication: Conference paper
Citation: Vlasenko_INTERSPEECH2018_2018
Booktitle: Proceedings of Interspeech 2018
Volume: 1-6
Year: 2018
Pages: 526-530
ISSN: 2308-457X
ISBN: 978-1-5108-7221-9
Abstract: This work tests several classification techniques and acoustic features and further combines them using late fusion to classify paralinguistic information for the ComParE 2018 challenge. We use Multiple Linear Regression (MLR) with Ordinary Least Squares (OLS) analysis to select the most informative features for Self-Assessed Affect (SSA) sub-Challenge. We also propose to use raw-waveform convolutional neural networks (CNN) in the context of three paralinguistic sub-challenges. By using combined evaluation split for estimating codebook, we obtain better representation for Bag-of-Audio-Words approach. We preprocess the speech to vocalized segments to improve classification performance. For fusion of our leading classification techniques, we use weighted late fusion approach applied for confidence scores. We use two mismatched evaluation phases by exchanging the training and development sets, and this estimates the optimal fusion weight. Weighted late fusion provides better performance on development sets in comparison with baseline techniques. Raw-waveform techniques perform comparable to the baseline.
Keywords: feature selection, Fusion, multiple linear regression, raw-waveform cnn
Projects Idiap
Authors Vlasenko, Bogdan
Sebastian, Jilt
Kumar, D S Pavan
Magimai.-Doss, Mathew
Added by: [UNK]
Total mark: 0
Attachments
  • Vlasenko_INTERSPEECH2018_2018.pdf
Notes