logo Idiap Research Institute        
 [BibTeX] [Marc21]
Subspace Regularized Dynamic Time Warping for Spoken Query Detection
Type of publication: Conference paper
Citation: Ram_SPARS_2017
Publication status: Accepted
Booktitle: Workshop on Signal Processing with Adaptive Sparse Structured Representations (SPARS)
Year: 2017
Note: Proceeding of Abstracts
Abstract: Deep neural network posterior probabilities are the best features for query detection in speech archives. Dynamic time warping (DTW) is the state-of-the-art solution for this task. Posterior features live in low-dimensional subspaces whereas, the current DTW methods do not incorporate this global structure of the data and rely on local feature distances. We exploit the query example as the dictionary for sparse recovery. Local DTW scores are integrated with the sparse reconstruction scores to obtain a subspace regularized distance matrix for DTW. The proposed method yields a substantial performance gain over the baseline system.
Keywords:
Projects PHASER-QUAD
Authors Ram, Dhananjay
Asaei, Afsaneh
Bourlard, Hervé
Added by: [UNK]
Total mark: 0
Attachments
  • Ram_SPARS_2017.pdf
Notes