logo Idiap Research Institute        
 [BibTeX] [Marc21]
Domain Adaptation and Investigation of Robustness of DNN-based Embeddings for Text-Independent Speaker Verification Using Dilated Residual Networks
Type of publication: Idiap-RR
Citation: Sarfjoo_Idiap-RR-10-2019
Number: Idiap-RR-10-2019
Year: 2019
Month: 9
Institution: Idiap
Address: Centre du Parc, Rue Marconi 19, P.O. Box 592, CH - 1920 Martigny
Abstract: Robustness of extracted embeddings in cross-database scenarios is one of the main challenges in text-independent speaker verification (SV) systems. In this paper, we investigate this robustness via performing structural cross-database experiments with or without additive noise. This noise can be added from the seen set, where the noise type is similar to the noise which is used in data augmentation for training the SV model, or unseen set, where distribution of additive noise in train and evaluation sets are different. For extracting the robust embeddings, we investigate applying the time dilation in the ResNet architecture, so-called dilated residual network (DRN). Dimension and number of segment level layers are tuned in this architecture. The proposed model with time dilation significantly outperformed the ResNet model and is comparable with the state-of-the-art SV systems on Voxceleb1 dataset. In addition, this architecture showed significant robustness in out of domain scenarios. Language mismatch is part of domain mismatch which recently is one of the main focuses of research in SV systems. Similar to image recognition field, we hypothesize that low-level convolutional neural network (CNN) layers are domain-specific features while high-level CNN layers are domain-independent and have more discriminative power. For adapting these domain-specific units, combination of triplet and intra-class losses are investigated. The adapted model on the evaluation part of the CMN2 dataset, relatively outperformed the DRN and x-vector SV systems without adaptation with 8.0 and 20.5 %, respectively in equal error-rate.
Projects Tesla
Authors Sarfjoo, Seyyed Saeed
Magimai.-Doss, Mathew
Marcel, S├ębastien
Editors Sarfjoo, Seyyed Saeed
Added by: [ADM]
Total mark: 0
  • Sarfjoo_Idiap-RR-10-2019.pdf (MD5: 9e9a1f5536a8efa7b9bdfc9de0433a46)