End-to-End Convolutional Neural Network-based Voice Presentation Attack Detection

We use cookies

This website uses cookies and other tracking technologies to improve your browsing experience for the following purposes: to enable basic functionality of the website, to provide a better experience on the website, to measure your interest in our products and services and to personalize marketing interactions, to deliver ads that are more relevant to you.

[BibTeX] [Marc21]

Type of publication:	Conference paper
Citation:	Muckenhirn_IJCB_2017
Publication status:	Accepted
Booktitle:	International Joint Conference on Biometrics
Year:	2017
Location:	Denver, Colorado, USA
Abstract:	Development of countermeasures to detect attacks performed on speaker verification systems through presentation of forged or altered speech samples is a challenging and open research problem. Typically, this problem is approached by extracting features through conventional short-term speech processing and feeding them to a binary classifier. In this article, we develop a convolutional neural network-based approach that learns in an end-to-end manner both the features and the binary classifier from the raw signal. Through investigations on two publicly available databases, namely, ASVspoof and AVspoof, we show that it yields systems comparable to or better than the state-of-the-art approaches for both physical access attacks and logical access attacks. Furthermore, the approach is shown to be complementary to a spectral statistics-based approach, which, similarly to the proposed approach, does not use prior assumptions related to speech signals.
Keywords:
Projects	Idiap UNITS SWAN
Authors	Muckenhirn, Hannah Magimai-Doss, Mathew Marcel, Sébastien
Added by:	[UNK]
Total mark:	0
Attachments
Muckenhirn_IJCB_2017.pdf
Notes

processing time: 0.0002 seconds.