Detection of disguised speech in forensic science by humans and automatic systems

We use cookies

This website uses cookies and other tracking technologies to improve your browsing experience for the following purposes: to enable basic functionality of the website, to provide a better experience on the website, to measure your interest in our products and services and to personalize marketing interactions, to deliver ads that are more relevant to you.

[BibTeX] [Marc21]

Type of publication:	Thesis
Citation:	Pettinato_THESIS_2020
Year:	2020
Month:	July
School:	Université de Lausanne Ecole des Sciences Criminelles
Note:	Master Thesis
Abstract:	The step preceding the speaker identification process consists in the determination of the authenticity of a speech sample. The focus of this thesis is on the performance of humans in detecting altered samples from replay, speech synthesis (TTS) and voice conversion (VC) systems. A listening test was constructed on the online survey platform LimeSurvey. The participants were asked to assess a series of recordings by first giving a binary evaluation (“authentic” or “altered”) and then by specifying their level of confidence on a 5-point scale. Moreover, the logic behind the aural approach was studied by inspecting the criteria used by the respondents in the assessment of the recordings. The same samples were also evaluated with an automatic LFCC-GMM-based system, trained on two different datasets, in order to make a comparison. The results show that the human’s performance (EER=0.10) surpasses the machine’s (EER=0.35 and EER=0.46) in the detection of the altered samples. However, sophisticated voice disguise systems have now reached levels of quality that can also easily fool most humans, which makes them a real threat to security and the identification process.
Keywords:	authentication, Biometrics, speaker recognition, Spoofing, Voice disguise
Projects	Idiap
Authors	Pettinato, Michela
Added by:	[UNK]
Total mark:	0
Attachments
Pettinato_THESIS_2020.pdf
Notes

processing time: 0.0011 seconds.