<?xml version="1.0" encoding="UTF-8"?>
<collection xmlns="http://www.loc.gov/MARC21/slim">
	<record>
		<datafield tag="980" ind1=" " ind2=" ">
			<subfield code="a">REPORT</subfield>
		</datafield>
		<datafield tag="970" ind1=" " ind2=" ">
			<subfield code="a">Motlicek_Idiap-RR-16-2015/IDIAP</subfield>
		</datafield>
		<datafield tag="245" ind1=" " ind2=" ">
			<subfield code="a">EMPLOYMENT OF SUBSPACE GAUSSIAN MIXTURE MODELS IN SPEAKER RECOGNITION</subfield>
		</datafield>
		<datafield tag="700" ind1=" " ind2=" ">
			<subfield code="a">Motlicek, Petr</subfield>
		</datafield>
		<datafield tag="700" ind1=" " ind2=" ">
			<subfield code="a">Dey, Subhadeep</subfield>
		</datafield>
		<datafield tag="700" ind1=" " ind2=" ">
			<subfield code="a">Madikeri, Srikanth</subfield>
		</datafield>
		<datafield tag="700" ind1=" " ind2=" ">
			<subfield code="a">Burget, Lukas</subfield>
		</datafield>
		<datafield tag="653" ind1="1" ind2=" ">
			<subfield code="a">Automatic Speech Recognition</subfield>
		</datafield>
		<datafield tag="653" ind1="1" ind2=" ">
			<subfield code="a">i-vectors</subfield>
		</datafield>
		<datafield tag="653" ind1="1" ind2=" ">
			<subfield code="a">speaker recognition</subfield>
		</datafield>
		<datafield tag="653" ind1="1" ind2=" ">
			<subfield code="a">subspace Gaussian mixture models</subfield>
		</datafield>
		<datafield tag="856" ind1="4" ind2="0">
			<subfield code="i">EXTERNAL</subfield>
			<subfield code="u">http://publications.idiap.ch/attachments/reports/2015/Motlicek_Idiap-RR-16-2015.pdf</subfield>
			<subfield code="x">PUBLIC</subfield>
		</datafield>
		<datafield tag="088" ind1=" " ind2=" ">
			<subfield code="a">Idiap-RR-16-2015</subfield>
		</datafield>
		<datafield tag="260" ind1=" " ind2=" ">
			<subfield code="c">2015</subfield>
			<subfield code="b">Idiap</subfield>
			<subfield code="a">Rue Marconi 19, Martigny</subfield>
		</datafield>
		<datafield tag="771" ind1="2" ind2=" ">
			<subfield code="d">June 2015</subfield>
		</datafield>
		<datafield tag="520" ind1=" " ind2=" ">
			<subfield code="a">This paper presents Subspace Gaussian Mixture Model (SGMM)
approach employed as a probabilistic generative model to estimate
speaker vector representations to be subsequently used in the speaker
verification task. SGMMs have already been shown to significantly
outperform traditional HMM/GMMs in Automatic Speech Recognition
(ASR) applications. An extension to the basic SGMM framework
allows to robustly estimate low-dimensional speaker vectors
and exploit them for speaker adaptation. We propose a speaker verification
framework based on low-dimensional speaker vectors estimated
using SGMMs, trained in ASR manner using manual transcriptions.
To test the robustness of the system, we evaluate the
proposed approach with respect to the state-of-the-art i-vector extractor
on the NIST SRE 2010 evaluation set and on four different
length-utterance conditions: 3sec-10sec, 10 sec-30 sec, 30 sec-60 sec
and full (untruncated) utterances. Experimental results reveal that
while i-vector system performs better on truncated 3sec to 10sec and
10 sec to 30 sec utterances, noticeable improvements are observed
with SGMMs especially on full length-utterance durations. Eventually,
the proposed SGMM approach exhibits complementary properties
and can thus be efficiently fused with i-vector based speaker
verification system.</subfield>
		</datafield>
	</record>
</collection>