Unknown-Multiple Speaker clustering using HMM

We use cookies

This website uses cookies and other tracking technologies to improve your browsing experience for the following purposes: to enable basic functionality of the website, to provide a better experience on the website, to measure your interest in our products and services and to personalize marketing interactions, to deliver ads that are more relevant to you.

[BibTeX] [Marc21]

Type of publication:	Idiap-RR
Citation:	ajmera-rr-02-07
Number:	Idiap-RR-07-2002
Year:	2002
Institution:	IDIAP
Address:	Martigny, Switzerland
Note:	ICSLP, Denver, Colorado, 2002
Abstract:	An HMM-based speaker clustering framework is presented, where the number of speakers and segmentation boundaries are unknown \emph{a priori}. Ideally, the system aims to create one pure cluster for each speaker. The HMM is ergodic in nature with a minimum duration topology. The final number of clusters is determined automatically by merging closest clusters and retraining this new cluster, until a decrease in likelihood is observed. In the same framework, we also examine the effect of using only the features from highly voiced frames as a means of improving the robustness and computational complexity of the algorithm. The proposed system is assessed on the 1996 HUB-4 evaluation test set in terms of both cluster and speaker purity. It is shown that the number of clusters found often correspond to the actual number of speakers.
Userfields:	ipdinar={2002}, ipdmembership={speech}, language={English},
Keywords:
Projects	Idiap
Authors	Ajmera, Jitendra Bourlard, Hervé Lapidot, I. McCowan, Iain A.
Crossref by	ajmera2002icslp
Added by:	[UNK]
Total mark:	0
Attachments
rr02-07.pdf rr02-07.ps.gz
Notes

processing time: 0.0003 seconds.