CONF
SyncAlign-Esp99/IDIAP
CLIENT / WORLD MODEL SYNCHRONOUS ALIGNEMENT FOR SPEAKER VERIFICATION
Mariéthoz, Johnny
Genoud, Dominique
Bimbot, Frédéric
Mokbel, Chafic
EXTERNAL
https://publications.idiap.ch/attachments/papers/1999/SyncAlignEurospeech99.pdf
PUBLIC
https://publications.idiap.ch/index.php/publications/showcite/syncalignesp-irr99
Related documents
6th European Conference on Speech Communication and Technology --- Eurospeech'99
1999
Budapest, Hungary
IDIAP-RR 99-23
In speaker verification, two independent stochastic models, i.e. a client model and a non-client (world) model, are generally used to verify the claimed identity using a likelihood ratio score. This paper investigates a variant of this approach based on a common hidden process for both models. In this framework, both models share the same topology, which is conditioned by the underlying phonetic structure of the utterance. Then, two different output distributions are defined corresponding to the client vs. world hypotheses. Based on this idea, a synchronous decoding algorithm and the corresponding training algorithm are derived. Our first experiments on the SESP telephone database indicate a slight improvement with respect to a baseline system using independent alignments. Moreover, synchronous alignment offers a reduced complexity during the decoding process. Interesting perspectives can be expected. Keywords : Stochastic Modeling, HMM, Synchronous Alignment, EM algorithm
REPORT
SyncAlignEsp-IRR99/IDIAP
CLIENT / WORLD MODEL SYNCHRONOUS ALIGNEMENT FOR SPEAKER VERIFICATION
Mariéthoz, Johnny
Genoud, Dominique
Bimbot, Frédéric
Mokbel, Chafic
EXTERNAL
https://publications.idiap.ch/attachments/reports/1999/rr99-23.pdf
PUBLIC
Idiap-RR-23-1999
1999
IDIAP
In speaker verification, two independent stochastic models, i.e. a client model and a non-client (world) model, are generally used to verify the claimed identity using a likelihood ratio score. This paper investigates a variant of this approach based on a common hidden process for both models. In this framework, both models share the same topology, which is conditioned by the underlying phonetic structure of the utterance. Then, two different output distributions are defined corresponding to the client vs. world hypotheses. Based on this idea, a synchronous decoding algorithm and the corresponding training algorithm are derived. Our first experiments on the SESP telephone database indicate a slight improvement with respect to a baseline system using independent alignments. Moreover, synchronous alignment offers a reduced complexity during the decoding process. Interesting perspectives can be expected. Keywords : Stochastic Modeling, HMM, Synchronous Alignment, EM algorithm