A Kernel Wrapper for Phoneme Sequence Recognition

We use cookies

This website uses cookies and other tracking technologies to improve your browsing experience for the following purposes: to enable basic functionality of the website, to provide a better experience on the website, to measure your interest in our products and services and to personalize marketing interactions, to deliver ads that are more relevant to you.

[BibTeX] [Marc21]

Type of publication:	Book chapter
Citation:	Keshet_WILEY-2_2009
Booktitle:	Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods
Year:	2009
Publisher:	John Wiley and Sons
Abstract:	We describe a kernel wrapper, a Mercer kernel for the task of phoneme sequence recognition which is based on operations with the Gaussian kernel, and suitable for any sequence kernel classifier. We start by presenting a kernel-based algorithm for phoneme sequence recognition, which aims at minimizing the Levenshtein distance (edit distance) between the predicted phoneme sequence and the true phoneme sequence. Motivated by the good results of frame-based phoneme classification using SVMs with Gaussian kernel, we devised a kernel for speech utterances and phoneme sequences, which generalizes the kernel function for phoneme frame-based classification and adds timing constraints in the form of transitions and durations constraints. The kernel function has three parts corresponding to phoneme acoustic model, phoneme duration model and phoneme transition model. We present initial encouraging experimental results with the TIMIT corpus.
Keywords:
Projects	Idiap
Authors	Keshet, Joseph Chazan, Dan
Editors	Keshet, Joseph Bengio, Samy
Added by:	[UNK]
Total mark:	0
Attachments

Notes

processing time: 0.0002 seconds.