Integrated Pronunciation Learning for Automatic Speech Recognition Using Probabilistic Lexical Modeling

We use cookies

This website uses cookies and other tracking technologies to improve your browsing experience for the following purposes: to enable basic functionality of the website, to provide a better experience on the website, to measure your interest in our products and services and to personalize marketing interactions, to deliver ads that are more relevant to you.

[BibTeX] [Marc21]

Type of publication:	Conference paper
Citation:	Rasipuram_ICASSP_2015
Publication status:	Published
Booktitle:	International Conference on Acoustics, Speech and Signal Processing
Year:	2015
Month:	April
Pages:	5176-5180
Location:	South Brisbane, QLD
DOI:	10.1109/ICASSP.2015.7178958
Abstract:	Standard automatic speech recognition (ASR) systems use phoneme-based pronunciation lexicon prepared by linguistic experts. When the hand crafted pronunciations fail to cover the vocabulary of a new domain, a grapheme-to-phoneme (G2P) converter is used to extract pronunciations for new words and then a phoneme- based ASR system is trained. G2P converters are typically trained only on the existing lexicons. In this paper, we propose a grapheme-based ASR approach in the framework of probabilistic lexical modeling that integrates pronunciation learning as a stage in ASR system training, and exploits both acoustic and lexical resources (not necessarily from the domain or language of interest). The proposed approach is evaluated on four lexical resource constrained ASR tasks and compared with the conventional two stage approach where G2P training is followed by ASR system development.
Keywords:
Projects	Idiap
Authors	Rasipuram, Ramya Razavi, Marzieh Magimai-Doss, Mathew
Added by:	[UNK]
Total mark:	0
Attachments
Rasipuram_ICASSP_2015.pdf
Notes

processing time: 0.0009 seconds.