REPORT
Rasipuram_Idiap-RR-38-2011/IDIAP
Acoustic Data-driven Grapheme-to-Phoneme Conversion using KL-HMM
Rasipuram, Ramya
Magimai-Doss, Mathew
grapheme
grapheme-to-phoneme converter
Lexicon
multi- layer perceptron
phoneme
EXTERNAL
https://publications.idiap.ch/attachments/reports/2012/Rasipuram_Idiap-RR-38-2011.pdf
PUBLIC
Idiap-RR-38-2011
2011
Idiap
December 2011
This paper proposes a novel grapheme-to-phoneme (G2P) conversion approach where first the probabilistic relation between graphemes and phonemes is captured from acoustic data using
Kullback-Leibler divergence based hidden Markov model (KL-HMM) system. Then, through a simple decoding framework the information in this probabilistic relation is integrated with the sequence information in the orthographic transcription of the word to infer the phoneme sequence. One of the main application of the proposed G2P approach is in the area of low linguistic resource based automatic speech recognition or text-to-speech systems. We demonstrate this potential through a simulation study where linguistic resources from one domain is used to create linguistic resources for a different domain.