Conversion of Recurrent Neural Network Language Models to Weighted Finite State Transducers for Automatic Speech Recognition

We use cookies

This website uses cookies and other tracking technologies to improve your browsing experience for the following purposes: to enable basic functionality of the website, to provide a better experience on the website, to measure your interest in our products and services and to personalize marketing interactions, to deliver ads that are more relevant to you.

[BibTeX] [Marc21]

Type of publication:	Conference paper
Citation:	Lecorve_INTERSPEECH-2_2012
Booktitle:	Proceedings of Interspeech
Year:	2012
Month:	September
Pages:	to appear
Location:	Portland, Oregon, USA
Crossref:	Lecorve_Idiap-RR-21-2012: Conversion of Recurrent Neural Network Language Models to Weighted Finite State Transducers for Automatic Speech Recognition, Lecorvé, Gwénolé and Motlicek, Petr, Idiap-RR-21-2012
Abstract:	Recurrent neural network language models (RNNLMs) have recently shown to outperform the venerable n-gram language models (LMs). However, in automatic speech recognition (ASR), RNNLMs were not yet used to directly decode a speech signal. Instead, RNNLMs are rather applied to rescore N-best lists generated from word lattices. To use RNNLMs in earlier stages of the speech recognition, our work proposes to transform RNNLMs into weighted finite state transducers approximating their underlying probability distribution. While the main idea consists in discretizing continuous representations of word histories, we present a first implementation of the approach using clustering techniques and entropy-based pruning. Achieved experimental results on LM perplexity and on ASR word error rates are encouraging since the performance of the discretized RNNLMs is comparable to the one of n-gram LMs.
Keywords:	ASR, Automatic Speech Recognition, Language Models, recurrent neural network, speech decoding, weighted finite state transducer, WFST
Projects	Idiap
Authors	Lecorvé, Gwénolé Motlicek, Petr
Added by:	[UNK]
Total mark:	0
Attachments
Lecorve_INTERSPEECH-2_2012.pdf
Notes

processing time: 0.0009 seconds.