REPORT
Szaszak_Idiap-RR-23-2013/IDIAP
Using Phonological Phrase Segmentation to Improve Automatic Keyword Spotting for the Highly Agglutinating Hungarian Language
Szaszak, Gyorgy
Beke, Andras
EXTERNAL
https://publications.idiap.ch/attachments/reports/2013/Szaszak_Idiap-RR-23-2013.pdf
PUBLIC
https://publications.idiap.ch/index.php/publications/showcite/Szaszak_INTERSPEECH2013_2013
Related documents
Idiap-RR-23-2013
2013
Idiap
June 2013
This paper investigates the usage of prosody for the improvement of keyword spotting, focusing on the highly agglutinating Hungarian language, where keyword spotting cannot be effectively performed using LVCSR, as such systems are either unavailable or hard to operate due to high OOV rates and poor N-gram language modelling capabilities. Therefore, the applied keyword spotting system is based on confidence scores computed as a ratio of acoustic scores obtained in two ways: firstly, by decoding with an universal background model; and secondly, by decoding with a keyword model embedded into filler models. Prosody is used to perform an automatic phonological phrase alignment for speech, proven to be useful for automatic partial word boundary detection in fixed stress languages. Several features deduced from the phonological phrase alignment are investigated to rescore baseline confidence scores both in a rule-based and in a data-driven manner. Results show that in relevant operating points of the system, a false alarm reduction of 10% - 40% can be reached by the same miss probability rates.
CONF
Szaszak_INTERSPEECH2013_2013/IDIAP
Using Phonological Phrase Segmentation to Improve Automatic Keyword Spotting for the Highly Agglutinating Hungarian Language
Szaszak, Gyorgy
Beke, Andras
Proc. of Interspeech 2013
2013