CONF
Cernak_INTERSPEECH_2015/IDIAP
An Empirical Model of Emphatic Word Detection
Cernak, Milos
Honnet, Pierre-Edouard
Speech Analysis
EXTERNAL
https://publications.idiap.ch/attachments/papers/2015/Cernak_INTERSPEECH_2015.pdf
PUBLIC
https://publications.idiap.ch/index.php/publications/showcite/Cernak_Idiap-RR-11-2015
Related documents
Proc. of Interspeech
Dresden, Germany
2015
ISCA
573-577
The paper presents an empirical model of emphatic word detection, as an alternative to conventional machine-learning-based methods. The model is based on the Probabilistic Amplitude Demodulation (PAD) that is iteratively applied for getting syllable and stress modulations, i.e., using the cascaded PAD method. The emphatic words are detected by prominent peaks of the stress modulation and by considering the peaks that are stressed or accented. The cascaded demodulation steered with general purpose values derived from 200ms long average syllable duration, yields to detection accuracy of 81%-83%. Speaker-dependent cascaded demodulation, considering specific speaking rate of the speakers, yields to detection accuracy of 86%-91%. The advantages of the proposed empirical detection model are (i) noise-robustness, (ii) language-independence and (iii) it does not require a training phase.
REPORT
Cernak_Idiap-RR-11-2015/IDIAP
An Empirical Model of Emphatic Word Detection
Cernak, Milos
Honnet, Pierre-Edouard
probabilistic amplitude demodulation
speech emphasis
EXTERNAL
https://publications.idiap.ch/attachments/reports/2015/Cernak_Idiap-RR-11-2015.pdf
PUBLIC
Idiap-RR-11-2015
2015
Idiap
June 2015