REPORT Roy_Idiap-RR-35-2011/IDIAP Continuous Speech Recognition using Boosted Binary Features Roy, Anindya Magimai-Doss, Mathew Marcel, Sébastien continuous speech recognition boosted binary features resource management EXTERNAL http://publications.idiap.ch/attachments/reports/2011/Roy_Idiap-RR-35-2011.pdf PUBLIC Idiap-RR-35-2011 2011 Idiap October 2011 A novel parts-based binary-valued feature termed Boosted Binary Feature (BBF) was recently proposed for ASR. Such features look at specific pairs of time-frequency bins in the spectro-temporal plane. The most discriminative of these features are selected by boosting and integrated into a standard HMM-based system using multilayer perceptron (MLP) and single layer perceptron (SLP). Previous studies on TIMIT phoneme recognition task showed that BBF yields similar or better performance compared to cepstral features. In this work, this study is extended to continuous speech recognition task on the DARPA Resource Management database. Results show that BBF achieves comparable word error rate (5.5%) on this task with respect to standard cepstral features (5.1%) using MLP. Using SLP, the error rate for BBF shows much lower degradation (from 5.5% to 7.1%) compared to cepstral features (from 5.1% to 14.7%). In addition, it is found that BBF features can be selected well using auxiliary data.

<subfield code="a">REPORT</subfield>

</datafield>

<subfield code="a">Roy_Idiap-RR-35-2011/IDIAP</subfield>

</datafield>

<subfield code="a">Continuous Speech Recognition using Boosted Binary Features</subfield>

</datafield>

<subfield code="a">Roy, Anindya</subfield>

</datafield>

<subfield code="a">Magimai-Doss, Mathew</subfield>

</datafield>

<subfield code="a">Marcel, Sébastien</subfield>

</datafield>

<subfield code="a">continuous speech recognition boosted binary features resource management</subfield>

</datafield>

<subfield code="i">EXTERNAL</subfield>

<subfield code="u">http://publications.idiap.ch/attachments/reports/2011/Roy_Idiap-RR-35-2011.pdf</subfield>

<subfield code="x">PUBLIC</subfield>

</datafield>

<subfield code="a">Idiap-RR-35-2011</subfield>

</datafield>

<subfield code="b">Idiap</subfield>

</datafield>

<subfield code="d">October 2011</subfield>

</datafield>

<subfield code="a">A novel parts-based binary-valued feature termed Boosted Binary Feature (BBF) was recently proposed for ASR. Such features look at specific pairs of time-frequency bins in the spectro-temporal plane. The most discriminative of these features are selected by boosting and integrated into a standard HMM-based system using multilayer perceptron (MLP) and single layer perceptron (SLP). Previous studies on TIMIT phoneme recognition task showed that BBF yields similar or better performance compared to cepstral features. In this work, this study is extended to continuous speech recognition task on the DARPA Resource Management database. Results show that BBF achieves comparable word error rate (5.5%) on this task with respect to standard cepstral features (5.1%) using MLP. Using SLP, the error rate for BBF shows much lower degradation (from 5.5% to 7.1%) compared to cepstral features (from 5.1% to 14.7%). In addition, it is found that BBF features can be selected well using auxiliary data.</subfield>

</datafield>

</record>

</collection>