REPORT morris-COM-02-03/IDIAP An information theoretic measure of sequence recognition performance Morris, Andrew contingency tables likelihood ratio mutual information word error rate EXTERNAL http://publications.idiap.ch/attachments/reports/2002/com02-03.pdf PUBLIC Idiap-Com-03-2002 2002 IDIAP Sequence recognition performance is often summarised first in terms of the number of hits (H,',','), substitutions (S,',','), deletions (D) and insertions (I,',','), and then as a single statistic by the "word error rate" WER = 100(S D I)/(H S D). While in common use, WER has two disadvantages as a performance measure. One is that it has no upper bound, so it doesn't tell you how good a system is, only that one is better than another. The other is that it is not D/I symmetric, although deletions and insertions are equally disadvantageous. At low error rates these limitations can be ignored. However, for the high error rates which can occur during tests for speech recognition in noise the WER measure starts to misbehave, giving far more weight to insertions than to deletions and regularly "exceeding 100%". Here we derive an alternative summary statistic for sequence recognition accuracy: WIP = H^2/(H S D)(H S I). The WIP (word information preserved) measure results from an approximation to the proportion of the information about the true sequence which is preserved in the recognised sequence. It has comparable simplicity to WER but neither of its disadvantages.

<subfield code="a">REPORT</subfield>

</datafield>

<subfield code="a">morris-COM-02-03/IDIAP</subfield>

</datafield>

<subfield code="a">An information theoretic measure of sequence recognition performance</subfield>

</datafield>

<subfield code="a">Morris, Andrew</subfield>

</datafield>

<subfield code="a">contingency tables</subfield>

</datafield>

<subfield code="a">likelihood ratio</subfield>

</datafield>

<subfield code="a">mutual information</subfield>

</datafield>

<subfield code="a">word error rate</subfield>

</datafield>

<subfield code="i">EXTERNAL</subfield>

<subfield code="u">http://publications.idiap.ch/attachments/reports/2002/com02-03.pdf</subfield>

<subfield code="x">PUBLIC</subfield>

</datafield>

<subfield code="a">Idiap-Com-03-2002</subfield>

</datafield>

<subfield code="b">IDIAP</subfield>

</datafield>

<subfield code="a">Sequence recognition performance is often summarised first in terms of the number of hits (H,',','), substitutions (S,',','), deletions (D) and insertions (I,',','), and then as a single statistic by the "word error rate" WER = 100(S D I)/(H S D). While in common use, WER has two disadvantages as a performance measure. One is that it has no upper bound, so it doesn't tell you how good a system is, only that one is better than another. The other is that it is not D/I symmetric, although deletions and insertions are equally disadvantageous. At low error rates these limitations can be ignored. However, for the high error rates which can occur during tests for speech recognition in noise the WER measure starts to misbehave, giving far more weight to insertions than to deletions and regularly "exceeding 100%". Here we derive an alternative summary statistic for sequence recognition accuracy: WIP = H^2/(H S D)(H S I). The WIP (word information preserved) measure results from an approximation to the proportion of the information about the true sequence which is preserved in the recognised sequence. It has comparable simplicity to WER but neither of its disadvantages.</subfield>

</datafield>

</record>

</collection>