REPORT Szaszak_Idiap-RR-25-2013/IDIAP Automatic Speech Indexing System of Bilingual Video Parliament Interventions Szaszak, Gyorgy Cernak, Milos Garner, Philip N. Motlicek, Petr Nanchen, Alexandre Tarsetti, Flavio EXTERNAL http://publications.idiap.ch/attachments/reports/2013/Szaszak_Idiap-RR-25-2013.pdf PUBLIC Idiap-RR-25-2013 2013 Idiap July 2013 This paper presents the development and evaluation of an automatic audio indexing system designed for a special task: work in a bilingual environment in the Parliament of the Canton of Valais in Switzerland, with two official languages, German and French. As several speakers are bilingual, language changes may occur within speaker or even within utterance. Two audio indexing approaches are presented and compared: in the first, speech indexing is based on bilingual automatic speech recognition; in the second, language identification is used after speaker diarization in order to select the corresponding monolingual speech recognizer for decoding. The approaches are later combined. Speaker adaptive training is also addressed and evaluated. Accuracy of language identification and speech recognition for the monolingual and bilingual cases are presented and compared, in parallel with a brief description of the system and the user interface. Finally, the audio indexing system is also evaluated from an information retrieval point of view.

<subfield code="a">REPORT</subfield>

</datafield>

<subfield code="a">Szaszak_Idiap-RR-25-2013/IDIAP</subfield>

</datafield>

<subfield code="a">Automatic Speech Indexing System of Bilingual Video Parliament Interventions</subfield>

</datafield>

<subfield code="a">Szaszak, Gyorgy</subfield>

</datafield>

<subfield code="a">Cernak, Milos</subfield>

</datafield>

<subfield code="a">Garner, Philip N.</subfield>

</datafield>

<subfield code="a">Motlicek, Petr</subfield>

</datafield>

<subfield code="a">Nanchen, Alexandre</subfield>

</datafield>

<subfield code="a">Tarsetti, Flavio</subfield>

</datafield>

<subfield code="i">EXTERNAL</subfield>

<subfield code="u">http://publications.idiap.ch/attachments/reports/2013/Szaszak_Idiap-RR-25-2013.pdf</subfield>

<subfield code="x">PUBLIC</subfield>

</datafield>

<subfield code="a">Idiap-RR-25-2013</subfield>

</datafield>

<subfield code="b">Idiap</subfield>

</datafield>

</datafield>

<subfield code="a">This paper presents the development and evaluation of an automatic audio indexing system designed for a special task: work in a bilingual environment in the Parliament of the Canton of Valais in Switzerland, with two official languages, German and French. As several speakers are bilingual, language changes may occur within speaker or even within utterance. Two audio indexing approaches are presented and compared: in the first, speech indexing is based on bilingual automatic speech recognition; in the second, language identification is used after speaker diarization in order to select the corresponding monolingual speech recognizer for decoding. The approaches are later combined. Speaker adaptive training is also addressed and evaluated. Accuracy of language identification and speech recognition for the monolingual and bilingual cases are presented and compared, in parallel with a brief description of the system and the user interface. Finally, the audio indexing system is also evaluated from an information retrieval point of view.</subfield>

</datafield>

</record>

</collection>