CONF Sarfjoo_SPSC_2021/IDIAP Graph2Speak: Improving Speaker Identification using Network Knowledge in Criminal Conversational Data Fabien, Mael Sarfjoo, Seyyed Saeed Madikeri, Srikanth Motlicek, Petr http://publications.idiap.ch/index.php/publications/showcite/Fabien_Idiap-RR-01-2023 Related documents 1st ISCA Symposium on Security and Privacy in Speech Communication 2021 10--13 10.21437/SPSC.2021-3 doi Criminal investigations mostly rely on the collection of speech conversational data in order to identify speakers and build or enrich an existing criminal network. Social network analysis tools are then applied to identify the central characters and the different communities within the network. This paper introduces a new method, Graph2Speak, to re-rank individuals after applying a speaker identification step, by leveraging the frequency of previous interactions extracted from a graph. We deploy our method on two candidate datasets for criminal conversational data, Crime Scene Investigation (CSI), a television show, and the ROXANNE simulated data. We demonstrate that our method can reduce the error rates of the speaker identification baseline by up to 12% (relative). REPORT Fabien_Idiap-RR-01-2023/IDIAP Graph2Speak: Improving Speaker Identification using Network Knowledge in Criminal Conversational Data Fabien, Mael Sarfjoo, Seyyed Saeed Madikeri, Srikanth Motlicek, Petr EXTERNAL http://publications.idiap.ch/attachments/reports/2020/Fabien_Idiap-RR-01-2023.pdf PUBLIC Idiap-RR-01-2023 2023 Idiap 19 rue Marconi, 1920 Lausanne January 2023 Criminal investigations mostly rely on the collection of speech conversational data in order to identify speakers and build or enrich an existing criminal network. Social network analysis tools are then applied to identify the most central characters and the different communities within the network. We introduce two candidate datasets for criminal conversational data, Crime Scene Investigation (CSI), a television show, and the ROXANNE simulated data. We also introduce the metric of conversation accuracy in the context of criminal investigations. By re-ranking candidate speakers based on the frequency of previous interactions, we improve the speaker identification baseline by 1.2% absolute (1.3% relative), and the conversation accuracy by 2.6% absolute (3.4% relative) on CSI data, and by 1.1% absolute (1.2% relative), and 2% absolute (2.5% relative) respectively on the ROXANNE simulated data. https://arxiv.org/abs/2006.02093 URL

</datafield>

<subfield code="a">Sarfjoo_SPSC_2021/IDIAP</subfield>

</datafield>

<subfield code="a">Graph2Speak: Improving Speaker Identification using Network Knowledge in Criminal Conversational Data</subfield>

</datafield>

<subfield code="a">Fabien, Mael</subfield>

</datafield>

<subfield code="a">Sarfjoo, Seyyed Saeed</subfield>

</datafield>

<subfield code="a">Madikeri, Srikanth</subfield>

</datafield>

<subfield code="a">Motlicek, Petr</subfield>

</datafield>

<subfield code="u">http://publications.idiap.ch/index.php/publications/showcite/Fabien_Idiap-RR-01-2023</subfield>

<subfield code="z">Related documents</subfield>

</datafield>

<subfield code="a">1st ISCA Symposium on Security and Privacy in Speech Communication</subfield>

</datafield>

</datafield>

</datafield>

</datafield>

<subfield code="a">Criminal investigations mostly rely on the collection of speech conversational data in order to identify speakers and build or enrich an existing criminal network. Social network analysis tools are then applied to identify the central characters and the different communities within the network. This paper introduces a new method, Graph2Speak, to re-rank individuals after applying a speaker identification step, by leveraging the frequency of previous interactions extracted from a graph. We deploy our method on two candidate datasets for criminal conversational data, Crime Scene Investigation (CSI), a television show, and the ROXANNE simulated data. We demonstrate that our method can reduce the error rates of the speaker identification baseline by up to 12% (relative).</subfield>

</datafield>

</record>

<subfield code="a">REPORT</subfield>

</datafield>

<subfield code="a">Fabien_Idiap-RR-01-2023/IDIAP</subfield>

</datafield>

<subfield code="a">Graph2Speak: Improving Speaker Identification using Network Knowledge in Criminal Conversational Data</subfield>

</datafield>

<subfield code="a">Fabien, Mael</subfield>

</datafield>

<subfield code="a">Sarfjoo, Seyyed Saeed</subfield>

</datafield>

<subfield code="a">Madikeri, Srikanth</subfield>

</datafield>

<subfield code="a">Motlicek, Petr</subfield>

</datafield>

<subfield code="i">EXTERNAL</subfield>

<subfield code="u">http://publications.idiap.ch/attachments/reports/2020/Fabien_Idiap-RR-01-2023.pdf</subfield>

<subfield code="x">PUBLIC</subfield>

</datafield>

<subfield code="a">Idiap-RR-01-2023</subfield>

</datafield>

<subfield code="b">Idiap</subfield>

<subfield code="a">19 rue Marconi, 1920 Lausanne</subfield>

</datafield>

<subfield code="d">January 2023</subfield>

</datafield>

<subfield code="a">Criminal investigations mostly rely on the collection of speech conversational data in order to identify speakers and build or enrich an existing criminal network. Social network analysis tools are then applied to identify the most central characters and the different communities within the network. We introduce two candidate datasets for criminal conversational data, Crime Scene Investigation (CSI), a television show, and the ROXANNE simulated data. We also introduce the metric of conversation accuracy in the context of criminal investigations. By re-ranking candidate speakers based on the frequency of previous interactions, we improve the speaker identification baseline by 1.2% absolute (1.3% relative), and the conversation accuracy by 2.6% absolute (3.4% relative) on CSI data, and by 1.1% absolute (1.2% relative), and 2% absolute (2.5% relative) respectively on the ROXANNE simulated data.</subfield>

</datafield>

<subfield code="u">https://arxiv.org/abs/2006.02093</subfield>

</datafield>

</record>

</collection>