ARTICLE
Habibi_DKE_2016/IDIAP
Question Answering in Conversations: Query Refinement Using Contextual and Semantic Information
Habibi, Maryam
Mahdabi, Parvaz
Popescu-Belis, Andrei
https://publications.idiap.ch/index.php/publications/showcite/Habibi_Idiap-RR-16-2016
Related documents
Data & Knowledge Engineering Journal
2016
REPORT
Habibi_Idiap-RR-16-2016/IDIAP
Question Answering in Conversations: Query Refinement Using Contextual and Semantic Information
Habibi, Maryam
Mahdabi, Parvaz
Popescu-Belis, Andrei
Context Modeling
Evaluation of Information Retrieval.
Query Expansion
Query Refinement
Speech-based Information Retrieval
EXTERNAL
https://publications.idiap.ch/attachments/reports/2016/Habibi_Idiap-RR-16-2016.pdf
PUBLIC
Idiap-RR-16-2016
2016
Idiap
June 2016
accepted version by the journal (before copy-editing)
This paper introduces a query refinement method applied to questions asked by users to a system during a meeting or a conversation that they have with other users. To answer the questions, the proposed method leverages the local context of the conversation along with semantic resources, either WordNet or word embeddings from word2vec. The method first represents the local context by extracting keywords from the transcript of the conversation, which is obtained from a real-time Automatic Speech Recognition (ASR) system and may contain noise. It then expands the queries with keywords that best represent the topic of the query, i.e.\ expansion keywords accompanied by weights indicating their topical similarity to the query. Finally, semantically related terms are added, using two options: either synonymous terms drawn from WordNet or similar words based on distributed representations in a low-dimensional word embedding space learned using word2vec. To evaluate the system, we introduce a dataset (named AREX for AMI Requests for Explanations) and an evaluation metric based on relevance judgments collected by crowdsourcing. We compare our query expansion approach with other methods, over queries from the AREX dataset, showing the superiority of our method when either manual or automatic transcripts of the AMI Meeting Corpus are used.