Exploiting Hyperlinks to Learn a Retrieval Model

Type of publication:	Conference paper
Citation:	grangier:2005:nips_workshop
Booktitle:	NIPS Workshop on Learning to Rank
Year:	2005
Month:	12
Address:	Whistler, Canada
Crossref:	grangier:2005:idiap-05-21: Inferring Document Similarity from Hyper-links, Grangier, David and Bengio, Samy, Idiap-RR-21-2005
Abstract:	Information Retrieval (IR) aims at solving a ranking problem: given a query $q$ and a corpus $C$, the documents of $C$ should be ranked such that the documents relevant to $q$ appear above the others. This task is generally performed by ranking the documents $d \in C$ according to their similarity with respect to $q$, $sim (q,d)$. The identification of an effective function $a,b \to sim(a,b)$ could be performed using a large set of queries with their corresponding relevance assessments. However, such data are especially expensive to label, thus, as an alternative, we propose to rely on hyperlink data which convey analogous semantic relationships. We then empirically show that a measure $sim$ inferred from hyperlinked documents can actually outperform the state-of-the-art {\em Okapi} approach, when applied over a non-hyperlinked retrieval corpus.
Userfields:	ipdmembership={speech},
Keywords:
Projects:	Idiap
Authors:	Grangier, David Bengio, Samy
Added by:	[UNK]
Total mark:	0
Attachments
grangier-nips-ranking-workshop.pdf grangier-nips-ranking-workshop.ps.gz
Notes

processing time: 0.0003 seconds.