logo Idiap Research Institute        
 [BibTeX] [Marc21]
Making Retrieval Faster Through Document Clustering
Type of publication: Idiap-RR
Citation: grangier:2004:idiap-04-02
Number: Idiap-RR-02-2004
Year: 2004
Institution: IDIAP
Abstract: This work addresses the problem of reducing the time between query submission and results output in a retrieval system. The goal is achieved by considering only a database fraction as small as possible during the retrieval process. Our approach is based on a new clustering technique and comparisons with other clustering methods presented in the literature are performed. Our algorithm is shown to outperform the other techniques: retrieval performances close to those obtained with the whole corpus are achieved by selecting only 5\% of the data.
Userfields: ipdmembership={speech},
Keywords:
Projects Idiap
Authors Grangier, David
Vinciarelli, Alessandro
Added by: [UNK]
Total mark: 0
Attachments
  • rr04-02.pdf
  • rr04-02.ps.gz
Notes