Making Retrieval Faster Through Document Clustering
Type of publication: | Idiap-RR |
Citation: | grangier:2004:idiap-04-02 |
Number: | Idiap-RR-02-2004 |
Year: | 2004 |
Institution: | IDIAP |
Abstract: | This work addresses the problem of reducing the time between query submission and results output in a retrieval system. The goal is achieved by considering only a database fraction as small as possible during the retrieval process. Our approach is based on a new clustering technique and comparisons with other clustering methods presented in the literature are performed. Our algorithm is shown to outperform the other techniques: retrieval performances close to those obtained with the whole corpus are achieved by selecting only 5\% of the data. |
Userfields: | ipdmembership={speech}, |
Keywords: | |
Projects |
Idiap |
Authors | |
Added by: | [UNK] |
Total mark: | 0 |
Attachments
|
|
Notes
|
|
|