REPORT grangier:2004:idiap-04-02/IDIAP Making Retrieval Faster Through Document Clustering Grangier, David Vinciarelli, Alessandro EXTERNAL https://publications.idiap.ch/attachments/reports/2004/rr04-02.pdf PUBLIC Idiap-RR-02-2004 2004 IDIAP This work addresses the problem of reducing the time between query submission and results output in a retrieval system. The goal is achieved by considering only a database fraction as small as possible during the retrieval process. Our approach is based on a new clustering technique and comparisons with other clustering methods presented in the literature are performed. Our algorithm is shown to outperform the other techniques: retrieval performances close to those obtained with the whole corpus are achieved by selecting only 5\% of the data.