Finding Near-Duplicate Web Pages: A Large-Scale Evaluation of Algorithms

Monika Henzinger
Proc. SIGIR, ACM (2006)

Abstract