Detecting near-duplicates for web crawling

Gurmeet Singh Manku
Arvind Jain
Anish Das Sarma
WWW 2007 (16th International Conference on the World Wide Web), ACM, Banff, pp. 141-150

Abstract