minwise hashing ninh pham rasmus pagh michael mitzenmacher bloom filter odd sketches jaccard coefficient set similarity
Tout plus