A framework to exponentially improve space and time of min-wise based algorithms.
Gets around the class="mathmlsrc">class="formulatext stixSupport mathImg" data-mathURL="/science?_ob=MathURL&_method=retrieve&_eid=1-s2.0-S0022000016300848&_mathId=si2.gif&_user=111111111&_pii=S0022000016300848&_rdoc=1&_issn=00220000&md5=cee42811d1e042484854439744da68ac" title="Click to view the MathML source">Ω(log1/ϵ)class="mathContainer hidden">class="mathCode"> lower bound needed by min-wise hash functions.
Only a constant degree of independence is required for the space and time improvements.
Demonstrates how to apply the framework for similarity and rarity over data stream.