Mining frequent items in the time fading model
详细信息    查看全文
文摘

We introduce FDCMSS, a novel sketch-based algorithm for frequent items working in the time fading model. The algorithm cleverly combines key ideas borrowed from forward decay, the Count-Min and the Space Saving algorithms.

We formally prove the correctness of our algorithm.

We experimentally validate the algorithm on synthetic data distributed using a Zipf distribution, and also on real datasets.

We compare the performances and the error committed by our algorithm against λ-HCount, an algorithm recently proposed by Chen and Mei. Extensive experimental results show that FDCMSS outperforms λ-HCount with regard to speed, space used, precision attained and error committed on both synthetic and real datasets.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700