Video hashing based on appearance and attention features fusion via DBN
详细信息    查看全文
文摘
Video hashing has attracted increasing attention in the field of large-scale video retrieval. However, only low-level features or their combinations, referred to as appearance features, are used to generate the video hash in most of the existing video hashing algorithms and these kinds of features are referred to as appearance features. In this paper, a visual attention model is used to extract visual attention features, and the video hash is generated from a fusion of visual-appearance and visual-attention features via a deep belief network (DBN) to obtain representative video features. In addition, hash distance is taken as a vector to measure the similarity between hashes. BER is used as the amplitude of hash distance and the vector cosine similarity is used as the angle of hash distance. Experimental results demonstrate that the fusion of visual appearance and attention features brings about better performance of video hash on recall and precision rates, and the angle of hash distance is useful to improve the accuracy of hash matching.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700