Hashing-based clustering in high dimensional data

读者指南

远程访问

科技查新

公务邮箱

NSTL服务站

Hashing-based clustering in high dimensional data

详细信息查看全文

作者：Juan Zamora^a ; ^{juan.zamora@usm.cl" class="auth_mail" title="E-mail the corresponding author} ; Marcelo Mendoza ; ^b ; ^{mmendoza@inf.utfsm.cl" class="auth_mail" title="E-mail the corresponding author} ; ^{marcelo.mendoza@usm.cl" class="auth_mail" title="E-mail the corresponding author} ; Hé ; ctor Allende^a ; ^{hallende@inf.utfsm.cl" class="auth_mail" title="E-mail the corresponding author}
关键词：Locality sensitive hashing ; High dimensional clustering ; Min-wise hashing ; Random hyperplanes
刊名：Expert Systems with Applications
出版年：2016
出版时间：15 November 2016
年：2016
卷：62
期：Complete
页码：202-211
全文大小：518 K

文摘

•: We modify hashing strategies to cluster high dimensional documents.
•: We estimate the Jaccard similarity by counting bucket collisions between documents.
•: We introduce a penalized Hamming function to approximate the cosine similarity.
•: Both strategies allow improving the quality of the detected clusters.

网站地图　|　常见问题　|　交通位置　|　联系我们　|　OA远程办公　|　English

© 2004-2018 中国地质图书馆版权所有京ICP备05064691号京公网安备11010802017129号

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700