Centralized vs. distributed feature selection methods based on data complexity measures
详细信息    查看全文
文摘
A methodology for distributing the process of feature selection based on several data complexity measures is proposed. We tackled the two strategies to partition the datasets: horizontal (i.e. by samples) and vertical (i.e. by features). We present an experimental study on 11 datasets (five of them microarrays) in terms of number of selected features, classification accuracy and running time. The novel procedures are able to reduce significantly the running time while maintaining (or even improving) the classification performance.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700