The Research of Imbalanced Data Set of Sample Sampling Method Based on K-Means Cluster and Genetic Algorithm
详细信息查看全文 | 推荐本文 |
摘要
The classification favors seriously to the most kinds when we use the traditional sorter to classify the imbalanced data set. In order to effectively enhance classified performance of the minority kind in the imbalanced data set, we proposed one kind minority kind of sample sampling method based on the K-means cluster and the genetic algorithm in view of this question. We used K-means algorithm to cluster and group the minority kind of sample, and in each cluster we use the genetic algorithm to gain the new sample and to carry on the valid confirmation. Finally, through using KNN and SVM sorter we proved the method validity in the simulation experiment.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700