Multi-objective semi-supervised clustering of tissue samples for cancer diagnosis
详细信息    查看全文
  • 作者:Sriparna Saha ; Kuldeep Kaushik ; Abhay Kumar Alok ; Sudipta Acharya
  • 刊名:Soft Computing - A Fusion of Foundations, Methodologies and Applications
  • 出版年:2016
  • 出版时间:September 2016
  • 年:2016
  • 卷:20
  • 期:9
  • 页码:3381-3392
  • 全文大小:1,374 KB
  • 刊物类别:Engineering
  • 刊物主题:Numerical and Computational Methods in Engineering
    Theory of Computation
    Computing Methodologies
    Mathematical Logic and Foundations
    Control Engineering
  • 出版者:Springer Berlin / Heidelberg
  • ISSN:1433-7479
  • 卷排序:20
文摘
In the domain of bioinformatics, the clustering of gene expression profiles of different tissue samples over different experimental conditions has gained importance with the invention of micro-array based technology. This study also has some impact on cancer diagnosis. The proper classification of cancer tissue samples generated using the micro-array technology helps in detecting cancers in an automated way. In the current paper we have developed a semi-supervised clustering technique for proper partitioning of these gene expression data sets. Semi-supervised clustering is a combination of unsupervised and supervised classification techniques. It uses some amount of supervised information and a large collection of unsupervised data. Here a multi-objective based semi-supervised clustering technique is developed for solving the cancer tissue classification problem. Different combinations of objective functions are used. As the supervised information we assume that class labels of 10 % data are available. The proposed technique is evaluated for three open source benchmark cancer data sets (brain tumor data set, adult malignancy and small round blood cell tumors). Two classification quality measures, viz., Adjusted Rand Index and Classification Accuracy are used to measure the goodness of the obtained partitionings. Obtained results are compared with several state-of-the-art clustering techniques. Moreover, significant gene markers have been identified and demonstrated visually from the clustering solutions obtained.KeywordsCancer tissue sample classificationGene expression data clusteringMulti-objective optimizationSemi-supervised clustering

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700