弹性核k-NN分类算法及其在药物构效关系中的应用
详细信息    查看全文 | 推荐本文 |
  • 英文篇名:Elastic Kernel k-Nearest Neighbor Classifier and Its Application in Drug Structure-Activity Relationship
  • 作者:黄新 ; 罗逸平 ; 王梦贤 ; 周密
  • 英文作者:HUANG Xin;LUO Yiping;WANG Mengxian;ZHOU Mi;Management School, Hunan City University;
  • 关键词:核方法 ; k-最近邻 ; 构效关系 ; 弹性核
  • 英文关键词:kernel methods;;k-Nearest Neighbor(k-NN);;Structure-Activity Relationship(SAR);;elastic kernel
  • 中文刊名:HNCG
  • 英文刊名:Journal of Hunan City University(Natural Science)
  • 机构:湖南城市学院管理学院;
  • 出版日期:2019-07-15
  • 出版单位:湖南城市学院学报(自然科学版)
  • 年:2019
  • 期:v.28;No.100
  • 基金:湖南省哲学社会科学基金项目(18YBA065)
  • 语种:中文;
  • 页:HNCG201904010
  • 页数:5
  • CN:04
  • ISSN:43-1428/TU
  • 分类号:50-54
摘要
核方法利用核函数可以有效地解决非线性问题,在药物构效关系领域得到了广泛的应用.本文提出了一种新的弹性核k-最近邻算法(EKk-NN).首先,基于加权多项式核和径向基函数核构造了一种信息丰富的弹性核,所构造的弹性核能有效地利用局部核和全局核的优点,同时也为构造核函数提供了一种可行的方法;然后,在核方法的框架下,将弹性核耦合到k-最近邻算法.实际数据集的实验和分析表明,EKk-NN能明显提高分类性能.
        The kernel approaches have been gaining popularity in the field of drug Structure-Activity Relationship, which could effectively solve nonlinear problems by using the kernel function. In the present study, a novel Elastic Kernel k-Nearest Neighbor algorithm(EKk-NN) has been proposed. First, an informative novel elastic kernel is constructed based on polynomial kernel and radial basis function kernel.The constructed elastic kernel can effectively integrate the advantages of local kernel and global kernel, which can provide a feasible way for building the kernel function. Then, under the framework of kernel methods,this elastic kernel is extended to the k-Nearest Neighbor algorithm. Compared with the traditional kernel k-NN,experiments and analysis on the real data sets have shown that EKk-NN can significantly improve the performance of classification, which is really an attractive alternative technique.
引文
[1]COURANT R,HILBERT D.Methods of mathematical physics:vol.1[M].New York:Interscience Publication,1953.
    [2]HART P E.The condensed nearest neighbor rule[J].IEEETransactions on Information Theory,1968,14(3):515-516.
    [3]WILSON D L.Asymptotic properties of nearest neighbor rules using edited data[J].IEEE Transactions on Systems,Man and Cybernetics,1972,SMC-2(3):408-421.
    [4]ZHENG W F,TROPSHA A.Novel variable selection quantitative Structure-Property Relationship approach based on the k-Nearest-Neighbor principle[J].Journal of Chemical Information and Computer Sciences,2000,40(1):185-194.
    [5]AJMANI S,JADHAV K,KULKARNI S A.Threedimensional QSAR using the k-Nearest Neighbor method and its interpretation[J].Journal of Chemical Information and Modeling,2006,46(1):24-31.
    [6]刘松华,张军英,许进,等.Kernel-k NN:基于信息能度量的核k-最近邻算法[J].自动化学报,2010,36(12):1681-1688.
    [7]YU K,JI L,ZHANG X G.Kernel Nearest-Neighbor algorithm[J].Neural Processing Letters,2002,15(2):147-156.
    [8]HASTIE T,TIBSHIRANI R,FRIEDMAN J.The elements of statistical learning:data mining,inference and prediction:second edition[M].Berlin:Springer,2009.
    [9]TAYLOR J S,CRISTIANINI N.Kernel methods for pattern analysis[M].London:Cambridge University Press,2004.
    [10]SCH?LKOPF B,SMOLA A J.Learning with kernels[M].Cambridge:MIT Press,2002.
    [11]XUE Y,LI Z R,YAP C W,et al.Effect of molecular descriptor feature selection in support vector machine classification of pharmacokinetic and toxicological properties of chemical agents[J].Journal of Chemical Information&Computer Sciences,2004,44(5):1630-1638.
    [12]FAWCETT T.An introduction to ROC analysis[J].Pattern Recognition Letters,2006,27(8):861-874.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700