Kernel k-nearest neighbor algorithm as a flexible SAR modeling tool

设为首页

收藏本站

网站地图 | English | 公务邮箱

远程访问

NSTL服务站

Kernel k-nearest neighbor algorithm as a flexible SAR modeling tool

详细信息	查看全文 \| 推荐本文 \|

作者：Dong-Sheng Cao^a ; Jian-Hua Huang^a ; Jun Yan^a ; Liang-Xiao Zhang^c ; Qian-Nan Hu^d ; Qing-Song Xu^b ; ^{dasongxu@gmail.com} ; Yi-Zeng Liang^a ; ^{yizeng_liang@263.net}
关键词：k-nearest neighbor (k-NN) ; Kernel methods ; String kernel ; Structure&ndash ; activity relationship (SAR)
刊名：Chemometrics and Intelligent Laboratory Systems
出版年：2012
期刊代码：21_01697439
类别：ch
出版时间：15 May, 2012
卷：114
期：Complete
页码：19-23
文件大小：373 K

摘要

A kernel version of k-nearest neighbor algorithm (k-NN) has been developed to model the complex relationship between molecular descriptors and bioactivities of compounds. Kernel k-NN is to perform the original k-NN algorithm by mapping the training samples in the input space into a high-dimensional feature space. It can be easily constructed by calculating the distance between samples in the feature space, directly deriving from the simple calculation of the kernel used. The developed kernel k-NN is very flexible to deal with complex nonlinear relationship, more importantly; it can also conveniently cope with some non-vectorial data only by the definition of different kernels. The results obtained from several real SAR datasets indicated that the performance of kernel k-NN is comparable to support vector machine methods. It can be regarded as an alternative modeling technique for several chemical problems including the study of structure-activity relationship (SAR). The source codes implementing kernel k-NN in R language are freely available at .

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700