Choosing Feature Selection and Learning Algorithms in QSAR
详细信息    查看全文
  • 作者:Martin Eklund ; Ulf Norinder ; Scott Boyer ; Lars Carlsson
  • 刊名:Journal of Chemical Information and Modeling
  • 出版年:2014
  • 出版时间:March 24, 2014
  • 年:2014
  • 卷:54
  • 期:3
  • 页码:837-843
  • 全文大小:284K
  • 年卷期:v.54,no.3(March 24, 2014)
  • ISSN:1549-960X
文摘
Feature selection is an important part of contemporary QSAR analysis. In a recently published paper, we investigated the performance of different feature selection methods in a large number of in silico experiments conducted using real QSAR datasets. However, an interesting question that we did not address is whether certain feature selection methods are better than others in combination with certain learning methods, in terms of producing models with high prediction accuracy. In this report we extend our work from the previous investigation by using four different feature selection methods (wrapper, ReliefF, MARS, and elastic nets), together with eight learners (MARS, elastic net, random forest, SVM, neural networks, multiple linear regression, PLS, kNN) in an empirical investigation to address this question. The results indicate that state-of-the-art learners (random forest, SVM, and neural networks) do not gain prediction accuracy from feature selection, and we found no evidence that a certain feature selection is particularly well-suited for use in combination with a certain learner.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700