Comparative Assessment of Scoring Functions on a Diverse Test Set
详细信息    查看全文
  • 作者:Tiejun Cheng ; Xun Li ; Yan Li ; Zhihai Liu ; Renxiao Wang
  • 刊名:Journal of Chemical Information and Modeling
  • 出版年:2009
  • 出版时间:April 27, 2009
  • 年:2009
  • 卷:49
  • 期:4
  • 页码:1079-1093
  • 全文大小:377K
  • 年卷期:v.49,no.4(April 27, 2009)
  • ISSN:1549-960X
文摘
Scoring functions are widely applied to the evaluation of protein−ligand binding in structure-based drug design. We have conducted a comparative assessment of 16 popular scoring functions implemented in main-stream commercial software or released by academic research groups. A set of 195 diverse protein−ligand complexes with high-resolution crystal structures and reliable binding constants were selected through a systematic nonredundant sampling of the PDBbind database and used as the primary test set in our study. All scoring functions were evaluated in three aspects, that is, “docking power”, “ranking power”, and “scoring power”, and all evaluations were independent from the context of molecular docking or virtual screening. As for “docking power”, six scoring functions, including GOLD::ASP, DS::PLP1, DrugScorePDB, GlideScore-SP, DS::LigScore, and GOLD::ChemScore, achieved success rates over 70% when the acceptance cutoff was root-mean-square deviation < 2.0 Å. Combining these scoring functions into consensus scoring schemes improved the success rates to 80% or even higher. As for “ranking power” and “scoring power”, the top four scoring functions on the primary test set were X-Score, DrugScoreCSD, DS::PLP, and SYBYL::ChemScore. They were able to correctly rank the protein−ligand complexes containing the same type of protein with success rates around 50%. Correlation coefficients between the experimental binding constants and the binding scores computed by these scoring functions ranged from 0.545 to 0.644. Besides the primary test set, each scoring function was also tested on four additional test sets, each consisting of a certain number of protein−ligand complexes containing one particular type of protein. Our study serves as an updated benchmark for evaluating the general performance of today’s scoring functions. Our results indicate that no single scoring function consistently outperforms others in all three aspects. Thus, it is important in practice to choose the appropriate scoring functions for different purposes.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700