PostDOCK: A Structural, Empirical Approach to Scoring Protein Ligand Complexes
详细信息    查看全文
In this work we introduce a postprocessing filter (PostDOCK) that distinguishes true bindingligand-protein complexes from docking artifacts (that are created by DOCK 4.0.1). PostDOCKis a pattern recognition system that relies on (1) a database of complexes, (2) biochemicaldescriptors of those complexes, and (3) machine learning tools. We use the protein databank(PDB) as the structural database of complexes and create diverse training and validation setsfrom it based on the "families of structurally similar proteins" (FSSP) hierarchy. For thebiochemical descriptors, we consider terms from the DOCK score, empirical scoring, and buriedsolvent accessible surface area. For the machine-learners, we use a random forest classifierand logistic regression. Our results were obtained on a test set of 44 structurally diverse proteintargets. Our highest performing descriptor combinations obtained ~19-fold enrichment (39 of44 binding complexes were correctly identified, while only allowing 2 of 44 decoy complexes),and our best overall accuracy was 92%.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700