使用相关向量机及一致性建模预测大鼠经口急性毒性
详细信息    查看官网全文
摘要
急性毒性的测定是药物开发中最重要的步骤,可以用半数致死量(LD_(50))来表示。由于急性毒性的体内试验非常昂贵和耗时,因此亟需开发其计算机预测模型。本研究中,我们基于7314个多样性化合物及其大鼠经口LD_(50)值,建立了一系列相关向量机(RVM)回归模型以预测化合物的经口急性毒性。与此同时,我们还将其与最邻近回归、随机森林(RF)、支持向量机、局部近似高斯过程、多层感知器集群和extreme梯度推进算法等六种方法构建的模型进行了比较。在各模型中,卡方统计量被用于分子描述符和结构指纹(PubChemFP或SubFP)的选择。单个模型的测试集预测精度q_(ext)~2为0.572~0.659。从测试集的总体预测精度来看,拉普拉斯核RVM和RF算法预测能力更好。我们还将各单独模型平均以建立四个一致性模型,取得了较好的预测精度(q_(ext)~2=0.669-0.689)。最后,我们还识别和分析了一系列急性毒性的描述符和子结构警报。我们认为最优一致性模型的预测精度较高,可以作为一个可靠的大鼠经口急性毒性预测工具。
In this study,based on 7314 diverse chemicals with rat oral LD_(50) values,relevance vector machine(RVM)technique was employed to build the regression models for the prediction of oral acute toxicity in rat,which were compared with those built using other six approaches,including k-nearest-neighbor,random forest(RF),support vector machine,local approximate Gaussian process,multilayer perceptron ensemble,and eXtreme gradient boosting.Dimension reduction was done by the Chi-squared statistics.The prediction capabilities(q_(ext)~2) of individual models ranged 0.572-0.659 for the test set.Considering the overall prediction accuracy for the test set,RVM with Laplacian kernel and RF were recommended to build models of rat oral acute toxicity.Then four consensus models were developed and predicted better for the test set(q_(ext)~2 = 0.669-0.689).Finally,some property or substructure alerts to oral acute toxicity were identified and analyzed.We believe that the best consensus model can be used as a reliable tool to filter out compounds with high rat oral acute toxicity.
引文
[1]Lei,T.;Li,Y.;Song,Y;Li,D.;Sun,H.;Hou,T.,ADMET evaluation in drug discovery:15.Accurate prediction of rat oral acute toxicity using relevance vector machine and consensus modeling.J.Cheminformatics 2016,8:1-19.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700