Similarity−Potency Trees: A Method to Search for SAR Information in Compound Data Sets and Derive SAR Rules
详细信息    查看全文
  • 作者:Mathias Wawer ; Jrgen Bajorath
  • 刊名:Journal of Chemical Information and Modeling
  • 出版年:2010
  • 出版时间:August 23, 2010
  • 年:2010
  • 卷:50
  • 期:8
  • 页码:1395-1409
  • 全文大小:796K
  • 年卷期:v.50,no.8(August 23, 2010)
  • ISSN:1549-960X
文摘
An intuitive and generally applicable analysis method, termed similarity−potency tree (SPT), is introduced to mine structure−activity relationship (SAR) information in compound data sets of any source. Only compound potency values and nearest-neighbor similarity relationships are considered. Rather than analyzing a data set as a whole, in part overlapping compound neighborhoods are systematically generated and represented as SPTs. This local analysis scheme simplifies the evaluation of SAR information and SPTs of high SAR information content are easily identified. By inspecting only a limited number of compound neighborhoods, it is also straightforward to determine whether data sets contain only little or no interpretable SAR information. Interactive analysis of SPTs is facilitated by reading the trees in two directions, which makes it possible to extract SAR rules, if available, in a consistent manner. The simplicity and interpretability of the data structure and the ease of calculation are characteristic features of this approach. We apply the methodology to high-throughput screening and lead optimization data sets, compare the approach to standard clustering techniques, illustrate how SAR rules are derived, and provide some practical guidance how to best utilize the methodology. The SPT program is made freely available to the scientific community.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700