Algorithms,Data Structures,and Statistical Analyses for Improving Chemical Database Searches.
详细信息   
  • 作者:Nasr ; Ramzi J.
  • 学历:Doctor
  • 年:2011
  • 导师:Baldi, Pierre,eadvisorHirschberg, Danielecommittee memberHertel, Klemensecommittee member
  • 毕业院校:University of California
  • Department:Computer Science - Ph.D
  • ISBN:9781124951089
  • CBH:3477971
  • Country:USA
  • 语种:English
  • FileSize:16239617
  • Pages:222
文摘
The objective of many cheminformatics applications is to effectively search large databases of molecules. For example, in ligand-based screening, the searches can identify lead molecules that are most likely to bind to a drug target, typically a protein receptor or enzyme, at the earliest stage of drug discovery. Identification of lead molecules in those early stages may decrease the cost of drug discovery and speed up the overall development time. The ongoing expansion of databases of molecules, as new compounds are discovered or synthesized, and the potential exploration of the large chemical space of virtual compounds, creates a need for improving search quality and efficiency. The work presented here addresses the need in three ways: (1) computing the statistical significance of similarity scores efficiently to give them biological relevance; (2) developing algorithms and data structures for speeding up the searches of very large databases by many folds; and (3) formulating and validating new similarity metrics to find biologically relevant molecules given a set of multiple query molecules.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700