New Approaches to Improving Organisms Detection and Gene Prediction in Metagenomes.
详细信息   
  • 作者:Yok ; Non G.
  • 学历:Ph.D.
  • 年:2011
  • 导师:Rosen, Gail,eadvisor
  • 毕业院校:Drexel University
  • ISBN:9781124715728
  • CBH:3459570
  • Country:USA
  • 语种:English
  • FileSize:4995873
  • Pages:129
文摘
This thesis investigates microorganism detection and gene prediction on the DNA level from environmental samples such as air, water, soil, and the human body. In applications such as homeland security, it is important to detect pathogens and their expressed genes in the air or ground. Chemical solutions have difficulty detecting trace amounts and are costly. Technologies such as DNA microarrays and next-generation sequencing are becoming cost-efficient, thus it is of interest to use these technologies to discover organisms and genes present in a sample, especially when in “fine” amounts. The thesis will impact organism and gene discovery, which will aid pharmaceutical and bio-fuel discovery, as well as new methods in forensics and homeland security. To enable microarrays to detect more organisms utilizing fewer probes, we discuss the advantage of using specially-designed compressive sensing microarrays GSM), which is a DNA-based sensor array that operates using group testing and compressive sensing CS) principles. We improve compressive sensing DNA microarrays by introducing probe picking algorithm that iteratively searches for the most optimal probe. For predicting reads from next-generation sequencing of DNA, we introduce a new hybrid algorithm, Homology-Abinitio, which combines ab-initio annotation, using improved models, with sequence similarity comparisons. After benchmarking well-known gene prediction programs for metagenomic reads—GeneMark, MetaGeneAnnotator MGA) and Orphelia—we then combined them to gain better accuracy f-measure), especially for short reads. In addition, we show how using metatranscriptomic data can significantly improve gene prediction specificity. Each previous algorithm and Homology-Abinitio is tested with the most diverse test set to-date, including genes from 96 species.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700