摘要
为了更加便捷地分析基因数据和序列,抓取NCBI的细胞器基因组相关数据做进一步研究。本文采用基于Python的网络爬虫,实现了细胞器基因组数据的信息抓取和相关基因信息的数据分析,提供了分类提取,封装导入数据库的功能。
引文
[1]Kerfeld Cheryl A, Sawaya Michael R,Tanaka Shiho, Nguyen Chau V,Phillips Mart in, Beeby Morgan, Yeates Todd 0. Protein structures forming the shell of primitive bacterial organelles[J]. Science(New York,N.Y.),2005,309(5736):936-938.
[2] Ingman M, Kaessmann H, Paabo S. Gyllensten U. Mitochondrial genome variation and the origin of modern humans[J].Nature, 2000, 408(6813):708-71 3.
[3] Van O M, Kayser M. Updated comprehensive phylogenetic tree of global human mitochondrial DNA variation.[J]. Human Mutation, 2009, 30(2):386-394.
[4]邢少辰.CLARKE,CLARKE JIHONG LIU.叶绿体基因组研究进展[J].生物化学与生物物理进展,2008, 35(01):21-28.
[5]Boo re Jeffrey L. Requirements and standards for organelle genome databases[J].Omics:a journal of integrative biology, 2006, 10(2):119-126.
[6]O'Brien E A,ZHANG Y,WANG E,et al.GOBASE:an organelle genomedatabase[J]. Nucleic Acids Research, 2009, 37(Database issue):946-950.
[7]Mizrachi I.GenBank:The Nucleotide Sequence Database[M]. National Center for Biotechnology Information(US), 2003. in life science[J]. Nucleic acids research,2002, 30(1):27-30.
[8]Lobo.Basic Local Alignment Search Tool(BLAST)[J].Journal of Molecular Biology,2008,215(3):403-410.