刺激下拟南芥基因相关网络构建与分析
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
研究拟南芥受刺激后基因组表达的改变,对于理解植物在受到外界环境刺激后基因组应对刺激作出反应的生物学机理具有重要意义。本文利用逆向网络建模方法,分别建立了拟南芥的正常对照组和刺激下的实验组的抽象基因和开花相关基因互信息相关网络,并对网络的结构性质进行了分析和研究,具体内容如下:
     对于抽象基因互信息网络,运用复杂网络统计量的分析方法进行比对,发现实验组与对照组的网络结构具有明显差异,并给出可以有效区分这两类网络结构差异的参数:平均度、聚类系数、模块度、非孤立点所占比例等。提出了多维参数空间中显著性分类的方法,并从网络结构的整体角度证实了上述差异性的存在。
     对于开花时间相关基因网络,通过比对实验组和对照组网络结构,发现可以区分对照组与实验组网络结构的特征参数:平均度、平均核数。给出从网络结构角度发掘对开花有重要影响的“结构性关键基因”的有效方法。得到对度和核数库模式贡献最大的结构性关键基因的个数分别为8个和11个。发现基于度库模式得到的8个基因都在基于核数库模式得到的11个基因中。结构决定功能的思想启示,与库模式相同的基因可能对开花起促进作用,与库模式相反的基因可能起抑制作用。TAIR数据库中资料表明11个基因中8个符合上述规律,即该模型预测的有效率为72.73%。因此,该模型可以预测基因对开花的促进或抑制功能。研究中发现规律:当基因在长光照实验中的核数和度相对短光照情况下较高时,基因可能对开花起促进作用;当基因在长光照实验中的核数和度相对正常条件下较低时,基因可能对开花起抑制作用。TAIR数据库中资料表明8个基因中5个符合上述规律,即该模型预测的有效率为62.5%。所以,该模型也可以预测基因对开花的促进或抑制功能。
     上述网络建模与模型分析的方法用于其他植物刺激下的数据表达谱数据时,对于理解基因组应对刺激的机理研究应该具有普遍的意义。
It is important to understand the physiological mechanism of genome response to external stimulus by studying the change of Arabidopsis genome expression under stimulus. In this dissertation, by the method of reverse network modeling, the mutual information networks of normal (control group) and stimulus (experimental groups) of Arabidopsis abstract genes and flowering-related genes are constructed respectively. Then analyze and study the structural properties of the networks. The details are as follows:
     For the mutual information networks of Arabidopsis abstract genes, through statistical analysis and comparison, the significant difference is found between the network structures of control group and experimental groups, and statistics are given which can distinguish the two types of networks:average degree, clustering coefficient, modularity, the proportion of non-isolated node. What's more, the method to classify the training sets of normal and each stimuli in multi-parameter space is proposed. The existence of differences is confirmed from the node of view of the whole network structure.
     For mutual information network of Arabidopsis flowering-related genes, through statistical analysis and comparison, the significant difference of the network structures between control group and experimental groups is found, and the statistics that can distinguish the two types of networks are given:average degree, average coreness. Then, the method to mine "the structural key genes" of the flowering is given. Using the method, the patterns of coreness and degree "the structural key genes" are obtained which contribution the two types of patterns the largest. The numbers of key genes are 8 and 11 respectively. Furthermore, the 8 genes are fully included in the 11 genes. That structure determines function inspirits the gene whose pattern is the same to the pattern of database may play positive role on flowering and the gene whose pattern is contrary to the pattern of database may play negative role on flowering. The information from TAIR database indicates that there are 8 genes out of 11 genes satisfying the law. That is, the efficiency to predict of the method is 72.73%. Therefore, the method can predict the functions of the remaining genes. In the research, the law is found:when the coreness and degree of gene in the long day experiment is higher than normal condition, the genes play a positive role on flowering; conversely, when the coreness and degree of gene in the long day experiment is lower than normal condition, the gene inhibits the flowering. The information from TAIR database indicates that there are 5 genes out of 8 genes satisfying the law. That is, the efficiency to predict of the method is 62.5%. Thus, the method can predict the functions of genes, too.
     When the methods of network modeling and model analysis are used for analyzing other plants' expression profile data under stimulus, it is universal significant to understand the mechanism of genome under external stimulus.
引文
1.陈琳.植物遗传实验的好材料—拟南芥简介[J],教学仪器与实验,2002,18(11):30.
    2.陈璋.拟南芥—植物分子生物学研究的模式物种[J],植物学通报,1994,11(1):6-11.
    3. Kazuo Shinozaki. Gene networks involved in drought stress response and tolerance[J], Experimental Botany,2007,58(2):221-227.
    4. Hee-Yeon Park, Hye-Yeon Seok, Bo-Kyung Park, Sun-Ho Kim, Chang-Hyo Goh, Byeong-ha Lee, Choon-Hwan Lee, Yong-Hwan Moon. Overexpression of Arabidopsis ZEP enhances tolerance to osmotic stress[J], Biochemical and Biophysical Research Communications,2008,375:80-85.
    5. Zhu J K. Plant salt tolerance[J], Trends Plant Sci,2001,2:66-71.
    6.刘康.野生型拟南芥及其sos2盐敏感突变体基因表达的差异以及At2g29080基因功能的研究[J],植物生理学,2002,10(4):312-316.
    7. Zhu Jiankang. Regulation of ion homeostasis under stress[J], Current Opinion in Plant Biology,2003,6:441-445.
    8. Hyun Ju Jung, Hunseung Kang. Expression and functional analyses of microRNA417 in Arabidopsis thaliana under stress conditions[J], Science Direct,2007,45:805-811.
    9.秦玉芝,李旭.钙传感蛋白互作激酶CIPK14参与拟南芥盐和ABA胁迫应答调节[J],中国科学,2008,38(5):446-457.
    10.张素芝,左建儒.拟南芥开花时间调控的研究进展[J],生物化学与生物物理进展,2006,33(4):301-309.
    11. Putterill J, Robson F, Lee K, Simon R, Coupland G. The constant genes of Arabidopsis promotes flowering and encodes a protein showing similarities to zinc finger transcription factors[J], Cell,1995,180(6):847-857.
    12. Suarez-Lopez P, Wheatley K, Robson F, Onouchi H, Valverde F, Coupland G. CONSTANS mediates between the circadian clock and the control of flowering in Arabidopsis[J], Nature,2001,410(6832):1116-1120.
    13. Yoo SK, Chung KS, Kim J, Lee JH, Hong SM, Yoo SJ, Yoo SY, Lee JS, Ahn JH. Constans activates suppressor of overexpression of constants 1 through flowering locus T to promote flowering in Arabidopsis[J], Plant Physiol,2005,139(2):770-778.
    14. Imaizumi T, Tran HG, Swartz TE, Briggs WR, Kay SA. FKF1 is essential for photoperiodic-specific light signaling in Arabidopsis[J], Nature,2003,426(6964): 302-306.
    15. Abe M, Kobayashi Y, Yamamoto S, Daimon Y, Yamaguchi A, Ikeda Y, Ichinoki H, Notaguchi M, Goto K, Araki T. FD. A bZIP protein mediating signals from the floral pathway integrator FT at the shoot apex[J], Science,2005,309(5737):1052-1056.
    16. Wigge PA, Kim MC, Jaeger KE, Busch W, Schmid M, Lohmann JU, Weigel D. Integration of spatial and temporal information during floral induction in Arabidopsis[J], Science, 2005,309 (5737):1056-1059.
    17. Mizoguchi T, Wright L, Fujiwara S, Cremer F, Lee K, Onouchi H, Mouradov A, Fowler S, Kamada H, Putterill J, Coupland G. Distinct roles of GIGANTEA in promoting flowering and regulating circadian rhythms in Arabidopsis[J], Plant Cell,2005,17(8):2255-2270.
    18. Imaizumi T, Schultz TF, Harmon FG, Ho LA, Kay SA. FKF1 F-box protein mediates cyclic degradation of a repressor of CONSTANS in Arabidopsis[J], Science,2005, 309(5732):293-297.
    19. Ahn JH, Miller D, W inter VJ, Banfield M J, Lee JH, Yoo SY, Henz SR, Brady RL, Weigel D. A divergent external loop confers antagonistic activity on floral regulators FT and TFL1. EMBO[J],2006,25(3):605-614.
    20. Paolo Pesaresi. Arabidopsis STN7 Kinase Provides a Link between Short-and Long-Term Photosynthetic Acclimation[J]. The Plant Cell,2009,21:2402-2423.
    21. Rogers HJ. Cell death and organ development in plants[J], Curr Top Dev Biol.,2005,71: 225-261.
    22.王东平.玉米转录因子ABP4在转基因拟南芥抗病和发育中的功能分析[D],北京:中国农科院博士论文,2007.
    23. Adriano V. Werhli, Marco Grzegorczyk, Dirk Husmeier. Comparative evaluation of reverse engineering gene regulatory networks with relevance networks graphical gaussian modelsand bayesian networks[J], Bioinformatics,2006,22(20):2523-2531.
    24. Theodore J. Perkins, Johannes Jaeger, John Reinitz, Leon Glass. Reverse Engineering the Gap Gene Network of Drosophila melanogaster[J], Plos Comput Biol.,2006,2(5): 0417-0428.
    25. Javier Carrera, Guillermo Rodrigo, Alfonso Jaramillo Santiago F Elena. Reverse-engineering the Arabidopsis thaliana transcriptional network under changing environmental conditions [J], Genome Biology,2009,10(9):1-15.
    26.黄继风,杨晶晶,王冠,余庆波,杨仲南.拟南芥花药表达基因调控关系的预测[J], 中国科学,2008,53(17):2054-2059.
    27. Shudong Wang, Yan Chen, Qingyun Wang, Eryan Li, Yansen Su, Dazhi Meng. Analysis for stimuli to shoot genes of arabidopsis thaliana based on logical relationships[J], The third international symposium on optimization and systems biology 2009 ORSC & APORC, pp.435-447.
    28.冯端,冯少彤.溯源探幽:熵的世界[M],北京:科学出版社,2005,33-39.
    29. Cover T M., Thomas, J A. Elements of Information Theory[M], New York:John Wiley, 1991.
    30.黄成玉,阮晓钢,李建更.基于熵互信息理论的基因调控网络的研究[J],华北科技学院学报,2008,5(1):82-84.
    31.郭衍游,侯一筠,刘战庆.互信息在海洋资料相关分析中的应用[J],海洋科学,2004,28(10):29-33.
    32.曾翎,刘斐,乔辉.基于互信息的功能磁共振图像配准[J],电子科技大学学报,2008,37(1):138-140.
    33.郭雷,许晓鸣.复杂网络[M],上海:上海科技教育出版社,2006,1-9.
    34. Watts D.J., Strogatz S.H.. Collective dynamics of small-world networks[J], Nature,1998, 393:440-442.
    35. Newman M.E.J., Watts D.J.. Renormalization group analysis of the small-world network model[J], Phys. Lette. A,1999,263:341-346.
    36. Albert R., Jeong H., AL Barabasi. Diameter of the world-wide web[J], Nature,1999,401: 130-131.
    37. AL Barabasi, Albert R., Jeong H.. Mean-field theory for scale-free random networks[J], Physica A,1999,272:173-187.
    38. Albert R., Jeong H., AL Barabasi. Statistical mechanics of complex networks[J], Rev. Mod. Phys.,2002,74:47-97.
    39. T Zhou, BH Wang, PL Zhou, CX Yang, J Liu-Physical Review E. Self-organized Boolean game on networks[J], Phys Rev E Stat Nonlin Soft Matter Phys.,2005,72:1539-3755.
    40.汪小帆,李翔,陈关荣.复杂网络理论及其应用[M],北京:清华大学出版社,2006.
    41. L. Diambra, L. da, F. Costa. Complex networks approach to gene expression driven phenotype imaging[J], Bioinformatics,2005,21(20):3846-3851.
    42. S.H. Yook, H. Jeong and A.-L. Barabasi. Weighted evolving networks[J], Phys. Rev. Lett., 2001,86(1):5835-5838.
    43.韩著.数据挖掘—概念与技术[M],北京:高等教育出版社,2006,219-222,261-269.
    44.晏庆华.支持向量机算法综述[J],计算机科学,2008,35(4):345-347.