群智能优化算法及其在PPI网络中的应用研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
随着生物信息学的不断发展,人们发现蛋白质并不是独立地在细胞中发挥作用的,它们通常是通过与其它蛋白质之间的相互作用,形成一个功能整体才能发挥相应的功能。在一个生命体内所有蛋白质的相互作用,被称为蛋白质相互作用网络(PPI网络)。通过对PPI网络的研究,能了解生命个体的生理代谢原理、循环作用,也能对病理进入深层次的分析,揭示生命的奥义,使人类对生物界的认识迈入一个新的台阶。群智能优化算法的理论基础在于仿生学,在该类算法思想被提出至今,已经被各界学者们进行了深入的研究和广泛的应用,其已在解决TSP问题、数据挖掘、图象处理、函数优化等领域形成了成熟的应用。现阶段在不断完善现有算法的同时,也不断涌现出新的群智能优化算法。因此,群智能优化算法是智能计算领域的一大研究热点。
     本文首先对群智能算法进行了概述并对主流的群智能优化算法给出了相关的描述。在此基础上指出现有群智能算法的优点和不足及需改进的地方。然后对国内外PPI网络以及群智能算法的研究现状进行了分析,对PPI网络进行了简要的介绍,并对PPI网络在计算分析领域的主要分析方法——聚类算法,进行了简要的介绍。
     其次,针对PPI网络中的无尺度、小世界等特性,提出目前在PPI数据聚类中效果较好的功能流聚类算法中的不足之处。对PPI数据中的桥结点提出一种识别和处理的方法,将特殊结点进行单独处理,以提高功能流算法的精准度。并在此基础上,将群智能优化的思想引入到聚类算法当中,对其中的一些需要根据经验人为设定的参数进行自动寻优,以获得最佳适应值,使聚类算法的结果最优化并具有稳定性。改进后的算法命名为IQ-Flow算法。针对MIPS的PPI数据集,采用IQ-Flow算法进行仿真实验,结果表明改进后的算法在算法精度和算法的稳定性上都有了一定程度的提高。
     在群智能优化算法方面,对人工蜂群算法的不足进行了改进研究。在综合其它群智能算法的改进方法的基础上,引入了惯性权重、收缩因子以及随机扰动因子的概念,对人工蜂群算法中的更新公式进行改进,提出了一种改进的人工蜂群算法(IABC算法)。通过测试函数及仿真实验结果表明,IABC算法比标准人工蜂群算法及在实验中参与对比的其它几种群钾能优化算法,在精度和收敛速度上都具有较明显的优势。针对MIPS的PPI数据集,采用IABC算法进行阈值自动寻优,克服了人工给定阈值的主观性,并且蛋白质识别的准确率有一定程度的提高。
     另外,根据PPI网络的拓扑结构特性提出了一种基于连接强度的PPI网络聚类蚁群算法(Joint Strength Based Ant Colony Optimization Algorithm, JSACO),算法根据PPI网络数据的特性,引入了连接强度的概念对蚁群聚类算法中的拾起/放下规则加以改进,以降低对PPI网络数据聚类的时间开销以及提高结果的正确率。在仿真实验中使用了MIPS的PPI数据进行实验,将改进的蚁群算法与PPI标准结果库及PPI网络数据聚类中的其它算法进行比较,结果表明改进的蚁群算法时间开销降低且准确率有所提高。
     最后,文章对全文进行了总结并指出今后研究工作需要继续改进的地方和工作的重心。
As the development of bioinformatics, it is found that proteins don't work separately. They usually interact with others and gather as an entirety to function. All the interactions in a life together is called protein protein interaction network (PPI network). The research of PPI network could help know the biological metabolism and circulatory system, deeply analyze disease, and enlighten the meaning of life. This would lead to a new stage of our knowledge of living life. Swarm intelligent algorithms are based on bionics. They have been profoundly studied, and matured applications of swarm intelligence are widely seen in solving TSP problem, data mining, image process, function optimization etc. The existing algorithms are being improved and the new algorithms are being proposed. As a result, swarm intelligent algorithms are one of the headlines of computational intelligence.
     This paper firstly introduced the main concepts of swarm intelligence and gave some descriptions of several typical algorithms. In addition, it gave the advantages and disadvantages of these algorithms and pointed out where to be improved. This paper also did researches on bioinformatics and gave a brief introduction on PPI network. In the mean time, as main computation analyzing methods, clustering algorithms were described.
     According to the scale free and small world characteristics of PPI network, this paper analyzed the insufficiencies of the functional flow algorithm which performs comparatively well. The thought of recognizing and dealing with bridging nodes were proposed to improve the performance of the algorithm. At the basis of this, swarm intelligence was introduced into the functional flow algorithm to automatically optimize some parameter which is manually set by experience. This would help raise the stability of the results and the improved algorithm was named as IQ-Flow algorithm. MIPS PPI data was used to test the IQ-Flow algorithm, the simulation results showed that the precision and stability were raised.
     In swarm intelligence, this paper studied artificial bee colony (ABC) algorithm. On synthesizing kinds of improve strategies of other swarm intelligent algorithms, weight factor, contracted factor and random disturbance were introduced to improve the ABC algorithm. The improved algorithm was called IABC algorithm. The simulation results showed that IABC algorithm performs better than the original ABC and other participated swarm intelligent algorithms both in speed and precision. IABC was also used to automatically search the threshold in order to overcome the subjectivities and the precision was raised.
     In addition, an ACO algorithm based on joint strength (Joint Strength Based Ant Colony Optimization Algorithm, JSACO) was proposes. According to the characters of PPI data, joint strength was introduced to modify the pickup/drop rules in order to reduce the time consuming and raise the precision of the algorithm. In the simulation of this paper, MIPS PPI data were used to test the JSACO algorithm. Also the simulation result was compared with other PPI clustering methods. The simulation results show that the JSACO algorithm costs less in time consume and performs better in precision.
     Finally, this paper gave a conclusion and made a prospect of the upcoming work.
引文
[1]刘学礼.试论生物学发展的内在动力[J].医学与社会.第17卷第4期.2008:19-21
    [2]张春霆.生物信息学的现状与发展.世界科技研究与发展[J].第22卷第6期.2000:17-20
    [3]白胜欣.迈向21世纪的生物信息学.中华医学图书馆杂志[J].第10卷第5期.2001:1,48
    [4]郝柏林,张淑誉.生物信息学手册第2版[M].上海科学技术出版社.2002:17-32
    [5]孙晶晶.粒子群算法的改进及其应用研究[D].西安:陕西师范大学.2010
    [6]王大将,蔡瑞英,徐新伟.群智能优化算法研究[J].电脑知识与技术.第6卷第21期.2010:5845-5846
    [7]胡琼琼.群智能优化算法在QoS网络路由优化中的应用[D].西安:陕西师范大学.2010
    [8]Qianzhi Ma, Xiujuan Lei. Application of Artificial Fish School Algorithm in UCAV Path Planning [C]. Proc 2010 IEEE Fifth International conference on Bio-Inspired Computing:Theories and Applications.2010:555-559
    [9]魏剑简,白振兴.生物智能算法在机器人路径规划中的应用研究[J].计算机仿真.第26卷第10期.2009:182-185,206
    [10]付阿利,雷秀娟.基于改进PSO算法的最大熵阈值图象分割[J].计算机工程与应用.2008,44(29):174-176,187
    [11]张学习,杨宜民.混合智能算法在彩色图像分割中的应用研究[J].计算机工程与设计.第29卷第16期.2008:4260-4262
    [12]关薇,王建,贺福初.大规模蛋白质相互作用研究进展[J].生命科学.第18卷第5期.2006:507-512
    [13]U. Guldener et al. CYGD:the comprehensive yeast genome database [J].Nucleic Acids Research,2005,33:D364-D368.
    [14]Berman HM, Westbrook J, Feng ZK, et al. The protein data bank [J]. Nucleic Acids Research 2000,28(1):235-242
    [15]Schwikowski B, Uetz P, Fields S. A network of protein-protein interactions in yeast [J]. Nature Biotechnology,2000,18(12):1257-1261
    [16]Vazquez A, Flammini A, Maritan A, et al. Global protein function prediction from protein-protein interaction networks [J]. Nature Biotechnology,2003,21(6): 697-700
    [17]Xiaohong Zhou, Ming-Chih J Kao, Wing Hung-wong. Transitive functional annotation by shortest-path analysis of gene expression data [C]. In:Proc National Academy of Science, USA,2002,99(20):12783-12788
    [18]Young-Rae Cho, Woochang Hwang, Murali Ramanathan, Aidong Zhang. Semantic integration to identify overlapping functional modules in protein interaction networks [J]. BMC Bioinformatics,2007,8(265):1-13.
    [19]WU Yue-wen, YAN Hua-yun. Unifying model for Sierpinski networks with scale-free small world properties [J]. Computer Engineering and Applications, 2009,45(1):99-102
    [20]Mering C von, Krause R, Snel B, et al. Comparative assessment of large-scale data sets of protein-protein interactions [J]. Nature,2002,417:399-403
    [21]Dorigo M, Gambardella L M. Ant colony system:a cooperative learning approach to the traveling salesman problem [J]. IEEE Transactions on Evolutionary Computation.1997, 1(1):53-66.
    [22]段海滨.蚁群算法原理及其应用[M].科学出版社.2005.12
    [23]Kennedy J, Eberhart RC. Particle swarm optimization [C]. In:Proc. IEEE International Conference:Neural Networks. Piscataway, NJ:IEEE Service Center, 1995.1942-1948.
    [24]Qun Zhang, Xiujuan Lei, Xu Huang, Aidong Zhang. An Improved Projection Pursuit Clustering Model and its Application based on Quantum-behaved PSO [C]. 2010 Sixth International Conference on Natural Computation (ICNC'10),2010, Vol.5:2581-2585.
    [25]张敏慧.改进的粒子群计算智能算法及其多目标优化的应用研究[D].浙江大学.2005
    [26]D Karaboga, B Basturk. A powerful and Efficient Algorithm for Numerical Function Optimization:Artificial Bee Colony (ABC) Algorithm [J]. Journal of Global Optimization, Nov.2007,39(3):459-171.
    [27]Xiujuan Lei, Xu Huang, Aidong Zhang. Improved Artificial Bee Colony Algorithm and Its Application in Data Clustering [C]. Proc 2010 IEEE Fifth International conference on Bio-Inspired Computing:Theories and Applications. 2010:514-521
    [28]李晓磊,邵之江,钱积新.一种基于动物自治体的寻优模式:鱼群算法[J].系统工程理论与实践.2002,22(11):32-38.
    [29]Aidong Zhang. Protein Interaction Networks [M]. Cambridge Press.2009
    [30]Bauer A, Kuster B. Affinity purification—mass spectrometry. Powerful tools for the characterization of protein complexes [J]. European Journal of Biochemistry.2003,270(4):570-578
    [31]刘喆.人类蛋白质相互作用数据库可靠性的衡量[D].2009
    [32]Legrain P, Wojcik J, Gauthier J M. Protein-protein interaction maps:A lead towards cellular functions [J]. Trends Genet.2001,17(6):346-352
    [33]Eisenberg D, Marcotte E M, Xenarios I, et al. Protein function in the post-genomic era [J]. Nature,2000,405(6788):823-826
    [34]彭利红.基于蛋白质相互作用网络的聚类和稀疏点检测算法研究[D].湖南大学.2008
    [35]孙景春,徐晋麟,李亦学,石铁流.大规模蛋白质相互作用数据的分析与应用[J].科学通报.Vol 50(19):2055-2060
    [36]Chen Y, Xu D. Computational analyses of high-throughput protein-protein interaction data [J]. Current Protein and Peptide Science.2003,4(3):159-181
    [37]Liu zhihui. Big-world Contrary to Reason and Small-world Phenomenon [J]. Chongqing Library and Information Science Research (2006)1:1-8
    [38]Milgram S. The small world problem [J]. Psychology Today.1967:60-67
    [39]刘昊,廖波,彭利红.基于蛋白质相互作用网络的聚类算法研究[J].计算机工程与应用.2008,Vol 40(33):142-144,245
    [40]Jiawei Han, Micheline Kamber. Data Mining:Concepts and Techniques [M]. Morgan Kaufmann Press.2001:231-232
    [41]蔡晓妍,戴冠中,杨黎斌.谱聚类算法综述[J].计算机科学.2008 Vol 35(7):14-18
    [42]Luxburg von U. A tutorial on spectral clustering [J]. Statistics and Computing. 2007,17 (4):395-416.
    [43]NGA, Jordan M, Weiss Y. On spectral clustering:analysis and an algorithm [C]. Advances in Neural Information Processing Systems (NIPS). Cambridge. MA: MIT Press,2002.
    [44]Elena Nabieva, Kam Jim, Amit Agarwal, et al. Whole-proteome prediction of protein function via graph-theoretic analysis of interaction maps. Bioinformatics, 2005,21(1):i302-i310
    [45]Koza J R. Genetic Programming:On the Programming of Computers by Means of Natural Selection [M]. Cambridge:MIT Press.1992
    [46]Hackwood S, Beni G. Self-organization of Sensors for Swarm Intelligence [C]. In: IEEE International conference on Robotics and Automation. Piscataway, NJ: IEEE Press,1992:819-829
    [47]Reynolds CW. Flocks, Herds and Schools:A Distributed Behavioral Model [J]. Computer Graphics,1987,21(4):25-34
    [48]Colorni A, Dorigo M, Maniezzo V. Distributed Optimization by Ant Colonies [C]. In:The First European conference on Artificial Life. France:Elsevier,1991: 134-142
    [49]Kennedy J, Eberhart R.C. Swarm Intelligence [M]. San Francisco:Morgan Kaufmann division of Academic Press,2001
    [50]潘正军,康立山,陈毓屏.演化计算[M].北京:清华大学出版社.1998
    [51]张青,康立山,李大农.群智能算法及其应用[J].黄冈师范学院学报.第28卷第6期.2008:44-48
    [52]Bonabeau E, Theraulaz G. Swarm smarts [M]. Scientific American 2000:72-79
    [53]任伟建,陈建玲,韩冬,王凤好.蚁群算法综述[C].2007中国控制与学术年会论文集.2007:357-362
    [54]Dorigo M, Gambardella LM. Ant Colonies for the traveling salesman Problem [J]. Biosystems,43,1997:73-81
    [55]夏天扬.蚁群算法在聚类分析中的应用研究[D].武汉理工大学.2010
    [56]XIA Xiao-yun, LI Yun-hao, WANG Feng. TSP Problem Solution Based on Ant Colony Optimization Algorithm [J]. Journal of Jiangxi university of science and technology. Aug.2009, Vol.30, No.4:30-39
    [57]Jamaludin Sallim, Rosni Abdullah, Ahamad Tajudin Khader. ACOPIN:An ACO Algorithm with TSP Approach for Clustering Proteins from Protein Interaction Network [C]. Second UKSIM European Symposium on Computer Modeling and Simulation. EMS'08,2008:203-208
    [58]Amita Lanjewar, Vaishali N Sahare, Nilesh Sahare. An Approach based on clustering method for Object Finding Mobile Robots using ACO [C].2010 Second International Conference on Machine Learning and Computing (ICMLC), 9-11 Feb.2010:161-165
    [59]张利彪.基于粒子群优化算法的研究[D].吉林大学.2004
    [60]J. Sun, B. Feng, W.B. Xu. Particle swarm optimization with particles having quantum behavior [C]. IEEE Proc. of Congress on Evolutionary Computation, 2004,326-331.
    [61]Clerc M. The Swarm and the Queen:Towards a Deterministic and Adaptive Particle Swarm Optimization [C]. Proc. of IEEE International Congress on Evolutionary Computation.2000:84-88
    [62]Higashi N, Iba H. Particle Swarm Optimization with Gaussian Mutation [C]. Piscataway, NJ:IEEE Press,2003:72-79
    [63]刘金洋,郭茂祖,邓超.基于雁群启示的粒子群优化算法[J].计算机科学.2006,33(11):166-168
    [64]贾亚军,丛爽.粒子群与模拟退火的混合算法求解旅行商问题[C].2010系统仿真技术及其应用学术会议论文集.2010:508-513
    [65]孙晶晶,雷秀娟.基于压缩速度范围PSO的图像自适应增强[J].计算机工程.第36卷第21期.2010:228-233
    [66]陈永刚.粒子群算法及其在函数优化和路径优化问题上的应用[D].吉林大学.2006
    [67]D. Karaboga. An Idea Based On Honey Bee Swarm For Numerical Optimization [R]. Technical Report-Tr06. Erciyes University, Engineering Faculty, Computer Engineering Department,2005
    [68]Tsai, P.-W., Pan, J.-S., Liao, B.-Y., Chu, S.-C. Enhanced artificial bee colony optimization [J]. International Journal of Innovative Computing, Information and Control, Dec.2009,5(12):5081-5092.
    [69]马苗,郭华磊,郭敏,贺姣,丁生荣.基于人工蜂群算法与FFCM聚类的图像分割[C]Proceedings of 2010 International Conference on Circuit and Signal Processing 2010 Second IITA International Joint Conference on Artificial Intelligence (Volume 2).2010:504-507
    [70]Xiujuan Lei, Jingjing Sun, Xiaojun Xu, Ling Guo. Artificial Bee Colony Algorithm for Solving Multiple Sequence Alignment [C]. Proc 2010 IEEE Fifth International conference on Bio-Inspired Computing:Theories and Applications. 2010:337-342
    [71]Soon-Hyung Yook, Zoltan N. Oltvai, Albert-Laszlo Barabasi. Functional and topological characterization of protein interaction networks [J]. Proteomics,2004, 4:928-942.
    [72]Dan Melamed, Ryan Green, Joseph P Turian. Precision and Recall of Machine Translation [C]. Proceedings of HLT-NAACL,2003
    [73]Eberhart RC and Shi Y. Comparing inertia weights and constriction factors in Particle Swarm Optimization [C]. Proceedings of the Congress on Evolutionary Computing,2000:84-88.
    [74]CHEN Bing-rui FENG Xia-ting. Particle Swarm Optimization with Contracted Ranges of Both Search Space and Velocity [J]. Journal of Northeastern University (Natural Science),2005,26(5):488-491.
    [75]Lumer E, Faieta B. Diversity and adaptation in populations of clustering ants [C]. Proceeding of the 3rd international conference on simulation of adaptive behavior: from animal to animals,1994:499-508
    [76]李敏.蛋白质网络中复合物和功能模块挖掘算法研究[D].中南大学博士学位论文2008
    [77]Cui GU, Chen Y, Huang DS, etal. An algorithm for finding functional modules and protein complexes inprotein-protein interaction networks [J]. Journal of Biomedicine and Biotechnology.2008,3:1-10
    [78]李敏,王建新,陈建二.基于距离测定的蛋白质复合物识别算法[J].吉林大学学报.Vol 40(5)2010:1318-1323

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700