基因表达式编程理论及其监督机器学习模型研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
基因表达式编程是进化算法的新成员,虽然得到了广泛而深入的应用研究,但至今尚未有系统和完善的理论研究成果,还无法揭示其运行的数学机理,对于一些关键技术的改进也就缺少了理论支撑,这对基因表达式编程和进化算法的研究和发展都是非常不利的。此外,目前经典的机器学习方法在解决分类和复杂函数关系发现问题时所构建的监督机器学习模型,在可理解性、精度以及泛化能力上,还存在诸多问题。
     有鉴于此,本文对基因表达式编程理论和关键技术进行了系统的研究。首先,通过分析基因表达式编程的运行机理,利用一系列的数学推理,构建了基因表达式编程的模式理论,给出了基因表达式编程的基因块假设,分析了基因表达式编程的模式收敛性,其次,基于模式定理成立的必要条件,提出了自适应遗传算子的设计方案。最后,在理论研究的基础上,对基本基因表达式编程中的染色体生成、个体解码求值等关键技术提出了改进方法,进行了理论和实验分析后给出了改进的基因表达式编程算法。
     在监督机器学习模型研究方面,为了提高学习模型的噪声数据处理能力和泛化能力本文基于改进的基因表达式编程算法构建了监督机器学习模型,对算法的适应值函数和算法提前终止条件进行了重点研究,并通过两类分类、多维分类、复杂函数关系发现问题等实验对此监督机器学习模型的有效性进行了验证。
     本文的主要研究成果包括:
     (1)证明了基因表达式编程有效的同时为更深入地研究基因表达式编程奠定了理论基础。本文通过研究和分析遗传算法、进化规划、进化编程、遗传编程、基因表达式编程各自的特点,对基因表达式编程进行了形式化定义,在此基础上,证明了个体编码的合法性,给出了基因模式和基因表达式编程模式的定义,通过一系列的数学推理过程,构建了基因表达式编程的模式理论,给出了基因表达式编程的模式定理和基因块假设,利用数学分析的方法揭示了基因表达式编程的运行机理,最后,给出了基因表达式编程的模式收敛性证明,指出了采用精英保留策略可以防止基因表达式编程中最优模式丢失的问题。
     (2)提出了自适应遗传算子方案。本文深入研究了优势模式在相关遗传算子作用下的存活概率,给出了基因表达式编程模式定理成立的必要条件,在满足模式定理成立的必要条件下,对遗传策略和遗传算子的参数设计进行了研究,指出了相关遗传算子的参数选择上限,并提出了自适应遗传算子设计方案,最后就自适应变异算子进行了相关实验研究,验证了自适应遗传算子设计方案的可行性;
     (3)提出了改进的基因表达式编程算法。本文在理论分析的基础上,结合实验研究,对基本基因表达式编程中的染色体生成、多样化种群、个体解码求值等关键技术提出了改进方法,给出了相异结构染色体生成、个体快速解码求值等算法,并进行了相关理论和实验分析,最后,基于这些改进思想给出了改进的基因表达式编程算法;
     (4)构建了有效的监督机器学习模型。本文基于改进的基因表达式编程算法构建了监督机器学习模型,用于解决分类和复杂函数关系发现问题。通过构造独特的适应值函数和交叉验证方法来获得改进的基因表达式编程算法的提前终止条件,提高了所构建的机器学习模型的噪声数据处理能力和泛化能力。Monk's problems、乳腺X光片微钙化点检测、wine recognition、复杂函数关系发现、定量构效关系建模等相关实验结果进一步验证了此机器学习模型的有效性。
Gene Expression Programming (GEP) is a new member of Evolutionary Algorithms, although it has been widely applied in many fields, there are not systematic and comprehensive theoretical research and mathematic analysis of the operation mechanism, and the improving of the key technologies lack of theoretical basis, which is very unfavorable to the research and development of gene expression programming and evolutionary algorithm. In addition, there are still many problems for the classical machine learning methods in solving classification and complex function finding problems in supervised machine learning field, such as difficult to understand, not very precise and lack of generalization.
     To bridge the gap, in the paper, we studied the theory of GEP thoroughly. By analyzing the operation mechanism of GEP, using a series of mathematical reasoning, we constructed a theory of GEP schema and presented a building block hypothesis. Through analysis of the convergence of GEP schema and further study of GEP schemas, we obtained the necessary conditions of schema theorem, which conducted to the design of adaptive genetic operators. Finally, some key technologies of GEP, such as the production of chromosome, the chromosome decoding and fitness evaluation had been studied, which will be the mainly part of the Revising Gene Expression Programming (RGEP).
     In the supervised machine learning, we applied the RGEP to construct the model of supervised machine learning, studied the fitness function and stopping criterion, finally, we provided the experimental results of two types of classification, multi-dimensional classification, and complex function finding which verified the validity of the RGEP model.
     The main contributions of the paper include:
     (1) The work proved that the GEP is effective as well as laid a theoretical foundation for further study of GEP. After analysis of genetic algorithms, evolution programming, evolution programming, genetic programming, gene expression programming, we presented a formal definition of GEP. The legitimacy of the individual coding was proposed and the gene schema and GEP schema were also provided. After a series of mathematical reasoning, we presented a GEP schema theorem and building block hypothesis, and proved that the convergence of GEP schema and proposed that elitist strategy of GEP can prevent the loss of the optimal schema.
     (2) Proposed an adaptive genetic operator algorithm. After studied the survival rate of the better individuals, we presented the requirement of schema theorem and the parameter limit of genetic operator, also, we proposed a adaptive genetic operator algorithm and conduct a experiment for adaptive mutation operator, which shown the algorithm is effective.
     (3) Proposed a revising gene expression programming. After a fundamental analysis of the GEP, we presented a different structure chromosome generation algorithm and individual quick decoding and evaluating algorithm to improve the population diversity and efficiency. Relevant analysis and experiment were made to verify the validity of the improving, later we proposed a revising gene expression programming based on these two algorithms.
     (4) Presented an efficient supervised machine learning model. We construct a supervised machine learning model for solving classification and function finding problems based on RGEP. The model used a unique fitness function and cross-validation method to obtain the conditions for early stopping criterion so as to improve the noise immunity and generalization ability. Finally, we verified the validity of the RGEP based model by experiments of monk's problems, detection of micro-calcification in mammogram, wine recognition, complex function finding and quantitative structure activity relationship modeling.
引文
[1]Turing A M. Computing machinery and intelligence. Computing Machinery and Intelligence[J]. Mind,1950,59:433-460.
    [2]McCarthy John, Minsky Marvin. Rochester Nathan, Shannon Claude. A Proposal for the Dartmouth Summer Research Project on Artificial Intelligence. 1956. http://www-formal.stanford.edu/j mc/hi story/dartmouth/dartmouth.html.
    [3]A. L. Samuel. AI:Where it has been and where it is going. In Proceedings of the Eighth International Joint Conference on Artificial Intelligence, Morgan Kaufmann, Los Altos, CA,1983:1152-1157.
    [4]Mitchell, T. Machine Learning, McGraw Hill,1997.
    [5]ACM Computing Classification System:Artificial intelligence. ACM,1998.
    [6]Winston, Patrick Henry. Artificial Intelligence. Reading, Massachusetts: Addison-Wesley,1984.
    [7]Poole David, Mackworth Alan, Goebel Randy. Computational Intelligence:A Logical Approach. New York:Oxford University Press,1998.
    [8]Nilsson Nils. Artificial Intelligence:A New Synthesis. Morgan Kaufmann Publishers,1998.
    [9]Russell Stuart J., Norvig Peter. Artificial Intelligence:A Modern Approach (2nd ed.), Upper Saddle River, New Jersey:Prentice Hall,2003.
    [10]Luger George, Stubblefield William. Artificial Intelligence:Structures and Strategies for Complex Problem Solving (5th ed.). The Benjamin/Cummings Publishing Company,2004.
    [11]Sergios Theodoridis, Konstantinos Koutroumbas. Pattern Recognition,4th Edition, Academic Press,2009.
    [12]Toby Segaran. Programming Collective Intelligence, O'Reilly ISBN 0-596-52932-5.
    [13]Ray Solomonoff. An Inductive Inference Machine. A privately circulated report from the 1956 Dartmouth Summer Research Conference on AI.
    [14]Ray Solomonoff, An Inductive Inference Machine, IRE Convention Record, Section on Information Theory,1957, Part 2:56-62.
    [15]Ryszard S. Michalski, Jaime G. Carbonell, Tom M. Mitchell. Machine Learning: An Artificial Intelligence Approach, Tioga Publishing Company,1983.
    [16]Ryszard S. Michalski, Jaime G. Carbonell, Tom M. Mitchell. Machine Learning: An Artificial Intelligence Approach, Volume Ⅱ, Morgan Kaufmann,1986.
    [17]Yves Kodratoff, Ryszard S. Michalski. Machine Learning:An Artificial Intelligence Approach, Volume Ⅲ, Morgan Kaufmann,1990.
    [18]Ryszard S. Michalski, George Tecuci. Machine Learning:A Multistrategy Approach, Volume Ⅳ, Morgan Kaufmann,1994.
    [19]Bhagat P. M.. Pattern Recognition in Industry, Elsevier,2005.
    [20]Mierswa Ingo, Wurst Michael, Klinkenberg Ralf, Scholz Martin, Euler Timm. YALE:Rapid Prototyping for Complex Data Mining Tasks, in Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-06).
    [21]Ethem Alpaydin. Introduction to Machine Learning (Adaptive Computation and Machine Learning), MIT Press,2004.
    [22]Christopher M. Bishop. Pattern Recognition and Machine Learning, Springer, 2006.
    [23]MacKay D. J. C. Information Theory, Inference, and Learning Algorithms, Cambridge University Press,2003.
    [24]Pearl J. Bayesian Networks:A Model of Self-Activated Memory for Evidential Reasoning. Proceedings of the 7th Conference of the Cognitive Science Society, University of California, Irvine, CA,1985:329-334.
    [25]Wasserman P.D. Neural computing theory and practice. Van Nostrand Reinhold, 1989.
    [26]Hastie T., Tibshirani R., Friedman J. H. The elements of statistical learning Data mining, inference, and prediction. New York:Springer Verlag,2001.
    [27]C. Cortes, V. Vapnik. Support vector networks. Machine Learning,1995,20(3): 273-297.
    [28]Ingo Steinwart, Andreas Christmann. Support Vector Machines. Springer-Verlag, New York,2008.
    [29]S. Kotsiantis, Supervised Machine Learning:A Review of Classification Techniques, Informatica Journal,31 (2007):249-268.
    [30]Richard A. Berk, Regression Analysis:A Constructive Critique, Sage Publications,2004.
    [31]Aldrich, John. Fisher and Regression. Statistical Science,2005,20 (4):401-417.
    [32]YangJing Long. Human age estimation by metric learning for regression problems. Proc. International Conference on Computer Analysis of Images and Patterns,74-82.
    [33]Corder G.W., Foreman D.I. Nonparametric Statistics for Non-Statisticians:A Step-by-Step Approach Wiley F. Sebastiani, Machine learning in automated text categorization, ACM Computing Surveys,2002,34(1).
    [34]D. Tax and R. Duin. Using two-class classifiers for multi-class classification.in International Conference on Pattern Recognition, (Quebec City, QC, Canada), August 2002.
    [35]Nilsson J., Sha F., Jordan M.I. Regression on manifolds using kernel dimension reduction. In IEEE Conf. ICML,2007:265-272.
    [36]A.Colorni, M. Dorigo, V. Maniezzo. Distributed Optimization by Ant Colonies, Proceedings of European Conference on Artificial Intelligence, Paris, France, Elsevier Publishing,1991:134-142.
    [37]M. Dorigo. Optimization, Learning and Natural Algorithms. PhD thesis, Politecnico di Milano, Italie,1992.
    [38]T. Khanna. Foundations of Neural Networks [M], Addison-Wesley,1990.
    [39]杨行峻,郑君里.人工神经网络[M].北京:高等教育出版社,1992.
    [40]阎平凡,张长水.人工神经网络与模拟进化计算[M].北京:清华大学出版社,2005.
    [41]李元香,康立山,陈毓屏.格子气自动机[M].北京:清华大学出版社,1994.
    [42]J.D. Farmer, N. Packard, A. Perelson. The immune system adaptation and machine learning. Physica D,1986(2):187-204.
    [43]de Castro, Leandro N, Timmis Jonathan. Artificial Immune Systems:A New Computational Intelligence Approach. Springer,2002:57-58.
    [44]V. Cutello, G. Nicosia, M. Pavone, J. Timmis. An Immune Algorithm for Protein Structure Prediction on Lattice Models, IEEE Transactions on Evolutionary Computation,2007,11(1):101-117.
    [45]刘勇,康立山,陈琉屏.非数值并行算法(第二册)——遗传算法[M].北京:科学出版社,1998.
    [46]Kenneth De Jong, Riccardo Poli, Jonathan Rowe. Foundations of Genetic Algorithms (Volume 7) [C], Morgan Kaufmann,2003.
    [47]Z. Michaelwicz. Genetic Algorithms+Data Structures=Evolution Programs[M]. Springer-Verlag, Berlin, Herdelberg, New York,1996.
    [48]T. Back, D. B. Fogel, Z. Michaelwicz. Handbook of Evolutionary Computation [M], Oxford University Press, New York,1997.
    [49]潘正君,康立山,陈毓屏.演化计算[M].北京:清华大学出版社,1998.
    [50]康立山,陈毓屏.演化计算.数值计算与计算机应用[M],1995,3:22-27.
    [51]康立山,谢云,尤矢勇,罗祖华.非线性并行算法(第一册)——模拟退火算法[M].北京:科学出版社,1994.
    [52]Timmis, J.; Neal, M.; Hunt, J.. An artificial immune system for data analysis. BioSystems,2000,55 (1):143-150.
    [53]W.N. Chen, J. ZHANG. Ant Colony Optimization Approach to Grid Workflow Scheduling Problem with Various QoS Requirements, IEEE Transactions on Systems, Man, and Cybernetics-Part C:Applications and Reviews,2009,31(1):29-43.
    [54]S. Kotsiantis, P. Pintelas. Selective Averaging of Regression Models. Annals of Mathematics, Computing & TeleInformatics,2005,1(3):66-75.
    [55]Maker, Meg Houston. AI @ 50:AI Past, Present, Future. Dartmouth College.2006.
    [56]Luger George, Stubblefield, William. Artificial Intelligence:Structures and Strategies for Complex Problem Solving (5th ed.). The Benjamin Cummings Publishing Company, Inc.2004.
    [57]John Johnston. The Allure of Machinic Life:Cybernetics, Artificial Life, and the New AI, MIT Press.2008.
    [58]Courtney Boyd Myers. The AI Report. Forbes June 2009.
    [59]A.E. Eiben and J.E. Smith, Introduction to Evolutionary Computing, Springer,2003.
    [60]K.A De Jong, Evolutionary computation:a unified approach. MIT Press, Cambridge MA,2006.
    [61]Fogel David B. Evolutionary Computation:Toward a New Philosophy of Machine Intelligence, IEEE Press, Piscataway, NJ. Third Edition,2006.
    [62]Ferreira C. Gene Expression Programming in Problem Solving.6th Online World Conference on Soft Computing in Industrial Applications[C],2001.
    [63]Holland John H. Adaptation in Natural and Artificial Systems, University of Michigan Press, Ann Arbor,1975.
    [64]J. R. Koza, Genetic Programming:On the Programming of Computers by Means of Natural Delection[M], MIT Press, Cambridge,1992.
    [65]Ferreira Candida. Gene Expression programming:mathematical modeling by an artificial intelligence. Springer-Verlag,2006.
    [66]左劫.基因表达式编程核心技术研究[D].四川大学博士学位论文,2004.
    [67]元昌安,唐常杰,左劫等.基于基因表达式编程的函数挖掘-收敛性分析与残差制导进化算法.四川大学学报(工程科学版),2004,36(6):100-105.
    [68]王悦,唐常杰,杨宁,陈瑜,徐开阔.基于基因表达式编程的进化模式定理.四川大学学报(工程科学版),2009,41(2):167-172.
    [69]Ingo Rechenberg. Evolutionsstrategie-Optimierung technischer Systeme nach Prinzipien der biologischen Evolution (PhD thesis),1971.
    [70]L. J. Fogel, A. J. Owens, M. J. Walsh. Artificial Intelligence through Simulated Evolution [M], John Wiley, New York,1966.
    [71]孙瑞祥.进化计算与智能诊断[D].西安交通大学博士学位论文,2000.
    [72]Fraser, Alex;Donald Burnell. Computer Models in Genetics. New York: McGraw-Hill,1970.
    [73]Goldberg D. E..Genetic algorithms in search, optimization & machine learning. Reading, MA:Addison-Wesley,1989.
    [74]Illinois Genetic Algorithms Lab (IlliGAL). http://www.illigal.uiuc.edu/web/
    [75]Goldberg D. E.. The Design of Innovation:Lessons from and for Competent GeneticAlgorithms. Kluwer Academic Publishers, Norwell, MA,2002.
    [76]Goldberg D. E., Deb K., Clark J. H.. Genetic algorithms, noise, and the sizing of populations. Complex Systems,1992,6:333-362.
    [77]Goldberg D. E., Voessner S.. Optimizing Global-Local Search Hybrids. In Proceedings of the Genetic and Evolutionary Computation Conference, Morgan Kaufmann,1999(1):220-228.
    [78]Goldberg D., Sastry K., Llora X.. Toward routine billion-variable optimization using genetic algorithms:Short Communication. Complexity,2007,12 (3):27-29.
    [79]Christopher R. Stephens, Henri Waelbroeck. Effective degrees of freedom in genetic algorithms and the block Hypothesis. Proceedings of the Seventh International Conference on Genetic Algorithms, ed. Morgan Kaufmann, San Mateo,1997.
    [80]Christopher R. Stephens, Henri Waelbroeck. Schemata Evolution and Building Blocks. Evolutionary Computation,1999,7(2):109-124.
    [81]李敏强,寇纪淞,林丹,李全书.遗传算法的基本理论与应用[M].北京:科学出版社,2002.
    [82]Whitley L. D.,Vose M. D. (Eds.).. Foundations of genetic algorithms 3. Morgan Kaufmann,1995.
    [83]Mitchell M. An introduction to genetic algorithms. Cambridge, MA:MIT Press, 1996.
    [84]Vose M D, Liepins G F. Punctuated equilibria in genetic search.complex Systems,5:31-44.
    [85]Holland J.H., Reitman J.H. Cognitive Systems Based in Adaptive Algorithms. In: Pattern-directed Inference Systems. Academic Press,1978.
    [86]De Jong. K.:Learning with genetic algorithms:An overview. Mach. Learn. (1988):121-138.
    [87]De Jong K.A., Spears W.M.. Learning concept classification rules using genetic algorithms. In:Proceedings of the International Joint Conference on Artificial Intelligence, Morgan Kaufmann,1991:651-656.
    [88]Goldberg D. E.. Computer-Aided Gas Pipeline Operation using Genetic Algorithms and Rule Learning. PhD thesis. The University of Michigan, Ann Arbor, MI,1983.
    [89]Wilson S. W. On the retino-cortical mapping. International Journal of Man-Machine Studies,1983,18:361-389.
    [90]Peter W. Frey, David J. Slate. Letter recognition using Holland-style adaptive classifiers. Machine learning,1991,6:161-182.
    [91]Wilson, S. W. ZCS:A zeroth level classifier system. Evolutionary Computation, 1994,2:1-18.
    [92]Wilson, S. W. Classifier fitness based on accuracy. Evolutionary Computation, 1995,3:149-175.
    [93]Wilson, S. W. Classifiers that approximate functions. Natural Computing,2002, 1:211-234.
    [94]Hai H. Dam, Hussien A. Abbass, Chris Lokan.BCS:Bayesian Learning Classifier System.ALAR Technical Report Series:TR-ALAR-200604005.The Artificial Life and Adaptive Robotics Laboratory. School of Information Technology and Electrical Engineering University of New South Wales Northcott Drive, Campbell, Canberra, ACT 2600, Australia.
    [95]Jorge Casillas, Brian Carse, Larry Bull. Fuzzy-XCS:A Michigan Genetic Fuzzy System. IEEE transactions on fuzzy systems,2007,15(4):536-550.
    [96]Bernado Mansilla E., Garrell Guiu J.M. Accuracy-based learning classifier systems:Models, analysis, and applications to classification tasks. Evolutionary Computation,2003,11:209-238.
    [97]Bernado Mansilla E.,Llora X., Garrell J.M. XCS and GALE:A comparative study of two learning classifier systems and six other learning algorithms on classification tasks. Advances in Learning Classifier Systems (LNAI 2321), Berlin Heidelberg:Springer-Verlag,2002:115-132,
    [98]Bacardit J., Goldberg D., Butz M., Llora X., Garrell J.M.:Speeding-up pittsburgh learning classifier systems:Modeling time and accuracy. In:Parallel problem solving from Nature-PPSN 2004, Springer-Verlag, LNCS 3242: 1021-1031.
    [99]Bacardit, J. Pittsburgh Genetics-Based Machine Learning in the Data Mining era: Representations, generalization, and run-time. PhD thesis, Ramon Llull University, Barcelona, Catalonia, Spain,2004.
    [100]Bacardit J., Butz M.V. Data mining in learning classifier systems:Comparing xcs with gassist. In:Advances at the frontier of Learning Classifier Systems. Springer-Verlag,2007:282-290.
    [101]Bacardit, J., Garrell, J.M. Bloat control and generalization pressure using the minimum description length principle for a pittsburgh approach learning classifier system. In:Proceedings of the 6th International Workshop on Learning Classifier Systems, LNAI, Springer-Verlag,2003.
    [102]Bacardit J., Goldberg D.E., Butz M.V. Improving the performance of a pittsburgh learning classifier system using a default rule. In:Learning Classifier Systems, Revised Selected Papers of the International Workshop on Learning Classifier Systems 2003-2005. Springer-Verlag, LNCS 4399.2007:291-307.
    [103]Bacardit J., Krasnogor N. Smart crossover operator with multiple parents for a pittsburgh learning classifier system. In:GECCO'06:Proceedings of the 8th annual conference on Genetic and evolutionary computation, New York, NY, USA, ACM Press,2006:1441-1448.
    [104]Bernado-Mansilla E., Llora X., Traus I. Multiobjective Learning Classifier Systems. In:Multi-Objective Machine Learning. Volume 16 of Studies in Computational Intelligence, Springer,2006:261-288.
    [105]Ghosh A., Nath B. Multi-objective rule mining using genetic algorithms. Information Sciences 163,2004:123-133.
    [106]Hurst J., Bull L. A neural learning classifier system with self-adaptive constructivism for mobile robot learning. Artificial Life 12,2006:1-28.
    [107]L. Bull M., Studley A.J.B., Whittley I. On the use of rule sharing in learning classifier system ensembles. In Proceedings of the 2005 Congress on Evolutionary Computation,2005.
    [108]H. P. Schwefel. Numerical Optimization of Computer Models [M], John Wiley, Chichester, UK,1981.
    [109]王小平,曹立明.遗传算法—理论、应用与软件实现[M].西安:西安交通大学出版社,2002.
    [110]Joint Conference on Neural Network'90[C]. Washington D. C.,I-601-I-605, 1990.
    [111]D. B. Fogel, L. J. Fogel, V. W. Porto. Evolving Neural Networks[J], Biol. Cybern.,63,1990:487-493.
    [112]N. Saravanan, D. B. Fogel. Evolving Neurocontrollers Using Evolutionary Programming, Proceedings of the 1st IEEE International Conference on Evolutionary Computation (ICEC'94) [C], Orlando, Florida, USA, IEEE Press, 1994:217-222.
    [113]John R. Koza. Genetic Programming Ⅱ:Automatic Discovery of Reusable Programs. MIT Press, Cambridge Massachusetts,1994.
    [114]John R. Koza, David Andre, Forrest H Bennett Ⅲ, Martin Keane. Genetic Programming 3:Darwinian Invention and Problem Solving. Morgan Kaufman, 1999.
    [115]John R. Koza, Martin A. Keane, Matthew J. Streeter, William Mydlowec, Jessen Yu, Guido Lanza. Genetic Programming Ⅳ:Routine Human-Competitive Machine Intelligence. Kluwer Academic Publishers,2003.
    [116]Lee Altenberg. The Schema Theorem and Price's Theorem. In Foundations of Genetic Algorithms 3,1994:23-49, Estes Park, Colorado, USA, Morgan Kaufmann.
    [117]O'Reilly U-M., Oppacher R.. Program search with a hierarchical variable lenth representation:Genetic programming, simulated annealing, and hill climbing. springer-verlag. Storer T W. Exercise in chronic pulmonary disease:resistance exercise prescription. Medicine and Science in Sports and Exercise,2001,33(7): 680-686.
    [118]Riccardo Poli, W. B. Langdon. A Review of Theoretical and Experimental Results on Schemata in Genetic Programming. In ET'97 Theory and Application of Evolutionary Computation, University College London, UK,1997:29-43.
    [119]Justinian P. Rosca. Analysis of Complexity Drift in Genetic Programming. In Genetic Programming 1997:Proceedings of the Second Annual Conference, Stanford University, CA, USA, Morgan Kaufmann,1997:286-294.
    [120]P. A. Whigham. A Schema Theorem for Context-Free Grammars. In 1995 IEEE Conference on Evolutionary Computation, Perth, Australia,1995(1):178-181.
    [121]Riccardo Poli. Exact Schema Theorem and Effective Fitness for GP with One-Point Crossover. In Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2000), Las Vegas, Nevada, USA, Morgan Kaufmann,2000:469-476.
    [122]R. Poli. Hyperschema Theory for GP with One-Point Crossover, Building Blocks, and Some New Results in GA Theory. In Genetic Programming, Proceedings of EuroGP'2000, Edinburgh, Springer-Verlag,2000,1802:163-180.
    [123]Nicholas Freitag McPhee, Riccardo Poli. A schema theory analysis of the evolution of size in genetic programming with linear representations. In Genetic Programming, Proceedings of EuroGP'2001, Lake Como, Italy, Springer-Verlag, 2001,2038:108-125.
    [124]Marcos I. Quintana, Riccardo Poli, Ela Claridge. On Two Approaches to Image Processing Algorithm Design for Binary Images using GP. In Applications of Evolutionary Computing, EvoWorkshops2003:EvoBIO, EvoCOP, EvoIASP, EvoMUSART, EvoROB, EvoSTIM, University of Essex, England, UK, Springer-Verlag,2003,2611:422-431.
    [125]Riccardo Poli. A Simple but Theoretically-motivated Method to Control Bloat in Genetic Programming. In Genetic Programming, Proceedings of EuroGP'2003, Essex, Springer-Verlag,2003,2610:204-217.
    [126]Riccardo Poli, Christopher R. Stephens. Constrained Molecular Dynamics as a Search and Optimization Tool. In Genetic Programming 7th European Conference, EuroGP 2004, Proceedings, Coimbra, Portugal, Springer-Verlag, 2004,3003:150-161.
    [127]T.E. Davis, J.C. Principe. A Markov chain framework for the simple genetic algorithm. Evol. Comput,1993:269-288.
    [128]Allen E. Nix1, Michael D. Vosel. Modeling genetic algorithms with Markov chains. Annals of Mathematics and Artificial Intelligence,2005,5(1):79-98.
    [129]Boris Mitavskiy, Jon Rowe. Some Results about the Markov Chains Associated to GPs and to General EAs. Theoretical Computer Science,2006,361(1): 72-110,
    [130]Riccardo Poli, Jonathan E. Rowe, Nicholas Freitag McPhee. Markov Chain Models for GP and Variable-length GAs with Homologous Crossover. In Proceedings of the Genetic and Evolutionary Computation Conference, San Francisco, California, USA, Morgan Kaufmann,2001:112-119.
    [131]Adam Prugel-Bennett, Jonathan L. Shapiro. Analysis of genetic algorithms using statistical mechanics. Phys. Rev. Lett.72:1305-1309.
    [132]Smith Jeff S.. Evolving a Better Solution, Developers Network Journal,2002.
    [133]Shu-Heng Chen.. Genetic Programming:An Emerging Engineering Tool,International Journal of Knowledge-based Intelligent Engineering System, 2008,12(1):1-2.
    [134]Nordin J.P.. Evolutionary Program Induction of Binary Machine Code and its Application. Krehl Verlag, Muenster, Germany,1997.
    [135]Korns Michael. Large Scale Time Constrained Symbolic Regression Classification in Genetic Programming Theory and Practice V. Springer, New York,2007.
    [136]Korns Michael. Symbolic Regression of Conditional Target Expressions, in Genetic Programming Theory and Practice Ⅶ. Springer, New York,2009.
    [137]Brameier M., Banzhaf W. Linear Genetic Programming, Springer, New York, 2007.
    [138]Xinye Cai, Stephen L. Smith, Andy M. Tyrrell. Positional Independence and Recombination in Cartesian Genetic Programming. Proceedings of the 9th European Conference on Genetic Programming, Budapest, Hungary, Springer, 2006,3905:351-360.
    [139]Steven Gustafson, Edmund K. Burke, Natalio Krasnogor. On Improving Genetic Programming for Symbolic Regression. Proceedings of the 2005 IEEE Congress on Evolutionary Computation, Edinburgh, UK, IEEE Press, volume 1,2005: 912-919.
    [140]Maarten Keijzer. Scaled Symbolic Regression. Genetic Programming and Evolvable Machines,2004,5(3):259-269.
    [141]T. L. Lew and A. B. Spencer, F. Scarpa,K. Worden, A. Rutherford, F. Hemez. Identification of response surface models using genetic programming. Mechanical Systems and Signal Processing,2006,20(8):1819-1831.
    [142]Liu B, HsuW, Ma Y. Integrating classification and associ2ation rule mining[C]. Proceedings of the KDD. New York,1998:80-86.
    [143]Carvalho D R, FreitasA A. A hybrid decision tree genetic algorithm for coping with the problem of small disjuncts in data mining [C]. Proceedings of Genetic and Evolutionary Computation Conference. Las Vegas,2000:1061-1068.
    [144]FidelisM V, Lop s H S, Freitas A A. Discovering comp rehensible ruleswith a genetic algorithm[C].Proceedings of the Congress on Evolutionary Computation, New York,2000:805-810.
    [145]ZUO J, TANG CJ, ZHANG TQ. Mining Predicate Association Rule by Gene Expression Programming[J]. Berling Heidelberg:Springer Verlag, Lecture Notes In Computer science,2002,2419:92-103.
    [146]FERREIRA C. Gene Expression Programming:A New Adaptive Algorithm for Solving Problems. Http://www.Gene-expression-programming.com/webpapers/ GEPfirst.pdf,2004.
    [147]FERREIRA C. Gene Expression Programming[M]. Portugal, Angrado Heroismo, 2002.
    [148]FERREIRA C.. Gene Expression Programming in Problem Solving.6th Online World Conference on Soft Computing in Industrial Applications[C],2001.
    [149]FERREIRA C. Analyzing the Founder Effect in Simulated Evolutionary Processes Using Gene Expression Programming. Soft Computing Systems: Design, Management and Applications[C]. IOS Press, Netherlands,2002: 153-162.
    [150]FERREIRA C. Linear and nonlinear genetic algorithms for solving problems such as optimization, function finding, planning and logic synthesis. USA Patent Application N09/899,282,2001.
    [151]Mehmet Saltan, Serdal Terzi. Comparative analysis of using artificial neural networks (ANN) and gene expression programming (GEP) in backcalculation of pavement layer thickness[A]. Indian Journal of Engineering & Materials Sciences,2005,12.
    [152]ZUO J, TANG CJ, LI C. Time Series Prediction based on Gene Expression Programming [A]. International Conference for Web Information Age 2004[C]. Lecture Notes In Computer science,2004.
    [153]FERREIRA C. Discovery of the Boolean Functions to the Best Density Classification Rules Using Gene Expression Programming [A]. Proceedings of the 4th European Conference on Genetic Programming[C]. Springer, Verlag,Berlin,Germany,2002,2278:51-60.
    [154]Adil Baykasoglu, Lale Ozbakir. MEPAR-miner:Multi-expression programming for classification rule mining[A]. European Journal of Operational Research, 2006.
    [155]Vassilios K. Karakasis, Andreas Stafylopatis. Data Mining based on Gene Expression Programming and Clonal Selection,2006 IEEE Congress on Evolutionary Computation Sheraton Vancouver Wall Centre Hotel, Vancouver, BC, Canada,2006:16-21.
    [156]Zhou C., Xiao W., Nelson P.C., Tiipak T.M.. Evolving Accurate and Compact Classification Rules with Gene Expression Programming, IEEE Transactions on Evolutionary Computation,2003,7(6):519-531.
    [157]Zhou C., Nelson P.C., Xiao W., Tirpak T.M. Discovery of Classification Rules by Using Gene Expression Programming, in Proceedings of the International Conference on Artificial Intelligence, Las Vegas, USA,2002:1355-1361.
    [158]Li. Q., Cai Z., Jianq S., Zhu L.. Gene expression programming in prediction, in Proceedings of the World Congress on Intelligent Control and Automation, 2004:2171-2175.
    [159]Zhuli Xie, Xin Li, Barbara Di Eugenio, Weimin Xiao, Thomas M. Tirpak, Peter C.Nelson. Using Gene Expression Programming to Construct Sentence Ranking Functions for Text Summarization. In Proceedings of the 20th International Conference on Computational Linguistics, Geneva, Switzerland, August 2004, COLING-2004:1381-1384.
    [160]Stewart W. Wilson. Classifier Conditions Using Gene Expression Programming. IlliGAL Report No.2008001, January,2008.
    [161]Kejun Zhang, Shouqian Sun, Hongzong Si. Prediction of Retention Times for a Large Set of Pesticides Based on Improved Gene Expression Programming. In Proceedings of the 10th annual conference on Genetic and evolutionary computation, Atlanta, GA, USA, Association for Computing Machinery,2008: 1725-1726.
    [162]Kejun Zhang, Shouqian Sun, Chunlei Chai, Hongzong Si. Quantitative structure activity relationship models for the prediction of rat lethal dose 50% of aldehydes based on heuristic method and support vector machine. Chinese Journal of Analytical Chemistry,2007,35(9):1263-1268.
    [163]Kejun Zhang, Shouqian Sun, Yongbo Tang, Hongzong Si. Gene Expression Programming for the Prediction of Acute Toxicity of Aldehydes. Chinese Journal of Analytical Chemistry,2009,37(3):425-428.
    [164]SI Hongzong, ZHANG Kejun, HU Zhide, FAN Botao. QSAR study of 1,4-dihydropyridine calcium channel antagonists based on gene expression programming. Bioorganic & Medicinal Chemistry.2006,14(15),4834-4841.
    [165]方旺盛,张克俊,邵利平.基于改进的基因表达式编程的复杂函数建模.计算机工程,2006,32(21):188-190.
    [166]Kejun Zhang, Yuxia Hu, Gang Liu. An improved gene expression programming for solving inverse problem. Proceedings of the World Congress on Intelligent Control and Automation (WCICA), Proceedings of the World Congress on Intelligent Control and Automation (WCICA),2006(1):3371-3375.
    [167]Hongzong Si, Ning Lian, Shuping Yuan, Aiping Fu, Yun-Bo Duan, Kejun Zhang, Xiaojun Yao. Predicting the activity of drugs for a group of imidazopyridine anticoccidial compounds. European Journal of Medicinal Chemistry,44 (2009):4044-4050.
    [168]Si H.Z., Zhang K.J. Hu Z.D., Fan B.T. QSAR Model for Prediction Capacity Factor of Molecular Imprinting Polymer Based on Gene Expression Programming. QSAR & Combinatorial Science,2007,26(1):41-50.
    [169]SI Hongzong, YUAN Shuping, ZHANG Kejun, FU Aiping, DUANYun-Bo, HU Zhide. Quantitative structure activity relationship study on EC50 of anti-HIV drugs. Chemometrics and Intelligent Laboratory Systems,2008,90:15-24.
    [170]Wang T, Si HZ, Chen PP, Zhang KJ, YAO Xiaojun. QSAR models for the dermal penetration of polycyclic aromatic hydrocarbons based on Gene Expression Programming. QSAR & Combinatorial Science,2008,27(7): 913-921.
    [171]Hongzong Si, Tao Wang, Kejun Zhang, Yun-Bo Duana, Shuping Yuan, Aiping Fu, Zhide Hu. Quantitative structure activity relationship model for predicting the depletion percentage of skin allergic chemical substances of glutathione. Analytica Chimica Acta,591(2007):255-264.
    [172]Kimura, M. The Neutral Theory of Molecular Evolution, Cambridge University Press, Cambridge, UK.1983.
    [173]彭京,唐常杰,李川,胡建军M-GEP:基于多层染色体基因表达式编程的遗传进化算法.计算机学报[J],2005,28(9):1459-1466.
    [174]Darrell Whitley. A genetic algorithm tutorial. Technical Report CS-93-103, Department of Computer Science, Colorado State University, August 1993.
    [175]杨海军,李敏强.进化算法中的模式定理及建筑块[J].计算机学报,2003,26(11):1550-1554.
    [176]J. Schaffer. Genetic Algorithms and Simulated Annealing. Los Altos, CA: Morgan Kaufmann,1987.
    [177]Goldberg D E, Sast ry K. A practical schema theorem for genetic Algorithms design and tuning[R]. Urbana:University of Illino is at Urbana Champaign, 2001.
    [178]Tobias Blickle, Lothar Thiela. A mathmatical analysis of tournament selection Proc.6th Int. Conf. on Genetic Algorithms, San mateo, CA:Morgan kaufmann, 9-16.
    [179]Mayr E..1954. Change of Genetic Environment and Evolution. In J. Huxley, A. C. Hardy, and E. B. Ford, eds., Evolution as a Process, Allen and Unwin, London,157-180.
    [180]Mayr, E.. Animal Species and Evolution, Harvard University Press, Cambridge, Massachusetts,1963.
    [181]胡建军,唐常杰,段磊,左劫,彭京,元昌安.基因表达式编程初始种群的多样化策略.计算机学报[J],2007,30(2):305-310.
    [182]胡建军,吴晓云.基因表达式编程中的优势种群产生策略.小型微型计算机系统,2009,30(8):1660-1662.
    [183]蒋思伟,蔡之华,曾丹,李曲,程远方.基于模拟退火的并行基因表达式编程算法研究.电子学报[J],2005,33(11):2017-2021.
    [184]姜玥,唐常杰,郑明秀,叶尚玉,吴江.基因表达式编程中动态适应的远缘繁殖策略.四川大学学报(工程科学版),2007,39(2):121-126.
    [185]黄隆胜,廖颀.GEP软件设计及其K表达式快速求值算法.计算机工程与设计[J],2007,28(4):775-776.
    [186]Murphy P., Aha, D. UCI repository of machine learning databases,1994.
    [187]S. B. Thrun, T. Mitchell, J. Cheng. The monk's problems-A performance comparison of different learning algorithms. Carnegie Mellon University, Computer Science Department, CS-CMU-91-197,1991.
    [188]University of South Florida. Digital database for screening mammography. http://marathon.csee.usf.edu/Mammography/Database.html,2001.
    [189]王磊,朱淼良,邓丽萍,袁昕.一种基于二维粒子的自动检测乳腺钼靶片上微钙化点簇的方法[J].计算机研究与发展,2009,46(9):1438-1445.
    [190]Forina M. R Leardi, C Armanino, S Lanteri. PARVUS-An Extendible Package for Data Exploration, Classification and Correlation. Institute of Pharmaceutical and Food Analysis and Technologies, Via Brigata Salerno,16147 Genoa, Italy.
    [191]Mark T., Cronin D., Wayne Schultz T. Structure Toxicity Relationships for Three Mechanisms of Action of Toxicity to Vibrio fischeri. Ecotox. Environ. Safe,1998, 39(1):65-69.
    [192]Schultz T.W., Bryant S.E., Lin D. T. Structure-toxicity relationships for Tetrahymena:Aliphatic aldehydes. Bull. Environ. Contam.Toxicol,1994, 52(2):279-285.
    [193]Byvatov E., Fechner U., Sadowski J., Schneider G. J. Comparison of Support Vector Machine and Artificial Neural Network Systems for Drug/Nondrug Classification. Chem. Inf. Comput. Sci,2003,43(4):1882-1889.
    [194]Liu H. X., Zhang R. S., Luan F., Yao X. J., Liu M. C., Hu Z. D., Fan B. D. Diagnosing Breast Cancer Based on Support Vector Machines. J. Chem. Inf. Comput. Sci,2003,43(3):900-907.
    [195]Burbidge R., Trotter M., Buxton B., S Holden. Drug design by machine learning: support vector machines for pharmaceutical data analysisComput. Chem,2001, 26(21):5-14.
    [196]Liu H. X., Zhang R. S., Yao X. J. Liu M. C., Hu D., Fan B. T.. QSAR Study of Ethyl 2-[(3-Methyl-2,5-dioxo(3-pyrrolinyl)) amino]-4-(trifluoromethyl) pyrimidine-5-carboxylate:An Inhibitor of AP-1 and NF-κB Mediated Gene Expression Based on Support Vector MachinesJ. Chem. Inf. Comput. Sci,2003, 43(4):1288-1296.
    [197]HyperChem. Release 4.0 for Windows, Hypercube, Inc.,1995.
    [198]Dewar M. J. S., Zoebisch E. G., Healy E. F., Stewart J. J. P.. Development and use of quantum mechanical molecular models.76. AM1:a new general purpose quantum mechanical molecular model. J.Am.Chem.Soc,1985,107(13): 3902-3909.
    [199]Stewart J. P. P. MOPAC 6.0, Quantum Chemistry Program Exchange;QCPE, No. 455, Indiana University, Bloomington,1989.
    [200]Amsterdam. Analytical methods for Pesticide residues in foodstuff. In:General Inspectorate for Health Protect,6th ed. Ministry of Health, Welfare and Spot, The Netherlands,1996.
    [201]Chun O.K., Kang H.G.. Estimation of risks of pesticide exposure, by food intake, to Koreans. Food Chem. Toxicol.2003,41:1063-1076.
    [202]Stajnbaher D., Zupancic-Kralj L. Multi-residue method for determination of 90 pesticides in fresh fruits and vegetables using solid-phase extraction and gas chromatography-mass spectrometry. J. Chromatogr.2003,1015:185-198.
    [203]Xiuyong Li, Feng Luan, Hongzong Si, Zhide Hu, Mancang Liu. Prediction of retention times for a large set of pesticides or toxicants based on support vector machine and the heuristic method. Toxicology Letters,2007(175):136-144.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700