时间序列分析的研究与应用
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
时间序列分析是动态数据分析处理的一种重要的方法,以概率统计学作为理论基础来分析随机数据序列(或称动态数据序列),并对其建立数学模型,以及进一步应用于预测、自适应控制等诸多方面,是一个具有相当高的实际价值的应用研究领域。
     随着时间序列分析方法的日趋成熟,其应用领域越来越广泛,对模型的精度提出了更高的要求。由于时间序列建模过程中的各种参数估计方法最终都会归结为高度非线性函数的优化问题,传统的优化方法估计出的参数可信度较低,因此本文将模拟生物进化过程的遗传算法作为求解复杂优化问题的手段应用时间序列模型的参数估计中来。遗传算法作为一种全局的优化算法,在处理高度非线性函数的优化问题中有着传统优化方法不可比拟的优越性,但在实际应用中也存在一定的缺陷,主要表现在算法的早熟现象、局部寻优能力差、收敛速度慢等,因此本文应用了改进的实数遗传算法(1RGA)对时间序列模型中的参数进行估计。
     本文主要做了以下四方面的研究工作:(1)论述了研究时间序列分析的背景和意义,总结了目前的学科发展以及参数估计方法的研究现状:(2)讨论了时间序列分析建模的理论方法;(3)阐述了遗传算法的发展和研究状况,给出了改进的实数遗传算法的原理和运算流程;(4)在时间序列分析和遗传算法的理论指导下,将遗传算法引入时间序列建模过程中,进行深入的讨论与研究。研究内容及成果主要包括以下几个方面:
     1.介绍了时间序列及其相关的基本概念,分析了随机时间序列的特性、研究中的方法性工具以及时间序列的特征函数,分析并讨论了ARIMA模型体系。介绍了拟合模型参数估计的常用方法,对目前模型参数估计方法存在的不足作了一定的探讨。介绍了Box-Jenkins建模的理论和流程,以及线性最小方差预测的理论方法。
     2.对遗传算法作了概述,简要介绍了基本遗传算法(SGA)的构成要素、运算流程和实现。阐述了实数遗传算法(RGA)研究的发展和研究状况,讨论了实数遗传算法特点,比较全面地描述了目前实数编码遗传算法中常用的选择、交叉、变异算子及适应度函数,给出了改进的实数遗传算法(IRGA)的原理和运算流程。
     3.进行实证分析,将基本遗传算法和改进实数遗传算法分别应用到时间序列模型参数辨识中,并与传统随机时间序列分析所辨识的模型精度进行比较,得出结论:在选定的误差指标下,应用本文中的改进实数遗传算法辨识参数后得到的时间序列模型拟合精度最高。
     4.提出了IRGA-ARIMA建模方法,并将研究的成果应用于电力系统负荷时间序列的建模和预测上,验证了模型的适用性,并取得了较好的预测效果。
Time series analysis is the important approach to solve and analyze the dynamic data. It is actual value to applied research study, which is basic on the probability statistics to analyze random data series (or called dynamic data series), develop the mathematic model, and further apply to forecast, adaptive control, and so on.
     With the development of the method of time series analysis, the applied field is more and more wide, thus the precision of the model is required to higher level. Various method of parameter estimation is come down to solve the optimization problem of the highly nonlinear function, and the confidence about parameter estimation by the traditional optimization approach is lower, therefore, in the paper, the complex optimization problem is solved by applying genetic algorithm (GA) to the parameter estimation of the time series model. GA which is an universal algorithm is more advantageous than the traditional optimization approach when the optimization problem of the highly nonlinear function is achieved. But this algorithm exists some limitation in practice, which appears the phenomena of the prematurity, the bad capability to obtain the local solution of the optimization problem and the slowly converge rate, and so on.. Hence the modified method of GA when we estimate parameters of the time series model by GA is provided.
     In this thesis four aspects are studied. Firstly, illustrate the background and the meaning to analyze the time series, and provide the research actuality of the development and the approach of parameter estimation. Secondly, analyze the theory method of modeling the time series analysis. Thirdly, illustrate the research actuality of the development and the approach of GA and capture the theory and the operator flow of improved real coding based genetic algorithm (IRGA). The last one, according to the theory of the time series analysis and GA, the GA is introduced in the process of modeling the time series, which is discussed and studied. The following conclusions are achieved:
     1. The concepts of time series and the related concepts are provided. It is analyzed which is the characteristics of time series , the research tool and the eigenfunction of time series. ARIMA model system is analyzed and discussed. The usual approach of fitting of model parameter estimation is introduced and the limitation of parameter estimation is probed. The theory and the flow of the Box-Jenkins modeling are introduced, and the theory method of the linear minimum variance forecast is introduced.
     2. GA is illustrated in this paper, some are also simple introduced, thus the components, algorithm flow and the realization of simple genetic algothm (SGA). Contemporarily, the research development and the characteristics of RGA are illustrated and selection operator, crossover operator、mutation operator and fitness function of RGA is captured. The principle and the algorithm flow of IRGA are provided.
     3. By the demonstration, the precision of SGA and IRGA, which are respectively applied to the parameter estimation of the time series model, are contrasted with the precision of the approach of the traditional time series analysis. In the end, the conclusion is achieved. In the case of the selected error index, the fitting of precision of the time series model, which is obtained by the parameter estimation used IRGA, is the highest.
     4. the research result is applied on modeling and forecasting time series about the electric power system load. The applicability is tested and the forecast effect is obtained.
引文
安德洪,柳湘月,刘嘉煜,许树荆.2002.基于季节ARIMA模型的电力负荷建模与预报.38(2):184~188
    陈建.2006.电力负荷预测方法[J].渤海大学学报(自然科学版).28(2):142
    陈小平,于盛林.2003.实数遗传算法交叉策略的改进[J].电子学报.31(1):71~74
    陈晓梅,杨成祥.2005.遗传进化算法在时间序列建模中的应用.计算机工程与应用.5:215
    邓华玲.2003.概率统计方法与应用[M].中国农业出版社.163~285
    邓祖新.2006.数据分析方法和SAS系统[M].上海财经大学出版社.391~461
    高惠旋.2000.实用统计方法与SAS系统[M].北京大学出版社.188~209
    顾岚.1994.时间序列分析在经济分析中的应用[M].中国统计出版社.8~103
    何大阔,王福利.2003.一种提高遗传算法全局收敛性的方法[J].东北大学学报(自然科学版).24 (6):511-514
    金菊良,杨晓华,丁晶.2001.标准遗传算法的改进方案——加速遗传算法[J].系统工程理论与实践.4:8~13
    靖稳峰,魏红,段惠娣.2000.遗传算法及其发展现状[J].西安工业学院学报.20(3):230~235
    来鹏,陈平.2004.基于遗传算法的时序混合模型的参数估计.山西师范大学学报(自然科学版).18(3):15~17
    雷英杰,张善文等.2006.Matlab遗传算法工具箱及其应用[M].西安电子科技大学出版社.1~207
    刘芳.2003.基于免疫算法的AR时间序列模型的参数估计[J].计算机科学.13(5):17~20
    彭新竹.2002.遗传算法的改进策略及其应用[J].华东船舶工业学院学报,16(3):53~58
    孙靖,程大章.2004.基于季节性时间序列模型的空调负荷预测.19(3):88~93
    孙晓云,蔡远利.2004.利用改进遗传算法的参数估计.自动化技术与应用.23(1):23~26
    孙晓云,高鑫,王鹏.2005.新型并行遗传算法及其在参数估计中的应用.计算机工程与应用.19:50~52
    田小梅,龚静.2005.实数编码遗传算法的评述,湖南环境生物职业技术学院学报[J].11(1):025~031
    王吉权,赵玉林,吕艳玲.2004.季节周期预测法在电力系统季负荷预测中的应用[M].35(4):476~479
    王福林,王吉权,吴昌友,吴秋峰.2006.实数遗传算法的改进研究[J].生物数学学报.21(1):153~158
    王燕.2005.应用时间序列分析[M].中国人民大学出版社.1~239
    王振龙.2000.时间序列分析[M].中国统计出版社.2~4
    吴怀宇.2004.时间序列分析与综合.武汉大学出版社.10~98
    吴喜.2006.时间序列建模与模型选择的应用研究
    项静恬等.1986.动态数据处理与时间序列分析[M].气象出版社.1~121
    杨叔子,吴雅等.1991.时间序列分析的工程应用[M].华中理工大学出版社.1~242
    于洪彦.2001.Excel统计分析与决策[M].高等教育出版社.26~48
    于良义,张银生.2002.实用概率统计.中国人民大学出版社.123~146
    张金槐.1999.线性模型参数估计及其改进[M].国防科技大学出版社.12~87
    张树京,齐立心.2003.时间序列分析简明教程[M].清华大学出版社,北方交通大学出版社.1~155
    赵云珍.2005.遗传算法及其改进.16~24
    钟小鹏,杨湘,王跃科.2001.MATLAB在动态数据建模中的应用[J].电脑开发与应用.14(10):21~23
    周明,孙树栋.1991.遗传算法原理及应用[M].国防工业出版社.18~78
    G. E. RBox, G. N. Jenkins and G. C. Reinsel著.顾岚主译.1997.时间序列分析、预测与控制[M].中国统计出版社.12~287
    Frantisek S. 2002. Predictions in Time Series using Regression Models[M]. Springer-Verlag.2~38
    Kristinsson K. 1992. System identigication and control using genetic algorithm[J]. IEEE Trans Sys Man Cybernetics. 22(5): 1033~1046
    Michalewicz Z. 1994. Genetic algorithm +data structure=evolution programs[M] . Springer--Verlag. 16~42
    Schoukens J, Pintelon R. 1991. Identigication of linear systems a practical guideline to accurate modeling[M]. Pressman. 221~229
    User's Guide. 2004. Genetic Algorithm and Direct Search Toolbox for Use with MATLAB[J]. The Math Works. 22 (1): 18~23
    Zhang G P. 2003. Time series forecasting using a hybird ARIMA and neural network model[J]. Journal of Liaoning Technical University. 22(2): 18~19

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700