一类未知非线性系统中有限域最优跟踪控制的迭代DHP算法

英文篇名：The Iterative DHP Algorithm for Finite-Horizon Optimal Tracking Control for a Class of Nonlinear Systems
作者：吴天庆 ; 陈全发 ; 廖芳芳
英文作者：WU Tian-qing;CHEN Quan-fa;LIAO Fang-fang;School of Computer Engineering,Shangqiu University;School of Mathematics and Finance,Xiangnan University;
关键词：离散时间系统 ; 最优跟踪 ; 自适应动态规划 ; 双启发式动态规划 ; 有限时域最优跟踪控制 ; 神经网络 ; 神经控制
英文关键词：discrete time system;;optimal tracking;;approximate dynamic programming;;dual heuristic dynamic programming;;finite-horizon optimal tracking control;;neural networks;;neural control
中文刊名：XYDZ
英文刊名：Journal of Xiangtan University(Natural Science Edition)
机构：商丘学院计算机工程学院;湘南学院数学与金融学院;
出版日期：2019-02-15
出版单位：湘潭大学学报(自然科学版)
年：2019
期：v.41;No.150
基金：国家自然科学基金项目NNSF(11701487);; 湖南省教育厅优秀青年项目(17B258)
语种：中文;
页：XYDZ201901008
页数：11
CN：01
ISSN：43-1549/N
分类号：86-96

摘要

针对一类未知非线性离散时间系统,提出了一种无模型时域有限差分最优跟踪控制方案.在有限时域最优控制理论的框架下,将跟踪控制问题转化为误差动态调节器,引入迭代自适应动态规划(ADP)算法,通过双启发式动态规划(DHP)技术,分别用三个神经网络逼近误差动力学、成本函数和控制率,结合成本函数和控制率的收敛性分析,得到有限时域最优控制器.通过仿真实例验证了跟踪控制方案的有效性.
A model-free finite-horizon optimal tracking control scheme for a class of unknown nonlinear discrete-time systems is proposed in this paper.The tracking control problem is converted into designing a regulator for the tracking error dynamics under the framework of finite-horizon optimal control theory.The iterative adaptive dynamic programming(ADP)algorithm is introduced,via dual heuristic dynamic programming(DHP)technique,three neural networks are taken to approximate the error dynamics,the cost function,and the control law,respectively,with convergence analysis in terms of cost function and control law,obtain the finite-horizon optimal controller.The effectiveness of the tracking control scheme is verified by simulation.

引文

[1]刘德荣,李宏亮,王鼎.基于数据的自学习优化控制:研究进展与展望[J].自动化学报,2013,39(11):1858-1870.
    [2]Rushikesh K,Huyen D,Shubhendu B,et al.Approximate optimal trajectory tracking for continuous-time nonlinear systems[J].Automatica,2015,51:40-48.
    [3]Kyriakos G V,Arman M,Henrique F.Event-triggered optimal tracking control of nonlinear systems[J].International Journal of Robust and Nonlinear Control,2017,27(4):598-319.
    [4]唐功友,刘毅敏,张勇.非线性离散系统的近似最优跟踪控制[J].控制理论与应用,2010,27:1293-1302.
    [5]Rushikesh K,Lindsey A,Patrick W,et al.Model-based reinforcement learning for infinite-horizon approximate optimal tracking[J].IEEE Transactions on Neural Networks and Learning Systems,2017,28:753-758.
    [6]Jagannathan S.Neural network control of nonlinear discrete-time systems[M].Boca Raton:CRC Press,2006.
    [7]张华光,张欣,罗艳红,等.自适应动态规划综述[J].自动化学报,2013,39(4):303-311.
    [8]Ruano A E.Intelligent control systems using computational intelligence techniques[M].London:The Institute of Engineering and Technology,2008.
    [9]Al-Tamimi A,Lewi F L,Abu-Khalaf M.Discrete-time nonlinear HJB solution using approximate dynamic programming:convergence proof[J].IEEE Transactions on System,Man,Cybernetics,Part B,Cybernetics,2008,38:943-949.
    [10]Zhang H,Luo Y,Liu D.Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints[J].IEEE Transactions on Neural Networks,2009,20:1490-1503.
    [11]Wang F Y,Jin N,Liu D,et al.Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear systems with error bound[J].IEEE Transactions on Neural Networks,2011,22:24-36.
    [12]Wang D,Liu D,Zhang Q,et al.Data-based adaptive critic designs for nonlinear robust optimal control with uncertain dynamics[J].IEEE Transactions on Systems,Man,and Cybernetics:Systems,2016,46:1544-1555.
    [13]Dierks T,Jagannathan S.Optimal tracking control of affine nonliner discrete-time systems with unknown internal dynamics[C].Proceedings of Joint 48th IEEE Conference on Decision and Controland 28th Chinese Control Conference,Shanghai,P R China,Dec 2009:6750-6755.
    [14]Park Y M,Choi M S,Lee K Y.An optimal tracking neurocontroller for nonlinear dynamic systems[J].IEEE Transactions on Neural Networks,1996(7):1099-1110.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700