CASIA OpenIR

浏览/检索结果: 共9条,第1-9条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Policy Iteration for H infinity Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 2, 页码: 500-509
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Yang, Xiong;  Zhang, Qichao
Adobe PDF(892Kb)  |  收藏  |  浏览/下载:334/56  |  提交时间:2018/10/10
Adaptive Dynamic Programming (Adp)  h Infinity Optimal Control  Policy Iteration (Pi)  Polynomial Nonlinear Systems  Sum Of Squares (Sos)  
Comprehensive comparison of online ADP algorithms for continuous-time optimal control 期刊论文
ARTIFICIAL INTELLIGENCE REVIEW, 2018, 卷号: 49, 期号: 4, 页码: 531-547
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(766Kb)  |  收藏  |  浏览/下载:434/190  |  提交时间:2017/09/13
Adaptive Dynamic Programming  Policy Iteration  Integral Reinforcement Learning  Experience Replay  Off-policy  
Event-Triggered Optimal Control for Partially Unknown Constrained-Input Systems via Adaptive Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2017, 卷号: 64, 期号: 5, 页码: 4101-4109
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo;  Ji, Junhong
浏览  |  Adobe PDF(2325Kb)  |  收藏  |  浏览/下载:581/226  |  提交时间:2017/09/12
Actor-critic-identifier  Concurrent Learning  Constrained Input  Event-triggered (Et) Control  Hamilton-jacobi-bellman (Hjb) Equation  
Error Bound Analysis of Q-Function for Discounted Optimal Control Problems With Policy Iteration 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2017, 卷号: 47, 期号: 7, 页码: 1207-1216
作者:  Yan, Pengfei;  Wang, Ding;  Li, Hongliang;  Liu, Derong
Adobe PDF(625Kb)  |  收藏  |  浏览/下载:383/92  |  提交时间:2017/09/12
Adaptive Dynamic Programming (Adp)  Error Analysis  Nonlinear Systems  Policy Iteration  Q-function  
Online identifier-actor-critic algorithm for optimal control of nonlinear systems 期刊论文
OPTIMAL CONTROL APPLICATIONS & METHODS, 2017, 卷号: 38, 期号: 3, 页码: 317-335
作者:  Lin, Hanquan;  Wei, Qinglai;  Liu, Derong
浏览  |  Adobe PDF(2888Kb)  |  收藏  |  浏览/下载:314/81  |  提交时间:2017/07/18
Adaptive Dynamic Programming  Optimal Control  Discrete-time  Nonlinear System  Neural Network  Online Learning  Lyapunov Method  
Intelligent Critic Control With Disturbance Attenuation for Affine Dynamics Including an Application to a Microgrid System 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2017, 卷号: 64, 期号: 6, 页码: 4935-4944
作者:  Wang, Ding;  He, Haibo;  Mu, Chaoxu;  Liu, Derong
Adobe PDF(728Kb)  |  收藏  |  浏览/下载:376/122  |  提交时间:2017/07/18
Adaptive/approximate Dynamic Programming  Adaptive Critic Control  Disturbance Attenuation  Intelligent Control  Neural Identification  Smart Microgrid  
Adaptive near-optimal controllers for non-linear decentralised feedback stabilisation problems 期刊论文
IET CONTROL THEORY AND APPLICATIONS, 2017, 卷号: 11, 期号: 6, 页码: 799-806
作者:  Wang, Ding;  He, Haibo;  Zhao, Bo;  Liu, Derong
Adobe PDF(3178Kb)  |  收藏  |  浏览/下载:375/107  |  提交时间:2017/07/18
Adaptive Control  Optimal Control  Continuous Time Systems  Nonlinear Control Systems  Decentralised Control  Feedback  Stability  Control System Synthesis  Adaptive-critic-based Near-optimal Controller  Continuous-time Nonlinear Decentralised Feedback Stabilisation Problem  Decentralised Feedback Control Problem  Updating Rule  Approximation Property  Learning Mechanism  
A neural-network-based online optimal control approach for nonlinear robust decentralized stabilization 期刊论文
SOFT COMPUTING, 2016, 卷号: 20, 期号: 2, 页码: 707-716
作者:  Wang, Ding;  Liu, Derong;  Li, Hongliang;  Ma, Hongwen;  Li, Chao
Adobe PDF(837Kb)  |  收藏  |  浏览/下载:375/89  |  提交时间:2016/06/14
Adaptive Dynamic Programming  Approximate Dynamic Programming  Neural Networks  Online Optimal Control  Robust Decentralized Stabilization  Uncertain Nonlinear Systems  
Reinforcement learning solution for HJB equation arising in constrained optimal control problem 期刊论文
NEURAL NETWORKS, 2015, 卷号: 71, 期号: 0, 页码: 150-158
作者:  Luo, Biao;  Wu, Huai-Ning;  Huang, Tingwen;  Liu, Derong
浏览  |  Adobe PDF(530Kb)  |  收藏  |  浏览/下载:496/210  |  提交时间:2016/03/30
Constrained Optimal Control  Data-based  Off-policy Reinforcement Learning  Hamilton-jacobi-bellman Equation  The Method Of Weighted Residuals