CASIA OpenIR

浏览/检索结果: 共5条,第1-5条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Policy Gradient Adaptive Dynamic Programming for Data-Based Optimal Control 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 10, 页码: 3341-3354
作者:  Luo, Biao;  Liu, Derong;  Wu, Huai-Ning;  Wang, Ding;  Lewis, Frank L.
浏览  |  Adobe PDF(3217Kb)  |  收藏  |  浏览/下载:572/204  |  提交时间:2016/11/09
Adaptive Control  Adaptive Dynamic Programming (Adp)  Data-based  Off-policy Learning  Optimal Control  Policy Gradient  
Improving the Critic Learning for Event-Based Nonlinear H-infinity Control Design 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 10, 页码: 3417-3428
作者:  Wang, Ding;  He, Haibo;  Liu, Derong
浏览  |  Adobe PDF(1068Kb)  |  收藏  |  浏览/下载:415/109  |  提交时间:2018/03/03
H-infinity Control  Adaptive Systems  Adaptive/approximate Dynamic Programming  Critic Network  Event-based Design  Learning Criterion  Neural Control  
Online identifier-actor-critic algorithm for optimal control of nonlinear systems 期刊论文
OPTIMAL CONTROL APPLICATIONS & METHODS, 2017, 卷号: 38, 期号: 3, 页码: 317-335
作者:  Lin, Hanquan;  Wei, Qinglai;  Liu, Derong
浏览  |  Adobe PDF(2888Kb)  |  收藏  |  浏览/下载:269/72  |  提交时间:2017/07/18
Adaptive Dynamic Programming  Optimal Control  Discrete-time  Nonlinear System  Neural Network  Online Learning  Lyapunov Method  
Model-Free Optimal Tracking Control via Critic-Only Q-Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 卷号: 27, 期号: 10, 页码: 2134-2144
作者:  Luo, Biao;  Liu, Derong;  Huang, Tingwen;  Wang, Ding;  Luo,Biao
浏览  |  Adobe PDF(1521Kb)  |  收藏  |  浏览/下载:569/283  |  提交时间:2016/10/24
Critic-only Q-learning (Coql)  Model-free  Nonaffine Nonlinear Systems  Optimal Tracking Control  
Value Iteration Adaptive Dynamic Programming for Optimal Control of Discrete-Time Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2016, 卷号: 46, 期号: 3, 页码: 840-853
作者:  Wei, Qinglai;  Liu, Derong;  Lin, Hanquan;  Derong Liu
浏览  |  Adobe PDF(2015Kb)  |  收藏  |  浏览/下载:372/162  |  提交时间:2016/06/14
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Neural Networks  Neuro-dynamic Programming  Optimal Control  Reinforcement Learning  Value Iteration