CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共6条,第1-6条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Policy Gradient Adaptive Dynamic Programming for Data-Based Optimal Control 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 10, 页码: 3341-3354
作者:  Luo, Biao;  Liu, Derong;  Wu, Huai-Ning;  Wang, Ding;  Lewis, Frank L.
浏览  |  Adobe PDF(3217Kb)  |  收藏  |  浏览/下载:569/204  |  提交时间:2016/11/09
Adaptive Control  Adaptive Dynamic Programming (Adp)  Data-based  Off-policy Learning  Optimal Control  Policy Gradient  
Data-based robust adaptive control for a class of unknown nonlinear constrained-input systems via integral reinforcement learning 期刊论文
INFORMATION SCIENCES, 2016, 卷号: 369, 页码: 731-747
作者:  Yang, Xiong;  Liu, Derong;  Luo, Biao;  Li, Chao
收藏  |  浏览/下载:214/0  |  提交时间:2016/12/26
Adaptive Dynamic Programming  Input Constraint  Neural Networks  Optimal Control  Reinforcement Learning  Robust Control  
Model-Free Optimal Tracking Control via Critic-Only Q-Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 卷号: 27, 期号: 10, 页码: 2134-2144
作者:  Luo, Biao;  Liu, Derong;  Huang, Tingwen;  Wang, Ding;  Luo,Biao
浏览  |  Adobe PDF(1521Kb)  |  收藏  |  浏览/下载:569/283  |  提交时间:2016/10/24
Critic-only Q-learning (Coql)  Model-free  Nonaffine Nonlinear Systems  Optimal Tracking Control  
An Approximate Optimal Control Approach for Robust Stabilization of a Class of Discrete-Time Nonlinear Systems With Uncertainties 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2016, 卷号: 46, 期号: 5, 页码: 713-717
作者:  Wang, Ding;  Liu, Derong;  Li, Hongliang;  Luo, Biao;  Ma, Hongwen
浏览  |  Adobe PDF(328Kb)  |  收藏  |  浏览/下载:367/110  |  提交时间:2016/04/08
Adaptive Dynamic Programming (Adp)  Generalized Hamilton-jacobi-bellman (Ghjb) Equation  Neural Networks  Optimal Control  Robust Control  Successive Approximation Method  Uncertainties  
Bipartite output consensus in networked multi-agent systems of high-order power integrators with signed digraph and input noises 期刊论文
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2016, 卷号: 47, 期号: 13, 页码: 3116-3131
作者:  Ma, Hongwen;  Liu, Derong;  Wang, Ding;  Luo, Biao
Adobe PDF(1913Kb)  |  收藏  |  浏览/下载:364/117  |  提交时间:2016/10/20
Bipartite Output Consensus  High-order  Input Noises  Networked Multi-agent Systems  Power Integrator  Signed Digraph  
Reinforcement learning solution for HJB equation arising in constrained optimal control problem 期刊论文
NEURAL NETWORKS, 2015, 卷号: 71, 期号: 0, 页码: 150-158
作者:  Luo, Biao;  Wu, Huai-Ning;  Huang, Tingwen;  Liu, Derong
浏览  |  Adobe PDF(530Kb)  |  收藏  |  浏览/下载:447/195  |  提交时间:2016/03/30
Constrained Optimal Control  Data-based  Off-policy Reinforcement Learning  Hamilton-jacobi-bellman Equation  The Method Of Weighted Residuals