CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共13条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Event-Triggered Adaptive Dynamic Programming for Zero-Sum Game of Partially Unknown Continuous-Time Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 卷号: 50, 期号: 9, 页码: 3189-3199
作者:  Xue, Shan;  Luo, Biao;  Liu, Derong
收藏  |  浏览/下载:203/0  |  提交时间:2020/09/28
Adaptive dynamic programming (ADP)  event-triggered control  Hamilton-Jacobi-Isaacs (HJI) equation  neural network (NN) identifier  zero-sum (ZS) game  
Output Tracking Control Based on Adaptive Dynamic Programming With Multistep Policy Evaluation 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2019, 卷号: 49, 期号: 10, 页码: 2155-2165
作者:  Luo, Biao;  Liu, Derong;  Huang, Tingwen;  Liu, Jiangjiang
收藏  |  浏览/下载:205/0  |  提交时间:2019/12/16
Adaptive dynamic programming (ADP)  Bellman equation  heuristic dynamic programming  neural networks (NNs)  output tracking control  
Adaptive Q-Learning for Data-Based Optimal Output Regulation With Experience Replay 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 12, 页码: 3337-3348
作者:  Luo, Biao;  Yang, Yin;  Liu, Derong
收藏  |  浏览/下载:261/0  |  提交时间:2019/01/08
Data-based  experience replay  neural networks (NNs)  off-policy  optimal control  Q-learning (QL)  
Reinforcement learning for robust adaptive control of partially unknown nonlinear systems subject to unmatched uncertainties 期刊论文
INFORMATION SCIENCES, 2018, 卷号: 463, 页码: 307-322
作者:  Yang, Xiong;  He, Haibo;  Wei, Qinglai;  Luo, Biao
收藏  |  浏览/下载:188/0  |  提交时间:2018/10/10
Adaptive Dynamic Programming  Neural Networks  Optimal Control  Reinforcement Learning  Robust Control  Unmatched Uncertainty  
Adaptive Constrained Optimal Control Design for Data-Based Nonlinear Discrete-Time Systems With Critic-Only Structure 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 6, 页码: 2099-2111
作者:  Luo, Biao;  Liu, Derong;  Wu, Huai-Ning
浏览  |  Adobe PDF(1045Kb)  |  收藏  |  浏览/下载:369/113  |  提交时间:2018/10/10
Adaptive Control  Adaptive Dynamic Programming  Constraints  Critic-only  Data-based  Optimal Control  Q-learning  
Multi-step heuristic dynamic programming for optimal control of nonlinear discrete-time systems 期刊论文
INFORMATION SCIENCES, 2017, 卷号: 411, 期号: 0, 页码: 66-83
作者:  Luo, Biao;  Liu, Derong;  Huang, Tingwen;  Yang, Xiong;  Ma, Hongwen
浏览  |  Adobe PDF(1092Kb)  |  收藏  |  浏览/下载:368/124  |  提交时间:2017/09/12
Optimal Control  Multi-step Heuristic Dynamic Programming  Adaptive Dynamic Programming  Nonlinear Systems  Discrete-time  Neural Networks  
Policy Gradient Adaptive Dynamic Programming for Data-Based Optimal Control 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 10, 页码: 3341-3354
作者:  Luo, Biao;  Liu, Derong;  Wu, Huai-Ning;  Wang, Ding;  Lewis, Frank L.
浏览  |  Adobe PDF(3217Kb)  |  收藏  |  浏览/下载:560/204  |  提交时间:2016/11/09
Adaptive Control  Adaptive Dynamic Programming (Adp)  Data-based  Off-policy Learning  Optimal Control  Policy Gradient  
Data-based robust adaptive control for a class of unknown nonlinear constrained-input systems via integral reinforcement learning 期刊论文
INFORMATION SCIENCES, 2016, 卷号: 369, 页码: 731-747
作者:  Yang, Xiong;  Liu, Derong;  Luo, Biao;  Li, Chao
收藏  |  浏览/下载:212/0  |  提交时间:2016/12/26
Adaptive Dynamic Programming  Input Constraint  Neural Networks  Optimal Control  Reinforcement Learning  Robust Control  
Model-Free Optimal Tracking Control via Critic-Only Q-Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 卷号: 27, 期号: 10, 页码: 2134-2144
作者:  Luo, Biao;  Liu, Derong;  Huang, Tingwen;  Wang, Ding;  Luo,Biao
浏览  |  Adobe PDF(1521Kb)  |  收藏  |  浏览/下载:561/283  |  提交时间:2016/10/24
Critic-only Q-learning (Coql)  Model-free  Nonaffine Nonlinear Systems  Optimal Tracking Control  
Reinforcement learning solution for HJB equation arising in constrained optimal control problem 期刊论文
NEURAL NETWORKS, 2015, 卷号: 71, 期号: 0, 页码: 150-158
作者:  Luo, Biao;  Wu, Huai-Ning;  Huang, Tingwen;  Liu, Derong
浏览  |  Adobe PDF(530Kb)  |  收藏  |  浏览/下载:442/194  |  提交时间:2016/03/30
Constrained Optimal Control  Data-based  Off-policy Reinforcement Learning  Hamilton-jacobi-bellman Equation  The Method Of Weighted Residuals