CASIA OpenIR

浏览/检索结果: 共5条,第1-5条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Discrete-Time Non-Zero-Sum Games With Completely Unknown Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 卷号: 51, 期号: 6, 页码: 2929-2943
作者:  Song, Ruizhuo;  Wei, Qinglai;  Zhang, Huaguang;  Lewis, Frank L.
收藏  |  浏览/下载:210/0  |  提交时间:2021/08/15
Adaptive critic designs  adaptive dynamic programming  approximate dynamic programming  discrete-time  nonzero-sum (NZS)  off-policy  reinforcement learning (RL)  
Discrete-Time Impulsive Adaptive Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 卷号: 50, 期号: 10, 页码: 4293-4306
作者:  Wei, Qinglai;  Song, Ruizhuo;  Liao, Zehua;  Li, Benkai;  Lewis, Frank L.
收藏  |  浏览/下载:259/0  |  提交时间:2021/01/07
Optimal control  Performance analysis  Nonlinear systems  Dynamic programming  Heuristic algorithms  Indexes  Adaptive critic designs  adaptive dynamic programming (ADP)  approximate dynamic programming  impulsive control  nonlinear systems  optimal control  
Discrete-Time Deterministic Q-Learning: A Novel Convergence Analysis 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 5, 页码: 1224-1237
作者:  Wei, Qinglai;  Lewis, Frank L.;  Sun, Qiuye;  Yan, Pengfei;  Song, Ruizhuo
收藏  |  浏览/下载:217/0  |  提交时间:2017/02/23
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Neural Networks (Nns)  Neuro-dynamic Programming  Optimal Control  Q-learning  
Policy Gradient Adaptive Dynamic Programming for Data-Based Optimal Control 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 10, 页码: 3341-3354
作者:  Luo, Biao;  Liu, Derong;  Wu, Huai-Ning;  Wang, Ding;  Lewis, Frank L.
Adobe PDF(3217Kb)  |  收藏  |  浏览/下载:591/211  |  提交时间:2016/11/09
Adaptive Control  Adaptive Dynamic Programming (Adp)  Data-based  Off-policy Learning  Optimal Control  Policy Gradient  
Off-Policy Actor-Critic Structure for Optimal Control of Unknown Systems With Disturbances 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2016, 卷号: 46, 期号: 5, 页码: 1041-1050
作者:  Song, Ruizhuo;  Lewis, Frank L.;  Wei, Qinglai;  Zhang, Huaguang
收藏  |  浏览/下载:209/0  |  提交时间:2016/10/20
Adaptive Critic Designs  Adaptive/approximate Dynamic Programming (Adp)  Dynamic Programming  Off-policy  Optimal Control  Unknown System