CASIA OpenIR

浏览/检索结果: 共12条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Adaptive bias-variance trade-off in advantage estimator for actor-critic algorithms 期刊论文
NEURAL NETWORKS, 2024, 卷号: 169, 页码: 764-777
作者:  Chen, Yurou;  Zhang, Fengyi;  Liu, Zhiyong
收藏  |  浏览/下载:16/0  |  提交时间:2024/02/22
Reinforcement Learning  Policy gradient  Actor-critic  Value function  Bias-variance trade-off  
Discrete-Time Non-Zero-Sum Games With Completely Unknown Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 卷号: 51, 期号: 6, 页码: 2929-2943
作者:  Song, Ruizhuo;  Wei, Qinglai;  Zhang, Huaguang;  Lewis, Frank L.
收藏  |  浏览/下载:194/0  |  提交时间:2021/08/15
Adaptive critic designs  adaptive dynamic programming  approximate dynamic programming  discrete-time  nonzero-sum (NZS)  off-policy  reinforcement learning (RL)  
Data-Based Reinforcement Learning for Nonzero-Sum Games With Unknown Drift Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 卷号: 49, 期号: 8, 页码: 2874-2885
作者:  Zhang, Qichao;  Zhao, Dongbin
浏览  |  Adobe PDF(1021Kb)  |  收藏  |  浏览/下载:402/119  |  提交时间:2019/07/12
Integral reinforcement learning (IRL)  neural network (NN)  nonzero-sum (NZS) games  off-policy  single-critic  unknown drift dynamics  
Adaptive Q-Learning for Data-Based Optimal Output Regulation With Experience Replay 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 12, 页码: 3337-3348
作者:  Luo, Biao;  Yang, Yin;  Liu, Derong
收藏  |  浏览/下载:278/0  |  提交时间:2019/01/08
Data-based  experience replay  neural networks (NNs)  off-policy  optimal control  Q-learning (QL)  
An off-policy iteration algorithm for robust stabilization of constrained-input uncertain nonlinear systems 期刊论文
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2018, 卷号: 28, 期号: 18, 页码: 5747-5765
作者:  Yang, Xiong;  Wei, Qinglai
收藏  |  浏览/下载:297/0  |  提交时间:2019/01/08
constrained input  mismatched uncertainties  off-policy iteration  reinforcement learning  robust stabilization  
Comprehensive comparison of online ADP algorithms for continuous-time optimal control 期刊论文
ARTIFICIAL INTELLIGENCE REVIEW, 2018, 卷号: 49, 期号: 4, 页码: 531-547
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(766Kb)  |  收藏  |  浏览/下载:405/180  |  提交时间:2017/09/13
Adaptive Dynamic Programming  Policy Iteration  Integral Reinforcement Learning  Experience Replay  Off-policy  
Off-Policy Integral Reinforcement Learning Method to Solve Nonlinear Continuous-Time Multiplayer Nonzero-Sum Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 卷号: 28, 期号: 3, 页码: 704-713
作者:  Song, Ruizhuo;  Lewis, Frank L.;  Wei, Qinglai
收藏  |  浏览/下载:179/0  |  提交时间:2017/05/05
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Integral Reinforcement Learning (Irl)  Nonlinear Systems  Nonzero Sum (Nzs)  Off-policy  
Policy Gradient Adaptive Dynamic Programming for Data-Based Optimal Control 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 10, 页码: 3341-3354
作者:  Luo, Biao;  Liu, Derong;  Wu, Huai-Ning;  Wang, Ding;  Lewis, Frank L.
浏览  |  Adobe PDF(3217Kb)  |  收藏  |  浏览/下载:571/204  |  提交时间:2016/11/09
Adaptive Control  Adaptive Dynamic Programming (Adp)  Data-based  Off-policy Learning  Optimal Control  Policy Gradient  
Off-Policy Actor-Critic Structure for Optimal Control of Unknown Systems With Disturbances 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2016, 卷号: 46, 期号: 5, 页码: 1041-1050
作者:  Song, Ruizhuo;  Lewis, Frank L.;  Wei, Qinglai;  Zhang, Huaguang
收藏  |  浏览/下载:206/0  |  提交时间:2016/10/20
Adaptive Critic Designs  Adaptive/approximate Dynamic Programming (Adp)  Dynamic Programming  Off-policy  Optimal Control  Unknown System  
Off-policy reinforcement learning for H∞ control design 期刊论文
IEEE Transactions on Cybernetics, 2015, 卷号: 45, 期号: 1, 页码: 65-76
作者:  Luo, Biao;  Wu, Huai-Ning;  Huang, Tingwen
浏览  |  Adobe PDF(680Kb)  |  收藏  |  浏览/下载:211/59  |  提交时间:2016/04/08
Off-policy