CASIA OpenIR

Browse/Search Results:  1-4 of 4 Help

Selected(0)Clear Items/Page:    Sort:
Adaptive Q-Learning for Data-Based Optimal Output Regulation With Experience Replay 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 12, 页码: 3337-3348
Authors:  Luo, Biao;  Yang, Yin;  Liu, Derong
Favorite  |  View/Download:14/0  |  Submit date:2019/01/08
Data-based  experience replay  neural networks (NNs)  off-policy  optimal control  Q-learning (QL)  
Comprehensive comparison of online ADP algorithms for continuous-time optimal control 期刊论文
ARTIFICIAL INTELLIGENCE REVIEW, 2018, 卷号: 49, 期号: 4, 页码: 531-547
Authors:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(766Kb)  |  Favorite  |  View/Download:60/24  |  Submit date:2017/09/13
Adaptive Dynamic Programming  Policy Iteration  Integral Reinforcement Learning  Experience Replay  Off-policy  
Deep reinforcement learning with Experience Replay based on SARSA 会议论文
, *, 2016-9
Authors:  Zhao,Dongbin(赵冬斌);  Wang,Haitao;  Shao,Kun;  Zhu,Yuanheng
View  |  Adobe PDF(1288Kb)  |  Favorite  |  View/Download:61/22  |  Submit date:2018/01/04
Deep Learning  Reinforcement Learning  Experience Replay  q Learning  Sarsa Learning  
Experience Replay for Optimal Control of Nonzero-Sum Game Systems With Unknown Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2016, 卷号: 46, 期号: 3, 页码: 854-865
Authors:  Zhao, Dongbin;  Zhang, Qichao;  Wang, Ding;  Zhu, Yuanheng
View  |  Adobe PDF(1769Kb)  |  Favorite  |  View/Download:106/48  |  Submit date:2016/06/14
Adaptive Dynamic Programming (Adp)  Experience Replay  Nonzero-sum (Nzs) Games  Optimal Control  Unknown Dynamics