CASIA OpenIR

Browse/Search Results:  1-7 of 7 Help

Selected(0)Clear Items/Page:    Sort:
Data-Based Reinforcement Learning for Nonzero-Sum Games With Unknown Drift Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 卷号: 49, 期号: 8, 页码: 2874-2885
Authors:  Zhang, Qichao;  Zhao, Dongbin
View  |  Adobe PDF(1021Kb)  |  Favorite  |  View/Download:13/0  |  Submit date:2019/07/12
Integral reinforcement learning (IRL)  neural network (NN)  nonzero-sum (NZS) games  off-policy  single-critic  unknown drift dynamics  
Off-Policy Reinforcement Learning for Partially Unknown Nonzero-Sum Games 会议论文
, Guangzhou China, November 14–18
Authors:  Zhang,Qichao;  Zhao,Dongbin;  Zhang,Sibo
View  |  Adobe PDF(119Kb)  |  Favorite  |  View/Download:74/15  |  Submit date:2017/12/28
Data-driven adaptive dynamic programming for continuous-time fully cooperative games with partially constrained inputs 期刊论文
NEUROCOMPUTING, 2017, 卷号: 238, 期号: *, 页码: 377-386
Authors:  Zhang, Qichao;  Zhao, Dongbin;  Zhu, Yuanheng
View  |  Adobe PDF(1508Kb)  |  Favorite  |  View/Download:158/84  |  Submit date:2017/05/04
Adaptive Dynamic Programming  Optimal Control  Neural Network  Fully Cooperative Games  Data-driven  Constrained Input  
Iterative Adaptive Dynamic Programming for Solving Unknown Nonlinear Zero-Sum Game Based on Online Data 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 卷号: 28, 期号: 3, 页码: 714-725
Authors:  Zhu, Yuanheng;  Zhao, Dongbin;  Li, Xiangjun
View  |  Adobe PDF(547Kb)  |  Favorite  |  View/Download:137/71  |  Submit date:2017/05/05
Adaptive Dynamic Programming (Adp)  H-infinity Control  Policy Iteration (Pi)  Zero-sum Game (Zsg)  
Off-Policy Integral Reinforcement Learning Method to Solve Nonlinear Continuous-Time Multiplayer Nonzero-Sum Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 卷号: 28, 期号: 3, 页码: 704-713
Authors:  Song, Ruizhuo;  Lewis, Frank L.;  Wei, Qinglai
Favorite  |  View/Download:37/0  |  Submit date:2017/05/05
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Integral Reinforcement Learning (Irl)  Nonlinear Systems  Nonzero Sum (Nzs)  Off-policy  
Data-driven adaptive dynamic programming for two-player nonzero-sum game 会议论文
, Chongqing, China, 2017-5
Authors:  Zhang, Qichao;  Zhao, Dongbin
View  |  Adobe PDF(141Kb)  |  Favorite  |  View/Download:124/66  |  Submit date:2017/05/04
Experience Replay for Optimal Control of Nonzero-Sum Game Systems With Unknown Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2016, 卷号: 46, 期号: 3, 页码: 854-865
Authors:  Zhao, Dongbin;  Zhang, Qichao;  Wang, Ding;  Zhu, Yuanheng
View  |  Adobe PDF(1769Kb)  |  Favorite  |  View/Download:144/71  |  Submit date:2016/06/14
Adaptive Dynamic Programming (Adp)  Experience Replay  Nonzero-sum (Nzs) Games  Optimal Control  Unknown Dynamics