CASIA OpenIR
(Note: the search results are based on claimed items)

Browse/Search Results:  1-10 of 17 Help

Filters        
Selected(0)Clear Items/Page:    Sort:
Data-Based Reinforcement Learning for Nonzero-Sum Games With Unknown Drift Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 卷号: 49, 期号: 8, 页码: 2874-2885
Authors:  Zhang, Qichao;  Zhao, Dongbin
View  |  Adobe PDF(1021Kb)  |  Favorite  |  View/Download:16/0  |  Submit date:2019/07/12
Integral reinforcement learning (IRL)  neural network (NN)  nonzero-sum (NZS) games  off-policy  single-critic  unknown drift dynamics  
StarCraft Micromanagement With Reinforcement Learning and Curriculum Transfer Learning 期刊论文
IEEE Transactions on Emerging Topics in Computational Intelligence, 2019, 卷号: 3, 期号: 1, 页码: 73-84
Authors:  Kun Shao;  Yuanheng Zhu;  Dongbin Zhao
View  |  Adobe PDF(4125Kb)  |  Favorite  |  View/Download:66/43  |  Submit date:2019/04/22
Reinforcement Learning, Transfer Learning, Curriculum Learning, Neural Network, Game Ai  
Deep reinforcement learning based automatic exploration for navigation in unknown environment 期刊论文
IEEE Transactions on Neural Network and Learning Systems, 2019, 期号: early acess, 页码: 1-13
Authors:  Li Haoran;  Zhang Qichao;  Zhao Dongbin
View  |  Adobe PDF(2946Kb)  |  Favorite  |  View/Download:54/35  |  Submit date:2019/10/09
Automatic Exploration  Deep Reinforcement Learning  Optimal Decision  Partial Observation  
A Review of Computational Intelligence for StarCraft AI 会议论文
, Bangalore, India, 18-21 Nov. 2018
Authors:  Tang, Zhentao;  Shao, Kun;  Zhu, Yuanheng;  Li, Dong;  Zhao, Dongbin;  Huang, Tingwen
View  |  Adobe PDF(131Kb)  |  Favorite  |  View/Download:65/47  |  Submit date:2019/04/25
Event-Based Robust Control for Uncertain Nonlinear Systems Using Adaptive Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 1, 页码: 37-50
Authors:  Zhang, Qichao;  Zhao, Dongbin;  Wang, Ding
View  |  Adobe PDF(2233Kb)  |  Favorite  |  View/Download:179/117  |  Submit date:2017/05/04
Adaptive Dynamic Programming (Adp)  Event-based Control  Neural Network (Nn)  Robust Control  Unmatched Uncertainties  
FMRQ-A Multiagent Reinforcement Learning Algorithm for Fully Cooperative Tasks 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 6, 页码: 1367-1379
Authors:  Zhang, Zhen;  Zhao, Dongbin;  Gao, Junwei;  Wang, Dongqing;  Dai, Yujie
Favorite  |  View/Download:78/0  |  Submit date:2017/07/18
Multiagent Reinforcement Learning (Marl)  Nash Equilibrium  Q-learning  Repeated Game  
Data-driven adaptive dynamic programming for continuous-time fully cooperative games with partially constrained inputs 期刊论文
NEUROCOMPUTING, 2017, 卷号: 238, 期号: *, 页码: 377-386
Authors:  Zhang, Qichao;  Zhao, Dongbin;  Zhu, Yuanheng
View  |  Adobe PDF(1508Kb)  |  Favorite  |  View/Download:167/90  |  Submit date:2017/05/04
Adaptive Dynamic Programming  Optimal Control  Neural Network  Fully Cooperative Games  Data-driven  Constrained Input  
Cooperative Reinforcement Learning for Multiple Units Combat in StarCraft 会议论文
, Honolulu, Hawaii, USA, Nov. 27 to Dec 1, 2017
Authors:  Shao K(邵坤);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
View  |  Adobe PDF(1378Kb)  |  Favorite  |  View/Download:179/109  |  Submit date:2017/09/20
Policy Iteration for Hinfinity Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming 期刊论文
IEEE Transactions on Cybernetics, 2017, 期号: PP, 页码: 1-9
Authors:  Yuanheng Zhu;  Zhao DB(赵冬斌)
View  |  Adobe PDF(894Kb)  |  Favorite  |  View/Download:73/39  |  Submit date:2017/09/13
Adaptive Dynamic Programming (Adp)  H∞ Optimal Control  Policy Iteration (Pi)  Polynomial Nonlinear Systems  Sum Of Squares (Sos)  
Data-Based Adaptive Critic Designs for Nonlinear Robust Optimal Control With Uncertain Dynamics 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2016, 卷号: 46, 期号: 11, 页码: 1544-1555
Authors:  Wang, Ding;  Liu, Derong;  Zhang, Qichao;  Zhao, Dongbin
View  |  Adobe PDF(1082Kb)  |  Favorite  |  View/Download:128/65  |  Submit date:2017/02/14
Adaptive Critic Designs  Adaptive Dynamic Programming  Intelligent Control  Neural Networks  Policy Iteration  Robust Optimal Control  System Identification  Uncertain Nonlinear Systems