CASIA OpenIR

浏览/检索结果: 共5条,第1-5条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
FMRQ-A Multiagent Reinforcement Learning Algorithm for Fully Cooperative Tasks 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 6, 页码: 1367-1379
作者:  Zhang, Zhen;  Zhao, Dongbin;  Gao, Junwei;  Wang, Dongqing;  Dai, Yujie
收藏  |  浏览/下载:285/0  |  提交时间:2017/07/18
Multiagent Reinforcement Learning (Marl)  Nash Equilibrium  Q-learning  Repeated Game  
Iterative Adaptive Dynamic Programming for Solving Unknown Nonlinear Zero-Sum Game Based on Online Data 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 卷号: 28, 期号: 3, 页码: 714-725
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Li, Xiangjun
浏览  |  Adobe PDF(547Kb)  |  收藏  |  浏览/下载:447/186  |  提交时间:2017/05/05
Adaptive Dynamic Programming (Adp)  H-infinity Control  Policy Iteration (Pi)  Zero-sum Game (Zsg)  
Event-Triggered H-infinity Control for Continuous-Time Nonlinear System via Concurrent Learning 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2017, 卷号: 47, 期号: 7, 页码: 1071-1081
作者:  Zhang, Qichao;  Zhao, Dongbin;  Zhu, Yuanheng
浏览  |  Adobe PDF(2937Kb)  |  收藏  |  浏览/下载:536/240  |  提交时间:2017/05/04
Concurrent Learning  Event-triggered Control  H-infinity Optimal Control  Neural Networks (Nns)  Zero-sum (Zs) Game  
Event-based input-constrained nonlinear H infinity state feedback with adaptive critic and neural implementation 期刊论文
NEUROCOMPUTING, 2016, 卷号: 214, 期号: *, 页码: 848-856
作者:  Wang, Ding;  Mu, Chaoxu;  Zhang, Qichao;  Liu, Derong
浏览  |  Adobe PDF(1090Kb)  |  收藏  |  浏览/下载:354/135  |  提交时间:2017/02/14
Adaptive Critic Learning (Acl)  Adaptive Dynamic Programming (Adp)  Event-based Control  Hamilton-jacobi-isaacs (Hji) Equation  Input Constraints  Neural Networks  Nonlinear H-infinity Control  State Feedback  
Experience Replay for Optimal Control of Nonzero-Sum Game Systems With Unknown Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2016, 卷号: 46, 期号: 3, 页码: 854-865
作者:  Zhao, Dongbin;  Zhang, Qichao;  Wang, Ding;  Zhu, Yuanheng
浏览  |  Adobe PDF(1769Kb)  |  收藏  |  浏览/下载:510/197  |  提交时间:2016/06/14
Adaptive Dynamic Programming (Adp)  Experience Replay  Nonzero-sum (Nzs) Games  Optimal Control  Unknown Dynamics