CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共6条,第1-6条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Policy Gradient Methods with Gaussian Process Modelling Acceleration 会议论文
, Anchorage, AK, USA, 14-19 May 2017
作者:  Li, Dong;  Zhao, Dongbin;  Zhang, Qichao;  Luo, Chaomin
Adobe PDF(720Kb)  |  收藏  |  浏览/下载:298/93  |  提交时间:2017/12/28
Event-Triggered H-infinity Control for Continuous-Time Nonlinear System via Concurrent Learning 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2017, 卷号: 47, 期号: 7, 页码: 1071-1081
作者:  Zhang, Qichao;  Zhao, Dongbin;  Zhu, Yuanheng
浏览  |  Adobe PDF(2937Kb)  |  收藏  |  浏览/下载:521/238  |  提交时间:2017/05/04
Concurrent Learning  Event-triggered Control  H-infinity Optimal Control  Neural Networks (Nns)  Zero-sum (Zs) Game  
Data-driven adaptive dynamic programming for continuous-time fully cooperative games with partially constrained inputs 期刊论文
NEUROCOMPUTING, 2017, 卷号: 238, 期号: *, 页码: 377-386
作者:  Zhang, Qichao;  Zhao, Dongbin;  Zhu, Yuanheng
浏览  |  Adobe PDF(1508Kb)  |  收藏  |  浏览/下载:606/264  |  提交时间:2017/05/04
Adaptive Dynamic Programming  Optimal Control  Neural Network  Fully Cooperative Games  Data-driven  Constrained Input  
ADP with MCTS algorithm for Gomoku 会议论文
, Athens, Greece, 6-9 Dec. 2016
作者:  Tang Zhentao;  Zhao Dongbin;  Shao Kun;  Lv Le
浏览  |  Adobe PDF(866Kb)  |  收藏  |  浏览/下载:651/304  |  提交时间:2017/05/08
Data-driven adaptive dynamic programming for two-player nonzero-sum game 会议论文
, Chongqing, China, 2017-5
作者:  Zhang, Qichao;  Zhao, Dongbin
浏览  |  Adobe PDF(141Kb)  |  收藏  |  浏览/下载:336/181  |  提交时间:2017/05/04
FMR-GA -- A cooperative multi-agent reinformcement learning algorithm based on gradient ascent 期刊论文
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10634), 2017, 期号: *, 页码: 840–848
作者:  Zhen Zhang;  Dongqing Wang;  Dongbin Zhao;  Tingting Song
收藏  |  浏览/下载:137/0  |  提交时间:2017/12/31
Reinforcement Learning  Multi-agent  Gradient Ascent  Q-learning