CASIA OpenIR

浏览/检索结果: 共5条,第1-5条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Policy Gradient Methods with Gaussian Process Modelling Acceleration 会议论文
, Anchorage, AK, USA, 14-19 May 2017
作者:  Li, Dong;  Zhao, Dongbin;  Zhang, Qichao;  Luo, Chaomin
浏览  |  Adobe PDF(720Kb)  |  收藏  |  浏览/下载:338/108  |  提交时间:2017/12/28
Cooperative Reinforcement Learning for Multiple Units Combat in StarCraft 会议论文
, Honolulu, Hawaii, USA, Nov. 27 to Dec 1, 2017
作者:  Shao K(邵坤);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
浏览  |  Adobe PDF(1378Kb)  |  收藏  |  浏览/下载:574/278  |  提交时间:2017/09/20
ADP with MCTS algorithm for Gomoku 会议论文
, Athens, Greece, 6-9 Dec. 2016
作者:  Tang Zhentao;  Zhao Dongbin;  Shao Kun;  Lv Le
浏览  |  Adobe PDF(866Kb)  |  收藏  |  浏览/下载:717/324  |  提交时间:2017/05/08
Iterative Adaptive Dynamic Programming for Solving Unknown Nonlinear Zero-Sum Game Based on Online Data 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 卷号: 28, 期号: 3, 页码: 714-725
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Li, Xiangjun
浏览  |  Adobe PDF(547Kb)  |  收藏  |  浏览/下载:477/195  |  提交时间:2017/05/05
Adaptive Dynamic Programming (Adp)  H-infinity Control  Policy Iteration (Pi)  Zero-sum Game (Zsg)  
Data-driven adaptive dynamic programming for continuous-time fully cooperative games with partially constrained inputs 期刊论文
NEUROCOMPUTING, 2017, 卷号: 238, 期号: *, 页码: 377-386
作者:  Zhang, Qichao;  Zhao, Dongbin;  Zhu, Yuanheng
浏览  |  Adobe PDF(1508Kb)  |  收藏  |  浏览/下载:669/282  |  提交时间:2017/05/04
Adaptive Dynamic Programming  Optimal Control  Neural Network  Fully Cooperative Games  Data-driven  Constrained Input