CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共10条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Reinforcement Learning for Build-Order Production in StarCraft II 会议论文
, Cordoba, Granada, and Seville, Spain, 30 June-6 July 2018
作者:  Zhentao Tang;  Dongbin Zhao;  Yuanheng Zhu;  Ping Guo
Adobe PDF(2680Kb)  |  收藏  |  浏览/下载:197/59  |  提交时间:2021/07/07
深度强化学习进展: 从 AlphaGo 到 AlphaGo Zero 期刊论文
控 制 理 论 与 应 用, 2017, 卷号: 34, 期号: 12, 页码: 1529-1546
作者:  唐振韬;  邵 坤;  赵冬斌;  朱圆恒
Adobe PDF(8232Kb)  |  收藏  |  浏览/下载:283/46  |  提交时间:2021/07/05
深度强化学习  AlphaGo Zero  深度学习  强化学习  人工智能  
Visual navigation with Actor-Critic deep reinforcement learning 会议论文
, Rio, Brazil, 2018-01
作者:  Kun Shao;  Dongbin Zhao;  Yuanheng Zhu;  Qichao Zhang
Adobe PDF(1827Kb)  |  收藏  |  浏览/下载:341/138  |  提交时间:2019/04/22
StarCraft Micromanagement With Reinforcement Learning and Curriculum Transfer Learning 期刊论文
IEEE Transactions on Emerging Topics in Computational Intelligence, 2019, 卷号: 3, 期号: 1, 页码: 73-84
作者:  Kun Shao;  Yuanheng Zhu;  Dongbin Zhao
Adobe PDF(4125Kb)  |  收藏  |  浏览/下载:364/137  |  提交时间:2019/04/22
Reinforcement Learning, Transfer Learning, Curriculum Learning, Neural Network, Game Ai  
Cooperative Reinforcement Learning for Multiple Units Combat in StarCraft 会议论文
, Honolulu, Hawaii, USA, Nov. 27 to Dec 1, 2017
作者:  Shao K(邵坤);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(1378Kb)  |  收藏  |  浏览/下载:573/278  |  提交时间:2017/09/20
深度强化学习综述:兼论计算机围棋的发展 期刊论文
控制理论与应用, 2016, 卷号: 33, 期号: 6, 页码: 701-717
作者:  赵冬斌;  邵坤;  朱圆恒;  李栋;  陈亚冉;  王海涛;  刘德荣;  周彤;  王成红
浏览  |  Adobe PDF(2816Kb)  |  收藏  |  浏览/下载:1807/659  |  提交时间:2017/09/13
深度强化学习  初弈号  深度学习  强化学习  人工智能  
ADP with MCTS algorithm for Gomoku 会议论文
, Athens, Greece, 6-9 Dec. 2016
作者:  Tang Zhentao;  Zhao Dongbin;  Shao Kun;  Lv Le
浏览  |  Adobe PDF(866Kb)  |  收藏  |  浏览/下载:714/323  |  提交时间:2017/05/08
Data-driven adaptive dynamic programming for continuous-time fully cooperative games with partially constrained inputs 期刊论文
NEUROCOMPUTING, 2017, 卷号: 238, 期号: *, 页码: 377-386
作者:  Zhang, Qichao;  Zhao, Dongbin;  Zhu, Yuanheng
浏览  |  Adobe PDF(1508Kb)  |  收藏  |  浏览/下载:668/282  |  提交时间:2017/05/04
Adaptive Dynamic Programming  Optimal Control  Neural Network  Fully Cooperative Games  Data-driven  Constrained Input  
MEC-A Near-Optimal Online Reinforcement Learning Algorithm for Continuous Deterministic Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 2, 页码: 346-356
作者:  Zhao, Dongbin;  Zhu, Yuanheng
浏览  |  Adobe PDF(2156Kb)  |  收藏  |  浏览/下载:294/116  |  提交时间:2015/09/18
Efficient Exploration  Probably Approximately Correct (Pac)  Reinforcement Learning (Rl)  State Aggregation  
Self-teaching adaptive dynamic programming for Gomoku 期刊论文
NEUROCOMPUTING, 2012, 卷号: 78, 期号: 1, 页码: 23-29
作者:  Zhao, Dongbin;  Zhang, Zhen;  Dai, Yujie
收藏  |  浏览/下载:208/0  |  提交时间:2015/08/12
Gomoku  Reinforcement Learning  Adaptive Dynamic Programming  Temporal Difference Learning  Neural Network