CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共5条,第1-5条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Visual navigation with Actor-Critic deep reinforcement learning 会议论文
, Rio, Brazil, 2018-01
作者:  Kun Shao;  Dongbin Zhao;  Yuanheng Zhu;  Qichao Zhang
浏览  |  Adobe PDF(1827Kb)  |  收藏  |  浏览/下载:296/123  |  提交时间:2019/04/22
Reinforcement Learning for Build-Order Production in StarCraft II 会议论文
, Cordoba, Granada, and Seville, Spain, 30 June-6 July 2018
作者:  Zhentao Tang;  Dongbin Zhao;  Yuanheng Zhu;  Ping Guo
Adobe PDF(2680Kb)  |  收藏  |  浏览/下载:159/49  |  提交时间:2021/07/07
Move Prediction in Gomoku Using Deep Learning 会议论文
, Wuhan, China, November 11-13, 2016
作者:  Shao, Kun;  Zhao, Bongbin;  Tang, Zhentao;  Zhu, Yuanheng
浏览  |  Adobe PDF(321Kb)  |  收藏  |  浏览/下载:828/397  |  提交时间:2017/12/29
Gomoku  Move Prediction  Deep Learning  Deep Convolutional Network  
连续状态系统的近似最优在线强化学习 学位论文
, 中国科学院自动化研究所: 中国科学院大学, 2015
作者:  朱圆恒
Adobe PDF(2679Kb)  |  收藏  |  浏览/下载:499/0  |  提交时间:2015/09/02
强化学习  最优控制  近似策略迭代  概率近似最优  连续状态系统  收敛性  在线学习  Kd树  Reinforcement Learning  Optimal Control  Approximate Policy Iteration  Probably Approximately Correct  Continuous-state System  Convergence  Online Learning  Kd-tree  
Online reinforcement learning for continuous-state systems 专著章节/文集论文
出自: Frontiers of Intelligent Control and Information Processing, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore:World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, 2014
作者:  Yuanheng Zhu;  Zhao DB(赵冬斌)
Adobe PDF(24150Kb)  |  收藏  |  浏览/下载:242/27  |  提交时间:2017/09/13