CASIA OpenIR

浏览/检索结果: 共7条,第1-7条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Deep reinforcement learning with Experience Replay based on SARSA 会议论文
, *, 2016-9
作者:  Zhao,Dongbin(赵冬斌);  Wang,Haitao;  Shao,Kun;  Zhu,Yuanheng
浏览  |  Adobe PDF(1288Kb)  |  收藏  |  浏览/下载:381/172  |  提交时间:2018/01/04
Deep Learning  Reinforcement Learning  Experience Replay  q Learning  Sarsa Learning  
Move Prediction in Gomoku Using Deep Learning 会议论文
, Wuhan, China, November 11-13, 2016
作者:  Shao, Kun;  Zhao, Bongbin;  Tang, Zhentao;  Zhu, Yuanheng
浏览  |  Adobe PDF(321Kb)  |  收藏  |  浏览/下载:802/393  |  提交时间:2017/12/29
Gomoku  Move Prediction  Deep Learning  Deep Convolutional Network  
深度强化学习综述:兼论计算机围棋的发展 期刊论文
控制理论与应用, 2016, 卷号: 33, 期号: 6, 页码: 701-717
作者:  赵冬斌;  邵坤;  朱圆恒;  李栋;  陈亚冉;  王海涛;  刘德荣;  周彤;  王成红
浏览  |  Adobe PDF(2816Kb)  |  收藏  |  浏览/下载:1716/633  |  提交时间:2017/09/13
深度强化学习  初弈号  深度学习  强化学习  人工智能  
Convolutional fitted Q iteration for vision-based control problems 会议论文
, Vancouver, BC, Canada, 24-29 July 2016
作者:  Zhao Dongbin;  Zhu Yuanheng;  Lv Le;  Chen Yaran;  Zhang Qichao
浏览  |  Adobe PDF(240Kb)  |  收藏  |  浏览/下载:336/115  |  提交时间:2017/05/08
Model-free reinforcement learning for nonlinear zero-sum games with simultaneous explorations 会议论文
, Vancouver, Canada, 2016-7
作者:  Zhang, Qichao;  Zhao, Dongbin;  Zhu, Yuanheng;  Chen, Xi
浏览  |  Adobe PDF(339Kb)  |  收藏  |  浏览/下载:260/85  |  提交时间:2017/05/04
Using reinforcement learning techniques to solve continuous-time non-linear optimal tracking problem without system dynamics 期刊论文
IET CONTROL THEORY AND APPLICATIONS, 2016, 卷号: 10, 期号: 12, 页码: 1339-1347
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Li, Xiangjun
浏览  |  Adobe PDF(976Kb)  |  收藏  |  浏览/下载:373/161  |  提交时间:2016/12/26
Nonlinear Control Systems  Continuous Time Systems  Learning (Artificial Intelligence)  Optimal Control  Dynamic Programming  Lyapunov Methods  Linear Systems  Reinforcement Learning  Continuous-time Problem  Nonlinear Optimal Tracking Problem  Adaptive Dynamic Programming  Model-free Adaptive Optimal Tracking Algorithm  Lyapunov Analysis  Linear System  
Experience Replay for Optimal Control of Nonzero-Sum Game Systems With Unknown Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2016, 卷号: 46, 期号: 3, 页码: 854-865
作者:  Zhao, Dongbin;  Zhang, Qichao;  Wang, Ding;  Zhu, Yuanheng
浏览  |  Adobe PDF(1769Kb)  |  收藏  |  浏览/下载:478/191  |  提交时间:2016/06/14
Adaptive Dynamic Programming (Adp)  Experience Replay  Nonzero-sum (Nzs) Games  Optimal Control  Unknown Dynamics