CASIA OpenIR

浏览/检索结果: 共26条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Observer based policy iteration algorithm for fault tolerant control of nonlinear systems with actuator faults 会议论文
, Guilin, China, 2016.6
作者:  Zhao B(赵博)
Adobe PDF(1291Kb)  |  收藏  |  浏览/下载:138/35  |  提交时间:2018/01/11
Adaptive dynamic programming based fault compensation control for nonlinear systems with actuator failures 会议论文
, Vancouver, Canada, 2016.7.6
作者:  Zhao B(赵博)
浏览  |  Adobe PDF(274Kb)  |  收藏  |  浏览/下载:121/59  |  提交时间:2018/01/11
Deep reinforcement learning with Experience Replay based on SARSA 会议论文
, *, 2016-9
作者:  Zhao,Dongbin(赵冬斌);  Wang,Haitao;  Shao,Kun;  Zhu,Yuanheng
Adobe PDF(1288Kb)  |  收藏  |  浏览/下载:390/174  |  提交时间:2018/01/04
Deep Learning  Reinforcement Learning  Experience Replay  q Learning  Sarsa Learning  
Two Intersections Traffic Signal Control Method Based on ADHDP 会议论文
, China, 2016
作者:  Cao Lin;  Hu Bin;  Dong Xisong;  XIONG Gang;  Zhu Fenghua;  Shen Zhen;  Shen Dong;  Liu Yuliang
Adobe PDF(438Kb)  |  收藏  |  浏览/下载:317/110  |  提交时间:2017/12/31
深度强化学习综述:兼论计算机围棋的发展 期刊论文
控制理论与应用, 2016, 卷号: 33, 期号: 6, 页码: 701-717
作者:  赵冬斌;  邵坤;  朱圆恒;  李栋;  陈亚冉;  王海涛;  刘德荣;  周彤;  王成红
浏览  |  Adobe PDF(2816Kb)  |  收藏  |  浏览/下载:1725/634  |  提交时间:2017/09/13
深度强化学习  初弈号  深度学习  强化学习  人工智能  
Convolutional fitted Q iteration for vision-based control problems 会议论文
, Vancouver, BC, Canada, 24-29 July 2016
作者:  Zhao Dongbin;  Zhu Yuanheng;  Lv Le;  Chen Yaran;  Zhang Qichao
Adobe PDF(240Kb)  |  收藏  |  浏览/下载:339/115  |  提交时间:2017/05/08
Model-free reinforcement learning for nonlinear zero-sum games with simultaneous explorations 会议论文
, Vancouver, Canada, 2016-7
作者:  Zhang, Qichao;  Zhao, Dongbin;  Zhu, Yuanheng;  Chen, Xi
浏览  |  Adobe PDF(339Kb)  |  收藏  |  浏览/下载:265/87  |  提交时间:2017/05/04
Neural-network-based robust optimal control of uncertain nonlinear systems using model-free policy iteration algorithm 会议论文
, Vancouver, BC, Canada, 24-29 July 2016
作者:  Li, Chao;  Wang, Ding;  Liu, Derong
浏览  |  Adobe PDF(201Kb)  |  收藏  |  浏览/下载:217/75  |  提交时间:2017/05/03
Policy Iteration for Optimal Control of Weakly Coupled Nonlinear Systems with Completely Unknown Dynamics 会议论文
, Boston, MA, USA, July 6-8, 2016
作者:  Li, Chao;  Wang, Ding;  Liu, Derong;  He, Haibo
浏览  |  Adobe PDF(206Kb)  |  收藏  |  浏览/下载:191/73  |  提交时间:2017/05/03
Guaranteed cost neural tracking control for a class of uncertain nonlinear systems using adaptive dynamic programming 期刊论文
Neurocomputing, 2016, 卷号: 198, 期号: wu, 页码: 80
作者:  Yang, Xiong;  Liu, Derong;  Wei, Qinglai;  Wang, Ding
Adobe PDF(1487Kb)  |  收藏  |  浏览/下载:292/122  |  提交时间:2017/02/23
Adaptive Dynamic Programming  Guaranteed Cost Control  Hamilton-jacobi-bellman Equation  Neural Network  Nonlinear System  Reinforcement Learning