CASIA OpenIR

浏览/检索结果: 共43条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Traffic Signal Timing via Deep Reinforcement Learning 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2016, 期号: 3, 页码: 247-254
作者:  Li Li;  Lv YS(吕宜生);  Fei-Yue Wang
Adobe PDF(509Kb)  |  收藏  |  浏览/下载:79/39  |  提交时间:2022/04/08
Traffic control , reinforcement learning , deep learning , deep reinforcement learning  
Adaptive dynamic programming for residential energy scheduling with solar energy 会议论文
, Guilin, China, 12-15 June 2016
作者:  Xu YC(徐延才);  Liu DR(刘德荣);  Wei QL(魏庆来);  Luo B(罗彪);  Yancai Xu
浏览  |  Adobe PDF(519Kb)  |  收藏  |  浏览/下载:331/107  |  提交时间:2018/01/18
Decentralized stabilization for nonlinear systems with unknown mismatched interconnections 会议论文
, Kyoto, Japan, 2016.10
作者:  Zhao B(赵博)
浏览  |  Adobe PDF(312Kb)  |  收藏  |  浏览/下载:48/11  |  提交时间:2018/01/11
Observer based policy iteration algorithm for fault tolerant control of nonlinear systems with actuator faults 会议论文
, Guilin, China, 2016.6
作者:  Zhao B(赵博)
Adobe PDF(1291Kb)  |  收藏  |  浏览/下载:187/49  |  提交时间:2018/01/11
Adaptive dynamic programming based fault compensation control for nonlinear systems with actuator failures 会议论文
, Vancouver, Canada, 2016.7.6
作者:  Zhao B(赵博)
浏览  |  Adobe PDF(274Kb)  |  收藏  |  浏览/下载:129/61  |  提交时间:2018/01/11
Online fault compensation control based on policy iteration algorithm for a class of affine nonlinear systems with actuator failures 期刊论文
IET Control Theory & Applications, 2016, 卷号: 10, 期号: 15, 页码: 1816-1823
作者:  Zhao B(赵博)
Adobe PDF(1674Kb)  |  收藏  |  浏览/下载:227/72  |  提交时间:2018/01/11
自适应动态规划  容错控制  
Deep reinforcement learning with Experience Replay based on SARSA 会议论文
, *, 2016-9
作者:  Zhao,Dongbin(赵冬斌);  Wang,Haitao;  Shao,Kun;  Zhu,Yuanheng
Adobe PDF(1288Kb)  |  收藏  |  浏览/下载:416/182  |  提交时间:2018/01/04
Deep Learning  Reinforcement Learning  Experience Replay  q Learning  Sarsa Learning  
深度强化学习综述:兼论计算机围棋的发展 期刊论文
控制理论与应用, 2016, 卷号: 33, 期号: 6, 页码: 701-717
作者:  赵冬斌;  邵坤;  朱圆恒;  李栋;  陈亚冉;  王海涛;  刘德荣;  周彤;  王成红
Adobe PDF(2816Kb)  |  收藏  |  浏览/下载:1807/659  |  提交时间:2017/09/13
深度强化学习  初弈号  深度学习  强化学习  人工智能  
An adaptive dynamic programming based method for optimization of electricity consumption in office buildings 会议论文
, Vancouver, Canada, Jul. 23–29, 2016
作者:  Shi, Guang;  Wei, Qinglai;  Liu, Derong
浏览  |  Adobe PDF(215Kb)  |  收藏  |  浏览/下载:273/74  |  提交时间:2017/05/25
Convolutional fitted Q iteration for vision-based control problems 会议论文
, Vancouver, BC, Canada, 24-29 July 2016
作者:  Zhao Dongbin;  Zhu Yuanheng;  Lv Le;  Chen Yaran;  Zhang Qichao
浏览  |  Adobe PDF(240Kb)  |  收藏  |  浏览/下载:385/130  |  提交时间:2017/05/08