CASIA OpenIR

浏览/检索结果: 共14条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Convolutional fitted Q iteration for vision-based control problems 会议论文
, Vancouver, BC, Canada, 24-29 July 2016
作者:  Zhao Dongbin;  Zhu Yuanheng;  Lv Le;  Chen Yaran;  Zhang Qichao
浏览  |  Adobe PDF(240Kb)  |  收藏  |  浏览/下载:335/114  |  提交时间:2017/05/08
Model-free reinforcement learning for nonlinear zero-sum games with simultaneous explorations 会议论文
, Vancouver, Canada, 2016-7
作者:  Zhang, Qichao;  Zhao, Dongbin;  Zhu, Yuanheng;  Chen, Xi
浏览  |  Adobe PDF(339Kb)  |  收藏  |  浏览/下载:260/85  |  提交时间:2017/05/04
Data-Based Adaptive Critic Designs for Nonlinear Robust Optimal Control With Uncertain Dynamics 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2016, 卷号: 46, 期号: 11, 页码: 1544-1555
作者:  Wang, Ding;  Liu, Derong;  Zhang, Qichao;  Zhao, Dongbin
浏览  |  Adobe PDF(1082Kb)  |  收藏  |  浏览/下载:423/193  |  提交时间:2017/02/14
Adaptive Critic Designs  Adaptive Dynamic Programming  Intelligent Control  Neural Networks  Policy Iteration  Robust Optimal Control  System Identification  Uncertain Nonlinear Systems  
Fuzzy-Based Goal Representation Adaptive Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2016, 卷号: 24, 期号: 5, 页码: 1159-1175
作者:  Tang, Yufei;  He, Haibo;  Ni, Zhen;  Zhong, Xiangnan;  Zhao, Dongbin;  Xu, Xin
收藏  |  浏览/下载:197/0  |  提交时间:2017/02/14
Adaptive Dynamic Programming (Adp)  Fuzzy Hyperbolic Model (Fhm)  Goal Representation Adaptive Dynamic Programming (Gradp)  Internal Goal Representation  Multimachine Power Systems  Stability Analysis  
Deep reinforcement learning with Experience Replay based on SARSA 会议论文
, *, 2016-9
作者:  Zhao,Dongbin(赵冬斌);  Wang,Haitao;  Shao,Kun;  Zhu,Yuanheng
浏览  |  Adobe PDF(1288Kb)  |  收藏  |  浏览/下载:381/172  |  提交时间:2018/01/04
Deep Learning  Reinforcement Learning  Experience Replay  q Learning  Sarsa Learning  
A Visual Attention based Convolutional Neural Network for Image Classification 会议论文
, Guilin, China, 12-15 June 2016
作者:  Chen Yaran;  Zhao Dongbin;  Lv Le;  Li Chengdong
浏览  |  Adobe PDF(540Kb)  |  收藏  |  浏览/下载:408/169  |  提交时间:2017/05/08
Using reinforcement learning techniques to solve continuous-time non-linear optimal tracking problem without system dynamics 期刊论文
IET CONTROL THEORY AND APPLICATIONS, 2016, 卷号: 10, 期号: 12, 页码: 1339-1347
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Li, Xiangjun
浏览  |  Adobe PDF(976Kb)  |  收藏  |  浏览/下载:370/161  |  提交时间:2016/12/26
Nonlinear Control Systems  Continuous Time Systems  Learning (Artificial Intelligence)  Optimal Control  Dynamic Programming  Lyapunov Methods  Linear Systems  Reinforcement Learning  Continuous-time Problem  Nonlinear Optimal Tracking Problem  Adaptive Dynamic Programming  Model-free Adaptive Optimal Tracking Algorithm  Lyapunov Analysis  Linear System  
Online reinforcement learning control by Bayesian inference 期刊论文
IET CONTROL THEORY AND APPLICATIONS, 2016, 卷号: 10, 期号: 12, 页码: 1331-1338
作者:  Xia, Zhongpu;  Zhao, Dongbin;  Dongbin Zhao
浏览  |  Adobe PDF(1559Kb)  |  收藏  |  浏览/下载:326/112  |  提交时间:2016/06/15
Learning Systems  Bayes Methods  Gaussian Processes  Optimal Control  Online Reinforcement Learning Control  Bayesian Inference  Self-learning Control  Probability  Action Value Function  Gaussian Process  Bayesian-state-action-reward-state-action Algorithm  
A perturbed Gaussian process regression with chunk sparsification for tracking non-stationary systems 会议论文
, Yinchuan, China, 28-30 May 2016
作者:  Li, Dong;  Zhao, Dongbin;  Xia, Zhongpu
浏览  |  Adobe PDF(195Kb)  |  收藏  |  浏览/下载:223/51  |  提交时间:2017/12/28
Experience Replay for Optimal Control of Nonzero-Sum Game Systems With Unknown Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2016, 卷号: 46, 期号: 3, 页码: 854-865
作者:  Zhao, Dongbin;  Zhang, Qichao;  Wang, Ding;  Zhu, Yuanheng
浏览  |  Adobe PDF(1769Kb)  |  收藏  |  浏览/下载:476/191  |  提交时间:2016/06/14
Adaptive Dynamic Programming (Adp)  Experience Replay  Nonzero-sum (Nzs) Games  Optimal Control  Unknown Dynamics