CASIA OpenIR

浏览/检索结果: 共5条,第1-5条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Comprehensive comparison of online ADP algorithms for continuous-time optimal control 期刊论文
ARTIFICIAL INTELLIGENCE REVIEW, 2018, 卷号: 49, 期号: 4, 页码: 531-547
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(766Kb)  |  收藏  |  浏览/下载:406/180  |  提交时间:2017/09/13
Adaptive Dynamic Programming  Policy Iteration  Integral Reinforcement Learning  Experience Replay  Off-policy  
Deep Reinforcement Learning With Visual Attention for Vehicle Classification 期刊论文
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2017, 卷号: 9, 期号: 4, 页码: 356-367
作者:  Zhao, Dongbin;  Chen, Yaran;  Lv, Le
浏览  |  Adobe PDF(3192Kb)  |  收藏  |  浏览/下载:1026/537  |  提交时间:2017/05/08
Convolutional Neural Network (Cnn)  Reinforcement Learning  Vehicle Classification  Visual Attention  
FMRQ-A Multiagent Reinforcement Learning Algorithm for Fully Cooperative Tasks 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 6, 页码: 1367-1379
作者:  Zhang, Zhen;  Zhao, Dongbin;  Gao, Junwei;  Wang, Dongqing;  Dai, Yujie
收藏  |  浏览/下载:281/0  |  提交时间:2017/07/18
Multiagent Reinforcement Learning (Marl)  Nash Equilibrium  Q-learning  Repeated Game  
Online reinforcement learning control by Bayesian inference 期刊论文
IET CONTROL THEORY AND APPLICATIONS, 2016, 卷号: 10, 期号: 12, 页码: 1331-1338
作者:  Xia, Zhongpu;  Zhao, Dongbin;  Dongbin Zhao
浏览  |  Adobe PDF(1559Kb)  |  收藏  |  浏览/下载:339/113  |  提交时间:2016/06/15
Learning Systems  Bayes Methods  Gaussian Processes  Optimal Control  Online Reinforcement Learning Control  Bayesian Inference  Self-learning Control  Probability  Action Value Function  Gaussian Process  Bayesian-state-action-reward-state-action Algorithm  
Using reinforcement learning techniques to solve continuous-time non-linear optimal tracking problem without system dynamics 期刊论文
IET CONTROL THEORY AND APPLICATIONS, 2016, 卷号: 10, 期号: 12, 页码: 1339-1347
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Li, Xiangjun
浏览  |  Adobe PDF(976Kb)  |  收藏  |  浏览/下载:387/161  |  提交时间:2016/12/26
Nonlinear Control Systems  Continuous Time Systems  Learning (Artificial Intelligence)  Optimal Control  Dynamic Programming  Lyapunov Methods  Linear Systems  Reinforcement Learning  Continuous-time Problem  Nonlinear Optimal Tracking Problem  Adaptive Dynamic Programming  Model-free Adaptive Optimal Tracking Algorithm  Lyapunov Analysis  Linear System