CASIA OpenIR

浏览/检索结果: 共13条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Traffic Signal Timing via Deep Reinforcement Learning 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2016, 期号: 3, 页码: 247-254
作者:  Li Li;  Lv YS(吕宜生);  Fei-Yue Wang
Adobe PDF(509Kb)  |  收藏  |  浏览/下载:73/36  |  提交时间:2022/04/08
Traffic control , reinforcement learning , deep learning , deep reinforcement learning  
Minimum parameter learning method for an N-link manipulator with nonlinear disturbance observer 期刊论文
International Journal of Robotics and Automation, 2016, 卷号: 31, 期号: 3, 页码: 206-212
作者:  Hongjun Yang;  Jinkun Liu
收藏  |  浏览/下载:100/0  |  提交时间:2019/09/23
Minimum Parameter Learning  Adaptive Control  Disturbance Observer  Rbf Neural Networks  N-link Manipulator  
Guaranteed cost neural tracking control for a class of uncertain nonlinear systems using adaptive dynamic programming 期刊论文
Neurocomputing, 2016, 卷号: 198, 期号: wu, 页码: 80
作者:  Yang, Xiong;  Liu, Derong;  Wei, Qinglai;  Wang, Ding
Adobe PDF(1487Kb)  |  收藏  |  浏览/下载:300/125  |  提交时间:2017/02/23
Adaptive Dynamic Programming  Guaranteed Cost Control  Hamilton-jacobi-bellman Equation  Neural Network  Nonlinear System  Reinforcement Learning  
Decentralized guaranteed cost control of interconnected systems with uncertainties: A learning-based optimal control strategy 期刊论文
NEUROCOMPUTING, 2016, 卷号: 214, 页码: 297-306
作者:  Wang, Ding;  Liu, Derong;  Mu, Chaoxu;  Ma, Hongwen
Adobe PDF(1113Kb)  |  收藏  |  浏览/下载:412/122  |  提交时间:2017/02/14
Adaptive Dynamic Programming  Decentralized Control  Guaranteed Cost Control  Interconnected Systems  Learning Control  Neural Networks  Optimal Control  Uncertain Plant  
Event-based input-constrained nonlinear H infinity state feedback with adaptive critic and neural implementation 期刊论文
NEUROCOMPUTING, 2016, 卷号: 214, 期号: *, 页码: 848-856
作者:  Wang, Ding;  Mu, Chaoxu;  Zhang, Qichao;  Liu, Derong
浏览  |  Adobe PDF(1090Kb)  |  收藏  |  浏览/下载:353/135  |  提交时间:2017/02/14
Adaptive Critic Learning (Acl)  Adaptive Dynamic Programming (Adp)  Event-based Control  Hamilton-jacobi-isaacs (Hji) Equation  Input Constraints  Neural Networks  Nonlinear H-infinity Control  State Feedback  
Using reinforcement learning techniques to solve continuous-time non-linear optimal tracking problem without system dynamics 期刊论文
IET CONTROL THEORY AND APPLICATIONS, 2016, 卷号: 10, 期号: 12, 页码: 1339-1347
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Li, Xiangjun
浏览  |  Adobe PDF(976Kb)  |  收藏  |  浏览/下载:407/167  |  提交时间:2016/12/26
Nonlinear Control Systems  Continuous Time Systems  Learning (Artificial Intelligence)  Optimal Control  Dynamic Programming  Lyapunov Methods  Linear Systems  Reinforcement Learning  Continuous-time Problem  Nonlinear Optimal Tracking Problem  Adaptive Dynamic Programming  Model-free Adaptive Optimal Tracking Algorithm  Lyapunov Analysis  Linear System  
Data-based robust adaptive control for a class of unknown nonlinear constrained-input systems via integral reinforcement learning 期刊论文
INFORMATION SCIENCES, 2016, 卷号: 369, 页码: 731-747
作者:  Yang, Xiong;  Liu, Derong;  Luo, Biao;  Li, Chao
收藏  |  浏览/下载:217/0  |  提交时间:2016/12/26
Adaptive Dynamic Programming  Input Constraint  Neural Networks  Optimal Control  Reinforcement Learning  Robust Control  
Model-Free Optimal Tracking Control via Critic-Only Q-Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 卷号: 27, 期号: 10, 页码: 2134-2144
作者:  Luo, Biao;  Liu, Derong;  Huang, Tingwen;  Wang, Ding;  Luo,Biao
浏览  |  Adobe PDF(1521Kb)  |  收藏  |  浏览/下载:581/284  |  提交时间:2016/10/24
Critic-only Q-learning (Coql)  Model-free  Nonaffine Nonlinear Systems  Optimal Tracking Control  
Online reinforcement learning control by Bayesian inference 期刊论文
IET CONTROL THEORY AND APPLICATIONS, 2016, 卷号: 10, 期号: 12, 页码: 1331-1338
作者:  Xia, Zhongpu;  Zhao, Dongbin;  Dongbin Zhao
浏览  |  Adobe PDF(1559Kb)  |  收藏  |  浏览/下载:346/115  |  提交时间:2016/06/15
Learning Systems  Bayes Methods  Gaussian Processes  Optimal Control  Online Reinforcement Learning Control  Bayesian Inference  Self-learning Control  Probability  Action Value Function  Gaussian Process  Bayesian-state-action-reward-state-action Algorithm  
Neuro-optimal tracking control for a class of discrete-time nonlinear systems via generalized value iteration adaptive dynamic programming approach 期刊论文
SOFT COMPUTING, 2016, 卷号: 20, 期号: 2, 页码: 697-706
作者:  Wei, Qinglai;  Liu, Derong;  Xu, Yancai;  Qinglai Wei
浏览  |  Adobe PDF(790Kb)  |  收藏  |  浏览/下载:451/165  |  提交时间:2016/06/14
Adaptive Dynamic Programming  Approximate Dynamic Programming  Adaptive Critic Designs  Optimal Control  Neural Networks  Nonlinear Systems  Reinforcement Learning