CASIA OpenIR

浏览/检索结果: 共11条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Data-based robust adaptive control for a class of unknown nonlinear constrained-input systems via integral reinforcement learning 期刊论文
INFORMATION SCIENCES, 2016, 卷号: 369, 页码: 731-747
作者:  Yang, Xiong;  Liu, Derong;  Luo, Biao;  Li, Chao
收藏  |  浏览/下载:214/0  |  提交时间:2016/12/26
Adaptive Dynamic Programming  Input Constraint  Neural Networks  Optimal Control  Reinforcement Learning  Robust Control  
A Virtual Character Learns to Defend Himself in Sword Fighting Based on Q-Network 会议论文
, San Jose, CA, USA, 2016-11-6
作者:  Wang YM(王雨萌);  Li E(李尔);  Wang F(王丰);  Xu B(徐波)
收藏  |  浏览/下载:71/0  |  提交时间:2020/10/27
Human-computer Interaction  Reinforcement Learning  Q-network  Character Animation  
Deep reinforcement learning with Experience Replay based on SARSA 会议论文
, *, 2016-9
作者:  Zhao,Dongbin(赵冬斌);  Wang,Haitao;  Shao,Kun;  Zhu,Yuanheng
Adobe PDF(1288Kb)  |  收藏  |  浏览/下载:392/174  |  提交时间:2018/01/04
Deep Learning  Reinforcement Learning  Experience Replay  q Learning  Sarsa Learning  
Online reinforcement learning control by Bayesian inference 期刊论文
IET CONTROL THEORY AND APPLICATIONS, 2016, 卷号: 10, 期号: 12, 页码: 1331-1338
作者:  Xia, Zhongpu;  Zhao, Dongbin;  Dongbin Zhao
浏览  |  Adobe PDF(1559Kb)  |  收藏  |  浏览/下载:340/113  |  提交时间:2016/06/15
Learning Systems  Bayes Methods  Gaussian Processes  Optimal Control  Online Reinforcement Learning Control  Bayesian Inference  Self-learning Control  Probability  Action Value Function  Gaussian Process  Bayesian-state-action-reward-state-action Algorithm  
Using reinforcement learning techniques to solve continuous-time non-linear optimal tracking problem without system dynamics 期刊论文
IET CONTROL THEORY AND APPLICATIONS, 2016, 卷号: 10, 期号: 12, 页码: 1339-1347
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Li, Xiangjun
浏览  |  Adobe PDF(976Kb)  |  收藏  |  浏览/下载:390/162  |  提交时间:2016/12/26
Nonlinear Control Systems  Continuous Time Systems  Learning (Artificial Intelligence)  Optimal Control  Dynamic Programming  Lyapunov Methods  Linear Systems  Reinforcement Learning  Continuous-time Problem  Nonlinear Optimal Tracking Problem  Adaptive Dynamic Programming  Model-free Adaptive Optimal Tracking Algorithm  Lyapunov Analysis  Linear System  
Value Iteration Adaptive Dynamic Programming for Optimal Control of Discrete-Time Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2016, 卷号: 46, 期号: 3, 页码: 840-853
作者:  Wei, Qinglai;  Liu, Derong;  Lin, Hanquan;  Derong Liu
浏览  |  Adobe PDF(2015Kb)  |  收藏  |  浏览/下载:374/162  |  提交时间:2016/06/14
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Neural Networks  Neuro-dynamic Programming  Optimal Control  Reinforcement Learning  Value Iteration  
Neuro-optimal tracking control for a class of discrete-time nonlinear systems via generalized value iteration adaptive dynamic programming approach 期刊论文
SOFT COMPUTING, 2016, 卷号: 20, 期号: 2, 页码: 697-706
作者:  Wei, Qinglai;  Liu, Derong;  Xu, Yancai;  Qinglai Wei
浏览  |  Adobe PDF(790Kb)  |  收藏  |  浏览/下载:430/159  |  提交时间:2016/06/14
Adaptive Dynamic Programming  Approximate Dynamic Programming  Adaptive Critic Designs  Optimal Control  Neural Networks  Nonlinear Systems  Reinforcement Learning  
Data-Driven Zero-Sum Neuro-Optimal Control for a Class of Continuous-Time Unknown Nonlinear Systems With Disturbance Using ADP 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 卷号: 27, 期号: 2, 页码: 444-458
作者:  Wei, Qinglai;  Song, Ruizhuo;  Yan, Pengfei
Adobe PDF(2204Kb)  |  收藏  |  浏览/下载:404/137  |  提交时间:2016/06/14
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Neurodynamic Programming  Nonlinear Systems  Optimal Control  Recurrent Neural Network (Rnn)  Reinforcement Learning  
Online approximate solution of HJI equation for unknown constrained-input nonlinear continuous-time systems 期刊论文
INFORMATION SCIENCES, 2016, 卷号: 328, 页码: 435-454
作者:  Yang, Xiong;  Liu, Derong;  Ma, Hongwen;  Xu, Yancai
浏览  |  Adobe PDF(833Kb)  |  收藏  |  浏览/下载:406/116  |  提交时间:2016/01/18
Adaptive Dynamic Programming  Hamilton-jacobi-isaacs Equation  Input Constraint  Neural Network  Optimal Control  Reinforcement Learning  
Traffic Signal Timing via Deep Reinforcement Learning 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2016, 期号: 3, 页码: 247-254
作者:  Li Li;  Lv YS(吕宜生);  Fei-Yue Wang
Adobe PDF(509Kb)  |  收藏  |  浏览/下载:71/34  |  提交时间:2022/04/08
Traffic control , reinforcement learning , deep learning , deep reinforcement learning