CASIA OpenIR

浏览/检索结果: 共12条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Generalized Policy Iteration Adaptive Dynamic Programming for Discrete-Time Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2015, 卷号: 45, 期号: 12, 页码: 1577-1591
作者:  Liu, Derong;  Wei, Qinglai;  Yan, Pengfei
浏览  |  Adobe PDF(1540Kb)  |  收藏  |  浏览/下载:218/67  |  提交时间:2016/03/19
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Generalized Policy Iteration  Neural Networks  Neuro-dynamic Programming  Nonlinear Systems  Optimal Control  Reinforcement Learning  
A Basal Ganglia Network Centric Autonomous Learning Model and Its Application in Unmanned Aerial Vehicle 会议论文
Conferences on the 7th International Conference on Brain-inspired Cognitive System, 安徽合肥, 2015年12月11-13日
作者:  Yi, Zeng;  Guixiang, Wang;  Bo, Xu;  Yi Zeng
Microsoft Word(6365Kb)  |  收藏  |  浏览/下载:472/134  |  提交时间:2016/12/09
Autonomous Learning Model  Basal Ganglia Network  Precise Encoding  Uav Autonomous Learning  Reinforcement Learning  Interactive Environment.  
Reinforcement learning solution for HJB equation arising in constrained optimal control problem 期刊论文
NEURAL NETWORKS, 2015, 卷号: 71, 期号: 0, 页码: 150-158
作者:  Luo, Biao;  Wu, Huai-Ning;  Huang, Tingwen;  Liu, Derong
浏览  |  Adobe PDF(530Kb)  |  收藏  |  浏览/下载:445/195  |  提交时间:2016/03/30
Constrained Optimal Control  Data-based  Off-policy Reinforcement Learning  Hamilton-jacobi-bellman Equation  The Method Of Weighted Residuals  
Online Reinforcement Learning by Bayesian Inference 会议论文
Proceedings of International Joint Conference on Neural Networks 2015, Ireland, 2015年7月
作者:  Xia ZP(夏中谱);  Dongbin Zhao
浏览  |  Adobe PDF(751Kb)  |  收藏  |  浏览/下载:277/89  |  提交时间:2016/06/15
Reinforcement Learning  Bayesian Inference  Gaussian Processes  
Reinforcement-Learning-Based Robust Controller Design for Continuous-Time Uncertain Nonlinear Systems Subject to Input Constraints 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2015, 卷号: 45, 期号: 7, 页码: 1372-1385
作者:  Liu, Derong;  Yang, Xiong;  Wang, Ding;  Wei, Qinglai
浏览  |  Adobe PDF(1179Kb)  |  收藏  |  浏览/下载:461/242  |  提交时间:2015/09/17
Approximate Dynamic Programming (Adp)  Neural Networks (Nns)  Neuro-dynamic Programming  Nonlinear Systems  Optimal Control  Reinforcement Learning (Rl)  Robust Control  
基于监督式自适应动态规划的车辆智能巡航控制 学位论文
, 中国科学院自动化研究所: 中国科学院大学, 2015
作者:  王滨
Adobe PDF(2069Kb)  |  收藏  |  浏览/下载:740/2  |  提交时间:2015/09/02
自适应巡航控制  自适应动态规划  监督式强化学习  智能控制  Dspace  Adaptive Cruise Control  Adaptive Dynamic Programming  Supervised Reinforcement Learning  Intelligent Control  Dspace  
连续状态系统的近似最优在线强化学习 学位论文
, 中国科学院自动化研究所: 中国科学院大学, 2015
作者:  朱圆恒
Adobe PDF(2679Kb)  |  收藏  |  浏览/下载:485/0  |  提交时间:2015/09/02
强化学习  最优控制  近似策略迭代  概率近似最优  连续状态系统  收敛性  在线学习  Kd树  Reinforcement Learning  Optimal Control  Approximate Policy Iteration  Probably Approximately Correct  Continuous-state System  Convergence  Online Learning  Kd-tree  
A data-based online reinforcement learning algorithm satisfying probably approximately correct principle 期刊论文
NEURAL COMPUTING & APPLICATIONS, 2015, 卷号: 26, 期号: 4, 页码: 775-787
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(1331Kb)  |  收藏  |  浏览/下载:247/59  |  提交时间:2015/09/21
Reinforcement Learning  Probably Approximately Correct  Kd-tree  
Infinite Horizon Self-Learning Optimal Control of Nonaffine Discrete-Time Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 4, 页码: 866-879
作者:  Wei, Qinglai;  Liu, Derong;  Yang, Xiong
Adobe PDF(2408Kb)  |  收藏  |  浏览/下载:277/109  |  提交时间:2015/09/21
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Generalized Policy Iteration  Neural Networks (Nns)  Neurodynamic Programming  Nonlinear Systems  Optimal Control  Reinforcement Learning  
GrDHP: A General Utility Function Representation for Dual Heuristic Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 3, 页码: 614-627
作者:  Ni, Zhen;  He, Haibo;  Zhao, Dongbin;  Xu, Xin;  Prokhorov, Danil V.
收藏  |  浏览/下载:176/0  |  提交时间:2015/09/21
Adaptive Control  Adaptive Dynamic Programming (Adp)  Dual Heuristic Dynamic Programming (Dhp)  General Utility Function  Goal Representation  Reinforcement Learning (Rl)