CASIA OpenIR

浏览/检索结果: 共6条,第1-6条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
A Novel Iterative theta-Adaptive Dynamic Programming for Discrete-Time Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2014, 卷号: 11, 期号: 4, 页码: 1176-1190
作者:  Wei, Qinglai;  Liu, Derong;  Derong Liu
浏览  |  Adobe PDF(4364Kb)  |  收藏  |  浏览/下载:264/98  |  提交时间:2015/08/12
Adaptive Critic Designs  Adaptive Dynamic Programming  Approximate Dynamic Programming  Neural Networks  Neuro-dynamic Programming  Nonlinear Systems  Optimal Control  Policy Iteration  Reinforcement Learning  Value Iteration  
Online Synchronous Approximate Optimal Learning Algorithm for Multiplayer Nonzero-Sum Games With Unknown Dynamics 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2014, 卷号: 44, 期号: 8, 页码: 1015-1027
作者:  Liu, Derong;  Li, Hongliang;  Wang, Ding
Adobe PDF(20912Kb)  |  收藏  |  浏览/下载:217/79  |  提交时间:2015/08/12
Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Multiplayer Nonzero-sum Games  Neural Networks  Neuro-dynamic Programming  Policy Iteration  
Integral Reinforcement Learning for Linear Continuous-Time Zero-Sum Games With Completely Unknown Dynamics 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2014, 卷号: 11, 期号: 3, 页码: 706-714
作者:  Li, Hongliang;  Liu, Derong;  Wang, Ding
Adobe PDF(1753Kb)  |  收藏  |  浏览/下载:279/100  |  提交时间:2015/08/12
Adaptive Critic Designs  Adaptive Dynamic Programming  Approximate Dynamic Programming  Reinforcement Learning  Policy Iteration  Zero-sum Games  
基于强化学习的非线性系统自适应优化控制研究 学位论文
, 中国科学院自动化研究所: 中国科学院大学, 2014
作者:  杨雄
Adobe PDF(2445Kb)  |  收藏  |  浏览/下载:1110/0  |  提交时间:2015/09/02
非线性系统  强化学习  神经网络  最优控制  智能控制  Nonlinear System  Reinforcement Learning  Neural Network  Optimal Control  Intelligent Control  
Policy Iteration Adaptive Dynamic Programming Algorithm for Discrete-Time Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 卷号: 25, 期号: 3, 页码: 621-634
作者:  Liu, Derong;  Wei, Qinglai
Adobe PDF(2635Kb)  |  收藏  |  浏览/下载:200/81  |  提交时间:2015/08/12
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Discrete-time Policy Iteration  Neural Networks  Neurodynamic Programming  Nonlinear Systems  Optimal Control  Reinforcement Learning  
Online reinforcement learning for continuous-state systems 专著章节/文集论文
出自: Frontiers of Intelligent Control and Information Processing, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore:World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, 2014
作者:  Yuanheng Zhu;  Zhao DB(赵冬斌)
Adobe PDF(24150Kb)  |  收藏  |  浏览/下载:242/27  |  提交时间:2017/09/13