CASIA OpenIR

浏览/检索结果: 共11条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Finite-Approximation-Error-Based Discrete-Time Iterative Adaptive Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2014, 卷号: 44, 期号: 12, 页码: 2820-2833
作者:  Wei, Qinglai;  Wang, Fei-Yue;  Liu, Derong;  Yang, Xiong
浏览  |  Adobe PDF(1826Kb)  |  收藏  |  浏览/下载:300/112  |  提交时间:2015/08/12
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Approximation Error  Neural Networks  Neuro-dynamic Programming  Nonlinear Systems  Optimal Control  Reinforcement Learning  Value Iteration  
Online Synchronous Approximate Optimal Learning Algorithm for Multiplayer Nonzero-Sum Games With Unknown Dynamics 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2014, 卷号: 44, 期号: 8, 页码: 1015-1027
作者:  Liu, Derong;  Li, Hongliang;  Wang, Ding
Adobe PDF(20912Kb)  |  收藏  |  浏览/下载:220/80  |  提交时间:2015/08/12
Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Multiplayer Nonzero-sum Games  Neural Networks  Neuro-dynamic Programming  Policy Iteration  
Integral Reinforcement Learning for Linear Continuous-Time Zero-Sum Games With Completely Unknown Dynamics 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2014, 卷号: 11, 期号: 3, 页码: 706-714
作者:  Li, Hongliang;  Liu, Derong;  Wang, Ding
Adobe PDF(1753Kb)  |  收藏  |  浏览/下载:282/100  |  提交时间:2015/08/12
Adaptive Critic Designs  Adaptive Dynamic Programming  Approximate Dynamic Programming  Reinforcement Learning  Policy Iteration  Zero-sum Games  
基于强化学习的非线性系统自适应优化控制研究 学位论文
, 中国科学院自动化研究所: 中国科学院大学, 2014
作者:  杨雄
Adobe PDF(2445Kb)  |  收藏  |  浏览/下载:1112/0  |  提交时间:2015/09/02
非线性系统  强化学习  神经网络  最优控制  智能控制  Nonlinear System  Reinforcement Learning  Neural Network  Optimal Control  Intelligent Control  
Data-based Suboptimal Neuro-control Design with Reinforcement Learning for Dissipative Spatially Distributed Processes 期刊论文
INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2014, 卷号: 53, 期号: 19, 页码: 8106-8119
作者:  Luo, Biao;  Wu, Huai-Ning;  Li, Han-Xiong
浏览  |  Adobe PDF(5283Kb)  |  收藏  |  浏览/下载:232/70  |  提交时间:2015/08/12
Data-based  
Stable iterative adaptive dynamic programming algorithm with approximation errors for discrete-time nonlinear systems 期刊论文
NEURAL COMPUTING & APPLICATIONS, 2014, 卷号: 24, 期号: 6, 页码: 1355-1367
作者:  Wei, Qinglai;  Liu, Derong;  Derong Liu
浏览  |  Adobe PDF(791Kb)  |  收藏  |  浏览/下载:206/54  |  提交时间:2015/08/12
Adaptive Dynamic Programming  Approximate Dynamic Programming  Adaptive Critic Designs  Optimal Control  Neural Networks  Nonlinear Systems  
Local joint information based active fault tolerant control for reconfigurable manipulator 期刊论文
Nonlinear Dynamics, 2014, 卷号: 77, 期号: 3, 页码: 859-876
作者:  Zhao B(赵博)
浏览  |  Adobe PDF(1165Kb)  |  收藏  |  浏览/下载:175/46  |  提交时间:2018/10/14
Reconfigurable Manipulators  Local Joint Information  Fault Detection And Identification  Active Fault Tolerant Control  Nonlinear Velocity Observer  Radial Basis Function Neural Network  
Workflow performance analysis and simulation based on multidimensional workflow net 期刊论文
COMPUTERS IN INDUSTRY, 2014, 卷号: 65, 期号: 2, 页码: 333-344
作者:  Liu Sheng;  Fan Yushun
浏览  |  Adobe PDF(565Kb)  |  收藏  |  浏览/下载:248/75  |  提交时间:2015/08/12
Mwf-net  Performance Analysis  Dwelling Time  Probability Density  
A data-based online reinforcement learning algorithm with high-efficient exploration 会议论文
, Orlando, FL, USA, Dec, 2014
作者:  Yuanheng Zhu;  Zhao DB(赵冬斌)
浏览  |  Adobe PDF(407Kb)  |  收藏  |  浏览/下载:204/79  |  提交时间:2017/09/13
Online reinforcement learning for continuous-state systems 专著章节/文集论文
出自: Frontiers of Intelligent Control and Information Processing, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore:World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, 2014
作者:  Yuanheng Zhu;  Zhao DB(赵冬斌)
Adobe PDF(24150Kb)  |  收藏  |  浏览/下载:243/27  |  提交时间:2017/09/13