CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共12条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Experience Replay for Optimal Control of Nonzero-Sum Game Systems With Unknown Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2016, 卷号: 46, 期号: 3, 页码: 854-865
作者:  Zhao, Dongbin;  Zhang, Qichao;  Wang, Ding;  Zhu, Yuanheng
浏览  |  Adobe PDF(1769Kb)  |  收藏  |  浏览/下载:500/195  |  提交时间:2016/06/14
Adaptive Dynamic Programming (Adp)  Experience Replay  Nonzero-sum (Nzs) Games  Optimal Control  Unknown Dynamics  
A data-based online reinforcement learning algorithm satisfying probably approximately correct principle 期刊论文
NEURAL COMPUTING & APPLICATIONS, 2015, 卷号: 26, 期号: 4, 页码: 775-787
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(1331Kb)  |  收藏  |  浏览/下载:252/60  |  提交时间:2015/09/21
Reinforcement Learning  Probably Approximately Correct  Kd-tree  
Convergence analysis and application of fuzzy-HDP for nonlinear discrete-time HJB systems 期刊论文
NEUROCOMPUTING, 2015, 卷号: 149, 页码: 124-131
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Liu, Derong
浏览  |  Adobe PDF(860Kb)  |  收藏  |  浏览/下载:266/99  |  提交时间:2015/10/13
Discrete-time Nonlinear System  T-s Fuzzy System  Hdp  
A data-based online reinforcement learning algorithm with high-efficient exploration 会议论文
, Orlando, FL, USA, Dec, 2014
作者:  Yuanheng Zhu;  Zhao DB(赵冬斌)
浏览  |  Adobe PDF(407Kb)  |  收藏  |  浏览/下载:201/79  |  提交时间:2017/09/13
Online reinforcement learning for continuous-state systems 专著章节/文集论文
出自: Frontiers of Intelligent Control and Information Processing, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore:World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, 2014
作者:  Yuanheng Zhu;  Zhao DB(赵冬斌)
Adobe PDF(24150Kb)  |  收藏  |  浏览/下载:242/27  |  提交时间:2017/09/13
A neural-network-based iterative GDHP approach for solving a class of nonlinear optimal control problems with control constraints 期刊论文
NEURAL COMPUTING & APPLICATIONS, 2013, 卷号: 22, 期号: 2, 页码: 219-227
作者:  Wang, Ding;  Liu, Derong;  Zhao, Dongbin;  Huang, Yuzhu;  Zhang, Dehua
Adobe PDF(507Kb)  |  收藏  |  浏览/下载:237/66  |  提交时间:2015/08/12
Adaptive Critic Designs  Adaptive Dynamic Programming  Approximate Dynamic Programming  Neural Dynamic Programming  Neural Networks  Optimal Control  Reinforcement Learning  
Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming 期刊论文
AUTOMATICA, 2012, 卷号: 48, 期号: 8, 页码: 1825-1832
作者:  Wang, Ding;  Liu, Derong;  Wei, Qinglai;  Zhao, Dongbin;  Jin, Ning
Adobe PDF(598Kb)  |  收藏  |  浏览/下载:352/138  |  提交时间:2015/08/12
Adaptive Critic Designs  Adaptive Dynamic Programming  Approximate Dynamic Programming  Globalized Dual Heuristic Programming  Intelligent Control  Neural Network  Optimal Control  
Neural-Network-Based Optimal Control for a Class of Unknown Discrete-Time Nonlinear Systems Using Globalized Dual Heuristic Programming 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2012, 卷号: 9, 期号: 3, 页码: 628-634
作者:  Liu, Derong;  Wang, Ding;  Zhao, Dongbin;  Wei, Qinglai;  Jin, Ning
Adobe PDF(364Kb)  |  收藏  |  浏览/下载:312/112  |  提交时间:2015/08/12
Adaptive Dynamic Programming  Approximate Dynamic Programming  Globalized Dual Heuristic Programming  Intelligent Control  Neural Networks  Optimal Control  
Neural and Fuzzy Dynamic Programming for Under-actuated Systems 会议论文
International Joint Conference on Neural Networks (IJCNN), Brisbane, Australia, 2012
作者:  Zhao, Dongbin;  Zhu, Yuanheng;  He, Haibo
浏览  |  Adobe PDF(955Kb)  |  收藏  |  浏览/下载:249/68  |  提交时间:2015/08/19
DHP Method for Ramp Metering of Freeway Traffic 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2011, 卷号: 12, 期号: 4, 页码: 990-999
作者:  Zhao, Dongbin;  Bai, Xuerui;  Wang, Fei-Yue;  Xu, Jing;  Yu, Wensheng;  Fei-Yue Wang
Adobe PDF(827Kb)  |  收藏  |  浏览/下载:237/73  |  提交时间:2015/08/12
Congestion  Dual Heuristic Programming (Dhp)  Ramp Metering  Traffic Control