CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共12条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Convergence Proof of Approximate Policy Iteration for Undiscounted Optimal Control of Discrete-Time Systems 期刊论文
COGNITIVE COMPUTATION, 2015, 卷号: 7, 期号: 6, 页码: 763-771
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo;  Ji, Junhong
Adobe PDF(809Kb)  |  收藏  |  浏览/下载:245/38  |  提交时间:2016/01/18
Approximate Policy Iteration  Approximation Error  Optimal Control  Fuzzy Approximator  
Event-Triggered H∞ Control for Continuous-Time Nonlinear System 会议论文
, South Korea, 2015-11
作者:  Zhao, Dongbin;  Zhang, Qichao;  Li, Xiangjun;  Kong, Lingda
Adobe PDF(365Kb)  |  收藏  |  浏览/下载:279/129  |  提交时间:2017/05/04
Model-Free Optimal Control for Affine Nonlinear Systems With Convergence Analysis 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2015, 卷号: 12, 期号: 4, 页码: 1461-1468
作者:  Zhao, Dongbin;  Xia, Zhongpu;  Wang, Ding
Adobe PDF(1985Kb)  |  收藏  |  浏览/下载:335/87  |  提交时间:2015/11/12
Action Dependent Heuristic Dynamic Programming  Adaptive Dynamic Programming  Model-free Optimal Control  Neural Networks  Policy Iteration  
A data-based online reinforcement learning algorithm satisfying probably approximately correct principle 期刊论文
NEURAL COMPUTING & APPLICATIONS, 2015, 卷号: 26, 期号: 4, 页码: 775-787
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(1331Kb)  |  收藏  |  浏览/下载:252/60  |  提交时间:2015/09/21
Reinforcement Learning  Probably Approximately Correct  Kd-tree  
Online Synchronous Policy Iteration Based on Concurrent Learning to Solve Continuous-time Optimal Control Problem 会议论文
Proceedings of International Conference on Information Science and Technology, Changsha, 2015.4.25~4.27
作者:  Haitao Wang;  Dongbin Zhao;  Chengdong Li
Adobe PDF(1339Kb)  |  收藏  |  浏览/下载:260/97  |  提交时间:2016/06/15
GrDHP: A General Utility Function Representation for Dual Heuristic Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 3, 页码: 614-627
作者:  Ni, Zhen;  He, Haibo;  Zhao, Dongbin;  Xu, Xin;  Prokhorov, Danil V.
收藏  |  浏览/下载:184/0  |  提交时间:2015/09/21
Adaptive Control  Adaptive Dynamic Programming (Adp)  Dual Heuristic Dynamic Programming (Dhp)  General Utility Function  Goal Representation  Reinforcement Learning (Rl)  
Convergence analysis and application of fuzzy-HDP for nonlinear discrete-time HJB systems 期刊论文
NEUROCOMPUTING, 2015, 卷号: 149, 页码: 124-131
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Liu, Derong
Adobe PDF(860Kb)  |  收藏  |  浏览/下载:266/99  |  提交时间:2015/10/13
Discrete-time Nonlinear System  T-s Fuzzy System  Hdp  
MEC-A Near-Optimal Online Reinforcement Learning Algorithm for Continuous Deterministic Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 2, 页码: 346-356
作者:  Zhao, Dongbin;  Zhu, Yuanheng
Adobe PDF(2156Kb)  |  收藏  |  浏览/下载:258/106  |  提交时间:2015/09/18
Efficient Exploration  Probably Approximately Correct (Pac)  Reinforcement Learning (Rl)  State Aggregation  
Clique-based cooperative multiagent reinforcement learning using factor graphs 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2015, 卷号: 3, 期号: 1, 页码: 248-256
作者:  Zhang,Zhen;  Zhao DB(赵冬斌)
Adobe PDF(707Kb)  |  收藏  |  浏览/下载:199/84  |  提交时间:2017/12/30
Reinforcement Learning  Factor Graphs  
Event-Triggered H∞ Control for Continuous-Time Nonlinear System 会议论文
, Jeju, South Korea, October 15-18
作者:  Zhao,Dongbin;  Zhang,Qichao;  Li,Xiangjun;  Kong,Lingda
Adobe PDF(365Kb)  |  收藏  |  浏览/下载:173/43  |  提交时间:2017/12/28