CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共10条,第1-10条 帮助

限定条件                                
已选(0)清除 条数/页:   排序方式:
computational intelligence for changing environments 期刊论文
IEEE Computational Intelligence Magazine, 2015, 卷号: 11, 期号: 10(4), 页码: 10-11
作者:  Amir,Hussain;  Dacheng,Tao;  Jonathan,Wu;  Zhao,Dongbin(赵冬斌)
浏览  |  Adobe PDF(194Kb)  |  收藏  |  浏览/下载:245/65  |  提交时间:2017/12/30
Behavioral Science  Guest Editorial  Computational Intelligence  
Clique-based cooperative multiagent reinforcement learning using factor graphs 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2015, 卷号: 3, 期号: 1, 页码: 248-256
作者:  Zhang,Zhen;  Zhao DB(赵冬斌)
浏览  |  Adobe PDF(707Kb)  |  收藏  |  浏览/下载:242/98  |  提交时间:2017/12/30
Reinforcement Learning  Factor Graphs  
Convergence Proof of Approximate Policy Iteration for Undiscounted Optimal Control of Discrete-Time Systems 期刊论文
COGNITIVE COMPUTATION, 2015, 卷号: 7, 期号: 6, 页码: 763-771
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo;  Ji, Junhong
Adobe PDF(809Kb)  |  收藏  |  浏览/下载:277/43  |  提交时间:2016/01/18
Approximate Policy Iteration  Approximation Error  Optimal Control  Fuzzy Approximator  
Machine Learning with Applications to Autonomous Systems 期刊论文
MATHEMATICAL PROBLEMS IN ENGINEERING, 2015
作者:  Xu, Xin;  He, Haibo;  Zhao, Dongbin;  Sun, Shiliang;  Busoniu, Lucian;  Yang, Simon X.
收藏  |  浏览/下载:196/0  |  提交时间:2016/01/18
Model-Free Optimal Control for Affine Nonlinear Systems With Convergence Analysis 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2015, 卷号: 12, 期号: 4, 页码: 1461-1468
作者:  Zhao, Dongbin;  Xia, Zhongpu;  Wang, Ding
浏览  |  Adobe PDF(1985Kb)  |  收藏  |  浏览/下载:380/101  |  提交时间:2015/11/12
Action Dependent Heuristic Dynamic Programming  Adaptive Dynamic Programming  Model-free Optimal Control  Neural Networks  Policy Iteration  
Computational Energy Management in Smart Grids 期刊论文
NEUROCOMPUTING, 2015, 卷号: 170, 页码: 267-269
作者:  Squartini, Stefano;  Liu, Derong;  Piazza, Francesco;  Zhao, Dongbin;  He, Haibo
收藏  |  浏览/下载:290/0  |  提交时间:2015/11/04
Energy Management  Computational Intelligence  Smart Grids  
Convergence analysis and application of fuzzy-HDP for nonlinear discrete-time HJB systems 期刊论文
NEUROCOMPUTING, 2015, 卷号: 149, 页码: 124-131
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Liu, Derong
浏览  |  Adobe PDF(860Kb)  |  收藏  |  浏览/下载:319/115  |  提交时间:2015/10/13
Discrete-time Nonlinear System  T-s Fuzzy System  Hdp  
A data-based online reinforcement learning algorithm satisfying probably approximately correct principle 期刊论文
NEURAL COMPUTING & APPLICATIONS, 2015, 卷号: 26, 期号: 4, 页码: 775-787
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(1331Kb)  |  收藏  |  浏览/下载:286/64  |  提交时间:2015/09/21
Reinforcement Learning  Probably Approximately Correct  Kd-tree  
GrDHP: A General Utility Function Representation for Dual Heuristic Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 3, 页码: 614-627
作者:  Ni, Zhen;  He, Haibo;  Zhao, Dongbin;  Xu, Xin;  Prokhorov, Danil V.
收藏  |  浏览/下载:203/0  |  提交时间:2015/09/21
Adaptive Control  Adaptive Dynamic Programming (Adp)  Dual Heuristic Dynamic Programming (Dhp)  General Utility Function  Goal Representation  Reinforcement Learning (Rl)  
MEC-A Near-Optimal Online Reinforcement Learning Algorithm for Continuous Deterministic Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 2, 页码: 346-356
作者:  Zhao, Dongbin;  Zhu, Yuanheng
浏览  |  Adobe PDF(2156Kb)  |  收藏  |  浏览/下载:300/118  |  提交时间:2015/09/18
Efficient Exploration  Probably Approximately Correct (Pac)  Reinforcement Learning (Rl)  State Aggregation