CASIA OpenIR

Browse/Search Results:  1-7 of 7 Help

Filters    
Selected(0)Clear Items/Page:    Sort:
Convergence Proof of Approximate Policy Iteration for Undiscounted Optimal Control of Discrete-Time Systems 期刊论文
COGNITIVE COMPUTATION, 2015, 卷号: 7, 期号: 6, 页码: 763-771
Authors:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo;  Ji, Junhong
Adobe PDF(809Kb)  |  Favorite  |  View/Download:61/9  |  Submit date:2016/01/18
Approximate Policy Iteration  Approximation Error  Optimal Control  Fuzzy Approximator  
Thermal Comfort Control Based on MEC Algorithm for HVAC System 会议论文
, Killarney, Ireland, 12-17 July 2015
Authors:  Li, Dong;  Zhao, Dongbin;  Zhu, Yuanheng;  Xia, Zhongpu
View  |  Adobe PDF(895Kb)  |  Favorite  |  View/Download:38/6  |  Submit date:2017/12/28
连续状态系统的近似最优在线强化学习 学位论文
, 中国科学院自动化研究所: 中国科学院大学, 2015
Authors:  朱圆恒
Adobe PDF(2679Kb)  |  Favorite  |  View/Download:226/0  |  Submit date:2015/09/02
强化学习  最优控制  近似策略迭代  概率近似最优  连续状态系统  收敛性  在线学习  Kd树  Reinforcement Learning  Optimal Control  Approximate Policy Iteration  Probably Approximately Correct  Continuous-state System  Convergence  Online Learning  Kd-tree  
A data-based online reinforcement learning algorithm satisfying probably approximately correct principle 期刊论文
NEURAL COMPUTING & APPLICATIONS, 2015, 卷号: 26, 期号: 4, 页码: 775-787
Authors:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(1331Kb)  |  Favorite  |  View/Download:91/22  |  Submit date:2015/09/21
Reinforcement Learning  Probably Approximately Correct  Kd-tree  
Convergence analysis and application of fuzzy-HDP for nonlinear discrete-time HJB systems 期刊论文
NEUROCOMPUTING, 2015, 卷号: 149, 页码: 124-131
Authors:  Zhu, Yuanheng;  Zhao, Dongbin;  Liu, Derong
View  |  Adobe PDF(860Kb)  |  Favorite  |  View/Download:56/21  |  Submit date:2015/10/13
Discrete-time Nonlinear System  T-s Fuzzy System  Hdp  
MEC-A Near-Optimal Online Reinforcement Learning Algorithm for Continuous Deterministic Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 2, 页码: 346-356
Authors:  Zhao, Dongbin;  Zhu, Yuanheng
View  |  Adobe PDF(2156Kb)  |  Favorite  |  View/Download:73/26  |  Submit date:2015/09/18
Efficient Exploration  Probably Approximately Correct (Pac)  Reinforcement Learning (Rl)  State Aggregation  
Model-Free Adaptive Algorithm for Optimal Control of Continuous-Time Nonlinear System 会议论文
, Wuhan, China, 2015
Authors:  Yuanheng Zhu;  Zhao DB(赵冬斌)
View  |  Adobe PDF(1399Kb)  |  Favorite  |  View/Download:19/5  |  Submit date:2017/09/13