CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共17条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Reinforcement Learning for Build-Order Production in StarCraft II 会议论文
, Cordoba, Granada, and Seville, Spain, 30 June-6 July 2018
作者:  Zhentao Tang;  Dongbin Zhao;  Yuanheng Zhu;  Ping Guo
Adobe PDF(2680Kb)  |  收藏  |  浏览/下载:147/46  |  提交时间:2021/07/07
Event-Triggered H-infinity Control for Continuous-Time Nonlinear System via Concurrent Learning 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2017, 卷号: 47, 期号: 7, 页码: 1071-1081
作者:  Zhang, Qichao;  Zhao, Dongbin;  Zhu, Yuanheng
浏览  |  Adobe PDF(2937Kb)  |  收藏  |  浏览/下载:511/238  |  提交时间:2017/05/04
Concurrent Learning  Event-triggered Control  H-infinity Optimal Control  Neural Networks (Nns)  Zero-sum (Zs) Game  
Experience Replay for Optimal Control of Nonzero-Sum Game Systems With Unknown Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2016, 卷号: 46, 期号: 3, 页码: 854-865
作者:  Zhao, Dongbin;  Zhang, Qichao;  Wang, Ding;  Zhu, Yuanheng
浏览  |  Adobe PDF(1769Kb)  |  收藏  |  浏览/下载:482/191  |  提交时间:2016/06/14
Adaptive Dynamic Programming (Adp)  Experience Replay  Nonzero-sum (Nzs) Games  Optimal Control  Unknown Dynamics  
Model-Free Optimal Control for Affine Nonlinear Systems With Convergence Analysis 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2015, 卷号: 12, 期号: 4, 页码: 1461-1468
作者:  Zhao, Dongbin;  Xia, Zhongpu;  Wang, Ding
浏览  |  Adobe PDF(1985Kb)  |  收藏  |  浏览/下载:323/83  |  提交时间:2015/11/12
Action Dependent Heuristic Dynamic Programming  Adaptive Dynamic Programming  Model-free Optimal Control  Neural Networks  Policy Iteration  
Online Reinforcement Learning by Bayesian Inference 会议论文
Proceedings of International Joint Conference on Neural Networks 2015, Ireland, 2015年7月
作者:  Xia ZP(夏中谱);  Dongbin Zhao
浏览  |  Adobe PDF(751Kb)  |  收藏  |  浏览/下载:277/89  |  提交时间:2016/06/15
Reinforcement Learning  Bayesian Inference  Gaussian Processes  
A data-based online reinforcement learning algorithm satisfying probably approximately correct principle 期刊论文
NEURAL COMPUTING & APPLICATIONS, 2015, 卷号: 26, 期号: 4, 页码: 775-787
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(1331Kb)  |  收藏  |  浏览/下载:247/59  |  提交时间:2015/09/21
Reinforcement Learning  Probably Approximately Correct  Kd-tree  
Convergence analysis and application of fuzzy-HDP for nonlinear discrete-time HJB systems 期刊论文
NEUROCOMPUTING, 2015, 卷号: 149, 页码: 124-131
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Liu, Derong
Adobe PDF(860Kb)  |  收藏  |  浏览/下载:259/99  |  提交时间:2015/10/13
Discrete-time Nonlinear System  T-s Fuzzy System  Hdp  
MEC-A Near-Optimal Online Reinforcement Learning Algorithm for Continuous Deterministic Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 2, 页码: 346-356
作者:  Zhao, Dongbin;  Zhu, Yuanheng
Adobe PDF(2156Kb)  |  收藏  |  浏览/下载:254/105  |  提交时间:2015/09/18
Efficient Exploration  Probably Approximately Correct (Pac)  Reinforcement Learning (Rl)  State Aggregation  
Clique-based cooperative multiagent reinforcement learning using factor graphs 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2015, 卷号: 3, 期号: 1, 页码: 248-256
作者:  Zhang,Zhen;  Zhao DB(赵冬斌)
浏览  |  Adobe PDF(707Kb)  |  收藏  |  浏览/下载:195/82  |  提交时间:2017/12/30
Reinforcement Learning  Factor Graphs  
Full-range adaptive cruise control based on supervised adaptive dynamic programming 期刊论文
NEUROCOMPUTING, 2014, 卷号: 125, 页码: 57-67
作者:  Zhao, Dongbin;  Hu, Zhaohui;  Xia, Zhongpu;  Alippi, Cesare;  Zhu, Yuanheng;  Wang, Ding
浏览  |  Adobe PDF(2228Kb)  |  收藏  |  浏览/下载:381/113  |  提交时间:2015/08/12
Adaptive Dynamic Programming  Supervised Reinforcement Learning  Neural Networks  Adaptive Cruise Control  Stop And Go