CASIA OpenIR

浏览/检索结果: 共16条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Clique-based cooperative multiagent reinforcement learning using factor graphs 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2015, 卷号: 3, 期号: 1, 页码: 248-256
作者:  Zhang,Zhen;  Zhao DB(赵冬斌)
浏览  |  Adobe PDF(707Kb)  |  收藏  |  浏览/下载:205/86  |  提交时间:2017/12/30
Reinforcement Learning  Factor Graphs  
A data-based online reinforcement learning algorithm with high-efficient exploration 会议论文
, Orlando, FL, USA, Dec, 2014
作者:  Yuanheng Zhu;  Zhao DB(赵冬斌)
浏览  |  Adobe PDF(407Kb)  |  收藏  |  浏览/下载:208/80  |  提交时间:2017/09/13
Optimal control for discrete-time systems with actuator saturation 期刊论文
OPTIMAL CONTROL APPLICATIONS & METHODS, 2017, 卷号: 38, 期号: 6, 页码: 1071-1080
作者:  Lin, Qiao;  Wei, Qinglai;  Zhao, Bo
浏览  |  Adobe PDF(348Kb)  |  收藏  |  浏览/下载:535/260  |  提交时间:2017/05/04
Approximate Dynamic Programming  Discrete Time  Generalized Policy Iteration  Optimal Control  Saturating Actuators  
The T-ITS Awards and Future Transportation 期刊论文
IEEE Transactions on Intelligent Transportation Systems, 2014, 卷号: 15, 期号: 6, 页码: 2353-2359
作者:  Fei-Yue Wang
浏览  |  Adobe PDF(81Kb)  |  收藏  |  浏览/下载:305/140  |  提交时间:2017/03/07
T-its Awards  Future Transportation  
Hybrid feedback control of vehicle longitudinal acceleration 会议论文
Proceeding of Chinese Control Conference, Hefei, China, 2012-7
作者:  Xia ZP(夏中谱);  Dongbin Zhao
浏览  |  Adobe PDF(310Kb)  |  收藏  |  浏览/下载:248/89  |  提交时间:2016/06/15
Acceleration Control  Fuzzy Inference  Pid Controller  Vehicle  
DynaCAS: Computational Experiments and Decision Support for ITS 期刊论文
IEEE INTELLIGENT SYSTEMS, 2008, 卷号: 23, 期号: 6, 页码: 19-23
作者:  Zhang, Nan;  Wang, Fei-Yue;  Zhu, Fenghua;  Zhao, Dongbin;  Tang, Shuming
浏览  |  Adobe PDF(586Kb)  |  收藏  |  浏览/下载:328/76  |  提交时间:2015/11/08
Dynacas  Computational Experiments  Decision Support  Its  
DynaCAS: Computational Experiments and Decision Support for ITS 期刊论文
IEEE INTELLIGENT SYSTEMS, 2008, 卷号: 23, 期号: 6, 页码: 19-23
作者:  Zhang, Nan;  Wang, Fei-Yue;  Zhu, Fenghua;  Zhao, Dongbin;  Tang, Shuming
浏览  |  Adobe PDF(586Kb)  |  收藏  |  浏览/下载:281/51  |  提交时间:2015/11/08
Dynacas  Computational Experiments  Decision Support  Its  
A data-based online reinforcement learning algorithm satisfying probably approximately correct principle 期刊论文
NEURAL COMPUTING & APPLICATIONS, 2015, 卷号: 26, 期号: 4, 页码: 775-787
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(1331Kb)  |  收藏  |  浏览/下载:256/60  |  提交时间:2015/09/21
Reinforcement Learning  Probably Approximately Correct  Kd-tree  
MEC-A Near-Optimal Online Reinforcement Learning Algorithm for Continuous Deterministic Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 2, 页码: 346-356
作者:  Zhao, Dongbin;  Zhu, Yuanheng
浏览  |  Adobe PDF(2156Kb)  |  收藏  |  浏览/下载:267/110  |  提交时间:2015/09/18
Efficient Exploration  Probably Approximately Correct (Pac)  Reinforcement Learning (Rl)  State Aggregation  
城市区域交通信号协调控制 学位论文
, 中国科学院自动化研究所: 中国科学院研究生院, 2012
作者:  戴钰桀
Adobe PDF(1887Kb)  |  收藏  |  浏览/下载:195/0  |  提交时间:2015/09/02
交通信号控制  协调  智能控制  自适应动态规划  强化学习  Traffic Signal Control  Coordination  Intelligent Control  Adaptive Dynamic Programming  Reinforcement Learning