CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共6条,第1-6条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning 会议论文
, 昆士兰, 2023-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(4104Kb)  |  收藏  |  浏览/下载:218/71  |  提交时间:2023/06/29
multi-agent  reinforcement learning  policy gradient  
Thermal Comfort Control Based on MEC Algorithm for HVAC System 会议论文
, Killarney, Ireland, 12-17 July 2015
作者:  Li, Dong;  Zhao, Dongbin;  Zhu, Yuanheng;  Xia, Zhongpu
浏览  |  Adobe PDF(895Kb)  |  收藏  |  浏览/下载:194/78  |  提交时间:2017/12/28
ADP with MCTS algorithm for Gomoku 会议论文
, Athens, Greece, 6-9 Dec. 2016
作者:  Tang Zhentao;  Zhao Dongbin;  Shao Kun;  Lv Le
浏览  |  Adobe PDF(866Kb)  |  收藏  |  浏览/下载:663/307  |  提交时间:2017/05/08
Consensus of Heterogeneous Multi-agent Systems With Switching Topologies Using Input-output Feedback Linearization 会议论文
, Hangzhou, China, 2015-7
作者:  Zhang,Qichao;  Zhao, Dongbin;  Wei, Qinglai;  Li, Chengdong
浏览  |  Adobe PDF(282Kb)  |  收藏  |  浏览/下载:246/77  |  提交时间:2017/05/04
Multi-agent Systems  Switching Topologies  Nonlinear Heterogeneous Systems  Communication Failures  Input-output Feedback  
Online Reinforcement Learning by Bayesian Inference 会议论文
Proceedings of International Joint Conference on Neural Networks 2015, Ireland, 2015年7月
作者:  Xia ZP(夏中谱);  Dongbin Zhao
浏览  |  Adobe PDF(751Kb)  |  收藏  |  浏览/下载:284/90  |  提交时间:2016/06/15
Reinforcement Learning  Bayesian Inference  Gaussian Processes  
An high-efficient online reinforcement learning algorithm for continuous-state systems 会议论文
IEEE World Congresson Intelligent Control and Automation (WCICA), Shenyang, China, 2014
作者:  Yuanheng Zhu;  Dongbin Zhao;  Haibo He
Adobe PDF(764Kb)  |  收藏  |  浏览/下载:254/79  |  提交时间:2015/08/19