CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共13条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
An Improved Minimax-Q Algorithm Based on Generalized Policy Iteration to Solve a Chaser-Invader Game 会议论文
, 线上, 2020-5
作者:  Liu MS(刘民颂);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(727Kb)  |  收藏  |  浏览/下载:24/10  |  提交时间:2024/07/04
Multi-Agent Reinforcement Learning Based on Clustering in Two-Player Games 会议论文
, Xiamen, China, 2019-12-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(488Kb)  |  收藏  |  浏览/下载:156/49  |  提交时间:2023/06/28
reinforcement learning  unsupervised clustering  matrix game  
Comparison of methods to efficient graph SLAM under general optimization framework 会议论文
YAC 2017
作者:  Haoran Li;  Qichao Zhang;  Dongbin Zhao
Adobe PDF(151Kb)  |  收藏  |  浏览/下载:950/532  |  提交时间:2017/12/31
Optimization  Slam  Pose Graph  
Off-Policy Reinforcement Learning for Partially Unknown Nonzero-Sum Games 会议论文
, Guangzhou China, November 14–18
作者:  Zhang,Qichao;  Zhao,Dongbin;  Zhang,Sibo
Adobe PDF(119Kb)  |  收藏  |  浏览/下载:279/107  |  提交时间:2017/12/28
A data-based online reinforcement learning algorithm with high-efficient exploration 会议论文
, Orlando, FL, USA, Dec, 2014
作者:  Yuanheng Zhu;  Zhao DB(赵冬斌)
Adobe PDF(407Kb)  |  收藏  |  浏览/下载:219/82  |  提交时间:2017/09/13
Convolutional fitted Q iteration for vision-based control problems 会议论文
, Vancouver, BC, Canada, 24-29 July 2016
作者:  Zhao Dongbin;  Zhu Yuanheng;  Lv Le;  Chen Yaran;  Zhang Qichao
Adobe PDF(240Kb)  |  收藏  |  浏览/下载:386/130  |  提交时间:2017/05/08
Data-driven adaptive dynamic programming for two-player nonzero-sum game 会议论文
, Chongqing, China, 2017-5
作者:  Zhang, Qichao;  Zhao, Dongbin
浏览  |  Adobe PDF(141Kb)  |  收藏  |  浏览/下载:377/199  |  提交时间:2017/05/04
Model-free reinforcement learning for nonlinear zero-sum games with simultaneous explorations 会议论文
, Vancouver, Canada, 2016-7
作者:  Zhang, Qichao;  Zhao, Dongbin;  Zhu, Yuanheng;  Chen, Xi
Adobe PDF(339Kb)  |  收藏  |  浏览/下载:298/103  |  提交时间:2017/05/04
Consensus of Heterogeneous Multi-agent Systems With Switching Topologies Using Input-output Feedback Linearization 会议论文
, Hangzhou, China, 2015-7
作者:  Zhang,Qichao;  Zhao, Dongbin;  Wei, Qinglai;  Li, Chengdong
Adobe PDF(282Kb)  |  收藏  |  浏览/下载:282/89  |  提交时间:2017/05/04
Multi-agent Systems  Switching Topologies  Nonlinear Heterogeneous Systems  Communication Failures  Input-output Feedback  
Online Synchronous Policy Iteration Based on Concurrent Learning to Solve Continuous-time Optimal Control Problem 会议论文
Proceedings of International Conference on Information Science and Technology, Changsha, 2015.4.25~4.27
作者:  Haitao Wang;  Dongbin Zhao;  Chengdong Li
浏览  |  Adobe PDF(1339Kb)  |  收藏  |  浏览/下载:293/105  |  提交时间:2016/06/15