CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共18条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Multi-Agent Reinforcement Learning Based on Clustering in Two-Player Games 会议论文
, Xiamen, China, 2019-12-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(488Kb)  |  收藏  |  浏览/下载:120/39  |  提交时间:2023/06/28
reinforcement learning  unsupervised clustering  matrix game  
Empirical Policy Optimization for n-Player Markov Games 期刊论文
IEEE Transactions on Cybernetics, 2022, 页码: doi={10.1109/TCYB.2022.3179775}
作者:  Yuanheng Zhu;  Weifan Li;  Mengchen Zhao;  Jianye Hao;  Dongbin Zhao
Adobe PDF(1739Kb)  |  收藏  |  浏览/下载:95/38  |  提交时间:2023/04/26
UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:  Chai, Jiajun;  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin;  Ma, Zhe;  Sun, Kewu;  Ding, Jishiyu
Adobe PDF(3402Kb)  |  收藏  |  浏览/下载:245/26  |  提交时间:2022/01/27
Multi-agent systems  Training  Task analysis  Reinforcement learning  Sun  Learning systems  Semantics  Centralized training with decentralized execution (CTDE)  multiagent  reinforcement learning  StarCraft II  
A Review of Computational Intelligence for StarCraft AI 会议论文
, Bangalore, India, 18-21 Nov. 2018
作者:  Tang, Zhentao;  Shao, Kun;  Zhu, Yuanheng;  Li, Dong;  Zhao, Dongbin;  Huang, Tingwen
浏览  |  Adobe PDF(131Kb)  |  收藏  |  浏览/下载:494/226  |  提交时间:2019/04/25
StarCraft Micromanagement With Reinforcement Learning and Curriculum Transfer Learning 期刊论文
IEEE Transactions on Emerging Topics in Computational Intelligence, 2019, 卷号: 3, 期号: 1, 页码: 73-84
作者:  Kun Shao;  Yuanheng Zhu;  Dongbin Zhao
浏览  |  Adobe PDF(4125Kb)  |  收藏  |  浏览/下载:341/132  |  提交时间:2019/04/22
Reinforcement Learning, Transfer Learning, Curriculum Learning, Neural Network, Game Ai  
Deep reinforcement learning with Experience Replay based on SARSA 会议论文
, *, 2016-9
作者:  Zhao,Dongbin(赵冬斌);  Wang,Haitao;  Shao,Kun;  Zhu,Yuanheng
浏览  |  Adobe PDF(1288Kb)  |  收藏  |  浏览/下载:395/174  |  提交时间:2018/01/04
Deep Learning  Reinforcement Learning  Experience Replay  q Learning  Sarsa Learning  
Cooperative Reinforcement Learning for Multiple Units Combat in StarCraft 会议论文
, Honolulu, Hawaii, USA, Nov. 27 to Dec 1, 2017
作者:  Shao K(邵坤);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
浏览  |  Adobe PDF(1378Kb)  |  收藏  |  浏览/下载:539/266  |  提交时间:2017/09/20
A data-based online reinforcement learning algorithm with high-efficient exploration 会议论文
, Orlando, FL, USA, Dec, 2014
作者:  Yuanheng Zhu;  Zhao DB(赵冬斌)
浏览  |  Adobe PDF(407Kb)  |  收藏  |  浏览/下载:208/80  |  提交时间:2017/09/13
Online Model-Free RLSPI Algorithm for Nonlinear Discrete-Time Non-affine Systems 会议论文
, Daegu, Korea, November 3-7, 2013
作者:  Yuanheng Zhu;  Zhao DB(赵冬斌)
Adobe PDF(276Kb)  |  收藏  |  浏览/下载:164/51  |  提交时间:2017/09/13
Online reinforcement learning for continuous-state systems 专著章节/文集论文
出自: Frontiers of Intelligent Control and Information Processing, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore:World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, 2014
作者:  Yuanheng Zhu;  Zhao DB(赵冬斌)
Adobe PDF(24150Kb)  |  收藏  |  浏览/下载:249/27  |  提交时间:2017/09/13