CASIA OpenIR

浏览/检索结果: 共4条,第1-4条 帮助

已选(0)清除 条数/页:   排序方式:
A Gradient-Based Reinforcement Learning Algorithm for Multiple Cooperative Agents 期刊论文
IEEE ACCESS, 2018, 卷号: 6, 页码: 70223-70235
作者:  Zhang, Zhen;  Wang, Dongqing;  Zhao, Dongbin;  Han, Qiaoni;  Song, Tingting
收藏  |  浏览/下载:232/0  |  提交时间:2019/07/12
Multi-agent reinforcement learning  gradient ascent  Q-learning  cooperative tasks  
An Appearance-and-Structure Fusion Network for Object Viewpoint Estimation 会议论文
, Stockholm, Sweden, 2018
作者:  Yueying Kao;  Weiming Li;  Zairan Wang;  Dongqing Zou;  Ran He(赫然);  Qiang Wang;  Minsu Ahn;  Sunghoon Hong
收藏  |  浏览/下载:167/0  |  提交时间:2018/06/07
FMR-GA -- A cooperative multi-agent reinformcement learning algorithm based on gradient ascent 期刊论文
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10634), 2017, 期号: *, 页码: 840–848
作者:  Zhen Zhang;  Dongqing Wang;  Dongbin Zhao;  Tingting Song
收藏  |  浏览/下载:135/0  |  提交时间:2017/12/31
Reinforcement Learning  Multi-agent  Gradient Ascent  Q-learning  
FMRQ-A Multiagent Reinforcement Learning Algorithm for Fully Cooperative Tasks 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 6, 页码: 1367-1379
作者:  Zhang, Zhen;  Zhao, Dongbin;  Gao, Junwei;  Wang, Dongqing;  Dai, Yujie
收藏  |  浏览/下载:280/0  |  提交时间:2017/07/18
Multiagent Reinforcement Learning (Marl)  Nash Equilibrium  Q-learning  Repeated Game