CASIA OpenIR

浏览/检索结果: 共2条,第1-2条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Actor-Critic-Identifier Structure-Based Decentralized Neuro-Optimal Control of Modular Robot Manipulators With Environmental Collisions 期刊论文
IEEE ACCESS, 2019, 卷号: 7, 页码: 96148-96165
作者:  Dong, Bo;  An, Tianjiao;  Zhou, Fan;  Liu, Keping;  Yu, Weibo;  Li, Yuanchun
收藏  |  浏览/下载:274/0  |  提交时间:2019/12/16
Adaptive dynamic programming  collision identification  decentralized optimal control  modular robot manipulators  zero-sum game  
A Gradient-Based Reinforcement Learning Algorithm for Multiple Cooperative Agents 期刊论文
IEEE ACCESS, 2018, 卷号: 6, 页码: 70223-70235
作者:  Zhang, Zhen;  Wang, Dongqing;  Zhao, Dongbin;  Han, Qiaoni;  Song, Tingting
收藏  |  浏览/下载:260/0  |  提交时间:2019/07/12
Multi-agent reinforcement learning  gradient ascent  Q-learning  cooperative tasks