CASIA OpenIR

浏览/检索结果: 共29条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
An Improved Minimax-Q Algorithm Based on Generalized Policy Iteration to Solve a Chaser-Invader Game 会议论文
, 线上, 2020-5
作者:  Liu MS(刘民颂);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(727Kb)  |  收藏  |  浏览/下载:34/14  |  提交时间:2024/07/04
Potential Driven Reinforcement Learning for Hard Exploration Tasks 会议论文
, 线上, 2020-4
作者:  Zhao EM(赵恩民);  Deng SH(邓诗弘);  Zang YF(臧一凡);  Kang YX(康永欣);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(1999Kb)  |  收藏  |  浏览/下载:131/50  |  提交时间:2023/06/29
Stable Training of Bellman Error in Reinforcement Learning 会议论文
, Thailand, November 18–22
作者:  Gong C(龚晨);  Bai YP(白云鹏);  Hou XW(侯新文);  Ji XH(季晓慧)
Adobe PDF(2416Kb)  |  收藏  |  浏览/下载:139/40  |  提交时间:2023/06/27
Robot Navigation among External Autonomous Agents through Deep Reinforcement Learning using Graph Attention Network 会议论文
, Berlin, Germany, July 12-17, 2020
作者:  Zhang TL(张天乐);  Qiu TH(丘腾海);  Pu ZQ(蒲志强);  Liu Z(刘振);  Yi JQ(易建强)
Adobe PDF(496Kb)  |  收藏  |  浏览/下载:141/43  |  提交时间:2023/06/12
Wd3: Taming the estimation bias in deep reinforcement learning 会议论文
, Baltimore, MD, USA, 2020-12
作者:  He Q(何强);  Hou XW(侯新文)
Adobe PDF(2006Kb)  |  收藏  |  浏览/下载:244/55  |  提交时间:2022/06/27
deep reinforcement learning  estimation bias  neural networks  
Cyber-Physical-Social Systems for Smart City: An Implementation Based on Intelligent Loop 会议论文
, 北京, 2020-12-5
作者:  Xiong, Gang;  Chen, Xiaoyu;  Shuo, Nan;  Lv, Yisheng;  Zhu, Fenghua;  Qu, Tianci;  Ye, Peijun
Adobe PDF(457Kb)  |  收藏  |  浏览/下载:219/65  |  提交时间:2022/06/16
Multi-robot cooperative target encirclement through learning distributed transferable policy 会议论文
, Online, July 19-24
作者:  Zhang Tianle;  Liu Zhen;  Wu Shiguang;  Pu Zhiqiang;  Yi Jianqiang
Adobe PDF(949Kb)  |  收藏  |  浏览/下载:238/73  |  提交时间:2022/06/16
Multi-Agent Cooperation and Competition with Two-Level Ggraph Attention Network 会议论文
, 线上, 2020-11
作者:  Shiguang, Wu;  Zhiqiang, Pu;  Jianqiang, Yi;  Huimu, Wang
Adobe PDF(1185Kb)  |  收藏  |  浏览/下载:176/1  |  提交时间:2021/06/24
STGA-LSTM: A Spatial-Temporal Graph Attentional LSTM Scheme for Multi-Agent Cooperation 会议论文
, 线上, 2020-11
作者:  Huimu Wang;  Zhen Liu;  Zhiqiang Pu;  Jianqiang Yi
Adobe PDF(916Kb)  |  收藏  |  浏览/下载:106/0  |  提交时间:2021/06/24
Multi-Agent Formation Control with Obstacles Avoidance under Restricted Communication through Graph Reinforcement Learning 会议论文
, 线上, 2020.06
作者:  Huimu, Wang;  Tenghai, Qiu;  Zhen, Liu;  Zhiqiang, Pu;  Jianqiang, Yi
Adobe PDF(1461Kb)  |  收藏  |  浏览/下载:228/47  |  提交时间:2021/06/24