已选(0)清除
条数/页: 排序方式: |
| An Improved Minimax-Q Algorithm Based on Generalized Policy Iteration to Solve a Chaser-Invader Game 会议论文 , 线上, 2020-5 作者: Liu MS(刘民颂) ; Zhu YH(朱圆恒) ; Zhao DB(赵冬斌)![](/image/person.jpg)
Adobe PDF(727Kb)  |   收藏  |  浏览/下载:34/14  |  提交时间:2024/07/04 |
| Potential Driven Reinforcement Learning for Hard Exploration Tasks 会议论文 , 线上, 2020-4 作者: Zhao EM(赵恩民) ; Deng SH(邓诗弘); Zang YF(臧一凡); Kang YX(康永欣) ; Li K(李凯) ; Xing JL(兴军亮)![](/image/person.jpg)
Adobe PDF(1999Kb)  |   收藏  |  浏览/下载:131/50  |  提交时间:2023/06/29 |
| Stable Training of Bellman Error in Reinforcement Learning 会议论文 , Thailand, November 18–22 作者: Gong C(龚晨) ; Bai YP(白云鹏) ; Hou XW(侯新文) ; Ji XH(季晓慧)
Adobe PDF(2416Kb)  |   收藏  |  浏览/下载:139/40  |  提交时间:2023/06/27 |
| Robot Navigation among External Autonomous Agents through Deep Reinforcement Learning using Graph Attention Network 会议论文 , Berlin, Germany, July 12-17, 2020 作者: Zhang TL(张天乐) ; Qiu TH(丘腾海) ; Pu ZQ(蒲志强) ; Liu Z(刘振) ; Yi JQ(易建强)![](/image/person.jpg)
Adobe PDF(496Kb)  |   收藏  |  浏览/下载:141/43  |  提交时间:2023/06/12 |
| Wd3: Taming the estimation bias in deep reinforcement learning 会议论文 , Baltimore, MD, USA, 2020-12 作者: He Q(何强) ; Hou XW(侯新文)![](/image/person.jpg)
Adobe PDF(2006Kb)  |   收藏  |  浏览/下载:244/55  |  提交时间:2022/06/27 deep reinforcement learning estimation bias neural networks |
| Cyber-Physical-Social Systems for Smart City: An Implementation Based on Intelligent Loop 会议论文 , 北京, 2020-12-5 作者: Xiong, Gang ; Chen, Xiaoyu ; Shuo, Nan; Lv, Yisheng ; Zhu, Fenghua ; Qu, Tianci ; Ye, Peijun![](/image/person.jpg)
Adobe PDF(457Kb)  |   收藏  |  浏览/下载:219/65  |  提交时间:2022/06/16 |
| Multi-robot cooperative target encirclement through learning distributed transferable policy 会议论文 , Online, July 19-24 作者: Zhang Tianle ; Liu Zhen ; Wu Shiguang ; Pu Zhiqiang ; Yi Jianqiang![](/image/person.jpg)
Adobe PDF(949Kb)  |   收藏  |  浏览/下载:238/73  |  提交时间:2022/06/16 |
| Multi-Agent Cooperation and Competition with Two-Level Ggraph Attention Network 会议论文 , 线上, 2020-11 作者: Shiguang, Wu ; Zhiqiang, Pu ; Jianqiang, Yi ; Huimu, Wang![](/image/person.jpg)
Adobe PDF(1185Kb)  |   收藏  |  浏览/下载:176/1  |  提交时间:2021/06/24 |
| STGA-LSTM: A Spatial-Temporal Graph Attentional LSTM Scheme for Multi-Agent Cooperation 会议论文 , 线上, 2020-11 作者: Huimu Wang ; Zhen Liu ; Zhiqiang Pu ; Jianqiang Yi![](/image/person.jpg)
Adobe PDF(916Kb)  |   收藏  |  浏览/下载:106/0  |  提交时间:2021/06/24 |
| Multi-Agent Formation Control with Obstacles Avoidance under Restricted Communication through Graph Reinforcement Learning 会议论文 , 线上, 2020.06 作者: Huimu, Wang ; Tenghai, Qiu ; Zhen, Liu ; Zhiqiang, Pu ; Jianqiang, Yi![](/image/person.jpg)
Adobe PDF(1461Kb)  |   收藏  |  浏览/下载:228/47  |  提交时间:2021/06/24 |