已选(0)清除
条数/页: 排序方式: |
| An Improved Minimax-Q Algorithm Based on Generalized Policy Iteration to Solve a Chaser-Invader Game 会议论文 , 线上, 2020-5 作者: Liu MS(刘民颂); Zhu YH(朱圆恒); Zhao DB(赵冬斌) Adobe PDF(727Kb)  |  收藏  |  浏览/下载:11/5  |  提交时间:2024/07/04 |
| Recursive Least-Squares Estimator-Aided Online Learning for Visual Tracking 会议论文 , Virtual, United States, 2020-06-14至2020-06-19 作者: Gao, Jin; Hu, Weiming; Lu, Yan Adobe PDF(468Kb)  |  收藏  |  浏览/下载:37/12  |  提交时间:2024/06/21 |
| Potential Driven Reinforcement Learning for Hard Exploration Tasks 会议论文 , 线上, 2020-4 作者: Zhao EM(赵恩民); Deng SH(邓诗弘); Zang YF(臧一凡); Kang YX(康永欣); Li K(李凯); Xing JL(兴军亮) Adobe PDF(1999Kb)  |  收藏  |  浏览/下载:111/42  |  提交时间:2023/06/29 |
| Consensus Control of Multi-Agent Systems With Two-Way Switching Directed Topology 会议论文 , 北京, 2020-12-5 作者: Wang Xin; Wei Qinglai; Song Ruizhuo Adobe PDF(898Kb)  |  收藏  |  浏览/下载:98/40  |  提交时间:2023/06/28 |
| Stable Training of Bellman Error in Reinforcement Learning 会议论文 , Thailand, November 18–22 作者: Gong C(龚晨); Bai YP(白云鹏); Hou XW(侯新文); Ji XH(季晓慧) Adobe PDF(2416Kb)  |  收藏  |  浏览/下载:125/34  |  提交时间:2023/06/27 |
| Wd3: Taming the estimation bias in deep reinforcement learning 会议论文 , Baltimore, MD, USA, 2020-12 作者: He Q(何强); Hou XW(侯新文) Adobe PDF(2006Kb)  |  收藏  |  浏览/下载:228/45  |  提交时间:2022/06/27 deep reinforcement learning estimation bias neural networks |
| Clas-Maze: An Edutainment Tool Combining Tangible Programming and Living Knowledge 会议论文 , 线上会议, 2020年11月10日 作者: Xing Q(邢倩); Wang DL(王丹力); Zhao YY(赵燕艳); Wang XY(王雪钰) Adobe PDF(1195Kb)  |  收藏  |  浏览/下载:147/35  |  提交时间:2022/06/17 |
| Multi-Agent Cooperation and Competition with Two-Level Ggraph Attention Network 会议论文 , 线上, 2020-11 作者: Shiguang, Wu; Zhiqiang, Pu; Jianqiang, Yi; Huimu, Wang Adobe PDF(1185Kb)  |  收藏  |  浏览/下载:170/1  |  提交时间:2021/06/24 |
| STGA-LSTM: A Spatial-Temporal Graph Attentional LSTM Scheme for Multi-Agent Cooperation 会议论文 , 线上, 2020-11 作者: Huimu Wang; Zhen Liu; Zhiqiang Pu; Jianqiang Yi Adobe PDF(916Kb)  |  收藏  |  浏览/下载:102/0  |  提交时间:2021/06/24 |
| Multi-Agent Formation Control with Obstacles Avoidance under Restricted Communication through Graph Reinforcement Learning 会议论文 , 线上, 2020.06 作者: Huimu, Wang; Tenghai, Qiu; Zhen, Liu; Zhiqiang, Pu; Jianqiang, Yi Adobe PDF(1461Kb)  |  收藏  |  浏览/下载:211/43  |  提交时间:2021/06/24 |