CASIA OpenIR

浏览/检索结果: 共15条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
An Improved Minimax-Q Algorithm Based on Generalized Policy Iteration to Solve a Chaser-Invader Game 会议论文
, 线上, 2020-5
作者:  Liu MS(刘民颂);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(727Kb)  |  收藏  |  浏览/下载:11/5  |  提交时间:2024/07/04
Recursive Least-Squares Estimator-Aided Online Learning for Visual Tracking 会议论文
, Virtual, United States, 2020-06-14至2020-06-19
作者:  Gao, Jin;  Hu, Weiming;  Lu, Yan
Adobe PDF(468Kb)  |  收藏  |  浏览/下载:37/12  |  提交时间:2024/06/21
Potential Driven Reinforcement Learning for Hard Exploration Tasks 会议论文
, 线上, 2020-4
作者:  Zhao EM(赵恩民);  Deng SH(邓诗弘);  Zang YF(臧一凡);  Kang YX(康永欣);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(1999Kb)  |  收藏  |  浏览/下载:111/42  |  提交时间:2023/06/29
Consensus Control of Multi-Agent Systems With Two-Way Switching Directed Topology 会议论文
, 北京, 2020-12-5
作者:  Wang Xin;  Wei Qinglai;  Song Ruizhuo
Adobe PDF(898Kb)  |  收藏  |  浏览/下载:98/40  |  提交时间:2023/06/28
Stable Training of Bellman Error in Reinforcement Learning 会议论文
, Thailand, November 18–22
作者:  Gong C(龚晨);  Bai YP(白云鹏);  Hou XW(侯新文);  Ji XH(季晓慧)
Adobe PDF(2416Kb)  |  收藏  |  浏览/下载:125/34  |  提交时间:2023/06/27
Wd3: Taming the estimation bias in deep reinforcement learning 会议论文
, Baltimore, MD, USA, 2020-12
作者:  He Q(何强);  Hou XW(侯新文)
Adobe PDF(2006Kb)  |  收藏  |  浏览/下载:228/45  |  提交时间:2022/06/27
deep reinforcement learning  estimation bias  neural networks  
Clas-Maze: An Edutainment Tool Combining Tangible Programming and Living Knowledge 会议论文
, 线上会议, 2020年11月10日
作者:  Xing Q(邢倩);  Wang DL(王丹力);  Zhao YY(赵燕艳);  Wang XY(王雪钰)
Adobe PDF(1195Kb)  |  收藏  |  浏览/下载:147/35  |  提交时间:2022/06/17
Multi-Agent Cooperation and Competition with Two-Level Ggraph Attention Network 会议论文
, 线上, 2020-11
作者:  Shiguang, Wu;  Zhiqiang, Pu;  Jianqiang, Yi;  Huimu, Wang
Adobe PDF(1185Kb)  |  收藏  |  浏览/下载:170/1  |  提交时间:2021/06/24
STGA-LSTM: A Spatial-Temporal Graph Attentional LSTM Scheme for Multi-Agent Cooperation 会议论文
, 线上, 2020-11
作者:  Huimu Wang;  Zhen Liu;  Zhiqiang Pu;  Jianqiang Yi
Adobe PDF(916Kb)  |  收藏  |  浏览/下载:102/0  |  提交时间:2021/06/24
Multi-Agent Formation Control with Obstacles Avoidance under Restricted Communication through Graph Reinforcement Learning 会议论文
, 线上, 2020.06
作者:  Huimu, Wang;  Tenghai, Qiu;  Zhen, Liu;  Zhiqiang, Pu;  Jianqiang, Yi
Adobe PDF(1461Kb)  |  收藏  |  浏览/下载:211/43  |  提交时间:2021/06/24