CASIA OpenIR

浏览/检索结果: 共6条,第1-6条 帮助

限定条件                            
已选(0)清除 条数/页:   排序方式:
MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Shenzhen, China, 18-22 July 2021
作者:  Zhiwei Xu;  Dapeng Li;  Yunpeng Bai;  Guoliang Fan
Adobe PDF(3892Kb)  |  收藏  |  浏览/下载:16/10  |  提交时间:2024/05/28
AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文
, 线上, 2022-02-22
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li JQ(李金秋);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:198/74  |  提交时间:2023/06/29
Learning to Play Hard Exploration Games Using Graph-guided Self-navigation 会议论文
, 线上, 2021-02
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li K(李凯);  Li LJ(李丽娟);  Xing JL(兴军亮)
Adobe PDF(413Kb)  |  收藏  |  浏览/下载:173/61  |  提交时间:2023/06/28
Hierarchical Cooperative Swarm Policy Learning with Role Emergence 会议论文
, Online, 05-07 December 2021
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Qiu TH(丘腾海);  Yi JQ(易建强)
Adobe PDF(327Kb)  |  收藏  |  浏览/下载:154/65  |  提交时间:2023/06/12
Marine autonomous navigation for biomimetic underwater robots based on deep stereo attention network 会议论文
, Prague, Czech Republic, 2021年9月27日-2021年10月1日
作者:  Yan, Shuaizheng;  Wu, Zhengxing;  Wang, Jian;  Tan, Min;  Yu, Junzhi
Adobe PDF(4783Kb)  |  收藏  |  浏览/下载:170/64  |  提交时间:2023/06/12
Autonomous underwater vehicles  Visualization  Navigation  Biological system modeling  Real-time systems  
Multi-Agent Cognition Difference Reinforcement Learning for MultiAgent Cooperation 会议论文
, 线上, 2021-07
作者:  Huimu, Wang;  Tenghai, Qiu;  Zhen, Liu;  Zhiqiang, Pu;  Jianqiang, Yi;  Wanmai Yuan
Adobe PDF(478Kb)  |  收藏  |  浏览/下载:305/65  |  提交时间:2021/06/24