CASIA OpenIR

浏览/检索结果: 共8条,第1-8条 帮助

限定条件                            
已选(0)清除 条数/页:   排序方式:
AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文
, 线上, 2022-02-22
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li JQ(李金秋);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:199/74  |  提交时间:2023/06/29
Learning to Play Hard Exploration Games Using Graph-guided Self-navigation 会议论文
, 线上, 2021-02
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li K(李凯);  Li LJ(李丽娟);  Xing JL(兴军亮)
Adobe PDF(413Kb)  |  收藏  |  浏览/下载:173/61  |  提交时间:2023/06/28
Hierarchical Cooperative Swarm Policy Learning with Role Emergence 会议论文
, Online, 05-07 December 2021
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Qiu TH(丘腾海);  Yi JQ(易建强)
Adobe PDF(327Kb)  |  收藏  |  浏览/下载:155/66  |  提交时间:2023/06/12
Semantic Perception Swarm Policy with Deep Reinforcement Learning 会议论文
, Online, 05 December 2021
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强)
Adobe PDF(523Kb)  |  收藏  |  浏览/下载:134/52  |  提交时间:2023/06/12
Multi-agent Collaborative Learning with Relational Graph Reasoning in Adversarial Environments 会议论文
, 线上会议, 2021-9
作者:  Wu Shiguang;  Qiu Tenghai;  Pu Zhiqiang;  Yi Jianqiang
Adobe PDF(1396Kb)  |  收藏  |  浏览/下载:264/77  |  提交时间:2022/06/16
Multi-target Coverage with Connectivity Maintenance using Knowledge-incorporated Policy Framework 会议论文
, Xi'an China, May 31-Jun. 4
作者:  Shiguang Wu;  Zhiqiang Pu;  Zhen Liu;  Tenghai Qiu;  Jianqiang Yi;  Tianle Zhang
Adobe PDF(12862Kb)  |  收藏  |  浏览/下载:291/48  |  提交时间:2022/04/06
Multi-Agent Cognition Difference Reinforcement Learning for MultiAgent Cooperation 会议论文
, 线上, 2021-07
作者:  Huimu, Wang;  Tenghai, Qiu;  Zhen, Liu;  Zhiqiang, Pu;  Jianqiang, Yi;  Wanmai Yuan
Adobe PDF(478Kb)  |  收藏  |  浏览/下载:305/65  |  提交时间:2021/06/24
Multi-Agent Hierarchical Cognition Difference Policy for Multi-Agent Cooperation 期刊论文
Algorithms, 2021, 期号: 14, 页码: 98
作者:  Huimu Wang;  Zhen Liu;  Jianqiang Yi;  Zhiqiang Pu
Adobe PDF(1155Kb)  |  收藏  |  浏览/下载:261/53  |  提交时间:2021/06/24
multiagent system  deep reinforcement learning  variational autoencoder  attention mechanism