CASIA OpenIR

浏览/检索结果: 共14条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文
IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040
作者:  Qingxu, Fu;  Xiaolin Ai;  Jianqiang Yi;  Tenghai Qiu;  Wanmai Yuan;  Zhiqiang Pu
Adobe PDF(996Kb)  |  收藏  |  浏览/下载:43/12  |  提交时间:2024/06/05
Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control 会议论文
Advances in Neural Information Processing Systems, New Orleans, USA, 2023-12-10
作者:  Chao Li;  Chen Gong;  Qiang He;  Xinwen Hou
Adobe PDF(1457Kb)  |  收藏  |  浏览/下载:41/12  |  提交时间:2024/05/30
Explicitly Learning Policy Under Partial Observability in Multiagent Reinforcement Learning 会议论文
, Queensland, Australia, 2023-6
作者:  Yang, Chen;  Yang, Guangkai;  Chen, Hao;  Zhang, Junge
Adobe PDF(3027Kb)  |  收藏  |  浏览/下载:61/22  |  提交时间:2024/05/29
A brain-inspired theory of mind spiking neural network improves multi-agent cooperation and competition 期刊论文
Patterns, 2023, 页码: 8
作者:  Zhao,Zhuoya;  Zhao,Feifei;  Zhao,Yuxuan;  Sun,Yinqian;  Zeng,Yi
Adobe PDF(4502Kb)  |  收藏  |  浏览/下载:60/16  |  提交时间:2024/05/28
Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文
, New Orleans, LA, USA, December 10-16, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Guangchong Zhou;  Zeren Zhang;  Guoliang Fan
Adobe PDF(8700Kb)  |  收藏  |  浏览/下载:53/15  |  提交时间:2024/05/28
Consensus Learning for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Zeren Zhang;  Guangchong Zhou;  Hao Chen;  Guoliang Fan
Adobe PDF(4141Kb)  |  收藏  |  浏览/下载:47/19  |  提交时间:2024/05/28
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Yunpeng Bai;  Bin Zhang;  Dapeng Li;  Guoliang Fan
Adobe PDF(3345Kb)  |  收藏  |  浏览/下载:37/9  |  提交时间:2024/05/28
Large sequence models for sequential decision-making: a survey 期刊论文
FRONTIERS OF COMPUTER SCIENCE, 2023, 卷号: 17, 期号: 6, 页码: 18
作者:  Wen, Muning;  Lin, Runji;  Wang, Hanjing;  Yang, Yaodong;  Wen, Ying;  Mai, Luo;  Wang, Jun;  Zhang, Haifeng;  Zhang, Weinan
Adobe PDF(1351Kb)  |  收藏  |  浏览/下载:152/5  |  提交时间:2023/11/17
sequential decision-making  sequence modeling  the Transformer  training system  
A Survey on Reinforcement Learning Methods in Bionic Underwater Robots 期刊论文
BIOMIMETICS, 2023, 卷号: 8, 期号: 2, 页码: 29
作者:  Tong, Ru;  Feng, Yukai;  Wang, Jian;  Wu, Zhengxing;  Tan, Min;  Yu, Junzhi
Adobe PDF(1260Kb)  |  收藏  |  浏览/下载:148/19  |  提交时间:2023/11/17
bionic underwater robot  reinforcement learning  robotic fish  intelligent control  
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Chai, Jiajun;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2469Kb)  |  收藏  |  浏览/下载:62/3  |  提交时间:2023/11/16
Large-scale multiagent  neighboring communication  reinforcement learning (RL)  variational information flow