CASIA OpenIR

Browse/Search Results:  1-10 of 10 Help

  Show only claimed items
Selected(0)Clear Items/Page:    Sort:
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism 会议论文
, Washington, DC, USA, February 7-14, 2023
Authors:  Zhiwei Xu;  Yunpeng Bai;  Bin Zhang;  Dapeng Li;  Guoliang Fan
Adobe PDF(3345Kb)  |  Favorite  |  View/Download:29/7  |  Submit date:2024/05/28
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning 会议论文
, New Orleans, LA, USA,, November 28 - December 9, 2022
Authors:  Zhiwei Xu;  Dapeng Li;  Bin Zhang;  Yuan Zhan;  Yunpeng Bai;  Guoliang Fan
Adobe PDF(4367Kb)  |  Favorite  |  View/Download:30/5  |  Submit date:2024/05/28
SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning 会议论文
, Auckland, New Zealand, May 9-13, 2022
Authors:  Zhiwei Xu;  Yunpeng Bai;  Dapeng Li;  Bin Zhang;  Guoliang Fan
Adobe PDF(2965Kb)  |  Favorite  |  View/Download:31/7  |  Submit date:2024/05/28
Learning to Coordinate via Multiple Graph Neural Networks 会议论文
, BALI, Indonesia, December 8-12, 2021
Authors:  Zhiwei Xu;  Bin Zhang;  Yunpeng Bai;  Dapeng Li;  Guoliang Fan
Adobe PDF(2047Kb)  |  Favorite  |  View/Download:40/16  |  Submit date:2024/05/28
MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Shenzhen, China, 18-22 July 2021
Authors:  Zhiwei Xu;  Dapeng Li;  Yunpeng Bai;  Guoliang Fan
Adobe PDF(3892Kb)  |  Favorite  |  View/Download:17/10  |  Submit date:2024/05/28
Stable Training of Bellman Error in Reinforcement Learning 会议论文
, Thailand, November 18–22
Authors:  Gong C(龚晨);  Bai YP(白云鹏);  Hou XW(侯新文);  Ji XH(季晓慧)
Adobe PDF(2416Kb)  |  Favorite  |  View/Download:128/36  |  Submit date:2023/06/27
Curiosity-Driven and Victim-Aware Adversarial Policies 会议论文
, Austin TX, USA, December 5-9, 2022
Authors:  Gong C(龚晨);  Yang Z(杨洲);  Bai YP(白云鹏);  Shi JK(史杰克);  Sinha Arunesh;  Xu BW(徐博文);  Lo David;  Hou XW(侯新文);  Fan GL(范国梁)
Adobe PDF(4090Kb)  |  Favorite  |  View/Download:126/50  |  Submit date:2023/06/27
Cooperative Multi-Agent Reinforcement Learning with Hypergraph Convolution 会议论文
, Padua, Italy, 18-23 July 2022
Authors:  Yunpeng Bai;  Chen Gong;  Bin Zhang;  Guoliang Fan;  Xinwen Hou;  Yu Liu
Adobe PDF(8946Kb)  |  Favorite  |  View/Download:141/36  |  Submit date:2023/06/14
面向稀疏奖励环境的多智能体协同探索问题研究 学位论文
, 2023
Authors:  白云鹏
Adobe PDF(36141Kb)  |  Favorite  |  View/Download:186/9  |  Submit date:2023/06/13
多智能体,强化学习,超图,变分推断,好奇心  
Wide-Sense Stationary Policy Optimization with Bellman Residual on Video Games 会议论文
, Shenzhen, China, 05-09 July 2021
Authors:  Gong C(龚晨);  He Q(何强);  Bai YP(白云鹏);  Hou XW(侯新文);  Fan GL(范国梁);  Liu Y(刘禹)
Adobe PDF(2780Kb)  |  Favorite  |  View/Download:246/44  |  Submit date:2022/06/27
Video Game  Reinforcement Learning  Quantile Regression  Bellman residual  Wasserstein Distance