已选(0)清除
条数/页: 排序方式: |
| HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism 会议论文 , Washington, DC, USA, February 7-14, 2023 作者: Zhiwei Xu ; Yunpeng Bai ; Bin Zhang; Dapeng Li ; Guoliang Fan![](/image/person.jpg)
Adobe PDF(3345Kb)  |   收藏  |  浏览/下载:29/7  |  提交时间:2024/05/28 |
| Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning 会议论文 , New Orleans, LA, USA,, November 28 - December 9, 2022 作者: Zhiwei Xu ; Dapeng Li ; Bin Zhang; Yuan Zhan ; Yunpeng Bai ; Guoliang Fan![](/image/person.jpg)
Adobe PDF(4367Kb)  |   收藏  |  浏览/下载:28/5  |  提交时间:2024/05/28 |
| SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning 会议论文 , Auckland, New Zealand, May 9-13, 2022 作者: Zhiwei Xu ; Yunpeng Bai ; Dapeng Li ; Bin Zhang; Guoliang Fan![](/image/person.jpg)
Adobe PDF(2965Kb)  |   收藏  |  浏览/下载:31/7  |  提交时间:2024/05/28 |
| Learning to Coordinate via Multiple Graph Neural Networks 会议论文 , BALI, Indonesia, December 8-12, 2021 作者: Zhiwei Xu ; Bin Zhang; Yunpeng Bai ; Dapeng Li ; Guoliang Fan![](/image/person.jpg)
Adobe PDF(2047Kb)  |   收藏  |  浏览/下载:38/15  |  提交时间:2024/05/28 |
| MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning 会议论文 , Shenzhen, China, 18-22 July 2021 作者: Zhiwei Xu ; Dapeng Li ; Yunpeng Bai ; Guoliang Fan![](/image/person.jpg)
Adobe PDF(3892Kb)  |   收藏  |  浏览/下载:16/10  |  提交时间:2024/05/28 |
| Stable Training of Bellman Error in Reinforcement Learning 会议论文 , Thailand, November 18–22 作者: Gong C(龚晨) ; Bai YP(白云鹏) ; Hou XW(侯新文) ; Ji XH(季晓慧)
Adobe PDF(2416Kb)  |   收藏  |  浏览/下载:126/35  |  提交时间:2023/06/27 |
| Curiosity-Driven and Victim-Aware Adversarial Policies 会议论文 , Austin TX, USA, December 5-9, 2022 作者: Gong C(龚晨) ; Yang Z(杨洲); Bai YP(白云鹏) ; Shi JK(史杰克); Sinha Arunesh; Xu BW(徐博文); Lo David; Hou XW(侯新文) ; Fan GL(范国梁)![](/image/person.jpg)
Adobe PDF(4090Kb)  |   收藏  |  浏览/下载:126/50  |  提交时间:2023/06/27 |
| Cooperative Multi-Agent Reinforcement Learning with Hypergraph Convolution 会议论文 , Padua, Italy, 18-23 July 2022 作者: Yunpeng Bai ; Chen Gong ; Bin Zhang; Guoliang Fan ; Xinwen Hou ; Yu Liu![](/image/person.jpg)
Adobe PDF(8946Kb)  |   收藏  |  浏览/下载:141/36  |  提交时间:2023/06/14 |
| 面向稀疏奖励环境的多智能体协同探索问题研究 学位论文 , 2023 作者: 白云鹏![](/image/person.jpg)
Adobe PDF(36141Kb)  |   收藏  |  浏览/下载:185/9  |  提交时间:2023/06/13 多智能体,强化学习,超图,变分推断,好奇心 |
| Wide-Sense Stationary Policy Optimization with Bellman Residual on Video Games 会议论文 , Shenzhen, China, 05-09 July 2021 作者: Gong C(龚晨) ; He Q(何强) ; Bai YP(白云鹏) ; Hou XW(侯新文) ; Fan GL(范国梁) ; Liu Y(刘禹)![](/image/person.jpg)
Adobe PDF(2780Kb)  |   收藏  |  浏览/下载:246/44  |  提交时间:2022/06/27 Video Game Reinforcement Learning Quantile Regression Bellman residual Wasserstein Distance |