CASIA OpenIR

浏览/检索结果: 共26条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Explicitly Learning Policy Under Partial Observability in Multiagent Reinforcement Learning 会议论文
, Queensland, Australia, 2023-6
作者:  Yang, Chen;  Yang, Guangkai;  Chen, Hao;  Zhang, Junge
Adobe PDF(3027Kb)  |  收藏  |  浏览/下载:8/2  |  提交时间:2024/05/29
Consensus Learning for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Zeren Zhang;  Guangchong Zhou;  Hao Chen;  Guoliang Fan
Adobe PDF(4141Kb)  |  收藏  |  浏览/下载:4/0  |  提交时间:2024/05/28
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Yunpeng Bai;  Bin Zhang;  Dapeng Li;  Guoliang Fan
Adobe PDF(3345Kb)  |  收藏  |  浏览/下载:7/1  |  提交时间:2024/05/28
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning 会议论文
, New Orleans, LA, USA,, November 28 - December 9, 2022
作者:  Zhiwei Xu;  Dapeng Li;  Bin Zhang;  Yuan Zhan;  Yunpeng Bai;  Guoliang Fan
Adobe PDF(4367Kb)  |  收藏  |  浏览/下载:4/1  |  提交时间:2024/05/28
SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning 会议论文
, Auckland, New Zealand, May 9-13, 2022
作者:  Zhiwei Xu;  Yunpeng Bai;  Dapeng Li;  Bin Zhang;  Guoliang Fan
Adobe PDF(2965Kb)  |  收藏  |  浏览/下载:7/2  |  提交时间:2024/05/28
MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Shenzhen, China, 18-22 July 2021
作者:  Zhiwei Xu;  Dapeng Li;  Yunpeng Bai;  Guoliang Fan
Adobe PDF(3892Kb)  |  收藏  |  浏览/下载:7/2  |  提交时间:2024/05/28
A Multi-modal Global Instance Tracking Benchmark (MGIT): Better Locating Target in Complex Spatio-temporal and causal Relationship 会议论文
, New Orleans, 2023-12
作者:  Shiyu, Hu;  Dailing, Zhang;  Meiqi, Wu;  Xiaokun, Feng;  Xuchen, Li;  Xin, Zhao;  Kaiqi, Huang
Adobe PDF(6215Kb)  |  收藏  |  浏览/下载:88/18  |  提交时间:2024/01/22
Commander-Soldiers Reinforcement Learning for Cooperative Multi-Agent Systems 会议论文
, 意大利, 2022-7
作者:  Chen YQ(陈逸群);  Yang Wei;  Tianle Zhang;  Shiguang Wu;  Hongxing Chang
Adobe PDF(15907Kb)  |  收藏  |  浏览/下载:141/32  |  提交时间:2023/06/28
Ristretto: An Atomized Processing Architecture for Sparsity-Condensed Stream Flow in CNN 会议论文
, Westin Chicago, 2022-10
作者:  Gang Li;  Weixiang Xu;  Zhuoran Song;  Naifeng Jing;  Naifeng Jing;  Xiaoyao Liang
Adobe PDF(850Kb)  |  收藏  |  浏览/下载:80/28  |  提交时间:2023/06/21
Stacking More Linear Operations with Orthogonal Regularization to Learn Better 会议论文
, 线上会议, 2022-7
作者:  Xu WX(许伟翔);  Cheng J(程健)
Adobe PDF(1126Kb)  |  收藏  |  浏览/下载:98/32  |  提交时间:2023/06/21