CASIA OpenIR

浏览/检索结果: 共5条,第1-5条 帮助

限定条件                            
已选(0)清除 条数/页:   排序方式:
Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文
, New Orleans, LA, USA, December 10-16, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Guangchong Zhou;  Zeren Zhang;  Guoliang Fan
Adobe PDF(8700Kb)  |  收藏  |  浏览/下载:53/15  |  提交时间:2024/05/28
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Yunpeng Bai;  Bin Zhang;  Dapeng Li;  Guoliang Fan
Adobe PDF(3345Kb)  |  收藏  |  浏览/下载:36/9  |  提交时间:2024/05/28
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:206/47  |  提交时间:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning  
Pseudo Value Network Distillation for High-Performance Exploration 会议论文
, 澳大利亚, 2023-06
作者:  Zhao EM(赵恩民);  Xing JL(兴军亮);  Li K(李凯);  Kang YX(康永欣);  Tao P(陶品)
Adobe PDF(5874Kb)  |  收藏  |  浏览/下载:160/45  |  提交时间:2023/06/28
TBERT: Dynamic BERT Inference with Top-k Based Predictors 会议论文
, Antwerp, Belgium, 2023-4-17
作者:  Liu, Zejian;  Zhao, Kun;  Cheng, Jian
Adobe PDF(3426Kb)  |  收藏  |  浏览/下载:119/31  |  提交时间:2023/06/19
Transformer  Dynamic Inference  Pruning