CASIA OpenIR

浏览/检索结果: 共149条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:48/22  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
Improving Generalization of Multi-agent Reinforcement Learning through Domain-Invariant Feature Extraction 会议论文
, Greece, 2023-5
作者:  Xu YF(徐一凡);  Pu ZQ(蒲志强);  Cai QA(蔡奇昂);  Li FM(李非墨);  Chai XH(柴兴华)
Adobe PDF(7610Kb)  |  收藏  |  浏览/下载:33/13  |  提交时间:2024/06/21
Leros: Learning Explicit Reasoning on Synthesized Data for Commonsense Question Answering 会议论文
, Torino, Italia, 2024-5
作者:  Wang, Chenhao;  Cao, Pengfei;  Li, Jiachun;  Chen, Yubo;  Liu, Kang;  Jiang, Xiaojian;  Xu, Jiexin;  Li, Qiuxia;  Jun Zhao
Adobe PDF(909Kb)  |  收藏  |  浏览/下载:48/13  |  提交时间:2024/05/30
CN-AutoMIC: Distilling Chinese Commonsense Knowledge from Pretrained Language Models 会议论文
, Abu Dhabi, United Arab Emirates, 2022-12
作者:  Wang, Chenhao;  Li, Jiachun;  Chen, Yubo;  Liu, Kang;  Zhao, Jun
Adobe PDF(848Kb)  |  收藏  |  浏览/下载:50/13  |  提交时间:2024/05/30
Leveraging Explicit Lexico-logical Alignments in Text-to-SQL Parsing 会议论文
, Dublin, May 22–27, 2022
作者:  Sun, Runxin;  He, Shizhu;  Zhu, Chong;  He, Yaohan;  Li, Jinlong;  Zhao, Jun;  Liu, Kang
Adobe PDF(528Kb)  |  收藏  |  浏览/下载:73/26  |  提交时间:2024/05/28
Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文
, New Orleans, LA, USA, December 10-16, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Guangchong Zhou;  Zeren Zhang;  Guoliang Fan
Adobe PDF(8700Kb)  |  收藏  |  浏览/下载:58/16  |  提交时间:2024/05/28
Consensus Learning for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Zeren Zhang;  Guangchong Zhou;  Hao Chen;  Guoliang Fan
Adobe PDF(4141Kb)  |  收藏  |  浏览/下载:53/21  |  提交时间:2024/05/28
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Yunpeng Bai;  Bin Zhang;  Dapeng Li;  Guoliang Fan
Adobe PDF(3345Kb)  |  收藏  |  浏览/下载:43/12  |  提交时间:2024/05/28
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning 会议论文
, New Orleans, LA, USA,, November 28 - December 9, 2022
作者:  Zhiwei Xu;  Dapeng Li;  Bin Zhang;  Yuan Zhan;  Yunpeng Bai;  Guoliang Fan
Adobe PDF(4367Kb)  |  收藏  |  浏览/下载:41/9  |  提交时间:2024/05/28
SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning 会议论文
, Auckland, New Zealand, May 9-13, 2022
作者:  Zhiwei Xu;  Yunpeng Bai;  Dapeng Li;  Bin Zhang;  Guoliang Fan
Adobe PDF(2965Kb)  |  收藏  |  浏览/下载:45/10  |  提交时间:2024/05/28