CASIA OpenIR

浏览/检索结果: 共41条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Towards Zero-Shot Generalization: Mutual Information-Guided Hierarchical Multi-Agent Coordination 会议论文
, 日本, 2024-6
作者:  Zhang Qingyang;  Xu Bo
Adobe PDF(8862Kb)  |  收藏  |  浏览/下载:26/9  |  提交时间:2024/06/25
强化学习,分层强化学习  
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:43/17  |  提交时间:2024/06/25
强化学习,分层强化学习  
Feature Comparison Based Channel Attention For Fine-Grained Visual Classification 会议论文
, Abu Dhabi, United Arab Emirates, 25-28 October 2020
作者:  Shukun Jia;  Yan Bai;  Zhang Jing
Adobe PDF(163Kb)  |  收藏  |  浏览/下载:22/11  |  提交时间:2024/06/21
Improving Generalization of Multi-agent Reinforcement Learning through Domain-Invariant Feature Extraction 会议论文
, Greece, 2023-5
作者:  Xu YF(徐一凡);  Pu ZQ(蒲志强);  Cai QA(蔡奇昂);  Li FM(李非墨);  Chai XH(柴兴华)
Adobe PDF(7610Kb)  |  收藏  |  浏览/下载:31/13  |  提交时间:2024/06/21
Token-level Direct Preference Optimization 会议论文
, Vienna, Austria, 2024/7/21-27
作者:  Zeng,Yongcheng;  Liu,Guoqing;  Ma,Weiyu;  Yang,Ning;  Zhang,Haifeng;  Wang,Jun
Adobe PDF(883Kb)  |  收藏  |  浏览/下载:75/26  |  提交时间:2024/06/05
Representative Demonstration Selection for In-Context Learning with Two-Stage Determinantal Point Process 会议论文
, Singapore, 2023-12
作者:  Zhao Yang;  Yuanzhe Zhang;  Dianbo Sui;  Cao Liu;  Jun Zhao;  Kang Liu
Adobe PDF(592Kb)  |  收藏  |  浏览/下载:62/27  |  提交时间:2024/05/30
Consensus Learning for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Zeren Zhang;  Guangchong Zhou;  Hao Chen;  Guoliang Fan
Adobe PDF(4141Kb)  |  收藏  |  浏览/下载:50/20  |  提交时间:2024/05/28
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Yunpeng Bai;  Bin Zhang;  Dapeng Li;  Guoliang Fan
Adobe PDF(3345Kb)  |  收藏  |  浏览/下载:43/12  |  提交时间:2024/05/28
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning 会议论文
, New Orleans, LA, USA,, November 28 - December 9, 2022
作者:  Zhiwei Xu;  Dapeng Li;  Bin Zhang;  Yuan Zhan;  Yunpeng Bai;  Guoliang Fan
Adobe PDF(4367Kb)  |  收藏  |  浏览/下载:40/9  |  提交时间:2024/05/28
SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning 会议论文
, Auckland, New Zealand, May 9-13, 2022
作者:  Zhiwei Xu;  Yunpeng Bai;  Dapeng Li;  Bin Zhang;  Guoliang Fan
Adobe PDF(2965Kb)  |  收藏  |  浏览/下载:44/9  |  提交时间:2024/05/28