CASIA OpenIR

浏览/检索结果: 共14条,第1-10条 帮助

限定条件                            
已选(0)清除 条数/页:   排序方式:
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:19/7  |  提交时间:2024/06/25
强化学习,分层强化学习  
Learning Superior Cooperative Policy in Competitive Multi-team Reinforcement Learning 会议论文
, Gold Coast, Australia, 2023-6
作者:  Qingxu Fu;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi;  Xiaolin Ai;  Wanmai Yuan
Adobe PDF(25675Kb)  |  收藏  |  浏览/下载:34/5  |  提交时间:2024/06/05
Improve the efficiency of deep reinforcement learning through semantic exploration guided by natural language. 会议论文
, 北京华腾美居酒店, 2023-12-9
作者:  Zhourui Guo;  Meng Yao;  Yang Yu;  Qiyue Yin
Adobe PDF(2302Kb)  |  收藏  |  浏览/下载:23/8  |  提交时间:2024/06/03
Learning Individual Difference Rewards in Multi-Agent Reinforcement Learning 会议论文
, London, United Kingdom, 2023-5
作者:  Yang, Chen;  Yang, Guangkai;  Zhang, Junge
Adobe PDF(2419Kb)  |  收藏  |  浏览/下载:39/15  |  提交时间:2024/05/29
Explicitly Learning Policy Under Partial Observability in Multiagent Reinforcement Learning 会议论文
, Queensland, Australia, 2023-6
作者:  Yang, Chen;  Yang, Guangkai;  Chen, Hao;  Zhang, Junge
Adobe PDF(3027Kb)  |  收藏  |  浏览/下载:46/19  |  提交时间:2024/05/29
Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文
, New Orleans, LA, USA, December 10-16, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Guangchong Zhou;  Zeren Zhang;  Guoliang Fan
Adobe PDF(8700Kb)  |  收藏  |  浏览/下载:41/12  |  提交时间:2024/05/28
Consensus Learning for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Zeren Zhang;  Guangchong Zhou;  Hao Chen;  Guoliang Fan
Adobe PDF(4141Kb)  |  收藏  |  浏览/下载:36/13  |  提交时间:2024/05/28
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Yunpeng Bai;  Bin Zhang;  Dapeng Li;  Guoliang Fan
Adobe PDF(3345Kb)  |  收藏  |  浏览/下载:29/7  |  提交时间:2024/05/28
A Performance Optimization Strategy Based on Improved NSGA-II for a Flexible Robotic Fish 会议论文
, 英国伦敦, 2023.5.29
作者:  Lu, Ben;  Wang, Jian;  Liao, Xiaocun;  Zou, Qianqian;  Tan, Min;  Zhou, Chao
Adobe PDF(1449Kb)  |  收藏  |  浏览/下载:53/13  |  提交时间:2024/05/28
A Multi-modal Global Instance Tracking Benchmark (MGIT): Better Locating Target in Complex Spatio-temporal and causal Relationship 会议论文
, New Orleans, 2023-12
作者:  Shiyu, Hu;  Dailing, Zhang;  Meiqi, Wu;  Xiaokun, Feng;  Xuchen, Li;  Xin, Zhao;  Kaiqi, Huang
Adobe PDF(6215Kb)  |  收藏  |  浏览/下载:115/26  |  提交时间:2024/01/22