CASIA OpenIR

浏览/检索结果: 共10条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:42/16  |  提交时间:2024/06/25
强化学习,分层强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:23/11  |  提交时间:2024/06/25
Continuous Exploration via Multiple Perspectives in Sparse Reward Environment 会议论文
, 厦门国际会议中心, 2023-10-13
作者:  Chen ZP(陈忠鹏);  Guan Q(关强)
Adobe PDF(2260Kb)  |  收藏  |  浏览/下载:40/12  |  提交时间:2024/06/04
Reinforcement Learning · Exploration Strategy · Sparse Reward · Intrinsic Motivation  
Representative Demonstration Selection for In-Context Learning with Two-Stage Determinantal Point Process 会议论文
, Singapore, 2023-12
作者:  Zhao Yang;  Yuanzhe Zhang;  Dianbo Sui;  Cao Liu;  Jun Zhao;  Kang Liu
Adobe PDF(592Kb)  |  收藏  |  浏览/下载:61/26  |  提交时间:2024/05/30
Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文
, New Orleans, LA, USA, December 10-16, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Guangchong Zhou;  Zeren Zhang;  Guoliang Fan
Adobe PDF(8700Kb)  |  收藏  |  浏览/下载:57/15  |  提交时间:2024/05/28
A Performance Optimization Strategy Based on Improved NSGA-II for a Flexible Robotic Fish 会议论文
, 英国伦敦, 2023.5.29
作者:  Lu, Ben;  Wang, Jian;  Liao, Xiaocun;  Zou, Qianqian;  Tan, Min;  Zhou, Chao
Adobe PDF(1449Kb)  |  收藏  |  浏览/下载:72/21  |  提交时间:2024/05/28
A Multi-modal Global Instance Tracking Benchmark (MGIT): Better Locating Target in Complex Spatio-temporal and causal Relationship 会议论文
, New Orleans, 2023-12
作者:  Shiyu, Hu;  Dailing, Zhang;  Meiqi, Wu;  Xiaokun, Feng;  Xuchen, Li;  Xin, Zhao;  Kaiqi, Huang
Adobe PDF(6215Kb)  |  收藏  |  浏览/下载:133/31  |  提交时间:2024/01/22
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:213/51  |  提交时间:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning  
Exploration via Joint Policy Diversity for Sparse-Reward Multi-Agent Tasks 会议论文
, Macao, China, 2023-8
作者:  Pei Xu;  Junge Zhang;  Kaiqi Huang
Adobe PDF(1369Kb)  |  收藏  |  浏览/下载:279/87  |  提交时间:2023/06/19
Subspace-Aware Exploration for Sparse-Reward Multi-Agent Tasks 会议论文
, Washington DC, USA, 2023-2-7
作者:  Pei Xu;  Junge Zhang;  Qiyue Yin;  Chao Yu;  Yaodong Yang;  Kaiqi Huang
Adobe PDF(2037Kb)  |  收藏  |  浏览/下载:262/79  |  提交时间:2023/06/19
deep reinforcement learning  sparse reward  exploration  multi-agent