CASIA OpenIR

Browse/Search Results:  1-3 of 3 Help

Selected(0)Clear Items/Page:    Sort:
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
Authors:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  Favorite  |  View/Download:32/13  |  Submit date:2024/06/25
强化学习,分层强化学习  
M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文
, London, United Kingdom, 2023.5.29-2023.6.2
Authors:  Meng Linghui;  Ruan Jingqing;  Xiong Xuantang;  Li Xiyun;  Zhang Xi;  Xing Dengpeng;  Xu Bo
Adobe PDF(1302Kb)  |  Favorite  |  View/Download:30/8  |  Submit date:2024/06/11
Filtered Observations for Model-Based Multi-agent Reinforcement Learning 会议论文
, Turin, Italy, 2023.9.18-2023.9.22
Authors:  Meng Linghui;  Xiong Xuantang;  Zang Yifan;  Zhang Xi;  Li Guoqi;  Xing Dengpeng;  Xu Bo
Adobe PDF(841Kb)  |  Favorite  |  View/Download:41/17  |  Submit date:2024/06/11