已选(0)清除
条数/页: 排序方式: |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang; Yang Yiming; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng; Xu Bo Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:17/7  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang; Zhang Hongming; Xing Dengpeng; Bo Xu Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:11/6  |  提交时间:2024/06/25 |
| LEGO: A Multi-agent Collaborative Framework with Role-playing and Iterative Feedback for Causality Explanation Generation 会议论文 , Singapore, 2023-12 作者: Zhitao He; Pengfei Cao; Yubo Chen; Kang Liu; Jun Zhao Adobe PDF(1153Kb)  |  收藏  |  浏览/下载:8/3  |  提交时间:2024/06/25 |
| Improving Generalization of Multi-agent Reinforcement Learning through Domain-Invariant Feature Extraction 会议论文 , Greece, 2023-5 作者: Xu YF(徐一凡); Pu ZQ(蒲志强); Cai QA(蔡奇昂); Li FM(李非墨); Chai XH(柴兴华) Adobe PDF(7610Kb)  |  收藏  |  浏览/下载:10/5  |  提交时间:2024/06/21 |
| M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文 , London, United Kingdom, 2023.5.29-2023.6.2 作者: Meng Linghui; Ruan Jingqing; Xiong Xuantang; Li Xiyun; Zhang Xi; Xing Dengpeng; Xu Bo Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:20/5  |  提交时间:2024/06/11 |
| Filtered Observations for Model-Based Multi-agent Reinforcement Learning 会议论文 , Turin, Italy, 2023.9.18-2023.9.22 作者: Meng Linghui; Xiong Xuantang; Zang Yifan; Zhang Xi; Li Guoqi; Xing Dengpeng; Xu Bo Adobe PDF(841Kb)  |  收藏  |  浏览/下载:31/12  |  提交时间:2024/06/11 |
| Minimizing Age of Information for Mobile Edge Computing Systems: A Nested Index Approach 会议论文 , Singapore, 2023/8/24-27 作者: Chen,Shuo; Yang,Ning; Zhang,Meng; Wang,Jun Adobe PDF(1413Kb)  |  收藏  |  浏览/下载:39/10  |  提交时间:2024/06/05 |
| Multi-objective Deep Reinforcement Learning for Mobile Edge Computing 会议论文 , Singapore, 2023/8/24-27 作者: Yang,Ning; Wen,Junrui; Zhang,Meng; Tang,Ming Adobe PDF(499Kb)  |  收藏  |  浏览/下载:39/14  |  提交时间:2024/06/05 mobile edge computing multi-objective reinforcement learning resource scheduling |
| Learning Superior Cooperative Policy in Competitive Multi-team Reinforcement Learning 会议论文 , Gold Coast, Australia, 2023-6 作者: Qingxu Fu; Tenghai Qiu; Zhiqiang Pu; Jianqiang Yi; Xiaolin Ai; Wanmai Yuan Adobe PDF(25675Kb)  |  收藏  |  浏览/下载:31/5  |  提交时间:2024/06/05 |
| Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文 IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040 作者: Qingxu, Fu; Xiaolin Ai; Jianqiang Yi; Tenghai Qiu; Wanmai Yuan; Zhiqiang Pu Adobe PDF(996Kb)  |  收藏  |  浏览/下载:29/8  |  提交时间:2024/06/05 |