已选(0)清除
条数/页: 排序方式: |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang; Yang Yiming; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng; Xu Bo Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:11/5  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang; Zhang Hongming; Xing Dengpeng; Bo Xu Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:8/5  |  提交时间:2024/06/25 |
| Improving Generalization of Multi-agent Reinforcement Learning through Domain-Invariant Feature Extraction 会议论文 , Greece, 2023-5 作者: Xu YF(徐一凡); Pu ZQ(蒲志强); Cai QA(蔡奇昂); Li FM(李非墨); Chai XH(柴兴华) Adobe PDF(7610Kb)  |  收藏  |  浏览/下载:8/4  |  提交时间:2024/06/21 |
| M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文 , London, United Kingdom, 2023.5.29-2023.6.2 作者: Meng Linghui; Ruan Jingqing; Xiong Xuantang; Li Xiyun; Zhang Xi; Xing Dengpeng; Xu Bo Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:20/5  |  提交时间:2024/06/11 |
| Filtered Observations for Model-Based Multi-agent Reinforcement Learning 会议论文 , Turin, Italy, 2023.9.18-2023.9.22 作者: Meng Linghui; Xiong Xuantang; Zang Yifan; Zhang Xi; Li Guoqi; Xing Dengpeng; Xu Bo Adobe PDF(841Kb)  |  收藏  |  浏览/下载:26/10  |  提交时间:2024/06/11 |
| Generative Calibration for In-context Learning 会议论文 , Singapore, 2023-10-6 作者: Zhongtao Jiang; Yuanzhe Zhang; Cao Liu; Jun Zhao; Kang Liu Adobe PDF(763Kb)  |  收藏  |  浏览/下载:21/7  |  提交时间:2024/06/06 |
| Minimizing Age of Information for Mobile Edge Computing Systems: A Nested Index Approach 会议论文 , Singapore, 2023/8/24-27 作者: Chen,Shuo; Yang,Ning; Zhang,Meng; Wang,Jun Adobe PDF(1413Kb)  |  收藏  |  浏览/下载:36/9  |  提交时间:2024/06/05 |
| Multi-objective Deep Reinforcement Learning for Mobile Edge Computing 会议论文 , Singapore, 2023/8/24-27 作者: Yang,Ning; Wen,Junrui; Zhang,Meng; Tang,Ming Adobe PDF(499Kb)  |  收藏  |  浏览/下载:35/12  |  提交时间:2024/06/05 mobile edge computing multi-objective reinforcement learning resource scheduling |
| Fault Diagnosis for Robotic Fish Sensors based on Spatial Domain Image Fusion and Convolution Neural Network 会议论文 , Tianjin, China, 2023-7 作者: Xuqing Fan; Sai Deng; Junfeng Fan; Chao Zhou; Zhengxing Wu; Yaming Ou; Bin Zhang Adobe PDF(1492Kb)  |  收藏  |  浏览/下载:26/8  |  提交时间:2024/06/05 Fault Diagnosis GAF Fusion CNN Robotic Fish |
| Learning Superior Cooperative Policy in Competitive Multi-team Reinforcement Learning 会议论文 , Gold Coast, Australia, 2023-6 作者: Qingxu Fu; Tenghai Qiu; Zhiqiang Pu; Jianqiang Yi; Xiaolin Ai; Wanmai Yuan Adobe PDF(25675Kb)  |  收藏  |  浏览/下载:27/4  |  提交时间:2024/06/05 |