已选(0)清除
条数/页: 排序方式: |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang ; Yang Yiming ; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(7948Kb)  |   收藏  |  浏览/下载:33/13  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang ; Zhang Hongming; Xing Dengpeng ; Bo Xu![](/image/person.jpg)
Adobe PDF(9639Kb)  |   收藏  |  浏览/下载:19/9  |  提交时间:2024/06/25 |
| M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文 , London, United Kingdom, 2023.5.29-2023.6.2 作者: Meng Linghui ; Ruan Jingqing; Xiong Xuantang; Li Xiyun ; Zhang Xi; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(1302Kb)  |   收藏  |  浏览/下载:30/8  |  提交时间:2024/06/11 |
| Filtered Observations for Model-Based Multi-agent Reinforcement Learning 会议论文 , Turin, Italy, 2023.9.18-2023.9.22 作者: Meng Linghui ; Xiong Xuantang; Zang Yifan; Zhang Xi; Li Guoqi ; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(841Kb)  |   收藏  |  浏览/下载:42/17  |  提交时间:2024/06/11 |
| Explainable Reinforcement Learning via a Causal World Model 会议论文 Proceedings of the 32nd International Joint Conference on Artificial Intelligence, 中国澳门, 2023-08-22 作者: Yu ZY(余忠蔚) ; Ruan JQ(阮景晴); Xing DP(邢登鹏)![](/image/person.jpg)
Adobe PDF(850Kb)  |   收藏  |  浏览/下载:48/22  |  提交时间:2024/05/28 强化学习 可解释人工智能 因果推理 |
| Efficient Hierarchical Reinforcement Learning via Mutual Information Constrained Subgoal Discovery 会议论文 , 长沙, 2023-11 作者: Kaishen Wang ; Jingqing Ruan; Qingyang Zhang ; Dengpeng Xing![](/image/person.jpg)
Adobe PDF(2044Kb)  |   收藏  |  浏览/下载:38/21  |  提交时间:2024/05/28 |
| Offline Pre-trained Multi-agent Decision Transformer 期刊论文 Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 233-248 作者: Linghui Meng ; Muning Wen; Chenyang Le; Xiyun Li ; Dengpeng Xing ; Weinan Zhang; Ying Wen; Haifeng Zhang; Jun Wang; Yaodong Yang; Bo Xu![](/image/person.jpg)
Adobe PDF(2121Kb)  |   收藏  |  浏览/下载:58/14  |  提交时间:2024/04/23 Pre-training model multi-agent reinforcement learning (MARL) decision making transformer offline reinforcement
learning |