已选(0)清除
条数/页: 排序方式: |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang ; Yang Yiming ; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(7948Kb)  |   收藏  |  浏览/下载:19/7  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang ; Zhang Hongming; Xing Dengpeng ; Bo Xu![](/image/person.jpg)
Adobe PDF(9639Kb)  |   收藏  |  浏览/下载:14/7  |  提交时间:2024/06/25 |
| Learning in bi-level markov games 会议论文 , Padua, Italy, 2022.7.18-2022.7.23 作者: Meng Linghui ; Ruan Jingqing; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(1450Kb)  |   收藏  |  浏览/下载:35/12  |  提交时间:2024/06/11 |
| M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文 , London, United Kingdom, 2023.5.29-2023.6.2 作者: Meng Linghui ; Ruan Jingqing; Xiong Xuantang; Li Xiyun ; Zhang Xi; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(1302Kb)  |   收藏  |  浏览/下载:23/5  |  提交时间:2024/06/11 |
| Filtered Observations for Model-Based Multi-agent Reinforcement Learning 会议论文 , Turin, Italy, 2023.9.18-2023.9.22 作者: Meng Linghui ; Xiong Xuantang; Zang Yifan; Zhang Xi; Li Guoqi ; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(841Kb)  |   收藏  |  浏览/下载:33/13  |  提交时间:2024/06/11 |
| A New Pre-Training Paradigm for Offline Multi-Agent Reinforcement Learning with Suboptimal Data 会议论文 , Seoul, Korea, 2024.4.14-2024.4.19 作者: Meng Linghui ; Zhang Xi; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(964Kb)  |   收藏  |  浏览/下载:29/10  |  提交时间:2024/06/11 |
| Learning Playing Piano with Bionic-Constrained Diffusion Policy for Anthropomorphic Hand 期刊论文 Cyborg and Bionic Systems, 2024, 卷号: 5, 页码: 0104 作者: Yang YM(杨依明) ; Wang ZC(王泽昌); Xing DP(邢登鹏) ; Wang P(王鹏)![](/image/person.jpg)
Adobe PDF(3500Kb)  |   收藏  |  浏览/下载:26/10  |  提交时间:2024/05/30 |
| Efficient Spatiotemporal Transformer for Robotic Reinforcement Learning 期刊论文 IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 卷号: 7, 期号: 3, 页码: 7982-7989 作者: Yang YM(杨依明) ; Xing DP(邢登鹏) ; Xu B(徐波)![](/image/person.jpg)
Adobe PDF(2469Kb)  |   收藏  |  浏览/下载:39/13  |  提交时间:2024/05/29 |
| Explainable Reinforcement Learning via a Causal World Model 会议论文 Proceedings of the 32nd International Joint Conference on Artificial Intelligence, 中国澳门, 2023-08-22 作者: Yu ZY(余忠蔚) ; Ruan JQ(阮景晴); Xing DP(邢登鹏)![](/image/person.jpg)
Adobe PDF(850Kb)  |   收藏  |  浏览/下载:32/13  |  提交时间:2024/05/28 强化学习 可解释人工智能 因果推理 |
| Learning Causal Dynamics Models in Object-Oriented Environments 会议论文 Proceedings of the 41st International Conference on Machine Learning, 奥地利, 维也纳, 2024-07-21 作者: Yu ZY(余忠蔚) ; Ruan JQ(阮景晴); Xing DP(邢登鹏)![](/image/person.jpg)
Adobe PDF(2176Kb)  |   收藏  |  浏览/下载:33/11  |  提交时间:2024/05/28 强化学习 因果模型 |