CASIA OpenIR

浏览/检索结果: 共38条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Learning Top-K Subtask Planning Tree Based on Discriminative Representation Pretraining for Decision-making 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 782-800
作者:  Jingqing Ruan;   Kaishen Wang;   Qingyang Zhang;   Dengpeng Xing;   Bo Xu
Adobe PDF(4577Kb)  |  收藏  |  浏览/下载:5/3  |  提交时间:2024/07/18
Reinforcement learning  representation learning  subtask planning  task decomposition  pretraining.  
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:31/13  |  提交时间:2024/06/25
强化学习,分层强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:18/9  |  提交时间:2024/06/25
Learning in bi-level markov games 会议论文
, Padua, Italy, 2022.7.18-2022.7.23
作者:  Meng Linghui;  Ruan Jingqing;  Xing Dengpeng;  Xu Bo
Adobe PDF(1450Kb)  |  收藏  |  浏览/下载:40/16  |  提交时间:2024/06/11
M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文
, London, United Kingdom, 2023.5.29-2023.6.2
作者:  Meng Linghui;  Ruan Jingqing;  Xiong Xuantang;  Li Xiyun;  Zhang Xi;  Xing Dengpeng;  Xu Bo
Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:29/8  |  提交时间:2024/06/11
Filtered Observations for Model-Based Multi-agent Reinforcement Learning 会议论文
, Turin, Italy, 2023.9.18-2023.9.22
作者:  Meng Linghui;  Xiong Xuantang;  Zang Yifan;  Zhang Xi;  Li Guoqi;  Xing Dengpeng;  Xu Bo
Adobe PDF(841Kb)  |  收藏  |  浏览/下载:40/17  |  提交时间:2024/06/11
A New Pre-Training Paradigm for Offline Multi-Agent Reinforcement Learning with Suboptimal Data 会议论文
, Seoul, Korea, 2024.4.14-2024.4.19
作者:  Meng Linghui;  Zhang Xi;  Xing Dengpeng;  Xu Bo
Adobe PDF(964Kb)  |  收藏  |  浏览/下载:43/17  |  提交时间:2024/06/11
Learning Playing Piano with Bionic-Constrained Diffusion Policy for Anthropomorphic Hand 期刊论文
Cyborg and Bionic Systems, 2024, 卷号: 5, 页码: 0104
作者:  Yang YM(杨依明);  Wang ZC(王泽昌);  Xing DP(邢登鹏);  Wang P(王鹏)
Adobe PDF(3500Kb)  |  收藏  |  浏览/下载:30/13  |  提交时间:2024/05/30
Efficient Spatiotemporal Transformer for Robotic Reinforcement Learning 期刊论文
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 卷号: 7, 期号: 3, 页码: 7982-7989
作者:  Yang YM(杨依明);  Xing DP(邢登鹏);  Xu B(徐波)
Adobe PDF(2469Kb)  |  收藏  |  浏览/下载:45/16  |  提交时间:2024/05/29
Explainable Reinforcement Learning via a Causal World Model 会议论文
Proceedings of the 32nd International Joint Conference on Artificial Intelligence, 中国澳门, 2023-08-22
作者:  Yu ZY(余忠蔚);  Ruan JQ(阮景晴);  Xing DP(邢登鹏)
Adobe PDF(850Kb)  |  收藏  |  浏览/下载:43/19  |  提交时间:2024/05/28
强化学习  可解释人工智能  因果推理