已选(0)清除
条数/页: 排序方式: |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang ; Yang Yiming ; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(7948Kb)  |   收藏  |  浏览/下载:37/14  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Learning in bi-level markov games 会议论文 , Padua, Italy, 2022.7.18-2022.7.23 作者: Meng Linghui ; Ruan Jingqing; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(1450Kb)  |   收藏  |  浏览/下载:45/18  |  提交时间:2024/06/11 |
| M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文 , London, United Kingdom, 2023.5.29-2023.6.2 作者: Meng Linghui ; Ruan Jingqing; Xiong Xuantang; Li Xiyun ; Zhang Xi; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(1302Kb)  |   收藏  |  浏览/下载:36/11  |  提交时间:2024/06/11 |
| Filtered Observations for Model-Based Multi-agent Reinforcement Learning 会议论文 , Turin, Italy, 2023.9.18-2023.9.22 作者: Meng Linghui ; Xiong Xuantang; Zang Yifan; Zhang Xi; Li Guoqi ; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(841Kb)  |   收藏  |  浏览/下载:46/18  |  提交时间:2024/06/11 |
| A New Pre-Training Paradigm for Offline Multi-Agent Reinforcement Learning with Suboptimal Data 会议论文 , Seoul, Korea, 2024.4.14-2024.4.19 作者: Meng Linghui ; Zhang Xi; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(964Kb)  |   收藏  |  浏览/下载:47/18  |  提交时间:2024/06/11 |
| Efficient Spatiotemporal Transformer for Robotic Reinforcement Learning 期刊论文 IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 卷号: 7, 期号: 3, 页码: 7982-7989 作者: Yang YM(杨依明) ; Xing DP(邢登鹏) ; Xu B(徐波)![](/image/person.jpg)
Adobe PDF(2469Kb)  |   收藏  |  浏览/下载:47/16  |  提交时间:2024/05/29 |
| Enhancing Multi-agent Coordination via Dual-channel Consensus 期刊论文 Machine Intelligence Research, 2024, 卷号: 21, 期号: 2, 页码: 349-368 作者: Qingyang Zhang ; Kaishen Wang ; Jingqing Ruan; Yiming Yang ; Dengpeng Xing ; Bo Xu![](/image/person.jpg)
Adobe PDF(4997Kb)  |   收藏  |  浏览/下载:77/27  |  提交时间:2024/04/23 Multi-agent reinforcement learning, contrastive representation learning, consensus, multi-agent cooperation, cognitive consistency |
| Offline Pre-trained Multi-agent Decision Transformer 期刊论文 Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 233-248 作者: Linghui Meng ; Muning Wen; Chenyang Le; Xiyun Li ; Dengpeng Xing ; Weinan Zhang; Ying Wen; Haifeng Zhang; Jun Wang; Yaodong Yang; Bo Xu![](/image/person.jpg)
Adobe PDF(2121Kb)  |   收藏  |  浏览/下载:62/15  |  提交时间:2024/04/23 Pre-training model multi-agent reinforcement learning (MARL) decision making transformer offline reinforcement
learning |
| DMRM: A Dual-Channel Multi-Hop Reasoning Model for Visual Dialog 会议论文 , 美国纽约, 2020.2 作者: Feilong Chen ; Fandong Meng; Jiaming Xu ; Peng Li; Bo Xu ; Jie Zhou
Adobe PDF(3052Kb)  |   收藏  |  浏览/下载:150/35  |  提交时间:2023/06/07 |
| A Unified Framework for Low-Latency Speaker Extraction in Cocktail Party Environments 会议论文 , Shanghai, China, October 25–29, 2020 作者: Yunzhe Hao ; Jiaming Xu ; Jing Shi ; Peng Zhang ; Lei Qin; Bo Xu![](/image/person.jpg)
Adobe PDF(399Kb)  |   收藏  |  浏览/下载:253/63  |  提交时间:2022/06/23 |