已选(0)清除
条数/页: 排序方式: |
| Tacit Commitments Emergence in Multi-agent Reinforcement Learning 会议论文 , New Delhi, India, 2023-7 作者: Liu BY(刘博寅) ; Zhiqiang Pu ; Junlong Gao; Jianqiang Yi ; Zhenyu Guo
Adobe PDF(932Kb)  |   收藏  |  浏览/下载:5/3  |  提交时间:2024/07/15 |
| Improved Self-Propelled Swarms Model with Enhanced Convergence Efficiency 会议论文 , Tianjing, China, 2020 作者: Boyin Liu ; Zhiqiang Pu ; Shiguang Wu ; Lele Wang
Adobe PDF(210Kb)  |   收藏  |  浏览/下载:14/7  |  提交时间:2024/07/12 |
| Offline Hierarchical Reinforcement Learning: Enable Large-Scale Training in HRL 会议论文 , Nanjing, 2023-11-27 作者: Yuqiao Wu ; Haifeng Zhang; Jun Wang
Adobe PDF(1339Kb)  |   收藏  |  浏览/下载:19/4  |  提交时间:2024/07/12 |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang ; Yang Yiming ; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(7948Kb)  |   收藏  |  浏览/下载:33/13  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| MULFE: A Multi-Level Benchmark for Free Text Model Editing 会议论文 , Bangkok, Thailand, 2024-08 作者: Wang, Chenhao ; Cao, Pengfei ; Jin, Zhuoran ; Chen, Yubo ; Zeng, Daojian; Liu, Kang ; Zhao, Jun![](/image/person.jpg)
Adobe PDF(571Kb)  |   收藏  |  浏览/下载:19/8  |  提交时间:2024/06/25 |
| MoDE-CoTD: Chain-of-Thought Distillation for Complex Reasoning Tasks with Mixture of Decoupled LoRA-Experts 会议论文 , Torino (Italia), 2024.5.20 - 2024.5.25 作者: Xiang Li ; Shizhu He ; Jiayu Wu; Zhao Yang; Yao Xu; Yang Jun; Haifeng Liu; Kang Liu; Jun Zhao![](/image/person.jpg)
Adobe PDF(1062Kb)  |   收藏  |  浏览/下载:30/6  |  提交时间:2024/06/20 |
| Teaching Small Language Models to Reason for Knowledge-Intensive Multi-Hop Question Answering 会议论文 , Bangkok, Thailand, 2024.08.11-2024.08.16 作者: Xiang Li ; Shizhu HE ; Fangyu Lei; Jun Yang; Tianhuang Su; Kang Liu ; Jun Zhao![](/image/person.jpg)
Adobe PDF(873Kb)  |   收藏  |  浏览/下载:37/13  |  提交时间:2024/06/20 |
| Shifted Chunk Encoder for Transformer Based Streaming End-to-End ASR 会议论文 , Indore,India, 2022.11.28 作者: Wang FY(王方圆) ; Xu B(徐波)![](/image/person.jpg)
Adobe PDF(1374Kb)  |   收藏  |  浏览/下载:42/14  |  提交时间:2024/06/13 |
| M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文 , London, United Kingdom, 2023.5.29-2023.6.2 作者: Meng Linghui ; Ruan Jingqing; Xiong Xuantang; Li Xiyun ; Zhang Xi; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(1302Kb)  |   收藏  |  浏览/下载:30/8  |  提交时间:2024/06/11 |
| Filtered Observations for Model-Based Multi-agent Reinforcement Learning 会议论文 , Turin, Italy, 2023.9.18-2023.9.22 作者: Meng Linghui ; Xiong Xuantang; Zang Yifan; Zhang Xi; Li Guoqi ; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(841Kb)  |   收藏  |  浏览/下载:42/17  |  提交时间:2024/06/11 |