已选(0)清除
条数/页: 排序方式: |
| Tacit Commitments Emergence in Multi-agent Reinforcement Learning 会议论文 , New Delhi, India, 2023-7 作者: Liu BY(刘博寅) ; Zhiqiang Pu ; Junlong Gao; Jianqiang Yi ; Zhenyu Guo
Adobe PDF(932Kb)  |   收藏  |  浏览/下载:28/10  |  提交时间:2024/07/15 |
| Offline Hierarchical Reinforcement Learning: Enable Large-Scale Training in HRL 会议论文 , Nanjing, 2023-11-27 作者: Yuqiao Wu ; Haifeng Zhang; Jun Wang
Adobe PDF(1339Kb)  |   收藏  |  浏览/下载:33/10  |  提交时间:2024/07/12 |
| On the Effects of Structural Modeling for Neural Semantic Parsing 会议论文 Proceedings of the 27th Conference on Computational Natural Language Learning (CoNLL), Singapore, Singapore, 2023-12 作者: Zhang X(张翔) ; He SZ(何世柱) ; Liu K(刘康) ; Zhao J(赵军)![](/image/person.jpg)
Adobe PDF(730Kb)  |   收藏  |  浏览/下载:45/23  |  提交时间:2024/06/27 |
| A Double-Observation Policy Learning Framework for Multi-target Coverage with Connectivity Maintenance 会议论文 , online, 2022-2 作者: Xu YF(徐一凡) ; Pu ZQ(蒲志强) ; Wu SG(吴士广) ; Liu BY(刘博寅); Yi JQ(易建强) ; Geng HJ(耿虎军); Chai XH(柴兴华)
Adobe PDF(9582Kb)  |   收藏  |  浏览/下载:31/10  |  提交时间:2024/06/21 |
| Query2Triple: Unified Query Encoding for Answering Diverse Complex Queries over Knowledge Graphs 会议论文 , Singapore, 2023.11.06-2023.11.10 作者: Yao Xu ; Shizhu HE ; Cunguang Wang; Li Cai; Kang Liu; Jun Zhao![](/image/person.jpg)
Adobe PDF(811Kb)  |   收藏  |  浏览/下载:43/15  |  提交时间:2024/06/20 |
| MoDE-CoTD: Chain-of-Thought Distillation for Complex Reasoning Tasks with Mixture of Decoupled LoRA-Experts 会议论文 , Torino (Italia), 2024.5.20 - 2024.5.25 作者: Xiang Li ; Shizhu He ; Jiayu Wu; Zhao Yang; Yao Xu; Yang Jun; Haifeng Liu; Kang Liu; Jun Zhao![](/image/person.jpg)
Adobe PDF(1062Kb)  |   收藏  |  浏览/下载:44/12  |  提交时间:2024/06/20 |
| Learning in bi-level markov games 会议论文 , Padua, Italy, 2022.7.18-2022.7.23 作者: Meng Linghui ; Ruan Jingqing; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(1450Kb)  |   收藏  |  浏览/下载:54/23  |  提交时间:2024/06/11 |
| M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文 , London, United Kingdom, 2023.5.29-2023.6.2 作者: Meng Linghui ; Ruan Jingqing; Xiong Xuantang; Li Xiyun ; Zhang Xi; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(1302Kb)  |   收藏  |  浏览/下载:42/12  |  提交时间:2024/06/11 |
| Filtered Observations for Model-Based Multi-agent Reinforcement Learning 会议论文 , Turin, Italy, 2023.9.18-2023.9.22 作者: Meng Linghui ; Xiong Xuantang; Zang Yifan; Zhang Xi; Li Guoqi ; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(841Kb)  |   收藏  |  浏览/下载:57/22  |  提交时间:2024/06/11 |
| Alignment Rationale for Natural Language Inference 会议论文 , Online, 2021-8-1 作者: Zhongtao Jiang ; Yuanzhe Zhang ; Zhao Yang; Jun Zhao ; Kang Liu
Adobe PDF(1280Kb)  |   收藏  |  浏览/下载:55/20  |  提交时间:2024/06/06 |