已选(0)清除
条数/页: 排序方式: |
| Lazy Agents: A New Perspective on Solving Sparse Reward Problem in Multi-agent Reinforcement Learning 期刊 创刊日期: 2018, 主办者: Liu BY(刘博寅)
Adobe PDF(5797Kb)  |   收藏  |  浏览/下载:31/8  |  提交时间:2024/07/12 |
| Offline Hierarchical Reinforcement Learning: Enable Large-Scale Training in HRL 会议论文 , Nanjing, 2023-11-27 作者: Yuqiao Wu ; Haifeng Zhang; Jun Wang
Adobe PDF(1339Kb)  |   收藏  |  浏览/下载:32/10  |  提交时间:2024/07/12 |
| On the Effects of Structural Modeling for Neural Semantic Parsing 会议论文 Proceedings of the 27th Conference on Computational Natural Language Learning (CoNLL), Singapore, Singapore, 2023-12 作者: Zhang X(张翔) ; He SZ(何世柱) ; Liu K(刘康) ; Zhao J(赵军)![](/image/person.jpg)
Adobe PDF(730Kb)  |   收藏  |  浏览/下载:45/23  |  提交时间:2024/06/27 |
| MULFE: A Multi-Level Benchmark for Free Text Model Editing 会议论文 , Bangkok, Thailand, 2024-08 作者: Wang, Chenhao ; Cao, Pengfei ; Jin, Zhuoran ; Chen, Yubo ; Zeng, Daojian; Liu, Kang ; Zhao, Jun![](/image/person.jpg)
Adobe PDF(571Kb)  |   收藏  |  浏览/下载:33/13  |  提交时间:2024/06/25 |
| Zero-Shot Cross-Lingual Document-Level Event Causality Identification with Heterogeneous Graph Contrastive Transfer Learning 会议论文 , Torino, Italia, 2024-5 作者: Zhitao He ; Pengfei Cao ; Zhuoran Jin ; Yubo Chen ; Kang Liu; Jun Zhao![](/image/person.jpg)
Adobe PDF(794Kb)  |   收藏  |  浏览/下载:50/25  |  提交时间:2024/06/25 |
| MoDE-CoTD: Chain-of-Thought Distillation for Complex Reasoning Tasks with Mixture of Decoupled LoRA-Experts 会议论文 , Torino (Italia), 2024.5.20 - 2024.5.25 作者: Xiang Li ; Shizhu He ; Jiayu Wu; Zhao Yang; Yao Xu; Yang Jun; Haifeng Liu; Kang Liu; Jun Zhao![](/image/person.jpg)
Adobe PDF(1062Kb)  |   收藏  |  浏览/下载:44/12  |  提交时间:2024/06/20 |
| M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文 , London, United Kingdom, 2023.5.29-2023.6.2 作者: Meng Linghui ; Ruan Jingqing; Xiong Xuantang; Li Xiyun ; Zhang Xi; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(1302Kb)  |   收藏  |  浏览/下载:42/12  |  提交时间:2024/06/11 |
| A New Pre-Training Paradigm for Offline Multi-Agent Reinforcement Learning with Suboptimal Data 会议论文 , Seoul, Korea, 2024.4.14-2024.4.19 作者: Meng Linghui ; Zhang Xi; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(964Kb)  |   收藏  |  浏览/下载:54/22  |  提交时间:2024/06/11 |
| Learn to flap: foil non-parametric path planning via deep reinforcement learning 期刊论文 Journal of Fluid Mechanics, 2024, 卷号: 984, 页码: A9 作者: Wang, Zhipeng ; Lin, Runji ; Zhao, Zhiyu; Chen, Xu; Guo, Pengming; Yang, Ning ; Wang,Zhicheng; Fan, Dixia
Adobe PDF(1892Kb)  |   收藏  |  浏览/下载:61/17  |  提交时间:2024/06/07 |
| 稀疏奖励环境下基于自博弈框架的智能空战算法研究 学位论文 , 2024 作者: 何少钦![](/image/person.jpg)
Adobe PDF(4570Kb)  |   收藏  |  浏览/下载:58/1  |  提交时间:2024/05/30 强化学习,离线强化学习,空战,智能决策,好奇心机制 |