已选(0)清除
条数/页: 排序方式: |
| Offline Hierarchical Reinforcement Learning: Enable Large-Scale Training in HRL 会议论文 , Nanjing, 2023-11-27 作者: Yuqiao Wu; Haifeng Zhang; Jun Wang Adobe PDF(1339Kb)  |  收藏  |  浏览/下载:33/10  |  提交时间:2024/07/12 |
| Learning State-Specific Action Masks for Reinforcement Learning 期刊论文 Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60 作者: Wang ZY(王梓薏); Li XR(李欣然); Sun LY(孙罗洋); Zhang HF(张海峰); Liu HL(刘华林); Jun Wang Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:50/22  |  提交时间:2024/07/05 reinforcement learning exploration efficiency space reduction |
| On the Effects of Structural Modeling for Neural Semantic Parsing 会议论文 Proceedings of the 27th Conference on Computational Natural Language Learning (CoNLL), Singapore, Singapore, 2023-12 作者: Zhang X(张翔); He SZ(何世柱); Liu K(刘康); Zhao J(赵军) Adobe PDF(730Kb)  |  收藏  |  浏览/下载:45/23  |  提交时间:2024/06/27 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang; Zhang Hongming; Xing Dengpeng; Bo Xu Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:29/13  |  提交时间:2024/06/25 |
| MULFE: A Multi-Level Benchmark for Free Text Model Editing 会议论文 , Bangkok, Thailand, 2024-08 作者: Wang, Chenhao; Cao, Pengfei; Jin, Zhuoran; Chen, Yubo; Zeng, Daojian; Liu, Kang; Zhao, Jun Adobe PDF(571Kb)  |  收藏  |  浏览/下载:33/13  |  提交时间:2024/06/25 |
| LEGO: A Multi-agent Collaborative Framework with Role-playing and Iterative Feedback for Causality Explanation Generation 会议论文 , Singapore, 2023-12 作者: Zhitao He; Pengfei Cao; Yubo Chen; Kang Liu; Jun Zhao Adobe PDF(1153Kb)  |  收藏  |  浏览/下载:28/8  |  提交时间:2024/06/25 |
| BioDrone: A Bionic Drone-Based Single Object Tracking Benchmark for Robust Vision 期刊论文 International Journal of Computer Vision, 2024, 卷号: 132, 页码: 1659-1684 作者: Xin Zhao; Shiyu Hu; Yipei Wang; Zhang Jing; Yimin Hu; Rongshuai Liu; Haibin Ling; Yin Li; Renshu Li; Kun Liu; Jiadong Li Adobe PDF(9076Kb)  |  收藏  |  浏览/下载:40/11  |  提交时间:2024/06/21 |
| Bidirectional Sentence Ordering with Interactive Decoding 期刊论文 ACM Transactions on Asian and Low-Resource Language Information Processing, 2023, 卷号: 22, 期号: 2, 页码: 1-15 作者: Guirong Bai; Shizhu HE; Kang Liu; Jun Zhao Adobe PDF(1080Kb)  |  收藏  |  浏览/下载:44/16  |  提交时间:2024/06/20 |
| Query2Triple: Unified Query Encoding for Answering Diverse Complex Queries over Knowledge Graphs 会议论文 , Singapore, 2023.11.06-2023.11.10 作者: Yao Xu; Shizhu HE; Cunguang Wang; Li Cai; Kang Liu; Jun Zhao Adobe PDF(811Kb)  |  收藏  |  浏览/下载:43/15  |  提交时间:2024/06/20 |
| MoDE-CoTD: Chain-of-Thought Distillation for Complex Reasoning Tasks with Mixture of Decoupled LoRA-Experts 会议论文 , Torino (Italia), 2024.5.20 - 2024.5.25 作者: Xiang Li; Shizhu He; Jiayu Wu; Zhao Yang; Yao Xu; Yang Jun; Haifeng Liu; Kang Liu; Jun Zhao Adobe PDF(1062Kb)  |  收藏  |  浏览/下载:44/12  |  提交时间:2024/06/20 |