已选(0)清除
条数/页: 排序方式: |
| Lazy Agents: A New Perspective on Solving Sparse Reward Problem in Multi-agent Reinforcement Learning 期刊 创刊日期: 2018, 主办者: Liu BY(刘博寅)
Adobe PDF(5797Kb)  |   收藏  |  浏览/下载:23/5  |  提交时间:2024/07/12 |
| Learning State-Specific Action Masks for Reinforcement Learning 期刊论文 Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60 作者: Wang ZY(王梓薏) ; Li XR(李欣然); Sun LY(孙罗洋); Zhang HF(张海峰); Liu HL(刘华林); Jun Wang
Adobe PDF(2976Kb)  |   收藏  |  浏览/下载:36/15  |  提交时间:2024/07/05 reinforcement learning exploration efficiency space reduction |
| On the Effects of Structural Modeling for Neural Semantic Parsing 会议论文 Proceedings of the 27th Conference on Computational Natural Language Learning (CoNLL), Singapore, Singapore, 2023-12 作者: Zhang X(张翔) ; He SZ(何世柱) ; Liu K(刘康) ; Zhao J(赵军)![](/image/person.jpg)
Adobe PDF(730Kb)  |   收藏  |  浏览/下载:36/20  |  提交时间:2024/06/27 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang ; Zhang Hongming; Xing Dengpeng ; Bo Xu![](/image/person.jpg)
Adobe PDF(9639Kb)  |   收藏  |  浏览/下载:21/9  |  提交时间:2024/06/25 |
| MULFE: A Multi-Level Benchmark for Free Text Model Editing 会议论文 , Bangkok, Thailand, 2024-08 作者: Wang, Chenhao ; Cao, Pengfei ; Jin, Zhuoran ; Chen, Yubo ; Zeng, Daojian; Liu, Kang ; Zhao, Jun![](/image/person.jpg)
Adobe PDF(571Kb)  |   收藏  |  浏览/下载:23/9  |  提交时间:2024/06/25 |
| CLUSTER CONSTRAINTBASEDSPARSENMFFORHYPERSPECTRALIMAGERY UNMIXING 会议论文 , 法国巴黎, 10月27-30日 作者: Jiang XW(蒋心为) ; Ma L(马雷) ; Yang YP(杨一平)![](/image/person.jpg)
Adobe PDF(261Kb)  |   收藏  |  浏览/下载:31/14  |  提交时间:2024/06/24 |
| BioDrone: A Bionic Drone-Based Single Object Tracking Benchmark for Robust Vision 期刊论文 International Journal of Computer Vision, 2024, 卷号: 132, 页码: 1659-1684 作者: Xin Zhao ; Shiyu Hu ; Yipei Wang; Zhang Jing ; Yimin Hu; Rongshuai Liu; Haibin Ling; Yin Li; Renshu Li; Kun Liu; Jiadong Li
Adobe PDF(9076Kb)  |   收藏  |  浏览/下载:32/8  |  提交时间:2024/06/21 |
| Learning to Align Question and Answer Utterances in Customer Service Conversation with Recurrent Pointer Networks 会议论文 , Honolulu, Hawaii, USA, 2019.01.27 - 2019.02.01 作者: Shizhu HE ; Kang Liu; Weiting An
Adobe PDF(1562Kb)  |   收藏  |  浏览/下载:48/18  |  提交时间:2024/06/20 |
| MoDE-CoTD: Chain-of-Thought Distillation for Complex Reasoning Tasks with Mixture of Decoupled LoRA-Experts 会议论文 , Torino (Italia), 2024.5.20 - 2024.5.25 作者: Xiang Li ; Shizhu He ; Jiayu Wu; Zhao Yang; Yao Xu; Yang Jun; Haifeng Liu; Kang Liu; Jun Zhao![](/image/person.jpg)
Adobe PDF(1062Kb)  |   收藏  |  浏览/下载:35/9  |  提交时间:2024/06/20 |
| Teaching Small Language Models to Reason for Knowledge-Intensive Multi-Hop Question Answering 会议论文 , Bangkok, Thailand, 2024.08.11-2024.08.16 作者: Xiang Li ; Shizhu HE ; Fangyu Lei; Jun Yang; Tianhuang Su; Kang Liu ; Jun Zhao![](/image/person.jpg)
Adobe PDF(873Kb)  |   收藏  |  浏览/下载:42/15  |  提交时间:2024/06/20 |