已选(0)清除
条数/页: 排序方式: |
| Lazy Agents: A New Perspective on Solving Sparse Reward Problem in Multi-agent Reinforcement Learning 期刊 创刊日期: 2018, 主办者: Liu BY(刘博寅) Adobe PDF(5797Kb)  |  收藏  |  浏览/下载:23/5  |  提交时间:2024/07/12 |
| Learning to Play Football from Sports Perspective: A Knowledge-embedded Deep Reinforcement Learning Framework 期刊论文 IEEE Transactions on Games, 2022, 页码: 12 作者: Liu BY(刘博寅) Adobe PDF(2957Kb)  |  收藏  |  浏览/下载:30/6  |  提交时间:2024/07/12 |
| QFuture: Learning Future Expectation Cognition in Multi-Agent Reinforcement Learning 期刊论文 IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 12 作者: Liu BY(刘博寅) Adobe PDF(6675Kb)  |  收藏  |  浏览/下载:19/2  |  提交时间:2024/07/12 |
| Offline Hierarchical Reinforcement Learning: Enable Large-Scale Training in HRL 会议论文 , Nanjing, 2023-11-27 作者: Yuqiao Wu; Haifeng Zhang; Jun Wang Adobe PDF(1339Kb)  |  收藏  |  浏览/下载:21/5  |  提交时间:2024/07/12 |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang; Yang Yiming; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng; Xu Bo Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:37/14  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang; Zhang Hongming; Xing Dengpeng; Bo Xu Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:21/9  |  提交时间:2024/06/25 |
| A Survey of Recent Advances in Commonsense Knowledge Acquisition: Methods and Resources 期刊论文 Machine Intelligence Research, 2024, 页码: 1 作者: Wang, Chenhao; Li, Jiachun; Chen, Yubo; Liu, Kang; Zhao, Jun Adobe PDF(1228Kb)  |  收藏  |  浏览/下载:22/5  |  提交时间:2024/06/25 |
| Zero-Shot Cross-Lingual Document-Level Event Causality Identification with Heterogeneous Graph Contrastive Transfer Learning 会议论文 , Torino, Italia, 2024-5 作者: Zhitao He; Pengfei Cao; Zhuoran Jin; Yubo Chen; Kang Liu; Jun Zhao Adobe PDF(794Kb)  |  收藏  |  浏览/下载:37/18  |  提交时间:2024/06/25 |
| A Double-Observation Policy Learning Framework for Multi-target Coverage with Connectivity Maintenance 会议论文 , online, 2022-2 作者: Xu YF(徐一凡); Pu ZQ(蒲志强); Wu SG(吴士广); Liu BY(刘博寅); Yi JQ(易建强); Geng HJ(耿虎军); Chai XH(柴兴华) Adobe PDF(9582Kb)  |  收藏  |  浏览/下载:23/7  |  提交时间:2024/06/21 |
| MoDE-CoTD: Chain-of-Thought Distillation for Complex Reasoning Tasks with Mixture of Decoupled LoRA-Experts 会议论文 , Torino (Italia), 2024.5.20 - 2024.5.25 作者: Xiang Li; Shizhu He; Jiayu Wu; Zhao Yang; Yao Xu; Yang Jun; Haifeng Liu; Kang Liu; Jun Zhao Adobe PDF(1062Kb)  |  收藏  |  浏览/下载:35/9  |  提交时间:2024/06/20 |