已选(0)清除
条数/页: 排序方式: |
| Lazy Agents: A New Perspective on Solving Sparse Reward Problem in Multi-agent Reinforcement Learning 期刊 创刊日期: 2018, 主办者: Liu BY(刘博寅)
Adobe PDF(5797Kb)  |   收藏  |  浏览/下载:23/5  |  提交时间:2024/07/12 |
| Learning to Play Football from Sports Perspective: A Knowledge-embedded Deep Reinforcement Learning Framework 期刊论文 IEEE Transactions on Games, 2022, 页码: 12 作者: Liu BY(刘博寅)![](/image/person.jpg)
Adobe PDF(2957Kb)  |   收藏  |  浏览/下载:30/6  |  提交时间:2024/07/12 |
| QFuture: Learning Future Expectation Cognition in Multi-Agent Reinforcement Learning 期刊论文 IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 12 作者: Liu BY(刘博寅)![](/image/person.jpg)
Adobe PDF(6675Kb)  |   收藏  |  浏览/下载:19/2  |  提交时间:2024/07/12 |
| Offline Hierarchical Reinforcement Learning: Enable Large-Scale Training in HRL 会议论文 , Nanjing, 2023-11-27 作者: Yuqiao Wu ; Haifeng Zhang; Jun Wang
Adobe PDF(1339Kb)  |   收藏  |  浏览/下载:21/5  |  提交时间:2024/07/12 |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang ; Yang Yiming ; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(7948Kb)  |   收藏  |  浏览/下载:37/14  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang ; Zhang Hongming; Xing Dengpeng ; Bo Xu![](/image/person.jpg)
Adobe PDF(9639Kb)  |   收藏  |  浏览/下载:21/9  |  提交时间:2024/06/25 |
| A Survey of Recent Advances in Commonsense Knowledge Acquisition: Methods and Resources 期刊论文 Machine Intelligence Research, 2024, 页码: 1 作者: Wang, Chenhao ; Li, Jiachun; Chen, Yubo ; Liu, Kang ; Zhao, Jun![](/image/person.jpg)
Adobe PDF(1228Kb)  |   收藏  |  浏览/下载:22/5  |  提交时间:2024/06/25 |
| Zero-Shot Cross-Lingual Document-Level Event Causality Identification with Heterogeneous Graph Contrastive Transfer Learning 会议论文 , Torino, Italia, 2024-5 作者: Zhitao He ; Pengfei Cao ; Zhuoran Jin ; Yubo Chen ; Kang Liu; Jun Zhao![](/image/person.jpg)
Adobe PDF(794Kb)  |   收藏  |  浏览/下载:37/18  |  提交时间:2024/06/25 |
| A Double-Observation Policy Learning Framework for Multi-target Coverage with Connectivity Maintenance 会议论文 , online, 2022-2 作者: Xu YF(徐一凡) ; Pu ZQ(蒲志强) ; Wu SG(吴士广) ; Liu BY(刘博寅); Yi JQ(易建强) ; Geng HJ(耿虎军); Chai XH(柴兴华)
Adobe PDF(9582Kb)  |   收藏  |  浏览/下载:23/7  |  提交时间:2024/06/21 |
| MoDE-CoTD: Chain-of-Thought Distillation for Complex Reasoning Tasks with Mixture of Decoupled LoRA-Experts 会议论文 , Torino (Italia), 2024.5.20 - 2024.5.25 作者: Xiang Li ; Shizhu He ; Jiayu Wu; Zhao Yang; Yao Xu; Yang Jun; Haifeng Liu; Kang Liu; Jun Zhao![](/image/person.jpg)
Adobe PDF(1062Kb)  |   收藏  |  浏览/下载:35/9  |  提交时间:2024/06/20 |