已选(0)清除
条数/页: 排序方式: |
| Lazy Agents: A New Perspective on Solving Sparse Reward Problem in Multi-agent Reinforcement Learning 期刊 创刊日期: 2018, 主办者: Liu BY(刘博寅) Adobe PDF(5797Kb)  |  收藏  |  浏览/下载:22/5  |  提交时间:2024/07/12 |
| Learning to Play Football from Sports Perspective: A Knowledge-embedded Deep Reinforcement Learning Framework 期刊论文 IEEE Transactions on Games, 2022, 页码: 12 作者: Liu BY(刘博寅) Adobe PDF(2957Kb)  |  收藏  |  浏览/下载:22/5  |  提交时间:2024/07/12 |
| QFuture: Learning Future Expectation Cognition in Multi-Agent Reinforcement Learning 期刊论文 IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 12 作者: Liu BY(刘博寅) Adobe PDF(6675Kb)  |  收藏  |  浏览/下载:16/2  |  提交时间:2024/07/12 |
| Learning State-Specific Action Masks for Reinforcement Learning 期刊论文 Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60 作者: Wang ZY(王梓薏); Li XR(李欣然); Sun LY(孙罗洋); Zhang HF(张海峰); Liu HL(刘华林); Jun Wang Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:30/12  |  提交时间:2024/07/05 reinforcement learning exploration efficiency space reduction |
| Towards Zero-Shot Generalization: Mutual Information-Guided Hierarchical Multi-Agent Coordination 会议论文 , 日本, 2024-6 作者: Zhang Qingyang; Xu Bo Adobe PDF(8862Kb)  |  收藏  |  浏览/下载:20/7  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang; Yang Yiming; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng; Xu Bo Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:34/13  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang; Zhang Hongming; Xing Dengpeng; Bo Xu Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:19/9  |  提交时间:2024/06/25 |
| Hitch-Hiking Motion of Multiple Bionic Robotic Remoras with Enhanced Multimodal Locomotion 期刊论文 IEEE-ASME Transactions on Mechatronics, 2024, 页码: 1-11 作者: Wu, Zhengxing; Yu, Lianyi; Wang, Jian; Dai, Shijie; Tan, Min; Yu, Junzhi Adobe PDF(4893Kb)  |  收藏  |  浏览/下载:54/30  |  提交时间:2024/06/24 |
| BioDrone: A Bionic Drone-Based Single Object Tracking Benchmark for Robust Vision 期刊论文 International Journal of Computer Vision, 2024, 卷号: 132, 页码: 1659-1684 作者: Xin Zhao; Shiyu Hu; Yipei Wang; Zhang Jing; Yimin Hu; Rongshuai Liu; Haibin Ling; Yin Li; Renshu Li; Kun Liu; Jiadong Li Adobe PDF(9076Kb)  |  收藏  |  浏览/下载:27/7  |  提交时间:2024/06/21 |
| A Double-Observation Policy Learning Framework for Multi-target Coverage with Connectivity Maintenance 会议论文 , online, 2022-2 作者: Xu YF(徐一凡); Pu ZQ(蒲志强); Wu SG(吴士广); Liu BY(刘博寅); Yi JQ(易建强); Geng HJ(耿虎军); Chai XH(柴兴华) Adobe PDF(9582Kb)  |  收藏  |  浏览/下载:20/5  |  提交时间:2024/06/21 |