已选(0)清除
条数/页: 排序方式: |
| Learning State-Specific Action Masks for Reinforcement Learning 期刊论文 Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60 作者: Wang ZY(王梓薏); Li XR(李欣然); Sun LY(孙罗洋); Zhang HF(张海峰); Liu HL(刘华林); Jun Wang Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:30/12  |  提交时间:2024/07/05 reinforcement learning exploration efficiency space reduction |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang; Yang Yiming; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng; Xu Bo Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:34/13  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang; Zhang Hongming; Xing Dengpeng; Bo Xu Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:19/9  |  提交时间:2024/06/25 |
| Improving Generalization of Multi-agent Reinforcement Learning through Domain-Invariant Feature Extraction 会议论文 , Greece, 2023-5 作者: Xu YF(徐一凡); Pu ZQ(蒲志强); Cai QA(蔡奇昂); Li FM(李非墨); Chai XH(柴兴华) Adobe PDF(7610Kb)  |  收藏  |  浏览/下载:21/10  |  提交时间:2024/06/21 |
| Multi-objective Deep Reinforcement Learning for Mobile Edge Computing 会议论文 , Singapore, 2023/8/24-27 作者: Yang,Ning; Wen,Junrui; Zhang,Meng; Tang,Ming Adobe PDF(499Kb)  |  收藏  |  浏览/下载:48/18  |  提交时间:2024/06/05 mobile edge computing multi-objective reinforcement learning resource scheduling |
| Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文 Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8 作者: He SQ(何少钦); Gao Y(高阳); Zhang BF(张保丰); Chang H(常惠); Zhang XC(张鑫辰) Adobe PDF(1496Kb)  |  收藏  |  浏览/下载:57/21  |  提交时间:2024/05/31 Air Combat, Reinforcement Learning, Neural Fictitious Self-Play. |
| SA-MPF: A Status-Aware Mask Prediction Framework for Online Disease Diagnosis 会议论文 , Yokohama, Japan, 2024-6-30 - 2023-7-5 作者: Zefa Hu; Linghui Meng; Yunlong Zhao; Yuanyuan Zhao; Shuang Xu; Bo Xu Adobe PDF(307Kb)  |  收藏  |  浏览/下载:59/13  |  提交时间:2024/05/29 |
| Explicitly Learning Policy Under Partial Observability in Multiagent Reinforcement Learning 会议论文 , Queensland, Australia, 2023-6 作者: Yang, Chen; Yang, Guangkai; Chen, Hao; Zhang, Junge Adobe PDF(3027Kb)  |  收藏  |  浏览/下载:58/22  |  提交时间:2024/05/29 |
| Analysis of the Total Orientation Workspace of a Type of n-PPPS Parallel Manipulator 会议论文 , Chongqing,China, 2021-7-3至2021-7-5 作者: Liu Zhaoyang; Fan Junfeng; Wang Zhe; Jing Fengshui Adobe PDF(4857Kb)  |  收藏  |  浏览/下载:20/7  |  提交时间:2024/05/28 |
| Design of a Robotic Fish Based on a Passive Flexible Mechanism 会议论文 , 云南大理, 2019.12.6 作者: Lu Ben; Yuzhuo Fu; Qianqian Zou; Sai Deng; Chao Zhou Adobe PDF(10378Kb)  |  收藏  |  浏览/下载:38/12  |  提交时间:2024/05/28 |