已选(0)清除
条数/页: 排序方式: |
| Learning State-Specific Action Masks for Reinforcement Learning 期刊论文 Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60 作者: Wang ZY(王梓薏); Li XR(李欣然); Sun LY(孙罗洋); Zhang HF(张海峰); Liu HL(刘华林); Jun Wang Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:37/15  |  提交时间:2024/07/05 reinforcement learning exploration efficiency space reduction |
| NeuronsMAE: A Novel Multi-Agent Reinforcement Learning Environment for Cooperative and Competitive Multi-Robot Tasks 会议论文 , Queensland, Australia, 2023-6 作者: Hu GZ(胡光政); Li HR(李浩然); Liu SS(刘莎莎); Zhu YH(朱圆恒); Zhao DB(赵冬斌) Adobe PDF(2785Kb)  |  收藏  |  浏览/下载:33/9  |  提交时间:2024/07/04 |
| An Improved Minimax-Q Algorithm Based on Generalized Policy Iteration to Solve a Chaser-Invader Game 会议论文 , 线上, 2020-5 作者: Liu MS(刘民颂); Zhu YH(朱圆恒); Zhao DB(赵冬斌) Adobe PDF(727Kb)  |  收藏  |  浏览/下载:24/10  |  提交时间:2024/07/04 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang; Zhang Hongming; Xing Dengpeng; Bo Xu Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:21/9  |  提交时间:2024/06/25 |
| User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文 , 新加坡, May 13 - 17, 2024 作者: Zhang, Zhiyuan; Zhang, Qichao; Wu, Xiaoxu; Shi, Xiaowen; Liao, Guogang; Wang, Yongkong; Wang, xingxing; Zhao, Dongbin Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:37/16  |  提交时间:2024/06/25 Ads Allocation Reinforcement Learning User Response Modeling |
| Review on Peg-in-Hole Insertion Technology Based on Reinforcement Learning 会议论文 , Chongqing, China, 2023-11 作者: Shen Liancheng; Su Jianhua; Zhang Xiaodong Adobe PDF(254Kb)  |  收藏  |  浏览/下载:40/20  |  提交时间:2024/06/24 —Robot Peg-in-hole Insertion Reinforcement Learning Meta-Reinforcement Learning |
| 基于视觉表征的深度强化学习方法 学位论文 , 2024 作者: 刘民颂 Adobe PDF(10778Kb)  |  收藏  |  浏览/下载:46/4  |  提交时间:2024/06/22 深度强化学习,视觉表征学习,自监督学习,状态抽象,Transformer神经网络 |
| Joint caching and transmission in the mobile edge network: An multi-agent learning approach 会议论文 , Madrid, Spain, 2021-12-7 作者: Mi,Qirui; Yang,Ning; Zhang,Haifeng; Zhang,Haijun; Wang,Jun Adobe PDF(1724Kb)  |  收藏  |  浏览/下载:44/11  |  提交时间:2024/06/05 |
| Continuous Exploration via Multiple Perspectives in Sparse Reward Environment 会议论文 , 厦门国际会议中心, 2023-10-13 作者: Chen ZP(陈忠鹏); Guan Q(关强) Adobe PDF(2260Kb)  |  收藏  |  浏览/下载:39/12  |  提交时间:2024/06/04 Reinforcement Learning · Exploration Strategy · Sparse Reward · Intrinsic Motivation |
| Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文 Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8 作者: He SQ(何少钦); Gao Y(高阳); Zhang BF(张保丰); Chang H(常惠); Zhang XC(张鑫辰) Adobe PDF(1496Kb)  |  收藏  |  浏览/下载:62/21  |  提交时间:2024/05/31 Air Combat, Reinforcement Learning, Neural Fictitious Self-Play. |