已选(0)清除
条数/页: 排序方式: |
| An Improved Minimax-Q Algorithm Based on Generalized Policy Iteration to Solve a Chaser-Invader Game 会议论文 , 线上, 2020-5 作者: Liu MS(刘民颂); Zhu YH(朱圆恒); Zhao DB(赵冬斌) Adobe PDF(727Kb)  |  收藏  |  浏览/下载:15/7  |  提交时间:2024/07/04 |
| 面向多机器人博弈的深度强化学习方法 学位论文 , 2024 作者: 胡光政 Adobe PDF(17740Kb)  |  收藏  |  浏览/下载:21/0  |  提交时间:2024/07/04 多智能体深度强化学习 多机器人博弈 极小极大Q学习 值分解 最大熵 |
| Towards Zero-Shot Generalization: Mutual Information-Guided Hierarchical Multi-Agent Coordination 会议论文 , 日本, 2024-6 作者: Zhang Qingyang; Xu Bo Adobe PDF(8862Kb)  |  收藏  |  浏览/下载:14/5  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang; Yang Yiming; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng; Xu Bo Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:19/7  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang; Zhang Hongming; Xing Dengpeng; Bo Xu Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:13/6  |  提交时间:2024/06/25 |
| User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文 , 新加坡, May 13 - 17, 2024 作者: Zhang, Zhiyuan; Zhang, Qichao; Wu, Xiaoxu; Shi, Xiaowen; Liao, Guogang; Wang, Yongkong; Wang, xingxing; Zhao, Dongbin Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:21/8  |  提交时间:2024/06/25 Ads Allocation Reinforcement Learning User Response Modeling |
| 基于用户行为预测和强化学习的推荐策略研究 学位论文 , 2024 作者: 张志远 Adobe PDF(3505Kb)  |  收藏  |  浏览/下载:12/1  |  提交时间:2024/06/25 强化学习 推荐系统 用户行为建模 |
| A Portable Robot-Assisted Device With Built-In Intelligence for Autonomous Ultrasound Acquisitions in Follow-Up Diagnosis 期刊论文 IEEE Transactions on Instrumentation and Measurement, 2024, 页码: 1-10 作者: Deng ZK(邓兆锟); Hou XL(侯西龙); Chen C(陈晨); Gu XL(谷晓林); Hou ZG(侯增广); Wang SY(王双翌) Adobe PDF(6984Kb)  |  收藏  |  浏览/下载:16/7  |  提交时间:2024/06/25 |
| Review on Peg-in-Hole Insertion Technology Based on Reinforcement Learning 会议论文 , Chongqing, China, 2023-11 作者: Shen Liancheng; Su Jianhua; Zhang Xiaodong Adobe PDF(254Kb)  |  收藏  |  浏览/下载:24/11  |  提交时间:2024/06/24 —Robot Peg-in-hole Insertion Reinforcement Learning Meta-Reinforcement Learning |
| Enhancing Reinforcement Learning via Transformer-based State Predictive Representations 期刊论文 IEEE Transactions on Artificial Intelligence, 2024, 页码: 1 - 12 作者: Liu MS(刘民颂); Zhu YH(朱圆恒); Chen YR(陈亚冉); Zhao DB(赵冬斌) Adobe PDF(1162Kb)  |  收藏  |  浏览/下载:22/5  |  提交时间:2024/06/24 |