已选(0)清除
条数/页: 排序方式: |
| An Improved Minimax-Q Algorithm Based on Generalized Policy Iteration to Solve a Chaser-Invader Game 会议论文 , 线上, 2020-5 作者: Liu MS(刘民颂) ; Zhu YH(朱圆恒) ; Zhao DB(赵冬斌)![](/image/person.jpg)
Adobe PDF(727Kb)  |   收藏  |  浏览/下载:23/10  |  提交时间:2024/07/04 |
| User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文 , 新加坡, May 13 - 17, 2024 作者: Zhang, Zhiyuan ; Zhang, Qichao ; Wu, Xiaoxu; Shi, Xiaowen; Liao, Guogang; Wang, Yongkong; Wang, xingxing; Zhao, Dongbin![](/image/person.jpg)
Adobe PDF(2077Kb)  |   收藏  |  浏览/下载:37/16  |  提交时间:2024/06/25 Ads Allocation Reinforcement Learning User Response Modeling |
| Review on Peg-in-Hole Insertion Technology Based on Reinforcement Learning 会议论文 , Chongqing, China, 2023-11 作者: Shen Liancheng ; Su Jianhua ; Zhang Xiaodong
Adobe PDF(254Kb)  |   收藏  |  浏览/下载:38/20  |  提交时间:2024/06/24 —Robot Peg-in-hole Insertion Reinforcement Learning Meta-Reinforcement Learning |
| Learning Playing Piano with Bionic-Constrained Diffusion Policy for Anthropomorphic Hand 期刊论文 Cyborg and Bionic Systems, 2024, 卷号: 5, 页码: 0104 作者: Yang YM(杨依明) ; Wang ZC(王泽昌); Xing DP(邢登鹏) ; Wang P(王鹏)![](/image/person.jpg)
Adobe PDF(3500Kb)  |   收藏  |  浏览/下载:35/15  |  提交时间:2024/05/30 |
| D2AH-PPO: Playing ViZDoom With Object-Aware Hierarchical Reinforcement Learning 会议论文 , 中国重庆, 2024.5.7-5.9 作者: Niu LY(钮龙宇) ; Wan J(万军)![](/image/person.jpg)
Adobe PDF(1645Kb)  |   收藏  |  浏览/下载:48/11  |  提交时间:2024/05/28 深度强化学习 表征学习 分层学习 |
| Explainable Reinforcement Learning via a Causal World Model 会议论文 Proceedings of the 32nd International Joint Conference on Artificial Intelligence, 中国澳门, 2023-08-22 作者: Yu ZY(余忠蔚) ; Ruan JQ(阮景晴); Xing DP(邢登鹏)![](/image/person.jpg)
Adobe PDF(850Kb)  |   收藏  |  浏览/下载:53/24  |  提交时间:2024/05/28 强化学习 可解释人工智能 因果推理 |
| Efficient Hierarchical Reinforcement Learning via Mutual Information Constrained Subgoal Discovery 会议论文 , 长沙, 2023-11 作者: Kaishen Wang ; Jingqing Ruan; Qingyang Zhang ; Dengpeng Xing![](/image/person.jpg)
Adobe PDF(2044Kb)  |   收藏  |  浏览/下载:40/22  |  提交时间:2024/05/28 |
| Learning Transformer-based Cooperation for Networked Traffic Signal Control 会议论文 , Macau, China, 2022-10 作者: Zhao, Chen ; Dai, Xingyuan ; Wang, Xiao ; Li, Lingxi; Lv, Yisheng ; Wang, Fei-Yue![](/image/person.jpg)
Adobe PDF(1431Kb)  |   收藏  |  浏览/下载:45/17  |  提交时间:2024/05/28 |
| Learning to Manipulate Tools Using Deep Reinforcement Learning and Anchor Information 会议论文 , Jinghong, China, 05-09 December 2022 作者: Junhang Wei ; Shaowei Cui ; Peng Hao ; Shuo Wang
Adobe PDF(933Kb)  |   收藏  |  浏览/下载:191/61  |  提交时间:2023/10/25 |
| Stable Training of Bellman Error in Reinforcement Learning 会议论文 , Thailand, November 18–22 作者: Gong C(龚晨) ; Bai YP(白云鹏) ; Hou XW(侯新文) ; Ji XH(季晓慧)
Adobe PDF(2416Kb)  |   收藏  |  浏览/下载:132/37  |  提交时间:2023/06/27 |