已选(0)清除
条数/页: 排序方式: |
| Learning to Manipulate Tools Using Deep Reinforcement Learning and Anchor Information 会议论文 , Jinghong, China, 05-09 December 2022 作者: Junhang Wei; Shaowei Cui; Peng Hao; Shuo Wang Adobe PDF(933Kb)  |  收藏  |  浏览/下载:136/51  |  提交时间:2023/10/25 |
| Position Control of an Underwater Biomimetic Vehicle-Manipulator System via Reinforcement Learning 会议论文 , Liuzhou, China, 20-22 November 2020 作者: Ma, Ruichen; Wang, Yu; Gao, Zisen; Zhao, Tianzi; Wang, Rui; Wang, Shuo; Zhou, Chao Adobe PDF(927Kb)  |  收藏  |  浏览/下载:63/28  |  提交时间:2023/08/03 |
| Second-Order Global Attention Networks for Graph Classification and Regression 会议论文 , Beijing, China, August 27-28, 2022 作者: Hu Fenyu; Cui Zeyu; Wu Shu; Liu Qiang; Wu Jinlin; Wang Liang; Tan Tieniu Adobe PDF(69424Kb)  |  收藏  |  浏览/下载:172/67  |  提交时间:2023/07/06 |
| PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文 Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14 作者: Bai FS(白丰硕); Zhang HM(张鸿铭); Tao TY(陶天阳); Wu ZH(武志亨); Wang YN(王燕娜); Xu B(徐博) Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:152/35  |  提交时间:2023/07/05 Reinforcement Learning Algorithms Transfer Domain Adaptation Multi-Task Learning |
| 基于噪声对比估计的权重自适应对抗生成式模仿学习 期刊论文 模式识别与人工智能, 2023, 卷号: 36, 期号: 4, 页码: 300-312 作者: 关伟凡; 张希 Adobe PDF(1849Kb)  |  收藏  |  浏览/下载:117/38  |  提交时间:2023/06/29 强化学习 模仿学习 噪声对比估计 自适应权重 |
| Potential Driven Reinforcement Learning for Hard Exploration Tasks 会议论文 , 线上, 2020-4 作者: Zhao EM(赵恩民); Deng SH(邓诗弘); Zang YF(臧一凡); Kang YX(康永欣); Li K(李凯); Xing JL(兴军亮) Adobe PDF(1999Kb)  |  收藏  |  浏览/下载:67/23  |  提交时间:2023/06/29 |
| AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文 , 线上, 2022-02-22 作者: Zhao EM(赵恩民); Yan RY(闫仁业); Li JQ(李金秋); Li K(李凯); Xing JL(兴军亮) Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:101/39  |  提交时间:2023/06/29 |
| Pseudo Value Network Distillation for High-Performance Exploration 会议论文 , 澳大利亚, 2023-06 作者: Zhao EM(赵恩民); Xing JL(兴军亮); Li K(李凯); Kang YX(康永欣); Tao P(陶品) Adobe PDF(5874Kb)  |  收藏  |  浏览/下载:126/37  |  提交时间:2023/06/28 |
| Learning to Play Hard Exploration Games Using Graph-guided Self-navigation 会议论文 , 线上, 2021-02 作者: Zhao EM(赵恩民); Yan RY(闫仁业); Li K(李凯); Li LJ(李丽娟); Xing JL(兴军亮) Adobe PDF(413Kb)  |  收藏  |  浏览/下载:122/50  |  提交时间:2023/06/28 |
| Stable Training of Bellman Error in Reinforcement Learning 会议论文 , Thailand, November 18–22 作者: Gong C(龚晨); Bai YP(白云鹏); Hou XW(侯新文); Ji XH(季晓慧) Adobe PDF(2416Kb)  |  收藏  |  浏览/下载:87/28  |  提交时间:2023/06/27 |