已选(0)清除
条数/页: 排序方式: |
| Exploration via Joint Policy Diversity for Sparse-Reward Multi-Agent Tasks 会议论文 , Macao, China, 2023-8 作者: Pei Xu; Junge Zhang; Kaiqi Huang Adobe PDF(1369Kb)  |  收藏  |  浏览/下载:213/67  |  提交时间:2023/06/19 |
| PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文 Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14 作者: Bai FS(白丰硕); Zhang HM(张鸿铭); Tao TY(陶天阳); Wu ZH(武志亨); Wang YN(王燕娜); Xu B(徐博) Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:156/35  |  提交时间:2023/07/05 Reinforcement Learning Algorithms Transfer Domain Adaptation Multi-Task Learning |
| Counterfactual Debiasing for Fact Verification 会议论文 , Toronto, Canada, 7.9-7.14, 2023 作者: Xu WZ(许伟志); Liu Q(刘强); Wu S(吴书); Wang L(王亮) Adobe PDF(1287Kb)  |  收藏  |  浏览/下载:151/43  |  提交时间:2023/06/26 |
| 基于噪声对比估计的权重自适应对抗生成式模仿学习 期刊论文 模式识别与人工智能, 2023, 卷号: 36, 期号: 4, 页码: 300-312 作者: 关伟凡; 张希 Adobe PDF(1849Kb)  |  收藏  |  浏览/下载:117/38  |  提交时间:2023/06/29 强化学习 模仿学习 噪声对比估计 自适应权重 |
| Pseudo Value Network Distillation for High-Performance Exploration 会议论文 , 澳大利亚, 2023-06 作者: Zhao EM(赵恩民); Xing JL(兴军亮); Li K(李凯); Kang YX(康永欣); Tao P(陶品) Adobe PDF(5874Kb)  |  收藏  |  浏览/下载:127/37  |  提交时间:2023/06/28 |
| Subspace-Aware Exploration for Sparse-Reward Multi-Agent Tasks 会议论文 , Washington DC, USA, 2023-2-7 作者: Pei Xu; Junge Zhang; Qiyue Yin; Chao Yu; Yaodong Yang; Kaiqi Huang Adobe PDF(2037Kb)  |  收藏  |  浏览/下载:188/60  |  提交时间:2023/06/19 deep reinforcement learning sparse reward exploration multi-agent |
| Learning to Manipulate Tools Using Deep Reinforcement Learning and Anchor Information 会议论文 , Jinghong, China, 05-09 December 2022 作者: Junhang Wei; Shaowei Cui; Peng Hao; Shuo Wang Adobe PDF(933Kb)  |  收藏  |  浏览/下载:136/51  |  提交时间:2023/10/25 |
| Improving the Ability of Robots to Navigate Through Crowded Environments Safely using Deep Reinforcement Learning 会议论文 , 中国桂林, 2022-7-9 作者: Shan QF(单钦锋); Wang WJ(王伟杰); Guo DF(郭丁飞); Sun XR(孙向荣); Jia LH(贾立好) Adobe PDF(494Kb)  |  收藏  |  浏览/下载:100/29  |  提交时间:2023/06/05 Deep learning Mechatronics Navigation Reinforcement learning Cost function Real-time systems Trajectory |
| Learning Hierarchical Graph Convolutional Neural Network for Object Navigation 会议论文 , 西英格兰大学计算机科学与创新技术系, 2022年9月6日-2022年9月9日 作者: Tao Xu; Xu Yang; Suiwu Zheng Adobe PDF(1220Kb)  |  收藏  |  浏览/下载:136/46  |  提交时间:2023/05/31 |
| Second-Order Global Attention Networks for Graph Classification and Regression 会议论文 , Beijing, China, August 27-28, 2022 作者: Hu Fenyu; Cui Zeyu; Wu Shu; Liu Qiang; Wu Jinlin; Wang Liang; Tan Tieniu Adobe PDF(69424Kb)  |  收藏  |  浏览/下载:174/67  |  提交时间:2023/07/06 |