已选(0)清除
条数/页: 排序方式: |
| PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文 Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14 作者: Bai FS(白丰硕); Zhang HM(张鸿铭); Tao TY(陶天阳); Wu ZH(武志亨); Wang YN(王燕娜); Xu B(徐博) Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:140/34  |  提交时间:2023/07/05 Reinforcement Learning Algorithms Transfer Domain Adaptation Multi-Task Learning |
| Counterfactual Debiasing for Fact Verification 会议论文 , Toronto, Canada, 7.9-7.14, 2023 作者: Xu WZ(许伟志); Liu Q(刘强); Wu S(吴书); Wang L(王亮) Adobe PDF(1287Kb)  |  收藏  |  浏览/下载:143/42  |  提交时间:2023/06/26 |
| Pseudo Value Network Distillation for High-Performance Exploration 会议论文 , 澳大利亚, 2023-06 作者: Zhao EM(赵恩民); Xing JL(兴军亮); Li K(李凯); Kang YX(康永欣); Tao P(陶品) Adobe PDF(5874Kb)  |  收藏  |  浏览/下载:120/37  |  提交时间:2023/06/28 |
| Subspace-Aware Exploration for Sparse-Reward Multi-Agent Tasks 会议论文 , Washington DC, USA, 2023-2-7 作者: Pei Xu; Junge Zhang; Qiyue Yin; Chao Yu; Yaodong Yang; Kaiqi Huang Adobe PDF(2037Kb)  |  收藏  |  浏览/下载:176/59  |  提交时间:2023/06/19 deep reinforcement learning sparse reward exploration multi-agent |
| Learning to Manipulate Tools Using Deep Reinforcement Learning and Anchor Information 会议论文 , Jinghong, China, 05-09 December 2022 作者: Junhang Wei; Shaowei Cui; Peng Hao; Shuo Wang Adobe PDF(933Kb)  |  收藏  |  浏览/下载:130/50  |  提交时间:2023/10/25 |
| Improving the Ability of Robots to Navigate Through Crowded Environments Safely using Deep Reinforcement Learning 会议论文 , 中国桂林, 2022-7-9 作者: Shan QF(单钦锋); Wang WJ(王伟杰); Guo DF(郭丁飞); Sun XR(孙向荣); Jia LH(贾立好) Adobe PDF(494Kb)  |  收藏  |  浏览/下载:98/28  |  提交时间:2023/06/05 Deep learning Mechatronics Navigation Reinforcement learning Cost function Real-time systems Trajectory |
| Learning Hierarchical Graph Convolutional Neural Network for Object Navigation 会议论文 , 西英格兰大学计算机科学与创新技术系, 2022年9月6日-2022年9月9日 作者: Tao Xu; Xu Yang; Suiwu Zheng Adobe PDF(1220Kb)  |  收藏  |  浏览/下载:129/45  |  提交时间:2023/05/31 |
| LEARN EFFECTIVE REPRESENTATION FOR DEEP REINFORCEMENT LEARNING 会议论文 , Taipei, Taiwan, 26 August 2022 作者: Zhan Yuan; Xu Zhiwei; Fan Guoliang Adobe PDF(2093Kb)  |  收藏  |  浏览/下载:114/42  |  提交时间:2023/06/08 |
| POPO: Pessimistic Offline Policy Optimization 会议论文 , Singapore, Singapore, 23-27 May 2022 作者: He Q(何强); Hou XW(侯新文); Liu Y(刘禹) Adobe PDF(1200Kb)  |  收藏  |  浏览/下载:165/34  |  提交时间:2022/06/27 reinforcement learning offline optimization out-of-distribution |
| Intrinsic Reward with Peer Incentives for Cooperative Multi-Agent Reinforcement Learning 会议论文 , Online, 18-23 July 2022 作者: Zhang TL(张天乐); Liu Z(刘振); Wu SG(吴士广); Pu ZQ(蒲志强); Yi JQ(易建强) Adobe PDF(2189Kb)  |  收藏  |  浏览/下载:140/39  |  提交时间:2023/06/12 |