已选(0)清除
条数/页: 排序方式: |
| PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文 Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14 作者: Bai FS(白丰硕); Zhang HM(张鸿铭); Tao TY(陶天阳); Wu ZH(武志亨); Wang YN(王燕娜); Xu B(徐博) Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:155/35  |  提交时间:2023/07/05 Reinforcement Learning Algorithms Transfer Domain Adaptation Multi-Task Learning |
| 基于噪声对比估计的权重自适应对抗生成式模仿学习 期刊论文 模式识别与人工智能, 2023, 卷号: 36, 期号: 4, 页码: 300-312 作者: 关伟凡; 张希 Adobe PDF(1849Kb)  |  收藏  |  浏览/下载:117/38  |  提交时间:2023/06/29 强化学习 模仿学习 噪声对比估计 自适应权重 |
| Pseudo Value Network Distillation for High-Performance Exploration 会议论文 , 澳大利亚, 2023-06 作者: Zhao EM(赵恩民); Xing JL(兴军亮); Li K(李凯); Kang YX(康永欣); Tao P(陶品) Adobe PDF(5874Kb)  |  收藏  |  浏览/下载:126/37  |  提交时间:2023/06/28 |
| Subspace-Aware Exploration for Sparse-Reward Multi-Agent Tasks 会议论文 , Washington DC, USA, 2023-2-7 作者: Pei Xu; Junge Zhang; Qiyue Yin; Chao Yu; Yaodong Yang; Kaiqi Huang Adobe PDF(2037Kb)  |  收藏  |  浏览/下载:187/60  |  提交时间:2023/06/19 deep reinforcement learning sparse reward exploration multi-agent |
| Learning Cooperative Policies with Graph Networks in Distributed Swarm Systems 会议论文 , Queensland, Australia, June 18-23, 2023 作者: Zhang TL(张天乐); Liu Z(刘振); Pu ZQ(蒲志强); Yi JQ(易建强); Ai XL(艾晓琳); Yuan GM(袁莞迈) Adobe PDF(612Kb)  |  收藏  |  浏览/下载:131/45  |  提交时间:2023/06/12 |
| Learning to Manipulate Tools Using Deep Reinforcement Learning and Anchor Information 会议论文 , Jinghong, China, 05-09 December 2022 作者: Junhang Wei; Shaowei Cui; Peng Hao; Shuo Wang Adobe PDF(933Kb)  |  收藏  |  浏览/下载:136/51  |  提交时间:2023/10/25 |
| Second-Order Global Attention Networks for Graph Classification and Regression 会议论文 , Beijing, China, August 27-28, 2022 作者: Hu Fenyu; Cui Zeyu; Wu Shu; Liu Qiang; Wu Jinlin; Wang Liang; Tan Tieniu Adobe PDF(69424Kb)  |  收藏  |  浏览/下载:174/67  |  提交时间:2023/07/06 |
| DPNAS: Neural Architecture Search for Deep Learning with Differential Privacy 会议论文 , 线上, 2022-2 作者: Cheng AD(程安达); Wang JX(王家兴); Zhang X(张希); Chen Q(谌强); Wang PS(王培松); Cheng J(程健) Adobe PDF(1135Kb)  |  收藏  |  浏览/下载:86/21  |  提交时间:2023/06/05 |
| POPO: Pessimistic Offline Policy Optimization 会议论文 , Singapore, Singapore, 23-27 May 2022 作者: He Q(何强); Hou XW(侯新文); Liu Y(刘禹) Adobe PDF(1200Kb)  |  收藏  |  浏览/下载:172/35  |  提交时间:2022/06/27 reinforcement learning offline optimization out-of-distribution |
| MTLDesc: Looking Wider to Describe Better 会议论文 , Virtual, February 22-28, 2022 作者: Changwei Wang; Rongtao Xu; Yuyang Zhang; Shibiao Xu; Weiliang Meng; Xiaopeng Zhang Adobe PDF(7473Kb)  |  收藏  |  浏览/下载:178/38  |  提交时间:2022/04/06 |