已选(0)清除
条数/页: 排序方式: |
| PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文 Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14 作者: Bai FS(白丰硕); Zhang HM(张鸿铭); Tao TY(陶天阳); Wu ZH(武志亨); Wang YN(王燕娜); Xu B(徐博) Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:161/37  |  提交时间:2023/07/05 Reinforcement Learning Algorithms Transfer Domain Adaptation Multi-Task Learning |
| Subspace-Aware Exploration for Sparse-Reward Multi-Agent Tasks 会议论文 , Washington DC, USA, 2023-2-7 作者: Pei Xu; Junge Zhang; Qiyue Yin; Chao Yu; Yaodong Yang; Kaiqi Huang Adobe PDF(2037Kb)  |  收藏  |  浏览/下载:193/60  |  提交时间:2023/06/19 deep reinforcement learning sparse reward exploration multi-agent |
| Learning Cooperative Policies with Graph Networks in Distributed Swarm Systems 会议论文 , Queensland, Australia, June 18-23, 2023 作者: Zhang TL(张天乐); Liu Z(刘振); Pu ZQ(蒲志强); Yi JQ(易建强); Ai XL(艾晓琳); Yuan GM(袁莞迈) Adobe PDF(612Kb)  |  收藏  |  浏览/下载:138/45  |  提交时间:2023/06/12 |
| Deep Contrastive Multiview Network Embedding 会议论文 Proceedings of the 31st ACM International Conference on Information and Knowledge Management, New York, NY, USA, 2022-10-17 作者: Mengqi Zhang; Yanqiao Zhu; Qiang Liu; Shu Wu; Liang Wang Adobe PDF(1307Kb)  |  收藏  |  浏览/下载:108/33  |  提交时间:2023/07/03 |
| POPO: Pessimistic Offline Policy Optimization 会议论文 , Singapore, Singapore, 23-27 May 2022 作者: He Q(何强); Hou XW(侯新文); Liu Y(刘禹) Adobe PDF(1200Kb)  |  收藏  |  浏览/下载:176/36  |  提交时间:2022/06/27 reinforcement learning offline optimization out-of-distribution |
| Intrinsic Reward with Peer Incentives for Cooperative Multi-Agent Reinforcement Learning 会议论文 , Online, 18-23 July 2022 作者: Zhang TL(张天乐); Liu Z(刘振); Wu SG(吴士广); Pu ZQ(蒲志强); Yi JQ(易建强) Adobe PDF(2189Kb)  |  收藏  |  浏览/下载:160/42  |  提交时间:2023/06/12 |
| Multi-UAV Cooperative Short-Range Combat via Attention-Based Reinforcement Learning using Individual Reward Shaping 会议论文 , Kyoto, Japan, October 23-27, 2022 作者: Zhang TL(张天乐); Qiu TH(丘腾海); Liu Z(刘振); Pu ZQ(蒲志强); Yi JQ(易建强) Adobe PDF(896Kb)  |  收藏  |  浏览/下载:113/40  |  提交时间:2023/06/12 |
| Multi-Target Encirclement with Collision Avoidance via Deep Reinforcement Learning using Relational Graphs 会议论文 , Philadelphia, PA, USA, May 23-27, 2022 作者: Zhang TL(张天乐); Liu Z(刘振); Pu ZQ(蒲志强); Yi JQ(易建强) Adobe PDF(4277Kb)  |  收藏  |  浏览/下载:116/30  |  提交时间:2023/06/12 |
| DIMSAN: Fast Exploration with the Synergy between Density-based Intrinsic Motivation and Self-adaptive Action Noise 会议论文 , 西安, 2021.5.30-2021.6.5 作者: Li, Jiayi; Li, Boyao; Lu, Tao; Lu, Ning; Cai, Yinghao; Wang, Shuo Adobe PDF(5599Kb)  |  收藏  |  浏览/下载:179/34  |  提交时间:2022/06/14 |
| Multi-agent Collaborative Learning with Relational Graph Reasoning in Adversarial Environments 会议论文 , 线上会议, 2021-9 作者: Wu Shiguang; Qiu Tenghai; Pu Zhiqiang; Yi Jianqiang Adobe PDF(1396Kb)  |  收藏  |  浏览/下载:216/63  |  提交时间:2022/06/16 |