已选(0)清除
条数/页: 排序方式: |
| PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文 Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14 作者: Bai FS(白丰硕); Zhang HM(张鸿铭); Tao TY(陶天阳); Wu ZH(武志亨); Wang YN(王燕娜); Xu B(徐博) Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:169/38  |  提交时间:2023/07/05 Reinforcement Learning Algorithms Transfer Domain Adaptation Multi-Task Learning |
| Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning 会议论文 , 昆士兰, 2023-6 作者: Li WF(李伟凡); Zhu YH(朱圆恒); Zhao DB(赵冬斌) Adobe PDF(4104Kb)  |  收藏  |  浏览/下载:218/71  |  提交时间:2023/06/29 multi-agent reinforcement learning policy gradient |
| Pseudo Value Network Distillation for High-Performance Exploration 会议论文 , 澳大利亚, 2023-06 作者: Zhao EM(赵恩民); Xing JL(兴军亮); Li K(李凯); Kang YX(康永欣); Tao P(陶品) Adobe PDF(5874Kb)  |  收藏  |  浏览/下载:136/39  |  提交时间:2023/06/28 |
| Curiosity-Driven and Victim-Aware Adversarial Policies 会议论文 , Austin TX, USA, December 5-9, 2022 作者: Gong C(龚晨); Yang Z(杨洲); Bai YP(白云鹏); Shi JK(史杰克); Sinha Arunesh; Xu BW(徐博文); Lo David; Hou XW(侯新文); Fan GL(范国梁) Adobe PDF(4090Kb)  |  收藏  |  浏览/下载:111/46  |  提交时间:2023/06/27 |