已选(0)清除
条数/页: 排序方式: |
| Soft Contrastive Learning with Q-irrelevance Abstraction for Reinforcement Learning 期刊论文 IEEE Transactions on Cognitive and Developmental Systems, 2023, 卷号: 15, 期号: 3, 页码: 1463 - 1473 作者: Liu MS(刘民颂); Li LT(李伦通); Hao S(郝帅); Zhu YH(朱圆恒); Zhao DB(赵冬斌) Adobe PDF(4197Kb)  |  收藏  |  浏览/下载:8/1  |  提交时间:2024/06/24 |
| Improving Generalization of Multi-agent Reinforcement Learning through Domain-Invariant Feature Extraction 会议论文 , Greece, 2023-5 作者: Xu YF(徐一凡); Pu ZQ(蒲志强); Cai QA(蔡奇昂); Li FM(李非墨); Chai XH(柴兴华) Adobe PDF(7610Kb)  |  收藏  |  浏览/下载:5/3  |  提交时间:2024/06/21 |
| M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文 , London, United Kingdom, 2023.5.29-2023.6.2 作者: Meng Linghui; Ruan Jingqing; Xiong Xuantang; Li Xiyun; Zhang Xi; Xing Dengpeng; Xu Bo Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:12/3  |  提交时间:2024/06/11 |
| Learning Superior Cooperative Policy in Competitive Multi-team Reinforcement Learning 会议论文 , Gold Coast, Australia, 2023-6 作者: Qingxu Fu; Tenghai Qiu; Zhiqiang Pu; Jianqiang Yi; Xiaolin Ai; Wanmai Yuan Adobe PDF(25675Kb)  |  收藏  |  浏览/下载:21/3  |  提交时间:2024/06/05 |
| Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文 IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040 作者: Qingxu, Fu; Xiaolin Ai; Jianqiang Yi; Tenghai Qiu; Wanmai Yuan; Zhiqiang Pu Adobe PDF(996Kb)  |  收藏  |  浏览/下载:20/5  |  提交时间:2024/06/05 |
| Centralized Cooperative Exploration Policy for Continuous Control Tasks 会议论文 Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, London, United Kingdom, May 29–June 2, 2023 作者: Chao Li; Chen Gong; Qiang He; Xinwen Hou; Yu Liu Adobe PDF(2175Kb)  |  收藏  |  浏览/下载:25/9  |  提交时间:2024/05/30 continuous control tasks cooperative exploration |
| Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control 会议论文 Advances in Neural Information Processing Systems, New Orleans, USA, 2023-12-10 作者: Chao Li; Chen Gong; Qiang He; Xinwen Hou Adobe PDF(1457Kb)  |  收藏  |  浏览/下载:24/7  |  提交时间:2024/05/30 |
| Reward Estimation with Scheduled Knowledge Distillation for Dialogue Policy Learning 期刊论文 Connection Science, 2023, 卷号: 35, 期号: 1, 页码: 2174078 作者: Qiu JY(邱俊彦); Haidong Zhang; Yiping Yang Adobe PDF(831Kb)  |  收藏  |  浏览/下载:26/9  |  提交时间:2024/05/29 reinforcement learning dialogue policy learning curriculum learning knowledge distillation |
| Learning Individual Difference Rewards in Multi-Agent Reinforcement Learning 会议论文 , London, United Kingdom, 2023-5 作者: Yang, Chen; Yang, Guangkai; Zhang, Junge Adobe PDF(2419Kb)  |  收藏  |  浏览/下载:28/9  |  提交时间:2024/05/29 |
| Explicitly Learning Policy Under Partial Observability in Multiagent Reinforcement Learning 会议论文 , Queensland, Australia, 2023-6 作者: Yang, Chen; Yang, Guangkai; Chen, Hao; Zhang, Junge Adobe PDF(3027Kb)  |  收藏  |  浏览/下载:35/14  |  提交时间:2024/05/29 |