已选(0)清除
条数/页: 排序方式: |
| Learning Superior Cooperative Policy in Competitive Multi-team Reinforcement Learning 会议论文 , Gold Coast, Australia, 2023-6 作者: Qingxu Fu ; Tenghai Qiu ; Zhiqiang Pu ; Jianqiang Yi ; Xiaolin Ai ; Wanmai Yuan
Adobe PDF(25675Kb)  |   收藏  |  浏览/下载:13/1  |  提交时间:2024/06/05 |
| A Cooperation Graph Approach for Multiagent Sparse Reward Reinforcement Learning 会议论文 , Padua, Italy, 2022年07月 作者: Qingxu Fu ; Tenghai Qiu ; Zhiqiang Pu ; Jianqiang Yi ; Wanmai Yuan
Adobe PDF(2650Kb)  |   收藏  |  浏览/下载:10/2  |  提交时间:2024/06/05 |
| Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文 IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040 作者: Qingxu, Fu ; Xiaolin Ai ; Jianqiang Yi ; Tenghai Qiu ; Wanmai Yuan; Zhiqiang Pu![](/image/person.jpg)
Adobe PDF(996Kb)  |   收藏  |  浏览/下载:10/1  |  提交时间:2024/06/05 |
| Concentration Network for Reinforcement Learning of Large-Scale Multi-Agent Systems 会议论文 , online, 2022 作者: Qingxu Fu ; Tenghai Qiu ; Jianqiang Yi ; Zhiqiang Pu ; Shiguang Wu![](/image/person.jpg)
Adobe PDF(5807Kb)  |   收藏  |  浏览/下载:12/2  |  提交时间:2024/06/05 |
| Explicitly Learning Policy Under Partial Observability in Multiagent Reinforcement Learning 会议论文 , Queensland, Australia, 2023-6 作者: Yang, Chen ; Yang, Guangkai ; Chen, Hao ; Zhang, Junge![](/image/person.jpg)
Adobe PDF(3027Kb)  |   收藏  |  浏览/下载:23/8  |  提交时间:2024/05/29 |
| Cooperative Object Transportation for Second-order Multi-robot Systems in Dynamic Environment 会议论文 Proceedings of the 42nd Chinese Control Conference, 天津, 2023-7-24 作者: Cai, Qiang ; Ai, Xiaolin ; Liu, Tianqi; Pu, zhiqiang![](/image/person.jpg)
Adobe PDF(3418Kb)  |   收藏  |  浏览/下载:14/3  |  提交时间:2024/05/28 |
| SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning 会议论文 , Auckland, New Zealand, May 9-13, 2022 作者: Zhiwei Xu ; Yunpeng Bai ; Dapeng Li ; Bin Zhang; Guoliang Fan![](/image/person.jpg)
Adobe PDF(2965Kb)  |   收藏  |  浏览/下载:12/2  |  提交时间:2024/05/28 |
| PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文 Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14 作者: Bai FS(白丰硕) ; Zhang HM(张鸿铭); Tao TY(陶天阳); Wu ZH(武志亨) ; Wang YN(王燕娜) ; Xu B(徐博)![](/image/person.jpg)
Adobe PDF(1663Kb)  |   收藏  |  浏览/下载:181/40  |  提交时间:2023/07/05 Reinforcement Learning Algorithms Transfer Domain Adaptation Multi-Task Learning |
| AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文 , 线上, 2022-02-22 作者: Zhao EM(赵恩民) ; Yan RY(闫仁业); Li JQ(李金秋); Li K(李凯) ; Xing JL(兴军亮)![](/image/person.jpg)
Adobe PDF(2593Kb)  |   收藏  |  浏览/下载:166/64  |  提交时间:2023/06/29 |
| VLP: A Survey on Vision-language Pre-training 期刊论文 Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56 作者: Feilong Chen ; Duzhen Zhang ; Minglun Han ; Xiuyi Chen ; Jing Shi ; Shuang Xu ; Bo Xu![](/image/person.jpg)
Adobe PDF(969Kb)  |   收藏  |  浏览/下载:151/29  |  提交时间:2023/06/21 |