已选(0)清除
条数/页: 排序方式: |
| Consensus Learning for Cooperative Multi-Agent Reinforcement Learning 会议论文 , Washington, DC, USA, February 7-14, 2023 作者: Zhiwei Xu ; Bin Zhang; Dapeng Li ; Zeren Zhang; Guangchong Zhou; Hao Chen ; Guoliang Fan![](/image/person.jpg)
Adobe PDF(4141Kb)  |   收藏  |  浏览/下载:21/7  |  提交时间:2024/05/28 |
| Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning 会议论文 , New Orleans, LA, USA,, November 28 - December 9, 2022 作者: Zhiwei Xu ; Dapeng Li ; Bin Zhang; Yuan Zhan ; Yunpeng Bai ; Guoliang Fan![](/image/person.jpg)
Adobe PDF(4367Kb)  |   收藏  |  浏览/下载:21/4  |  提交时间:2024/05/28 |
| PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文 Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14 作者: Bai FS(白丰硕) ; Zhang HM(张鸿铭); Tao TY(陶天阳); Wu ZH(武志亨) ; Wang YN(王燕娜) ; Xu B(徐博)![](/image/person.jpg)
Adobe PDF(1663Kb)  |   收藏  |  浏览/下载:188/42  |  提交时间:2023/07/05 Reinforcement Learning Algorithms Transfer Domain Adaptation Multi-Task Learning |
| Potential Driven Reinforcement Learning for Hard Exploration Tasks 会议论文 , 线上, 2020-4 作者: Zhao EM(赵恩民) ; Deng SH(邓诗弘); Zang YF(臧一凡); Kang YX(康永欣) ; Li K(李凯) ; Xing JL(兴军亮)![](/image/person.jpg)
Adobe PDF(1999Kb)  |   收藏  |  浏览/下载:105/41  |  提交时间:2023/06/29 |
| AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文 , 线上, 2022-02-22 作者: Zhao EM(赵恩民) ; Yan RY(闫仁业); Li JQ(李金秋); Li K(李凯) ; Xing JL(兴军亮)![](/image/person.jpg)
Adobe PDF(2593Kb)  |   收藏  |  浏览/下载:183/70  |  提交时间:2023/06/29 |
| Pseudo Value Network Distillation for High-Performance Exploration 会议论文 , 澳大利亚, 2023-06 作者: Zhao EM(赵恩民) ; Xing JL(兴军亮) ; Li K(李凯) ; Kang YX(康永欣) ; Tao P(陶品)
Adobe PDF(5874Kb)  |   收藏  |  浏览/下载:149/43  |  提交时间:2023/06/28 |
| Learning to Play Hard Exploration Games Using Graph-guided Self-navigation 会议论文 , 线上, 2021-02 作者: Zhao EM(赵恩民) ; Yan RY(闫仁业); Li K(李凯) ; Li LJ(李丽娟) ; Xing JL(兴军亮)![](/image/person.jpg)
Adobe PDF(413Kb)  |   收藏  |  浏览/下载:159/57  |  提交时间:2023/06/28 |
| Counterfactual Supporting Facts Extraction for Explainable Medical Record Based Diagnosis with Graph Network 会议论文 , Online, June 6–11, 2021 作者: Wu HR(吴浩然) ; Chen W(陈炜) ; Xu S(徐爽) ; Xu B(徐波)![](/image/person.jpg)
Adobe PDF(1394Kb)  |   收藏  |  浏览/下载:185/61  |  提交时间:2023/06/26 |
| Multi-Granularity Pruning for Model Acceleration on Mobile Devices 会议论文 , 线上, 2022-07 作者: Zhao TL(赵天理) ; Zhang X(张希); Zhu WT(朱文涛); Wang JX(王家兴) ; Yang S(杨森); Liu J(刘季); Cheng J(程健)![](/image/person.jpg)
Adobe PDF(1919Kb)  |   收藏  |  浏览/下载:122/51  |  提交时间:2023/06/21 Deep Neural Networks Network Pruning Structured Pruning Non-structured Pruning Single Instruction Multiple Data |
| Semantic Perception Swarm Policy with Deep Reinforcement Learning 会议论文 , Online, 05 December 2021 作者: Zhang TL(张天乐) ; Liu Z(刘振) ; Pu ZQ(蒲志强) ; Yi JQ(易建强)![](/image/person.jpg)
Adobe PDF(523Kb)  |   收藏  |  浏览/下载:128/51  |  提交时间:2023/06/12 |