已选(0)清除
条数/页: 排序方式: |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang ; Yang Yiming ; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(7948Kb)  |   收藏  |  浏览/下载:35/14  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Token-level Direct Preference Optimization 会议论文 , Vienna, Austria, 2024/7/21-27 作者: Zeng,Yongcheng; Liu,Guoqing ; Ma,Weiyu; Yang,Ning ; Zhang,Haifeng; Wang,Jun
Adobe PDF(883Kb)  |   收藏  |  浏览/下载:66/22  |  提交时间:2024/06/05 |
| Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文 IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040 作者: Qingxu, Fu ; Xiaolin Ai ; Jianqiang Yi ; Tenghai Qiu ; Wanmai Yuan; Zhiqiang Pu![](/image/person.jpg)
Adobe PDF(996Kb)  |   收藏  |  浏览/下载:41/12  |  提交时间:2024/06/05 |
| Continuous Exploration via Multiple Perspectives in Sparse Reward Environment 会议论文 , 厦门国际会议中心, 2023-10-13 作者: Chen ZP(陈忠鹏) ; Guan Q(关强)![](/image/person.jpg)
Adobe PDF(2260Kb)  |   收藏  |  浏览/下载:36/11  |  提交时间:2024/06/04 Reinforcement Learning · Exploration Strategy · Sparse Reward · Intrinsic Motivation |
| Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文 Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8 作者: He SQ(何少钦) ; Gao Y(高阳) ; Zhang BF(张保丰); Chang H(常惠) ; Zhang XC(张鑫辰)
Adobe PDF(1496Kb)  |   收藏  |  浏览/下载:60/21  |  提交时间:2024/05/31 Air Combat, Reinforcement Learning, Neural Fictitious Self-Play. |
| Consensus Learning for Cooperative Multi-Agent Reinforcement Learning 会议论文 , Washington, DC, USA, February 7-14, 2023 作者: Zhiwei Xu ; Bin Zhang; Dapeng Li ; Zeren Zhang; Guangchong Zhou; Hao Chen ; Guoliang Fan![](/image/person.jpg)
Adobe PDF(4141Kb)  |   收藏  |  浏览/下载:46/19  |  提交时间:2024/05/28 |
| SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning 会议论文 , Auckland, New Zealand, May 9-13, 2022 作者: Zhiwei Xu ; Yunpeng Bai ; Dapeng Li ; Bin Zhang; Guoliang Fan![](/image/person.jpg)
Adobe PDF(2965Kb)  |   收藏  |  浏览/下载:37/7  |  提交时间:2024/05/28 |
| PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文 Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14 作者: Bai FS(白丰硕) ; Zhang HM(张鸿铭); Tao TY(陶天阳); Wu ZH(武志亨) ; Wang YN(王燕娜) ; Xu B(徐博)![](/image/person.jpg)
Adobe PDF(1663Kb)  |   收藏  |  浏览/下载:206/47  |  提交时间:2023/07/05 Reinforcement Learning Algorithms Transfer Domain Adaptation Multi-Task Learning |
| AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文 , 线上, 2022-02-22 作者: Zhao EM(赵恩民) ; Yan RY(闫仁业); Li JQ(李金秋); Li K(李凯) ; Xing JL(兴军亮)![](/image/person.jpg)
Adobe PDF(2593Kb)  |   收藏  |  浏览/下载:211/78  |  提交时间:2023/06/29 |
| Multi-Granularity Pruning for Model Acceleration on Mobile Devices 会议论文 , 线上, 2022-07 作者: Zhao TL(赵天理) ; Zhang X(张希); Zhu WT(朱文涛); Wang JX(王家兴) ; Yang S(杨森); Liu J(刘季); Cheng J(程健)![](/image/person.jpg)
Adobe PDF(1919Kb)  |   收藏  |  浏览/下载:148/60  |  提交时间:2023/06/21 Deep Neural Networks Network Pruning Structured Pruning Non-structured Pruning Single Instruction Multiple Data |