已选(0)清除
条数/页: 排序方式: |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang ; Yang Yiming ; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(7948Kb)  |   收藏  |  浏览/下载:19/7  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Learning in bi-level markov games 会议论文 , Padua, Italy, 2022.7.18-2022.7.23 作者: Meng Linghui ; Ruan Jingqing; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(1450Kb)  |   收藏  |  浏览/下载:35/12  |  提交时间:2024/06/11 |
| M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文 , London, United Kingdom, 2023.5.29-2023.6.2 作者: Meng Linghui ; Ruan Jingqing; Xiong Xuantang; Li Xiyun ; Zhang Xi; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(1302Kb)  |   收藏  |  浏览/下载:23/5  |  提交时间:2024/06/11 |
| A Cooperation Graph Approach for Multiagent Sparse Reward Reinforcement Learning 会议论文 , Padua, Italy, 2022年07月 作者: Qingxu Fu ; Tenghai Qiu ; Zhiqiang Pu ; Jianqiang Yi ; Wanmai Yuan
Adobe PDF(2650Kb)  |   收藏  |  浏览/下载:31/11  |  提交时间:2024/06/05 |
| Continuous Exploration via Multiple Perspectives in Sparse Reward Environment 会议论文 , 厦门国际会议中心, 2023-10-13 作者: Chen ZP(陈忠鹏) ; Guan Q(关强)![](/image/person.jpg)
Adobe PDF(2260Kb)  |   收藏  |  浏览/下载:29/9  |  提交时间:2024/06/04 Reinforcement Learning · Exploration Strategy · Sparse Reward · Intrinsic Motivation |
| Improve the efficiency of deep reinforcement learning through semantic exploration guided by natural language. 会议论文 , 北京华腾美居酒店, 2023-12-9 作者: Zhourui Guo ; Meng Yao; Yang Yu ; Qiyue Yin![](/image/person.jpg)
Adobe PDF(2302Kb)  |   收藏  |  浏览/下载:23/8  |  提交时间:2024/06/03 |
| Learning Individual Difference Rewards in Multi-Agent Reinforcement Learning 会议论文 , London, United Kingdom, 2023-5 作者: Yang, Chen ; Yang, Guangkai ; Zhang, Junge![](/image/person.jpg)
Adobe PDF(2419Kb)  |   收藏  |  浏览/下载:39/15  |  提交时间:2024/05/29 |
| Explicitly Learning Policy Under Partial Observability in Multiagent Reinforcement Learning 会议论文 , Queensland, Australia, 2023-6 作者: Yang, Chen ; Yang, Guangkai ; Chen, Hao ; Zhang, Junge![](/image/person.jpg)
Adobe PDF(3027Kb)  |   收藏  |  浏览/下载:46/19  |  提交时间:2024/05/29 |
| Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文 , New Orleans, LA, USA, December 10-16, 2023 作者: Zhiwei Xu ; Bin Zhang; Dapeng Li ; Guangchong Zhou; Zeren Zhang; Guoliang Fan![](/image/person.jpg)
Adobe PDF(8700Kb)  |   收藏  |  浏览/下载:41/12  |  提交时间:2024/05/28 |
| HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism 会议论文 , Washington, DC, USA, February 7-14, 2023 作者: Zhiwei Xu ; Yunpeng Bai ; Bin Zhang; Dapeng Li ; Guoliang Fan![](/image/person.jpg)
Adobe PDF(3345Kb)  |   收藏  |  浏览/下载:29/7  |  提交时间:2024/05/28 |