已选(0)清除
条数/页: 排序方式: |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang ; Yang Yiming ; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(7948Kb)  |   收藏  |  浏览/下载:19/7  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang ; Zhang Hongming; Xing Dengpeng ; Bo Xu![](/image/person.jpg)
Adobe PDF(9639Kb)  |   收藏  |  浏览/下载:14/7  |  提交时间:2024/06/25 |
| A Cooperation Graph Approach for Multiagent Sparse Reward Reinforcement Learning 会议论文 , Padua, Italy, 2022年07月 作者: Qingxu Fu ; Tenghai Qiu ; Zhiqiang Pu ; Jianqiang Yi ; Wanmai Yuan
Adobe PDF(2650Kb)  |   收藏  |  浏览/下载:32/11  |  提交时间:2024/06/05 |
| Continuous Exploration via Multiple Perspectives in Sparse Reward Environment 会议论文 , 厦门国际会议中心, 2023-10-13 作者: Chen ZP(陈忠鹏) ; Guan Q(关强)![](/image/person.jpg)
Adobe PDF(2260Kb)  |   收藏  |  浏览/下载:29/9  |  提交时间:2024/06/04 Reinforcement Learning · Exploration Strategy · Sparse Reward · Intrinsic Motivation |
| Improve the efficiency of deep reinforcement learning through semantic exploration guided by natural language. 会议论文 , 北京华腾美居酒店, 2023-12-9 作者: Zhourui Guo ; Meng Yao; Yang Yu ; Qiyue Yin![](/image/person.jpg)
Adobe PDF(2302Kb)  |   收藏  |  浏览/下载:23/8  |  提交时间:2024/06/03 |
| Learning Individual Difference Rewards in Multi-Agent Reinforcement Learning 会议论文 , London, United Kingdom, 2023-5 作者: Yang, Chen ; Yang, Guangkai ; Zhang, Junge![](/image/person.jpg)
Adobe PDF(2419Kb)  |   收藏  |  浏览/下载:39/15  |  提交时间:2024/05/29 |
| Explicitly Learning Policy Under Partial Observability in Multiagent Reinforcement Learning 会议论文 , Queensland, Australia, 2023-6 作者: Yang, Chen ; Yang, Guangkai ; Chen, Hao ; Zhang, Junge![](/image/person.jpg)
Adobe PDF(3027Kb)  |   收藏  |  浏览/下载:46/19  |  提交时间:2024/05/29 |
| Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文 , New Orleans, LA, USA, December 10-16, 2023 作者: Zhiwei Xu ; Bin Zhang; Dapeng Li ; Guangchong Zhou; Zeren Zhang; Guoliang Fan![](/image/person.jpg)
Adobe PDF(8700Kb)  |   收藏  |  浏览/下载:41/12  |  提交时间:2024/05/28 |
| HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism 会议论文 , Washington, DC, USA, February 7-14, 2023 作者: Zhiwei Xu ; Yunpeng Bai ; Bin Zhang; Dapeng Li ; Guoliang Fan![](/image/person.jpg)
Adobe PDF(3345Kb)  |   收藏  |  浏览/下载:29/7  |  提交时间:2024/05/28 |
| Potential Driven Reinforcement Learning for Hard Exploration Tasks 会议论文 , 线上, 2020-4 作者: Zhao EM(赵恩民) ; Deng SH(邓诗弘); Zang YF(臧一凡); Kang YX(康永欣) ; Li K(李凯) ; Xing JL(兴军亮)![](/image/person.jpg)
Adobe PDF(1999Kb)  |   收藏  |  浏览/下载:115/44  |  提交时间:2023/06/29 |