已选(0)清除
条数/页: 排序方式: |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang ; Yang Yiming ; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(7948Kb)  |   收藏  |  浏览/下载:19/7  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang ; Zhang Hongming; Xing Dengpeng ; Bo Xu![](/image/person.jpg)
Adobe PDF(9639Kb)  |   收藏  |  浏览/下载:13/6  |  提交时间:2024/06/25 |
| Generative Calibration for In-context Learning 会议论文 , Singapore, 2023-10-6 作者: Zhongtao Jiang ; Yuanzhe Zhang ; Cao Liu ; Jun Zhao ; Kang Liu
Adobe PDF(763Kb)  |   收藏  |  浏览/下载:28/10  |  提交时间:2024/06/06 |
| Learning Superior Cooperative Policy in Competitive Multi-team Reinforcement Learning 会议论文 , Gold Coast, Australia, 2023-6 作者: Qingxu Fu ; Tenghai Qiu ; Zhiqiang Pu ; Jianqiang Yi ; Xiaolin Ai ; Wanmai Yuan
Adobe PDF(25675Kb)  |   收藏  |  浏览/下载:34/5  |  提交时间:2024/06/05 |
| Continuous Exploration via Multiple Perspectives in Sparse Reward Environment 会议论文 , 厦门国际会议中心, 2023-10-13 作者: Chen ZP(陈忠鹏) ; Guan Q(关强)![](/image/person.jpg)
Adobe PDF(2260Kb)  |   收藏  |  浏览/下载:29/9  |  提交时间:2024/06/04 Reinforcement Learning · Exploration Strategy · Sparse Reward · Intrinsic Motivation |
| Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文 Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8 作者: He SQ(何少钦) ; Gao Y(高阳) ; Zhang BF(张保丰); Chang H(常惠) ; Zhang XC(张鑫辰)
Adobe PDF(1496Kb)  |   收藏  |  浏览/下载:45/14  |  提交时间:2024/05/31 Air Combat, Reinforcement Learning, Neural Fictitious Self-Play. |
| Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文 , New Orleans, LA, USA, December 10-16, 2023 作者: Zhiwei Xu ; Bin Zhang; Dapeng Li ; Guangchong Zhou; Zeren Zhang; Guoliang Fan![](/image/person.jpg)
Adobe PDF(8700Kb)  |   收藏  |  浏览/下载:41/12  |  提交时间:2024/05/28 |
| Consensus Learning for Cooperative Multi-Agent Reinforcement Learning 会议论文 , Washington, DC, USA, February 7-14, 2023 作者: Zhiwei Xu ; Bin Zhang; Dapeng Li ; Zeren Zhang; Guangchong Zhou; Hao Chen ; Guoliang Fan![](/image/person.jpg)
Adobe PDF(4141Kb)  |   收藏  |  浏览/下载:36/13  |  提交时间:2024/05/28 |
| Large sequence models for sequential decision-making: a survey 期刊论文 FRONTIERS OF COMPUTER SCIENCE, 2023, 卷号: 17, 期号: 6, 页码: 18 作者: Wen, Muning; Lin, Runji ; Wang, Hanjing; Yang, Yaodong; Wen, Ying; Mai, Luo; Wang, Jun; Zhang, Haifeng; Zhang, Weinan
Adobe PDF(1351Kb)  |   收藏  |  浏览/下载:144/4  |  提交时间:2023/11/17 sequential decision-making sequence modeling the Transformer training system |
| 无权访问的条目 学位论文 作者: 樊晨晨![](/image/person.jpg)
Adobe PDF(23181Kb)  |   收藏  |  浏览/下载:18/11  |  提交时间:2023/06/15 |