已选(0)清除
条数/页: 排序方式: |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang; Yang Yiming; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng; Xu Bo Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:41/16  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang; Zhang Hongming; Xing Dengpeng; Bo Xu Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:23/11  |  提交时间:2024/06/25 |
| Global and local multi-modal feature mutual learning for retinal vessel segmentation 期刊论文 Pattern Recognition, 2024, 卷号: 151, 页码: 110376 作者: Xin Zhao; Zhang Jing; Qiaozhe Li; Tengfei Zhao; Yi Li; Zifeng Wu Adobe PDF(4182Kb)  |  收藏  |  浏览/下载:41/16  |  提交时间:2024/06/21 Mutual learning Multi-modal learning OCTA images Retinal vessel segmentation |
| Continuous Exploration via Multiple Perspectives in Sparse Reward Environment 会议论文 , 厦门国际会议中心, 2023-10-13 作者: Chen ZP(陈忠鹏); Guan Q(关强) Adobe PDF(2260Kb)  |  收藏  |  浏览/下载:40/12  |  提交时间:2024/06/04 Reinforcement Learning · Exploration Strategy · Sparse Reward · Intrinsic Motivation |
| 稀疏奖励环境下基于自博弈框架的智能空战算法研究 学位论文 , 2024 作者: 何少钦 Adobe PDF(4570Kb)  |  收藏  |  浏览/下载:55/1  |  提交时间:2024/05/30 强化学习,离线强化学习,空战,智能决策,好奇心机制 |
| Explanation Guided Knowledge Distillation for Pre-trained Language Model Compression 期刊论文 ACM Transactions on Asian and Low-Resource Language Information Processing, 2024, 卷号: 23, 期号: 2, 页码: 1-19 作者: Zhao Yang; Yuanzhe Zhang; Dianbo Sui; Yiming Ju; Jun Zhao; Kang Liu Adobe PDF(1250Kb)  |  收藏  |  浏览/下载:60/21  |  提交时间:2024/05/30 Explanation knowledge distillation model compression |
| Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文 , New Orleans, LA, USA, December 10-16, 2023 作者: Zhiwei Xu; Bin Zhang; Dapeng Li; Guangchong Zhou; Zeren Zhang; Guoliang Fan Adobe PDF(8700Kb)  |  收藏  |  浏览/下载:55/15  |  提交时间:2024/05/28 |
| Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning 会议论文 , New Orleans, LA, USA,, November 28 - December 9, 2022 作者: Zhiwei Xu; Dapeng Li; Bin Zhang; Yuan Zhan; Yunpeng Bai; Guoliang Fan Adobe PDF(4367Kb)  |  收藏  |  浏览/下载:39/8  |  提交时间:2024/05/28 |
| SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning 会议论文 , Auckland, New Zealand, May 9-13, 2022 作者: Zhiwei Xu; Yunpeng Bai; Dapeng Li; Bin Zhang; Guoliang Fan Adobe PDF(2965Kb)  |  收藏  |  浏览/下载:41/8  |  提交时间:2024/05/28 |
| Subspace-Aware Exploration for Sparse-Reward Multi-Agent Tasks 会议论文 , Washington DC, USA, 2023-2-7 作者: Pei Xu; Junge Zhang; Qiyue Yin; Chao Yu; Yaodong Yang; Kaiqi Huang Adobe PDF(2037Kb)  |  收藏  |  浏览/下载:261/79  |  提交时间:2023/06/19 deep reinforcement learning sparse reward exploration multi-agent |