Knowledge Commons of Institute of Automation,CAS
Hierarchical Reinforcement Learning With Automatic Sub-Goal Identification | |
Chenghao Liu; Fei Zhu; Quan Liu; Yuchen Fu | |
发表期刊 | IEEE/CAA Journal of Automatica Sinica |
ISSN | 2329-9266 |
2021 | |
卷号 | 8期号:10页码:1686-1696 |
摘要 | In reinforcement learning an agent may explore ineffectively when dealing with sparse reward tasks where finding a reward point is difficult. To solve the problem, we propose an algorithm called hierarchical deep reinforcement learning with automatic sub-goal identification via computer vision (HADS) which takes advantage of hierarchical reinforcement learning to alleviate the sparse reward problem and improve efficiency of exploration by utilizing a sub-goal mechanism. HADS uses a computer vision method to identify sub-goals automatically for hierarchical deep reinforcement learning. Due to the fact that not all sub-goal points are reachable, a mechanism is proposed to remove unreachable sub-goal points so as to further improve the performance of the algorithm. HADS involves contour recognition to identify sub-goals from the state image where some salient states in the state image may be recognized as sub-goals, while those that are not will be removed based on prior knowledge. Our experiments verified the effect of the algorithm. |
关键词 | Hierarchical control hierarchical reinforcement learning option sparse reward sub-goal |
DOI | 10.1109/JAS.2021.1004141 |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://ir.ia.ac.cn/handle/173211/45382 |
专题 | 学术期刊_IEEE/CAA Journal of Automatica Sinica |
推荐引用方式 GB/T 7714 | Chenghao Liu,Fei Zhu,Quan Liu,et al. Hierarchical Reinforcement Learning With Automatic Sub-Goal Identification[J]. IEEE/CAA Journal of Automatica Sinica,2021,8(10):1686-1696. |
APA | Chenghao Liu,Fei Zhu,Quan Liu,&Yuchen Fu.(2021).Hierarchical Reinforcement Learning With Automatic Sub-Goal Identification.IEEE/CAA Journal of Automatica Sinica,8(10),1686-1696. |
MLA | Chenghao Liu,et al."Hierarchical Reinforcement Learning With Automatic Sub-Goal Identification".IEEE/CAA Journal of Automatica Sinica 8.10(2021):1686-1696. |
条目包含的文件 | 下载所有文件 | |||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | ||
JAS-2020-0603.pdf(5095KB) | 期刊论文 | 出版稿 | 开放获取 | CC BY-NC-SA | 浏览 下载 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论