Time-sequence Action-Decision and Navigation Through Stage Deep Reinforcement Learning in Complex Dynamic Environments
Huimu, Wang1,2; Tenghai, Qiu2; Zhen, Liu2; Zhiqiang, Pu1,2; Jianqiang, Yi1,2; Zhaoyang, Liu3
2019
会议名称2019 IEEE Symposium Series on Computational Intelligence
会议日期2019.12
会议地点厦门
摘要

Navigation in a complex dynamic environment is one of the most attractive tasks. Although most of such algorithms can achieve navigation tasks effectively, they ignore the necessity of the mission planning in the process of navigation. Given the situation, a novel end-to-end two-stage deep reinforcement learning architecture for a time-sequence navigation and action-decision in a dynamic environment with randomly rapidly
moving obstacles is proposed in this paper. During the first-stage
training, a network with spatial and temporal information is designed to process the navigation task while a conventional recurrent full-connected network is adopted to resolve the action-decision task. During the second-stage training, the two networks are integrated and trained online with dynamic entropy to obtain a stable policy for dynamic missions. Simulations demonstrate that the navigation and the action-decision in
different environments can be completed effectively under our architecture.
 

收录类别EI
语种英语
七大方向——子方向分类强化与进化学习
文献类型会议论文
条目标识符http://ir.ia.ac.cn/handle/173211/44953
专题复杂系统认知与决策实验室_飞行器智能技术
通讯作者Tenghai, Qiu
作者单位1.School of Artificial Intelligence, University of Chinese Academy of Sciences
2.Institute of Automation, Chinese Academy of Sciences
3.Department of Automation, Tsinghua University
第一作者单位中国科学院自动化研究所
通讯作者单位中国科学院自动化研究所
推荐引用方式
GB/T 7714
Huimu, Wang,Tenghai, Qiu,Zhen, Liu,et al. Time-sequence Action-Decision and Navigation Through Stage Deep Reinforcement Learning in Complex Dynamic Environments[C],2019.
条目包含的文件 下载所有文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
Time-sequence Action(2178KB)会议论文 开放获取CC BY-NC-SA浏览 下载
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Huimu, Wang]的文章
[Tenghai, Qiu]的文章
[Zhen, Liu]的文章
百度学术
百度学术中相似的文章
[Huimu, Wang]的文章
[Tenghai, Qiu]的文章
[Zhen, Liu]的文章
必应学术
必应学术中相似的文章
[Huimu, Wang]的文章
[Tenghai, Qiu]的文章
[Zhen, Liu]的文章
相关权益政策
暂无数据
收藏/分享
文件名: Time-sequence Action-Decision and Navigation.pdf
格式: Adobe PDF
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。