CASIA OpenIR  > 复杂系统管理与控制国家重点实验室  > 深度强化学习
Convolutional fitted Q iteration for vision-based control problems
Zhao Dongbin; Zhu Yuanheng; Lv Le; Chen Yaran; Zhang Qichao
2016-11
Conference NameThe 2016 International Joint Conference on Neural Networks
Conference Date24-29 July 2016
Conference PlaceVancouver, BC, Canada
AbstractIn this paper a deep reinforcement learning (DRL) method is proposed to solve the control problem which takes raw image pixels as input states. A convolutional neural network (CNN) is used to approximate Q functions, termed as Q-CNN. A pretrained network, which is the result of a classification challenge on a vast set of natural images, initializes the parameters of Q-CNN. Such initialization assigns Q-CNN with the features of image representation, so it is more concentrated on the control tasks. The weights are tuned under the scheme of fitted Q iteration (FQI), which is an offline reinforcement learning method with the stable convergence property. To demonstrate the performance, a modified Food-Poison problem is simulated. The agent determines its movements based on its forward view. In the end the algorithm successfully learns a satisfied policy which has better performance than the results of previous researches.
DOI10.1109/IJCNN.2016.7727794
Citation statistics
Document Type会议论文
Identifierhttp://ir.ia.ac.cn/handle/173211/14476
Collection复杂系统管理与控制国家重点实验室_深度强化学习
Affiliationhe State Key Laboratory of Management and Control for Complex Systems, In- stitution of Automation, Chinese Academy of Sciences, Beijing 100190, China.
Recommended Citation
GB/T 7714
Zhao Dongbin,Zhu Yuanheng,Lv Le,et al. Convolutional fitted Q iteration for vision-based control problems[C],2016.
Files in This Item: Download All
File Name/Size DocType Version Access License
07727794.pdf(240KB)会议论文 开放获取CC BY-NC-SAView Download
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Zhao Dongbin]'s Articles
[Zhu Yuanheng]'s Articles
[Lv Le]'s Articles
Baidu academic
Similar articles in Baidu academic
[Zhao Dongbin]'s Articles
[Zhu Yuanheng]'s Articles
[Lv Le]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Zhao Dongbin]'s Articles
[Zhu Yuanheng]'s Articles
[Lv Le]'s Articles
Terms of Use
No data!
Social Bookmark/Share
File name: 07727794.pdf
Format: Adobe PDF
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.