CASIA OpenIR  > 复杂系统管理与控制国家重点实验室  > 深度强化学习
Comprehensive comparison of online ADP algorithms for continuous-time optimal control
Zhu, Yuanheng1,2; Zhao, Dongbin1,2
Source PublicationARTIFICIAL INTELLIGENCE REVIEW
2018-04-01
Volume49Issue:4Pages:531-547
SubtypeArticle
AbstractOnline learning is an important property of adaptive dynamic programming (ADP). Online observations contain plentiful dynamics information, and ADP algorithms can utilize them to learn the optimal control policy. This paper reviews the research of online ADP algorithms for the optimal control of continuous-time systems. With the intensive study, ADP has been developed towards model free and data efficient. After separately introducing the algorithms, we compare their performance on the same problem. This paper is desired to provide a comprehensive understanding of continuous-time online ADP algorithms.
KeywordAdaptive Dynamic Programming Policy Iteration Integral Reinforcement Learning Experience Replay Off-policy
WOS HeadingsScience & Technology ; Technology
DOI10.1007/s10462-017-9548-4
WOS KeywordNONLINEAR-SYSTEMS ; EXPERIENCE REPLAY
Indexed BySCI
Language英语
Funding OrganizationNational Natural Science Foundation of China(61533017 ; Early Career Development Award of SKLMCCS ; 61573353 ; 61603382)
WOS Research AreaComputer Science
WOS SubjectComputer Science, Artificial Intelligence
WOS IDWOS:000426912500004
Citation statistics
Cited Times:5[WOS]   [WOS Record]     [Related Records in WOS]
Document Type期刊论文
Identifierhttp://ir.ia.ac.cn/handle/173211/15287
Collection复杂系统管理与控制国家重点实验室_深度强化学习
Affiliation1.Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
2.Univ Chinese Acad Sci, Beijing, Peoples R China
Recommended Citation
GB/T 7714
Zhu, Yuanheng,Zhao, Dongbin. Comprehensive comparison of online ADP algorithms for continuous-time optimal control[J]. ARTIFICIAL INTELLIGENCE REVIEW,2018,49(4):531-547.
APA Zhu, Yuanheng,&Zhao, Dongbin.(2018).Comprehensive comparison of online ADP algorithms for continuous-time optimal control.ARTIFICIAL INTELLIGENCE REVIEW,49(4),531-547.
MLA Zhu, Yuanheng,et al."Comprehensive comparison of online ADP algorithms for continuous-time optimal control".ARTIFICIAL INTELLIGENCE REVIEW 49.4(2018):531-547.
Files in This Item: Download All
File Name/Size DocType Version Access License
art%3A10.1007%2Fs104(766KB)期刊论文作者接受稿开放获取CC BY-NC-SAView Download
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Zhu, Yuanheng]'s Articles
[Zhao, Dongbin]'s Articles
Baidu academic
Similar articles in Baidu academic
[Zhu, Yuanheng]'s Articles
[Zhao, Dongbin]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Zhu, Yuanheng]'s Articles
[Zhao, Dongbin]'s Articles
Terms of Use
No data!
Social Bookmark/Share
File name: art%3A10.1007%2Fs10462-017-9548-4.pdf
Format: Adobe PDF
This file does not support browsing at this time
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.