Knowledge Commons of Institute of Automation,CAS
Invariant Adaptive Dynamic Programming for Discrete-Time Optimal Control | |
Zhu, Yuanheng1,2![]() ![]() | |
Source Publication | IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS
![]() |
ISSN | 2168-2216 |
2020-11-01 | |
Volume | 50Issue:11Pages:3959-3971 |
Corresponding Author | He, Haibo(he@ele.uri.edu) |
Abstract | For systems that can only be locally stabilized, control laws and their effective regions are both important. In this paper, invariant policy iteration is proposed to solve the optimal control of discrete-time systems. At each iteration, a given policy is evaluated in its invariantly admissible region, and a new policy and a new region are updated for the next iteration. Theoretical analysis shows the method is regionally convergent to the optimal value and the optimal policy. Combined with sum-of-squares polynomials, the method is able to achieve the near-optimal control of a class of discrete-time systems. An invariant adaptive dynamic programming algorithm is developed to extend the method to scenarios where system dynamics is not available. Online data are utilized to learn the near-optimal policy and the invariantly admissible region. Simulated experiments verify the effectiveness of our method. |
Keyword | Optimal control Discrete-time systems Heuristic algorithms Dynamic programming Convergence Artificial intelligence Nonlinear systems Adaptive dynamic programming discrete-time systems invariant admissibility optimal control policy iteration sum of squares |
DOI | 10.1109/TSMC.2019.2911900 |
WOS Keyword | NONLINEAR-SYSTEMS ; POLICY ITERATION ; STABILITY ; SUM |
Indexed By | SCI |
Language | 英语 |
Funding Project | National Natural Science Foundation of China[61603382] |
Funding Organization | National Natural Science Foundation of China |
WOS Research Area | Automation & Control Systems ; Computer Science |
WOS Subject | Automation & Control Systems ; Computer Science, Cybernetics |
WOS ID | WOS:000578826300003 |
Publisher | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC |
Citation statistics | |
Document Type | 期刊论文 |
Identifier | http://ir.ia.ac.cn/handle/173211/42180 |
Collection | 中国科学院自动化研究所 |
Corresponding Author | He, Haibo |
Affiliation | 1.Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China 2.Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China 3.Univ Rhode Isl, Dept Elect Comp & Biomed Engn, Kingston, RI 02881 USA |
First Author Affilication | Institute of Automation, Chinese Academy of Sciences |
Recommended Citation GB/T 7714 | Zhu, Yuanheng,Zhao, Dongbin,He, Haibo. Invariant Adaptive Dynamic Programming for Discrete-Time Optimal Control[J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS,2020,50(11):3959-3971. |
APA | Zhu, Yuanheng,Zhao, Dongbin,&He, Haibo.(2020).Invariant Adaptive Dynamic Programming for Discrete-Time Optimal Control.IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS,50(11),3959-3971. |
MLA | Zhu, Yuanheng,et al."Invariant Adaptive Dynamic Programming for Discrete-Time Optimal Control".IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS 50.11(2020):3959-3971. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment