Optimal Output Regulation for Model-Free Quanser Helicopter With Multistep Q-Learning
Luo, Biao1; Wu, Huai-Ning2; Huang, Tingwen3
发表期刊IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS
2018-06-01
卷号65期号:6页码:4953-4961
文章类型Article
摘要In this paper, the optimal output regulation problem is considered for the model-free 2-degree-of-freedom (2-DOF) helicopter. A multistep Q-learning (MsQL) method is developed with multistep policy evaluation. First, by introducing the Q-function, the optimal output regulation problem is converted to finding the optimal Q-function. Therefore, the MsQL algorithm is proposed and its convergence theory is established by showing that it generates a non-increasing Q-function sequence that converges to the optimal Q-function. In the MsQL, the step-size of multistep policy evaluation can be different at each iteration and an adaptive tuning rule is proposed. The MsQL learns the optimal Q-function by using real system data rather than using a system model. Finally, the developed MsQL method is employed to solve the optimal output regulation problem of the model-free 2-DOF helicopter, and its effectiveness is verified.
关键词Helicopter Model-free Multistep Policy Evaluation Optimal Output Regulation Q-learning
WOS标题词Science & Technology ; Technology
DOI10.1109/TIE.2017.2772162
关键词[WOS]H-INFINITY CONTROL ; DISCRETE-TIME-SYSTEMS ; ADAPTIVE OPTIMAL-CONTROL ; UNKNOWN DYNAMICS ; POLICY ITERATION ; CONTROL DESIGN ; TRACKING CONTROL
收录类别SCI
语种英语
项目资助者National Natural Science Foundation of China(61503377 ; Qatar National Research Fund under National Priority Research Project(NPRP 9 166-1-031) ; 61533017 ; 61625302 ; 61473011 ; U1501251)
WOS研究方向Automation & Control Systems ; Engineering ; Instruments & Instrumentation
WOS类目Automation & Control Systems ; Engineering, Electrical & Electronic ; Instruments & Instrumentation
WOS记录号WOS:000425618900051
引用统计
被引频次:69[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符http://ir.ia.ac.cn/handle/173211/21957
专题多模态人工智能系统全国重点实验室_复杂系统智能机理与平行控制团队
作者单位1.Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
2.Beihang Univ, Beijing Univ Aeronaut & Astronaut, Sci & Technol Aircraft Control Lab, Beijing 100191, Peoples R China
3.Texas A&M Univ Qatar, Doha, Qatar
第一作者单位中国科学院自动化研究所
推荐引用方式
GB/T 7714
Luo, Biao,Wu, Huai-Ning,Huang, Tingwen. Optimal Output Regulation for Model-Free Quanser Helicopter With Multistep Q-Learning[J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS,2018,65(6):4953-4961.
APA Luo, Biao,Wu, Huai-Ning,&Huang, Tingwen.(2018).Optimal Output Regulation for Model-Free Quanser Helicopter With Multistep Q-Learning.IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS,65(6),4953-4961.
MLA Luo, Biao,et al."Optimal Output Regulation for Model-Free Quanser Helicopter With Multistep Q-Learning".IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS 65.6(2018):4953-4961.
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Luo, Biao]的文章
[Wu, Huai-Ning]的文章
[Huang, Tingwen]的文章
百度学术
百度学术中相似的文章
[Luo, Biao]的文章
[Wu, Huai-Ning]的文章
[Huang, Tingwen]的文章
必应学术
必应学术中相似的文章
[Luo, Biao]的文章
[Wu, Huai-Ning]的文章
[Huang, Tingwen]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。