CASIA OpenIR

浏览/检索结果: 共75条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Multistep Look-Ahead Policy Iteration for Optimal Control of Discrete-Time Nonlinear Systems With Isoperimetric Constraints 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 13
作者:  Li, Tao;  Wei, Qinglai;  Wang, Fei-Yue
收藏  |  浏览/下载:35/0  |  提交时间:2024/02/22
Performance analysis  Optimal control  Dynamic programming  Iterative algorithms  Upper bound  Measurement  Convergence  Adaptive dynamic programming (ADP)  isoperimetric constraints  nonlinear systems  optimal control  policy iteration  
Value Iteration-Based Cooperative Adaptive Optimal Control for Multi-Player Differential Games With Incomplete Information 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 3, 页码: 690-697
作者:  Yun Zhang;  Lulu Zhang;  Yunze Cai
Adobe PDF(6850Kb)  |  收藏  |  浏览/下载:72/29  |  提交时间:2024/02/19
Adaptive dynamic programming  incomplete information  multi-player differential game  value iteration  
Adaptive Optimal Discrete-Time Output-Feedback Using an Internal Model Principle and Adaptive Dynamic Programming 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 1, 页码: 131-140
作者:  Zhongyang Wang;  Youqing Wang;  Zdzisław Kowalczuk
Adobe PDF(1538Kb)  |  收藏  |  浏览/下载:134/72  |  提交时间:2024/01/02
Adaptive dynamic programming (ADP)  internal model principle (IMP)  output feedback problem  policy iteration (PI)  value iteration (VI)  
Continuous-Time Stochastic Policy Iteration of Adaptive Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 13
作者:  Wei, Qinglai;  Zhou, Tianmin;  Lu, Jingwei;  Liu, Yu;  Su, Shuai;  Xiao, Jun
收藏  |  浏览/下载:101/0  |  提交时间:2023/11/17
Adaptive dynamic programming (ADP)  Hamilton-Jacobi-Bellman equation (HJBE)  nonlinear stochastic system  stochastic policy iteration (PI)  
Data-Driven Optimal Output Cluster Synchronization Control of Heterogeneous Multi-Agent Systems 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, 页码: 11
作者:  Li, Hongyang;  Wei, Qinglai
收藏  |  浏览/下载:52/0  |  提交时间:2023/11/17
Index Terms- Output cluster synchronization control  data-driven control  adaptive dynamic programming  policy iteration  heterogeneous multi-agent systems  optimal control  
Enhancing Iterative Learning Control With Fractional Power Update Law 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 5, 页码: 1137-1149
作者:  Zihan Li;  Dong Shen;  Xinghuo Yu
Adobe PDF(2070Kb)  |  收藏  |  浏览/下载:102/32  |  提交时间:2023/04/26
Asymptotic convergence  convergence rate  finite-iteration tracking  fractional power learning rule  limit cycles  
Policy Iteration for Optimal Control of Discrete-Time Time-Varying Nonlinear Systems 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 3, 页码: 781-791
作者:  Guangyu Zhu;  Xiaolu Li;  Ranran Sun;  Yiyuan Yang;  Peng Zhang
Adobe PDF(2432Kb)  |  收藏  |  浏览/下载:167/62  |  提交时间:2023/03/02
Adaptive critic designs  adaptive dynamic programming  approximate dynamic programming  optimal control  policy iteration  time-varying  
A Novel Data-Based Fault-Tolerant Control Method for Multicontroller Linear Systems via Distributed Policy Iteration 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 页码: 11
作者:  Wei, Qinglai;  Li, Hongyang;  Li, Tao;  Wang, Fei-Yue
收藏  |  浏览/下载:188/0  |  提交时间:2023/02/22
Fault tolerant systems  Fault tolerance  Linear systems  Control systems  Actuators  Optimal control  Mathematical models  Distributed policy iteration  fault-tolerant control  multicontroller linear systems  optimal control  
Discounted Iterative Adaptive Critic Designs With Novel Stability Analysis for Tracking Control 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2022, 卷号: 9, 期号: 7, 页码: 1262-1272
作者:  Mingming Ha;  Ding Wang;  Derong Liu
Adobe PDF(1832Kb)  |  收藏  |  浏览/下载:198/66  |  提交时间:2022/06/27
Adaptive critic design  adaptive dynamic programming (ADP)  approximate dynamic programming  discrete-time nonlinear systems  reinforcement learning  stability analysis  tracking control  value iteration (VI)  
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
收藏  |  浏览/下载:196/0  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum