CASIA OpenIR

浏览/检索结果: 共16条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Isoperimetric Constraint Inference for Discrete-Time Nonlinear Systems Based on Inverse Optimal Control 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2024, 页码: 1 - 13
作者:  Wei, Qinglai;  Li, Tao;  Zhang, Jie;  Li, Hongyang;  Wang, Xin;  Xiao, Jun
Adobe PDF(1700Kb)  |  收藏  |  浏览/下载:51/22  |  提交时间:2024/05/28
Multiagent Adversarial Collaborative Learning via Mean-Field Theory 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 卷号: 51, 期号: 10, 页码: 4994-5007
作者:  Luo, Guiyang;  Zhang, Hui;  He, Haibo;  Li, Jinglin;  Wang, Fei-Yue
收藏  |  浏览/下载:221/0  |  提交时间:2021/12/28
Games  Training  Collaborative work  Task analysis  Nash equilibrium  Sociology  Statistics  Adversarial collaborative learning (ACL)  friend-or-foe Q-learning  mean-field theory  multiagent reinforcement learning (MARL)  
Discrete-Time Non-Zero-Sum Games With Completely Unknown Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 卷号: 51, 期号: 6, 页码: 2929-2943
作者:  Song, Ruizhuo;  Wei, Qinglai;  Zhang, Huaguang;  Lewis, Frank L.
收藏  |  浏览/下载:250/0  |  提交时间:2021/08/15
Adaptive critic designs  adaptive dynamic programming  approximate dynamic programming  discrete-time  nonzero-sum (NZS)  off-policy  reinforcement learning (RL)  
Continuous-Time Distributed Policy Iteration for Multicontroller Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 卷号: 51, 期号: 5, 页码: 2372-2383
作者:  Wei, Qinglai;  Li, Hongyang;  Yang, Xiong;  He, Haibo
Adobe PDF(1246Kb)  |  收藏  |  浏览/下载:285/54  |  提交时间:2021/06/07
Optimal control  Nonlinear systems  Decentralized control  Mathematical model  Convergence  Multi-agent systems  Adaptive dynamic programming (ADP)  approximate dynamic programming  distributed policy iteration  nonlinear systems  optimal control  
Continuous-Time Time-Varying Policy Iteration 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 卷号: 50, 期号: 12, 页码: 4958-4971
作者:  Wei, Qinglai;  Liao, Zehua;  Yang, Zhanyu;  Li, Benkai;  Liu, Derong
Adobe PDF(3149Kb)  |  收藏  |  浏览/下载:303/56  |  提交时间:2021/03/02
Optimal control  Nonlinear systems  Time-varying systems  Mathematical model  Dynamic programming  Approximation algorithms  Iterative algorithms  Adaptive critic designs  adaptive dynamic programming (ADP)  neuro-dynamic programming  nonlinear systems  optimal control  policy iteration  
Data-Based Reinforcement Learning for Nonzero-Sum Games With Unknown Drift Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 卷号: 49, 期号: 8, 页码: 2874-2885
作者:  Zhang, Qichao;  Zhao, Dongbin
Adobe PDF(1021Kb)  |  收藏  |  浏览/下载:460/137  |  提交时间:2019/07/12
Integral reinforcement learning (IRL)  neural network (NN)  nonzero-sum (NZS) games  off-policy  single-critic  unknown drift dynamics  
Policy Iteration for H infinity Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 2, 页码: 500-509
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Yang, Xiong;  Zhang, Qichao
Adobe PDF(892Kb)  |  收藏  |  浏览/下载:328/52  |  提交时间:2018/10/10
Adaptive Dynamic Programming (Adp)  h Infinity Optimal Control  Policy Iteration (Pi)  Polynomial Nonlinear Systems  Sum Of Squares (Sos)  
Adaptive Critic Nonlinear Robust Control: A Survey 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 10, 页码: 3429-3451
作者:  Wang, Ding;  He, Haibo;  Liu, Derong
Adobe PDF(1954Kb)  |  收藏  |  浏览/下载:450/151  |  提交时间:2018/03/03
Adaptive Critic Designs  Adaptive/approximate Dynamic Programming (Adp)  Boundedness  Convergence  Neural Networks  Optimal Control  Reinforcement Learning  Robust Control  Stability  
Improving the Critic Learning for Event-Based Nonlinear H-infinity Control Design 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 10, 页码: 3417-3428
作者:  Wang, Ding;  He, Haibo;  Liu, Derong
Adobe PDF(1068Kb)  |  收藏  |  浏览/下载:469/127  |  提交时间:2018/03/03
H-infinity Control  Adaptive Systems  Adaptive/approximate Dynamic Programming  Critic Network  Event-based Design  Learning Criterion  Neural Control  
Discrete-Time Deterministic Q-Learning: A Novel Convergence Analysis 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 5, 页码: 1224-1237
作者:  Wei, Qinglai;  Lewis, Frank L.;  Sun, Qiuye;  Yan, Pengfei;  Song, Ruizhuo
收藏  |  浏览/下载:245/0  |  提交时间:2017/02/23
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Neural Networks (Nns)  Neuro-dynamic Programming  Optimal Control  Q-learning