CASIA OpenIR

浏览/检索结果: 共19条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Multiagent Adversarial Collaborative Learning via Mean-Field Theory 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 卷号: 51, 期号: 10, 页码: 4994-5007
作者:  Luo, Guiyang;  Zhang, Hui;  He, Haibo;  Li, Jinglin;  Wang, Fei-Yue
收藏  |  浏览/下载:181/0  |  提交时间:2021/12/28
Games  Training  Collaborative work  Task analysis  Nash equilibrium  Sociology  Statistics  Adversarial collaborative learning (ACL)  friend-or-foe Q-learning  mean-field theory  multiagent reinforcement learning (MARL)  
Sliding-Mode Surface-Based Approximate Optimal Control for Uncertain Nonlinear Systems With Asymptotically Stable Critic Structure 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 卷号: 51, 期号: 6, 页码: 2858-2869
作者:  Zhao, Bo;  Liu, Derong;  Alippi, Cesare
收藏  |  浏览/下载:213/0  |  提交时间:2021/08/15
Optimal control  Perturbation methods  Nonlinear systems  Uncertainty  Cost function  Stability analysis  Adaptive systems  Adaptive critic designs  adaptive dynamic programming (ADP)  optimal control  sliding mode surface (SMS)  unknown mismatched perturbations  
Discrete-Time Non-Zero-Sum Games With Completely Unknown Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 卷号: 51, 期号: 6, 页码: 2929-2943
作者:  Song, Ruizhuo;  Wei, Qinglai;  Zhang, Huaguang;  Lewis, Frank L.
收藏  |  浏览/下载:197/0  |  提交时间:2021/08/15
Adaptive critic designs  adaptive dynamic programming  approximate dynamic programming  discrete-time  nonzero-sum (NZS)  off-policy  reinforcement learning (RL)  
Continuous-Time Distributed Policy Iteration for Multicontroller Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 卷号: 51, 期号: 5, 页码: 2372-2383
作者:  Wei, Qinglai;  Li, Hongyang;  Yang, Xiong;  He, Haibo
Adobe PDF(1246Kb)  |  收藏  |  浏览/下载:214/36  |  提交时间:2021/06/07
Optimal control  Nonlinear systems  Decentralized control  Mathematical model  Convergence  Multi-agent systems  Adaptive dynamic programming (ADP)  approximate dynamic programming  distributed policy iteration  nonlinear systems  optimal control  
Continuous-Time Time-Varying Policy Iteration 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 卷号: 50, 期号: 12, 页码: 4958-4971
作者:  Wei, Qinglai;  Liao, Zehua;  Yang, Zhanyu;  Li, Benkai;  Liu, Derong
Adobe PDF(3149Kb)  |  收藏  |  浏览/下载:259/49  |  提交时间:2021/03/02
Optimal control  Nonlinear systems  Time-varying systems  Mathematical model  Dynamic programming  Approximation algorithms  Iterative algorithms  Adaptive critic designs  adaptive dynamic programming (ADP)  neuro-dynamic programming  nonlinear systems  optimal control  policy iteration  
Data-Based Reinforcement Learning for Nonzero-Sum Games With Unknown Drift Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 卷号: 49, 期号: 8, 页码: 2874-2885
作者:  Zhang, Qichao;  Zhao, Dongbin
浏览  |  Adobe PDF(1021Kb)  |  收藏  |  浏览/下载:407/120  |  提交时间:2019/07/12
Integral reinforcement learning (IRL)  neural network (NN)  nonzero-sum (NZS) games  off-policy  single-critic  unknown drift dynamics  
A Survey of Cognitive Architectures in the Past 20 Years 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 12, 页码: 3280-3290
作者:  Ye, Peijun;  Wang, Tao;  Wang, Fei-Yue
浏览  |  Adobe PDF(734Kb)  |  收藏  |  浏览/下载:700/255  |  提交时间:2019/01/08
Agent  artificial intelligence (Al)  cognitive architecture (CA)  survey  
Policy Iteration for H infinity Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 2, 页码: 500-509
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Yang, Xiong;  Zhang, Qichao
Adobe PDF(892Kb)  |  收藏  |  浏览/下载:289/36  |  提交时间:2018/10/10
Adaptive Dynamic Programming (Adp)  h Infinity Optimal Control  Policy Iteration (Pi)  Polynomial Nonlinear Systems  Sum Of Squares (Sos)  
Adaptive Critic Nonlinear Robust Control: A Survey 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 10, 页码: 3429-3451
作者:  Wang, Ding;  He, Haibo;  Liu, Derong
浏览  |  Adobe PDF(1954Kb)  |  收藏  |  浏览/下载:409/143  |  提交时间:2018/03/03
Adaptive Critic Designs  Adaptive/approximate Dynamic Programming (Adp)  Boundedness  Convergence  Neural Networks  Optimal Control  Reinforcement Learning  Robust Control  Stability  
Policy Gradient Adaptive Dynamic Programming for Data-Based Optimal Control 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 10, 页码: 3341-3354
作者:  Luo, Biao;  Liu, Derong;  Wu, Huai-Ning;  Wang, Ding;  Lewis, Frank L.
浏览  |  Adobe PDF(3217Kb)  |  收藏  |  浏览/下载:575/205  |  提交时间:2016/11/09
Adaptive Control  Adaptive Dynamic Programming (Adp)  Data-based  Off-policy Learning  Optimal Control  Policy Gradient