CASIA OpenIR

浏览/检索结果: 共7条,第1-7条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Isoperimetric Constraint Inference for Discrete-Time Nonlinear Systems Based on Inverse Optimal Control 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2024, 页码: 1 - 13
作者:  Wei, Qinglai;  Li, Tao;  Zhang, Jie;  Li, Hongyang;  Wang, Xin;  Xiao, Jun
Adobe PDF(1700Kb)  |  收藏  |  浏览/下载:20/8  |  提交时间:2024/05/28
Multiagent Adversarial Collaborative Learning via Mean-Field Theory 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 卷号: 51, 期号: 10, 页码: 4994-5007
作者:  Luo, Guiyang;  Zhang, Hui;  He, Haibo;  Li, Jinglin;  Wang, Fei-Yue
收藏  |  浏览/下载:206/0  |  提交时间:2021/12/28
Games  Training  Collaborative work  Task analysis  Nash equilibrium  Sociology  Statistics  Adversarial collaborative learning (ACL)  friend-or-foe Q-learning  mean-field theory  multiagent reinforcement learning (MARL)  
Data-Based Reinforcement Learning for Nonzero-Sum Games With Unknown Drift Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 卷号: 49, 期号: 8, 页码: 2874-2885
作者:  Zhang, Qichao;  Zhao, Dongbin
Adobe PDF(1021Kb)  |  收藏  |  浏览/下载:440/130  |  提交时间:2019/07/12
Integral reinforcement learning (IRL)  neural network (NN)  nonzero-sum (NZS) games  off-policy  single-critic  unknown drift dynamics  
Adaptive Critic Nonlinear Robust Control: A Survey 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 10, 页码: 3429-3451
作者:  Wang, Ding;  He, Haibo;  Liu, Derong
Adobe PDF(1954Kb)  |  收藏  |  浏览/下载:432/147  |  提交时间:2018/03/03
Adaptive Critic Designs  Adaptive/approximate Dynamic Programming (Adp)  Boundedness  Convergence  Neural Networks  Optimal Control  Reinforcement Learning  Robust Control  Stability  
Improving the Critic Learning for Event-Based Nonlinear H-infinity Control Design 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 10, 页码: 3417-3428
作者:  Wang, Ding;  He, Haibo;  Liu, Derong
浏览  |  Adobe PDF(1068Kb)  |  收藏  |  浏览/下载:450/123  |  提交时间:2018/03/03
H-infinity Control  Adaptive Systems  Adaptive/approximate Dynamic Programming  Critic Network  Event-based Design  Learning Criterion  Neural Control  
FMRQ-A Multiagent Reinforcement Learning Algorithm for Fully Cooperative Tasks 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 6, 页码: 1367-1379
作者:  Zhang, Zhen;  Zhao, Dongbin;  Gao, Junwei;  Wang, Dongqing;  Dai, Yujie
收藏  |  浏览/下载:293/0  |  提交时间:2017/07/18
Multiagent Reinforcement Learning (Marl)  Nash Equilibrium  Q-learning  Repeated Game  
Experience Replay for Optimal Control of Nonzero-Sum Game Systems With Unknown Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2016, 卷号: 46, 期号: 3, 页码: 854-865
作者:  Zhao, Dongbin;  Zhang, Qichao;  Wang, Ding;  Zhu, Yuanheng
Adobe PDF(1769Kb)  |  收藏  |  浏览/下载:524/200  |  提交时间:2016/06/14
Adaptive Dynamic Programming (Adp)  Experience Replay  Nonzero-sum (Nzs) Games  Optimal Control  Unknown Dynamics