CASIA OpenIR

Browse/Search Results:  1-10 of 10 Help

Filters    
Selected(0)Clear Items/Page:    Sort:
Data-Based Reinforcement Learning for Nonzero-Sum Games With Unknown Drift Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 卷号: 49, 期号: 8, 页码: 2874-2885
Authors:  Zhang, Qichao;  Zhao, Dongbin
View  |  Adobe PDF(1021Kb)  |  Favorite  |  View/Download:13/0  |  Submit date:2019/07/12
Integral reinforcement learning (IRL)  neural network (NN)  nonzero-sum (NZS) games  off-policy  single-critic  unknown drift dynamics  
Adaptive Q-Learning for Data-Based Optimal Output Regulation With Experience Replay 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 12, 页码: 3337-3348
Authors:  Luo, Biao;  Yang, Yin;  Liu, Derong
Favorite  |  View/Download:26/0  |  Submit date:2019/01/08
Data-based  experience replay  neural networks (NNs)  off-policy  optimal control  Q-learning (QL)  
Improving the Critic Learning for Event-Based Nonlinear H-infinity Control Design 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 10, 页码: 3417-3428
Authors:  Wang, Ding;  He, Haibo;  Liu, Derong
View  |  Adobe PDF(1068Kb)  |  Favorite  |  View/Download:64/10  |  Submit date:2018/03/03
H-infinity Control  Adaptive Systems  Adaptive/approximate Dynamic Programming  Critic Network  Event-based Design  Learning Criterion  Neural Control  
Adaptive Critic Nonlinear Robust Control: A Survey 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 10, 页码: 3429-3451
Authors:  Wang, Ding;  He, Haibo;  Liu, Derong
View  |  Adobe PDF(1954Kb)  |  Favorite  |  View/Download:79/23  |  Submit date:2018/03/03
Adaptive Critic Designs  Adaptive/approximate Dynamic Programming (Adp)  Boundedness  Convergence  Neural Networks  Optimal Control  Reinforcement Learning  Robust Control  Stability  
Policy Gradient Adaptive Dynamic Programming for Data-Based Optimal Control 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 10, 页码: 3341-3354
Authors:  Luo, Biao;  Liu, Derong;  Wu, Huai-Ning;  Wang, Ding;  Lewis, Frank L.
View  |  Adobe PDF(3217Kb)  |  Favorite  |  View/Download:148/51  |  Submit date:2016/11/09
Adaptive Control  Adaptive Dynamic Programming (Adp)  Data-based  Off-policy Learning  Optimal Control  Policy Gradient  
Experience Replay for Optimal Control of Nonzero-Sum Game Systems With Unknown Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2016, 卷号: 46, 期号: 3, 页码: 854-865
Authors:  Zhao, Dongbin;  Zhang, Qichao;  Wang, Ding;  Zhu, Yuanheng
View  |  Adobe PDF(1769Kb)  |  Favorite  |  View/Download:143/70  |  Submit date:2016/06/14
Adaptive Dynamic Programming (Adp)  Experience Replay  Nonzero-sum (Nzs) Games  Optimal Control  Unknown Dynamics  
Value Iteration Adaptive Dynamic Programming for Optimal Control of Discrete-Time Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2016, 卷号: 46, 期号: 3, 页码: 840-853
Authors:  Wei, Qinglai;  Liu, Derong;  Lin, Hanquan;  Derong Liu
View  |  Adobe PDF(2015Kb)  |  Favorite  |  View/Download:107/38  |  Submit date:2016/06/14
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Neural Networks  Neuro-dynamic Programming  Optimal Control  Reinforcement Learning  Value Iteration  
Reinforcement-Learning-Based Robust Controller Design for Continuous-Time Uncertain Nonlinear Systems Subject to Input Constraints 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2015, 卷号: 45, 期号: 7, 页码: 1372-1385
Authors:  Liu, Derong;  Yang, Xiong;  Wang, Ding;  Wei, Qinglai
View  |  Adobe PDF(1179Kb)  |  Favorite  |  View/Download:152/73  |  Submit date:2015/09/17
Approximate Dynamic Programming (Adp)  Neural Networks (Nns)  Neuro-dynamic Programming  Nonlinear Systems  Optimal Control  Reinforcement Learning (Rl)  Robust Control  
Neural-Network-Based Online HJB Solution for Optimal Robust Guaranteed Cost Control of Continuous-Time Uncertain Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2014, 卷号: 44, 期号: 12, 页码: 2834-2847
Authors:  Liu, Derong;  Wang, Ding;  Wang, Fei-Yue;  Li, Hongliang;  Yang, Xiong
View  |  Adobe PDF(780Kb)  |  Favorite  |  View/Download:113/53  |  Submit date:2015/08/12
Adaptive Critic Designs  Adaptive/approximate Dynamic Programming (Adp)  Hamilton-jacobi-bellman (Hjb) Equation  Neural Networks  Optimal Robust Guaranteed Cost Control  Uncertain Nonlinear Systems  
Finite-Approximation-Error-Based Discrete-Time Iterative Adaptive Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2014, 卷号: 44, 期号: 12, 页码: 2820-2833
Authors:  Wei, Qinglai;  Wang, Fei-Yue;  Liu, Derong;  Yang, Xiong
View  |  Adobe PDF(1826Kb)  |  Favorite  |  View/Download:89/30  |  Submit date:2015/08/12
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Approximation Error  Neural Networks  Neuro-dynamic Programming  Nonlinear Systems  Optimal Control  Reinforcement Learning  Value Iteration