CASIA OpenIR

浏览/检索结果: 共5条,第1-5条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Discrete-Time Local Value Iteration Adaptive Dynamic Programming: Convergence Analysis 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2018, 卷号: 48, 期号: 6, 页码: 875-891
作者:  Wei, Qinglai;  Lewis, Frank L.;  Liu, Derong;  Song, Ruizhuo;  Lin, Hanquan
收藏  |  浏览/下载:267/0  |  提交时间:2017/02/23
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Local Iteration  Neural Networks  Neuro-dynamic Programming  Nonlinear Systems  Optimal Control  
Neural-network-based synchronous iteration learning method for multi-player zero-sum games 期刊论文
NEUROCOMPUTING, 2017, 卷号: 242, 页码: 73-82
作者:  Song, Ruizhuo;  Wei, Qinglai;  Song, Biao
收藏  |  浏览/下载:338/0  |  提交时间:2017/09/12
Adaptive Dynamic Programming  Approximate Dynamic Programming  Adaptive Critic Designs  Multi-player  Iteration Learning  Neural Network  
Off-policy neuro-optimal control for unknown complex-valued nonlinear systems based on policy iteration 期刊论文
NEURAL COMPUTING & APPLICATIONS, 2017, 卷号: 28, 期号: 6, 页码: 1435-1441
作者:  Song, Ruizhuo;  Wei, Qinglai;  Xiao, Wendong
收藏  |  浏览/下载:189/0  |  提交时间:2017/02/23
Adaptive Dynamic Programming  Approximate Dynamic Programming  Adaptive Critic Designs  Optimal Control  
Discrete-Time Deterministic Q-Learning: A Novel Convergence Analysis 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 5, 页码: 1224-1237
作者:  Wei, Qinglai;  Lewis, Frank L.;  Sun, Qiuye;  Yan, Pengfei;  Song, Ruizhuo
收藏  |  浏览/下载:208/0  |  提交时间:2017/02/23
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Neural Networks (Nns)  Neuro-dynamic Programming  Optimal Control  Q-learning  
Off-Policy Integral Reinforcement Learning Method to Solve Nonlinear Continuous-Time Multiplayer Nonzero-Sum Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 卷号: 28, 期号: 3, 页码: 704-713
作者:  Song, Ruizhuo;  Lewis, Frank L.;  Wei, Qinglai
收藏  |  浏览/下载:174/0  |  提交时间:2017/05/05
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Integral Reinforcement Learning (Irl)  Nonlinear Systems  Nonzero Sum (Nzs)  Off-policy