CASIA OpenIR

浏览/检索结果: 共2条,第1-2条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Continuous-Time Time-Varying Policy Iteration 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 卷号: 50, 期号: 12, 页码: 4958-4971
作者:  Wei, Qinglai;  Liao, Zehua;  Yang, Zhanyu;  Li, Benkai;  Liu, Derong
Adobe PDF(3149Kb)  |  收藏  |  浏览/下载:268/50  |  提交时间:2021/03/02
Optimal control  Nonlinear systems  Time-varying systems  Mathematical model  Dynamic programming  Approximation algorithms  Iterative algorithms  Adaptive critic designs  adaptive dynamic programming (ADP)  neuro-dynamic programming  nonlinear systems  optimal control  policy iteration  
Adaptive Q-Learning for Data-Based Optimal Output Regulation With Experience Replay 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 12, 页码: 3337-3348
作者:  Luo, Biao;  Yang, Yin;  Liu, Derong
收藏  |  浏览/下载:293/0  |  提交时间:2019/01/08
Data-based  experience replay  neural networks (NNs)  off-policy  optimal control  Q-learning (QL)