CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共2条,第1-2条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Discrete-Time Deterministic Q-Learning: A Novel Convergence Analysis 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 5, 页码: 1224-1237
作者:  Wei, Qinglai;  Lewis, Frank L.;  Sun, Qiuye;  Yan, Pengfei;  Song, Ruizhuo
收藏  |  浏览/下载:217/0  |  提交时间:2017/02/23
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Neural Networks (Nns)  Neuro-dynamic Programming  Optimal Control  Q-learning  
Off-Policy Actor-Critic Structure for Optimal Control of Unknown Systems With Disturbances 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2016, 卷号: 46, 期号: 5, 页码: 1041-1050
作者:  Song, Ruizhuo;  Lewis, Frank L.;  Wei, Qinglai;  Zhang, Huaguang
收藏  |  浏览/下载:209/0  |  提交时间:2016/10/20
Adaptive Critic Designs  Adaptive/approximate Dynamic Programming (Adp)  Dynamic Programming  Off-policy  Optimal Control  Unknown System