CASIA OpenIR

浏览/检索结果: 共19条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Neuro-Optimal Trajectory Tracking With Value Iteration of Discrete-Time Nonlinear Dynamics 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:  Wang, Ding;  Ha, Mingming;  Cheng, Long
收藏  |  浏览/下载:258/0  |  提交时间:2022/01/27
Trajectory  Heuristic algorithms  Convergence  Trajectory tracking  Stability criteria  Optimal control  Dynamic programming  Adaptive critic design  discrete-time nonlinear plants  neuro-optimal trajectory tracking  uniformly ultimately bounded stability  value iteration  
Multiagent Reinforcement Learning:Rollout and Policy Iteration 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2021, 卷号: 8, 期号: 2, 页码: 249-272
作者:  Dimitri Bertsekas
Adobe PDF(2312Kb)  |  收藏  |  浏览/下载:138/31  |  提交时间:2021/04/09
Dynamic programming  multiagent problems  neuro-dynamic programming  policy iteration  reinforcement learning, rollout  
A partial policy iteration ADP algorithm for nonlinear neuro-optimal control with discounted total reward 期刊论文
NEUROCOMPUTING, 2021, 卷号: 424, 页码: 23-34
作者:  Liang, Mingming;  Wei, Qinglai
收藏  |  浏览/下载:208/0  |  提交时间:2021/03/15
Adaptive critic designs  Adaptive dynamic programming  Policy iteration  Neural networks  Neuro-dynamic programming  Nonlinear systems  Optimal control  
Adaptive Dynamic Programming for Control: A Survey and Recent Advances 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 卷号: 51, 期号: 1, 页码: 142-160
作者:  Liu, Derong;  Xue, Shan;  Zhao, Bo;  Luo, Biao;  Wei, Qinglai
收藏  |  浏览/下载:220/0  |  提交时间:2021/03/08
Adaptive critic designs (ACDs)  adaptive dynamic programming  approximate dynamic programming  intelligent control  learning control  neural dynamic programming  neuro-dynamic programming  optimal control  reinforcement learning (RL)  
Continuous-Time Time-Varying Policy Iteration 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 卷号: 50, 期号: 12, 页码: 4958-4971
作者:  Wei, Qinglai;  Liao, Zehua;  Yang, Zhanyu;  Li, Benkai;  Liu, Derong
Adobe PDF(3149Kb)  |  收藏  |  浏览/下载:271/50  |  提交时间:2021/03/02
Optimal control  Nonlinear systems  Time-varying systems  Mathematical model  Dynamic programming  Approximation algorithms  Iterative algorithms  Adaptive critic designs  adaptive dynamic programming (ADP)  neuro-dynamic programming  nonlinear systems  optimal control  policy iteration  
Neuro-optimal control for discrete stochastic processes via a novel policy iteration algorithm 期刊论文
IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2020, 卷号: 50, 期号: 11, 页码: 3972-3985
作者:  Liang, Mingming;  Wang, Ding;  Liu, Derong
浏览  |  Adobe PDF(1604Kb)  |  收藏  |  浏览/下载:199/67  |  提交时间:2020/10/23
Adaptive critic designs  adaptive dynamic programming (ADP)  local policy iteration  neuro-dynamic programming  optimal control  stochastic processes  
Discrete-Time Optimal Control via Local Policy Iteration Adaptive Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 10, 页码: 3367-3379
作者:  Wei, Qinglai;  Liu, Derong;  Lin, Qiao;  Song, Ruizhuo
收藏  |  浏览/下载:232/0  |  提交时间:2017/02/23
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Local Policy Iteration  Neuro-dynamic Programming  Nonlinear Systems  Optimal Control  
Discrete-Time Local Value Iteration Adaptive Dynamic Programming: Convergence Analysis 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2018, 卷号: 48, 期号: 6, 页码: 875-891
作者:  Wei, Qinglai;  Lewis, Frank L.;  Liu, Derong;  Song, Ruizhuo;  Lin, Hanquan
收藏  |  浏览/下载:273/0  |  提交时间:2017/02/23
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Local Iteration  Neural Networks  Neuro-dynamic Programming  Nonlinear Systems  Optimal Control  
Discrete-Time Deterministic Q-Learning: A Novel Convergence Analysis 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 5, 页码: 1224-1237
作者:  Wei, Qinglai;  Lewis, Frank L.;  Sun, Qiuye;  Yan, Pengfei;  Song, Ruizhuo
收藏  |  浏览/下载:218/0  |  提交时间:2017/02/23
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Neural Networks (Nns)  Neuro-dynamic Programming  Optimal Control  Q-learning  
ADP-based optimal sensor scheduling for target tracking in energy harvesting wireless sensor networks 期刊论文
NEURAL COMPUTING & APPLICATIONS, 2016, 卷号: 27, 期号: 6, 页码: 1543-1551
作者:  Song, Ruizhuo;  Wei, Qinglai;  Xiao, Wendong
收藏  |  浏览/下载:181/0  |  提交时间:2016/10/20
Adaptive Critic Designs  Adaptive Dynamic Programming  Approximate Dynamic Programming  Neuro-dynamic Programming  Neural Networks  Wireless Sensor Networks  Scheduling