CASIA OpenIR

浏览/检索结果: 共7条,第1-7条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Continuous-Time Time-Varying Policy Iteration 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 卷号: 50, 期号: 12, 页码: 4958-4971
作者:  Wei, Qinglai;  Liao, Zehua;  Yang, Zhanyu;  Li, Benkai;  Liu, Derong
Adobe PDF(3149Kb)  |  收藏  |  浏览/下载:235/49  |  提交时间:2021/03/02
Optimal control  Nonlinear systems  Time-varying systems  Mathematical model  Dynamic programming  Approximation algorithms  Iterative algorithms  Adaptive critic designs  adaptive dynamic programming (ADP)  neuro-dynamic programming  nonlinear systems  optimal control  policy iteration  
Adaptive Q-Learning for Data-Based Optimal Output Regulation With Experience Replay 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 12, 页码: 3337-3348
作者:  Luo, Biao;  Yang, Yin;  Liu, Derong
收藏  |  浏览/下载:257/0  |  提交时间:2019/01/08
Data-based  experience replay  neural networks (NNs)  off-policy  optimal control  Q-learning (QL)  
Adaptive Graph Matching 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 5, 页码: 1432-1445
作者:  Yang, Xu;  Liu, Zhi-Yong
浏览  |  Adobe PDF(2213Kb)  |  收藏  |  浏览/下载:294/85  |  提交时间:2017/10/12
Graduated Projection  Graph Matching  Point Correspondence  Regularization Method  
A Semi-Supervised Method for Surveillance-Based Visual Location Recognition 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 11, 页码: 3719-3732
作者:  Liu, Pengcheng;  Yang, Peipei;  Wang, Chong;  Huang, Kaiqi;  Tan, Tieniu
浏览  |  Adobe PDF(5404Kb)  |  收藏  |  浏览/下载:515/164  |  提交时间:2016/10/31
Cross-device (C-d) Recognition  Semi-supervised Learning  Visual Localization  
Reinforcement-Learning-Based Robust Controller Design for Continuous-Time Uncertain Nonlinear Systems Subject to Input Constraints 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2015, 卷号: 45, 期号: 7, 页码: 1372-1385
作者:  Liu, Derong;  Yang, Xiong;  Wang, Ding;  Wei, Qinglai
浏览  |  Adobe PDF(1179Kb)  |  收藏  |  浏览/下载:459/241  |  提交时间:2015/09/17
Approximate Dynamic Programming (Adp)  Neural Networks (Nns)  Neuro-dynamic Programming  Nonlinear Systems  Optimal Control  Reinforcement Learning (Rl)  Robust Control  
Finite-Approximation-Error-Based Discrete-Time Iterative Adaptive Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2014, 卷号: 44, 期号: 12, 页码: 2820-2833
作者:  Wei, Qinglai;  Wang, Fei-Yue;  Liu, Derong;  Yang, Xiong
浏览  |  Adobe PDF(1826Kb)  |  收藏  |  浏览/下载:290/107  |  提交时间:2015/08/12
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Approximation Error  Neural Networks  Neuro-dynamic Programming  Nonlinear Systems  Optimal Control  Reinforcement Learning  Value Iteration  
Neural-Network-Based Online HJB Solution for Optimal Robust Guaranteed Cost Control of Continuous-Time Uncertain Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2014, 卷号: 44, 期号: 12, 页码: 2834-2847
作者:  Liu, Derong;  Wang, Ding;  Wang, Fei-Yue;  Li, Hongliang;  Yang, Xiong
浏览  |  Adobe PDF(780Kb)  |  收藏  |  浏览/下载:426/203  |  提交时间:2015/08/12
Adaptive Critic Designs  Adaptive/approximate Dynamic Programming (Adp)  Hamilton-jacobi-bellman (Hjb) Equation  Neural Networks  Optimal Robust Guaranteed Cost Control  Uncertain Nonlinear Systems