CASIA OpenIR

浏览/检索结果: 共9条,第1-9条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Model-Free Adaptive Optimal Control for Unknown Nonlinear Multiplayer Nonzero-Sum Game 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 2, 页码: 879-892
作者:  Wei, Qinglai;  Zhu, Liao;  Song, Ruizhuo;  Zhang, Pinjia;  Liu, Derong;  Xiao, Jun
收藏  |  浏览/下载:236/0  |  提交时间:2022/03/17
Heuristic algorithms  Nonlinear systems  Optimal control  Mathematical model  Dynamic programming  Games  Adaptive systems  Adaptive dynamic programming (ADP)  globalized dual-heuristic dynamic programming (GDHP)  multiplayer nonzero-sum game (MP-NZSG)  neural network (NN)  
Generalized Actor-Critic Learning Optimal Control in Smart Home Energy Management 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 卷号: 17, 期号: 10, 页码: 6614-6623
作者:  Wei, Qinglai;  Liao, Zehua;  Shi, Guang
Adobe PDF(1229Kb)  |  收藏  |  浏览/下载:247/33  |  提交时间:2021/11/02
Optimal control  Process control  Smart homes  Dynamic programming  Numerical models  Iterative methods  Informatics  Actor-critic learning  adaptive critic designs  adaptive dynamic programming (ADP)  approximate dynamic programming  energy management  optimal control  smart grid  
Self-Learning Optimal Control for Ice-Storage Air Conditioning Systems via Data-Based Adaptive Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2021, 卷号: 68, 期号: 4, 页码: 3599-3608
作者:  Wei, Qinglai;  Liao, Zehua;  Song, Ruizhuo;  Zhang, Pinjia;  Wang, Zhuo;  Xiao, Jun
Adobe PDF(3395Kb)  |  收藏  |  浏览/下载:350/35  |  提交时间:2021/03/02
Optimal control  Air conditioning  Load modeling  Neural networks  Dynamic programming  Predictive models  Adaptive dynamic programming (ADP)  cooling load prediction  ice-storage air conditioning (IAC)  neural network  optimal control  
Optimal Elevator Group Control via Deep Asynchronous Actor-Critic Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 12, 页码: 5245-5256
作者:  Wei, Qinglai;  Wang, Lingxiao;  Liu, Yu;  Polycarpou, Marios M.
Adobe PDF(4019Kb)  |  收藏  |  浏览/下载:319/75  |  提交时间:2021/03/08
Elevators  Optimal control  Backpropagation  Machine learning  Neural networks  Learning (artificial intelligence)  Actor  –critic  adaptive dynamic programming  deep learning (DL)  elevator group control (EGC)  optimal control  reinforcement learning (RL)  
Continuous-Time Time-Varying Policy Iteration 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 卷号: 50, 期号: 12, 页码: 4958-4971
作者:  Wei, Qinglai;  Liao, Zehua;  Yang, Zhanyu;  Li, Benkai;  Liu, Derong
Adobe PDF(3149Kb)  |  收藏  |  浏览/下载:262/49  |  提交时间:2021/03/02
Optimal control  Nonlinear systems  Time-varying systems  Mathematical model  Dynamic programming  Approximation algorithms  Iterative algorithms  Adaptive critic designs  adaptive dynamic programming (ADP)  neuro-dynamic programming  nonlinear systems  optimal control  policy iteration  
Discrete-Time Impulsive Adaptive Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 卷号: 50, 期号: 10, 页码: 4293-4306
作者:  Wei, Qinglai;  Song, Ruizhuo;  Liao, Zehua;  Li, Benkai;  Lewis, Frank L.
收藏  |  浏览/下载:251/0  |  提交时间:2021/01/07
Optimal control  Performance analysis  Nonlinear systems  Dynamic programming  Heuristic algorithms  Indexes  Adaptive critic designs  adaptive dynamic programming (ADP)  approximate dynamic programming  impulsive control  nonlinear systems  optimal control  
Neuro-optimal control for discrete stochastic processes via a novel policy iteration algorithm 期刊论文
IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2020, 卷号: 50, 期号: 11, 页码: 3972-3985
作者:  Liang, Mingming;  Wang, Ding;  Liu, Derong
浏览  |  Adobe PDF(1604Kb)  |  收藏  |  浏览/下载:184/63  |  提交时间:2020/10/23
Adaptive critic designs  adaptive dynamic programming (ADP)  local policy iteration  neuro-dynamic programming  optimal control  stochastic processes  
Output Tracking Control Based on Adaptive Dynamic Programming With Multistep Policy Evaluation 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2019, 卷号: 49, 期号: 10, 页码: 2155-2165
作者:  Luo, Biao;  Liu, Derong;  Huang, Tingwen;  Liu, Jiangjiang
收藏  |  浏览/下载:220/0  |  提交时间:2019/12/16
Adaptive dynamic programming (ADP)  Bellman equation  heuristic dynamic programming  neural networks (NNs)  output tracking control  
Data-Based Reinforcement Learning for Nonzero-Sum Games With Unknown Drift Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 卷号: 49, 期号: 8, 页码: 2874-2885
作者:  Zhang, Qichao;  Zhao, Dongbin
Adobe PDF(1021Kb)  |  收藏  |  浏览/下载:407/120  |  提交时间:2019/07/12
Integral reinforcement learning (IRL)  neural network (NN)  nonzero-sum (NZS) games  off-policy  single-critic  unknown drift dynamics