CASIA OpenIR

浏览/检索结果: 共9条,第1-9条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Generalized Actor-Critic Learning Optimal Control in Smart Home Energy Management 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 卷号: 17, 期号: 10, 页码: 6614-6623
作者:  Wei, Qinglai;  Liao, Zehua;  Shi, Guang
Adobe PDF(1229Kb)  |  收藏  |  浏览/下载:232/32  |  提交时间:2021/11/02
Optimal control  Process control  Smart homes  Dynamic programming  Numerical models  Iterative methods  Informatics  Actor-critic learning  adaptive critic designs  adaptive dynamic programming (ADP)  approximate dynamic programming  energy management  optimal control  smart grid  
Consensus Control of Leader-Following Multi-Agent Systems in Directed Topology With Heterogeneous Disturbances 期刊论文
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2021, 卷号: 8, 期号: 2, 页码: 423-431
作者:  Wei, Qinglai;  Wang, Xin;  Zhong, Xiangnan;  Wu, Naiqi
Adobe PDF(4423Kb)  |  收藏  |  浏览/下载:267/42  |  提交时间:2021/03/08
Consensus control  directed topology  external disturbance  multi-agent (MA) systems  
Optimal Elevator Group Control via Deep Asynchronous Actor-Critic Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 12, 页码: 5245-5256
作者:  Wei, Qinglai;  Wang, Lingxiao;  Liu, Yu;  Polycarpou, Marios M.
Adobe PDF(4019Kb)  |  收藏  |  浏览/下载:297/75  |  提交时间:2021/03/08
Elevators  Optimal control  Backpropagation  Machine learning  Neural networks  Learning (artificial intelligence)  Actor  –critic  adaptive dynamic programming  deep learning (DL)  elevator group control (EGC)  optimal control  reinforcement learning (RL)  
Continuous-Time Time-Varying Policy Iteration 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 卷号: 50, 期号: 12, 页码: 4958-4971
作者:  Wei, Qinglai;  Liao, Zehua;  Yang, Zhanyu;  Li, Benkai;  Liu, Derong
Adobe PDF(3149Kb)  |  收藏  |  浏览/下载:241/49  |  提交时间:2021/03/02
Optimal control  Nonlinear systems  Time-varying systems  Mathematical model  Dynamic programming  Approximation algorithms  Iterative algorithms  Adaptive critic designs  adaptive dynamic programming (ADP)  neuro-dynamic programming  nonlinear systems  optimal control  policy iteration  
Parallel Control for Optimal Tracking via Adaptive Dynamic Programming 期刊论文
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2020, 卷号: 7, 期号: 6, 页码: 1662-1674
作者:  Lu, Jingwei;  Wei, Qinglai;  Wang, Fei-Yue
浏览  |  Adobe PDF(7214Kb)  |  收藏  |  浏览/下载:291/53  |  提交时间:2021/01/06
Adaptive dynamic programming (ADP)  nonlinear optimal control  parallel controller  parallel control theory  parallel system  tracking control  neural network (NN)  
Event-Triggered Adaptive Critic Control Design for Discrete-Time Constrained Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 卷号: 50, 期号: 9, 页码: 3158-3168
作者:  Ha, Mingming;  Wang, Ding;  Liu, Derong
收藏  |  浏览/下载:134/0  |  提交时间:2020/09/28
Nonlinear systems  Actuators  Discrete-time systems  Dynamic programming  Optimal control  Adaptive systems  Adaptive dynamic programming (ADP)  control constraints  event-triggered control  heuristic dynamic programming (HDP)  neural networks  nonlinear discrete-time system  
Improved value iteration for neural-network-based stochastic optimal control design 期刊论文
NEURAL NETWORKS, 2020, 卷号: 124, 页码: 280-295
作者:  Liang, Mingming;  Wang, Ding;  Liu, Derong
Adobe PDF(5875Kb)  |  收藏  |  浏览/下载:218/50  |  提交时间:2020/06/02
Adaptive critic designs  Adaptive dynamic programming  Neural networks  Optimal control  Stochastic processes  Value iteration  
Neuro-optimal control for discrete stochastic processes via a novel policy iteration algorithm 期刊论文
IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2020, 卷号: 50, 期号: 11, 页码: 3972-3985
作者:  Liang, Mingming;  Wang, Ding;  Liu, Derong
Adobe PDF(1604Kb)  |  收藏  |  浏览/下载:172/63  |  提交时间:2020/10/23
Adaptive critic designs  adaptive dynamic programming (ADP)  local policy iteration  neuro-dynamic programming  optimal control  stochastic processes  
Data-Based Reinforcement Learning for Nonzero-Sum Games With Unknown Drift Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 卷号: 49, 期号: 8, 页码: 2874-2885
作者:  Zhang, Qichao;  Zhao, Dongbin
Adobe PDF(1021Kb)  |  收藏  |  浏览/下载:388/119  |  提交时间:2019/07/12
Integral reinforcement learning (IRL)  neural network (NN)  nonzero-sum (NZS) games  off-policy  single-critic  unknown drift dynamics