CASIA OpenIR

浏览/检索结果: 共9条,第1-9条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Optimal Elevator Group Control via Deep Asynchronous Actor-Critic Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 12, 页码: 5245-5256
作者:  Wei, Qinglai;  Wang, Lingxiao;  Liu, Yu;  Polycarpou, Marios M.
Adobe PDF(4019Kb)  |  收藏  |  浏览/下载:331/75  |  提交时间:2021/03/08
Elevators  Optimal control  Backpropagation  Machine learning  Neural networks  Learning (artificial intelligence)  Actor  –critic  adaptive dynamic programming  deep learning (DL)  elevator group control (EGC)  optimal control  reinforcement learning (RL)  
Deep Reinforcement Learning-Based Automatic Exploration for Navigation in Unknown Environment 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 6, 页码: 2064-2076
作者:  Li, Haoran;  Zhang, Qichao;  Zhao, Dongbin
浏览  |  Adobe PDF(4274Kb)  |  收藏  |  浏览/下载:369/114  |  提交时间:2020/08/03
Robot sensing systems  Navigation  Entropy  Neural networks  Task analysis  Planning  Automatic exploration  deep reinforcement learning (DRL)  optimal decision  partial observation  
Discrete-Time Stable Generalized Self-Learning Optimal Control With Approximation Errors 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 4, 页码: 1226-1238
作者:  Wei, Qinglai;  Li, Benkai;  Song, Ruizhuo
浏览  |  Adobe PDF(2475Kb)  |  收藏  |  浏览/下载:386/125  |  提交时间:2017/02/23
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Generalized Policy Iteration (Gpi)  Neural Networks  Neurodynamic Programming  Nonlinear Systems  Optimal Control  Reinforcement Learning  
Model-Free Optimal Tracking Control via Critic-Only Q-Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 卷号: 27, 期号: 10, 页码: 2134-2144
作者:  Luo, Biao;  Liu, Derong;  Huang, Tingwen;  Wang, Ding;  Luo,Biao
浏览  |  Adobe PDF(1521Kb)  |  收藏  |  浏览/下载:579/284  |  提交时间:2016/10/24
Critic-only Q-learning (Coql)  Model-free  Nonaffine Nonlinear Systems  Optimal Tracking Control  
The Twist Tensor Nuclear Norm for Video Completion 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 卷号: 28, 期号: 12, 页码: 2961-2973
作者:  Hu, Wenrui;  Tao, Dacheng;  Zhang, Wensheng;  Xie, Yuan;  Yang, Yehui;  Wensheng Zhang
浏览  |  Adobe PDF(24685Kb)  |  收藏  |  浏览/下载:479/144  |  提交时间:2016/10/22
Low-rank Tensor Estimation (Lrte)  Tensor Multirank  Tensor Nuclear Norm (Tnn)  Twist Tensor  Video Completion  
GrDHP: A General Utility Function Representation for Dual Heuristic Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 3, 页码: 614-627
作者:  Ni, Zhen;  He, Haibo;  Zhao, Dongbin;  Xu, Xin;  Prokhorov, Danil V.
收藏  |  浏览/下载:189/0  |  提交时间:2015/09/21
Adaptive Control  Adaptive Dynamic Programming (Adp)  Dual Heuristic Dynamic Programming (Dhp)  General Utility Function  Goal Representation  Reinforcement Learning (Rl)  
Multiple Actor-Critic Structures for Continuous-Time Optimal Control Using Input-Output Data 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 4, 页码: 851-865
作者:  Song, Ruizhuo;  Lewis, Frank;  Wei, Qinglai;  Zhang, Hua-Guang;  Jiang, Zhong-Ping;  Levine, Dan;  Qinglai Wei
浏览  |  Adobe PDF(3455Kb)  |  收藏  |  浏览/下载:404/175  |  提交时间:2015/09/21
Actor-critic  Approximate Dynamic Programming (Adp)  Category  Optimal Control  Shunting Inhibitory Artificial Neural Network (Siann)  
MEC-A Near-Optimal Online Reinforcement Learning Algorithm for Continuous Deterministic Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 2, 页码: 346-356
作者:  Zhao, Dongbin;  Zhu, Yuanheng
浏览  |  Adobe PDF(2156Kb)  |  收藏  |  浏览/下载:267/110  |  提交时间:2015/09/18
Efficient Exploration  Probably Approximately Correct (Pac)  Reinforcement Learning (Rl)  State Aggregation  
Error Bounds of Adaptive Dynamic Programming Algorithms for Solving Undiscounted Optimal Control Problems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 6, 页码: 1323-1334
作者:  Liu, Derong;  Li, Hongliang;  Wang, Ding
Adobe PDF(1114Kb)  |  收藏  |  浏览/下载:292/88  |  提交时间:2015/09/17
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Neural Networks  Neurodynamic Programming  Nonlinear Systems  Optimal Control