CASIA OpenIR

Browse/Search Results:  1-9 of 9 Help

Filters    
Selected(0)Clear Items/Page:    Sort:
Feature Aggregation With Reinforcement Learning for Video-Based Person Re-Identification 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 卷号: 30, 期号: 12, 页码: 3847-3852
Authors:  Zhang, Wei;  He, Xuanyu;  Lu, Weizhi;  Qiao, Hong;  Li, Yibin
Favorite  |  View/Download:0/0  |  Submit date:2020/03/30
Feature extraction  Task analysis  Cameras  Noise measurement  Learning systems  Reinforcement learning  Feature aggregation  reinforcement learning (RL)  sequential decision making  video-based person re-identification (re-id)  
Manifold Regularized Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 4, 页码: 932-943
Authors:  Li, Hongliang;  Liu, Derong;  Wang, Ding
Favorite  |  View/Download:45/0  |  Submit date:2018/10/10
Adaptive Dynamic Programming  Approximate Dynamic Programming  Approximate Policy Iteration (Api)  Manifold Regularization  Reinforcement Learning (Rl)  
Iterative Adaptive Dynamic Programming for Solving Unknown Nonlinear Zero-Sum Game Based on Online Data 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 卷号: 28, 期号: 3, 页码: 714-725
Authors:  Zhu, Yuanheng;  Zhao, Dongbin;  Li, Xiangjun
View  |  Adobe PDF(547Kb)  |  Favorite  |  View/Download:179/85  |  Submit date:2017/05/05
Adaptive Dynamic Programming (Adp)  H-infinity Control  Policy Iteration (Pi)  Zero-sum Game (Zsg)  
Model-Free Optimal Tracking Control via Critic-Only Q-Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 卷号: 27, 期号: 10, 页码: 2134-2144
Authors:  Luo, Biao;  Liu, Derong;  Huang, Tingwen;  Wang, Ding;  Luo,Biao
View  |  Adobe PDF(1521Kb)  |  Favorite  |  View/Download:216/111  |  Submit date:2016/10/24
Critic-only Q-learning (Coql)  Model-free  Nonaffine Nonlinear Systems  Optimal Tracking Control  
Data-Driven Zero-Sum Neuro-Optimal Control for a Class of Continuous-Time Unknown Nonlinear Systems With Disturbance Using ADP 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 卷号: 27, 期号: 2, 页码: 444-458
Authors:  Wei, Qinglai;  Song, Ruizhuo;  Yan, Pengfei
View  |  Adobe PDF(2204Kb)  |  Favorite  |  View/Download:143/49  |  Submit date:2016/06/14
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Neurodynamic Programming  Nonlinear Systems  Optimal Control  Recurrent Neural Network (Rnn)  Reinforcement Learning  
Adaptive Optimal Control of Highly Dissipative Nonlinear Spatially Distributed Processes With Neuro-Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 4, 页码: 684-696
Authors:  Luo, Biao;  Wu, Huai-Ning;  Li, Han-Xiong
View  |  Adobe PDF(2465Kb)  |  Favorite  |  View/Download:95/19  |  Submit date:2016/03/30
Adaptive Optimal Control  Empirical Eigenfunction (Eef)  Highly Dissipative Partial Differential Equations (Pdes)  Neuro-dynamic Programming (Ndp)  Spatially Distributed Processes (Sdps)  
GrDHP: A General Utility Function Representation for Dual Heuristic Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 3, 页码: 614-627
Authors:  Ni, Zhen;  He, Haibo;  Zhao, Dongbin;  Xu, Xin;  Prokhorov, Danil V.
Favorite  |  View/Download:63/0  |  Submit date:2015/09/21
Adaptive Control  Adaptive Dynamic Programming (Adp)  Dual Heuristic Dynamic Programming (Dhp)  General Utility Function  Goal Representation  Reinforcement Learning (Rl)  
MEC-A Near-Optimal Online Reinforcement Learning Algorithm for Continuous Deterministic Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 2, 页码: 346-356
Authors:  Zhao, Dongbin;  Zhu, Yuanheng
View  |  Adobe PDF(2156Kb)  |  Favorite  |  View/Download:96/36  |  Submit date:2015/09/18
Efficient Exploration  Probably Approximately Correct (Pac)  Reinforcement Learning (Rl)  State Aggregation  
Policy Iteration Adaptive Dynamic Programming Algorithm for Discrete-Time Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 卷号: 25, 期号: 3, 页码: 621-634
Authors:  Liu, Derong;  Wei, Qinglai
Adobe PDF(2635Kb)  |  Favorite  |  View/Download:42/8  |  Submit date:2015/08/12
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Discrete-time Policy Iteration  Neural Networks  Neurodynamic Programming  Nonlinear Systems  Optimal Control  Reinforcement Learning