CASIA OpenIR

Browse/Search Results:  1-10 of 10 Help

Selected(0)Clear Items/Page:    Sort:
Feature Aggregation With Reinforcement Learning for Video-Based Person Re-Identification 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 卷号: 30, 期号: 12, 页码: 3847-3852
Authors:  Zhang, Wei;  He, Xuanyu;  Lu, Weizhi;  Qiao, Hong;  Li, Yibin
Favorite  |  View/Download:0/0  |  Submit date:2020/03/30
Feature extraction  Task analysis  Cameras  Noise measurement  Learning systems  Reinforcement learning  Feature aggregation  reinforcement learning (RL)  sequential decision making  video-based person re-identification (re-id)  
Optimized Adaptive Nonlinear Tracking Control Using Actor-Critic Reinforcement Learning Strategy 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2019, 卷号: 15, 期号: 9, 页码: 4969-4977
Authors:  Wen, Guoxing;  Chen, C. L. Philip;  Ge, Shuzhi Sam;  Yang, Hongli;  Liu, Xiaoguang
Favorite  |  View/Download:9/0  |  Submit date:2019/12/16
Lyapunov function  neural networks (NNs)  nonlinear systems  optimized tracking control  reinforcement learning (RL) of actor-critic architecture  
Adaptive Tracking Control of Surface Vessel Using Optimized Backstepping Technique 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 卷号: 49, 期号: 9, 页码: 3420-3431
Authors:  Wen, Guoxing;  Ge, Shuzhi Sam;  Chen, C. L. Philip;  Tu, Fangwen;  Wang, Shengnan
Favorite  |  View/Download:7/0  |  Submit date:2019/12/16
Actor-critic architecture  Lyapunov stability  optimized backstepping (OB)  reinforcement learning (RL)  surface vessel  
Guided Policy Search for Sequential Multitask Learning 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2019, 卷号: 49, 期号: 1, 页码: 216-226
Authors:  Xiong, Fangzhou;  Sun, Biao;  Yang, Xu;  Qiao, Hong;  Huang, Kaizhu;  Hussain, Amir;  Liu, Zhiyong
Favorite  |  View/Download:22/0  |  Submit date:2019/07/12
Elastic weight consolidation (EWC)  guided policy search (GPS)  reinforcement learning (RL)  sequential multitask learning  
Optimized Multi-Agent Formation Control Based on an Identifier-Actor--Critic Reinforcement Learning Algorithm 期刊论文
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2018, 卷号: 26, 期号: 5, 页码: 2719-2731
Authors:  Wen, Guoxing;  Chen, C. L. Philip;  Feng, Jun;  Zhou, Ning
Favorite  |  View/Download:12/0  |  Submit date:2019/12/16
Fuzzy logic systems (FLSs)  identifier-actor-critic architecture  multi-agent formation  optimized formation control  reinforcement learning (RL)  
Manifold Regularized Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 4, 页码: 932-943
Authors:  Li, Hongliang;  Liu, Derong;  Wang, Ding
Favorite  |  View/Download:45/0  |  Submit date:2018/10/10
Adaptive Dynamic Programming  Approximate Dynamic Programming  Approximate Policy Iteration (Api)  Manifold Regularization  Reinforcement Learning (Rl)  
Reinforcement Learning Optimized Look-Ahead Energy Management of a Parallel Hybrid Electric Vehicle 期刊论文
IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2017, 卷号: 22, 期号: 4, 页码: 1497-1507
Authors:  Liu, Teng;  Hu, Xiaosong;  Li, Shengbo Eben;  Cao, Dongpu
Favorite  |  View/Download:4/0  |  Submit date:2019/12/16
Energy management  hybrid electric vehicle (HEV)  Markov chain (MC)  predictive control  reinforcement learning (RL)  
Reinforcement-Learning-Based Robust Controller Design for Continuous-Time Uncertain Nonlinear Systems Subject to Input Constraints 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2015, 卷号: 45, 期号: 7, 页码: 1372-1385
Authors:  Liu, Derong;  Yang, Xiong;  Wang, Ding;  Wei, Qinglai
View  |  Adobe PDF(1179Kb)  |  Favorite  |  View/Download:211/120  |  Submit date:2015/09/17
Approximate Dynamic Programming (Adp)  Neural Networks (Nns)  Neuro-dynamic Programming  Nonlinear Systems  Optimal Control  Reinforcement Learning (Rl)  Robust Control  
GrDHP: A General Utility Function Representation for Dual Heuristic Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 3, 页码: 614-627
Authors:  Ni, Zhen;  He, Haibo;  Zhao, Dongbin;  Xu, Xin;  Prokhorov, Danil V.
Favorite  |  View/Download:63/0  |  Submit date:2015/09/21
Adaptive Control  Adaptive Dynamic Programming (Adp)  Dual Heuristic Dynamic Programming (Dhp)  General Utility Function  Goal Representation  Reinforcement Learning (Rl)  
MEC-A Near-Optimal Online Reinforcement Learning Algorithm for Continuous Deterministic Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 2, 页码: 346-356
Authors:  Zhao, Dongbin;  Zhu, Yuanheng
View  |  Adobe PDF(2156Kb)  |  Favorite  |  View/Download:96/36  |  Submit date:2015/09/18
Efficient Exploration  Probably Approximately Correct (Pac)  Reinforcement Learning (Rl)  State Aggregation