CASIA OpenIR

Browse/Search Results:  1-10 of 78 Help

Selected(0)Clear Items/Page:    Sort:
Data-Based Reinforcement Learning for Nonzero-Sum Games With Unknown Drift Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 卷号: 49, 期号: 8, 页码: 2874-2885
Authors:  Zhang, Qichao;  Zhao, Dongbin
View  |  Adobe PDF(1021Kb)  |  Favorite  |  View/Download:36/10  |  Submit date:2019/07/12
Integral reinforcement learning (IRL)  neural network (NN)  nonzero-sum (NZS) games  off-policy  single-critic  unknown drift dynamics  
Reinforcement Learning and Deep Learning based Lateral Control for Autonomous Driving 期刊论文
IEEE Computational Intelligence Magazine, IEEE Computational Intelligence Magazine, 2019, 2019, 卷号: 14, 14, 期号: 2, 页码: 83-98, 83-98
Authors:  Dong Li;  Dongbin Zhao;  Qichao Zhang;  Yaran Chen
View  |  Adobe PDF(2205Kb)  |  Favorite  |  View/Download:53/16  |  Submit date:2019/04/25
Deep Learning  Autonomous Driving  Visual Control  Reinforcement Learning  Deep Learning  Autonomous Driving  Visual Control  Reinforcement Learning  
基于自适应动态规划的可重构机器人系统分散控制方法研究 研究报告
2019
Authors:  董博
Adobe PDF(2806Kb)  |  Favorite  |  View/Download:71/1  |  Submit date:2019/03/12
可重构机器人  分散控制  自适应动态规划  滑模控制  最优控制  动力学耦合效应  关节力矩估计  谐波传动  
StarCraft Micromanagement With Reinforcement Learning and Curriculum Transfer Learning 期刊论文
IEEE Transactions on Emerging Topics in Computational Intelligence, 2019, 卷号: 3, 期号: 1, 页码: 73-84
Authors:  Kun Shao;  Yuanheng Zhu;  Dongbin Zhao
View  |  Adobe PDF(4125Kb)  |  Favorite  |  View/Download:75/45  |  Submit date:2019/04/22
Reinforcement Learning, Transfer Learning, Curriculum Learning, Neural Network, Game Ai  
Deep reinforcement learning based automatic exploration for navigation in unknown environment 期刊论文
IEEE Transactions on Neural Network and Learning Systems, 2019, 期号: early acess, 页码: 1-13
Authors:  Li Haoran;  Zhang Qichao;  Zhao Dongbin
View  |  Adobe PDF(2946Kb)  |  Favorite  |  View/Download:77/44  |  Submit date:2019/10/09
Automatic Exploration  Deep Reinforcement Learning  Optimal Decision  Partial Observation  
Actor-Critic-Identifier Structure-Based Decentralized Neuro-Optimal Control of Modular Robot Manipulators With Environmental Collisions 期刊论文
IEEE ACCESS, 2019, 卷号: 7, 页码: 96148-96165
Authors:  Dong, Bo;  An, Tianjiao;  Zhou, Fan;  Liu, Keping;  Yu, Weibo;  Li, Yuanchun
Favorite  |  View/Download:6/0  |  Submit date:2019/12/16
Adaptive dynamic programming  collision identification  decentralized optimal control  modular robot manipulators  zero-sum game  
Optimized Multi-Agent Formation Control Based on an Identifier-Actor--Critic Reinforcement Learning Algorithm 期刊论文
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2018, 卷号: 26, 期号: 5, 页码: 2719-2731
Authors:  Wen, Guoxing;  Chen, C. L. Philip;  Feng, Jun;  Zhou, Ning
Favorite  |  View/Download:3/0  |  Submit date:2019/12/16
Fuzzy logic systems (FLSs)  identifier-actor-critic architecture  multi-agent formation  optimized formation control  reinforcement learning (RL)  
Adaptive dynamic programming-based stabilization of nonlinear systems with unknown actuator saturation 期刊论文
NONLINEAR DYNAMICS, 2018, 卷号: 93, 期号: 4, 页码: 2089-2103
Authors:  Zhao, Bo;  Jia, Lihao;  Xia, Hongbing;  Li, Yuanchun
View  |  Adobe PDF(1069Kb)  |  Favorite  |  View/Download:86/37  |  Submit date:2018/10/10
Adaptive Dynamic Programming  Unknown Actuator Saturation  Continuous-time Nonlinear Systems  Stabilizing Control  Neural Networks  
Adaptive Constrained Optimal Control Design for Data-Based Nonlinear Discrete-Time Systems With Critic-Only Structure 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 6, 页码: 2099-2111
Authors:  Luo, Biao;  Liu, Derong;  Wu, Huai-Ning
View  |  Adobe PDF(1045Kb)  |  Favorite  |  View/Download:86/34  |  Submit date:2018/10/10
Adaptive Control  Adaptive Dynamic Programming  Constraints  Critic-only  Data-based  Optimal Control  Q-learning  
Discrete-Time Stable Generalized Self-Learning Optimal Control With Approximation Errors 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 4, 页码: 1226-1238
Authors:  Wei, Qinglai;  Li, Benkai;  Song, Ruizhuo
View  |  Adobe PDF(2475Kb)  |  Favorite  |  View/Download:86/18  |  Submit date:2017/02/23
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Generalized Policy Iteration (Gpi)  Neural Networks  Neurodynamic Programming  Nonlinear Systems  Optimal Control  Reinforcement Learning