CASIA OpenIR

Browse/Search Results:  1-10 of 115 Help

Selected(0)Clear Items/Page:    Sort:
Data-Based Reinforcement Learning for Nonzero-Sum Games With Unknown Drift Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 卷号: 49, 期号: 8, 页码: 2874-2885
Authors:  Zhang, Qichao;  Zhao, Dongbin
Favorite  |  View/Download:12/0  |  Submit date:2019/07/12
Integral reinforcement learning (IRL)  neural network (NN)  nonzero-sum (NZS) games  off-policy  single-critic  unknown drift dynamics  
面向智能驾驶视觉控制的深度强化学习方法 学位论文
工学博士, 中国科学院自动化研究所: 中国科学院大学, 2019
Authors:  李栋
Adobe PDF(6681Kb)  |  Favorite  |  View/Download:28/2  |  Submit date:2019/06/27
深度强化学习  智能驾驶  视觉控制  目标检测  图注意力网络  
基于自适应动态规划的可重构机器人系统分散控制方法研究 研究报告
2019
Authors:  董博
Adobe PDF(2806Kb)  |  Favorite  |  View/Download:54/1  |  Submit date:2019/03/12
可重构机器人  分散控制  自适应动态规划  滑模控制  最优控制  动力学耦合效应  关节力矩估计  谐波传动  
Decentralized control for large-scale nonlinear systems with unknown mismatched interconnections via policy iteration 期刊论文
IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2018, 卷号: 48, 期号: 10, 页码: 1725-1735
Authors:  Zhao B(赵博);  Ding Wang
View  |  Adobe PDF(768Kb)  |  Favorite  |  View/Download:95/19  |  Submit date:2018/10/14
Adaptive Dynamic Programming  Decentralized Control  Large-scale Systems  Neural Networks  
Adaptive dynamic programming-based stabilization of nonlinear systems with unknown actuator saturation 期刊论文
NONLINEAR DYNAMICS, 2018, 卷号: 93, 期号: 4, 页码: 2089-2103
Authors:  Zhao, Bo;  Jia, Lihao;  Xia, Hongbing;  Li, Yuanchun
View  |  Adobe PDF(1069Kb)  |  Favorite  |  View/Download:62/25  |  Submit date:2018/10/10
Adaptive Dynamic Programming  Unknown Actuator Saturation  Continuous-time Nonlinear Systems  Stabilizing Control  Neural Networks  
Policy Iteration Algorithm Based Fault Tolerant Tracking Control: An Implementation on Reconfigurable Manipulators 期刊论文
JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2018, 卷号: 13, 期号: 4, 页码: 1739-1750
Authors:  Li, Yuanchun;  Xia, Hongbing;  Zhao, Bo
View  |  Adobe PDF(708Kb)  |  Favorite  |  View/Download:62/6  |  Submit date:2018/10/10
Adaptive Dynamic Programming  Policy Iteration  Fault Tolerant Tracking Control  Reconfigurable Manipulators  Neural Network  
Adaptive Constrained Optimal Control Design for Data-Based Nonlinear Discrete-Time Systems With Critic-Only Structure 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 6, 页码: 2099-2111
Authors:  Luo, Biao;  Liu, Derong;  Wu, Huai-Ning
View  |  Adobe PDF(1045Kb)  |  Favorite  |  View/Download:70/28  |  Submit date:2018/10/10
Adaptive Control  Adaptive Dynamic Programming  Constraints  Critic-only  Data-based  Optimal Control  Q-learning  
Model-free Adaptive Dynamic Programming Based Near-optimal Decentralized Tracking Control of Reconfigurable Manipulators 期刊论文
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2018, 卷号: 16, 期号: 2, 页码: 478-490
Authors:  Zhao, Bo;  Li, Yuanchun
View  |  Adobe PDF(974Kb)  |  Favorite  |  View/Download:55/22  |  Submit date:2018/10/10
Adaptive Dynamic Programming  Decentralized Tracking Control  Model-free  Near-optimal  Neural Networks  Reconfigurable Manipulators  
Discrete-Time Stable Generalized Self-Learning Optimal Control With Approximation Errors 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 4, 页码: 1226-1238
Authors:  Wei, Qinglai;  Li, Benkai;  Song, Ruizhuo
View  |  Adobe PDF(2475Kb)  |  Favorite  |  View/Download:71/15  |  Submit date:2017/02/23
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Generalized Policy Iteration (Gpi)  Neural Networks  Neurodynamic Programming  Nonlinear Systems  Optimal Control  Reinforcement Learning  
Data-Based Optimal Control for Weakly Coupled Nonlinear Systems Using Policy Iteration 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2018, 卷号: 48, 期号: 4, 页码: 511-521
Authors:  Li, Chao;  Liu, Derong;  Wang, Ding
View  |  Adobe PDF(892Kb)  |  Favorite  |  View/Download:103/32  |  Submit date:2017/05/03
Adaptive Dynamic Programming (Adp)  Neural Networks (Nns)  Optimal Control  Policy Iteration (Pi)  Unknown Dynamics  Weakly Coupled Systems