CASIA OpenIR

Browse/Search Results:  1-10 of 129 Help

Selected(0)Clear Items/Page:    Sort:
面向智能驾驶视觉控制的深度强化学习方法 学位论文
工学博士, 中国科学院自动化研究所: 中国科学院大学, 2019
Authors:  李栋
Adobe PDF(6681Kb)  |  Favorite  |  View/Download:71/3  |  Submit date:2019/06/27
深度强化学习  智能驾驶  视觉控制  目标检测  图注意力网络  
Reinforcement Learning and Deep Learning based Lateral Control for Autonomous Driving 期刊论文
IEEE Computational Intelligence Magazine, IEEE Computational Intelligence Magazine, 2019, 2019, 卷号: 14, 14, 期号: 2, 页码: 83-98, 83-98
Authors:  Dong Li;  Dongbin Zhao;  Qichao Zhang;  Yaran Chen
View  |  Adobe PDF(2205Kb)  |  Favorite  |  View/Download:51/16  |  Submit date:2019/04/25
Deep Learning  Autonomous Driving  Visual Control  Reinforcement Learning  Deep Learning  Autonomous Driving  Visual Control  Reinforcement Learning  
基于自适应动态规划的可重构机器人系统分散控制方法研究 研究报告
2019
Authors:  董博
Adobe PDF(2806Kb)  |  Favorite  |  View/Download:69/1  |  Submit date:2019/03/12
可重构机器人  分散控制  自适应动态规划  滑模控制  最优控制  动力学耦合效应  关节力矩估计  谐波传动  
Decentralized control for large-scale nonlinear systems with unknown mismatched interconnections via policy iteration 期刊论文
IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2018, 卷号: 48, 期号: 10, 页码: 1725-1735
Authors:  Zhao B(赵博);  Ding Wang
View  |  Adobe PDF(768Kb)  |  Favorite  |  View/Download:121/21  |  Submit date:2018/10/14
Adaptive Dynamic Programming  Decentralized Control  Large-scale Systems  Neural Networks  
Adaptive dynamic programming-based stabilization of nonlinear systems with unknown actuator saturation 期刊论文
NONLINEAR DYNAMICS, 2018, 卷号: 93, 期号: 4, 页码: 2089-2103
Authors:  Zhao, Bo;  Jia, Lihao;  Xia, Hongbing;  Li, Yuanchun
View  |  Adobe PDF(1069Kb)  |  Favorite  |  View/Download:85/36  |  Submit date:2018/10/10
Adaptive Dynamic Programming  Unknown Actuator Saturation  Continuous-time Nonlinear Systems  Stabilizing Control  Neural Networks  
Policy Iteration Algorithm Based Fault Tolerant Tracking Control: An Implementation on Reconfigurable Manipulators 期刊论文
JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2018, 卷号: 13, 期号: 4, 页码: 1739-1750
Authors:  Li, Yuanchun;  Xia, Hongbing;  Zhao, Bo
View  |  Adobe PDF(708Kb)  |  Favorite  |  View/Download:83/7  |  Submit date:2018/10/10
Adaptive dynamic programming  Policy iteration  Fault tolerant tracking control  Reconfigurable manipulators  Neural network  
Adaptive Constrained Optimal Control Design for Data-Based Nonlinear Discrete-Time Systems With Critic-Only Structure 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 6, 页码: 2099-2111
Authors:  Luo, Biao;  Liu, Derong;  Wu, Huai-Ning
View  |  Adobe PDF(1045Kb)  |  Favorite  |  View/Download:85/33  |  Submit date:2018/10/10
Adaptive Control  Adaptive Dynamic Programming  Constraints  Critic-only  Data-based  Optimal Control  Q-learning  
Discrete-Time Stable Generalized Self-Learning Optimal Control With Approximation Errors 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 4, 页码: 1226-1238
Authors:  Wei, Qinglai;  Li, Benkai;  Song, Ruizhuo
View  |  Adobe PDF(2475Kb)  |  Favorite  |  View/Download:86/18  |  Submit date:2017/02/23
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Generalized Policy Iteration (Gpi)  Neural Networks  Neurodynamic Programming  Nonlinear Systems  Optimal Control  Reinforcement Learning  
Comprehensive comparison of online ADP algorithms for continuous-time optimal control 期刊论文
ARTIFICIAL INTELLIGENCE REVIEW, 2018, 卷号: 49, 期号: 4, 页码: 531-547
Authors:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(766Kb)  |  Favorite  |  View/Download:106/39  |  Submit date:2017/09/13
Adaptive Dynamic Programming  Policy Iteration  Integral Reinforcement Learning  Experience Replay  Off-policy  
Data-Based Optimal Control for Weakly Coupled Nonlinear Systems Using Policy Iteration 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2018, 卷号: 48, 期号: 4, 页码: 511-521
Authors:  Li, Chao;  Liu, Derong;  Wang, Ding
View  |  Adobe PDF(892Kb)  |  Favorite  |  View/Download:119/38  |  Submit date:2017/05/03
Adaptive Dynamic Programming (Adp)  Neural Networks (Nns)  Optimal Control  Policy Iteration (Pi)  Unknown Dynamics  Weakly Coupled Systems