CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共15条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Boosting On-Policy Actor–Critic With Shallow Updates in Critic 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2024, 页码: 1-10
作者:  Luntong Li;  Yuanheng Zhu
Adobe PDF(9953Kb)  |  收藏  |  浏览/下载:14/6  |  提交时间:2024/06/05
MAT: Morphological Adaptive Transformer for Universal Morphology Policy Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 1-12
作者:  Boyu Li;  Haran Li;  Yuanheng Zhu;  Dongbin Zhao
Adobe PDF(9953Kb)  |  收藏  |  浏览/下载:10/5  |  提交时间:2024/06/05
Vision-based control in the open racing car simulator with deep and reinforcement learning 期刊论文
Journal of Ambient Intelligence and Humanized Computing, 2019, 页码: doi={10.1007/s12652-019-01503-y}
作者:  Yuanheng Zhu;  Dongbin Zhao
Adobe PDF(2210Kb)  |  收藏  |  浏览/下载:54/14  |  提交时间:2023/04/26
Decentralized Event-Driven Constrained Control Using Adaptive Critic Designs 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 15
作者:  Yang, Xiong;  Zhu, Yuanheng;  Dong, Na;  Wei, Qinglai
Adobe PDF(1578Kb)  |  收藏  |  浏览/下载:217/9  |  提交时间:2022/01/27
Adaptive critic designs (ACDs)  adaptive dynamic programming (ADP)  decentralized event-driven control  input constraint  reinforcement learning (RL)  
LMI-Based Synthesis of String-Stable Controller for Cooperative Adaptive Cruise Control 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 卷号: 21, 期号: 11, 页码: 4516-4525
作者:  Zhu, Yuanheng;  He, Haibo;  Zhao, Dongbin
Adobe PDF(1648Kb)  |  收藏  |  浏览/下载:165/8  |  提交时间:2021/01/06
Cooperative adaptive cruise control  string stability  time-delay system  H-infinity control  linear matrix inequality  
Synthesis of Cooperative Adaptive Cruise Control With Feedforward Strategies 期刊论文
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 卷号: 69, 期号: 4, 页码: 3615-3627
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo
Adobe PDF(2462Kb)  |  收藏  |  浏览/下载:180/8  |  提交时间:2020/06/22
Cooperative cruise control  H-infinity-norm  L-2-gain  time-delay system  state-space model  
Policy Iteration for H infinity Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 2, 页码: 500-509
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Yang, Xiong;  Zhang, Qichao
Adobe PDF(892Kb)  |  收藏  |  浏览/下载:309/45  |  提交时间:2018/10/10
Adaptive Dynamic Programming (Adp)  h Infinity Optimal Control  Policy Iteration (Pi)  Polynomial Nonlinear Systems  Sum Of Squares (Sos)  
Adaptive dynamic programming for robust neural control of unknown continuous-time non-linear systems 期刊论文
IET CONTROL THEORY AND APPLICATIONS, 2017, 卷号: 11, 期号: 14, 页码: 2307-2316
作者:  Yang, Xiong;  He, Haibo;  Liu, Derong;  Zhu, Yuanheng
Adobe PDF(2123Kb)  |  收藏  |  浏览/下载:455/146  |  提交时间:2017/09/13
Dynamic Programming  Robust Control  Neurocontrollers  Continuous Time Systems  Control System Synthesis  Nonlinear Control Systems  Optimal Control  Function Approximation  Monte Carlo Methods  Closed Loop Systems  Asymptotic Stability  Adaptive Dynamic Programming  Robust Neural Control Design  Unknown Continuous-time Nonlinear Systems  Ct Nonlinear Systems  Adp-based Robust Neural Control Scheme  Robust Nonlinear Control Problem  Nonlinear Optimal Control Problem  Nominal System  Adp Algorithm  Actor-critic Dual Networks  Control Policy Approximation  Value Function Approximation  Actor Neural Network Weights  Critic Nn Weights  Monte Carlo Integration Method  Closed-loop System  Asymptotically Stability  
Comprehensive comparison of online ADP algorithms for continuous-time optimal control 期刊论文
ARTIFICIAL INTELLIGENCE REVIEW, 2018, 卷号: 49, 期号: 4, 页码: 531-547
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(766Kb)  |  收藏  |  浏览/下载:419/184  |  提交时间:2017/09/13
Adaptive Dynamic Programming  Policy Iteration  Integral Reinforcement Learning  Experience Replay  Off-policy  
Policy Iteration for Hinfinity Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming 期刊论文
IEEE Transactions on Cybernetics, 2017, 期号: PP, 页码: 1-9
作者:  Yuanheng Zhu;  Zhao DB(赵冬斌)
Adobe PDF(894Kb)  |  收藏  |  浏览/下载:354/165  |  提交时间:2017/09/13
Adaptive Dynamic Programming (Adp)  H∞ Optimal Control  Policy Iteration (Pi)  Polynomial Nonlinear Systems  Sum Of Squares (Sos)