CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共19条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Boosting On-Policy Actor–Critic With Shallow Updates in Critic 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2024, 页码: 1-10
作者:  Luntong Li;  Yuanheng Zhu
Adobe PDF(9953Kb)  |  收藏  |  浏览/下载:9/4  |  提交时间:2024/06/05
MAT: Morphological Adaptive Transformer for Universal Morphology Policy Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 1-12
作者:  Boyu Li;  Haran Li;  Yuanheng Zhu;  Dongbin Zhao
Adobe PDF(9953Kb)  |  收藏  |  浏览/下载:6/3  |  提交时间:2024/06/05
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:220/2  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Optimal Feedback Control of Pedestrian Flow in Heterogeneous Corridors 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2021, 卷号: 18, 期号: 3, 页码: 1097-1108
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo
Adobe PDF(2666Kb)  |  收藏  |  浏览/下载:195/2  |  提交时间:2021/08/15
Microscopy  Feedback control  Mathematical model  Data models  Dynamic programming  Psychology  Computational modeling  Adaptive dynamic programming (ADP)  heterogeneous corridors  macroscopic pedestrian dynamics  optimal feedback control  pedestrian flow  
Invariant Adaptive Dynamic Programming for Discrete-Time Optimal Control 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 卷号: 50, 期号: 11, 页码: 3959-3971
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo
Adobe PDF(2079Kb)  |  收藏  |  浏览/下载:182/3  |  提交时间:2021/01/07
Optimal control  Discrete-time systems  Heuristic algorithms  Dynamic programming  Convergence  Artificial intelligence  Nonlinear systems  Adaptive dynamic programming  discrete-time systems  invariant admissibility  optimal control  policy iteration  sum of squares  
LMI-Based Synthesis of String-Stable Controller for Cooperative Adaptive Cruise Control 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 卷号: 21, 期号: 11, 页码: 4516-4525
作者:  Zhu, Yuanheng;  He, Haibo;  Zhao, Dongbin
Adobe PDF(1648Kb)  |  收藏  |  浏览/下载:158/4  |  提交时间:2021/01/06
Cooperative adaptive cruise control  string stability  time-delay system  H-infinity control  linear matrix inequality  
Control-Limited Adaptive Dynamic Programming for Multi-Battery Energy Storage Systems 期刊论文
IEEE TRANSACTIONS ON SMART GRID, 2019, 卷号: 10, 期号: 4, 页码: 4235-4244
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Li, Xiangjun;  Wang, Ding
Adobe PDF(973Kb)  |  收藏  |  浏览/下载:282/4  |  提交时间:2019/09/30
Microgrid  energy storage system  multi-battery management system  adaptive dynamic programming  control-limited optimization  
Adaptive Optimal Control of Heterogeneous CACC System With Uncertain Dynamics 期刊论文
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2019, 卷号: 27, 期号: 4, 页码: 1772-1779
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Zhong, Zhiguang
Adobe PDF(1189Kb)  |  收藏  |  浏览/下载:260/2  |  提交时间:2019/09/30
Adaptive optimal control  cooperative adaptive cruise control (CACC)  heterogeneous platoon  string stability  sum-of-squares polynomial  
Policy Iteration for H infinity Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 2, 页码: 500-509
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Yang, Xiong;  Zhang, Qichao
Adobe PDF(892Kb)  |  收藏  |  浏览/下载:305/42  |  提交时间:2018/10/10
Adaptive Dynamic Programming (Adp)  h Infinity Optimal Control  Policy Iteration (Pi)  Polynomial Nonlinear Systems  Sum Of Squares (Sos)  
Adaptive dynamic programming for robust neural control of unknown continuous-time non-linear systems 期刊论文
IET CONTROL THEORY AND APPLICATIONS, 2017, 卷号: 11, 期号: 14, 页码: 2307-2316
作者:  Yang, Xiong;  He, Haibo;  Liu, Derong;  Zhu, Yuanheng
浏览  |  Adobe PDF(2123Kb)  |  收藏  |  浏览/下载:451/145  |  提交时间:2017/09/13
Dynamic Programming  Robust Control  Neurocontrollers  Continuous Time Systems  Control System Synthesis  Nonlinear Control Systems  Optimal Control  Function Approximation  Monte Carlo Methods  Closed Loop Systems  Asymptotic Stability  Adaptive Dynamic Programming  Robust Neural Control Design  Unknown Continuous-time Nonlinear Systems  Ct Nonlinear Systems  Adp-based Robust Neural Control Scheme  Robust Nonlinear Control Problem  Nonlinear Optimal Control Problem  Nominal System  Adp Algorithm  Actor-critic Dual Networks  Control Policy Approximation  Value Function Approximation  Actor Neural Network Weights  Critic Nn Weights  Monte Carlo Integration Method  Closed-loop System  Asymptotically Stability