CASIA OpenIR

浏览/检索结果: 共104条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Boosting On-Policy Actor-Critic With Shallow Updates in Critic 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 页码: 10
作者:  Li, Luntong;  Zhu, Yuanheng
收藏  |  浏览/下载:14/0  |  提交时间:2024/07/03
Artificial neural networks  Vectors  Task analysis  Training  Representation learning  Approximation algorithms  Optimization  Actor-critic  deep reinforcement learning (DRL)  proximal policy optimization (PPO)  shallow reinforcement learning (SRL)  
Adaptive bias-variance trade-off in advantage estimator for actor-critic algorithms 期刊论文
NEURAL NETWORKS, 2024, 卷号: 169, 页码: 764-777
作者:  Chen, Yurou;  Zhang, Fengyi;  Liu, Zhiyong
收藏  |  浏览/下载:63/0  |  提交时间:2024/02/22
Reinforcement Learning  Policy gradient  Actor-critic  Value function  Bias-variance trade-off  
Path Planning and Tracking Control for Parking via Soft Actor-Critic Under Non-Ideal Scenarios 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 1, 页码: 181-195
作者:  Xiaolin Tang;  Yuyou Yang;  Teng Liu;  Xianke Lin;  Kai Yang;  Shen Li
Adobe PDF(4905Kb)  |  收藏  |  浏览/下载:246/138  |  提交时间:2024/01/02
Automatic parking  control strategy  parking deviation (APS)  soft actor-critic (SAC)  
Residual Reinforcement Learning for Motion Control of a Bionic Exploration Robot-RoboDact 期刊论文
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 卷号: 72, 页码: 13
作者:  Zhang, Tiandong;  Wang, Rui;  Wang, Shuo;  Wang, Yu;  Zheng, Gang;  Tan, Min
收藏  |  浏览/下载:124/0  |  提交时间:2023/11/17
Active disturbance rejection control (ADRC)  bionic exploration robot  motion control  residual reinforcement learning (RRL)  soft actor-critic (SAC)  
Adaptive Multi-Step Evaluation Design With Stability Guarantee for Discrete-Time Optimal Learning Control 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 9, 页码: 1797-1809
作者:  Ding Wang;  Jiangyu Wang;  Mingming Zhao;  Peng Xin;  Junfei Qiao
Adobe PDF(5140Kb)  |  收藏  |  浏览/下载:180/65  |  提交时间:2023/08/10
Adaptive critic  artificial neural networks  Hamilton-Jacobi-Bellman (HJB) equation  multi-step heuristic dynamic programming  multi-step reinforcement learning  optimal control  
Position and Attitude Tracking Control of a Biomimetic Underwater Vehicle via Deep Reinforcement Learning 期刊论文
IEEE/ASME Transactions on Mechatronics, 2023, 页码: 1-10
作者:  Ma, Ruichen;  Wang, Yu;  Tang, Chong;  Wang, Shuo;  Wang, Rui
收藏  |  浏览/下载:119/0  |  提交时间:2023/08/03
Biomimetic underwater vehicle (BUV)  Deep reinforcement learning (DRL)  Soft actor-critic (SAC)  Undulatory fin  
Policy Iteration for Optimal Control of Discrete-Time Time-Varying Nonlinear Systems 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 3, 页码: 781-791
作者:  Guangyu Zhu;  Xiaolu Li;  Ranran Sun;  Yiyuan Yang;  Peng Zhang
Adobe PDF(2432Kb)  |  收藏  |  浏览/下载:273/78  |  提交时间:2023/03/02
Adaptive critic designs  adaptive dynamic programming  approximate dynamic programming  optimal control  policy iteration  time-varying  
Discounted Iterative Adaptive Critic Designs With Novel Stability Analysis for Tracking Control 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2022, 卷号: 9, 期号: 7, 页码: 1262-1272
作者:  Mingming Ha;  Ding Wang;  Derong Liu
Adobe PDF(1832Kb)  |  收藏  |  浏览/下载:262/88  |  提交时间:2022/06/27
Adaptive critic design  adaptive dynamic programming (ADP)  approximate dynamic programming  discrete-time nonlinear systems  reinforcement learning  stability analysis  tracking control  value iteration (VI)  
Self-Learning Robust Control Synthesis and Trajectory Tracking of Uncertain Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 卷号: 52, 期号: 1, 页码: 278-286
作者:  Wang, Ding;  Cheng, Long;  Yan, Jun
收藏  |  浏览/下载:265/0  |  提交时间:2022/03/17
Robust control  Optimal control  Cost function  Trajectory tracking  Nonlinear systems  Feedback control  Dynamical systems  Adaptive critic learning  control synthesis  neural networks  optimization  robust stabilization  tracking design  
Neuro-Optimal Trajectory Tracking With Value Iteration of Discrete-Time Nonlinear Dynamics 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:  Wang, Ding;  Ha, Mingming;  Cheng, Long
收藏  |  浏览/下载:300/0  |  提交时间:2022/01/27
Trajectory  Heuristic algorithms  Convergence  Trajectory tracking  Stability criteria  Optimal control  Dynamic programming  Adaptive critic design  discrete-time nonlinear plants  neuro-optimal trajectory tracking  uniformly ultimately bounded stability  value iteration