CASIA OpenIR

浏览/检索结果: 共20条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Boosting On-Policy Actor-Critic With Shallow Updates in Critic 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 页码: 10
作者:  Li, Luntong;  Zhu, Yuanheng
收藏  |  浏览/下载:14/0  |  提交时间:2024/07/03
Artificial neural networks  Vectors  Task analysis  Training  Representation learning  Approximation algorithms  Optimization  Actor-critic  deep reinforcement learning (DRL)  proximal policy optimization (PPO)  shallow reinforcement learning (SRL)  
Adaptive bias-variance trade-off in advantage estimator for actor-critic algorithms 期刊论文
NEURAL NETWORKS, 2024, 卷号: 169, 页码: 764-777
作者:  Chen, Yurou;  Zhang, Fengyi;  Liu, Zhiyong
收藏  |  浏览/下载:63/0  |  提交时间:2024/02/22
Reinforcement Learning  Policy gradient  Actor-critic  Value function  Bias-variance trade-off  
Path Planning and Tracking Control for Parking via Soft Actor-Critic Under Non-Ideal Scenarios 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 1, 页码: 181-195
作者:  Xiaolin Tang;  Yuyou Yang;  Teng Liu;  Xianke Lin;  Kai Yang;  Shen Li
Adobe PDF(4905Kb)  |  收藏  |  浏览/下载:246/138  |  提交时间:2024/01/02
Automatic parking  control strategy  parking deviation (APS)  soft actor-critic (SAC)  
Residual Reinforcement Learning for Motion Control of a Bionic Exploration Robot-RoboDact 期刊论文
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 卷号: 72, 页码: 13
作者:  Zhang, Tiandong;  Wang, Rui;  Wang, Shuo;  Wang, Yu;  Zheng, Gang;  Tan, Min
收藏  |  浏览/下载:124/0  |  提交时间:2023/11/17
Active disturbance rejection control (ADRC)  bionic exploration robot  motion control  residual reinforcement learning (RRL)  soft actor-critic (SAC)  
Mixture of personality improved spiking actor network for efficient multi-agent cooperation 期刊论文
FRONTIERS IN NEUROSCIENCE, 2023, 卷号: 17, 页码: 14
作者:  Li, Xiyun;  Ni, Ziyi;  Ruan, Jingqing;  Meng, Linghui;  Shi, Jing;  Zhang, Tielin;  Xu, Bo
收藏  |  浏览/下载:104/0  |  提交时间:2023/11/17
multi-agent cooperation  personality theory  spiking actor networks  multi-agent reinforcement learning  theory of mind  
Position and Attitude Tracking Control of a Biomimetic Underwater Vehicle via Deep Reinforcement Learning 期刊论文
IEEE/ASME Transactions on Mechatronics, 2023, 页码: 1-10
作者:  Ma, Ruichen;  Wang, Yu;  Tang, Chong;  Wang, Shuo;  Wang, Rui
收藏  |  浏览/下载:119/0  |  提交时间:2023/08/03
Biomimetic underwater vehicle (BUV)  Deep reinforcement learning (DRL)  Soft actor-critic (SAC)  Undulatory fin  
Sparse online kernelized actor-critic Learning in reproducing kernel Hilbert space 期刊论文
ARTIFICIAL INTELLIGENCE REVIEW, 2021, 页码: 36
作者:  Yang, Yongliang;  Zhu, Hufei;  Zhang, Qichao;  Zhao, Bo;  Li, Zhenning;  Wunsch, Donald C.
收藏  |  浏览/下载:238/0  |  提交时间:2021/11/02
Reproducing kernel Hilbert space  Actor-critic learning  Value function approximation  Online sparsification  Non-parametric learning  
Generalized Actor-Critic Learning Optimal Control in Smart Home Energy Management 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 卷号: 17, 期号: 10, 页码: 6614-6623
作者:  Wei, Qinglai;  Liao, Zehua;  Shi, Guang
Adobe PDF(1229Kb)  |  收藏  |  浏览/下载:294/39  |  提交时间:2021/11/02
Optimal control  Process control  Smart homes  Dynamic programming  Numerical models  Iterative methods  Informatics  Actor-critic learning  adaptive critic designs  adaptive dynamic programming (ADP)  approximate dynamic programming  energy management  optimal control  smart grid  
A Novel Heterogeneous Actor-critic Algorithm with Recent Emphasizing Replay Memory 期刊论文
International Journal of Automation and Computing, 2021, 卷号: 18, 期号: 4, 页码: 619-631
作者:  Bao Xi;  Rui Wang;  Ying-Hao Cai;  TaoLu;  Shuo Wang
Adobe PDF(2505Kb)  |  收藏  |  浏览/下载:212/61  |  提交时间:2021/07/20
Reinforcement learning (RL)  actor-critic  experience replay  training efficiency  manipulation skill learning  
Optimal Elevator Group Control via Deep Asynchronous Actor-Critic Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 12, 页码: 5245-5256
作者:  Wei, Qinglai;  Wang, Lingxiao;  Liu, Yu;  Polycarpou, Marios M.
Adobe PDF(4019Kb)  |  收藏  |  浏览/下载:390/88  |  提交时间:2021/03/08
Elevators  Optimal control  Backpropagation  Machine learning  Neural networks  Learning (artificial intelligence)  Actor  –critic  adaptive dynamic programming  deep learning (DL)  elevator group control (EGC)  optimal control  reinforcement learning (RL)