CASIA OpenIR

浏览/检索结果: 共14条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Adaptive bias-variance trade-off in advantage estimator for actor-critic algorithms 期刊论文
NEURAL NETWORKS, 2024, 卷号: 169, 页码: 764-777
作者:  Chen, Yurou;  Zhang, Fengyi;  Liu, Zhiyong
收藏  |  浏览/下载:15/0  |  提交时间:2024/02/22
Reinforcement Learning  Policy gradient  Actor-critic  Value function  Bias-variance trade-off  
Residual Reinforcement Learning for Motion Control of a Bionic Exploration Robot-RoboDact 期刊论文
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 卷号: 72, 页码: 13
作者:  Zhang, Tiandong;  Wang, Rui;  Wang, Shuo;  Wang, Yu;  Zheng, Gang;  Tan, Min
收藏  |  浏览/下载:52/0  |  提交时间:2023/11/17
Active disturbance rejection control (ADRC)  bionic exploration robot  motion control  residual reinforcement learning (RRL)  soft actor-critic (SAC)  
Position and Attitude Tracking Control of a Biomimetic Underwater Vehicle via Deep Reinforcement Learning 期刊论文
IEEE/ASME Transactions on Mechatronics, 2023, 页码: 1-10
作者:  Ma, Ruichen;  Wang, Yu;  Tang, Chong;  Wang, Shuo;  Wang, Rui
收藏  |  浏览/下载:65/0  |  提交时间:2023/08/03
Biomimetic underwater vehicle (BUV)  Deep reinforcement learning (DRL)  Soft actor-critic (SAC)  Undulatory fin  
Sparse online kernelized actor-critic Learning in reproducing kernel Hilbert space 期刊论文
ARTIFICIAL INTELLIGENCE REVIEW, 2021, 页码: 36
作者:  Yang, Yongliang;  Zhu, Hufei;  Zhang, Qichao;  Zhao, Bo;  Li, Zhenning;  Wunsch, Donald C.
收藏  |  浏览/下载:180/0  |  提交时间:2021/11/02
Reproducing kernel Hilbert space  Actor-critic learning  Value function approximation  Online sparsification  Non-parametric learning  
Generalized Actor-Critic Learning Optimal Control in Smart Home Energy Management 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 卷号: 17, 期号: 10, 页码: 6614-6623
作者:  Wei, Qinglai;  Liao, Zehua;  Shi, Guang
Adobe PDF(1229Kb)  |  收藏  |  浏览/下载:231/32  |  提交时间:2021/11/02
Optimal control  Process control  Smart homes  Dynamic programming  Numerical models  Iterative methods  Informatics  Actor-critic learning  adaptive critic designs  adaptive dynamic programming (ADP)  approximate dynamic programming  energy management  optimal control  smart grid  
A Novel Heterogeneous Actor-critic Algorithm with Recent Emphasizing Replay Memory 期刊论文
International Journal of Automation and Computing, 2021, 卷号: 18, 期号: 4, 页码: 619-631
作者:  Bao Xi
Adobe PDF(2505Kb)  |  收藏  |  浏览/下载:148/47  |  提交时间:2021/07/20
Reinforcement learning (RL)  actor-critic  experience replay  training efficiency  manipulation skill learning  
Optimal Elevator Group Control via Deep Asynchronous Actor-Critic Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 12, 页码: 5245-5256
作者:  Wei, Qinglai;  Wang, Lingxiao;  Liu, Yu;  Polycarpou, Marios M.
Adobe PDF(4019Kb)  |  收藏  |  浏览/下载:294/75  |  提交时间:2021/03/08
Elevators  Optimal control  Backpropagation  Machine learning  Neural networks  Learning (artificial intelligence)  Actor  –critic  adaptive dynamic programming  deep learning (DL)  elevator group control (EGC)  optimal control  reinforcement learning (RL)  
Siamese Regression Tracking With Reinforced Template Updating 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 卷号: 30, 页码: 628-640
作者:  Zhao, Fei;  Zhang, Ting;  Song, Yibing;  Tang, Ming;  Wang, Xiaobo;  Wang, Jinqiao
收藏  |  浏览/下载:187/0  |  提交时间:2021/03/02
Target tracking  Training  Reinforcement learning  Visualization  Task analysis  Benchmark testing  Head  Siamese regression tracking  actor-critic network  reinforcement learning  
Optimized Multi-Agent Formation Control Based on an Identifier-Actor--Critic Reinforcement Learning Algorithm 期刊论文
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2018, 卷号: 26, 期号: 5, 页码: 2719-2731
作者:  Wen, Guoxing;  Chen, C. L. Philip;  Feng, Jun;  Zhou, Ning
收藏  |  浏览/下载:221/0  |  提交时间:2019/12/16
Fuzzy logic systems (FLSs)  identifier-actor-critic architecture  multi-agent formation  optimized formation control  reinforcement learning (RL)  
Adaptive Tracking Control of Surface Vessel Using Optimized Backstepping Technique 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 卷号: 49, 期号: 9, 页码: 3420-3431
作者:  Wen, Guoxing;  Ge, Shuzhi Sam;  Chen, C. L. Philip;  Tu, Fangwen;  Wang, Shengnan
收藏  |  浏览/下载:163/0  |  提交时间:2019/12/16
Actor-critic architecture  Lyapunov stability  optimized backstepping (OB)  reinforcement learning (RL)  surface vessel