CASIA OpenIR

浏览/检索结果: 共171条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Continuous-Time Stochastic Policy Iteration of Adaptive Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 13
作者:  Wei, Qinglai;  Zhou, Tianmin;  Lu, Jingwei;  Liu, Yu;  Su, Shuai;  Xiao, Jun
收藏  |  浏览/下载:118/0  |  提交时间:2023/11/17
Adaptive dynamic programming (ADP)  Hamilton-Jacobi-Bellman equation (HJBE)  nonlinear stochastic system  stochastic policy iteration (PI)  
Sample-Observed Soft Actor-Critic Learning for Path Following of a Biomimetic Underwater Vehicle 期刊论文
IEEE Transactions on Automation Science and Engineering, 2023, 页码: 1-10
作者:  Ma, Ruichen;  Wang, Yu;  Wang, Shuo;  Cheng, Long;  Wang, Rui;  Tan, Ming
Adobe PDF(2902Kb)  |  收藏  |  浏览/下载:174/57  |  提交时间:2023/08/03
A Routing Optimization Method for Software-Defined Optical Transport Network Based on Ensembles and Reinforcement Learning 期刊论文
Sensors, 2022, 页码: 8139
作者:  Junyan Chen;  Yang Zheng
Adobe PDF(3250Kb)  |  收藏  |  浏览/下载:100/32  |  提交时间:2023/05/04
Optimal synchronization control for multi-agent systems with input saturation: a nonzero-sum game 期刊论文
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2022, 页码: 10
作者:  Li, Hongyang;  Wei, Qinglai
Adobe PDF(716Kb)  |  收藏  |  浏览/下载:193/46  |  提交时间:2022/06/14
A New Neuro-Optimal Nonlinear Tracking Control Method via Integral Reinforcement Learning with Applications to Nuclear Systems 期刊论文
NEUROCOMPUTING, 2022, 卷号: 483, 页码: 361-369
作者:  Zhong, Weifeng;  Wang, Mengxuan;  Wei, Qinglai;  Lu, Jingwei
收藏  |  浏览/下载:206/0  |  提交时间:2022/06/10
Integral reinforcement learning  Nuclear power reactor  Nonlinear system  Optimal tracking control  Neural networks  
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
收藏  |  浏览/下载:206/0  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Tracking of Uncertain Robotic Manipulators Using Event-Triggered Model Predictive Control With Learning Terminal Cost 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2022, 页码: 15
作者:  Kang, Erlong;  Qiao, Hong;  Chen, Ziyu;  Gao, Jie
Adobe PDF(4203Kb)  |  收藏  |  浏览/下载:420/155  |  提交时间:2022/06/06
Model predictive control  robotic manipulator  leaning terminal cost  neural networks  event-triggered mechanism  unknown dynamics  
Meta-Residual Policy Learning: Zero-Trial Robot Skill Adaptation via Knowledge Fusion 期刊论文
IEEE Robotics and Automation Letters, 2022, 卷号: 7, 期号: 7, 页码: 3656-3663
作者:  Peng Hao;  Tao Lu;  Shaowei Cui;  Junhang Wei;  YInghao Cai;  Shuo Wang
Adobe PDF(1750Kb)  |  收藏  |  浏览/下载:211/40  |  提交时间:2022/04/08
meta-learning  residual learning  
An Approximate Neuro-Optimal Solution of Discounted Guaranteed Cost Control Design 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 卷号: 52, 期号: 1, 页码: 77-86
作者:  Wang, Ding;  Qiao, Junfei;  Cheng, Long
收藏  |  浏览/下载:241/0  |  提交时间:2022/03/17
Control design  Cost function  Optimal control  Nonlinear systems  Adaptive systems  Switches  Adaptive learning system  discount factor  guaranteed cost function  neuro-optimal control  uncertainty  
SADRL: Merging human experience with machine intelligence via supervised assisted deep reinforcement learning 期刊论文
NEUROCOMPUTING, 2022, 卷号: 467, 页码: 300-309
作者:  Li, Xiaoshuang;  Wang, Xiao;  Zheng, Xinhu;  Jin, Junchen;  Huang, Yanhao;  Zhang, Jun Jason;  Wang, Fei-Yue
Adobe PDF(1244Kb)  |  收藏  |  浏览/下载:292/65  |  提交时间:2021/12/28
Deep reinforcement learning  Behavioral cloning  Dynamic demonstration  Double DQN