CASIA OpenIR

浏览/检索结果: 共173条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Multistep Look-Ahead Policy Iteration for Optimal Control of Discrete-Time Nonlinear Systems With Isoperimetric Constraints 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 卷号: 54, 期号: 3, 页码: 1414-1426
作者:  Li, Tao;  Wei, Qinglai;  Wang, Fei-Yue
Adobe PDF(784Kb)  |  收藏  |  浏览/下载:60/1  |  提交时间:2024/02/22
Performance analysis  Optimal control  Dynamic programming  Iterative algorithms  Upper bound  Measurement  Convergence  Adaptive dynamic programming (ADP)  isoperimetric constraints  nonlinear systems  optimal control  policy iteration  
Continuous-Time Stochastic Policy Iteration of Adaptive Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 13
作者:  Wei, Qinglai;  Zhou, Tianmin;  Lu, Jingwei;  Liu, Yu;  Su, Shuai;  Xiao, Jun
收藏  |  浏览/下载:126/0  |  提交时间:2023/11/17
Adaptive dynamic programming (ADP)  Hamilton-Jacobi-Bellman equation (HJBE)  nonlinear stochastic system  stochastic policy iteration (PI)  
Sample-Observed Soft Actor-Critic Learning for Path Following of a Biomimetic Underwater Vehicle 期刊论文
IEEE Transactions on Automation Science and Engineering, 2023, 页码: 1-10
作者:  Ma, Ruichen;  Wang, Yu;  Wang, Shuo;  Cheng, Long;  Wang, Rui;  Tan, Ming
Adobe PDF(2902Kb)  |  收藏  |  浏览/下载:183/59  |  提交时间:2023/08/03
A Routing Optimization Method for Software-Defined Optical Transport Network Based on Ensembles and Reinforcement Learning 期刊论文
Sensors, 2022, 页码: 8139
作者:  Junyan Chen;  Yang Zheng
Adobe PDF(3250Kb)  |  收藏  |  浏览/下载:110/35  |  提交时间:2023/05/04
Optimal synchronization control for multi-agent systems with input saturation: a nonzero-sum game 期刊论文
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2022, 页码: 10
作者:  Li, Hongyang;  Wei, Qinglai
Adobe PDF(716Kb)  |  收藏  |  浏览/下载:197/48  |  提交时间:2022/06/14
A New Neuro-Optimal Nonlinear Tracking Control Method via Integral Reinforcement Learning with Applications to Nuclear Systems 期刊论文
NEUROCOMPUTING, 2022, 卷号: 483, 页码: 361-369
作者:  Zhong, Weifeng;  Wang, Mengxuan;  Wei, Qinglai;  Lu, Jingwei
收藏  |  浏览/下载:213/0  |  提交时间:2022/06/10
Integral reinforcement learning  Nuclear power reactor  Nonlinear system  Optimal tracking control  Neural networks  
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
收藏  |  浏览/下载:210/0  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Tracking of Uncertain Robotic Manipulators Using Event-Triggered Model Predictive Control With Learning Terminal Cost 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2022, 页码: 15
作者:  Kang, Erlong;  Qiao, Hong;  Chen, Ziyu;  Gao, Jie
Adobe PDF(4203Kb)  |  收藏  |  浏览/下载:425/155  |  提交时间:2022/06/06
Model predictive control  robotic manipulator  leaning terminal cost  neural networks  event-triggered mechanism  unknown dynamics  
Meta-Residual Policy Learning: Zero-Trial Robot Skill Adaptation via Knowledge Fusion 期刊论文
IEEE Robotics and Automation Letters, 2022, 卷号: 7, 期号: 7, 页码: 3656-3663
作者:  Peng Hao;  Tao Lu;  Shaowei Cui;  Junhang Wei;  YInghao Cai;  Shuo Wang
Adobe PDF(1750Kb)  |  收藏  |  浏览/下载:217/42  |  提交时间:2022/04/08
meta-learning  residual learning  
An Approximate Neuro-Optimal Solution of Discounted Guaranteed Cost Control Design 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 卷号: 52, 期号: 1, 页码: 77-86
作者:  Wang, Ding;  Qiao, Junfei;  Cheng, Long
收藏  |  浏览/下载:246/0  |  提交时间:2022/03/17
Control design  Cost function  Optimal control  Nonlinear systems  Adaptive systems  Switches  Adaptive learning system  discount factor  guaranteed cost function  neuro-optimal control  uncertainty