CASIA OpenIR

浏览/检索结果: 共9条,第1-9条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Multistep Look-Ahead Policy Iteration for Optimal Control of Discrete-Time Nonlinear Systems With Isoperimetric Constraints 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 卷号: 54, 期号: 3, 页码: 1414-1426
作者:  Li, Tao;  Wei, Qinglai;  Wang, Fei-Yue
Adobe PDF(784Kb)  |  收藏  |  浏览/下载:126/24  |  提交时间:2024/02/22
Performance analysis  Optimal control  Dynamic programming  Iterative algorithms  Upper bound  Measurement  Convergence  Adaptive dynamic programming (ADP)  isoperimetric constraints  nonlinear systems  optimal control  policy iteration  
A Survey on Reinforcement Learning Methods in Bionic Underwater Robots 期刊论文
BIOMIMETICS, 2023, 卷号: 8, 期号: 2, 页码: 29
作者:  Tong, Ru;  Feng, Yukai;  Wang, Jian;  Wu, Zhengxing;  Tan, Min;  Yu, Junzhi
Adobe PDF(1260Kb)  |  收藏  |  浏览/下载:154/20  |  提交时间:2023/11/17
bionic underwater robot  reinforcement learning  robotic fish  intelligent control  
Neural event-triggered optimal filtering co-design of Markovian jump systems with hidden mode detections 期刊论文
TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2023, 页码: 11
作者:  Ma, Chao;  Lu, Yanfeng;  Wu, Wei
Adobe PDF(1257Kb)  |  收藏  |  浏览/下载:153/16  |  提交时间:2023/03/20
Markovian jump system  neural event-triggered scheme  optimal filtering  unknown nonlinearity  hidden mode detections  
Asynchronous fault detection for delayed semi-markov jump systems with mismatched mode-dependent nonlinearities 期刊论文
INFORMATION SCIENCES, 2022, 卷号: 587, 页码: 679-691
作者:  Ma, Chao;  Fu, Hang;  Wu, Wei
收藏  |  浏览/下载:192/0  |  提交时间:2022/07/25
Asynchronous fault detection  Semi-Markov jump systems  Mismatched mode-dependent  nonlinearities  Time-varying delay  
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 13
作者:  Hu, Guangzheng;  Zhu, Yuanheng;  Zhao, Dongbin;  Zhao, Mengchen;  Hao, Jianye
Adobe PDF(4187Kb)  |  收藏  |  浏览/下载:269/12  |  提交时间:2022/01/27
Bandwidth  Protocols  Reinforcement learning  Task analysis  Optimization  Communication networks  Multi-agent systems  Event trigger  limited bandwidth  multi-agent communication  multi-agent reinforcement learning (MARL)  
Spiking Adaptive Dynamic Programming Based on Poisson Process for Discrete-Time Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 11
作者:  Wei, Qinglai;  Han, Liyuan;  Zhang, Tielin
Adobe PDF(2904Kb)  |  收藏  |  浏览/下载:242/18  |  提交时间:2022/01/27
Maximum likelihood estimation (MLE)  Nonlinear systems  Optimal control  Poisson process  Spike train  Spiking Adaptive dynamic programming(SADP)  
Neuro-optimal control for discrete stochastic processes via a novel policy iteration algorithm 期刊论文
IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2020, 卷号: 50, 期号: 11, 页码: 3972-3985
作者:  Liang, Mingming;  Wang, Ding;  Liu, Derong
浏览  |  Adobe PDF(1604Kb)  |  收藏  |  浏览/下载:226/74  |  提交时间:2020/10/23
Adaptive critic designs  adaptive dynamic programming (ADP)  local policy iteration  neuro-dynamic programming  optimal control  stochastic processes  
Simulation and field testing of multiple vehicles collision avoidance algorithms 期刊论文
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2020, 卷号: 7, 期号: 4, 页码: 1045-1063
作者:  Zu, Chaoyue;  Yang, Chao;  Wang, Jian;  Gao, Wenbin;  Cao, Dongpu;  Wang, Fei-Yue
收藏  |  浏览/下载:230/0  |  提交时间:2020/08/03
Collision avoidance  intelligent vehicles  inter-vehicle communication  simulation  testing  trajectory planning  
Parallel reinforcement learning-based energy efficiency improvement for a cyber-physical system 期刊论文
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2020, 卷号: 7, 期号: 2, 页码: 617-626
作者:  Liu, Teng;  Tian, Bin;  Ai, Yunfeng;  Wang, Fei-Yue
Adobe PDF(5784Kb)  |  收藏  |  浏览/下载:268/8  |  提交时间:2020/06/02
Bidirectional long short-term memory (LSTM) network  cyber-physical system (CPS)  energy management  parallel system  reinforcement learning (RL)