CASIA OpenIR

浏览/检索结果: 共14条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Chai, Jiajun;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2469Kb)  |  收藏  |  浏览/下载:48/0  |  提交时间:2023/11/16
Large-scale multiagent  neighboring communication  reinforcement learning (RL)  variational information flow  
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 13
作者:  Hu, Guangzheng;  Zhu, Yuanheng;  Zhao, Dongbin;  Zhao, Mengchen;  Hao, Jianye
Adobe PDF(4187Kb)  |  收藏  |  浏览/下载:219/1  |  提交时间:2022/01/27
Bandwidth  Protocols  Reinforcement learning  Task analysis  Optimization  Communication networks  Multi-agent systems  Event trigger  limited bandwidth  multi-agent communication  multi-agent reinforcement learning (MARL)  
Spiking Adaptive Dynamic Programming Based on Poisson Process for Discrete-Time Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 11
作者:  Wei, Qinglai;  Han, Liyuan;  Zhang, Tielin
Adobe PDF(2904Kb)  |  收藏  |  浏览/下载:198/3  |  提交时间:2022/01/27
Maximum likelihood estimation (MLE)  Nonlinear systems  Optimal control  Poisson process  Spike train  Spiking Adaptive dynamic programming(SADP)  
Decentralized Event-Driven Constrained Control Using Adaptive Critic Designs 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 15
作者:  Yang, Xiong;  Zhu, Yuanheng;  Dong, Na;  Wei, Qinglai
Adobe PDF(1578Kb)  |  收藏  |  浏览/下载:205/0  |  提交时间:2022/01/27
Adaptive critic designs (ACDs)  adaptive dynamic programming (ADP)  decentralized event-driven control  input constraint  reinforcement learning (RL)  
Adaptive Constrained Optimal Control Design for Data-Based Nonlinear Discrete-Time Systems With Critic-Only Structure 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 6, 页码: 2099-2111
作者:  Luo, Biao;  Liu, Derong;  Wu, Huai-Ning
浏览  |  Adobe PDF(1045Kb)  |  收藏  |  浏览/下载:383/117  |  提交时间:2018/10/10
Adaptive Control  Adaptive Dynamic Programming  Constraints  Critic-only  Data-based  Optimal Control  Q-learning  
Discrete-Time Stable Generalized Self-Learning Optimal Control With Approximation Errors 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 4, 页码: 1226-1238
作者:  Wei, Qinglai;  Li, Benkai;  Song, Ruizhuo
浏览  |  Adobe PDF(2475Kb)  |  收藏  |  浏览/下载:394/127  |  提交时间:2017/02/23
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Generalized Policy Iteration (Gpi)  Neural Networks  Neurodynamic Programming  Nonlinear Systems  Optimal Control  Reinforcement Learning  
Model-Free Optimal Tracking Control via Critic-Only Q-Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 卷号: 27, 期号: 10, 页码: 2134-2144
作者:  Luo, Biao;  Liu, Derong;  Huang, Tingwen;  Wang, Ding;  Luo,Biao
浏览  |  Adobe PDF(1521Kb)  |  收藏  |  浏览/下载:589/288  |  提交时间:2016/10/24
Critic-only Q-learning (Coql)  Model-free  Nonaffine Nonlinear Systems  Optimal Tracking Control  
Data-Driven Zero-Sum Neuro-Optimal Control for a Class of Continuous-Time Unknown Nonlinear Systems With Disturbance Using ADP 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 卷号: 27, 期号: 2, 页码: 444-458
作者:  Wei, Qinglai;  Song, Ruizhuo;  Yan, Pengfei
浏览  |  Adobe PDF(2204Kb)  |  收藏  |  浏览/下载:420/138  |  提交时间:2016/06/14
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Neurodynamic Programming  Nonlinear Systems  Optimal Control  Recurrent Neural Network (Rnn)  Reinforcement Learning  
Infinite Horizon Self-Learning Optimal Control of Nonaffine Discrete-Time Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 4, 页码: 866-879
作者:  Wei, Qinglai;  Liu, Derong;  Yang, Xiong
浏览  |  Adobe PDF(2408Kb)  |  收藏  |  浏览/下载:294/116  |  提交时间:2015/09/21
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Generalized Policy Iteration  Neural Networks (Nns)  Neurodynamic Programming  Nonlinear Systems  Optimal Control  Reinforcement Learning  
Multiple Actor-Critic Structures for Continuous-Time Optimal Control Using Input-Output Data 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 4, 页码: 851-865
作者:  Song, Ruizhuo;  Lewis, Frank;  Wei, Qinglai;  Zhang, Hua-Guang;  Jiang, Zhong-Ping;  Levine, Dan;  Qinglai Wei
浏览  |  Adobe PDF(3455Kb)  |  收藏  |  浏览/下载:411/176  |  提交时间:2015/09/21
Actor-critic  Approximate Dynamic Programming (Adp)  Category  Optimal Control  Shunting Inhibitory Artificial Neural Network (Siann)