CASIA OpenIR

浏览/检索结果: 共9条,第1-9条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Chai, Jiajun;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2469Kb)  |  收藏  |  浏览/下载:62/3  |  提交时间:2023/11/16
Large-scale multiagent  neighboring communication  reinforcement learning (RL)  variational information flow  
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:250/12  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 13
作者:  Hu, Guangzheng;  Zhu, Yuanheng;  Zhao, Dongbin;  Zhao, Mengchen;  Hao, Jianye
Adobe PDF(4187Kb)  |  收藏  |  浏览/下载:263/12  |  提交时间:2022/01/27
Bandwidth  Protocols  Reinforcement learning  Task analysis  Optimization  Communication networks  Multi-agent systems  Event trigger  limited bandwidth  multi-agent communication  multi-agent reinforcement learning (MARL)  
Optimal Elevator Group Control via Deep Asynchronous Actor-Critic Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 12, 页码: 5245-5256
作者:  Wei, Qinglai;  Wang, Lingxiao;  Liu, Yu;  Polycarpou, Marios M.
Adobe PDF(4019Kb)  |  收藏  |  浏览/下载:381/86  |  提交时间:2021/03/08
Elevators  Optimal control  Backpropagation  Machine learning  Neural networks  Learning (artificial intelligence)  Actor  –critic  adaptive dynamic programming  deep learning (DL)  elevator group control (EGC)  optimal control  reinforcement learning (RL)  
Adaptive Constrained Optimal Control Design for Data-Based Nonlinear Discrete-Time Systems With Critic-Only Structure 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 6, 页码: 2099-2111
作者:  Luo, Biao;  Liu, Derong;  Wu, Huai-Ning
浏览  |  Adobe PDF(1045Kb)  |  收藏  |  浏览/下载:400/122  |  提交时间:2018/10/10
Adaptive Control  Adaptive Dynamic Programming  Constraints  Critic-only  Data-based  Optimal Control  Q-learning  
Model-Free Optimal Tracking Control via Critic-Only Q-Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 卷号: 27, 期号: 10, 页码: 2134-2144
作者:  Luo, Biao;  Liu, Derong;  Huang, Tingwen;  Wang, Ding;  Luo,Biao
浏览  |  Adobe PDF(1521Kb)  |  收藏  |  浏览/下载:614/295  |  提交时间:2016/10/24
Critic-only Q-learning (Coql)  Model-free  Nonaffine Nonlinear Systems  Optimal Tracking Control  
Data-Driven Zero-Sum Neuro-Optimal Control for a Class of Continuous-Time Unknown Nonlinear Systems With Disturbance Using ADP 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 卷号: 27, 期号: 2, 页码: 444-458
作者:  Wei, Qinglai;  Song, Ruizhuo;  Yan, Pengfei
浏览  |  Adobe PDF(2204Kb)  |  收藏  |  浏览/下载:442/142  |  提交时间:2016/06/14
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Neurodynamic Programming  Nonlinear Systems  Optimal Control  Recurrent Neural Network (Rnn)  Reinforcement Learning  
Adaptive Optimal Control of Highly Dissipative Nonlinear Spatially Distributed Processes With Neuro-Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 4, 页码: 684-696
作者:  Luo, Biao;  Wu, Huai-Ning;  Li, Han-Xiong
浏览  |  Adobe PDF(2465Kb)  |  收藏  |  浏览/下载:375/122  |  提交时间:2016/03/30
Adaptive Optimal Control  Empirical Eigenfunction (Eef)  Highly Dissipative Partial Differential Equations (Pdes)  Neuro-dynamic Programming (Ndp)  Spatially Distributed Processes (Sdps)  
Multiple Actor-Critic Structures for Continuous-Time Optimal Control Using Input-Output Data 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 4, 页码: 851-865
作者:  Song, Ruizhuo;  Lewis, Frank;  Wei, Qinglai;  Zhang, Hua-Guang;  Jiang, Zhong-Ping;  Levine, Dan;  Qinglai Wei
浏览  |  Adobe PDF(3455Kb)  |  收藏  |  浏览/下载:427/180  |  提交时间:2015/09/21
Actor-critic  Approximate Dynamic Programming (Adp)  Category  Optimal Control  Shunting Inhibitory Artificial Neural Network (Siann)