CASIA OpenIR

浏览/检索结果: 共7条,第1-7条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Chai, Jiajun;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2469Kb)  |  收藏  |  浏览/下载:63/3  |  提交时间:2023/11/16
Large-scale multiagent  neighboring communication  reinforcement learning (RL)  variational information flow  
UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:  Chai, Jiajun;  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin;  Ma, Zhe;  Sun, Kewu;  Ding, Jishiyu
Adobe PDF(3402Kb)  |  收藏  |  浏览/下载:291/38  |  提交时间:2022/01/27
Multi-agent systems  Training  Task analysis  Reinforcement learning  Sun  Learning systems  Semantics  Centralized training with decentralized execution (CTDE)  multiagent  reinforcement learning  StarCraft II  
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 13
作者:  Hu, Guangzheng;  Zhu, Yuanheng;  Zhao, Dongbin;  Zhao, Mengchen;  Hao, Jianye
Adobe PDF(4187Kb)  |  收藏  |  浏览/下载:266/12  |  提交时间:2022/01/27
Bandwidth  Protocols  Reinforcement learning  Task analysis  Optimization  Communication networks  Multi-agent systems  Event trigger  limited bandwidth  multi-agent communication  multi-agent reinforcement learning (MARL)  
Discrete-Time Stable Generalized Self-Learning Optimal Control With Approximation Errors 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 4, 页码: 1226-1238
作者:  Wei, Qinglai;  Li, Benkai;  Song, Ruizhuo
浏览  |  Adobe PDF(2475Kb)  |  收藏  |  浏览/下载:422/136  |  提交时间:2017/02/23
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Generalized Policy Iteration (Gpi)  Neural Networks  Neurodynamic Programming  Nonlinear Systems  Optimal Control  Reinforcement Learning  
Data-Driven Zero-Sum Neuro-Optimal Control for a Class of Continuous-Time Unknown Nonlinear Systems With Disturbance Using ADP 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 卷号: 27, 期号: 2, 页码: 444-458
作者:  Wei, Qinglai;  Song, Ruizhuo;  Yan, Pengfei
浏览  |  Adobe PDF(2204Kb)  |  收藏  |  浏览/下载:447/144  |  提交时间:2016/06/14
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Neurodynamic Programming  Nonlinear Systems  Optimal Control  Recurrent Neural Network (Rnn)  Reinforcement Learning  
Infinite Horizon Self-Learning Optimal Control of Nonaffine Discrete-Time Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 4, 页码: 866-879
作者:  Wei, Qinglai;  Liu, Derong;  Yang, Xiong
浏览  |  Adobe PDF(2408Kb)  |  收藏  |  浏览/下载:311/125  |  提交时间:2015/09/21
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Generalized Policy Iteration  Neural Networks (Nns)  Neurodynamic Programming  Nonlinear Systems  Optimal Control  Reinforcement Learning  
Multiple Actor-Critic Structures for Continuous-Time Optimal Control Using Input-Output Data 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 4, 页码: 851-865
作者:  Song, Ruizhuo;  Lewis, Frank;  Wei, Qinglai;  Zhang, Hua-Guang;  Jiang, Zhong-Ping;  Levine, Dan;  Qinglai Wei
浏览  |  Adobe PDF(3455Kb)  |  收藏  |  浏览/下载:429/181  |  提交时间:2015/09/21
Actor-critic  Approximate Dynamic Programming (Adp)  Category  Optimal Control  Shunting Inhibitory Artificial Neural Network (Siann)