CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共4条,第1-4条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Chai, Jiajun;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2469Kb)  |  收藏  |  浏览/下载:61/3  |  提交时间:2023/11/16
Large-scale multiagent  neighboring communication  reinforcement learning (RL)  variational information flow  
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 13
作者:  Hu, Guangzheng;  Zhu, Yuanheng;  Zhao, Dongbin;  Zhao, Mengchen;  Hao, Jianye
Adobe PDF(4187Kb)  |  收藏  |  浏览/下载:255/10  |  提交时间:2022/01/27
Bandwidth  Protocols  Reinforcement learning  Task analysis  Optimization  Communication networks  Multi-agent systems  Event trigger  limited bandwidth  multi-agent communication  multi-agent reinforcement learning (MARL)  
Invariant Adaptive Dynamic Programming for Discrete-Time Optimal Control 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 卷号: 50, 期号: 11, 页码: 3959-3971
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo
Adobe PDF(2079Kb)  |  收藏  |  浏览/下载:204/14  |  提交时间:2021/01/07
Optimal control  Discrete-time systems  Heuristic algorithms  Dynamic programming  Convergence  Artificial intelligence  Nonlinear systems  Adaptive dynamic programming  discrete-time systems  invariant admissibility  optimal control  policy iteration  sum of squares  
Adaptive cruise control via adaptive dynamic programming with experience replay 期刊论文
SOFT COMPUTING, 2019, 卷号: 23, 期号: 12, 页码: 4131-4144
作者:  Wang, Bin;  Zhao, Dongbin;  Cheng, Jin
收藏  |  浏览/下载:240/0  |  提交时间:2019/07/11
Adaptive cruise control  Adaptive dynamic programming  Experience replay  Reinforcement learning  Neural networks