CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共20条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Chai, Jiajun;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2469Kb)  |  收藏  |  浏览/下载:51/0  |  提交时间:2023/11/16
Large-scale multiagent  neighboring communication  reinforcement learning (RL)  variational information flow  
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:220/2  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:  Chai, Jiajun;  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin;  Ma, Zhe;  Sun, Kewu;  Ding, Jishiyu
Adobe PDF(3402Kb)  |  收藏  |  浏览/下载:259/29  |  提交时间:2022/01/27
Multi-agent systems  Training  Task analysis  Reinforcement learning  Sun  Learning systems  Semantics  Centralized training with decentralized execution (CTDE)  multiagent  reinforcement learning  StarCraft II  
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 13
作者:  Hu, Guangzheng;  Zhu, Yuanheng;  Zhao, Dongbin;  Zhao, Mengchen;  Hao, Jianye
Adobe PDF(4187Kb)  |  收藏  |  浏览/下载:229/4  |  提交时间:2022/01/27
Bandwidth  Protocols  Reinforcement learning  Task analysis  Optimization  Communication networks  Multi-agent systems  Event trigger  limited bandwidth  multi-agent communication  multi-agent reinforcement learning (MARL)  
Optimal Feedback Control of Pedestrian Flow in Heterogeneous Corridors 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2021, 卷号: 18, 期号: 3, 页码: 1097-1108
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo
Adobe PDF(2666Kb)  |  收藏  |  浏览/下载:196/2  |  提交时间:2021/08/15
Microscopy  Feedback control  Mathematical model  Data models  Dynamic programming  Psychology  Computational modeling  Adaptive dynamic programming (ADP)  heterogeneous corridors  macroscopic pedestrian dynamics  optimal feedback control  pedestrian flow  
Invariant Adaptive Dynamic Programming for Discrete-Time Optimal Control 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 卷号: 50, 期号: 11, 页码: 3959-3971
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo
Adobe PDF(2079Kb)  |  收藏  |  浏览/下载:182/3  |  提交时间:2021/01/07
Optimal control  Discrete-time systems  Heuristic algorithms  Dynamic programming  Convergence  Artificial intelligence  Nonlinear systems  Adaptive dynamic programming  discrete-time systems  invariant admissibility  optimal control  policy iteration  sum of squares  
LMI-Based Synthesis of String-Stable Controller for Cooperative Adaptive Cruise Control 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 卷号: 21, 期号: 11, 页码: 4516-4525
作者:  Zhu, Yuanheng;  He, Haibo;  Zhao, Dongbin
Adobe PDF(1648Kb)  |  收藏  |  浏览/下载:158/4  |  提交时间:2021/01/06
Cooperative adaptive cruise control  string stability  time-delay system  H-infinity control  linear matrix inequality  
Control-Limited Adaptive Dynamic Programming for Multi-Battery Energy Storage Systems 期刊论文
IEEE TRANSACTIONS ON SMART GRID, 2019, 卷号: 10, 期号: 4, 页码: 4235-4244
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Li, Xiangjun;  Wang, Ding
Adobe PDF(973Kb)  |  收藏  |  浏览/下载:282/4  |  提交时间:2019/09/30
Microgrid  energy storage system  multi-battery management system  adaptive dynamic programming  control-limited optimization  
Adaptive Optimal Control of Heterogeneous CACC System With Uncertain Dynamics 期刊论文
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2019, 卷号: 27, 期号: 4, 页码: 1772-1779
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Zhong, Zhiguang
Adobe PDF(1189Kb)  |  收藏  |  浏览/下载:260/2  |  提交时间:2019/09/30
Adaptive optimal control  cooperative adaptive cruise control (CACC)  heterogeneous platoon  string stability  sum-of-squares polynomial  
Policy Iteration for H infinity Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 2, 页码: 500-509
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Yang, Xiong;  Zhang, Qichao
Adobe PDF(892Kb)  |  收藏  |  浏览/下载:305/42  |  提交时间:2018/10/10
Adaptive Dynamic Programming (Adp)  h Infinity Optimal Control  Policy Iteration (Pi)  Polynomial Nonlinear Systems  Sum Of Squares (Sos)