CASIA OpenIR

浏览/检索结果: 共10条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Chai, Jiajun;  Zhu, Yuanheng;  Zhao, Dongbin
收藏  |  浏览/下载:41/0  |  提交时间:2023/11/16
Large-scale multiagent  neighboring communication  reinforcement learning (RL)  variational information flow  
Data Generation Feedback Relearning Control for Unmodeled Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2023, 页码: 12
作者:  Zhang, Yong;  Mu, Chaoxu;  Zhao, Dongbin
收藏  |  浏览/下载:66/0  |  提交时间:2023/11/16
Data models  Real-time systems  Heuristic algorithms  Mathematical models  Adaptation models  Approximation algorithms  Cost function  Data generation model  feedback relearning control  delayed neural network  reinforcement learning  unmodeled nonlinear system  
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 13
作者:  Hu, Guangzheng;  Zhu, Yuanheng;  Zhao, Dongbin;  Zhao, Mengchen;  Hao, Jianye
收藏  |  浏览/下载:191/0  |  提交时间:2022/01/27
Bandwidth  Protocols  Reinforcement learning  Task analysis  Optimization  Communication networks  Multi-agent systems  Event trigger  limited bandwidth  multi-agent communication  multi-agent reinforcement learning (MARL)  
Invariant Adaptive Dynamic Programming for Discrete-Time Optimal Control 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 卷号: 50, 期号: 11, 页码: 3959-3971
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo
收藏  |  浏览/下载:159/0  |  提交时间:2021/01/07
Optimal control  Discrete-time systems  Heuristic algorithms  Dynamic programming  Convergence  Artificial intelligence  Nonlinear systems  Adaptive dynamic programming  discrete-time systems  invariant admissibility  optimal control  policy iteration  sum of squares  
A spatial-temporal attention model for human trajectory prediction 期刊论文
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2020, 卷号: 7, 期号: 4, 页码: 965-974
作者:  Zhao, Xiaodong;  Chen, Yaran;  Guo, Jin;  Zhao, Dongbin
收藏  |  浏览/下载:94/0  |  提交时间:2020/08/03
Attention mechanism  long-short term memory (LSTM)  spatial-temporal model  trajectory prediction  
Hierarchical optimal control for input-affine nonlinear systems through the formulation of Stackelberg game 期刊论文
INFORMATION SCIENCES, 2020, 卷号: 517, 页码: 1-17
作者:  Mu, Chaoxu;  Wang, Ke;  Zhang, Qichao;  Zhao, Dongbin
收藏  |  浏览/下载:205/0  |  提交时间:2020/04/07
Nonzero-sum differential game  Hierarchical optimization  Nonlinear dynamics  Stackelberg equilibrium  Neural network  
Adaptive Optimal Control of Heterogeneous CACC System With Uncertain Dynamics 期刊论文
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2019, 卷号: 27, 期号: 4, 页码: 1772-1779
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Zhong, Zhiguang
收藏  |  浏览/下载:238/0  |  提交时间:2019/09/30
Adaptive optimal control  cooperative adaptive cruise control (CACC)  heterogeneous platoon  string stability  sum-of-squares polynomial  
Control-Limited Adaptive Dynamic Programming for Multi-Battery Energy Storage Systems 期刊论文
IEEE TRANSACTIONS ON SMART GRID, 2019, 卷号: 10, 期号: 4, 页码: 4235-4244
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Li, Xiangjun;  Wang, Ding
收藏  |  浏览/下载:252/0  |  提交时间:2019/09/30
Microgrid  energy storage system  multi-battery management system  adaptive dynamic programming  control-limited optimization  
Adaptive cruise control via adaptive dynamic programming with experience replay 期刊论文
SOFT COMPUTING, 2019, 卷号: 23, 期号: 12, 页码: 4131-4144
作者:  Wang, Bin;  Zhao, Dongbin;  Cheng, Jin
收藏  |  浏览/下载:210/0  |  提交时间:2019/07/11
Adaptive cruise control  Adaptive dynamic programming  Experience replay  Reinforcement learning  Neural networks  
A Gradient-Based Reinforcement Learning Algorithm for Multiple Cooperative Agents 期刊论文
IEEE ACCESS, 2018, 卷号: 6, 页码: 70223-70235
作者:  Zhang, Zhen;  Wang, Dongqing;  Zhao, Dongbin;  Han, Qiaoni;  Song, Tingting
收藏  |  浏览/下载:236/0  |  提交时间:2019/07/12
Multi-agent reinforcement learning  gradient ascent  Q-learning  cooperative tasks