CASIA OpenIR

Browse/Search Results:  1-10 of 52 Help

  Show only claimed items
Selected(0)Clear Items/Page:    Sort:
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
Authors:  Zhu, Yuanheng;  Zhao, Dongbin
Favorite  |  View/Download:16/0  |  Submit date:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Missile guidance with assisted deep reinforcement learning for head-on interception of maneuvering target 期刊论文
COMPLEX & INTELLIGENT SYSTEMS, 2021, 页码: 12
Authors:  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin
Favorite  |  View/Download:15/0  |  Submit date:2021/12/28
Reinforcement learning  Missile guidance  Auxiliary learning  Self-imitation learning  
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 13
Authors:  Hu, Guangzheng;  Zhu, Yuanheng;  Zhao, Dongbin;  Zhao, Mengchen;  Hao, Jianye
Favorite  |  View/Download:10/0  |  Submit date:2022/01/27
Bandwidth  Protocols  Reinforcement learning  Task analysis  Optimization  Communication networks  Multi-agent systems  Event trigger  limited bandwidth  multi-agent communication  multi-agent reinforcement learning (MARL)  
UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
Authors:  Chai, Jiajun;  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin;  Ma, Zhe;  Sun, Kewu;  Ding, Jishiyu
Favorite  |  View/Download:12/0  |  Submit date:2022/01/27
Multi-agent systems  Training  Task analysis  Reinforcement learning  Sun  Learning systems  Semantics  Centralized training with decentralized execution (CTDE)  multiagent  reinforcement learning  StarCraft II  
Optimal Feedback Control of Pedestrian Flow in Heterogeneous Corridors 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2021, 卷号: 18, 期号: 3, 页码: 1097-1108
Authors:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo
Favorite  |  View/Download:38/0  |  Submit date:2021/08/15
Microscopy  Feedback control  Mathematical model  Data models  Dynamic programming  Psychology  Computational modeling  Adaptive dynamic programming (ADP)  heterogeneous corridors  macroscopic pedestrian dynamics  optimal feedback control  pedestrian flow  
Decentralized Event-Driven Constrained Control Using Adaptive Critic Designs 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 15
Authors:  Yang, Xiong;  Zhu, Yuanheng;  Dong, Na;  Wei, Qinglai
Favorite  |  View/Download:13/0  |  Submit date:2022/01/27
Adaptive critic designs (ACDs)  adaptive dynamic programming (ADP)  decentralized event-driven control  input constraint  reinforcement learning (RL)  
Invariant Adaptive Dynamic Programming for Discrete-Time Optimal Control 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 卷号: 50, 期号: 11, 页码: 3959-3971
Authors:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo
Favorite  |  View/Download:40/0  |  Submit date:2021/01/07
Optimal control  Discrete-time systems  Heuristic algorithms  Dynamic programming  Convergence  Artificial intelligence  Nonlinear systems  Adaptive dynamic programming  discrete-time systems  invariant admissibility  optimal control  policy iteration  sum of squares  
LMI-Based Synthesis of String-Stable Controller for Cooperative Adaptive Cruise Control 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 卷号: 21, 期号: 11, 页码: 4516-4525
Authors:  Zhu, Yuanheng;  He, Haibo;  Zhao, Dongbin
Favorite  |  View/Download:39/0  |  Submit date:2021/01/06
Cooperative adaptive cruise control  string stability  time-delay system  H-infinity control  linear matrix inequality  
Synthesis of Cooperative Adaptive Cruise Control With Feedforward Strategies 期刊论文
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 卷号: 69, 期号: 4, 页码: 3615-3627
Authors:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo
Favorite  |  View/Download:54/0  |  Submit date:2020/06/22
Cooperative cruise control  H-infinity-norm  L-2-gain  time-delay system  state-space model  
Enhanced Rolling Horizon Evolution Algorithm with Opponent Model Learning 期刊论文
IEEE TRANSACTIONS ON GAMES, 2020, 期号: Early Access, 页码: Early Access
Authors:  Zhentao Tang;  Yuanheng Zhu;  Dongbin Zhao;  Simon M. Lucas
Adobe PDF(7686Kb)  |  Favorite  |  View/Download:54/16  |  Submit date:2021/07/05
Rolling horizon evolution  opponent model  reinforcement learning  supervised learning  fighting game