CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共20条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning 会议论文
, 昆士兰, 2023-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(4104Kb)  |  收藏  |  浏览/下载:209/68  |  提交时间:2023/06/29
multi-agent  reinforcement learning  policy gradient  
Enhanced Rolling Horizon Evolution Algorithm With Opponent Model Learning: Results for the Fighting Game AI Competition 期刊论文
IEEE TRANSACTIONS ON GAMES, 2023, 卷号: 5, 期号: 1, 页码: 5 - 15
作者:  Zhentao Tang;  Yuanheng Zhu;  Dongbin Zhao;  Simon M. Lucas
Adobe PDF(7686Kb)  |  收藏  |  浏览/下载:223/63  |  提交时间:2021/07/05
Rolling horizon evolution  opponent model  reinforcement learning  supervised learning  fighting game  
A Hierarchical Deep Reinforcement Learning Framework for 6-DOF UCAV Air-to-Air Combat 期刊论文
IEEE Transactions on Systems, Man and Cybernetics: Systems, 2023, 页码: DOI: 10.1109/TSMC.2023.3270444
作者:  Jiajun Chai;  Wenzhang Chen;  Yuanheng Zhu;  Zong-xin Yao,;  Dongbin Zhao
Adobe PDF(9249Kb)  |  收藏  |  浏览/下载:202/108  |  提交时间:2023/04/26
Invariant Adaptive Dynamic Programming for Discrete-Time Optimal Control 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 卷号: 50, 期号: 11, 页码: 3959-3971
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo
收藏  |  浏览/下载:165/0  |  提交时间:2021/01/07
Optimal control  Discrete-time systems  Heuristic algorithms  Dynamic programming  Convergence  Artificial intelligence  Nonlinear systems  Adaptive dynamic programming  discrete-time systems  invariant admissibility  optimal control  policy iteration  sum of squares  
StarCraft Micromanagement With Reinforcement Learning and Curriculum Transfer Learning 期刊论文
IEEE Transactions on Emerging Topics in Computational Intelligence, 2019, 卷号: 3, 期号: 1, 页码: 73-84
作者:  Kun Shao;  Yuanheng Zhu;  Dongbin Zhao
浏览  |  Adobe PDF(4125Kb)  |  收藏  |  浏览/下载:337/131  |  提交时间:2019/04/22
Reinforcement Learning, Transfer Learning, Curriculum Learning, Neural Network, Game Ai  
A Review of Computational Intelligence for StarCraft AI 会议论文
, Bangalore, India, 18-21 Nov. 2018
作者:  Tang, Zhentao;  Shao, Kun;  Zhu, Yuanheng;  Li, Dong;  Zhao, Dongbin;  Huang, Tingwen
浏览  |  Adobe PDF(131Kb)  |  收藏  |  浏览/下载:484/223  |  提交时间:2019/04/25
An Autonomous Driving Experience Platform with Learning-Based Functions 会议论文
, Bangalore, India, 18-21 Nov. 2018
作者:  Li, Dong;  Zhao, Dongbin;  Zhang, Qichao;  Zhu, Yuanheng
浏览  |  Adobe PDF(215Kb)  |  收藏  |  浏览/下载:271/69  |  提交时间:2019/04/25
Visual navigation with Actor-Critic deep reinforcement learning 会议论文
, Rio, Brazil, 2018-01
作者:  Kun Shao;  Dongbin Zhao;  Yuanheng Zhu;  Qichao Zhang
浏览  |  Adobe PDF(1827Kb)  |  收藏  |  浏览/下载:298/123  |  提交时间:2019/04/22
Adaptive dynamic programming for robust neural control of unknown continuous-time non-linear systems 期刊论文
IET CONTROL THEORY AND APPLICATIONS, 2017, 卷号: 11, 期号: 14, 页码: 2307-2316
作者:  Yang, Xiong;  He, Haibo;  Liu, Derong;  Zhu, Yuanheng
浏览  |  Adobe PDF(2123Kb)  |  收藏  |  浏览/下载:432/141  |  提交时间:2017/09/13
Dynamic Programming  Robust Control  Neurocontrollers  Continuous Time Systems  Control System Synthesis  Nonlinear Control Systems  Optimal Control  Function Approximation  Monte Carlo Methods  Closed Loop Systems  Asymptotic Stability  Adaptive Dynamic Programming  Robust Neural Control Design  Unknown Continuous-time Nonlinear Systems  Ct Nonlinear Systems  Adp-based Robust Neural Control Scheme  Robust Nonlinear Control Problem  Nonlinear Optimal Control Problem  Nominal System  Adp Algorithm  Actor-critic Dual Networks  Control Policy Approximation  Value Function Approximation  Actor Neural Network Weights  Critic Nn Weights  Monte Carlo Integration Method  Closed-loop System  Asymptotically Stability  
Event-Triggered H-infinity Control for Continuous-Time Nonlinear System via Concurrent Learning 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2017, 卷号: 47, 期号: 7, 页码: 1071-1081
作者:  Zhang, Qichao;  Zhao, Dongbin;  Zhu, Yuanheng
浏览  |  Adobe PDF(2937Kb)  |  收藏  |  浏览/下载:529/238  |  提交时间:2017/05/04
Concurrent Learning  Event-triggered Control  H-infinity Optimal Control  Neural Networks (Nns)  Zero-sum (Zs) Game