CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共14条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
A Hierarchical Deep Reinforcement Learning Framework for 6-DOF UCAV Air-to-Air Combat 期刊论文
IEEE Transactions on Systems, Man and Cybernetics: Systems, 2023, 页码: DOI: 10.1109/TSMC.2023.3270444
作者:  Jiajun Chai;  Wenzhang Chen;  Yuanheng Zhu;  Zong-xin Yao,;  Dongbin Zhao
Adobe PDF(9249Kb)  |  收藏  |  浏览/下载:198/107  |  提交时间:2023/04/26
Adaptive dynamic programming for robust neural control of unknown continuous-time non-linear systems 期刊论文
IET CONTROL THEORY AND APPLICATIONS, 2017, 卷号: 11, 期号: 14, 页码: 2307-2316
作者:  Yang, Xiong;  He, Haibo;  Liu, Derong;  Zhu, Yuanheng
浏览  |  Adobe PDF(2123Kb)  |  收藏  |  浏览/下载:426/141  |  提交时间:2017/09/13
Dynamic Programming  Robust Control  Neurocontrollers  Continuous Time Systems  Control System Synthesis  Nonlinear Control Systems  Optimal Control  Function Approximation  Monte Carlo Methods  Closed Loop Systems  Asymptotic Stability  Adaptive Dynamic Programming  Robust Neural Control Design  Unknown Continuous-time Nonlinear Systems  Ct Nonlinear Systems  Adp-based Robust Neural Control Scheme  Robust Nonlinear Control Problem  Nonlinear Optimal Control Problem  Nominal System  Adp Algorithm  Actor-critic Dual Networks  Control Policy Approximation  Value Function Approximation  Actor Neural Network Weights  Critic Nn Weights  Monte Carlo Integration Method  Closed-loop System  Asymptotically Stability  
Event-Triggered H-infinity Control for Continuous-Time Nonlinear System via Concurrent Learning 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2017, 卷号: 47, 期号: 7, 页码: 1071-1081
作者:  Zhang, Qichao;  Zhao, Dongbin;  Zhu, Yuanheng
浏览  |  Adobe PDF(2937Kb)  |  收藏  |  浏览/下载:521/238  |  提交时间:2017/05/04
Concurrent Learning  Event-triggered Control  H-infinity Optimal Control  Neural Networks (Nns)  Zero-sum (Zs) Game  
Data-driven adaptive dynamic programming for continuous-time fully cooperative games with partially constrained inputs 期刊论文
NEUROCOMPUTING, 2017, 卷号: 238, 期号: *, 页码: 377-386
作者:  Zhang, Qichao;  Zhao, Dongbin;  Zhu, Yuanheng
浏览  |  Adobe PDF(1508Kb)  |  收藏  |  浏览/下载:608/265  |  提交时间:2017/05/04
Adaptive Dynamic Programming  Optimal Control  Neural Network  Fully Cooperative Games  Data-driven  Constrained Input  
Event-Triggered Optimal Control for Partially Unknown Constrained-Input Systems via Adaptive Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2017, 卷号: 64, 期号: 5, 页码: 4101-4109
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo;  Ji, Junhong
浏览  |  Adobe PDF(2325Kb)  |  收藏  |  浏览/下载:512/205  |  提交时间:2017/09/12
Actor-critic-identifier  Concurrent Learning  Constrained Input  Event-triggered (Et) Control  Hamilton-jacobi-bellman (Hjb) Equation  
Iterative Adaptive Dynamic Programming for Solving Unknown Nonlinear Zero-Sum Game Based on Online Data 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 卷号: 28, 期号: 3, 页码: 714-725
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Li, Xiangjun
浏览  |  Adobe PDF(547Kb)  |  收藏  |  浏览/下载:436/182  |  提交时间:2017/05/05
Adaptive Dynamic Programming (Adp)  H-infinity Control  Policy Iteration (Pi)  Zero-sum Game (Zsg)  
深度强化学习进展: 从 AlphaGo 到 AlphaGo Zero 期刊论文
控 制 理 论 与 应 用, 2017, 卷号: 34, 期号: 12, 页码: 1529-1546
作者:  唐振韬;  邵 坤;  赵冬斌;  朱圆恒
Adobe PDF(8232Kb)  |  收藏  |  浏览/下载:217/33  |  提交时间:2021/07/05
深度强化学习  AlphaGo Zero  深度学习  强化学习  人工智能  
Experience Replay for Optimal Control of Nonzero-Sum Game Systems With Unknown Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2016, 卷号: 46, 期号: 3, 页码: 854-865
作者:  Zhao, Dongbin;  Zhang, Qichao;  Wang, Ding;  Zhu, Yuanheng
浏览  |  Adobe PDF(1769Kb)  |  收藏  |  浏览/下载:499/194  |  提交时间:2016/06/14
Adaptive Dynamic Programming (Adp)  Experience Replay  Nonzero-sum (Nzs) Games  Optimal Control  Unknown Dynamics  
深度强化学习综述:兼论计算机围棋的发展 期刊论文
控制理论与应用, 2016, 卷号: 33, 期号: 6, 页码: 701-717
作者:  赵冬斌;  邵坤;  朱圆恒;  李栋;  陈亚冉;  王海涛;  刘德荣;  周彤;  王成红
浏览  |  Adobe PDF(2816Kb)  |  收藏  |  浏览/下载:1727/634  |  提交时间:2017/09/13
深度强化学习  初弈号  深度学习  强化学习  人工智能  
连续状态系统的近似最优在线强化学习 学位论文
, 中国科学院自动化研究所: 中国科学院大学, 2015
作者:  朱圆恒
Adobe PDF(2679Kb)  |  收藏  |  浏览/下载:498/0  |  提交时间:2015/09/02
强化学习  最优控制  近似策略迭代  概率近似最优  连续状态系统  收敛性  在线学习  Kd树  Reinforcement Learning  Optimal Control  Approximate Policy Iteration  Probably Approximately Correct  Continuous-state System  Convergence  Online Learning  Kd-tree