CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共19条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Data-Based Reinforcement Learning for Nonzero-Sum Games With Unknown Drift Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 卷号: 49, 期号: 8, 页码: 2874-2885
作者:  Zhang, Qichao;  Zhao, Dongbin
浏览  |  Adobe PDF(1021Kb)  |  收藏  |  浏览/下载:383/118  |  提交时间:2019/07/12
Integral reinforcement learning (IRL)  neural network (NN)  nonzero-sum (NZS) games  off-policy  single-critic  unknown drift dynamics  
Comprehensive comparison of online ADP algorithms for continuous-time optimal control 期刊论文
ARTIFICIAL INTELLIGENCE REVIEW, 2018, 卷号: 49, 期号: 4, 页码: 531-547
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(766Kb)  |  收藏  |  浏览/下载:397/180  |  提交时间:2017/09/13
Adaptive Dynamic Programming  Policy Iteration  Integral Reinforcement Learning  Experience Replay  Off-policy  
Policy Iteration for H infinity Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 2, 页码: 500-509
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Yang, Xiong;  Zhang, Qichao
Adobe PDF(892Kb)  |  收藏  |  浏览/下载:274/33  |  提交时间:2018/10/10
Adaptive Dynamic Programming (Adp)  h Infinity Optimal Control  Policy Iteration (Pi)  Polynomial Nonlinear Systems  Sum Of Squares (Sos)  
Event-Based Robust Control for Uncertain Nonlinear Systems Using Adaptive Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 1, 页码: 37-50
作者:  Zhang, Qichao;  Zhao, Dongbin;  Wang, Ding
浏览  |  Adobe PDF(2233Kb)  |  收藏  |  浏览/下载:507/238  |  提交时间:2017/05/04
Adaptive Dynamic Programming (Adp)  Event-based Control  Neural Network (Nn)  Robust Control  Unmatched Uncertainties  
Event-Triggered H-infinity Control for Continuous-Time Nonlinear System via Concurrent Learning 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2017, 卷号: 47, 期号: 7, 页码: 1071-1081
作者:  Zhang, Qichao;  Zhao, Dongbin;  Zhu, Yuanheng
浏览  |  Adobe PDF(2937Kb)  |  收藏  |  浏览/下载:508/238  |  提交时间:2017/05/04
Concurrent Learning  Event-triggered Control  H-infinity Optimal Control  Neural Networks (Nns)  Zero-sum (Zs) Game  
Event-Triggered Optimal Control for Partially Unknown Constrained-Input Systems via Adaptive Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2017, 卷号: 64, 期号: 5, 页码: 4101-4109
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo;  Ji, Junhong
浏览  |  Adobe PDF(2325Kb)  |  收藏  |  浏览/下载:496/203  |  提交时间:2017/09/12
Actor-critic-identifier  Concurrent Learning  Constrained Input  Event-triggered (Et) Control  Hamilton-jacobi-bellman (Hjb) Equation  
Iterative Adaptive Dynamic Programming for Solving Unknown Nonlinear Zero-Sum Game Based on Online Data 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 卷号: 28, 期号: 3, 页码: 714-725
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Li, Xiangjun
浏览  |  Adobe PDF(547Kb)  |  收藏  |  浏览/下载:426/179  |  提交时间:2017/05/05
Adaptive Dynamic Programming (Adp)  H-infinity Control  Policy Iteration (Pi)  Zero-sum Game (Zsg)  
Policy Iteration for Hinfinity Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming 期刊论文
IEEE Transactions on Cybernetics, 2017, 期号: PP, 页码: 1-9
作者:  Yuanheng Zhu;  Zhao DB(赵冬斌)
浏览  |  Adobe PDF(894Kb)  |  收藏  |  浏览/下载:319/157  |  提交时间:2017/09/13
Adaptive Dynamic Programming (Adp)  H∞ Optimal Control  Policy Iteration (Pi)  Polynomial Nonlinear Systems  Sum Of Squares (Sos)  
Model-free reinforcement learning for nonlinear zero-sum games with simultaneous explorations 会议论文
, Vancouver, Canada, 2016-7
作者:  Zhang, Qichao;  Zhao, Dongbin;  Zhu, Yuanheng;  Chen, Xi
浏览  |  Adobe PDF(339Kb)  |  收藏  |  浏览/下载:260/85  |  提交时间:2017/05/04
Data-Based Adaptive Critic Designs for Nonlinear Robust Optimal Control With Uncertain Dynamics 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2016, 卷号: 46, 期号: 11, 页码: 1544-1555
作者:  Wang, Ding;  Liu, Derong;  Zhang, Qichao;  Zhao, Dongbin
浏览  |  Adobe PDF(1082Kb)  |  收藏  |  浏览/下载:424/193  |  提交时间:2017/02/14
Adaptive Critic Designs  Adaptive Dynamic Programming  Intelligent Control  Neural Networks  Policy Iteration  Robust Optimal Control  System Identification  Uncertain Nonlinear Systems