CASIA OpenIR

浏览/检索结果: 共13条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Comprehensive comparison of online ADP algorithms for continuous-time optimal control 期刊论文
ARTIFICIAL INTELLIGENCE REVIEW, 2018, 卷号: 49, 期号: 4, 页码: 531-547
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(766Kb)  |  收藏  |  浏览/下载:399/180  |  提交时间:2017/09/13
Adaptive Dynamic Programming  Policy Iteration  Integral Reinforcement Learning  Experience Replay  Off-policy  
Policy Iteration for H infinity Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 2, 页码: 500-509
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Yang, Xiong;  Zhang, Qichao
Adobe PDF(892Kb)  |  收藏  |  浏览/下载:279/34  |  提交时间:2018/10/10
Adaptive Dynamic Programming (Adp)  h Infinity Optimal Control  Policy Iteration (Pi)  Polynomial Nonlinear Systems  Sum Of Squares (Sos)  
A pdf-Free Change Detection Test Based on Density Difference Estimation 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 2, 页码: 324-334
作者:  Bu, Li;  Alippi, Cesare;  Zhao, Dongbin
浏览  |  Adobe PDF(2468Kb)  |  收藏  |  浏览/下载:359/104  |  提交时间:2017/05/04
Concept Drift  Least Squares Density-difference (Lsdd)-based Method  Probability Density Function (Pdf)-free  Three-level Threshold Mechanism  
Event-Based Robust Control for Uncertain Nonlinear Systems Using Adaptive Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 1, 页码: 37-50
作者:  Zhang, Qichao;  Zhao, Dongbin;  Wang, Ding
浏览  |  Adobe PDF(2233Kb)  |  收藏  |  浏览/下载:513/238  |  提交时间:2017/05/04
Adaptive Dynamic Programming (Adp)  Event-based Control  Neural Network (Nn)  Robust Control  Unmatched Uncertainties  
Deep Reinforcement Learning With Visual Attention for Vehicle Classification 期刊论文
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2017, 卷号: 9, 期号: 4, 页码: 356-367
作者:  Zhao, Dongbin;  Chen, Yaran;  Lv, Le
浏览  |  Adobe PDF(3192Kb)  |  收藏  |  浏览/下载:1017/535  |  提交时间:2017/05/08
Convolutional Neural Network (Cnn)  Reinforcement Learning  Vehicle Classification  Visual Attention  
Event-Triggered H-infinity Control for Continuous-Time Nonlinear System via Concurrent Learning 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2017, 卷号: 47, 期号: 7, 页码: 1071-1081
作者:  Zhang, Qichao;  Zhao, Dongbin;  Zhu, Yuanheng
浏览  |  Adobe PDF(2937Kb)  |  收藏  |  浏览/下载:512/238  |  提交时间:2017/05/04
Concurrent Learning  Event-triggered Control  H-infinity Optimal Control  Neural Networks (Nns)  Zero-sum (Zs) Game  
FMRQ-A Multiagent Reinforcement Learning Algorithm for Fully Cooperative Tasks 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 6, 页码: 1367-1379
作者:  Zhang, Zhen;  Zhao, Dongbin;  Gao, Junwei;  Wang, Dongqing;  Dai, Yujie
收藏  |  浏览/下载:280/0  |  提交时间:2017/07/18
Multiagent Reinforcement Learning (Marl)  Nash Equilibrium  Q-learning  Repeated Game  
Data-driven adaptive dynamic programming for continuous-time fully cooperative games with partially constrained inputs 期刊论文
NEUROCOMPUTING, 2017, 卷号: 238, 期号: *, 页码: 377-386
作者:  Zhang, Qichao;  Zhao, Dongbin;  Zhu, Yuanheng
浏览  |  Adobe PDF(1508Kb)  |  收藏  |  浏览/下载:591/262  |  提交时间:2017/05/04
Adaptive Dynamic Programming  Optimal Control  Neural Network  Fully Cooperative Games  Data-driven  Constrained Input  
Event-Triggered Optimal Control for Partially Unknown Constrained-Input Systems via Adaptive Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2017, 卷号: 64, 期号: 5, 页码: 4101-4109
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo;  Ji, Junhong
浏览  |  Adobe PDF(2325Kb)  |  收藏  |  浏览/下载:503/204  |  提交时间:2017/09/12
Actor-critic-identifier  Concurrent Learning  Constrained Input  Event-triggered (Et) Control  Hamilton-jacobi-bellman (Hjb) Equation  
Iterative Adaptive Dynamic Programming for Solving Unknown Nonlinear Zero-Sum Game Based on Online Data 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 卷号: 28, 期号: 3, 页码: 714-725
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Li, Xiangjun
Adobe PDF(547Kb)  |  收藏  |  浏览/下载:428/179  |  提交时间:2017/05/05
Adaptive Dynamic Programming (Adp)  H-infinity Control  Policy Iteration (Pi)  Zero-sum Game (Zsg)