CASIA OpenIR
(Note: the search results are based on claimed items)

Browse/Search Results:  1-6 of 6 Help

Filters        
Selected(0)Clear Items/Page:    Sort:
Data-Based Reinforcement Learning for Nonzero-Sum Games With Unknown Drift Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 卷号: 49, 期号: 8, 页码: 2874-2885
Authors:  Zhang, Qichao;  Zhao, Dongbin
Adobe PDF(1021Kb)  |  Favorite  |  View/Download:59/17  |  Submit date:2019/07/12
Integral reinforcement learning (IRL)  neural network (NN)  nonzero-sum (NZS) games  off-policy  single-critic  unknown drift dynamics  
Event-Based Robust Control for Uncertain Nonlinear Systems Using Adaptive Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 1, 页码: 37-50
Authors:  Zhang, Qichao;  Zhao, Dongbin;  Wang, Ding
Adobe PDF(2233Kb)  |  Favorite  |  View/Download:232/136  |  Submit date:2017/05/04
Adaptive Dynamic Programming (Adp)  Event-based Control  Neural Network (Nn)  Robust Control  Unmatched Uncertainties  
Policy Iteration for Hinfinity Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming 期刊论文
IEEE Transactions on Cybernetics, 2017, 期号: PP, 页码: 1-9
Authors:  Yuanheng Zhu;  Zhao DB(赵冬斌)
Adobe PDF(894Kb)  |  Favorite  |  View/Download:114/55  |  Submit date:2017/09/13
Adaptive Dynamic Programming (Adp)  H∞ Optimal Control  Policy Iteration (Pi)  Polynomial Nonlinear Systems  Sum Of Squares (Sos)  
Data-driven adaptive dynamic programming for two-player nonzero-sum game 会议论文
, Chongqing, China, 2017-5
Authors:  Zhang, Qichao;  Zhao, Dongbin
View  |  Adobe PDF(141Kb)  |  Favorite  |  View/Download:152/76  |  Submit date:2017/05/04
Online reinforcement learning control by Bayesian inference 期刊论文
IET CONTROL THEORY AND APPLICATIONS, 2016, 卷号: 10, 期号: 12, 页码: 1331-1338
Authors:  Xia, Zhongpu;  Zhao, Dongbin;  Dongbin Zhao
Adobe PDF(1559Kb)  |  Favorite  |  View/Download:132/46  |  Submit date:2016/06/15
Learning Systems  Bayes Methods  Gaussian Processes  Optimal Control  Online Reinforcement Learning Control  Bayesian Inference  Self-learning Control  Probability  Action Value Function  Gaussian Process  Bayesian-state-action-reward-state-action Algorithm  
深度强化学习综述:兼论计算机围棋的发展 期刊论文
控制理论与应用, 2016, 卷号: 33, 期号: 6, 页码: 701-717
Authors:  赵冬斌;  邵坤;  朱圆恒;  李栋;  陈亚冉;  王海涛;  刘德荣;  周彤;  王成红
Adobe PDF(2816Kb)  |  Favorite  |  View/Download:883/347  |  Submit date:2017/09/13
深度强化学习  初弈号  深度学习  强化学习  人工智能