CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共8条,第1-8条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Dynamic-horizon model-based value estimation with latent imagination 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2022, 页码: 1-14
作者:  Wang JJ(王俊杰);  Zhang QC(张启超);  Zhao DB(赵冬斌)
Adobe PDF(2305Kb)  |  收藏  |  浏览/下载:197/69  |  提交时间:2023/05/30
Latent world model  model-based value expansion (MVE)  reinforcement learning  reinforcement learning  
Deep Reinforcement Learning-Based Automatic Exploration for Navigation in Unknown Environment 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 6, 页码: 2064-2076
作者:  Li, Haoran;  Zhang, Qichao;  Zhao, Dongbin
浏览  |  Adobe PDF(4274Kb)  |  收藏  |  浏览/下载:412/126  |  提交时间:2020/08/03
Robot sensing systems  Navigation  Entropy  Neural networks  Task analysis  Planning  Automatic exploration  deep reinforcement learning (DRL)  optimal decision  partial observation  
Policy Iteration for H infinity Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 2, 页码: 500-509
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Yang, Xiong;  Zhang, Qichao
Adobe PDF(892Kb)  |  收藏  |  浏览/下载:328/52  |  提交时间:2018/10/10
Adaptive Dynamic Programming (Adp)  h Infinity Optimal Control  Policy Iteration (Pi)  Polynomial Nonlinear Systems  Sum Of Squares (Sos)  
Off-Policy Reinforcement Learning for Partially Unknown Nonzero-Sum Games 会议论文
, Guangzhou China, November 14–18
作者:  Zhang,Qichao;  Zhao,Dongbin;  Zhang,Sibo
浏览  |  Adobe PDF(119Kb)  |  收藏  |  浏览/下载:278/106  |  提交时间:2017/12/28
Data-driven adaptive dynamic programming for two-player nonzero-sum game 会议论文
, Chongqing, China, 2017-5
作者:  Zhang, Qichao;  Zhao, Dongbin
浏览  |  Adobe PDF(141Kb)  |  收藏  |  浏览/下载:375/197  |  提交时间:2017/05/04
Event-Based Robust Control for Uncertain Nonlinear Systems Using Adaptive Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 1, 页码: 37-50
作者:  Zhang, Qichao;  Zhao, Dongbin;  Wang, Ding
浏览  |  Adobe PDF(2233Kb)  |  收藏  |  浏览/下载:568/255  |  提交时间:2017/05/04
Adaptive Dynamic Programming (Adp)  Event-based Control  Neural Network (Nn)  Robust Control  Unmatched Uncertainties  
Data-Based Adaptive Critic Designs for Nonlinear Robust Optimal Control With Uncertain Dynamics 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2016, 卷号: 46, 期号: 11, 页码: 1544-1555
作者:  Wang, Ding;  Liu, Derong;  Zhang, Qichao;  Zhao, Dongbin
浏览  |  Adobe PDF(1082Kb)  |  收藏  |  浏览/下载:488/210  |  提交时间:2017/02/14
Adaptive Critic Designs  Adaptive Dynamic Programming  Intelligent Control  Neural Networks  Policy Iteration  Robust Optimal Control  System Identification  Uncertain Nonlinear Systems  
Experience Replay for Optimal Control of Nonzero-Sum Game Systems With Unknown Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2016, 卷号: 46, 期号: 3, 页码: 854-865
作者:  Zhao, Dongbin;  Zhang, Qichao;  Wang, Ding;  Zhu, Yuanheng
浏览  |  Adobe PDF(1769Kb)  |  收藏  |  浏览/下载:539/204  |  提交时间:2016/06/14
Adaptive Dynamic Programming (Adp)  Experience Replay  Nonzero-sum (Nzs) Games  Optimal Control  Unknown Dynamics