CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共24条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Dynamic-horizon model-based value estimation with latent imagination 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2022, 页码: 1-14
作者:  Wang JJ(王俊杰);  Zhang QC(张启超);  Zhao DB(赵冬斌)
Adobe PDF(2305Kb)  |  收藏  |  浏览/下载:159/59  |  提交时间:2023/05/30
Latent world model  model-based value expansion (MVE)  reinforcement learning  reinforcement learning  
Sparse online kernelized actor-critic Learning in reproducing kernel Hilbert space 期刊论文
ARTIFICIAL INTELLIGENCE REVIEW, 2021, 页码: 36
作者:  Yang, Yongliang;  Zhu, Hufei;  Zhang, Qichao;  Zhao, Bo;  Li, Zhenning;  Wunsch, Donald C.
收藏  |  浏览/下载:201/0  |  提交时间:2021/11/02
Reproducing kernel Hilbert space  Actor-critic learning  Value function approximation  Online sparsification  Non-parametric learning  
Deep Reinforcement Learning-Based Automatic Exploration for Navigation in Unknown Environment 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 6, 页码: 2064-2076
作者:  Li, Haoran;  Zhang, Qichao;  Zhao, Dongbin
浏览  |  Adobe PDF(4274Kb)  |  收藏  |  浏览/下载:370/115  |  提交时间:2020/08/03
Robot sensing systems  Navigation  Entropy  Neural networks  Task analysis  Planning  Automatic exploration  deep reinforcement learning (DRL)  optimal decision  partial observation  
Data-Based Reinforcement Learning for Nonzero-Sum Games With Unknown Drift Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 卷号: 49, 期号: 8, 页码: 2874-2885
作者:  Zhang, Qichao;  Zhao, Dongbin
浏览  |  Adobe PDF(1021Kb)  |  收藏  |  浏览/下载:415/122  |  提交时间:2019/07/12
Integral reinforcement learning (IRL)  neural network (NN)  nonzero-sum (NZS) games  off-policy  single-critic  unknown drift dynamics  
Policy Iteration for H infinity Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 2, 页码: 500-509
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Yang, Xiong;  Zhang, Qichao
Adobe PDF(892Kb)  |  收藏  |  浏览/下载:295/39  |  提交时间:2018/10/10
Adaptive Dynamic Programming (Adp)  h Infinity Optimal Control  Policy Iteration (Pi)  Polynomial Nonlinear Systems  Sum Of Squares (Sos)  
Event-triggered hinfinity control for continuous-time nonlinear system 会议论文
, *, 2015
作者:  Zhao,Dongbin(赵冬斌);  Zhang,Qichao;  Li,Xiangjun;  Kong,Lingda
浏览  |  Adobe PDF(365Kb)  |  收藏  |  浏览/下载:237/85  |  提交时间:2018/01/04
Comparison of methods to efficient graph SLAM under general optimization framework 会议论文
YAC 2017
作者:  Haoran Li;  Qichao Zhang;  Dongbin Zhao
浏览  |  Adobe PDF(151Kb)  |  收藏  |  浏览/下载:901/517  |  提交时间:2017/12/31
Optimization  Slam  Pose Graph  
Policy Gradient Methods with Gaussian Process Modelling Acceleration 会议论文
, Anchorage, AK, USA, 14-19 May 2017
作者:  Li, Dong;  Zhao, Dongbin;  Zhang, Qichao;  Luo, Chaomin
浏览  |  Adobe PDF(720Kb)  |  收藏  |  浏览/下载:305/96  |  提交时间:2017/12/28
Event-Triggered H∞ Control for Continuous-Time Nonlinear System 会议论文
, Jeju, South Korea, October 15-18
作者:  Zhao,Dongbin;  Zhang,Qichao;  Li,Xiangjun;  Kong,Lingda
浏览  |  Adobe PDF(365Kb)  |  收藏  |  浏览/下载:181/47  |  提交时间:2017/12/28
Event-Triggered Adaptive Dynamic Programming for Uncertain Nonlinear Systems 会议论文
, Beijing, China, November 19–23
作者:  Zhang,Qichao;  Zhao,Dongbin;  Wang,Ding
浏览  |  Adobe PDF(153Kb)  |  收藏  |  浏览/下载:193/78  |  提交时间:2017/12/28