CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共10条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Highway Lane Change Decision-Making via Attention-Based Deep Reinforcement Learning 期刊论文
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2022, 卷号: 9, 期号: 3, 页码: 567-569
作者:  Wang, Junjie;  Zhang, Qichao;  Zhao, Dongbin
Adobe PDF(803Kb)  |  收藏  |  浏览/下载:242/56  |  提交时间:2022/02/16
Data-Based Reinforcement Learning for Nonzero-Sum Games With Unknown Drift Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 卷号: 49, 期号: 8, 页码: 2874-2885
作者:  Zhang, Qichao;  Zhao, Dongbin
浏览  |  Adobe PDF(1021Kb)  |  收藏  |  浏览/下载:407/120  |  提交时间:2019/07/12
Integral reinforcement learning (IRL)  neural network (NN)  nonzero-sum (NZS) games  off-policy  single-critic  unknown drift dynamics  
Policy Iteration for H infinity Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 2, 页码: 500-509
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Yang, Xiong;  Zhang, Qichao
Adobe PDF(892Kb)  |  收藏  |  浏览/下载:290/37  |  提交时间:2018/10/10
Adaptive Dynamic Programming (Adp)  h Infinity Optimal Control  Policy Iteration (Pi)  Polynomial Nonlinear Systems  Sum Of Squares (Sos)  
Off-Policy Reinforcement Learning for Partially Unknown Nonzero-Sum Games 会议论文
, Guangzhou China, November 14–18
作者:  Zhang,Qichao;  Zhao,Dongbin;  Zhang,Sibo
Adobe PDF(119Kb)  |  收藏  |  浏览/下载:239/85  |  提交时间:2017/12/28
面向几类微分博弈的自适应动态规划方法 学位论文
, 北京: 中国科学院研究生院, 2017
作者:  Zhang,Qichao
Adobe PDF(4868Kb)  |  收藏  |  浏览/下载:418/12  |  提交时间:2017/06/07
自适应动态规划  神经网络  微分博弈  
Data-driven adaptive dynamic programming for continuous-time fully cooperative games with partially constrained inputs 期刊论文
NEUROCOMPUTING, 2017, 卷号: 238, 期号: *, 页码: 377-386
作者:  Zhang, Qichao;  Zhao, Dongbin;  Zhu, Yuanheng
浏览  |  Adobe PDF(1508Kb)  |  收藏  |  浏览/下载:610/266  |  提交时间:2017/05/04
Adaptive Dynamic Programming  Optimal Control  Neural Network  Fully Cooperative Games  Data-driven  Constrained Input  
Model-free Optimal Control based Intelligent Cruise Control with Hardware-in-the-loop Demonstration 期刊论文
IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2017, 卷号: 12, 期号: 2, 页码: 56-69
作者:  Zhao, Dongbin;  Xia, Zhongpu;  Zhang, Qichao
浏览  |  Adobe PDF(4525Kb)  |  收藏  |  浏览/下载:507/182  |  提交时间:2017/05/04
Intelligent Cruise Control  
Data-driven adaptive dynamic programming for two-player nonzero-sum game 会议论文
, Chongqing, China, 2017-5
作者:  Zhang, Qichao;  Zhao, Dongbin
浏览  |  Adobe PDF(141Kb)  |  收藏  |  浏览/下载:338/183  |  提交时间:2017/05/04
Model-free reinforcement learning for nonlinear zero-sum games with simultaneous explorations 会议论文
, Vancouver, Canada, 2016-7
作者:  Zhang, Qichao;  Zhao, Dongbin;  Zhu, Yuanheng;  Chen, Xi
浏览  |  Adobe PDF(339Kb)  |  收藏  |  浏览/下载:268/88  |  提交时间:2017/05/04
Experience Replay for Optimal Control of Nonzero-Sum Game Systems With Unknown Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2016, 卷号: 46, 期号: 3, 页码: 854-865
作者:  Zhao, Dongbin;  Zhang, Qichao;  Wang, Ding;  Zhu, Yuanheng
浏览  |  Adobe PDF(1769Kb)  |  收藏  |  浏览/下载:500/195  |  提交时间:2016/06/14
Adaptive Dynamic Programming (Adp)  Experience Replay  Nonzero-sum (Nzs) Games  Optimal Control  Unknown Dynamics