CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共12条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Vision-based control in the open racing car simulator with deep and reinforcement learning 期刊论文
Journal of Ambient Intelligence and Humanized Computing, 2019, 页码: doi={10.1007/s12652-019-01503-y}
作者:  Yuanheng Zhu;  Dongbin Zhao
Adobe PDF(2210Kb)  |  收藏  |  浏览/下载:42/9  |  提交时间:2023/04/26
Missile guidance with assisted deep reinforcement learning for head-on interception of maneuvering target 期刊论文
COMPLEX & INTELLIGENT SYSTEMS, 2021, 页码: 12
作者:  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(1431Kb)  |  收藏  |  浏览/下载:273/48  |  提交时间:2021/12/28
Reinforcement learning  Missile guidance  Auxiliary learning  Self-imitation learning  
LMI-Based Synthesis of String-Stable Controller for Cooperative Adaptive Cruise Control 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 卷号: 21, 期号: 11, 页码: 4516-4525
作者:  Zhu, Yuanheng;  He, Haibo;  Zhao, Dongbin
收藏  |  浏览/下载:141/0  |  提交时间:2021/01/06
Cooperative adaptive cruise control  string stability  time-delay system  H-infinity control  linear matrix inequality  
An Autonomous Driving Experience Platform with Learning-Based Functions 会议论文
, Bangalore, India, 18-21 Nov. 2018
作者:  Li, Dong;  Zhao, Dongbin;  Zhang, Qichao;  Zhu, Yuanheng
浏览  |  Adobe PDF(215Kb)  |  收藏  |  浏览/下载:271/69  |  提交时间:2019/04/25
深度强化学习综述:兼论计算机围棋的发展 期刊论文
控制理论与应用, 2016, 卷号: 33, 期号: 6, 页码: 701-717
作者:  赵冬斌;  邵坤;  朱圆恒;  李栋;  陈亚冉;  王海涛;  刘德荣;  周彤;  王成红
浏览  |  Adobe PDF(2816Kb)  |  收藏  |  浏览/下载:1727/634  |  提交时间:2017/09/13
深度强化学习  初弈号  深度学习  强化学习  人工智能  
A data-based online reinforcement learning algorithm with high-efficient exploration 会议论文
, Orlando, FL, USA, Dec, 2014
作者:  Yuanheng Zhu;  Zhao DB(赵冬斌)
浏览  |  Adobe PDF(407Kb)  |  收藏  |  浏览/下载:201/79  |  提交时间:2017/09/13
Online Model-Free RLSPI Algorithm for Nonlinear Discrete-Time Non-affine Systems 会议论文
, Daegu, Korea, November 3-7, 2013
作者:  Yuanheng Zhu;  Zhao DB(赵冬斌)
Adobe PDF(276Kb)  |  收藏  |  浏览/下载:159/51  |  提交时间:2017/09/13
Data-driven adaptive dynamic programming for continuous-time fully cooperative games with partially constrained inputs 期刊论文
NEUROCOMPUTING, 2017, 卷号: 238, 期号: *, 页码: 377-386
作者:  Zhang, Qichao;  Zhao, Dongbin;  Zhu, Yuanheng
浏览  |  Adobe PDF(1508Kb)  |  收藏  |  浏览/下载:613/267  |  提交时间:2017/05/04
Adaptive Dynamic Programming  Optimal Control  Neural Network  Fully Cooperative Games  Data-driven  Constrained Input  
Experience Replay for Optimal Control of Nonzero-Sum Game Systems With Unknown Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2016, 卷号: 46, 期号: 3, 页码: 854-865
作者:  Zhao, Dongbin;  Zhang, Qichao;  Wang, Ding;  Zhu, Yuanheng
浏览  |  Adobe PDF(1769Kb)  |  收藏  |  浏览/下载:500/195  |  提交时间:2016/06/14
Adaptive Dynamic Programming (Adp)  Experience Replay  Nonzero-sum (Nzs) Games  Optimal Control  Unknown Dynamics  
A data-based online reinforcement learning algorithm satisfying probably approximately correct principle 期刊论文
NEURAL COMPUTING & APPLICATIONS, 2015, 卷号: 26, 期号: 4, 页码: 775-787
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(1331Kb)  |  收藏  |  浏览/下载:252/60  |  提交时间:2015/09/21
Reinforcement Learning  Probably Approximately Correct  Kd-tree