CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共11条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Driving Control with Deep and Reinforcement Learning in The Open Racing Car Simulator 会议论文
, Siem Reap, Cambodia, 2018, 12, 13-16
作者:  Yuanheng Zhu;  Dongbin Zhao
Adobe PDF(697Kb)  |  收藏  |  浏览/下载:20/9  |  提交时间:2024/06/05
Multi-Agent Reinforcement Learning Based on Clustering in Two-Player Games 会议论文
, Xiamen, China, 2019-12-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(488Kb)  |  收藏  |  浏览/下载:151/47  |  提交时间:2023/06/28
reinforcement learning  unsupervised clustering  matrix game  
Vision-based control in the open racing car simulator with deep and reinforcement learning 期刊论文
Journal of Ambient Intelligence and Humanized Computing, 2019, 页码: doi={10.1007/s12652-019-01503-y}
作者:  Yuanheng Zhu;  Dongbin Zhao
Adobe PDF(2210Kb)  |  收藏  |  浏览/下载:59/14  |  提交时间:2023/04/26
Empirical Policy Optimization for n-Player Markov Games 期刊论文
IEEE Transactions on Cybernetics, 2022, 页码: doi={10.1109/TCYB.2022.3179775}
作者:  Yuanheng Zhu;  Weifan Li;  Mengchen Zhao;  Jianye Hao;  Dongbin Zhao
Adobe PDF(1739Kb)  |  收藏  |  浏览/下载:107/42  |  提交时间:2023/04/26
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:236/6  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Missile guidance with assisted deep reinforcement learning for head-on interception of maneuvering target 期刊论文
COMPLEX & INTELLIGENT SYSTEMS, 2021, 页码: 12
作者:  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(1431Kb)  |  收藏  |  浏览/下载:313/59  |  提交时间:2021/12/28
Reinforcement learning  Missile guidance  Auxiliary learning  Self-imitation learning  
Synthesis of Cooperative Adaptive Cruise Control With Feedforward Strategies 期刊论文
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 卷号: 69, 期号: 4, 页码: 3615-3627
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo
Adobe PDF(2462Kb)  |  收藏  |  浏览/下载:190/9  |  提交时间:2020/06/22
Cooperative cruise control  H-infinity-norm  L-2-gain  time-delay system  state-space model  
Adaptive Optimal Control of Heterogeneous CACC System With Uncertain Dynamics 期刊论文
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2019, 卷号: 27, 期号: 4, 页码: 1772-1779
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Zhong, Zhiguang
Adobe PDF(1189Kb)  |  收藏  |  浏览/下载:281/10  |  提交时间:2019/09/30
Adaptive optimal control  cooperative adaptive cruise control (CACC)  heterogeneous platoon  string stability  sum-of-squares polynomial  
An Autonomous Driving Experience Platform with Learning-Based Functions 会议论文
, Bangalore, India, 18-21 Nov. 2018
作者:  Li, Dong;  Zhao, Dongbin;  Zhang, Qichao;  Zhu, Yuanheng
浏览  |  Adobe PDF(215Kb)  |  收藏  |  浏览/下载:300/76  |  提交时间:2019/04/25
深度强化学习综述:兼论计算机围棋的发展 期刊论文
控制理论与应用, 2016, 卷号: 33, 期号: 6, 页码: 701-717
作者:  赵冬斌;  邵坤;  朱圆恒;  李栋;  陈亚冉;  王海涛;  刘德荣;  周彤;  王成红
浏览  |  Adobe PDF(2816Kb)  |  收藏  |  浏览/下载:1792/656  |  提交时间:2017/09/13
深度强化学习  初弈号  深度学习  强化学习  人工智能