CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共20条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:258/15  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Missile guidance with assisted deep reinforcement learning for head-on interception of maneuvering target 期刊论文
COMPLEX & INTELLIGENT SYSTEMS, 2021, 页码: 12
作者:  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(1431Kb)  |  收藏  |  浏览/下载:331/63  |  提交时间:2021/12/28
Reinforcement learning  Missile guidance  Auxiliary learning  Self-imitation learning  
Enhanced Rolling Horizon Evolution Algorithm With Opponent Model Learning: Results for the Fighting Game AI Competition 期刊论文
IEEE TRANSACTIONS ON GAMES, 2023, 卷号: 5, 期号: 1, 页码: 5 - 15
作者:  Zhentao Tang;  Yuanheng Zhu;  Dongbin Zhao;  Simon M. Lucas
Adobe PDF(7686Kb)  |  收藏  |  浏览/下载:370/79  |  提交时间:2021/07/05
Rolling horizon evolution  opponent model  reinforcement learning  supervised learning  fighting game  
Control-Limited Adaptive Dynamic Programming for Multi-Battery Energy Storage Systems 期刊论文
IEEE TRANSACTIONS ON SMART GRID, 2019, 卷号: 10, 期号: 4, 页码: 4235-4244
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Li, Xiangjun;  Wang, Ding
Adobe PDF(973Kb)  |  收藏  |  浏览/下载:327/19  |  提交时间:2019/09/30
Microgrid  energy storage system  multi-battery management system  adaptive dynamic programming  control-limited optimization  
Reinforcement Learning and Deep Learning based Lateral Control for Autonomous Driving 期刊论文
IEEE Computational Intelligence Magazine, IEEE Computational Intelligence Magazine, 2019, 2019, 卷号: 14, 14, 期号: 2, 页码: 83-98, 83-98
作者:  Dong Li;  Dongbin Zhao;  Qichao Zhang;  Yaran Chen
Adobe PDF(2205Kb)  |  收藏  |  浏览/下载:403/116  |  提交时间:2019/04/25
Deep Learning  Autonomous Driving  Visual Control  Reinforcement Learning  Deep Learning  Autonomous Driving  Visual Control  Reinforcement Learning  
Policy Iteration for H infinity Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 2, 页码: 500-509
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Yang, Xiong;  Zhang, Qichao
Adobe PDF(892Kb)  |  收藏  |  浏览/下载:334/56  |  提交时间:2018/10/10
Adaptive Dynamic Programming (Adp)  h Infinity Optimal Control  Policy Iteration (Pi)  Polynomial Nonlinear Systems  Sum Of Squares (Sos)  
Policy Iteration for Hinfinity Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming 期刊论文
IEEE Transactions on Cybernetics, 2017, 期号: PP, 页码: 1-9
作者:  Yuanheng Zhu;  Zhao DB(赵冬斌)
浏览  |  Adobe PDF(894Kb)  |  收藏  |  浏览/下载:381/177  |  提交时间:2017/09/13
Adaptive Dynamic Programming (Adp)  H∞ Optimal Control  Policy Iteration (Pi)  Polynomial Nonlinear Systems  Sum Of Squares (Sos)  
Model-free Optimal Control based Intelligent Cruise Control with Hardware-in-the-loop Demonstration 期刊论文
IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2017, 卷号: 12, 期号: 2, 页码: 56-69
作者:  Zhao, Dongbin;  Xia, Zhongpu;  Zhang, Qichao
浏览  |  Adobe PDF(4525Kb)  |  收藏  |  浏览/下载:624/198  |  提交时间:2017/05/04
Intelligent Cruise Control  
An Incremental Change Detection Test Based on Density Difference Estimation 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2017, 卷号: 49, 期号: 10, 页码: 2714-2726
作者:  Bu, Li;  Zhao, Dongbin;  Alippi, Cesare
Adobe PDF(1601Kb)  |  收藏  |  浏览/下载:471/169  |  提交时间:2017/05/04
Change Detection  Incremental Computing  Incremental Least Squares Density Difference Change Detection Method (Lsdd-inc)  Probability Density Function (Pdf)-free  
A pdf-Free Change Detection Test Based on Density Difference Estimation 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 2, 页码: 324-334
作者:  Bu, Li;  Alippi, Cesare;  Zhao, Dongbin
Adobe PDF(2468Kb)  |  收藏  |  浏览/下载:418/122  |  提交时间:2017/05/04
Concept Drift  Least Squares Density-difference (Lsdd)-based Method  Probability Density Function (Pdf)-free  Three-level Threshold Mechanism