CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共9条,第1-9条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Vision-based control in the open racing car simulator with deep and reinforcement learning 期刊论文
Journal of Ambient Intelligence and Humanized Computing, 2019, 页码: doi={10.1007/s12652-019-01503-y}
作者:  Yuanheng Zhu;  Dongbin Zhao
Adobe PDF(2210Kb)  |  收藏  |  浏览/下载:64/17  |  提交时间:2023/04/26
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:250/12  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Invariant Adaptive Dynamic Programming for Discrete-Time Optimal Control 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 卷号: 50, 期号: 11, 页码: 3959-3971
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo
Adobe PDF(2079Kb)  |  收藏  |  浏览/下载:216/17  |  提交时间:2021/01/07
Optimal control  Discrete-time systems  Heuristic algorithms  Dynamic programming  Convergence  Artificial intelligence  Nonlinear systems  Adaptive dynamic programming  discrete-time systems  invariant admissibility  optimal control  policy iteration  sum of squares  
Control-Limited Adaptive Dynamic Programming for Multi-Battery Energy Storage Systems 期刊论文
IEEE TRANSACTIONS ON SMART GRID, 2019, 卷号: 10, 期号: 4, 页码: 4235-4244
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Li, Xiangjun;  Wang, Ding
Adobe PDF(973Kb)  |  收藏  |  浏览/下载:318/16  |  提交时间:2019/09/30
Microgrid  energy storage system  multi-battery management system  adaptive dynamic programming  control-limited optimization  
Online reinforcement learning for continuous-state systems 专著章节/文集论文
出自: Frontiers of Intelligent Control and Information Processing, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore:World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, 2014
作者:  Yuanheng Zhu;  Zhao DB(赵冬斌)
Adobe PDF(24150Kb)  |  收藏  |  浏览/下载:277/35  |  提交时间:2017/09/13
Data-driven adaptive dynamic programming for continuous-time fully cooperative games with partially constrained inputs 期刊论文
NEUROCOMPUTING, 2017, 卷号: 238, 期号: *, 页码: 377-386
作者:  Zhang, Qichao;  Zhao, Dongbin;  Zhu, Yuanheng
浏览  |  Adobe PDF(1508Kb)  |  收藏  |  浏览/下载:666/280  |  提交时间:2017/05/04
Adaptive Dynamic Programming  Optimal Control  Neural Network  Fully Cooperative Games  Data-driven  Constrained Input  
Convergence Proof of Approximate Policy Iteration for Undiscounted Optimal Control of Discrete-Time Systems 期刊论文
COGNITIVE COMPUTATION, 2015, 卷号: 7, 期号: 6, 页码: 763-771
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo;  Ji, Junhong
Adobe PDF(809Kb)  |  收藏  |  浏览/下载:271/41  |  提交时间:2016/01/18
Approximate Policy Iteration  Approximation Error  Optimal Control  Fuzzy Approximator  
A data-based online reinforcement learning algorithm satisfying probably approximately correct principle 期刊论文
NEURAL COMPUTING & APPLICATIONS, 2015, 卷号: 26, 期号: 4, 页码: 775-787
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(1331Kb)  |  收藏  |  浏览/下载:281/63  |  提交时间:2015/09/21
Reinforcement Learning  Probably Approximately Correct  Kd-tree  
Full-range adaptive cruise control based on supervised adaptive dynamic programming 期刊论文
NEUROCOMPUTING, 2014, 卷号: 125, 页码: 57-67
作者:  Zhao, Dongbin;  Hu, Zhaohui;  Xia, Zhongpu;  Alippi, Cesare;  Zhu, Yuanheng;  Wang, Ding
浏览  |  Adobe PDF(2228Kb)  |  收藏  |  浏览/下载:434/124  |  提交时间:2015/08/12
Adaptive Dynamic Programming  Supervised Reinforcement Learning  Neural Networks  Adaptive Cruise Control  Stop And Go