CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共13条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
深度强化学习进展: 从 AlphaGo 到 AlphaGo Zero 期刊论文
控 制 理 论 与 应 用, 2017, 卷号: 34, 期号: 12, 页码: 1529-1546
作者:  唐振韬;  邵 坤;  赵冬斌;  朱圆恒
Adobe PDF(8232Kb)  |  收藏  |  浏览/下载:237/38  |  提交时间:2021/07/05
深度强化学习  AlphaGo Zero  深度学习  强化学习  人工智能  
Event-triggered integral reinforcement learning for nonlinear continuous-time systems 会议论文
, Honolulu, Hawaii, USA, Nov. 27 to Dec 1, 2017
作者:  Qichao Zhang;  Dongbin Zhao
收藏  |  浏览/下载:66/0  |  提交时间:2019/10/09
Policy Gradient Methods with Gaussian Process Modelling Acceleration 会议论文
, Anchorage, AK, USA, 14-19 May 2017
作者:  Li, Dong;  Zhao, Dongbin;  Zhang, Qichao;  Luo, Chaomin
浏览  |  Adobe PDF(720Kb)  |  收藏  |  浏览/下载:316/100  |  提交时间:2017/12/28
Event-Triggered Adaptive Dynamic Programming for Uncertain Nonlinear Systems 会议论文
, Beijing, China, November 19–23
作者:  Zhang,Qichao;  Zhao,Dongbin;  Wang,Ding
浏览  |  Adobe PDF(153Kb)  |  收藏  |  浏览/下载:200/82  |  提交时间:2017/12/28
Off-Policy Reinforcement Learning for Partially Unknown Nonzero-Sum Games 会议论文
, Guangzhou China, November 14–18
作者:  Zhang,Qichao;  Zhao,Dongbin;  Zhang,Sibo
浏览  |  Adobe PDF(119Kb)  |  收藏  |  浏览/下载:259/95  |  提交时间:2017/12/28
Policy Iteration for Hinfinity Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming 期刊论文
IEEE Transactions on Cybernetics, 2017, 期号: PP, 页码: 1-9
作者:  Yuanheng Zhu;  Zhao DB(赵冬斌)
浏览  |  Adobe PDF(894Kb)  |  收藏  |  浏览/下载:349/164  |  提交时间:2017/09/13
Adaptive Dynamic Programming (Adp)  H∞ Optimal Control  Policy Iteration (Pi)  Polynomial Nonlinear Systems  Sum Of Squares (Sos)  
Event-Triggered Optimal Control for Partially Unknown Constrained-Input Systems via Adaptive Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2017, 卷号: 64, 期号: 5, 页码: 4101-4109
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo;  Ji, Junhong
浏览  |  Adobe PDF(2325Kb)  |  收藏  |  浏览/下载:542/213  |  提交时间:2017/09/12
Actor-critic-identifier  Concurrent Learning  Constrained Input  Event-triggered (Et) Control  Hamilton-jacobi-bellman (Hjb) Equation  
Iterative Adaptive Dynamic Programming for Solving Unknown Nonlinear Zero-Sum Game Based on Online Data 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 卷号: 28, 期号: 3, 页码: 714-725
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Li, Xiangjun
浏览  |  Adobe PDF(547Kb)  |  收藏  |  浏览/下载:452/188  |  提交时间:2017/05/05
Adaptive Dynamic Programming (Adp)  H-infinity Control  Policy Iteration (Pi)  Zero-sum Game (Zsg)  
Data-driven adaptive dynamic programming for two-player nonzero-sum game 会议论文
, Chongqing, China, 2017-5
作者:  Zhang, Qichao;  Zhao, Dongbin
浏览  |  Adobe PDF(141Kb)  |  收藏  |  浏览/下载:353/187  |  提交时间:2017/05/04
Event-triggered adaptive dynamic programming for uncertain nonlinear systems 会议论文
, Beijing, China, 2016-12
作者:  Zhang, Qichao;  Zhao, Dongbin;  Wang, Ding
浏览  |  Adobe PDF(153Kb)  |  收藏  |  浏览/下载:197/108  |  提交时间:2017/05/04