CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共8条,第1-8条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Off-Policy Reinforcement Learning for Partially Unknown Nonzero-Sum Games 会议论文
, Guangzhou China, November 14–18
作者:  Zhang,Qichao;  Zhao,Dongbin;  Zhang,Sibo
浏览  |  Adobe PDF(119Kb)  |  收藏  |  浏览/下载:232/83  |  提交时间:2017/12/28
An Incremental Change Detection Test Based on Density Difference Estimation 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2017, 卷号: 49, 期号: 10, 页码: 2714-2726
作者:  Bu, Li;  Zhao, Dongbin;  Alippi, Cesare
Adobe PDF(1601Kb)  |  收藏  |  浏览/下载:398/142  |  提交时间:2017/05/04
Change Detection  Incremental Computing  Incremental Least Squares Density Difference Change Detection Method (Lsdd-inc)  Probability Density Function (Pdf)-free  
Policy Gradient Methods with Gaussian Process Modelling Acceleration 会议论文
, Anchorage, AK, USA, 14-19 May 2017
作者:  Li, Dong;  Zhao, Dongbin;  Zhang, Qichao;  Luo, Chaomin
浏览  |  Adobe PDF(720Kb)  |  收藏  |  浏览/下载:290/92  |  提交时间:2017/12/28
Event-Triggered Optimal Control for Partially Unknown Constrained-Input Systems via Adaptive Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2017, 卷号: 64, 期号: 5, 页码: 4101-4109
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo;  Ji, Junhong
浏览  |  Adobe PDF(2325Kb)  |  收藏  |  浏览/下载:503/204  |  提交时间:2017/09/12
Actor-critic-identifier  Concurrent Learning  Constrained Input  Event-triggered (Et) Control  Hamilton-jacobi-bellman (Hjb) Equation  
A Kolmogorov-Smirnov test to detect changes in stationarity in big data 会议论文
, Toulouse, France, July 9-14,2017
作者:  Zhao, Dongbin;  Bu, Li;  Alippi, Cesare;  Wei, Qinglai
浏览  |  Adobe PDF(1666Kb)  |  收藏  |  浏览/下载:301/112  |  提交时间:2017/05/04
深度强化学习进展: 从 AlphaGo 到 AlphaGo Zero 期刊论文
控 制 理 论 与 应 用, 2017, 卷号: 34, 期号: 12, 页码: 1529-1546
作者:  唐振韬;  邵 坤;  赵冬斌;  朱圆恒
Adobe PDF(8232Kb)  |  收藏  |  浏览/下载:207/33  |  提交时间:2021/07/05
深度强化学习  AlphaGo Zero  深度学习  强化学习  人工智能  
Policy Iteration for Hinfinity Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming 期刊论文
IEEE Transactions on Cybernetics, 2017, 期号: PP, 页码: 1-9
作者:  Yuanheng Zhu;  Zhao DB(赵冬斌)
浏览  |  Adobe PDF(894Kb)  |  收藏  |  浏览/下载:321/157  |  提交时间:2017/09/13
Adaptive Dynamic Programming (Adp)  H∞ Optimal Control  Policy Iteration (Pi)  Polynomial Nonlinear Systems  Sum Of Squares (Sos)  
Comparison of methods to efficient graph SLAM under general optimization framework 会议论文
YAC 2017
作者:  Haoran Li;  Qichao Zhang;  Dongbin Zhao
浏览  |  Adobe PDF(151Kb)  |  收藏  |  浏览/下载:874/508  |  提交时间:2017/12/31
Optimization  Slam  Pose Graph