CASIA OpenIR

浏览/检索结果: 共5条,第1-5条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Error Bound Analysis of Q-Function for Discounted Optimal Control Problems With Policy Iteration 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2017, 卷号: 47, 期号: 7, 页码: 1207-1216
作者:  Yan, Pengfei;  Wang, Ding;  Li, Hongliang;  Liu, Derong
Adobe PDF(625Kb)  |  收藏  |  浏览/下载:314/76  |  提交时间:2017/09/12
Adaptive Dynamic Programming (Adp)  Error Analysis  Nonlinear Systems  Policy Iteration  Q-function  
Event-Driven Adaptive Robust Control of Nonlinear Systems With Uncertainties Through NDP Strategy 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2017, 卷号: 47, 期号: 7, 页码: 1358-1370
作者:  Wang, Ding;  Mu, Chaoxu;  He, Haibo;  Liu, Derong
Adobe PDF(1425Kb)  |  收藏  |  浏览/下载:366/132  |  提交时间:2017/09/12
Adaptive Dynamic Programming (Adp)  Adaptive Robust Control  Critic Neural Network  Event-driven Control  Neural Dynamic Programming (Ndp)  Uncertain Nonlinear Systems  
Generalized Policy Iteration Adaptive Dynamic Programming for Discrete-Time Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2015, 卷号: 45, 期号: 12, 页码: 1577-1591
作者:  Liu, Derong;  Wei, Qinglai;  Yan, Pengfei
浏览  |  Adobe PDF(1540Kb)  |  收藏  |  浏览/下载:217/66  |  提交时间:2016/03/19
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Generalized Policy Iteration  Neural Networks  Neuro-dynamic Programming  Nonlinear Systems  Optimal Control  Reinforcement Learning  
Analyzing Positioning Strategies in Sponsored Search Auctions Under CTR-Based Quality Scoring 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2015, 卷号: 45, 期号: 4, 页码: 688-701
作者:  Yuan, Yong;  Zeng, Daniel;  Zhao, Huimin;  Li, Linjing
浏览  |  Adobe PDF(1156Kb)  |  收藏  |  浏览/下载:353/110  |  提交时间:2015/09/21
Terms-click-through Rate (Ctr)  Optimal Control  Polarization  Quality Score (Qs)  Sponsored Search  
Online Synchronous Approximate Optimal Learning Algorithm for Multiplayer Nonzero-Sum Games With Unknown Dynamics 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2014, 卷号: 44, 期号: 8, 页码: 1015-1027
作者:  Liu, Derong;  Li, Hongliang;  Wang, Ding
Adobe PDF(20912Kb)  |  收藏  |  浏览/下载:205/79  |  提交时间:2015/08/12
Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Multiplayer Nonzero-sum Games  Neural Networks  Neuro-dynamic Programming  Policy Iteration