CASIA OpenIR

浏览/检索结果: 共7条,第1-7条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
A Probabilistic Mechanism Design for Online Auctions 期刊论文
IEEE ACCESS, 2017, 卷号: 2017, 期号: .5, 页码: 10782-10794
作者:  Zhang, Jie;  Li, Linjing;  Wang, Fei-Yue
浏览  |  Adobe PDF(3721Kb)  |  收藏  |  浏览/下载:391/101  |  提交时间:2017/09/12
Mechanism Design  Online Auctions  Randomized Mechanisms  E-commerce  Computational Experiments  Probabilistic Mechanism Design  
Multi-step heuristic dynamic programming for optimal control of nonlinear discrete-time systems 期刊论文
INFORMATION SCIENCES, 2017, 卷号: 411, 期号: 0, 页码: 66-83
作者:  Luo, Biao;  Liu, Derong;  Huang, Tingwen;  Yang, Xiong;  Ma, Hongwen
浏览  |  Adobe PDF(1092Kb)  |  收藏  |  浏览/下载:382/125  |  提交时间:2017/09/12
Optimal Control  Multi-step Heuristic Dynamic Programming  Adaptive Dynamic Programming  Nonlinear Systems  Discrete-time  Neural Networks  
Error Bound Analysis of Q-Function for Discounted Optimal Control Problems With Policy Iteration 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2017, 卷号: 47, 期号: 7, 页码: 1207-1216
作者:  Yan, Pengfei;  Wang, Ding;  Li, Hongliang;  Liu, Derong
Adobe PDF(625Kb)  |  收藏  |  浏览/下载:337/82  |  提交时间:2017/09/12
Adaptive Dynamic Programming (Adp)  Error Analysis  Nonlinear Systems  Policy Iteration  Q-function  
Data-Based Adaptive Critic Designs for Nonlinear Robust Optimal Control With Uncertain Dynamics 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2016, 卷号: 46, 期号: 11, 页码: 1544-1555
作者:  Wang, Ding;  Liu, Derong;  Zhang, Qichao;  Zhao, Dongbin
浏览  |  Adobe PDF(1082Kb)  |  收藏  |  浏览/下载:454/199  |  提交时间:2017/02/14
Adaptive Critic Designs  Adaptive Dynamic Programming  Intelligent Control  Neural Networks  Policy Iteration  Robust Optimal Control  System Identification  Uncertain Nonlinear Systems  
Policy Gradient Adaptive Dynamic Programming for Data-Based Optimal Control 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 10, 页码: 3341-3354
作者:  Luo, Biao;  Liu, Derong;  Wu, Huai-Ning;  Wang, Ding;  Lewis, Frank L.
浏览  |  Adobe PDF(3217Kb)  |  收藏  |  浏览/下载:589/210  |  提交时间:2016/11/09
Adaptive Control  Adaptive Dynamic Programming (Adp)  Data-based  Off-policy Learning  Optimal Control  Policy Gradient  
Model-Free Optimal Tracking Control via Critic-Only Q-Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 卷号: 27, 期号: 10, 页码: 2134-2144
作者:  Luo, Biao;  Liu, Derong;  Huang, Tingwen;  Wang, Ding;  Luo,Biao
浏览  |  Adobe PDF(1521Kb)  |  收藏  |  浏览/下载:580/284  |  提交时间:2016/10/24
Critic-only Q-learning (Coql)  Model-free  Nonaffine Nonlinear Systems  Optimal Tracking Control  
Value Iteration Adaptive Dynamic Programming for Optimal Control of Discrete-Time Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2016, 卷号: 46, 期号: 3, 页码: 840-853
作者:  Wei, Qinglai;  Liu, Derong;  Lin, Hanquan;  Derong Liu
浏览  |  Adobe PDF(2015Kb)  |  收藏  |  浏览/下载:378/164  |  提交时间:2016/06/14
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Neural Networks  Neuro-dynamic Programming  Optimal Control  Reinforcement Learning  Value Iteration