CASIA OpenIR

浏览/检索结果: 共6条,第1-6条 帮助

已选(0)清除 条数/页:   排序方式:
Continuous-Time Stochastic Policy Iteration of Adaptive Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 13
作者:  Wei, Qinglai;  Zhou, Tianmin;  Lu, Jingwei;  Liu, Yu;  Su, Shuai;  Xiao, Jun
收藏  |  浏览/下载:96/0  |  提交时间:2023/11/17
Adaptive dynamic programming (ADP)  Hamilton-Jacobi-Bellman equation (HJBE)  nonlinear stochastic system  stochastic policy iteration (PI)  
Neural-Network-Based Control for Discrete-Time Nonlinear Systems with Input Saturation Under Stochastic Communication Protocol 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2021, 卷号: 8, 期号: 4, 页码: 766-778
作者:  Xueli Wang;  Derui Ding;  Hongli Dong;  Xian-Ming Zhang
Adobe PDF(1995Kb)  |  收藏  |  浏览/下载:184/40  |  提交时间:2021/04/09
Adaptive dynamic programming (ADP)  constrained inputs  neural network (NN)  stochastic communication protocols (SCPs)  suboptimal control  
Neuro-optimal control for discrete stochastic processes via a novel policy iteration algorithm 期刊论文
IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2020, 卷号: 50, 期号: 11, 页码: 3972-3985
作者:  Liang, Mingming;  Wang, Ding;  Liu, Derong
浏览  |  Adobe PDF(1604Kb)  |  收藏  |  浏览/下载:180/63  |  提交时间:2020/10/23
Adaptive critic designs  adaptive dynamic programming (ADP)  local policy iteration  neuro-dynamic programming  optimal control  stochastic processes  
Improved value iteration for neural-network-based stochastic optimal control design 期刊论文
NEURAL NETWORKS, 2020, 卷号: 124, 页码: 280-295
作者:  Liang, Mingming;  Wang, Ding;  Liu, Derong
Adobe PDF(5875Kb)  |  收藏  |  浏览/下载:227/50  |  提交时间:2020/06/02
Adaptive critic designs  Adaptive dynamic programming  Neural networks  Optimal control  Stochastic processes  Value iteration  
Primal Averaging: A New Gradient Evaluation Step to Attain the Optimal Individual Convergence 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 卷号: 50, 期号: 2, 页码: 835-845
作者:  Tao, Wei;  Pan, Zhisong;  Wu, Gaowei;  Tao, Qing
收藏  |  浏览/下载:189/0  |  提交时间:2020/03/30
Convergence  Convex functions  Machine learning  Optimization methods  Linear programming  Cybernetics  Individual convergence  machine learning  mirror descent (MD) methods  regularized learning problems  stochastic gradient descent (SGD)  stochastic optimization  
A stochastic model for budget distribution over time in search advertisements 会议论文
Proceedings of 2014 IEEE International Conference on Service Operations and Logistics, and Informatic, Qingdao, China, October 8-10, 2014
作者:  Qin, Rui;  Yuan, Yong;  Li, Juanjuan;  Yang, Yanwu;  Rui Qin
浏览  |  Adobe PDF(1043Kb)  |  收藏  |  浏览/下载:268/62  |  提交时间:2016/06/20
Search Advertisement  Budget Distribution  Optimal Budget  Stochastic Programming  Budget Constraint