CASIA OpenIR

浏览/检索结果: 共3条,第1-3条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Policy Gradient Adaptive Dynamic Programming for Data-Based Optimal Control 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 10, 页码: 3341-3354
作者:  Luo, Biao;  Liu, Derong;  Wu, Huai-Ning;  Wang, Ding;  Lewis, Frank L.
浏览  |  Adobe PDF(3217Kb)  |  收藏  |  浏览/下载:573/205  |  提交时间:2016/11/09
Adaptive Control  Adaptive Dynamic Programming (Adp)  Data-based  Off-policy Learning  Optimal Control  Policy Gradient  
An efficient realization of deep learning for traffic data imputation 期刊论文
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2016, 卷号: 72, 页码: 168-181
作者:  Duan, Yanjie;  Lv, Yisheng;  Liu, Yu-Liang;  Wang, Fei-Yue
Adobe PDF(824Kb)  |  收藏  |  浏览/下载:558/272  |  提交时间:2017/02/14
Traffic Data Imputation  Deep Learning  Missing Data  
Reinforcement learning solution for HJB equation arising in constrained optimal control problem 期刊论文
NEURAL NETWORKS, 2015, 卷号: 71, 期号: 0, 页码: 150-158
作者:  Luo, Biao;  Wu, Huai-Ning;  Huang, Tingwen;  Liu, Derong
浏览  |  Adobe PDF(530Kb)  |  收藏  |  浏览/下载:450/196  |  提交时间:2016/03/30
Constrained Optimal Control  Data-based  Off-policy Reinforcement Learning  Hamilton-jacobi-bellman Equation  The Method Of Weighted Residuals