CASIA OpenIR

浏览/检索结果: 共3条,第1-3条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Policy Iteration Algorithm Based Fault Tolerant Tracking Control: An Implementation on Reconfigurable Manipulators 期刊论文
JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2018, 卷号: 13, 期号: 4, 页码: 1739-1750
作者:  Li, Yuanchun;  Xia, Hongbing;  Zhao, Bo
浏览  |  Adobe PDF(708Kb)  |  收藏  |  浏览/下载:378/67  |  提交时间:2018/10/10
Adaptive dynamic programming  Policy iteration  Fault tolerant tracking control  Reconfigurable manipulators  Neural network  
Comprehensive comparison of online ADP algorithms for continuous-time optimal control 期刊论文
ARTIFICIAL INTELLIGENCE REVIEW, 2018, 卷号: 49, 期号: 4, 页码: 531-547
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(766Kb)  |  收藏  |  浏览/下载:429/189  |  提交时间:2017/09/13
Adaptive Dynamic Programming  Policy Iteration  Integral Reinforcement Learning  Experience Replay  Off-policy  
Reinforcement learning solution for HJB equation arising in constrained optimal control problem 期刊论文
NEURAL NETWORKS, 2015, 卷号: 71, 期号: 0, 页码: 150-158
作者:  Luo, Biao;  Wu, Huai-Ning;  Huang, Tingwen;  Liu, Derong
浏览  |  Adobe PDF(530Kb)  |  收藏  |  浏览/下载:486/207  |  提交时间:2016/03/30
Constrained Optimal Control  Data-based  Off-policy Reinforcement Learning  Hamilton-jacobi-bellman Equation  The Method Of Weighted Residuals